(e.g. earthquake magnitude on the Richter scale vs. energy released, crystal
population density vs. crystal size) are logarithms, squares or cubes. The
apparently linear pH scale is actually logarithmic – a pH of 4 indicates a ten-
fold difference from pH 5 and a 100-fold difference from pH 6. Therefore, in
many cases it is actually more appropriate to transform the data so they
reflect the underlying relationship.
Importantly, if you transform a set of data, you also need to transform
your null and alternate hypotheses. For example, if you were to hypothesize
that “the steepness of river banks is not related to river basin size” but
carried out a logarithmic transformation on your data before analysis, your
original hypothesis would also have to be transformed to “the steepness of
river banks is not related to the logarithm of river basin size”.
13.7 Tests for heteroscedasticity
There are several tests designed to examine whether two or more samples
appear to have come from populations with the same variance. As men-
tioned earlier, if you are only interested in whether the data are suitable for a
parametric analysis, the general rule that the ratio of the largest variance to
the smallest should not exceed 4 : 1 can be used. If this ratio is greater, it may
be useful to examine the data and see where the differences occur because it
may be possible to transform the data so that a parametric analysis can
be done.
If, instead, you are interested in testing an hypothesis about the variance
of two or more samples, you can use the Levene test, which also gives an F
ratio. Remember, however, that a significant result for the Levene test may
not mean the data are unsuitable for analysis by ANOVA, which is quite
robust to heteroscedasticity.
Levene’s original test calculates the absolute difference between each
replicate and its treatment mean and then does a one-factor ANOVA on
these differences. The absolute difference is the difference between any two
numbers expressed as a positive value. (For example, the difference between
6 and 3 is –3, while the difference between 3 and 6 is +3, but the absolute
difference in both cases is +3.)
Figures 13.6 and 13.7 are a pictorial explanation of the Levene test. Two
cases are shown, using the data on apatite abundance in sandstones that
were first described in Section 11.3.3.
174 Important assumptions of ANOVA