be equal to zero for each pair of observed and expected frequencies. Such a result would
yield a value of equal to zero, and we would be unable to reject
When there is disagreement between observed frequencies and the frequencies one
would expect given that is true, at least one of the terms in Equation 12.2.4
will be a nonzero number. In general, the poorer the agreement between the and the
the greater or the more frequent will be these nonzero values. As noted previously,
if the agreement between the and the is sufficiently poor (resulting in a sufficiently
large value,) we will be able to reject
When there is disagreement between a pair of observed and expected frequencies,
the difference may be either positive or negative, depending on which of the two frequen-
cies is the larger. Since the measure of agreement, is a sum of component quantities
whose magnitudes depend on the difference positive and negative differences
must be given equal weight. This is achieved by squaring each difference. Divid-
ing the squared differences by the appropriate expected frequency converts the quantity
to a term that is measured in original units. Adding these individual terms
yields a summary statistic that reflects the extent of the overall agreement between
observed and expected frequencies.
The Decision Rule The quantity will be small if the observed
and expected frequencies are close together and will be large if the differences are large.
The computed value of is compared with the tabulated value of with
degrees of freedom. The decision rule, then, is: Reject if is greater than or equal
to the tabulated for the chosen value of
Small Expected Frequencies Frequently in applications of the chi-square
test the expected frequency for one or more categories will be small, perhaps much less
than 1. In the literature the point is frequently made that the approximation of to
is not strictly valid when some of the expected frequencies are small. There is disagree-
ment among writers, however, over what size expected frequencies are allowable before
making some adjustment or abandoning in favor of some alternative test. Some writ-
ers, especially the earlier ones, suggest lower limits of 10, whereas others suggest that
all expected frequencies should be no less than 5. Cochran (4, 5), suggests that for good-
ness-of-fit tests of unimodal distributions (such as the normal), the minimum expected
frequency can be as low as 1. If, in practice, one encounters one or more expected fre-
quencies less than 1, adjacent categories may be combined to achieve the suggested min-
imum. Combining reduces the number of categories and, therefore, the number of degrees
of freedom. Cochran’s suggestions appear to have been followed extensively by practi-
tioners in recent years.
12.3 TESTS OF GOODNESS-OF-FIT
As we have pointed out, a goodness-of-fit test is appropriate when one wishes to decide
if an observed distribution of frequencies is incompatible with some preconceived or
hypothesized distribution.
x
2
x
2
X
2
a.x
2
X
2
H
0
k - rx
2
X
2
g31O
i
- E
i
2
2
>E
i
4
X
2
,
1O
i
- E
i
2
2
>E
i
O
i
- E
i
O
i
- E
i
,
X
2
,
H
0
.X
2
E
i
O
i
E
i
,
O
i
O
i
- E
i
H
0
H
0
.X
2
12.3 TESTS OF GOODNESS-OF-FIT
597