RELATING RANKS 225
each soil zone divided by the total number of square kilometers covered by that
zone to indicate how densely the zone was occupied, and we wish to investigate
whether more productive soil zones were more densely inhabited.
The first step in calculating Spearman’s rank correlation is to determine the rank
orderings of all the cases for each of the two variables (taken separately). These rank
orderings are also given in Table 16.1. Ties frequently occur in the soil productivity
ratings. That is, for example, soil zones H and K are ranked in the lowest produc-
tivity category (1). These two least productive soil zones should be rank ordered 1
and 2, but we have no basis for putting one above the other since they are tied in
the productivity ratings. As a consequence, we assign each a rank order of 1.5 (the
mean of 1 and 2). Soil zones C, I, and P are tied with productivity ratings of 3. These
would be soil zones 5, 6, and 7 in rank order if we could determine which to put
above the other. Since we cannot make this determination, each is assigned a rank
order of 6 (the mean of 5, 6, and 7). Such a treatment is accorded whenever there are
ties. No ties occur in the number of villages per square kilometer (which actually is
a true measurement), so the rank ordering is simpler. It begins at 1 for soil zone K
and continues through zones A, H, and so on to zone F, which ranks 17th because it
has the highest number of village sites per square kilometer.
Subtracting the rank orderings for villages per square kilometer (Y) from the
rank orderings for soil productivity (X) gives us the difference between rankings d,
which we then square and sum up to get
∑
d
2
.
The last four columns in Table
16.1 concern a correction that must be made for
ties. The value t for each soil zone is the total number of soil zones that are tied at
that ranking. For example, soil zone A has a value of t
x
= 2 because a total of two
zones (A and N) are tied at its productivity rating of 2. Since there are no ties for
number of villages per square kilometer, all the values of t
y
are 1. For each t value
for each of the two variables, a value of T is obtained as follows:
T =
t
3
−t
12
The calculation of Spearman’s rank correlation requires three sums from
Table
16.1:
∑
d
2
,
∑
T
x
,and
∑
T
y
. A sum of squares is calculated for each of the
two variables:
∑
x
2
=
n
3
−n
12
−
∑
T
x
where
∑
T
x
is from Table 16.1,andn is the number in the sample (17 in this
example). Thus, for the example in Table
16.1,
∑
x
2
=
17
3
−17
12
−17.0 = 408 −17 = 391 and
∑
x
2
=
17
3
−17
12
−0.0 = 408−0 = 408
Spearman’s rank correlation, then, is given by the equation