120 CHAPTER 9
than the number of elements in the sample (n −1). For the moment, we will just take
this notion of degrees of freedom (often abbreviated d. f .) on faith. For our sample,
n −1 = 99. There is no row corresponding exactly to 99 degrees of freedom, so we
will use the row for 120 d. f., which comes closest. We are looking for the exact
level of confidence associated with an error range of 2 standard errors, so we read
across that row looking for 2. In the fourth column we find 1.98 (which we’ll take
as close enough to 2 for the moment).
The fourth column is headed 95% confidence. This means that 95% of the pos-
sible populations (represented by the shaded area “under the normal curve” in
Fig.
9.7) that our sample could come from lie within 1.98 standard errors of the
mean of our sample. Thus, when we say that it is “very likely” that our sample came
from a population with a mean of 3.35cm±0.10cm, what we mean more precisely
is that there is about a 95% probability that our sample came from such a popula-
tion. We are 95% confident that our sample came from a population with a mean of
3.35cm±0.10cm. We are not certain that our sample came from a population with
a mean of 3.35cm ±0.10cm, but the probability that this is the case is 95%.
Since the probability that our sample came from a population with a mean
between 3.25 cm and 3.45 cm is 95%, the probability that it came from a popula-
tion with a mean less than 3.25 cm or greater than 3.45 cm is 5%. (This has to be
true since the probability that it came from one or the other of these groups is 100%.)
Since a normal shape is symmetrical, this 5% is evenly distributed in both “tails” of
the distribution. There is a 2.5% probability that our sample came from a population
with a mean less than 3.25 cm and a 2.5% probability that our sample came from
a population with a mean greater than 3.45 cm. When we provide an error range of
about 2 standard errors, then, as we have done here, we are speaking at a 95% con-
fidence level. This follows directly from the observation that a number that falls 2
standard deviations or more away from the mean in its batch is a very unusual num-
ber in terms of its batch. Specifically, only about 5% of the numbers in a normally
distributed batch fall this far from the mean.
Every error range (or confidence interval) expressed in terms of standard errors
corresponds to a specific confidence level. (The terms confidence interval and con-
fidence level are too close for comfort, considering that they refer to two rather
different concepts. Thus the term error range is used here in preference to confi-
dence interval.) An error range of ±3 standard errors, as illustrated in Fig.
9.8,cor-
responds to approximately 99.8% confidence. Reading across the row in Table
9.1
that corresponds to 120 d. f ., as we did before, and looking for 3 brings us to the
next-to-last column, where 3.160 is relatively close to 3. This column is headed
99.8% confidence. Thus when we concluded, on the basis of Fig.
9.8 that it is very
likely that our sample comes from a population with a mean of 3.35 cm ± 0.15 cm,
that “very likely” actually meant a probability of around 99.8%. There is only about
a 0.2% probability that the population our sample came from has a mean less than
3.20 cm or greater than 3.50 cm. Once again, since a normal shape is symmetrical,
that means about a 0.1% probability that the population our sample came from has a
mean less than 3.20 cm and about a 0.1% probability that it has a mean greater than
3.50 cm.