
252 7 Point and Interval Estimators
Assume that the measurements are normally distributed with mean µ and
variance
σ
2
, but both parameters are unknown. The sample mean and vari-
ances are
X = 10.098 , s
2
= 2.1702, and s =1.4732. Also, the confidence inter-
val would use an appropriate t-quantile, in this case
tinv(1-0.05/2, 51-1) =
2.0086
.
The 95% confidence interval for the population mean,
µ, is
·
10.098 −2.0086 ×
1.4732
p
51
, 10.098
+2.0086 ×
1.4732
p
51
¸
=
[9.6836,10.5124].
Thus, the unknown mean
µ belongs to the interval [9.6836,10.5124] with con-
fidence 95%. That means that if the sample is obtained many times and for
each sample the confidence interval is calculated, 95% of the intervals would
contain
µ.
To find, say, the 90% confidence interval for the population variance,
σ
2
, we
need
χ
2
quantiles, chi2inv(1-0.10/2, 51-1) = 67.5048, and chi2inv(0.10/2,
51-1) = 34.7643
. According to (7.7), the interval is
[
(51
−1) ×2.1702/67.5048, (51 −1) ×2.1702/34.7643
]
=[1.6074,3.1213].
Thus, the interval [1.6074,3.1213] covers the population variance
σ
2
with a
confidence of 90%.
Example 7.9. An alternative confidence interval for the normal variance is
possible. Since by the CLT s
2
approx
∼ N
³
σ
2
,
2σ
4
n−1
´
(Can you explain why?),
when n is not small, an approximate (1
−α)100% confidence interval for σ
2
is
"
s
2
−z
1−α/2
·
p
2 s
2
p
n −1
, s
2
+z
1−α/2
·
p
2 s
2
p
n −1
#
.
In Example 7.8, s
2
= 2.1702 and n = 51. A 90% confidence interval for the
variance was [1.6074,3.1213]. By normal approximation,
s2 = 2.1702; n=51; alpha = 0.1;
[s2 - norminv(1-alpha/2)
*
sqrt(2)
*
s2/sqrt(n-1), ...
s2 + norminv(1-alpha/2)
*
sqrt(2)
*
s2/sqrt(n-1)]
%ans = 1.4563 2.8841
The interval [1.4563, 2.8841] is shorter, compared to the standard con-
fidence interval [1.6074,3.1213] obtained using
χ
2
quantiles, as 1.4278 <
1.5139. Insisting on equal-probability tails does not lead to the shortest in-
terval since the
χ
2
distribution is asymmetric. In addition, the approximate
interval is centered at s
2
. Why, then, is this interval not used? The coverage
probability of a CLT-based interval is smaller than the nominal 1
−α, and un-
less n is large (
>100, say), this discrepancy can be significant (Exercise 7.26).