Adding demographic variables reduces the MPS whether OLS or WLS is used; the stan-
dard errors also increase by a fair amount (due to multicollinearity that is induced by adding
these additional variables). It is easy to see, using either the OLS or WLS estimates, that
none of the additional variables is individually significant. Are they jointly significant? The F
test based on the OLS estimates uses the R-squareds from columns (1) and (3). With 94 df
in the unrestricted model and four restrictions, the F statistic is F [(.0828 .0621)/(1
.0828)](94/4) ⬇ .53 and p-value .715. The F test, using the WLS estimates, uses the
R-squareds from columns (2) and (4): F ⬇ .50 and p-value .739. Thus, using either OLS
or WLS, the demographic variables are jointly insignificant. This suggests that the simple
regression model relating savings to income is sufficient.
What should we choose as our best estimate of the marginal propensity to save? In this
case, it does not matter much whether we use the OLS estimate of .147 or the WLS esti-
mate of .172. Remember, both are just estimates from a relatively small sample, and the
OLS 95% confidence interval contains the WLS estimate, and vice versa.
In practice, we rarely know how the variance depends on a particular independent
variable in a simple form. For example, in the savings equation that includes all demo-
graphic variables, how do we know that the variance of sav does not change with age
or education levels? In most applications,
we are unsure about Var(y兩x
1
,x
2
…, x
k
).
There is one case where the weights
needed for WLS arise naturally from an
underlying econometric model. This hap-
pens when, instead of using individual
level data, we only have averages of data
across some group or geographic region. For example, suppose we are interested in
determining the relationship between the amount a worker contributes to his or her
401(k) pension plan as a function of the plan generosity. Let i denote a particular firm
and let e denote an employee within the firm. A simple model is
contrib
i,e
0
1
earns
i,e
2
age
i,e
3
mrate
i
u
i,e
, (8.28)
where contrib
i,e
is the annual contribution by employee e who works for firm i, earns
i,e
is annual earnings for this person, and age
i,e
is the person’s age. The variable mrate
i
is
the amount the firm puts into an employee’s account for every dollar the employee con-
tributes.
If (8.28) satisfies the Gauss-Markov assumptions, then we could estimate it, given
a sample on individuals across various employers. Suppose, however, that we only have
average values of contributions, earnings, and age by employer. In other words, indi-
vidual-level data are not available. Thus, let denote average contribution for
people at firm i, and similarly for and . Let m
i
denote the number of employ-
ees at each firm; we assume that this is a known quantity. Then, if we average equation
(8.28) across all employees at firm i, we obtain the firm-level equation
0
1
2
3
mrate
i
, (8.29)
u
i
age
i
earns
i
contrib
i
age
i
earns
i
contrib
i
Chapter 8 Heteroskedasticity
265
QUESTION 8.3
Using the OLS residuals obtained from the OLS regression reported
in column (1) of Table 8.1, the regression of u
ˆ
2
on inc yields a t sta-
tistic on inc of .96. Is there any need to use weighted least squares
in Example 8.6?
d 7/14/99 6:18 PM Page 265