indicator for drug usage, drugs, has become positive and insignificant, whereas it was negative
(as we expect) and significant in Table 17.4. On the other hand, the work program dummy,
workprg, becomes positive but is still insignificant. The remaining coefficients maintain the
same sign, but they are all attenuated toward zero. The apparent attenuation bias of OLS for the
coefficient on black is especially severe, where the estimate changes from −.543 in the
(appropriate) censored regression estimation to −.00085 in the (inappropriate) OLS regression
using only the uncensored durations.
17.14 (i) When log(wage) is regressed on educ, exper, exper
2
, nwifeinc, age, kidslt6, and kidsge6,
the coefficient and standard error on educ are .0999 (se = .0151).
(ii) The Heckit coefficient on educ is .1187 (se = .0341), where the standard error is just the
usual OLS standard error. The estimated return to education is somewhat larger than without the
Heckit corrections, but the Heckit standard error is over twice as large.
(iii) Regressing
ˆ
on educ, exper, exper
2
, nwifeinc, age, kidslt6, and kidsge6 (using only the
selected sample of 428) produces R
2
.962, which means that there is substantial
multicollinearity among the regressors in the second stage regression. This is what leads to the
large standard errors. Without an exclusion restriction in the log(wage) equation,
ˆ
is almost a
linear function of the other explanatory variables in the sample.
17.15 (i) 185 out of 445 participated in the job training program. The longest time in the
experiment was 24 months (obtained from the variable mosinex).
(ii) The F statistic for joint significance of the explanatory variables is F(7,437) = 1.43 with
p-value = .19. Therefore, they are jointly insignificant at even the 15% level. Note that, even
though we have estimated a linear probability model, the null hypothesis we are testing is that all
slope coefficients are zero, and so there is no heteroskedasticity under H
0
. This means that the
usual F statistic is asymptotically valid.
(iii) After estimating the model P(train = 1|x) = Φ(
β
0
+
β
1
unem74 +
β
2
unem75 +
β
3
age +
β
4
educ +
β
5
black +
β
6
hisp +
β
7
married) by probit maximum likelihood, the likelihood ratio test
for joint significance is 10.18. In a
2
7
distribution this gives p-value = .18, which is very
similar to that obtained for the LPM in part (ii).
(iv) Training eligibility was randomly assigned among the participants, so it is not surprising
that train appears to be independent of other observed factors. (However, there can be a
difference between eligibility and actual participation, as men can always refuse to participate if
chosen.)
(v) The simple LPM results are
171