DUMMY VARIABLES
23
instead of k when you split the sample. Thus the numerator of the F statistic is the same for both tests.
The denominator is also the same because in both cases it is the residual sum of squares for the
subsample regressions divided by n – 2k. In the case of the Chow test, 2k degrees of freedom are used
up when fitting the separate regressions. In the case of the dummy variable group test, k degrees of
freedom are used up when estimating the original intercept and slope coefficients, and a further k
degrees of freedom are used up estimating the intercept dummy and the slope dummy coefficients.
What are the advantages and disadvantages of the two approaches? The Chow test is quick. You
just run the three regressions and compute the test statistic. But it does not tell you how the functions
differ, if they do. The dummy variable approach involves more preparation because you have to
define a dummy variable for the intercept and for each slope coefficient. However, it is more
informative because you can perform t tests on the individual dummy coefficients and they may
indicate where the functions differ, if they do.
Exercises
6.16*
Are educational attainment functions are different for males and females? Using your EAEF
data set, regress S on ASVABC, ETHBLACK, ETHHISP, SM, and SF (do not include MALE).
Repeat the regression using only the male respondents. Repeat it again using only the female
respondents. Perform a Chow test.
6.17
Are earnings functions different for males and females? Using your EAEF data set, regress
LGEARN on S, ASVABC, ETHBLACK, and ETHHISP (do not include MALE). Repeat the
regression using only the male respondents. Repeat it again using only the female respondents.
Perform a Chow test
.
6.18*
Are there differences in male and female educational attainment functions? This question has
been answered by Exercise 6.16 but nevertheless it is instructive to investigate the issue using
the dummy variable approach. Using your EAEF data set, define the following slope dummies
combining MALE with the parental education variables:
MALESM = MALE*SM
MALESF = MALE*SF
and regress S on ETHBLACK, ETHHISP, ASVABC, SM, SF, MALE, MALEBLAC, MALEHISP
(defined in Exercise 6.15), MALEASVC (defined in Exercise 6.12), MALESM, and MALESF.
Next regress S on ETHBLACK, ETHHISP, ASVABC, SM, and SF only. Perform an F test of the
joint explanatory power of MALE and the slope dummy variables as a group (verify that the F
statistic is the same as in Exercise 6.16) and perform t tests on the coefficients of the slope
dummy variables in the first regression.
6.19
Where are the differences in male and female earnings functions? Using your EAEF data set,
regress LGEARN on S, ASVABC, ETHBLACK, ETHHISP, MALE, MALES, MALEASVC,
MALEBLAC, and MALEHISP. Next regress LGEARN on S, ASVABC, ETHBLACK, and
ETHHISP only. Calculate the correlation matrix for MALE and the slope dummies. Perform an
F test of the joint explanatory power of MALE and the slope dummies (verify that the F statistic
is the same as in Exercise 6.17) and perform t tests on the coefficients of the dummy variables.