increase in complexity would be worthwhile. Hence, I choose the quadratic model as
exhibiting maximum parsimony while capturing most of the nonlinearity in the data.
Additionally, the quadratic model is most consistent with theoretical expectation.
Centering. For models 2 and 3 in Table 5.1, I employ the centered version of age
decile and then form the quadratic term by squaring this centered variable. This has
two advantages. First, it renders the main effect of age decile interpretable in the
quadratic model. The partial slope for age decile in model (5.5), which captures the
“effect” of age decile on sexual frequency, is β 2γ age decile. The main effect of
age decile, or β, is the effect when age decile is zero, since in this case the “2γ age
decile” term disappears. If age decile is uncentered, this effect has no meaning, since
age decile, which begins at the value 1, cannot possibly take on the value zero.
However, if age decile is centered, it is zero whenever age decile is at its mean. Thus,
β is the effect of age decile at its mean. Moreover, the test of significance of b, the
sample estimate of β, is a test for whether the impact of increasing age (in deciles)
is significant at its mean value.
The second advantage of centering has to do with collinearity. Recall from
Chapter 3 that it was important to center the continuous variables involved in cross-
product terms in order to reduce potential collinearity problems. For the same rea-
son, we want to center X before creating higher-order powers of X (e.g., X
2
, X
3
) to
include in the model. As an example, without centering, age and age-squared are
correlated .9737, producing VIF’s (not shown) of 19.26 for each coefficient in the
quadratic model. After centering, the correlation is reduced to .1, and the VIF’s for
each coefficient are only 1.01. At the least, collinearity inflates the sampling variance
of one’s estimators, which tends to reduce the power of tests for the coefficients. In
this case, it has relatively little effect on the estimates, however, and both are quite
significant in the uncentered version of the model as well. In this model (not shown),
the coefficient for age decile, or b, is .422, while the coefficient for (age decile)
2
,or
g,is.065. Although σ
ˆ
g
is no different than for the model using the centered vari-
ables, σ
ˆ
b
is about four times larger in the uncentered, versus the centered, model.
The quadratic effect, .065, is the same in both models. The main effects in the cen-
tered and uncentered models are not directly comparable, since the value of .422 in
the uncentered model is the effect when age decile is zero. To make them compara-
ble, consider the effect of age decile at its mean for the uncentered model. As the
mean of age decile is 5.279, the effect is . 422 2(.065) (5.279) .264, com-
pared to .26 in the centered model. Both models apparently give rise to compara-
ble estimates of age decile’s effect on sexual frequency.
Interpreting Quadratic Models
The use of age decile in place of the continuous variable, age, was necessary for test-
ing various alternatives to the unconstrained model, since it allowed for the creation
of nested models. However, once the quadratic model was chosen, it was reestimated
using continuous age in place of age decile. Again, age was centered prior to taking
its square. The results are shown as model 1 in Table 5.2. The effects of both age and
COMMON NONLINEAR FUNCTIONS OF X 175