170 NONIGNORABLE MISSINGNESS
model was studied in a simple but very informative empirical example by Ken-
ward (1998). He considered a bivariate full-data response with Y
2
potentially
missing and missing data mechanism given in (8.2), and assessed the sensi-
tivity of inference about ψ to distributional assumptions about p(y
2
| y
1
). If
this distribution was assumed to be normal, the estimates and standard errors
of ψ implied MNAR. However, if this distribution was assumed to follow a
t-distribution with only a few degrees of freedom, the estimates and standard
errors of ψ implied MAR. So by just changing the tails of the distribution
of the full-data response, inference concerning the missing data mechanism
changedsignificantly. This and the previous hypothetical examples illustrate
the prominent role of modeling assumptions in parametric selection models:
widely differing conclusions can be drawn based on unverifiable modeling as-
sumptions about the full-data response.
The preceding discussion has focused entirely on the full-data response
model in identifying potential sensitivity parameters in the missing data mech-
anism. However, we have focusedonthesimple(andcommon) situation where
the response y is entered into the MDM linearly. This is a strong assumption
and is not always appropriate. For example, in the schizophrenia clinical trial
(Section 1.2), it is conceivable that participants may be more likely to drop out
when they are doing much better or much worse, suggesting thatthemissing
data mechanism should be quadratic in y.Revisiting the previous example
with cross-sectional data, assume p(y
obs
)follows the histogram in Figure 8.1,
p(y)isanormaldistribution, and the MDM follows
logit{P (R =1| y)} = ψ
0
+ ψ
1
y + ψ
2
y
2
. (8.3)
This scenario is consistent with an MNAR mechanism having ψ
1
< 0and
ψ
2
=0,because the right tail needs to be filled in for the full-data responses
to be normally distributed (the left tail is already consistent with a normal
distribution). Alternatively, suppose that p(y
obs
)resembled the histogram
in Figure 8.2. The quadratic MDM (8.3) is now consistent with ψ
2
> 0,
which is MNAR. On the other hand, with the same observed data response
distribution, an MDM that is linear in y (ψ
2
=0)isconsistent with ψ
1
=0,
which is MAR.
These simpleexamplesdemonstrate that for a fully specified parametric
selection model, all parameters are identified. Asaconsequence, there are no
obvious sensitivity parameters. By contrast, in an ideal sensitivity analysis, the
distribution of Y
mis
given Y
obs
and R is governed by parameters that affect
the full-data distribution but not the observed-data distribution, so that per-
turbations of these parameter values do not affect fit of the full-data model
to observables. By this criterion, parametric selection models are not well-
suited to sensitivity analysis or to incorporation of prior information about
p(y
mis
| y
obs
, r). A detailed discussion of this point follows in Sections 8.3.3
through 8.3.6; in Section 8.3.7 we introduce semiparametric selection mod-