
FULL-DATA MODELS UNDER MNAR 109
in y,theparameter ψ
2
indexing association between R and the partially
observed Y
2
is identifiable from observed data. This feature of the model
places considerable importance on assumptions that cannot be verified, and
has the potential to make sensitivity analysis problematic.
5.9.2 Mixture models
Mixture models represent another way to factor the full-data model so that
R depends on Y
mis
.Incontrast to selection models, the MM approach uses
the factorization
p(y, r | x, ω)=p(y | r, x, ω) p(r | x, ω). (5.18)
As with the selection model, it is sometimes useful to partition the parameter
space as ω =(α, φ), where α and φ,respectively, index the two factors in
(5.18). (Recall that the decomposition ω =(θ, φ)required for ignorability
implicitly refers to a selection model factorization of the full-data model, so
the partition ω =(α, φ)isnotequivalent.) The full-data response model is a
mixture
p(y | x, α, φ)=
r∈R
p(y | r, x, α) p(r | x, φ).
The missing data mechanism can be derived using Bayes’ rule,
p(r | y, x, α, φ)=
p(y | r, x, α) p(r | x, φ)
p(y | x, α, φ)
. (5.19)
Despite its seeming intractability, in some circumstances the missing data
mechanism implied by a mixture model actually takes a closed form, as in the
mixture of normals model in Example 5.9.
One criticism of mixture models, notreadily apparent when studying the
case of bivariate data with missingness in Y
2
only, is thatthehazard of miss-
ingness at time t
j
can depend not only on the potentially missing observation
Y
j
, but also on future measurements Y
j+1
,...,Y
J
.Inonesense this is a reason-
able criticism because in a stochastic process it does not seem sensible for the
future to predict the past. However, the construction of mixture models asks
for specification of the full-data distribution conditional on different dropout
times or patterns. It seems entirely sensible, in consideringtheassociation
between dropout and response, to specify whether and how the distribution
of (say) the vector (Y
1
,Y
2
,Y
3
)
T
differs between those who drop out after one
measurement and those who record complete follow-up. If desired, mixture
models can be constrained such that, conditionally on past responses, hazard
of dropout at time t
j
depends only on the past and current (but possibly
missing) response, and conditionally on those, does not depend on future re-
sponses (Kenward et al., 2003). Further discussion of this point is provided in
Section 8.4.2, and an illustrative comparison of mixture models where hazard