182 NONIGNORABLE MISSINGNESS
Fitzmauriceand Laird (2000) developed moment-based approaches based on
mixtures of generalized linear models. Roy (2003) and Roy and Daniels (2007)
addressed the issue of having a large number of dropout categories by using
mixtures over latent classes. Hogan et al. (2004a) developed approaches for
continuous dropout times based on mixtures of varying coefficient models.
Reviews of model-based approaches can be found in Little (1995), Hogan and
Laird (1997b), Kenward and Molenberghs (1999), Fitzmaurice (2003), and
Hogan et al. (2004b).
Another key thread of research concerns model identification. Here, Molen-
berghs and colleagues have developed an important body of work for the case
of discrete-time dropout (cf. Molenberghs et al., 1998; Kenward et al., 2003),
much of which isdescribed in this section.
The mixture model approach factors the full-data model as
p(y, r | x, ω)=p(y | r, x, ω) p(r | x, ω). (8.16)
The full-data response distribution is obtained by averaging (8.16) over the
distribution of r,
p(y | x, ω)=
r∈R
p(y | r, x, ω) p(r | x, ω),
where R is the sample space of R (see also Section 5.9.2). In this section,
we describe some specific formulations that give a broad representation of
the settings where the models can be used, and describe formal methods for
identifying them. In general, when R is discrete, the component distributions
comprise the set {p(y | r, x):r ∈ R}.Whenmissingness is caused by dropout
at some time U ,thenthe component distributions p(y | u, x)mayalsobea
discrete set of distributions, or may be specified in terms of a continuous u.
To maintain focus on key ideas related to specification and identification, we
defer discussion of covariates to Section 8.4.7.
Mixture models for either discrete or continuous dropout time are under-
identified and require specific constraints for fitting to data. There are various
approaches to identifying the models; in this section, we focus on strategies
that divide the full-data distribution into identified and nonidentified compo-
nents (see the extrapolation factorization (8.1)).
In the case of discrete-time dropout, the MAR assumption is used as a
basis for identifying the full-data distribution and for building a larger class of
models that accommodate MNAR. For continuous-time dropout (or discrete-
time dropout with large number of support points), models can be identified
by making assumptions about the mean of the full-data response as a function
of time; e.g., assuming E{Y (t) | U = u} is linear in t,withintercept and
slope depending on U.Usually one assumes that given dropout at U = u,
mean response prior toandafterU — i.e., E{Y (t) | U = u, t < u} and
E{Y (t) | U = u, t ≥ u} —areeither equivalent or related through some