
202 NONIGNORABLE MISSINGNESS
individuals that drop out for reasons deemed MNAR, S is observed.Foreach
individual, the data augmentation step draws y
∗
i,mis
from the distribution
p(y
mis
| y
i,obs
,s
i
, α), which is simply a conditional normal distribution in
pattern S = s
i
for the pattern mixture model given above.
For individuals that discontinue follow up at time k for reasons deemed
MAR, data augmentation proceeds in two steps. First, draw s
∗
i
from a multi-
nomial distribution having probabilities (φ
∗
i1
,...,φ
∗
i6
), where
φ
∗
ij
= p(S = j | y
i,obs
,S >k,α, φ)
=
φ
j
p(y
i,obs
| S = j, α) I(j>k)
6
j=k+1
φ
j
p(y
i,obs
| S = j, α)
.
Note that φ
∗
ij
=0forj ≤ k.Next,wedrawanewvalueofy
∗
i,mis
as above,
this time conditioning on s
∗
i
and using the distribution p(y
mis
| y
i,obs
,s
∗
i
, α).
Foracomplete analysis of the schizophrenia data, see Hogan and Laird
(1997a); in that paper, an EM algorithm is used, where the E step is very
similar to the data augmentation step described here.
8.4.6 Mixture models or selection models?
With binary (or categorical) data, when the measurement occasions are dis-
crete and dropout is the sole cause of missingness, the duality between mix-
ture and selection models holds for any J, but of course the dimensionality
increases exponentially: for J measurement occasions with J possible dropout
times, the number of unique parameters is J2
J
− 1, and any realistic analysis
must rely on simplifying assumptions to reduce the dimension of the param-
eter space. This raises an obvious question of which factorization to use for
the full-data model p(y, r | ω).
With a selection model, the simplifications must be made in terms of the
full-data response distribution p(y)andtheselection mechanism p(r | y).
Possible strategies include limiting the association structure in the full-data
response distribution (e.g., to include only two-way interactions), or limiting
the selection mechanism such that dropout at t
j
depends only on a small
part of the observed history (say Y
j
and Y
j−1
). For purposes ofsensitivity
analysis, however, this can become problematic because unless p(y)isspecified
nonparametrically, the full-data model parameter ω can be identified from
the observed data, and the choice of an appropriate sensitivity parameter is
often not possible. An alternative is a semiparametric formulation for the full-
data response model Scharfstein et al. (2003), but this approach can present
nontrivial technical complications.
By contrast, model simplifications inmixturemodels may be more feasi-
ble, despite the proliferation of nonidentified parameters. Sensible simplifying
assumptions can be imposed on the observed data,while keeping the distribu-
tion of the missing data indexed by one or more nonidentified parameters. Our