112 MISSING DATA MECHANISMS
inference, information about this distribution will derive at least in part from
parameters that are informed solely by prior distributions.
To summarize, mixture models have the advantage that in many cases, one
can find full-data parameters indexing the distribution of missing responses
that are not identified by observed data. In one sense, this makes inferences
about the full-data more transparent in that the source of information about
the posterior is clear. A potential downside of the models relates to practical
implementation: in the previous example, we confine attention to a bivari-
ate distribution with no covariates, and even in this simple case there are
three unidentified parameters. As the dimension of Y increases, so will the
dimension of the unidentified parameter. In our case studies in Chapter 10,
we address this concern by illustrating sensible ways to reduce the dimension
of nonidentified parameters indexing the extrapolation distribution.
5.9.3 Shared parameter models
Athirdapproach to specifying the full data distribution is to use an explicitly
multilevel formulation, frequently called a shared parameter model,whereran-
dom effects b are modeled jointly with Y and R.Insomecases the random
effects can be used to explain the association or to capture multiple sources
of variation. Key papers tracing the development of these models include Wu
and Carroll (1988); Follmann and Wu (1995); Pulkstenis et al. (1998), and
Henderson et al. (2000). De Gruttola and Tu (1994), Wulfsohn and Tsiatis
(1997), and Faucett and Thomas (1996) use similar formulations but with the
objective of modeling a survival process as a function of stochastic longitudinal
covariates.
The general form of the full-data model using a shared parameter ap-
proach is
p(y, r | x, ω)=
p(y, r, b | x, ω) db. (5.22)
Specific shared parameter models are formulated by making assumptions
aboutthe joint distribution under the integral sign. Notice in (5.22) that
the full-data parameter ω also includes parameters indexing the distribution
of the random effects.
Advantages to the SPM model include simplified specification for the re-
sponse and missingness components. When Y is measured with error, SPMs
provide a useful way for missingness to depend on the error-free version of
Y ,represented in terms of the random effects (this is a type of ‘random-
coefficient-dependent missingness’; see Wu and Carroll, 1988, and Little, 1995).
Another potential advantage is that through the use of random effects, SPMs
can be used to handle high-dimensional or multilevel response data. A disad-
vantage is that except in simple settings, the underlying missing data mecha-