220 INFORMATIVE PRIORS AND SENSITIVITY ANALYSIS
The prior for ξ
S
may be informed by a credible source of external infor-
mation, such as expert opinion, historical data, or some combination of the
two. The Bayesiansetupisideal for this because we can, at least in principle,
quantify uncertainty about assumptions through these priors.
Acommon practice for assessing sensitivity to missing data assumptions
is to compare inferences about a parameter of interest under several different
full-data models that are not compatible in their fit to the observed data. For
example, one might compare inferences under a selection model, a pattern
mixture model, and a shared parameter model. It is also common to com-
pare nested models that have different assumptions about the missing data
mechanism, but also have different observed data likelihoods (for example by
fitting model (9.2) and comparing it to the nested model with ψ
2
=0).By
contrast, principles 1.(b) and 1.(c) emphasize that model parameterizations
should allow for exploration of sensitivity to missing data assumptions along
acontinuum of model specifications that have similar or identical fits to the
observed data.
9.3 Parameterizing the full-data model
Recall that any full-data model has an associated extrapolation factorization
(9.1). The component p(y
mis
| y
obs
, r, ω
E
)istheextrapolation distribution,
or the distribution of the missing data given the observed data; without para-
metric assumptions, it cannot be identified by observed data. The component
p(y
obs
, r | ω
O
)istheobserved data distribution and is proportional in ω to
the observed data likelihood.
We seek a parameterization ξ(ω)=(ξ
S
, ξ
M
)suchthat ξ
M
is identified
by observed data and ξ
S
is a sensitivity parameter. Sensitivity analysis and
elicitation of informative priors is then be based on functions of ξ
S
alone
(although this may not be possible in all full-data models).
To make inferences under MNAR, we construct a prior for ξ
S
and draw
inference about the full-data distribution under this prior (Rubin, 1977). In
practice we will frequently introduce a redundant parameter (or vector of
parameters) ∆ that captures explicitly the departures from MAR. The prior
on ξ
S
can then be expressed in terms of ∆ and ξ
M
.Thefollowing example
provides a concrete illustration.
Example 9.1. Mixture model parameterization in terms of (ξ
S
, ξ
M
).
Consider a mixture model for univariate full-data response Y such that
Y | R =1 ∼ N(µ
(1)
,σ
2
)
Y | R =0 ∼ N(µ
(0)
,σ
2
)
R ∼ Ber(φ),
where, as usual, R =1indicates that Y is observed and R =0indicates