FURTHER READING 231
Example 9.7. Sensitivity analysis with semiparametric selection models for
cross-sectional continuous response.
Denote the response as Y and the missing indicator as R. Suppose we specify
the following semiparametric selection model using a mixture of Dirichlet
processes model (cf. Section 3.6):
logit{P (R
i
=1| y
i
, ψ)} = ψ
0
+ ψ
1
y
i
Y
i
| µ
i
,σ
2
∼ N(µ
i
,σ
2
)
µ
i
∼ G
G ∼ DP(G
0
,α), (9.13)
where θ =(G, σ
2
)and(G
0
,α)arefixedhyperparameters. In Scharfstein et al.
(2003), a similar model was specified with a prior of the form (9.10). Clearly,
if no distributional assumptions are made about the full-data response y,the
data will provide no information on themissing data mechanism parameter
ψ
1
,andtheMDPonthe distribution of the full-data response allows ψ
1
to be
essentially unidentified; by contrast, it is identified when using a parametric
model for the full-data response. Here, ψ
1
is a sensitivity parameter. 2
Semiparametric selection models can provide a viable alternative to mixture
models for sensitivity analyses. The underlying factorization into the full-data
response model and the missing data mechanism facilitates elicitation of priors
for parameters indexing the missing data mechanism (e.g., ψ
1
in (9.13)). See
Scharfstein et al. (2003) and Scharfstein et al. (2006) for examples.
The main stumbling block to implementation of these models in general
is the feasibility of their specification for longitudinal data with many time
points, especially if the responses are continuous (cf. Chapter 8). The com-
putational challenges also are nontrivial. For complex longitudinal settings,
the full-data response model can be specified semiparametrically (Daniels and
Scharfstein, 2007). Construction of sensitivity analysis and informative priors
will still be valid as long as (9.12) holds.
9.7 Further reading
Model uncertainty and incomplete data
Copasand Eguchi (2005) also emphasize the importance of characterizing
uncertainty in incomplete data problems, but more from the perspective of
mis-specifying the full-data model. To account for model uncertainty, they
suggest rules for adjusting standard errors and confidence intervals for pa-
rameters of interest. Work by Forster and Smith (1998) is closely related, and
deals with categorical data. Recent work by Gustafson (2006) describes model
expansion and model contraction for handling full-data models that are only
partially identified by observed data. Illustrations related to measurement er-