20 REGRESSION MODELS
the linear predictor
η
i
= η(x
i
, β)=x
i
β,
where β =(β
1
,...,β
p
)
T
is a vector of regression coefficients. Define µ
i
=
µ(x
i
, β)=E(Y | x
i
, β). A smooth, monotone function g links the mean µ
i
to the linear predictor η
i
via
g(µ
i
)=η
i
= x
i
β. (2.1)
In many exponential family distributions, it is possible to identify a link
function g such that X
T
Y is the sufficient statistic for β (here, X is the
n × p design matrix and Y =(Y
1
,...,Y
n
)
T
is the n × 1vector of responses).
In this case, the canonical parameter is θ = η.Examplesarewell-known and
widespread: for the Poisson distribution, the canonical parameter is log(µ);
for binomial distribution, it is the log odds (logit), log{µ/(1 − µ)}.
Although canonical links are sometimes convenient, their use is not neces-
sary to form a GLM. In general, only the specification of a mean and variance
function, conditionally on covariates, is required. The mean follows (2.1), and
the variance is given by
v(µ
i
,φ)=φh(µ
i
),
where h(·)issomefunction of the mean and φ>0isascalefactor.Certain
choices of g and h will yield likelihood score equations for common parametric
regression models based on exponential family distributions. For example,
setting g(µ)=log{µ/(1 − µ)}, h(µ)=µ(1 − µ)andφ =1yieldslogistic
regression under a Bernoulli distribution. Similarly, Poisson regression can be
specified by setting g(µ)=log(µ), h(µ)=µ,andφ =1.
2.4 Conditionally specified models
This section focuses on models that specify the mean of Y given X con-
ditionally on random effects; these models also are known by a variety of
names, including ‘mixed effects models’,‘multilevel models’, ‘random effects
models’, and ‘random coefficient models’. Throughout the text, we use the
terms ‘random effects’ and ‘multilevel’ models. Readers arereferred to Bres-
lowand Clayton(1993) and Daniels and Gatsonis (1999) for a more complete
accounting and list of references. Thisclassofmodels also includes regres-
sion models with factor-analytic and latent class structures (see Bartholomew
and Knott, 1999 for a full account) and Markov models (where the mean is
specified conditional on a subset of past responses).
Conditionally specified models with multiple levels, using random effects
or latent variables, provide a highly flexible class of models for handling lon-
gitudinal data. The models can be applied either to balanced or unbalanced
response profiles and can be used to capture key features of both between-
and within-subject variation using relatively few parameters.