23Amaro Forests - Chap 20 25/7/03 11:08 am Page 227
227 Logistic Regression in Modelling Mortality
related to
π
(x). The model’s statistical evaluation is done on the basis of several cri-
teria developed to guide in choosing the best model from different candidates.
Assessing the fit is usually done with the Pearson
χ
2
statistic and the deviance. The
Hosmer–Lemeshow test looks for the correctness of the model in grouping probabil-
ities in risk deciles. The classification table describes the model’s goodness of fit
with the correct classification rate. The receiver operating characteristic (ROC) curve
(Hosmer and Lemeshow, 2001) is based on sensitivity and specificity measures. A
high ROC curve describes the good prediction ability of the model. The area under
the curve is interpreted as the model’s R
2
.
Multilevel logistic regression with random effect
Multilevel logistic regression models for hierarchically structured data have become
mor
e common in recent studies of social sciences and medicine (e.g. Hox, 1995;
Goldstein and Rasbash, 1996; Carlin et al., 1999; Fielding, 2000). Forestry applica-
tions are few (Jalkanen, 2001), and published tree mortality models, for example,
have been constructed from multilevel data, but in parameter estimation the data
structure has been ignored.
Multilevel binary data can be analysed using a logistic regression model which
is composed of fixed covariates fr
om different hierarchy levels and random effects
(Hosmer and Lemeshow, 2001; McCulloch and Searle, 2001). The random effects not
related to the lowest level of hierarchy are generally assumed to follow normal dis-
tribution with mean zero and constant variance. Here the random stand effect
describes the unmeasured variance between the stands in case of mortality. The
error variance (tree-level variance) is assumed to be:
2
(2)
xl
π
x
φπ
()
(
−
()
)
where
φ
is the dispersion scale (or extra-binomial variance). If
φ
= 1, then the error
variance is purely binomial. If
φ
> 1, the situation is called over-dispersion. Unlike
under-dispersion (
φ
< 1), over-dispersion is fairly common, and may occur when
data are collected from experiments with a hierarchical design (McCullagh and
Nelder, 1989). If the dispersion scale is forced to be equal to 1, the potential over-
dispersed variance is added to the random effect variance.
The ML and restricted maximum likelihood (REML) methods are relevant
when analysing linear multilevel models with normal distribution, but logistic
r
egression models with random effects need special quasi-likelihood estimation
methods. The key point is to estimate the effect of the covariates on the binary out-
come with awareness of the random effect(s). The idea of quasi-likelihood methods
is that only the model for the mean (response and covariates) and the relationship
between the mean and the variance is necessary for the estimation. Two quasi-likeli-
hood methods have been developed for estimation: the marginal quasi-likelihood
(MQL) and the penalized quasi-likelihood (PQL) methods. Today these methods are
available in most statistical software packages. However, Rodriguez and Goldman
(1995) have pointed out that in certain circumstances the first-order MQL estimates
are badly biased, sometimes more biased than those obtained by ML without ran-
dom effects. The PQL method has also been noticed to have a major problem in
practice if the cluster size is small (McCulloch and Searle, 2001). Alternatives to the
ML approach are, for instance, GEE, generalized estimating equations (Zeger et al.,
1988) or Bayesian methods (MCMC, bootstrapping) (e.g. Goldstein, 1995; Browne
and Draper, 2002), which will not be addressed in this chapter.