M issing information principle: This principle states that the missing information in data with
missing values
is equal to the difference between the
information matrix
for complete data
and the observed information matrix. [Proceedings of the 6th Berkeley Symposium on
Mathematical Statistics and Probability, 1972, 1, 697–715.]
Missing values : Observations missing from a set of data for some reason. In
longitudinal studies
, for
example, they may occur because subjects drop out of the study completely or do not appear
for one or other of the scheduled visits or because of equipment failure. Common causes of
subjects prematurely ceasing to participate include recovery, lack of improvement,
unwanted signs or symptoms that may be related to the investigational treatment, unpleasant
study procedures and intercurrent health problems. Such values greatly complicate many
methods of analysis and simply using those individuals for whom the data are complete can
be unsatisfactory in many situations. A distinction can be made between values missing
completely at random (MCAR), missing at random (MAR) and non-ignorable (or informa-
tive). The MCAR variety arise when individuals drop out of the study in a process which is
independent of both the observed measurements and those that would have been available
had they not been missing; here the observed values effectively constitute a
simple random
sample
of the values for all study subjects. Random drop-out (MAR) occurs when the drop-
out process depends on the outcomes that have been observed in the past, but given this
information is conditionally independent of all future (unrecorded) values of the outcome
variable following drop-out. Finally, in the case of informative drop-out, the drop-out
process depends on the unobserved values of the outcome variable. It is the latter which
cause most problems for the analysis of data containing missing values. See also last
observation carried forward, attrition, imputation, multiple imputation and Diggle–
Kenward model for drop-outs.[Analysis of Longitudinal Data, 2nd edition, 2002,
P. J. Diggle, P. J. Heagerty, K.-Y. Liang and S. Zeger, Oxford Science Publications, Oxford.]
Misspecification: A term applied to describe assumed statistical models which are incorrect for one
of a variety of reasons, for example, using the wrong probability distribution, omitting
important covariates, or using the wrong
link function
. Such errors can produce inconsistent
or inefficient estimates of parameters. See also White’s information matrix test and
Hausman misspecification test.[Biometrika, 1986, 73, 363–9.]
Mitofsky^Waksberg scheme: See telephone interview surveys.
M itscherlich curve: A curve which may be used to model a
hazard function
that increases or
decreases with time in the short term and then becomes constant. Its formula is
hðtÞ¼ βe
γt
where all three parameters, ; β and γ, are greater than zero. [Australian Journal of
Experimental Agriculture, 2001, 41, 655–61.]
Mixed data: Data containing a mixture of continuous variables, ordinal variables and categorical
variables.
Mixed-effects logistic regression: A generalization of standard
logistic regression
in which
the intercept terms, α
i
are allowed to vary between subjects according to some probability
distribution, f ðαÞ. In essence these terms are used to model the possible different
frailties
of
the subjects. For a single covariate x, the model often called a random intercept model, is
logit½Pðy
ij
jα
i
; x
ij
Þ ¼ α
i
þ βx
ij
where y
ij
is the binary response variable for the jth measurement on subject i, and x
ij
is the
corresponding covariate value. Here β measures the change in the conditional logit of the
280