
Multinomial distribution: A generalization of the
binomial distribution
to situations in which r
outcomes can occur on each of n trials, where r>2. Specifically the distribution is given by
Pðn
1
; n
2
; ...; n
r
Þ¼
n!
n
1
!n
2
! n
r
!
p
n
1
1
p
n
2
2
p
n
r
r
where n
i
is the number of trials with outcome i, and p
i
is the probability of outcome i
occurring on a particular trial. The expected value of n
i
is np
i
and its variance is np
i
ð1 p
i
Þ.
The covariance of n
i
and n
j
is np
i
p
j
. [STD Chapter 26.]
Multinomial logit model: See multinomial logistic regression.
Multinomial logistic regression: A form of
logistic regression
for use when the categorical
response variable has more than two unordered categories. If we let k be the number of
categories of the response variable, Y, then the model used (sometimes called the multi-
nomial logit model) is the following;
p
r
ðxÞ¼
expðx
0
β
r
Þ
P
k
s¼1
expðx
0
β
r
Þ
; r ¼ 1; ...; k
where p
r
ðxÞ¼PrðY ¼ r jxÞ; r ¼ 1; ...; k, x is a vector of explanatory variables and β
r
is a
vector of regression coefficients for category r. Because it is only possible to investigate the
effect of x upon the ‘preference’ of a response category compared to other categories, not all
the parameter vectors are identifiable so it is necessary to choose a reference category,
commonly category k and to take β
k
¼ 0. See also proportional odds model.[Applied
Logistic Regression, 2nd edn, 2000, D. W. Hosmer and S. Lemeshow, Wiley, New York.]
Multinormal distribution: Synonym for multivariate normal distribution.
Multiphasic screening: A process in which tests in
screening studies
may be performed in
combination. For example, in cancer screening, two or more anatomic sites may be screened
for cancer by tests applied to an individual during a single screening session. [American
Journal of Public Health, 1964, 54, 741–50.]
Mu lt ipl e compa r iso n tests: Procedures for detailed examination of the differences between a set
of means, usually after a general hypothesis that they are all equal has been rejected. No
single technique is best in all situations and a major distinction between techniques is how
they control the possible inflation of the type I error. See also Bonferroni correction,
Duncan’s multiple range test, Scheffé’s test and Dunnett’s test.[Biostatistics: A
Methodology for the Health Sciences , 2nd edition, 2004, G. Van Belle, L. D. Fisher,
P. J. Heagerty and T. S. Lumley, Wiley, New York.]
Multiplecorrelationcoefficient: The correlation between the observed values of the dependent
variable in a
multiple regression
, and the values predicted by the estimated regression
equation. Often used as an indicator of how useful the explanatory variables are in predicting
the response. The square of the multiple correlation coefficient gives the proportion of
variance of the response variable that is accounted for by the explanatory variables. [SMR
Chapter 12.]
Multipleendpoints: A term used to describe the variety of outcome measures used in many
clinical
trials
. Typically there are multiple ways to measure treatment success, for example, length of
patient survival, percentage of patients surviving for two years, or percentage of patients
experiencing tumour regression. The aim in using a variety of such measures is to gain better
overall knowledge of the differences between the treatments being compared. The danger
290