Discrete uniform distribution: A probability distribution for a discrete random variable that
takes on k distinct values x
1
; x
2
; ...; x
k
with equal probabilities where k is a positive integer.
See also lattice distributions.[A Primer on Statistical Distributions, 2003, N. Balakrishnan
and V.B. Neizorow, Wiley, New York.]
Discrete variabl es: Variables having only integer values, for example, number of births, number of
pregnancies, number of teeth extracted, etc. [SMR Chapter 2.]
Discrete wavelet transfo rm (DWT ): The calculation of the coefficients of the
wavelet series
approximation
for a discrete signal f
1
; f
2
; ...; f
n
of finite extent. Essentially maps the vector
f
0
¼½f
1
; f
2
; ...; f
n
to a vector of n
wavelet transform coefficients
.[IEEE Transactions on
Pattern Analysis and Machine Intelligence, 1989, 11, 674–93.]
Discriminant analysis: A term that covers a large number of techniques for the analysis of multi-
variate data that have in common the aim to assess whether or not a set of variables
distinguish or discriminate between two (or more) groups of individuals. In medicine, for
example, such methods are generally applied to the problem of using optimally the results
from a number of tests or the observations of a number of symptoms to make a diagnosis that
can only be confirmed perhaps by post-mortem examination. In the two group case the most
commonly used method is
Fisher’s linear discriminant function
, in which a linear function of
the variables giving maximal separation between the groups is determined. This results in a
classification rule
(often also known as an
allocation rule
) that may be used to assign a new
patient to one of the two groups. The derivation of this linear function assumes that the
variance–covariance matrices
of the two groups are the same. If they are not then a
quadratic
discriminant function
may be necessary to distinguish between the groups. Such a function
contains powers and cross-products of variables. The sample of observations from which the
discriminant function is derived is often known as the
training set
. When more than two
groups are involved (all with the same variance–covariance matrix) then it is possible to
determine several linear functions of the variables for separating them. In general the number
of such functions that can be derived is the smaller of q and g 1 where q is the number of
variables and g the number of groups. The collection of linear functions for discrimination
are known as
canonical discriminant functions
or often simply as
canonical variates
. See also
error rate estimation and regularised discriminant analysis. [MV2 Chapter 9.]
Discrimination information: Synonymous with Kullback-Leibler information.
Disease clusters: An unusual aggregation of health events, real or perceived. The events may be
grouped in a particular area or in some short period of time, or they may occur among a
certain group of people, for example, those having a particular occupation. The significance
of studying such clusters as a means of determining the origins of public health problems has
long been recognized. In 1850, for example, the Broad Street pump in London was identified
as a major source of cholera by plotting cases on a map and noting the cluster around the
well. More recently, recognition of clusters of relatively rare kinds of pneumonia and
tumours among young homosexual men led to the identification of acquired immunodefi-
ciency syndrome (AIDS) and eventually to the discovery of the human immunodeficiency
virus (HIV). See also scan statistic.[Statistics in Medicine, 1995, 14, 799–810.]
Diseasemapping: The process of displaying the geographical variability of disease on maps using
different colours, shading, etc. The idea is not new, but the advent of computers and
computer graphics has made it simpler to apply and it is now widely used in descriptive
epidemiology
, for example, to display morbidity or mortality information for an area.
Figure 55 shows an example. Such mapping may involve absolute rates, relative rates,
etc., and often the viewers impression of geographical variation in the data may vary quite
138