Option-3 scheme: A scheme of measurement used in situations investigating possible changes
over time in
longitudinal data
. The scheme is designed to prevent measurement
outliers
causing an unexpected increase in falsely claiming that a change in the data has occurred.
Two measures are taken initially and, if they are closer than a specified threshold, the
average of the two is considered to be an estimate of the true mean; otherwise a third
measurement is made, and the mean of the closest ‘pair’ is considered to be the estimate.
[Statistics in Medicine, 1998, 17, 2607–2615.]
Oracle property: A name given to methods for estimating the regression parameters in models
fitted to
high-dimensional data
that have the property that they can correctly select the
nonzero coefficients with probability converging to one and that the estimators of the
nonzero coefficients are asymptotically normal with the same means and covariances that
they would have if the zero coefficients were known in advance, i.e., the estimators are
asymptotically as efficient as the ideal estimation assisted by an ‘oracle’ who knows which
coefficients are nonzero. [Annals of Statistics, 2005, 32, 928–961.]
Or dered alternative hypothesis: A hypothesis that specifies an order for a set of parameters
of interest as an alternative to their equality, rather than simply that they are not all equal.
For example, in an evaluation of the treatment effect of a drug at several different doses,
it might be thought reasonable to postulate that the response variable shows either a
monotonic increasing or monotonic decreasing effect with dose. In such a case the null
hypothesis of the equality of, say, a set of m means would be tested against
H
1
:
1
2
m
;
using some suitable test procedure such as
Jonckheere’s k
-sample test.[Biostatistics:
A Methodology for the Health Sciences, 2nd edn, 2004, G. Van Belle, L. D. Fisher,
P. J. Heagerty and T. S. Lumley, Wiley, New York.]
Ordered logistic regression: Logistic regression when the response is an
ordinal variable
. See
also proportional odds model.
Order statistics: The ordered values of a collection of random variables, i.e. if X
1
, X
2
, X
3
, ..., X
n
is a collection of random variables with ordered values, X
(1)
≤ X
(2)
≤ ... ≤ X
(n)
then their
rth order statistic is the rth smallest amongst them, X
(r)
and X
(1)
and X
(n)
are, respectively
the sample minimum and maximum. The order statistics are widely used as the basis
of estimators and assessment of fit; for example, two simple statistics based on them are
the sample median and the
alpha (α
)-
trimmed mean
.[Order Statistics, 3rd edn, 2003,
H. A. David and H. N. Nagaraja, Wiley, New York.]
Ordinal variable: A measurement that allows a sample of individuals to be ranked with respect to
some characteristic but where differences at different points of the scale are not necessarily
equivalent. For example, anxiety might be rated on a scale ‘none’, ‘mild’, ‘moderate’ and
‘severe’, with the values 0,1,2,3, being used to label the categories. A patient with anxiety
score of one could be ranked as less anxious than one given a score of three, but patients
with scores 0 and 2 do not necessarily have the same difference in anxiety as patients
with scores 1 and 3. See also categorical variable and continuous variable.
Ordinary least squares (OLS): See least squares estimation.
Ordination: The process of reducing the dimensionality (i.e. the number of variables) of multi-
variate data by deriving a small number of new variables that contain much of the
information in the original data. The reduced data set is often more useful for investigating
possible structure in the observations. See also curse of dimensionality, principal
components analysis and multidimensional scaling. [MV1 Chapter 1.]
312