Order Condition: A necessary condition for identifying
the parameters in a model with one or more endogenous
explanatory variables: the total number of exogenous
variables must be at least as great as the total number of
explanatory variables.
Ordinal Variable: A variable where the ordering of the val-
ues conveys information but the magnitude of the values
does not.
Ordinary Least Squares (OLS): A method for estimating
the parameters of a multiple linear regression model. The
ordinary least squares estimates are obtained by minimiz-
ing the sum of squared residuals.
Outliers: Observations in a data set that are substantially
different from the bulk of the data, perhaps because of
errors or because some data are generated by a different
model than most of the other data.
Out-of-Sample Criteria: Criteria used for choosing fore-
casting models that are based on a part of the sample that
was not used in obtaining parameter estimates.
Overall Significance of a Regression:A test of the joint
significance of all explanatory variables appearing in a
multiple regression equation.
Over Controlling: In a multiple regression model, includ-
ing explanatory variables that should not be held fixed
when studying the ceteris paribus effect of one or more
other explanatory variables; this can occur when vari-
ables that are themselves outcomes of an intervention or
a policy are included among the regressors.
Overdispersion: In modeling a count variable, the variance
is larger than the mean.
Overidentified Equation: In models with endogenous
explanatory variables, an equation where the number of
instrumental variables is strictly greater than the number
of endogenous explanatory variables.
Overidentifying Restrictions: The extra moment conditions
that come from having more instrumental variables than
endogenous explanatory variables in a linear model.
Overspecifying a Model: See inclusion of an irrelevant
variable.
P
p-Value: The smallest significance level at which the null
hypothesis can be rejected. Equivalently, the largest sig-
nificance level at which the null hypothesis cannot be
rejected.
Pairwise Uncorrelated Random Variables: A set of two or
more random variables where each pair is uncorrelated.
Panel Data: A data set constructed from repeated cross sec-
tions over time. With a balanced panel, the same units
appear in each time period. With an unbalanced panel,
some units do not appear in each time period, often due
to attrition.
Parameter: An unknown value that describes a population
relationship.
Parsimonious Model: A model with as few parameters as
possible for capturing any desired features.
Glossary 867
Partial Derivative: For a smooth function of more than one
variable, the slope of the function in one direction.
Partial Effect: The effect of an explanatory variable on the
dependent variable, holding other factors in the regres-
sion model fixed.
Percent Correctly Predicted: In a binary response model,
the percentage of times the prediction of zero or one
coincides with the actual outcome.
Percentage Change: The proportionate change in a vari-
able, multiplied by 100.
Percentage Point Change: The change in a variable that is
measured as a percentage.
Perfect Collinearity: In multiple regression, one indepen-
dent variable is an exact linear function of one or more
other independent variables.
Plug-In Solution to the Omitted Variables Problem: A
proxy variable is substituted for an unobserved omitted
variable in an OLS regression.
Point Forecast: The forecasted value of a future outcome.
Poisson Distribution: A probability distribution for count
variables.
Poisson Regression Model: A model for a count dependent
variable where the dependent variable, conditional on the
explanatory variables, is nominally assumed to have a
Poisson distribution.
Policy Analysis: An empirical analysis that uses economet-
ric methods to evaluate the effects of a certain policy.
Pooled Cross Section: A data configuration where indepen-
dent cross sections, usually collected at different points in
time, are combined to produce a single data set.
Pooled OLS Estimation: OLS estimation with indepen-
dently pooled cross sections, panel data, or cluster
samples, where the observations are pooled across time
(or group) as well as across the cross-sectional units.
Population: A well-defined group (of people, firms, cities,
and so on) that is the focus of a statistical or econometric
analysis.
Population Model: A model, especially a multiple linear
regression model, that describes a population.
Population R-Squared: In the population, the fraction of
the variation in the dependent variable that is explained
by the explanatory variables.
Population Regression Function: See conditional
expectation.
Positive Definite: A symmetric matrix such that all qua-
dratic forms, except the trivial one that must be zero,
are strictly positive.
Positive Semi-Definite: A symmetric matrix such that all
quadratic forms are nonnegative.
Power of a Test: The probability of rejecting the null hypothe-
sis when it is false; the power depends on the values of the
population parameters under the alternative.
Practical Significance: The practical or economic impor-
tance of an estimate, which is measured by its sign and
magnitude, as opposed to its statistical significance.
Prais-Winsten (PW) Estimation: A method of estimating
a multiple linear regression model with AR(1) errors and
strictly exogenous explanatory variables; unlike