This looks like a linear trend model with the intercept
y
0
. But the error, v
th
, while
having mean zero, has variance
2
(t h). Therefore, if we use the linear trend
y
0
(t h) to forecast y
th
at time t, the forecast error variance is
2
(t h), as com-
pared with
2
h when we use
h y
t
. The ratio of the forecast variances is (t h)/h,
which can be big for large t. The bottom line is that we should not use a linear trend to
forecast a random walk with drift. (Problem 18.17 asks you to compare forecasts from
a cubic trend line and those from the simple random walk model for the general fertil-
ity rate in the United States.)
Deterministic trends can also produce poor forecasts if the trend parameters are esti-
mated using old data and the process has a subsequent shift in the trend line.
Sometimes, exogenous shocks—such as the oil crises of the 1970s—can change the tra-
jectory of trending variables. If an old trend line is used to forecast far into the future,
the forecasts can be way off. This problem can be mitigated by using the most recent
data available to obtain the trend line parameters.
Nothing prevents us from combining trends with other models for forecasting. For
example, we can add a linear trend to an AR(1) model, which can work well for fore-
casting series with linear trends but which are also stable AR processes around the trend.
It is also straightforward to forecast processes with deterministic seasonality
(monthly or quarterly series). For example, the file BARIUM.RAW contains the
monthly production of gasoline in the United States from 1978 through 1988. This
series has no obvious trend, but it does have a strong seasonal pattern. (Gasoline pro-
duction is higher in the summer months and in December.) In the simplest model, we
would regress gas (measured in gallons) on eleven month dummies, say for February
through December. Then, the forecast for any future month is simply the intercept plus
the coefficient on the appropriate month dummy. (For January, the forecast is just the
intercept in the regression.) We can also add lags of variables and time trends to allow
for general series with seasonality.
Forecasting processes with unit roots also deserves special attention. Earlier, we
obtained the expected value of a random walk conditional on information through time
n. To forecast a random walk, with possible drift
, h periods into the future at time n,
we use f
ˆ
n,h
ˆ
h y
n
, where
ˆ
is the sample average of the y
t
up through t n. (If
there is no drift, we set
ˆ
0.) This approach imposes the unit root. An alternative
would be to estimate an AR(1) model for {y
t
} and to use the forecast formula (18.55).
This approach does not impose a unit root, but if one is present,
ˆ
converges in proba-
bility to one as n gets large. Nevertheless,
ˆ
can be substantially different than one,
especially if the sample size is not very large. The matter of which approach produces
better out-of-sample forecasts is an empirical issue. If in the AR(1) model,
is less than
one, even slightly, the AR(1) model will tend to produce better long-run forecasts.
Generally, there are two approaches to producing forecasts for I(1) processes. The
first is to impose a unit root. For a one-step-ahead forecast, we obtain a model to fore-
cast the change in y, y
t1
, given information up through time t. Then, because y
t1
y
t1
y
t
,E(y
t1
兩I
t
) E(y
t1
兩I
t
) y
t
. Therefore, our forecast of y
n1
at time n is just
f
ˆ
n
g
ˆ
n
y
n
,
where g
ˆ
n
is the forecast of y
n1
at time n. Typically, an AR model (which is necessar-
ily stable) is used for y
t
, or a vector autoregression.
Part 3 Advanced Topics
606
d 7/14/99 8:36 PM Page 606