Chapter 18 Advanced Time Series Topics 667
where
is the drift term (usually
0), and each u
tj
has zero mean given I
t
and con-
stant variance
2
. As we saw earlier, the forecast of y
th
at time t is E(y
th
I
t
)
h y
t
,
and the forecast error variance is
2
h. What happens if we use a linear trend model? Let
y
0
be the initial value of the process at time zero, which we take as nonrandom. Then, we
can also write
y
th
y
0
(t h) u
1
u
2
… u
th
y
0
(t h) v
th
.
This looks like a linear trend model with the intercept
y
0
. But the error, v
th
,while
having mean zero, has variance
2
(t h). Therefore, if we use the linear trend
y
0
(t h) to forecast y
th
at time t, the forecast error variance is
2
(t h), compared
with
2
h when we use
h y
t
. The ratio of the forecast variances is (t h)/h,which can
be big for large t. The bottom line is that we should not use a linear trend to forecast a
random walk with drift. (Computer Exercise C18.8 asks you to compare forecasts from a
cubic trend line and those from the simple random walk model for the general fertility rate
in the United States.)
Deterministic trends can also produce poor forecasts if the trend parameters are esti-
mated using old data and the process has a subsequent shift in the trend line. Sometimes,
exogenous shocks—such as the oil crises of the 1970s—can change the trajectory of trend-
ing variables. If an old trend line is used to forecast far into the future, the forecasts can
be way off. This problem can be mitigated by using the most recent data available to obtain
the trend line parameters.
Nothing prevents us from combining trends with other models for forecasting. For
example, we can add a linear trend to an AR(1) model, which can work well for forecast-
ing series with linear trends but which are also stable AR processes around the trend.
It is also straightforward to forecast processes with deterministic seasonality (monthly
or quarterly series). For example, the file BARIUM.RAW contains the monthly production
of gasoline in the United States from 1978 through 1988. This series has no obvious trend,
but it does have a strong seasonal pattern. (Gasoline production is higher in the summer
months and in December.) In the simplest model, we would regress gas (measured in gal-
lons) on 11 month dummies, say, for February through December. Then, the forecast for
any future month is simply the intercept plus the coefficient on the appropriate month
dummy. (For January, the forecast is just the intercept in the regression.) We can also add
lags of variables and time trends to allow for general series with seasonality.
Forecasting processes with unit roots also deserves special attention. Earlier, we
obtained the expected value of a random walk conditional on information through time n.
To forecast a random walk, with possible drift
, h periods into the future at time n,we
use f
ˆ
n,h
ˆ
h y
n
,where
ˆ
is the sample average of the y
t
up through t n. (If there
is no drift, we set
ˆ
0.) This approach imposes the unit root. An alternative would be
to estimate an AR(1) model for {y
t
} and to use the forecast formula (18.55). This approach
does not impose a unit root, but if one is present,
ˆ
converges in probability to one as n
gets large. Nevertheless,
ˆ
can be substantially different than one, especially if the sam-
ple size is not very large. The matter of which approach produces better out-of-sample
forecasts is an empirical issue. If in the AR(1) model,
is less than one, even slightly, the
AR(1) model will tend to produce better long-run forecasts.