Afterword 193
J. T. When you do this averaging what sort of forecast do you get?
R. H. That depends on what quantity you look at, but you ask
exactly the correct question. When we want to judge a forecast,
then we usually compare a forecast with some sort of observation.
For example, when I look at precipitation, the rain gauge collects
the precipitation at a certain site. But one kilometer away there
could be a very different amount of precipitation, because precipita-
tion can be a quite small-scale process. Or even temperature, when
we have, for example, a station close to the sea we can experience
land-sea effects, or we can have observations influenced by other
complex orographic conditions. Such stations can have very par-
ticular circumstances—that is, the atmospheric conditions can vary
quite a lot at short distances. However, the model forecast gives the
average conditions in the 16 by 16 square kilometer grid box. And
that is not necessarily always directly comparable to what is mea-
sured at a particular station. To overcome that, there are two ways.
One way is that we do not compare our forecasts directly with
station observations, but we compare it with the so-called analysis
field. Our analysis represents all of the observations at the differ-
ent stations and brings it into our model grid. That is one way
that we compare—not to the real observations but to the analysis
values, which represent the same scales as the model.
On the other hand, one can go the other way around. One
can say, “OK, I want to correct my forecast, which only repre-
sents the average conditions in the grid box, to represent the
local conditions at the observation site.” This process is called
down-scaling. We distinguish between dynamical and statistical
downscaling. Statistical downscaling, which is closely related to
the post- processing mentioned earlier, is based on error statistics
of so-called training data sets. Basically, we compare forecasts and
station observations from the past in order to find out the system-
atic errors of the model at this particular station. Then we can
correct for the particular characteristics for this station.
J. T. You’re using training data? That must mean that you are
using some sort of machine learning—what is the right terminology?