tive rows. That is, there are 5, 4, 5, 2 possible sequences of length 2 that begin
in states s
1
, s
2
, s
3
, s
4,
respectively. Similarly, the sums of the entries in the indi-
vidual columns of the matrix are equal to the number of possible sequences
of length 2 that terminate in states assigned to the respective columns. The
same results apply to sequences of lengths 3, 4, and so on, but we need to
perform, respectively,
and so on.
Determining the number of possible sequences for n Œ ⺞
10
and calculating
the average predictive nonspecificity for each n in this range by Eq. (2.36), we
obtain the following sequence of predictive nonspecificities: 1, 2, 2.95, 4.11,
5.16, 6.21, 7.27, 8.32, 9.37, 10.43. As expected, the predictive nonspecificities
increase with n. This means qualitatively that long-term predictions by a non-
deterministic system are less specific than short-term predictions by the same
system.
Assume now that only one state, s
i
, is possible at time t and we want to cal-
culate again the nonspecifity in predicting the sequence of states of length n.
In this case,
As already mentioned, the number of sequences of states of length n that begin
with state s
i
, which we need for this calculation, is obtained by adding the
entries in the respective row of the matrix resulting from the required chain
of n - 1 matrix products. For s
t
= s
1
in our example and n Œ ⺞
10
, we obtain the
following predictive nonspecificities: 1.58, 2.32, 3.46, 4.46, 5.55, 6.58, 7.65, 8.70,
9.75, 10.81. As expected from the high initial nonspecificity H(S
t+1
| {s
1
}), all
these values are above average. On the other hand, the following values for s
t
= s
4
are all below average: 0, 1, 2, 3.17, 4.17, 5.25, 6.29, 7.35, 8.40, 9.45.
EXAMPLE 2.2. Consider the same system and the same types of predictions
as in Example 2.1. However, let the focus in this example be on the predictive
informativeness of the system rather than its predictive nonspecificity. That is,
the aim of this example is to calculate the amount of information contained
in each prediction of a certain type made by the system. In each case, we need
to calculate the maximum amount of predictive nonspecificity, obtained in the
face of total ignorance, and the actual amount of predictive nonspecificity asso-
ciated with the prediction made by the system. The amount of information
provided by the system is then defined as the difference between the maximum
and actual amounts of predictive nonspecificity.
In general, the distinguishing feature of total ignorance within the classical
possibility theory is that all recognized alternatives are possible. In our
example, the recognized alternatives are transitions from states to states, each
.