112
MARKOV
PROCESSES
Note that the trivial solution 7f = 0 is not
of
interest here, as it does not correspond
to a probability distribution (its elements do not sum to
1).
However, there are al-
ways infinitely many solutions to (5.2), so proper solutions can be found by finding
a positive solution and then imposing the unit-sum constraint. In the case
of
a unique
stationary distribution (just one eigenvalue
of
P equal to 1), then there will be a one-
dimensional set
of
solutions to (5.2), and the unique stationary distribution will be
the single solution with positive elements summing to
1.
5.2.4 Convergence
Convergence
of
Markov chains is a rather technical topic, which we do not have time
to examine in detail here. This short section presents a very informal explanation
of
why Markov chains often do converge to their stationary distribution and how the ·
rate
of
convergence can
be
understood.
By
convergence to stationary distribution, we mean that irrespective
of
the start-
ing distribution, 7r(O), the distribution at time n,
1r(n),
will converge to the stationary
distribution,
1r as n tends to infinity.
If
the limit
of
1r(n) exists, it is referred to
as
the equilibrium distribution
of
the chain (sometimes referred to as the limiting dis-
tribution).
Clearly the equilibrium distribution will
be
a stationary distribution, but a
stationary distribution is not guaranteed to
be
an equilibrium distribution. t The rela-
tionship between stationary and equilibrium distributions is therefore rather subtle.
Let
1r
be
a (row) eigenvector
of
P with corresponding eigenvalue
A.
Then
7rP=A7r.
Also 1r
pn
=
An
1r. It is easy to show that for stochastic P we must have I A I
:::;
1 (see
Exercises).
We
also know that at least one eigenvector is equal to 1 (the correspond-
ing eigenvector is a stationary distribution). Let
(1r1,
AI),
(7rz,
Az),
...
,
(7rr,
Ar)
be
the full eigen-decomposition
of
P, with
IAil
in decreasing order, so that A
1
=
1,
and
1r
1
is
a (rescaled) stationary distribution.
To
keep things simple, let us now make
the assumption that the initial distribution 1r(o) can be written in the form
7r(o) = a11r1 +
az1r2
+ · · · +
ar'lrr
for appropriate choice
of
ai
(this might not always
be
possible, but making the as-
sumption keeps the mathematics simple). Then
7r(n) =
71"(0)
pn
=
(a11r1
+
az7rz
+ · · · +
ar'lrr
)Pn
=
a11r1Pn
+ az7rzPn + · · · + ar'lrrPn
=
a1Al'lr1
+
azA27rz
+ · · · +
arA~'lrr
as
n--.
oo,
t This
is
clear
if
there is more than one stationary distribution, but in fact, even in the case
of
a unique
stationary distribution, there might not exist an equilibrium distribution at all.