34 Probability distributions
applets.
3
I consider here an experimental example of CLT proof, using one such web
tool.
4
Rolling dice and adding spots
This is an experiment similar to that described earlier in Chapter 1 and illustrated in
Figs. 1.1 and 1.2. As we know, each individual die has a uniform discrete distribution
defined by p(x) = 1/6 with x
i
= 1, 2, 3, 4, 5, 6 being the event space. If we roll two
dice and sum up the spots, the event space is x
i
= 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 and the
probability distribution has a triangular (or witch’s hat) shape centered about <x> = 7,
see Fig. 1.2. Interestingly, the CLT does not apply to the two-dice case, the limiting
PDF being still the witch hat, as can be easily verified. As a simplified explanation, this
is because the event space is too limited. But when using three dice or more, the CLT
is observed to apply, as illustrated in Fig. 2.9. The figure shows results obtained with
fivedicefork = 10, 100, 1000, 10 000, and 100 000. It is seen that as k increases, the
resulting histogram envelope progressively takes a bell shape. Ultimately, the histogram
takes a symmetrical (discretized) bell shape limited to the 26 = 30–5+ 1 integer
bins of the event space. A 1000-dice experiment with an adequate number of trials k
would yield a similar histogram with 5001 = 6000 − 1000 + 1 discrete bins, which
is much closer to the idea of a smooth envelope, albeit the resulting PDF is discrete,
not continuous. Only an infinite number of dice and an infinite number of rolls could
provide a histogram match of the limiting Gaussian (normal) envelope.
To complete the illustration of the CLT, consider the school game of a pegboard matrix,
also known as a pinball machine, bean machine, quincunx, or Galton box, and whose
principle is at the root of the Japanese gambling parlors called Pachinko. At each step
of the game, a ball bounces on a peg (or a nail) to choose a left or right path randomly,
according to a uniform, two-valued distribution. The triangular arrangement of pegs or
nails makes it possible to repeat the ball’s choice as many times as there are rows (n)
in the matrix. At the bottom and after the final row, the ball rests in a single bin, which
is associated with some reward or gain. It is easily established that the probability for
the ball to be found in a given bin k follows the binomial distribution, Eq. (2.9). If the
number n of rows becomes large, and for a sufficiently large number of such trials,
the histogram distribution of balls into the bottom bins takes a Gaussian-like envelope,
which represents a nice, mechanical schoolroom illustration of the CLT.
This concludes the second chapter on probability basics. Most of the mathematical
tools that are required to approach information theory have been described in these two
chapters.
3
See, for instance:
www.stat.sc.edu/∼west/javahtml/CLT.html,
www.rand.org/statistics/applets/clt.html,
www.math.csusb.edu/faculty/stanton/m262/central
limit theorem/clt old.html,
www.ruf.rice.edu/%7Elane/stat
sim/sampling dist/index.html,
www.vias.org/simulations/simusoft
cenlimit.html.
4
I am grateful to Professor Todd Ogden for permission to reproduce the simulation results obtained from his
web tool in www.stat.sc.edu/∼west/javahtml/CLT.html.