
4.2. An Introduction to Probability 117
Suppose we flip a coin or toss a die. When we refer to the probability of
a certain outcome, such as getting a heads in the coin flip, or a 4 in the die
toss, we mean a number P = P(outcome), with 0 ≤ P ≤ 1, that indicates
the likelihood of that outcome occurring. For instance, if we flip a fair coin,
we would say the probability of the outcome “heads” is
P =
1
2
, or P(heads) =
1
2
,
because we expect to see heads in roughly 1 of every 2 tosses. This does not
mean that if we flip the coin twice we will get one head and one tail, but rather
that if we flipped it a very large number of times, we should find that in about
1
2
of the tosses each outcome occurred. For the die toss, to express the chance
of a 4 turning up, we would say that P(4) =
1
6
, since we expect roughly 1 of
every 6 in a large number of tosses to produce a 4.
We might say that a probability measures the chance of a “random” out-
come occurring. Alternately, we may believe the outcome of a die toss is not
random (it is, after all, governed by the deterministic laws of physics), but
predicting it is too complicated to be practical. With this viewpoint, we are
willing to give up trying to say exactly what will happen with any particular
toss and instead accept a description of how often outcomes are likely to
occur in the long run. More precisely, the probability P of an outcome gives
our expectation of the percentage of trials in which that outcome will occur,
assuming a very large number of trials are performed. The smaller P is, the
less likely we believe an outcome is to occur in any given trial.
Usually, a probability will not indicate exactly what will happen in any
trial. However, there are two exceptions. A probability of P = 1 means an
outcome is sure to happen – it will occur 100% of the time. Likewise, a
probability of P = 0 means the event is sure not to happen.
Do not assume that the probability of a heads in a coin flip is
1
2
just because
there are only two possible outcomes: heads and tails. For a weighted coin,
there are still only two possible outcomes, but it might be that, with such a coin,
we expect to get heads in 80% of the flips and so we have P(heads) = .8.
Such a coin is not “fair,” but it is still capable of being described through
probability. Similarly, for a fair die, the probability of any particular outcome
is
1
6
, but for a weighted die, the probabilities of some of the outcomes might
be more than
1
6
, while for others they are less than
1
6
.
Given a weighted coin, how can we determine the probability of it pro-
ducing an outcome of heads? We simply perform many trials by flipping it
repeatedly. After recording how often heads comes up in these trials, we can