3.7 Total Probability 83
Example 3.11. The four sides of a tetrahedron (regular three-sided pyramid
with four sides consisting of isosceles triangles) are denoted by 2, 3, 5, and
30, respectively. If the tetrahedron is “rolled,” the number on the bottom side
is the outcome of interest. The three events are defined as follows: A – the
number on the bottom side is even, B – the number is divisible by 3, and C
– the number is divisible by 5. The events are pairwise independent, but in
totality, they are dependent.
The algebra is simple here, but what is the intuition? The “trick” is that
events AB, AC, BC, and ABC all coincide. In other words,
P(A|BC) = 1 even
though
P(A|B) =P(A|C) =P(A).
The concept of independence/dependence is not transitive. At first glance,
it may seem incorrect. One may argue, “If A depends on B, and B depends on
C, then A should depend on C, right?” We can demonstrate that this reasoning
is not correct with a simple example.
Example 3.12. Take a standard deck of 52 playing cards and replace the Q
♣
with Q♦. The deck still has 52 cards, two Q♦ and no Q♣. From that deck
draw a card at random and consider three events: A – the card is a queen, B
– the card is red, and C – the card is a
♥. It is easy to see that A and B are
dependent since
P(AB) =3/52 does not equal P(A)·P(B) =4/52·27/52. Events B
and C are dependent as well since event C is contained in B, and
P(BC) =P(C)
does not equal
P(B) ·P(C). However, events A and C are independent since
P(AC) =P(Q♥) =
1
52
=P(A)P(C) =
13
52
·
4
52
.
3.7 Total Probability
The rule of total probability expresses the probability of an event A as the
weighted average of its conditional probabilities. The events that A is condi-
tioned upon need to be exclusive and should partition the sample space
S .
Here are the definitions.
Events H
1
, H
2
,... , H
n
form a partition of the sample space S if
(i) they are mutually exclusive (H
i
·H
j
=;, i 6= j) and
(ii) their union is the sample space
S ,
S
n
i
=1
H
i
=S .
The events H
1
,... , H
n
are usually called hypotheses. By this definition it
follows that
P(H
1
) +···+P(H
n
) =1 (=P(S )).
Let the event of interest A happen under any of the hypotheses H
i
with a
known (conditional) probability
P(A|H
i
). Assume, in addition, that the proba-