
(b)
(a)
(a)
F1 F2
F3
D
F1 F2
D2
D1
B1 B2 Bk
Dm
Fn
FIGURE 5.1
Generic BN structures for medical diagnosis: (a) naive Bayes model; (b) multiply-
connected network.
has values for each of the diseases under consideration, while the F nodes represent
“findings,” which encompass both symptoms and test results. This network reflects
two unrealistic assumptions: that the patient can have only a single disease and that
the symptoms are independent of each other given the disease.
A more realistic, but more complex, model is the multiply-connected network of
Figure 5.1(b). Here there is a Boolean node for each disease under consideration,
while the B nodes represent background information such as the age and sex of the
patient, whether they are a smoker, or have been exposed to pollution. However, in
practice, this structure is likely to be too complex, requiring us to specify probabilis-
tically the combined effect of every disease on each finding.
The network structure in Figure 5.1(b) is essentially that developed in the QMR-
DT project [256, 190], a probabilistic version of the frame-based CPCS knowledge
base for internal medicine. The QMR-DT network had this two-level structure. The
problem of complexity was ameliorated by making it a binary noisy-or model (see
7.4.2). This was done by assuming the effect of a disease on its symptoms and
test results is independent of other diseases and independent of other findings. One
version of the QMR-DT network (described in [222]) had 448 nodes and 908 arcs,
including 74 background nodes (which they called “predisposing factors”) needing
prior probabilities, while the remaining nodes required probabilities to be assessed
for each of their values. In total more than 600 probabilities were estimated, a large
but not an unreasonable number given the scope of the application. Performing ex-
act inference on networks of this size is generally not feasible. Initial work on the
application of likelihood weighting to this medical diagnosis problem is described in
[255], while other approximate methods for QMR-DT are presented in [121].
The ALARM network for monitoring patients in intensive care [17], shown in
Figure 5.2, is a sparsely connected BN consisting of 37 nodes and 42 arcs (the num-
ber of values for each node is shown next to the node name). This network is often
used as a benchmark in the BN literature. Clearly, it does not map neatly into the
generic medical diagnosis structure given above, and it does not provide a template
for building other medical monitoring or diagnostic BNs.
© 2004 by Chapman & Hall/CRC Press LLC