3.1. As is well described by Hacking [1975], the concept of numerical probability
emerged in the mid-17th century. However, its adequate formalization was
achieved only in the 20th century by Kolmogorov [1950]. This formalization is
based on the classical measure theory [Halmos, 1950].
3.2. A justified way of measuring uncertainty and uncertainty-based information in
probability theory was established in a series of papers by Shannon [1948]. These
papers, which are also reprinted in the small book by Shannon and Weaver [1949],
opened a way for developing the classical probability-based information theory.
3.3. Various subsets of the axioms for a probabilistic measure of uncertainty that are
presented in Section 3.2.2. were shown to be sufficient for providing the unique-
[1970b], and others.
[1970b], and others. The uniqueness proof presented as Theorem 3.1 is adopted
from a book by Ash [1965]. Excellent overviews of the various axiomatic treat-
ments of Shannon entropy can be found in books by Aczél and Daróczy [1975],
Ebanks et al. [1997], and Mathai and Rathie [1975]. All these books are based
heavily on the use of functional equations.An excellent and comprehensive mono-
graph on functional equations was prepared by Aczél [1966].
3.4. Several classes of functionals that subsume the Shannon entropy as a special case
have been proposed and studied. They include: