388 Discounted Dynamic Programming Under Uncertainty
set ψ
−
(F) ≡{s ∈ S : ψ(s) ∩ F = φ} belongs to the Borel sigmafield S
of S. We show that ψ
−
(F) is closed. Let s
n
∈ ψ
−
(F) and s
n
→ s. Then
ψ(s
n
) ∩ F = φ for all n. Choose some a
n
∈ ψ(s
n
) ∩ F. By compact-
ness of A, there is some subsequence (a
n
) converging to some a. Since
F is closed, a ∈ F. Since (s
n
, a
n
) converges to (s, a) and ψ is upper
semicontinuous, a ∈ ψ(s). Hence a ∈ ψ(s) ∩ F, and this implies that
s ∈ ψ
−
(F).
We shall refer to
ˆ
f as a measurable selection from the correspon-
dence ϕ.
Example 3.1 Let S = A = [0, 1], and ϕ(s) = A = [0, 1] for all s. Then
ϕ is clearly continuous. Define u(s, a) = sa.Fors = 0, ψ(s) consists of
a single element: ψ (s) = 1. For s = 0, ψ(s) = [0, 1]. Hence, ψ(s)isnot
lower semicontinuous at s = 0. For all s ∈ S, m(s) = s.
6.4 Dynamic Programming with a Compact Action Space
In this section, we deal with a more general framework under the follow-
ing assumptions on S, A, q, and u:
[A.1] S is a (nonempty) Borel subset of a Polish (complete, separable
metric) space;
[A.2] A is a compact metric space;
[A.3] u is a bounded continuous function on S × A; and
[A.4] if s
n
→ s, a
n
→ a, then q(.|s
n
, a
n
) converges weakly to q(.|s, a).
The main result asserts the existence of a stationary optimal policy ζ
∗
=
(
ˆ
f
(∞)
); moreover, it is shown that the value function V (≡ I (
ˆ
f
(∞)
) which
satisfies the functional equation is a bounded continuous function on S.
Let us first note a preliminary result which follows from [A.3].
Lemma 4.1 Let w : S →
R be a bounded continuous function. Then
g : S × A →
R defined by g(s, a) =
!
w(.) dq(.|s, a) is continuous.
Recall that C
b
(S) is the class of all bounded continuous functions on
S.Forv, v
∈ C
b
(S), we use, as before, the metric d(v, v
) =v − v
=
sup
s∈S
|v(s) −v
(s)|. The metric space (C
b
(S), d) is complete.