8.6 Minimization 223
The example shows that regularity on the horizontal level is not captured
by the congruence ≡
L
. One might be tempted to think that it is enough to
require for each label a ∈ Σ, that the set of words occurring as successor word
below the lab el a in some tree of the language is regular. Indeed, this would
exclude the above example but it is not difficult to find a non-regular language
of trees for which ≡
L
is of finite index and this additional condition is also met
(see Exercises).
At the end of Section 8.6.3 we give a refinement of ≡
L
that is sufficient to
characterize the recognizable languages.
8.6.2 Problems for Minimizing the Whole Representation
Minimization becomes a more complex task if we also want to consider the size of
the representations of the transitions. It might be that adding additional states
to the hedge automaton allows splitting transitions such that the representations
for the required horizontal languages become smaller.
If we are interested in unique representations while taking into account also
the horizontal languages, then we should certainly choose a formalism that al-
lows unique representations for the horizontal languages. Therefore we only con-
sider DFHAs with horizontal languages given by deterministic finite automata
in the following.
But even with this assumption it turns out that there are no unique minimal
DFHAs for a given recognizable language. An exact statement of this negative
result, however, would require a precise definition of the size of DFHAs. There
are various reasonable such definitions, and to prove that minimal DFHAs for
a given language are not unique in general, it would be necessary to s how this
for all these definitions.
Here, we restrict ourselves to an explanation of the main reason why DFHAs
are problematic with respect to minimization. In Section 8.6.3 we then introduce
a model that solves this problem.
Example 8.6.3. We consider the language containing the trees of the form
a(b
n
) with n mod 2 = 0 or n mod 3 = 0, i.e. trees of height 1 with a at the
root and b at all the successors, where the number of successors divides 2 or
3. A DFHA recognizing this language can us e two states q
a
and q
b
with rules
b({ε}) → q
b
and a(L) → q
a
with L = {q
n
b
| n mod 2 = 0 or n mod 3 = 0}.
The minimal deterministic automaton (over the singleton alphabet {q
b
}, which
is sufficient for the language under consideration) for L has 6 states because it
has to count modulo 6 for verifying if one of the condition holds.
It is also possible to split the second transition into two transitions: a(L
2
) →
q
a
and a(L
3
) → q
a
with L
2
= {q
n
b
| n mod 2 = 0} and L
3
= {q
n
b
| n mod 3 = 0}.
The two automata for L
2
and L
3
need only 2 and 3 states, respectively. But in
exchange the tree automaton has one transition more.
If we take as size of a DFHA the number of states and the number of tran-
sitions of the tree automaton plus the numb er of states used in the horizontal
languages, then the two DFHAs from above are both minimal for the given
language but they are clearly non-isomorphic.
TATA — November 18, 2008 —