12 Combinatorics of Compositions and Words
namely the length of the longest increasing subsequence, by considering ran-
dom words. However, the foundation of research on pattern avoidance in
words was laid in Burstein’s thesis [31], where he gave results for generating
functions for the number of k-ary words of length n that avoid a set of pat-
terns T ⊂S
3
. Various generalizations followed, to pattern avoidance in words
for multi-permutation subsequence patterns [33, 105] or pairs of more general
patterns of varying length. Mansour [141] considered the special case of words
that avoid both the pattern 132 and some arbitrary pattern, which leads to
generating functions that are expressed in terms of continued fractions and
Chebyshev polynomials of the second kind. Many of these papers generalize
methods used for permutations avoiding permutation patterns, but several
authors utilized different methods, such as automata theory [27], the ECO
method [23], and the scanning-element algorithm [65]. Another variation was
considered by Dollhopf et al. [61], who enumerated words that avoid a set of
patterns of length two that describes a reflexive, acyclic relation.
Several authors considered other types of patterns besides subsequence pat-
terns and also studied enumeration instead of avoidance. Burstein and Man-
sour [34] considered the enumeration of words containing a subword pattern
of length exactly r times, while Kitaev and Mansour [109] studied partially
ordered generalized patterns. Burstein and Mansour [35] also enumerated all
generalized patterns of length three. A somewhat related paper by Burstein
et al. [32] investigated the packing density for subword patterns, generalized
patterns and superpatterns.
The most recent papers investigated a wide range of topics. Kitaev et al.
[110] enumerate the number of rises, levels and falls in words that have a
prescribed first element, while Prodinger [166] enumerated words according
to the number of records (= sum of the positions of the rises, or in the ter-
minology of MacMahon, the lesser index). Myers and Wilf [158] investigated
left-to-right maxima in words and multiset permutations. Knopfmacher et
al. [116] and Mansour [144] looked at smooth words and smooth partitions,
respectively, and obtained connections with Chebyshev polynomials of the
second kind. Finally, Mansour enumerated words [143] and words that avoid
certain patterns [142] according to the longest alternating subsequence.
Another variation is avoidance or occurrence of substrings in words. R´egnier
and Szpankowski [170] used a combinatorial approach to study the frequency
of occurrences of strings from a given set in a random word, when overlapping
copies of the strings are counted separately (see [170, Theorem 2.1]). Various
examples on enumeration of substrings in words are included in general texts
on combinatorics (see for example [75, 182]). This research area is also treated
in the third volume of the Lothaire series [132] which focuses on applied
combinatorics of words, for example genetics, where words are used to describe
DNA fragments (see Example 1.23). The reader who is interested in the
algorithmic aspects and the statistics connected to substrings in words will
find this third volume a good starting point. We will restrict our focus to
patterns that encode relations as opposed to a specific string of letters.
© 2010 by Taylor and Francis Group, LLC