368 Handbook of Chemoinformatics Algorithms
Going back to the 1960s, Edmonds and Matula [19] showed that the maximum
common subtree (MCST, in short) between two unordered trees can be computed in
polynomial time, where a tree T is called a maximum common subtree of T
1
and T
2
if T is a subtree of both T
1
and T
2
and T has the maximum number of nodes. Using
efficient computation of bipartite graph matching, their MCST algorithm works in
O(n
2.5
) time [20]. Furthermore, some extensions of the MCST algorithm to more
general graphs have been studied [21,22]. Since it is often useful to regard glycans as
unordered trees, it is reasonable to develop algorithms for the comparison of glycans
based on the MCST algorithm.
However, in order to develop biologically meaningful algorithms, score matrices
such as PAM and BLOSUM and local alignment should also be taken into account
because these two have been used quite successfully in the analysis of the DNA and
protein sequences [23]. Thus, KCaM algorithms were developed by combining the
MCST algorithm, score matrices, and local alignment.
13.3 GLYCAN STRUCTURES
Glycans are special kinds of chemical compounds. The basic component of glycans
is the monosaccharide unit, or sugar, of which a handful are most common in higher
animal oligosaccharides (see Table 13.1). Each unit is linked to one or more other
monosaccharides by various types of linkages, depending on the anomer (i.e., α or β)
and the hydroxyl group numbers to which they are attached on the monosaccharides.
Most glycans have rooted tree structures, where nodes correspond to monosaccha-
rides and edges correspond to linkages between monosaccharides (see Figure 13.4).
Although glycans might be regarded as ordered trees, it is better to regard them as
unordered trees for providing more flexible matching methods.
There are several classes of glycan based on certain basic patterns mainly in the
core structure (i.e., a substructure near to the root), which include N-glycan, O-glycan,
glycoside, and sphingolipid [1]. Parts of these structures are recognized by various
TABLE 13.1
Common Monosaccharide Names, their
Abbreviations, and their Symbols [4]
Sugar Name Abbreviations Symbols
Glucose Glc
Galactose Gla •
Mannose Man *
N-acetyl neuraminic/sialic acid NeuNAc
N-acetylglucosamine GlcNAc
N-acetylgalactosamine GalNAc
Fucose Fuc ,
Xylose Xyl -