7.5. Application of Graph Theory to Studies of RNA Structureand Function 231
and functional similarity among existing RNAs [965], identify RNA motifs of
antibiotic-binding aptamers (found synthetically) in genomes [702–704], analyze
the structural diversity of random pools used for in vitro selection of RNAs [450],
simulate aspects of the process of in vitro selection in silico [643, 644, 646],
and analyze RNA thermodynamics landscapes to better understand riboswitch
mechanisms to ultimately enhance their design [1028].
Other applications of RAG in the community which include graph theory ex-
tensions involve classification and prediction of ncRNAs [503,633,801,901,902,
1181], various RNA structure analyses [50,78,148,170,533,534,538,982,1041],
and various applications of graph theory [82,416,470,471,538,761,1324,1431].
Examples of graph extensions include labeled dual graphs [633] and directed
tree graphs [503]. Our application of spectral theory to catalog RNA graphs
has also been extended to other biological and physical systems [82, 416, 470,
471,761,1324,1431]. See Kim’s thesis for a more detailed descriptions on these
applications [642].
RNA Structure Enumeration
Cataloging based on graph theory enumeration suggests that the RNA structure
universe is dominated (more than 90%) by pseudoknots, in agreement with avail-
able data [645], as also discussed in [78,170,982,1041]. Significantly, the existing
RNA classes represent only a small subset of possible 2D RNA motifs as enumer-
ated by graph index; some of these motifs may be natural while others may be
possible to generate in the laboratory. Still, others may not exist.
RNA-Like Motifs
The usage of clustering techniques to separate graphs that are ‘RNA-like’ from
those that do not resemble natural RNAs also led to predictions of many new
RNA-like motifs, including ten specific examples of sequences that might lead
to novel-like RNA topologies [645], as shown in Figure 7.13. Some of these
motifs predicted in 2004 have since been solved: C1 in mammalian CPEB3 ri-
bozyme [1084], C2 in a purine riboswitch [819], C3 in the tymovirus/Pomovirus
tRNA-like 3
UTR element [840], C4 in the tombusvirus 3
UTR region IV [1463],
and C7 in the flavivirus DB element [235]. Significantly, the predicted and actual
sequences have between 45 to 51% homology.
Graph theory tools are also natural for comparing RNA structures to find ex-
isting RNA motifs within large RNAs based on graph isomorphisms [965]. This
idea was applied to identify topological similarities among existing RNA classes
and to define motifs of RNA within larger RNA topologies for major RNA classes
(e.g., tRNA, tmRNA, hepatitis delta virus RNA, 5S, 16S, 23S rRNAs).
Furthermore, the representation of RNAs as graphs led to identification of RNA
motifs in genomes. Since natural aptamers exist in many bacterial genomes and
other organisms, it appeared likely that natural counterparts of synthetic motifs
exist in vivo. This led to development of an efficient search tool for identifying
small RNA motifs in genomes by exploiting many artificial motifs derived from