
(6) Phylogenetic trees can be us ed to reconstruct genealogies from molecular data, not
just for genes, but also for species of organisms. A phylogenetic tree, or dendrogram,
is an acyclic two-dimensional graph of the evolutionary relationship among three or
more genes or organisms. A phylogenetic tree is composed of nodes and branches
connecting them.
9.5 Problems
9.1. Genome access and multiple sequence alignment Tumor necrosis factor (TNF)-
related apoptosis inducing ligand (TRAIL) is a naturally occurring protein that
specifically induces apoptosis in many types of cancer cells via binding to “death
receptors.” Obtain the amino acid sequences for the TRAIL protein for human,
mouse, and three other species from the Entrez cross-database search at the National
Center for Biotechnology Information (NCBI) website (www.ncbi.nlm.nih.gov/sites/
gquery). Perform a multiple sequence alignment using the ClustalW and TCoffee
algorithms; these can be accessed from many different online resources. What regions
of the TRAIL protein are most highly conserved between different species?
9.2. Dot plot comparison of two genomic sequences Compare the human TRAIL
protein sequence from Problem 9.1 to the sequence for human tumor necrosis factor
(TNF)-α.TNF-α is an important inflammatory cytokine with cytotoxic effects at
sufficient concentration. Generate a dot plot and interp ret the diagram in terms of
any common domains that may be revealed.
9.3. Phylogenetics The following six amino acid sequences are related by a common
ancestor:
EYGINEVV, EVGINAER, EVGINEYR, ENGINRNR, ENLYNEYR, ENGINRYI
Construct a rooted phylogenetic tree for these related sequences, identify the
sequence of the common ancestor, and calculate the distances between the lineages.
References
Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and
Lipman, D. J. (1997) Gapped BLAST and PSI-BLAST: A New Generation of Protein
Database Search Programs. Nucleic Acids Res., 25, 3389–402.
Carrillo, H. and Lipman, D. (1988) The Multiple Sequence Alignment Problem in Biology.
SIAM J. Appl. Math., 48, 1073–82.
Giasson, B. I., Murray, I. V. J., Trojanowski, J. Q., and Lee, V. M.-Y. (2001) A Hydrophobic
Stretch of 12 Amino Acid Residues in the Middle of a-Synuclein Is Essential for Filament
Assembly. J. Biol. Chem., 276, 2380–6.
Green, C. E., Pearson, D. N., Camphausen, R. T., Staunton, D. E., and Simon, S. I. (2004)
Shear-dependent Capping of L-selectin and P-selectin Glycoprotein Ligand 1 by E-
selectin Signals Activation of High-avidity Beta 2-integrin on Neutrophils. J. Immunol.,
284, C705–17.
Hughey, R. and Krogh, A. (1996) Hidden Markov Models for Sequence Analysis: Extension
and Analysis of the Basic Method. CABIOS, 12, 95–107.
Ivetic, A., Florey, O., Deka, J., Haskard, D. O., Ager, A., and Ridley, A. J. (2004) Mutagenesis
of the Ezrin-Radixin-Moesin Binding Domain of L-selectin Tail Affects Shedding,
Microvillar Positioning, and Leukocyte Tethering. J. Biol. Chem., 279, 33 263–72.
Kim, J., Pramanik, S., and Chung, M. J. (1994) Multiple Sequence Alignment Using
Simulated Annealing. Bioinformatics, 10, 419–26.
558
Basic algorithms of bioinformatics