1.6 Exercises, Analyses, and Computation 27
1.6 Exercises, Analyses, and Computation
Exercise 1. For the RNA sequences E.co and B.st given in Table 4.5, use
the dynamic programming-based algorithm to compute their minimal penalty
alignment based on the Hamming penalty matrix.
Exercise 2. For the RNA sequences Mc. vanniel and Mb. tautotr given in
Table 4.5, use the dynamic programming-based algorithm to determine the
optimal alignment sequences based on the following requirements:
1. Compute the minimal penalty alignment based on the Hamming penalty
matrix and the WT-penalty matrix, respectively.
2. If d
H
(a, b), a, b ∈ V
5
is the Hamming penalty matrix, then g
H
(a, b)=
1 − d
H
(a, b), a, b ∈ V
5
is the corresponding scoring matrix. Compute the
maximal score alignment.
3. Compare the computational results of the minimal penalty alignment and
the maximal score alignment.
Exercise 3. Continuing from Exercise 2, for RNA sequences Mc. vanniel and
Mb.tautotr, compute the optimal alignment by using the dynamic program-
ming-based algorithm, based on the three criteria of the Hamming penalty
matrix, the WT-penalty matrix and the Hamming scoring matrix, respec-
tively. Compare the corresponding results.
Exercise 4. For an arbitrary pair of sequences with different lengths, com-
pute the optimal alignment using the dynamic programming-based algorithm,
based on the penalty matrix and the scoring matrix, respectively.
Exercise 5. Test your dynamic programming-based algorithm for the optimal
pairwise alignment according to the following indices:
1. For data with length 1 kbp, such as the sequences Mc.vanniel and
Mb.tautotr given in Table 4.5, visually check whether they arrive at the
target.
2. For data with length 1–100 kbp (such as sequences 1–10 given in Table
4.4), use your dynamic programming-based algorithm to test the relation-
ship between the length of the sequence and the CPU time required for
computation.
3. Analyze the relationship between similarity and the CPU time required
for computation.
Hint
1. Data for the two sequences Mc.vanniel and Mb.tautotr given in Table 4.5
may be downloaded from the Web site [99].