118 4 Super Pairwise Alignment
in which w(A, B;2, 0,n)=
7
20
=0.35 < 0.4 and the values of the other
functions are all greater than 0.6. Therefore, following from Step 4.2.2,
we have
ˆ
1
= −2. We then insert two virtual symbols at the beginning of
sequence B, and we find the local aligned sequences (C
1
,D
1
) as follows:
(
C
1
: ugccuggcgg ccguagcgcg guggucccac cugaccccau gccgaacuca gaagugaaa
D
1
: —ccuaguga caauagcgga gaggaaacac ccgucccauc ccgaacacgg aaguuaag
3. Aligning (C
1
,D
1
). If we put i = j = 22, n = 25 and calculate the sliding
window function, we get w = w(A, B; i, j, n)=
12
25
. Putting h = 10, we get
w
= w(A, B; i + h, j + h, n)=
16
25
. We input w =
16
25
, w
=
12
25
, n = 25,
h = 10 into (4.17), giving
ˆ
i
2
= L
1
+
h
w
− w
3
4
− w
=20+
250
4
3
4
−
12
25
∼ 20 + 17 = 37 .
4. Letting i = j =
ˆ
i
2
, n = 15, we calculate w(A, B; i + , j, n)and
w(A, B; i, j + , n). The results are
w(A, B; i, j +1,n)=
12
15
=0.8 > 0.6 ,
w(A, B; i +1,j,n)=
3
15
=0.2 < 0.3 .
Thus,
ˆ
2
= −1, and the local aligned sequences (C
2
,D
2
)aregivenas:
(
C
2
: ugccuggcgg ccguagcgcg guggucccac cugaccccau gccgaacuca gaagugaaa
D
2
: —ccuaguga caauagcgga gaggaaacac ccgucc-cau cccgaacacg gaaguuaag
5. Let i = j =
ˆ
i
2
,n= 21, and calculate w(A, B; i+1,j,n)=
5
21
=0.24 < 0.3.
We have aligned the entire sequences (C
2
,D
2
). Therefore, (C
2
,D
2
)isthe
uniform alignment of sequences (A, B).
The implications of Example 17: We have performed the alignment of
the pair of E.co and B.st using the SPA, and obtained (C
2
,D
2
), in which
both sequences are of the same length 59. The total penalty is w(C
2
,D
2
)=
21
59
∼ 0.356. This value is much less than 0.75, so we can declare that sequences
E.co and B.st are homologous.
4.2.2 Some Unsolved Problems and Discussions of SPA
Some Unsolved Problems
In the last subsection, we introduced the operation steps of the SPA. Because
of the complexity of biological data, some problems arise while running the
SPA. Therefore, the SPA does not represent the final word, as there are still
unsolved problems in the sense of both theoretical analysis and the design of
the program. For example: