300 11 Alignment of Primary and Three-Dimensional Structures of Proteins
2. Grammar and word-building analysis for protein sequence structures. If
protein sequences are disassembled, many small protein sequence segments
result. These segments are called characteristic chains in biology. If peptide
chains are considered to be the basic building blocks for protein structure
and function, they would correspond to words and phrases of a book. In
order to get an overview of the structure and functions of proteins, one
approach is to study the grammar and syntax. That is, to set up special
databases (databases of grammar and syntax) of the peptide chains whose
structure and function is known, and then search and compare the general
structures of proteins in these databases, so that the general structures
and functions of these proteins can be determined. This problem is still
related to sequence alignment, where the segments are usually short in
length and large in number.
Research on Protein Three-Dimensional Structures
and Their Functions
Protein three-dimensional structure includes secondary structure and other
higher level structures. From biological research, we know that the activity and
function of a protein is related to its three-dimensional structure. If a protein
is heated, it will be denatured and thus lose its activity. It is found biologically
that, for the proteins whose primary structures are different, if they have the
same three-dimensional structures, their functions may be similar. Therefore,
the research on protein three-dimensional structures is closely related to that
of their functions.
The topic of protein three-dimensional structures covers many aspects.
They are:
1. Research on protein secondary structures. In biology, peptide chains with
special structural characteristics are called secondary structures, such as
α helices and β sheets. These structures are common units of protein
general three-dimensional structures, and their combinations determine
the functions of the proteins they make up. Therefore, the research on
protein secondary structures is essential to that of the three-dimensional
structures. The analysis of protein secondary structures focuses on their
prediction, which is to confirm protein secondary structure characteristics
according to their primary sequences.
Many mathematical tools and methods have been used in the research on
protein secondary structure prediction, while so far, the accuracy of the
predictions is far from satisfactory. In the past ten years, the accuracy
of the predictions of α helices and β sheets remained at around 70%. For
some fixed proteins whose types are known, the accuracy of the prediction
has improved. However, because the types are fixed, their definition of the
prediction accuracy is not as good.
2. The main unresolved difficulty present in the analysis of protein three-
dimensional structures is the long distance folding of the three-dimensional