PHYSICAL BASIS OF
SECONDARY STRUCTURE
An impressive number of secondary structure prediction
methods can be found in the literature and on the web.
Surprisingly, almost all are based on empirical like-
lihoods or neural nets; few are based on physico-
chemical theory.
In one such theory, secondary structure propensities
are predominantly a consequence of two competing
local effects – one favoring hydrogen bond formation in
helices and turns, and the other opposing the attendant
reduction in sidechain conformational entropy upon
helix and turn formation. These sequence-specific biases
are densely dispersed throughout the unfolded polypep-
tide chain, where they serve to pre-organize the folding
process and largely, but imperfectly, anticipate the native
secondary structure.
WHY AREN’T SECONDARY STRUCTURE
PREDICTIONS BETTER?
Currently, the best methods for predicting helix and
sheet are correct about three-quarters of the time. Can
greater success be achieved?
Several measures to assess predictive accuracy are in
common use, of which the Q3 score is the most
widespread. The Q3 score gives the percentage of
correctly predicted residues in three categories: helix,
strand, and coil (i.e., everything else):
Q3 ¼
number of correctly predicted residues
total number of residues
£ 100
where the “correct” answer is given by a program
to identify secondary structure from coordinates,
e.g., DSSP. At this writing, (Position-Specific
PREDiction algorithm) PSIPRED has an overall Q3
score of 78%.
Is greater prediction accuracy possible? It has
been argued that prediction methods fail to achieve a
higher rate of success because some amino acid
sequences are inherently ambiguous. That is, these
“conformational chameleons” will adopt a helical
conformation in one protein, but the identical sequence
will adopt a strand conformation in another protein.
Only time will tell whether current efforts have encoun-
tered an inherent limit.
SEE ALSO THE FOLLOWING ARTICLES
Amino Acid Metabolism † Multiple Sequence Align-
ment and Phylogenetic Trees † Protein Data Resources †
X-Ray Determination of 3-D Structure in Proteins
GLOSSARY
a
-helix The best-known element of secondary structure in which the
polypeptide chain adopts a right-handed helical twist with 3.6
residues per turn and an i ! i 2 4 hydrogen bond between
successive amide hydrogens and carbonyl oxygens.
b
-strand An element of secondary structure in which the chain
adopts an extended conformation. A
b
-sheet results when two or
more aligned
b
-strands form inter-strand hydrogen bonds.
Chou – Fasman Among the earliest attempts to predict protein
secondary structure from the amino acid sequence. The method,
which uses a database of known structures, is based on the
empirically observed likelihood of finding the 20 different amino
acids in helix, sheet or turns.
DSSP The most widely used method to parse x; y; z-coordinates for a
protein structure into elements of secondary structure.
hydrophobicity A measure of the degree to which solutes, like amino
acids, partition spontaneously between a polar environment (like
the outside of a protein) and an organic environment (like the inside
of a protein).
hydrophobicity profile A method to predict the location of peptide
chain turns from the amino acid sequence by plotting averaged
hydrophobicity against residue number. The method does not
require a database of known structure.
neural network A pattern recognition method – adapted from
artificial intelligence – that has been highly successful in predicting
protein secondary structure when used in conjunction with an
extensive database of known structures.
peptide chain turn A site at which the protein changes its overall
direction. The frequent occurrence of turns is responsible for
the globular morphology of globular (i.e., sphere-like) proteins.
secondary structure The backbone structure of the protein, with
particular emphasis on hydrogen bonded motifs.
tertiary structure The three-dimensional structure of the protein.
FURTHER READING
Berg, J. M., Tymoczko, J. L., and Stryer, L. (2002). Biochemistry, 5th
edition. W.H. Freeman and Company, New York.
Holm, L., and Sander, C. (1996). Mapping the protein universe.
Science 273, 595–603.
Hovmo
¨
ller, S., Zhou, T., and Ohlson, T. (2002). Conformation of
amino acids in proteins. Acta Cryst. D58, 768–776.
Jones, D. T. (1999). Protein secondary structure based on position-
specific scoring matrices. J. Mol. Biol. 292, 195– 202.
Mathews, C., van Holde, K. E., and Ahern, K. G. (2000). Biochemi-
stry, 3rd edition. Pearson Benjamin Cummings, Menlo Park, CA.
Richardson, J. S. (1981). The anatomy and taxonomy of protein
structure. Adv. Prot. Chem. 34, 168–340.
Rose, G. D., Gierasch, L. M., and Smith, J. A. (1985). Turns in peptides
and proteins. Adv. Prot. Chem. 37, 1–109.
Voet, D., and Voet, J. G. (1996). Biochemistry, 2nd edition. Wiley,
New York.
BIOGRAPHY
George Rose is Professor of Biophysics and Director of the Institute for
Biophysical Research at Johns Hopkins University. He holds a Ph.D.
from Oregon State University. His principal research interest is in
protein folding, and he has written many articles on this topic. He
serves as the consulting editor of Proteins: Structure, Function and
Genetics and as a member of the editorial advisory board of Protein
Science. Recently, he was a Fellow of the John Simon Guggenheim
Memorial Foundation.
6 SECONDARY STRUCTURE IN PROTEIN ANALYSIS