and fi nally,
RS
RNH
2
RCOO
RO
Using these relationships, it is obvious that the strongest nucleophile in protein molecules
is the sulfhydryl group of cysteine, particularly in the ionized, thiolate form. Next in line are
the amine groups in their uncharged, unprotonated forms, including the -amines at the
N-terminals, the -amines of lysine side chains, the secondary amines of histidine imidazolyl
groups and tryptophan indol rings, and the guanidino amines of arginine residues. Finally, the
least potent nucleophiles are the oxygen containing ionizable groups including the -carboxy-
late at the C-terminal, the -carboxyl of aspartic acid, the -carboxyl of glutamic acid, and the
phenolate of tyrosine residues.
According to the theoretical pK
a
values for the ionizable side chains of amino acids, nucle-
ophilic substitution reactions involving primary amines or sulfhydryl groups on proteins should
not be effi cient below a pH of about 8.5 ( Table 1.1 ). In practice, however, reactions can be done
with these groups in high yield at pH values not much higher than neutrality. This discrepancy
relates to the changes in pK
a
due to microenvironmental effects experienced by the residues
within the three-dimensional structure of the protein molecule. In reality, the -amine groups on
lysine side chains within proteins, having theoretical pK
a
s of over 10, nonetheless exist in suf-
fi cient quantity in an unprotonated form even at a pH of 7.2 that modifi cation easily occurs.
One important point should be noted, however. The changes that occur in the pK
a
of ioniz-
able groups in protein molecules due to microenvironmental effects sometimes make it diffi cult
to select certain residues for modifi cation simply by careful modulation of reaction pH. For
instance, at least in theory, overlap of the pK
a
range for sulfhydryls and amine-containing resi-
dues would eliminate any chance of directing a reaction toward SH groups solely by adjust-
ing the pH of the reaction medium. However, because of the microenvironmental changes that
occur in complex biomolecules, pH sometimes can be used along with the right reactive group
to target thiols without amine modifi cation. Thus, in practice, to effectively site-direct a modifi -
cation reaction, the proper choice of reactive group and reaction conditions can result in highly
discrete conjugation to certain sites within proteins.
Secondary, Tertiary, and Quaternary Structure
Amino acids are linked through peptide bonds to form long polypeptide chains. The primary
structure of protein molecules is simply the linear sequence of each residue along the -chain.
Each amino acid in the chain interacts with surrounding groups through various weak, non-
covalent interactions and through its unique side chain functionalities. Noncovalent forces such
as hydrogen bonding and ionic and hydrophobic interactions combine to create each protein ’s
unique organization.
It is the sequence and types of amino acids and the way that they are folded that provides
protein molecules with specifi c structure, activity, and function. Ionic charge, hydrogen bond-
ing capability, and hydrophobicity are the major determinants for the resultant three-dimen-
sional structure of protein molecules. The -chain is twisted, folded, and formed into globular
structures, -helicies, and -sheets based upon the side-chain amino acid sequence and weak
intramolecular interactions such as hydrogen bonding between different parts of the peptide
1. Modifi cation of Amino acids, Peptides, and Proteins 15