
454.3 What Do DNAs Do?: Protein Synthesis
Well it can distinguish 4 × 4 = 16 names. It is still not enough. It turned out that a
set (called “code”) of three nucleotides (of A, C, G, and T) specifies an amino
acid. Code is like a three-letter word. Sixty-four different codes or words can be
obtained by using three letters out of the four letters; this is more than enough.
There is some redundancy; that is, several different sets specify a same amino
acid. Questions such as “how has the genetic code developed” or “is there any
chemical basis for the genetic code” are very interesting, but have not yet been
answered.
To talk about the genetic code in detail, we need to delve into further details of
protein formation process. DNA does not directly dictate the sequence of amino
acids in a protein. The information (i.e., sequence) in DNA first needs to be tran-
scribed into a nucleic acid of another type, that is, RNA (ribonucleic acid). The
DNA is a permanent copy, and the corresponding RNA is a sort of temporary copy
from which the cell makes a protein. Chemically, RNA is very similar to DNA, but
it has more varied functions than DNA. The details are not important here. The dif-
ferences between DNA and RNA are twofold: (1) RNAs use ribose instead of
deoxyribose (see Fig. 4.2); and (2) the bases used are A, C, G, and U. That is, U,
uracil, is used instead of thymine. Uracil has a very similar structure to thymine, and
it combines with A, adenine, like in DNA.
A portion of DNA (usually only a very small portion of a very large (long) DNA)
specifies a protein. When that protein is needed, the portion of DNA will be copied
onto an RNA molecule (called messenger RNA, m-RNA). This portion of a DNA is
called the gene for the protein. Therefore, a large DNA molecule has on it a number
of genes for different proteins.
Like the replication process of DNA, the A, G, C, and T sequence of that portion
of DNA is then duplicated using A, C, G, and U (of ribonucleotides) this time, and
the connection (polymerization of A, C, G, and U) is made by an enzyme called RNA
polymerase. This process is called “transcription.” The next step in making a protein
is to use this copy of an m-RNA and to “translate” the information on the m-RNA
into the sequence of amino acids in a protein. This is done in the following manner.
A set of three ribonucleotides out of A, C, G, and U specifies an amino acid of a
protein. Such a set is called a “codon.” Therefore, the sequence of m-RNA can now
specify the sequence of amino acids in a protein. Examples are as follows. UUU and
UUC are codons for phenylalanine; CUU, CUA, CUG, and CUC (all CUX) for
leucine; CAU and CAC for histidine. Glycine is specified by GGX (X = A, C, G, and
U). GAU and GAC specify aspartic acid. Interestingly, some codes UAA, UAG, and
UGA are used to indicate where to stop. The AUG code is used to specify where to
start, but it specifies an amino acid methionine, if it occurs in the middle of a gene.
Now how are these codes translated into amino acids? An amino acid binds to a
special kind of RNA called transfer-RNA (t-RNA for short); there is a specific
t-RNA for each and every amino acid. A t-RNA for an amino acid has in it a set of
three consecutive nucleotides called anticodon that binds specifically to the codon
for that amino acid on m-RNA. The binding between a codon on m-RNA and the
corresponding anticodon on t-RNA is again due to the specific hydrogen bonding
similar to that in the formation of double helix of DNA.