Alternative splicing occurs in all metazoa and is espe-
cially prevalent in vertebrates. In fact, microarray-based
comparisons of the cDNAs obtained from various tissues
indicate that ⬃95% of human structural genes are subject
to at least one alternative splicing event. This rationalizes
the discrepancy between the ⬃23,000 genes identified in
the human genome (Section 7-2Bc) and earlier estimates
that it contains 50,000 to 140,000 structural genes.
The variation in mRNA sequence can take several dif-
ferent forms: Exons can be retained in an mRNA or they
can be skipped; introns may be excised or retained; and the
positions of 5¿ and 3¿ splice sites can be shifted to make ex-
ons shorter or longer. Alterations in the transcriptional
start site and/or the polyadenylation site can further con-
tribute to the diversity of the mRNAs that are transcribed
from a single gene. In a particularly striking example, the
Drosophila protein Dscam (for Down syndrome cell-adhe-
sion molecule), which functions in neuronal development,
is encoded by 24 exons of which there are 12 mutually ex-
clusive variants of exon 4, 48 of exon 6, 33 of exon 9, and 2
of exon 17 (which are therefore known as cassette exons)
for total of 38,016 possible isoforms of this protein (com-
pared to ⬃14,000 identified genes in the Drosophila
genome).Although it is unknown if all possible Dscam iso-
forms are produced, experimental evidence suggests that
the Dscam gene expresses many thousands of them.
[Dscam is a membrane-anchored cell-surface protein of
the immunoglobulin superfamily. The specific isoform ex-
pressed in a given neuron binds to itself but rarely to other
isoforms. This permits the neuron to distinguish its own
processes (axons and dendrites) from those of other neu-
rons and thereby plays an essential role in neural pattern-
ing. However, the precise identity of a given isoform ap-
pears to be unimportant.] Clearly, the number of genes in
an organism’s genome does not by itself provide an ade-
quate assessment of its protein diversity. Indeed, it has
been estimated that, on average, each human structural
gene encodes three different proteins.
The types of changes that alternative splicing confers on
expressed proteins spans the entire spectrum of protein
properties and functions. Entire functional domains or
even single amino acid residues may be inserted into or
deleted from a protein, and the insertion of a stop codon
may truncate the expressed polypeptide. Splice variations
may, for example, control whether a protein is soluble or
membrane bound, whether it is phosphorylated by a spe-
cific kinase, the subcellular location to which it is targeted,
whether an enzyme binds a particular allosteric effector,
and the affinity with which a receptor binds a ligand.
Changes in an mRNA, particularly in its noncoding re-
gions, may also influence the rate at which it is transcribed
and its susceptibility to degradation. Since the selection of
alternative splice sites is both tissue- and developmental
stage-specific,splice site choice must be tightly regulated in
both space and time. In fact, it is estimated that from ⬃15%
to 50% of human genetic diseases are caused by point muta-
tions that result in pre-mRNA splicing defects. Some of these
mutations delete functional splice sites, thereby activating
nearby pre-existing cryptic splice sites. Others generate
new splice sites that are used instead of the normal ones
and yet others are in the genes encoding components of
the splicing machinery. In addition, tumor progression is
correlated with changes in levels of proteins implicated in
alternative splice site selection.
How are alternative splice sites selected? Well-understood
examples of such processes occur in the pathway responsible
for sex determination in Drosophila, two of which we dis-
cuss here:
1. Exon 2 of transformer (tra) pre-mRNA contains two
alternative 3¿ splice sites (which succeed the excised in-
tron),with the proximal (close; to exon 1) site used in males
and the distal (far) site used in females (Fig. 31-66a). The
region between these two sites contains a Stop codon
(UAG). In males, the splicing factor U2AF binds to the
proximal 3¿ splice site to yield an mRNA containing this
premature stop codon, which thereby directs the synthesis
of truncated and hence nonfunctional TRA protein. In fe-
males, however, the proximal 3¿ splice site is bound by the
female-specific SXL protein, the product of the sex-lethal
(sxl) gene (which is only expressed in females), so as to
block the binding of U2AF, which then binds to the distal 3¿
splice site, thereby excising the UAG and inducing the ex-
pression of functional TRA protein (U2AF and TRA both
contain RS domains but not RRMs so that neither is an SR
protein).
2. In doublesex (dsx) pre-mRNA, the first three exons
are constitutively spliced in both males and females. How-
ever, the branch site immediately upstream of exon 4 has a
suboptimal pyrimidine tract to which U2AF does not bind
(Fig. 31-66b). Hence in males, exon 4 is not included in dsx
mRNA, leading to the synthesis of male-specific DSX-M
protein that functions as a repressor of female-specific
genes. However, in females, TRA protein promotes the co-
operative binding of the SR protein RBP1 and the SR-like
protein
TRA2 [the product of the transformer 2 (tra-2)
gene] to six copies of an exonic splice enhancer (ESE)
within exon 4. This heterotrimeric complex recruits the
splicing machinery to the upstream 3¿ splice site of exon 4,
leading to its inclusion in dsx mRNA.The resulting female-
specific DSX-F protein is a repressor of male-specific
genes.
Thus, the synthesis of functional TRA protein involves the
repression of a splice site, whereas the synthesis of female-
specific DSX-F protein involves the activation of a splice
site. Similar mechanisms of alternative splice site selection
have been identified in vertebrates.
In general, the decision as to whether an alternative
exon is kept or eliminated is determined by the activities
and concentrations of its various regulators, many of which
are SR proteins and hnRNPs. Hence the tissue-specific ex-
pression of these regulators and the phosphorylation state
of the SR proteins are important contributors to the com-
plex regulation of mRNA splicing. Moreover, extensive
analysis of the sequences of numerous alternative splice
sites has revealed the existence of a “splicing code” that
uses combinations of over 200 RNA features that are
1318 Chapter 31. Transcription
JWCL281_c31_1260-1337.qxd 8/11/10 9:48 PM Page 1318