Eukaryotes encode long precursor RNA transcripts, which are spliced, polyadenylated and capped to produce a messenger RNA that can be translated.
| Event | Where? | Overview |
|---|---|---|
| Capping | Nucleus | Guanosine is added to the 5′ end of the mRNA for stability, transport and translation initiation. |
| Polyadenylation | Nucleus | The 3’ end of the RNA is cleaved and a Poly-A tail is added by the polyadenylation complex. |
| Splicing | Cytoplasm | During splicing, introns are excised and exons are ligated together by the very large spliceosome. |
The spliceosome is a hugely massive complex that catalyzes excision of introns and ligation of exons. In addition to many non-snRNP subunits, the spliceosome consists of 5 small nuclear ribonucleoproteins (snRNPs) that assemble stepwise onto each intron. The snRNPs are small and are composed of: a small nuclear RNA (snRNA) with a specific sequence and secondary structure; and some common and some specific proteins.
The five snRNPs of the spliceosome: U1, U2, U4 and U5 are transcribed by RNAP II and have a TriMethyl G Cap; U6 is transcribed by RNAP III and has an unusual Cap. In the U4/U6 Di-snRNP and the U4/U5/U6 Tri-snRNP, the U4 and U6 snRNAs are base-paired to each other. This interaction is later disrupted in the formation of the active spliceosome.
Remember that the splice sites do not always perfectly match the consensus sequences. Thus, the base-pairing interactions between the snRNAs and the pre-mRNA are not always the same. The interactions of U1 with the 5’ splice site and U2 with the branchpoint were proven by creating mutant splice sites that disrupted pairing with the snRNA and inhibited splicing. Compensating mutations in the snRNA that restored complementarity (base-pairing) with the splice site restored splicing.
| Step | Overview |
|---|---|
| U1 | U1 is the first snRNP to bind the pre-mRNA; uses its ssRNA to base-pair to the 5’ splice site. |
| U2AF | U2 Auxilliary Factor (U2AF) binds the 3′ splice site’s polypyrimidine tract and Ag, then helps U2 bind to the branchpoint. |
| U2 | Completing the ATP-dependent pre-spliceosomal A Complex, U2 binds to the branchpoint via RNA/RNA base-pairs to create a bulged A residue. |
| U4, U5 & U6 | The pre-spliceosome becomes a full spliceosome via addition of the U4/U5/U6 Tri-snRNP. This step also requires ATP hydrolysis. U6 base-pairs to U2 snRNA now, instead of to the U4 snRNP. |
| Detachment | In an ATP-dependent process, U1 and U4 detach and U2, U5 and U6 are rearranged. |
| All Done! | In the mature catalytically active spliceosome, the U1 and U4 snRNPs are absent, and the U2 and U6 snRNPs are basepaired to each other. The U2, U5 and U6 snRNAs, and the large spliceosomal protein Prp8 all make very specific contacts with the splice sites. |
Most metazoan genes have multiple exons that must be carefully excised (from the pre-mRNA) and then ligated (to form the mRNA). This RNA splicing occurs in the nucleus, and upon its completion the mRNA is exported to the cytoplasm for translation. Exon definition complexes and spliceosomes begin to assemble during transcription. Some of these complexes interact with RNAP II’s CTD, and some introns are thus excised before transcription has even terminated. There is no apparent order in which introns get spliced). Introns do no excise in any particular order, and active transcription (or its termination) is not needed for splicing to occur. However, the rate of transcription elongation through an intron can strongly affect what splice sites are chosen and thus indirectly couple transcription and splicing. Exon definition complexes and spliceosomes begin assembling during RNA synthesis; interaction with RNAP II’s CTD leads to excision of some introns before transcription has even terminated).
Whether a gene has one or many introns, each is excised via the same two isonergetic transesterification reactions:
1st Transesterification
Within the intron, the 2′ hydroxyl of the branchpoint A attacks the 5′-end phosphate, releasing the 5′ exon and forming an intron lariat. The new phosphodiester bond of the A nucleotide makes it a branched nucleotide bound to its attack-ee and two adjacent bases.
2nd Transesterification
The 3′ hydroxyl of the detached 5′ exon attacks the phosphate at the 3′ end of the intron. This causes release of the intron lariat and ligation of the exons.

Introns have sequences that directs the splicing apparatus during RNA splicing, part of RNA processing:
| Introns begin and end with splice sites that conform to consensus sequences. |
| Introns always begin with a GU encompassed within a larger 5’ splice site consensus. |
| Introns always end with the branch point sequence, several pyrimidines and an AG. |
Some mitochondrial, protist and archaeal genes have autocatalytic self-splicing introns (as opposed to nuclear spliceosome-dependent introns). These introns fold into specific secondary and tertiary structures that catalyze their own excision, without proteins. There are two families of these introns — Group I and and Group II — and Group II autocatalytic introns are thought to operate with a similar mechanism to the spliceosome catalytic center.
Spinal Muscular Atrophy is the most common genetic cause of infant death, occurring in 1 in 6,000-10,000 births. This is an autosomal recessive disorder caused by mutations in the SMN1 gene, encoding a protein involved in snRNP maturation. Humans have undergone an SMN gene duplication generating the SMN2 gene. Even though it encodes an identical protein, the presence of SMN2 cannot compensate for the loss of SMN1 in spinal muscular atrophy, because its splicing is abnormal and it does not produce functional protein.
Splicing targeted therapies are being tested for Spinal Muscular Atrophy. A single C to T mutation in an exonic splicing enhancer of SMN2 exon 7 causes exon skipping and production of a truncated protein from 80% of the SMN2 mRNA. If drugs or other therapies could be found that increase exon 7 splicing of SMN2, one could potentially treat the disease even without correcting the SMN1 defect.
Nearly all disease genes have mutant alleles which affect splicing of the mRNA, rather than protein coding. Exon skipping can arise via mutations in the 5′ splice site, the 3′ splice site, and/or exonic or intronic splicing enhancers. Activation of cryptic splice sites (thus shortening the exon) can arise via mutated exonic splicing enhancers. Examples of diseases due to mRNA splicing errors include Familial Isolated Growth Hormone Deficiency (mutations in the GH-1 gene), Cystic Fibrosis (mutations in the CFTR gene), Spinal Muscular Atrophy and some thalassemias (mutations in the β-globin gene that activate a cryptic 3′ splice site).
| Humans | 3×109 base pairs | ~25,000 genes |
|---|---|---|
| Fruit fly | 1.2×108 base pairs | ~13,600 genes |
| Round Worm | 9.7×107 base pairs | ~19,100 genes |
| Bakers Yeast | 1.2×107 base pairs | ~6000 genes |
Does gene number correlate with complexity? As shown to the rightt, it obviously does not. Rather, alternative splicing correlates with complexity. 95% of Human genes are known to exhibit alternative splicing. Complex Transcription Units use alternative splicing to produce more than one type of mRNA. Patterns of RNA spliciing can combine to produce a dizzying array of alternatively spliced isoforms: the Slo or Bk channel has over 500 isoforms; the neurexin protein has 2,346 isoforms; the para-sodium channel has 1,536 isoforms; the Drosophila DSCAM Receptor can potentially be made in 38,016 different spliced isoforms. Somatic sexual development in Drosophila is controlled by a cascade of splicing factors each regulating the splicing of genes downstream in the pathway. And cochlear hair cells are tuned to respond to different frequencies via alternative RNA splicing.

There are many examples of regulation in mammals where important changes in gene activity are regulated by alternative splicing. These systems use pre-mRNA binding proteins to enhance or repress particular splicing choices. These proteins have different types, numbers, and arrangements of RNA binding domains, and other domains that can be involved in protein/protein interactions. The RNA binding domains target the proteins to specific short sequence elements adjacent to sites of regulation. Alternative splicing is especially common in the mammalian nervous system, where it is used to diversify many proteins important for neuronal development and function. Small cassette exons create short peptide inserts that determine precise changes in ligand binding, electrophysiology, or subcellular targeting.

| Example | Overview |
|---|---|
| BK Channels | Cochlear cells are tuned use alternative splicing of α subunit exons. |
| Sxl Protein | Drosophila use alternative splicing for sex differentiation via Sxl protein. |
|
|