Exon shuffling can lead to a common domain being found in a variety of proteins.

If a LINE has a weak poly(A) signal, then sometimes transcription will continue and include an adjacent 3′ gene (eventually terminating at that gene’s strong poly(A) signal). ORF2 then reverse-transcribes the RNA transcript of the LINE and gene, eventually inserting the gene at a new location along with the SINE in a phenomenon known as exon shuffling.

Exon shuffling can occur double crossover between interspersed repeats.


The most common type of repetitive DNA are interspersed repeats or moderately repeated sequences. These are present as a single copy at very many different loci and can move or jump to new locations.Interspersed repeats account for almost half of human DNA. These do not occur in tandem arrays. Individual copies of the same, or nearly the same sequence, ~100 bp to ~10 kb long, are found at tens of thousands to more than 1 million different positions dispersed all over the genome. This dispersion is the result of repeated insertions of transposons into new sites during the evolution. The interspersed elements are either transposons themselves or are derived from other genomic sequences acted on by transposon enzymes.
| Class | Length | Copy # | Genome | Overview | ||
| DNA Transposons | 2 – 3 kbp | ~300,000 | 3% | Transposons either move by: direct excision and reinsertion of one DNA element; or insertion of a reverse-transcribed RNA product. DNA transposons can jump during S phase from a daughter strand into unreplicated DNA, thus increasing its copy number. A DNA transposon integrates with a staggered cleavage of the target DNA followed by ligation of the target 5’ ends to the transposon and filling the gaps. | ||
![]() |
||||||
| LTR Retrotransposons | 6 – 11 kbp | ~440,000 | 8% |
Retrotransposons are transcribed normally, then reverse-transcribed to form a DNA copy that is inserted into a new site. LTR retrotransposons are retroviruses that have lost the ability to exit and reinfect a cell. The upstream LTR acts as a promoter and the downstream LTR contains a poly-A site to produce transcripts from the integrated element. These both encode the proteins needed for transposition and serve as a template for making the DNA copy. Between them is a coding region that encodes proteins for transposition and also acts as a copy template.
The retroviral genomic RNA is copied into DNA via priming, extension, jumping and repriming steps:
The double-stranded DNA of an LTR retrotransposon integrates by the same mechanism as a DNA transposon, with a staggered cleavage of the target DNA followed by ligation of the target 5’ ends to the transposon and filling the gaps. revise_new_w.png |
||
![]() |
||||||
| Non-LTR Retrotr’sons | ||||||
| LINEs | 6 – 8 kbp | ~860,000 | 21% | Non-LTR retrotransposons also encode a reverse transcriptase, but use a different mechanism for insertion. The two proteins encoded by LINEs are: ORF1, an RNA binding protein; and ORF2, a reverse transcriptase and endonuclease. ORF2 protein makes a nick in an A/T rich region of the target DNA to allow priming on the Poly-A tail of the LINE RNA. Reverse Transcriptase uses the 3’ end of the nicked target to extend into the LINE RNA, making a DNA copy. RT reaches the end of the LINE RNA and continues into the target at the staggered cleavage. Insertion is completed by cellular enzymes that copy the second strand, degrade the RNA and ligate the fragments together. Most LINE elements are truncated due to incomplete copying of element during insertion. This makes them inactive for transposition, but they can still be mutagenic upon insertion and can still induce aberrant recombination events. LINEs can result in exon shuffling. | ||
![]() |
||||||
| SINEs | 100 – 300 bp | ~1,600,000 | 13% |
Short Interspersed Elements (SINE’s) do not encode proteins but transpose by the same mechanism as LINE’s, presumably using the LINE proteins. SINEs carry within them a promoter for RNA Pol III allowing new RNA copies to be made. The most common SINE is the Alu Element named for an Alu restriction site it contains. Alu elements originally derived from the 7SL RNA, a cytoplasmic small RNA involved in protein secretion.Single Alu Elements can evolve into new exons. Alu elements can result in exon shuffling. |
||
| Processed Pseudogenes | Variable | 1 – ~100 | ~0.4% | |||
Also, there is unclassified spacer DNA that accounts for ~25% of the genome.
| Class | Length | Copy # | Genome | Overview | ||
| Solitary Genes | Variable | 1 | ~15% | |||
| Gene Families | Variable | 2 – ~1,000 | ~15% |
A significant percentage of human genes are members of gene families. In some cases, the multiple copies allow increased production of identical gene products – rRNA. In other cases, the different family members have different but related functions – beta Globin. About half of all human genes are solitary genes, like the SUR2 gene. This means that there is only one gene of similar sequence and function in the haploid genome. About half of all human protein coding genes are duplicated, or members of a gene family with >2 closely related genes. For example, the globin genes are members of a gene family. The β-globin genes on chromosome 11 have exons that are >90% identical. They are also >80% identical to the β-globin genes on another chromosome. The different beta Globins have evolved different oxygen affinities and transport properties and are adapted to use in different situations. For example, e globin is expressed in the developing fetus for absorbing oxygen from the maternal Hemoglobin in the placenta. How did the duplicated genes arise? Gene duplication by unequal crossing-over between homologous repeats during homologous recombination in meiosis. Duplicated genes do not necessarily remain linked at the same chromosomal locus. Later events can move them to other locations in the genome. |
||
Also, there is unclassified spacer DNA that accounts for ~25% of the genome.
| Class | Length | Copy # | Genome | Overview | ||
| Tandem Repeats | Variable | 20 – 300 | 0.3% | Encode rRNAs, tRNAs, snRNAs and histones. In some cases, the multiple copies allow increased production of identical gene products – rRNA and histones. In these cases, the genes usually exist as tandem arrays. Allows production of millions of copies of the gene product per cell division – needed for ribosome, snRNPs and histones. | ||
| Simple-Sequence DNA | 1 – 500 bp | Variable | 3% |
Commonly called “satellite DNA” — Sheared DNA has buoyant density dependent on base content. Total DNA gives rise to a main band of average base content. Certain overrepresented simple sequence repeats give rise to satellite bands due to skewed base content.
Microsatellite DNA is defined as having very short repeat units of 1-15 nt, such as CAGCAGCAG etc, repeated 50 or more times. Many human diseases are caused by triplet (especially CAG) repeat expansion mutations. These are thought to accumulate during rare mistakes in DNA synthesis when the nascent daughter strand slips backward along the template strand to insert additional bases into the daughter strand. |
||
![]() |
||||||
| Minisatellite DNA has a longer repeat unit length of 15-100 nt or so With tandem array lengths of 500 bp to 20 kb. Differences between individuals in the number of repeats of a minisatellite sequence arise through unequal crossing over between chromosomes during meiosis. Some minisatellite sequences are highly variable in repeat number between individuals. This is the basis of the DNA fingerprinting. | ||||||
![]() |
||||||
| Most simple sequence DNA is comprised of 14-500 bp units tandemly repeated in long stretches of 20-100 kb. Most of these very long simple sequence DNAs are either at the centromeres of chromosomes where they may affect chromosome segregation, or serve as telomeres at the chromosome ends. | ||||||
Also, there is unclassified spacer DNA that accounts for ~25% of the genome.
Introns have sequences that directs the splicing apparatus during RNA splicing, part of RNA processing:
| Introns begin and end with splice sites that conform to consensus sequences. |
| Introns always begin with a GU encompassed within a larger 5’ splice site consensus. |
| Introns always end with the branch point sequence, several pyrimidines and an AG. |
Transposable elements were first discovered by Barbara McClintock in kernels of corn, where certain mutations caused loss and reinstatement of purple pigment (due to gain and loss of an insertion element that activated pigment genes). The human genome contains ~300,000 DNA transposons, which have extensively accelerated evolution due to the modularity of exons and regulatory regions. Transposition is one avenue for exon shuffling to occur, whereby an exon and two flanking transposons are all excised and reinserted elsewhere as a single element (potentially adding a new exon to a gene). There are conservative (cut + paste) and replicative (copy + paste) mechanisms for transposition. However, transposition is potentially mutagenic and over-transposition is very deleterious; thus, it remains a rare event that occurs in about 1:105 or 1:107 cells per generation. Transposable elements occur in both eukaryotes (transposons; retrotransposons) and prokaryotes (insertion elements).
| Next Steps | Transposons and retrotransposons are discussed in the article about eukaryotic chromosomes. |
|---|
A gene is the nucleic acid sequence needed to synthesize a particular gene product. A gene includes more than just the coding region that encodes an RNA transcript; there are also control regions controlling synthesis, processing and translation of the RNA transcript. In prokaryotes, the entire coding region encodes a continuous polypeptide sequence. In eukaryotes, coding regions contain exons (50-250 nucleotides) that encode polypeptide sequences and introns (500-50,000 nucletides, removed during RNA processing) that do not. Higher eukaryotes not only have introns within genes, but large intergenic regions. For example, a ~80 kb region in Saccharomyces cerevisiae (baker’s yeast) contains 40 genes; the ~80 kb region encompassing the human β-globin cluster contains only 5 genes. This extra DNA comes from multiple repeats described here. Exons often encode modular units that are included or excluded via RNA processing. Exons are usually highly conserved while introns are barely conserved. For example, SUR2 exons are 90% identical between mice and humans while SUR2 introns are less than 10% identical between mice and humans. A lack of inter-species sequence conservation indicates a lack of function.

| Monocistronic | Most eukaryotic genes are monocistronic, meaning their mRNAs encode a single protein. Often, a eukaryotic primary transcript forms a single mRNA that encodes a single protein. Most eukaryotic mRNAs have a 5′ cap structure that directs ribosome binding, with translation beginning only at the closest AUG codon. |
| Polycistronic | Prokaryotic genes are mostly polycistronic, with one mRNA encoding multiple proteins involved in a biological process. Along the mRNA, there is a ribosome binding site near each coding region’s start site. Translation can initiate at any of these sites, allowing production of different proteins from one mRNA. |
A transcription unit is a region of DNA that is transcribed under the control of a particular promoter. While a gene and a transcription unit (like the LAC operon) are distinguishable in prokaryotes, the two terms are used interchangeable in eukaryotes. There are simple and complex eukaryotic transcription units. A simple transcription unit RNA transcript is processed to yield a single mRNA encoding a single protein. Complex transcription units, which are more common, encode an RNA transcript that is processed to form different monocistronic mRNAs each encoding a different protein. A single transcript can undergo different mRNA pathways via:
| Alternative Splicing | mRNAs have the same 5′ and 3′ exons but different internal exons. |
| Alternative Poly(A) Sites | mRNAs have the same 5′ exons but different 3′ exons. |
| Alternative Promoters | mRNAs have different 5′ exons but share 3′ exons. |
| Next Steps | Study about the eukaryotic chromosome. |
|---|
The total mass of histones associated with DNA in chromatin is about equal to that of the DNA. Interphase chromatin and metaphase chromsomes also contain small amounts of a complex set of other proteins. For instance, a growing list of DNA-binding transcription factors have been identified associated with interphase chromatin. The structure and function of hese critical nonhistone proteins, which help regulate transcritpion, are examined in Chapter 11. Other low-abundance nonhistone proteins associated with chromatin regulate DNA replication during the eukaryotic cell cycle.
A few other nonhistone DNA-binding proteins are present in much larger amounts than the transcription or replication factors. Some exhbit high mobility during electrophoretic separation and have thus been designated high-mobility group (HMG) proteins.W hen genes encoding the most abundant HMG proteins are deleted from yeast cells, normal transcription is disturbed in most other genes examined. Some HMG proteins have been found to bind DNA cooperatively with transcription facors binding to specific DNA sequences to stabilize multiproteins complexes regulating transcription of a neighboring gene.
A mammalian chromosome is massive, with tens to hundreds of megabases of DNA. The entire haploid human genome contains about 3 billion base pairs of DNA. Only about 5% of this encodes functional RNAs or Proteins or controls their production. Eukaryotic chromosomes are bound to structural proteins to form chromatin. During metaphase, chromatin is highly condensed into the recognizable structure seen at left. During interphase, chromatin is highly decondensed so that regulatory proteins can access the DNA.
| Cell Stage | Status of Chromosome |
| Interphase | Chromosomes are highly decondensed in most regions, allowing access of regulatory proteins for transcription and replication. Within the nucleus, individual chromosomes are found within diffuse but non-overlapping domains. |
|---|---|
| M Phase | Duplicated chromosomes condense into defined sister chromatids to allow their segregation at cytokinesis. After chromosome condensation, the nuclear envelope breaks down in a process controlled by the nuclear lamina so that the chromosomes can segregate to opposite ends. At metaphase, chromosomes are aligned along the metaphase plate and sister chromatids are split at the centromere to segregate to opposite poles of the dividing cell. |
During evolution large rearrangements can occur in the size and number of chromosomes. A syntenic region contains genes that are found in the same order in different species, although not always on the same chromosome. For example, the Indian Muntjac has three large chromosomes and a tiny X chromosome; the very similar Reeves Muntjac has just as much DNA — and often in the same sequence — but divided among 23 chromosomes. These chromosomal rearrangements are rare, but are extremely important for speciation because they make productive mating impossible. The number, sizes and shapes of metaphase chromosomes constitute the karyotype (distinctive for each species). During metaphase, chromosomes are distringuished by banding patterns and chromosome painting.
The region of the chromosome where the sister chromatids are held together is called the centromere. This assembles a structure called the kinetochore that is required for attachment to microtubules during alignment at the metaphase plate, splitting of the sister chromatids, and movement to the spindle poles. Because of the nature of DNA replication, a linear chromosome requires special sequences at the ends called Telomeres. DNA replication requires an RNA primer to initiate synthesis, which is degraded after priming. The loss of these primers on the lagging strand of the chromosome ends will result in a loss of information with each round of replication. Telomerase is a special enzyme that uses its own RNA template to add telomeric repetitive DNA to chromosome ends. Chromosomes require replication origins (ARS), centromeres, and telomeres for proper replication, mitotic segregation, and maintenance. These sequences were first identified in yeast. Adding telomeric DNA to a DNA containing an ARS and Centromere allows its maintenance as a linear chromosome. Yeast artificial chromosomes containing ARS, Cen, and Tel elements allowed the cloning of large fragments of human chromosomes.
| Progeny of Transfected Cell | ||||||
| Plasmid | Recipient | Growth | Mitotic Segregation | Observation | ||
|---|---|---|---|---|---|---|
| LEU+ Circular | LEU- Yeast | None | Transfection with a LEU+ plasmid does not alone restore LEU to a LEU- cell. | |||
| LEU+ ARS+ Circular | LEU- Yeast | Some | Poor | Replication occurs, but poor segregation means only ~10% of progeny carry the plasmid. | ||
| LEU+ ARS+ CEN+ Circular | LEU- Yeast | Yes | Good | A centromeric (CEN) genome fragment is needed for strong segregation. | ||
| LEU+ ARS+ CEN+ Linear | LEU- Yeast | None | Linearization (via restriction enzymes) of a TEL- circular plasmid makes it unstable. | |||
| LEU+ ARS+ CEN+ TEL+ Linear | LEU- Yeast | Yes | Good | Linear plasmids must carry the telomeric (TEL) gene fragment at each each end to remain stable in progeny cells. | ||
Some specialized eukaryotic cells increase cell volume via endomitosis, where DNA synthesis is repeated without cell division and a normal chromosome develops into a giant polytene chromosome results (first observed in Drosophila melanogaster larval salivary glands). Centromeres and telomeres endoreplicate poorly, leading to a bundle of duplicate chromatids (as many as 1,000) stuck at the chromocenter; thus, the ploidy of the cell remains constant.

Chromosome puffs are diffuse uncoiled regions of the polytene chromosomes where RNA transcription occurs; a large chromosome puff is a Balbiani ring. In addition to increased nucleic and cellular volume, polytene cells have metabolic advantages since multiple gene copies facilitate high levels of gene expression (which would be particularly useful in larval cells). Polytene chromosomes have very distinctive banding patterns that are useful for mapping the location of genes and observing their transcriptional activation.
| 0.34 nm | = | Distance between base pairs in B-form DNA. |
| 3 x 109 bp | = | Human Haploid Genome. |
| ~1 m | = | Length of Haploid Genome. |
| 3-10 x 10-6 m | = | Length of nucleus. |
| ~2 m | = | DNA per nucleus. |
To pack DNA into the tiny nucleus (the DNA packing problem), DNA is tightly wound around special proteins to form a nucleoprotein complex called chromatin. Chromatin proteins are predominantly histones, a family (H1, H2A, H2B, H3 and H4) of small proteins that are conserved in eukaryotes and contain many positively-charged basic amino acids that interact with negatively-charged DNA phosphate groups.
| Structure | Overview |
|---|---|
| Primary | Two each of each histone (except H1) interlock to form a disc-shaped structure ~10 nm in diameter. In all eukaryotes, 147 bp of DNA wraps almost twice around the histone octamer to form the nucleosome. Nucleosome “beads” are each spaced by 15-55 bp (depending on species) “strings” of linker DNA. The basic and positively-charged histones bind tightly to DNA, protecting it from proteins. Careful nuclease treatment will digest linker DNA and release individual nucleosomes with their DNA still intact. However, linker DNA is somewhat protected by bound H1 and by inter-histone interactions. Newly replicated DNA in vivo assembles into nucleosomes shortly after the replication fork passes, nucleosomes do not spontaneously form in vitro at physiological salt concentration when histones are added to DNA. However, nuclear proteins have been characterized that bind histones and assemble them in vitro with DNA. These are thought to assemble new DNA into histones in vivo as well. |
| Secondary | Nucleosomes (the 1° chromatin structural unit) will stack on top of each other at physiological conditions (~0.15 M KCl), and the stacks will then intertwine into an irregular spiral (solenoid arrangement) that is ~30 nm and contains ~6 nucleosomes per turn. H1, the 5th major histone, is bound to DNA on inside of solenoid with one H1 molecule associated with each nucleosome. The 30-nm solenoid is less uniform than a perfect solenoid; condensed chromatin may actually be quite dynamic, with regions occasionally partially unfolding and then refolding into a solenoid structure. |
| Tertiary | At special scaffold attachment regions (SARs), the 30 nm fibers attach to a flexible protein scaffold; the unattached regions form chromatin loops. DNA can be released from the protein scaffold by treatment with detergent. In addition to this general structure, thousands of low-abundance regulatory proteins associate with specific DNA sequences. During meiosis, chromatin further folds and compacts into visible metaphase chromosomes. |
Actively transcribed chromatin is in 10 nm form (beads-on-a-string). This can revert to 30 nm when genes are repressed. Highly inactive chromatin, such as that containing repetitive DNA, is still further condensed around the chromosome scaffold. Histones have unstructured tails (not seen in crystal structure) that are specifically modified (including acetylation, methylation, and phosphorylation) to mediate the regulated condensation and decondensation of the chromatin.
| Chromatin | Overview | |
|---|---|---|
| Heterochromatin | Heterochromatin is made up of dense, tightly packed, portions of the chromosome that are mostly inactive and often contain repetitive simple sequence DNA. This appears very dark in the electron microscope.Heterochromatin is highly condensed but can be converted to euchromatin by transcriptional activators targeted to that chromosomal region. | |
| Euchromatin | Gene rich regions of the chromosome are much less densely packed and make up what is called Euchromatin. The decondensation of chromatin upon transcriptional activation can also be observed through its sensitivity to DNAse. For example, globin genes expressed in erythrocytes (in these cells, DNAse sensitive) but not expressed in other cells (DNAse resistant). | |
Nonhistone proteins provide a structural scaffold for long chromatin loops. Although histones are the predominant proteins in chromosomes, nonhistone proteins are also involved in organizing chromosome structure. Electron micrographs of histone-depleted metaphase chromosomes from HeLa cells reveal long loops of DNA anchored to a chromosome scaffold composed of nonhistone proteins. This scaffold is shaped like the metaphase chromosome and persists evn when DNA is digested by nucleases. Loops of 30-nm chromatin fiber a few megabases long associate with a flexible chromsome scaffold, yielding an extended form characteristic of chromosomes during interphase. Folding of scaffold produces highly condensed structure characteristic of metaphase chromosomes. But the geometry of scaffold folding in metaphase chromosomes has not yet been determined.
In situ hybridization with different fluorescent-labeled probes to DNA in human interphase cells support loop model shown. Some probe sequences mllion of base pairs apart in linear DNA appeared reproducibly very clse in interphase nuclei from different cells. hese are postulated to lie close to specific sequences in DNA called scaffold-associated regions (SARs) or matrix-attachment regions (MARs) bound to chromosome scaffold. SARs have been mapped by digesting histone-depleted chromsoemes with restriction enzymes and recovering fragment bound to scaffold proteins. In general, SARs are found between transcription units. Genes are located primarily within chromatin loops attached at bases to chromosome scaffold. Experiments with transgenic mice indicate that in some cases, SARs are required for transcription of neigboring genes.
Individual interphase chromosmes are less condensed than metaphase chromosomes and cannot be resolved by standard microscopy or electron microscopy. Nonetheless, it is associate with extended scaffolds and is further organized into specific domains. This can be demonstrated by in site hybridization of interphase nuclei with a large mixture of fluorescent-labeled probes specific for sequences along the length of a particular chromosome. Little overlap between chromosomes in interphase nuclei. Precicse positions are not reproducible between cells.

The pre-mRNA binds many proteins during transcription. Although collectively called hnRNP proteins, they in fact belong to many different structural and functional families. Like transcription factors, RNA binding proteins can be modular and contain and RNA-binding domain and a protein-protein interacting domain.
RNA-binding domains often recognize short sequences (like a base triplet) and thus have less sequence specificity than DNA-binding domains. To increase specificity, many proteins have multiple RNA-binding domains; ssRNA is flexible enough that different domains can bind the same strand. Common RNA-binding domains are shown below, in order of decreasing frequency.
| Domain | Overview |
| RRM/RNP | The RNA Recognition Motif (RRM, aka RNP domain) has an αβ structure whose β domain binds the single-stranded pre-mRNA via many backbone and base-specific interactions. |
|---|---|
| K Homology | K Homology Domains (KH Domains) are also αβ structures, although the binding surface is along the edge between α and β. Multiple KH Domains are often found in one protein. |
| Next Steps | Read about DNA Binding Proteins. |
|---|
III. Biosynthetic Pathways
In the 40’s Beadle and Tatum suggested the “one-gene-one-enzyme†hypothesis which explained how genes can code for a biosynthetic pathway ultimately producing a useful compound.
Rule of Thumb
• The compound that rescues most number of mutants is the last one in the pathway.
• The mutant that is rescued by most supplements is the first one in the pathway.
A B C D
Mut 1 - + + +
Mut 2 - - + +
Mut 3 - - - +
| °’s of Freedom | Probability | ||||
| .9 | .5 | .1 | .05 | .01 | |
| 1 | 0.02 | 0.46 | 2.71 | 3.84 | 6.64 |
| 2 | 0.21 | 1.39 | 4.61 | 5.99 | 9.21 |
| 3 | 0.58 | 2.37 | 6.25 | 7.82 | 11.35 |
| 4 | 1.06 | 3.36 | 7.78 | 9.49 | 13.28 |
| 5 | 1.61 | 4.35 | 9.24 | 11.07 | 15.09 |
Click Χ2 for information on how to use the Χ2 table (Chi-Squared Table).
A mutation is any change in the nucleotide sequence or arrangement of DNA; mutations are one of only four evolutionary agents. There are three categories of mutations: genome mutations, which arise from chromosome missegregation and change how many intact chromosomes are in a cell, leading to aneuploidy; chromosome mutations, which arise from chromosome rearrangement and restructure an individual chromosome; and gene mutations, which are base pair mutations affecting an individual gene. An inherited mutation is a mutation passed on from ancestors; a de novo mutation is a new and non-inherited mutation. Disease mutations interfere with protein synthesis at one of the steps in the list to the bottom right.
Somatic mutations are not passed on to progeny and only affect some cells of the body; a tumor is a common example of a somatic mutation. Germline mutation affect germ cells (cells which differentiate into eggs or sperm) and are passed onto progeny. Many germline mutations occur either in the fertilized egg (the zygote), leading to both germ and somatic cells containing the mutation. Dynamic mutations worsen during gametogenesis, leading to a worsened phenotype in each generation. Examples of dynamic mutations include Huntington Disease and Fragile X Syndrome, which involve repeat sequences extending during gametogenesis.
Some mutations are more likely to be maternal and other mutations are more likely to be paternal. Primary oocytes develop fetally and ovulate years and decades later; the older an oocyte is, the more likely it is to undergo nondisjunction. This nondisjunction leads to trisomy (which can be viable) and monosomy (which is almost never viable). For this reason, aneuploidy is maternal in at least 80% of all cases and occurs more often with maternal age. On the other hand, spermatogenesis occurs throughout a man’s entire life. With mutations accumulating during each round of replication, point mutations are almost always paternal and increase with paternal age.
Point mutations are substitutions of one base pair for another. A transition is when one pyrimidine is swapped for another — such as C for T, or T for C — or one purine is swapped for another — such as A for G, or G for A. A transversion is when a pyrimidine and a purine swap. Transitions are more frequent because when cytosines methylate to form 5-methylcytosine, they can spontaneously deaminate to thymidine.
| Missense Mutation | A missense mutation is the changing of a single base-pair. Within a coding region, this usually leads to a change of a single amino acid in the protein product. However, if this mutation is in the 5′ or 3′ untranslated region then a missense can lead to underexpression of the protein product. |
|---|---|
| Nonsense Mutation | A nonsense mutation (aka chain termination mutation) is the replacement of a codon encoding an amino acid with a stop codon. This stop codon ends transcription of that gene prematurely, with an incomplete and unstable mRNA formed. Most of these mRNAs just fall apart; however, the few that are translated result in truncated proteins that quickly disintegrate. |
| RNA Processing Mutation | An RNA processing mutation interferes with splicing of mRNA. If the point mutation alters a splice donor or splice acceptor site, then RNA splicing is interfered with or abolished at that location. If the point mutation creates a new splice donor or splice acceptor site, then this new site competes with normal splicing and the processed mRNA might still contain introns. |
There are deletions (removal of DNA), insertions (addition of DNA), inversions (reversal of the orientation of a DNA segment), translocations (moving of a DNA segment) and combinations thereof. Some deletions and insertions are small changes which are detectable only via PCR or genome sequencing; these usually shift the reading frame (a frameshift mutation) and lead to truncation of the mRNA by an early stop codon in the new reading frame. Some larger mutations can be detected via Southern blotting. For a mutation to be detectable by chromosome banding, it must involve at least 2 million base pairs.
Deletions, insertions, inversions and translocations often arise via faulty recombination. For example, unequal crossing over is crossing over without proper exchange of genetic information, leading to insertions in one chromosomes and deletions in another. Another form of faulty recombination is when mispaired chromosomes or sister chromatids exchange genetic information.
| Mutation | Example | Overview |
| Missense | Phe15Tyr | A missense mutation is described by the wild-type amino acid, its residue and the resulting mutant amino acid. The example shows a mutation where a phenylalanine is converted to tyrosene at residue 15 of a gene. |
|---|---|---|
| Nonsense | Ser25X | A nonsense mutation is described by the wild-type amino acid, its residue and then an X to represent the mutant stop codon. The example shows a mutation where serine is replaced with a stop codon at residue 25 of a gene. |
| Nucleotide Change | g.3000G>A c.1000g>a |
If the full genomic sequence is known, a nucleotide change is denoted by a prefix (g for genomic and c for cDNA), followed by the number of that nucleotide, the original nucleotide, a ‘>’ symbol, and finally the mutant nucleotide. Mutations identified in genomic DNA are denoted with capitalized nucleotides; mutations identified in non-genomic DNA are denoted with lower-case nucleotides. The first example shows a genomic mutation at position 3,000 where a G transitions to an A; and the second example shows the same mutation at position 1,000 on cDNA. |
| g.IVS25+2G>A g.IVS25-2T>A g.IVS25-2A>T |
If the full genomic sequence is not known, then nucleotides are counted either: up from the 5′ splice donor site, with +1 being the invariant G of the GT at the 5′ splice donor site; or down from the 3′ splice acceptor site, with -1 being the invariant G of the AG at the 3′ splice acceptor site. The first example shows a transversion at the T of the 5′ splice donor site; the second example shows a transversion at the A of the 3′ splice acceptor site. | |
| Deletions | c.1000_10003delGCAT | Small deletions begin with a prefix (g or c for genomic or cDNA), followed by the locations of the deleted nucleotides, then a del for deletion, and finally the nucleotides deleted. The example shows a four-nucleotide deletion of a G,C, A and T at respective locations 1000, 1001, 1002 and 1003. |
| Insertions | c.1000_10001insATGC | Small insertions begin with a prefix (g or c for genomic or cDNA), followed by the nucleotides flanking the insertion, then an ins for insertion, and finally the nucleotides inserted. The example shows an insertion of A,T,G and C between nucleotides 1000 and 1001. |
| Nomenclature table derived from Nussbaum, McInnes & Willard: Genetics in Medicine, 7th ed. Philadelphia, Saunders, 2007. (pg 181) | ||
| Effect | Example | Overview |
| Loss of Function | α-thalassemia β-thalassemia Turner Syndrome Retinoblastoma |
Loss of function mutations leads to a lower gene dosage. Examples of loss of function diseases are the α-thalassemias (where the entire α-globin gene is deleted), β-thalassemias (premature stop codon or coding missense), Turner Syndrome (a monosomy where a chromosome is lost) and retinoblastoma (where a somatic mutation lead to loss of function of tumor-suppressor genes). Many loss-of-functional diseases are less severe in heterozygotes; oftentimes, a single functional allele is enough for a healthy or mild-diseases phenotype. |
|---|---|---|
| Gain of Function | Trisomy 21 Charcot-Marie-Tooth 1A Hemoglobin Kempsey Achondroplasia |
Gain of function mutations lead to increased activity of a certain protein in tissues which normally express it (as opposed to heterocrhonic or ectopic expression). This increased activity can be due to a higher gene dosage, as in Down Syndrome (a third copy of part or all of Chromosome 21) or Charcot-Marie Tooth Type 1A (duplication of a particular gene) or even progression of certain cancers. Alternatively, one function of a protein might be detrimentally hyperactive. Examples include: hemoglobin Kempsey, a mutant hemoglobin with such high oxygen affinity that it does not release oxygen to tissues; and achondroplasia, where an over-strong signal (from exceptional binding of a growth factor receptor) leads to growth retardation. |
| Novel Property | Sickle Cell | Novel property mutations give encoded proteins new properties. For example, the hemoglobin chains of sickle cells aggregate into long fibers which deform the cell and impair its function. Some novel property mutations are also loss of function mutations; mutants with novel glycosylation sites are rendered inactive by glycosylation. |
| Heterochronic & Ectopic Expression |
Some Cancers | Certain mutations are simply due to gene expression at the wrong time (heterochronic) or in the wrong tissue (ectopic). For example, constitutive expression of proliferation genes (known as oncogenes) can lead to tumor formation. |
| Term/Abbreviation | Overview |
| Pedigree | A graphical family tree using standardized symbols. |
|---|---|
| Germ Cell | Constituting the germline, germ cells differentiate into sperm or eggs and pass genetic information to progeny. |
| Somatic Cell | Any cell that is not a germ cell; genetic information within a somatic cell will not pass on to progeny. |
| Gene | a section of DNA (xsm) that codes for a product. |
| Locus | The location of a gene on a chromosome. |
| Allele | A version of a gene. For example, there is a hair color gene. An allele of that gene might encode brown hair. |
| Wild-Type | The most common form(s) of an allele in the overall population. |
| Mutation | Any change in the nucleotide sequence or arrangement of DNA. |
| Mutant | Arising from mutation, a mutant (aka variant) allele is any other than the wild-type allele(s). |
| Polymorphism | If there is more than one common allele for a gene — such as with hair color — then that gene is polymorphic. |
| Rare Variant | If an allele is present in less than 1% of the population, it is a rare variant (as opposed to a polymorphic allele). |
| Haplotype | The different alleles possible at a given locus. |
| Centromere | A centromere (abbreviated cen is the center of a chromosome, where sister chromatids meet. |
| Genotype | An individual’s set of alleles. |
| Phenotype | The phenotype (aka trait) is the manifestation of a genotype, ranging from retardation to hair color. Genetic variation (allelic heterogeneity, locus heterogeneity and gene modifiers) muddles the genotype-phenotype correlation. |
| Qualitative Trait | A trait that is either present or not, such as trisomy 21. |
| Quantitative Trait | A trait that is measured, such as height, body mass index or intelligence. |
| Single-Gene Trait | A trait that is mostly determined by the alleles at a single locus. |
| Polygenic Trait | Aka multigenic, any trait (or disease) that is controlled by several genes. |
| Allelic Heterogeneity | Allelic heterogeneity is different mutations in the same gene resulting in the same disease, even if of different severity. An example is cystic fibrosis. ther diseases, like sickle cell disease, show little or no allelic heterogeneity (since a single mutation causes it). |
| Locus Heterogeneity | Different genetic diseases causing the same phenotype. An example is retinitis pigmentosa. |
| Phenotypic Heterogneity | Different mutations in the same gene can cause very different diseases. A deletion in the RET gene causes Hirchsprung Disease, characterized by severe constipation; other mutations in the RET gene result in thyroid and adrenal cancer; another set of mutations causes both Hirschsprung disease and cancer. |
| Inbreeding Coefficient (F) | |
| Homozygous | Having the same alleles on each homologous chromosomes. |
| Heterozygous | Having different alleles on each homologous chromosome. A compound heterozygote carries no normal alleles (only mutant alleles) for a particular gene. |
| Hemizygous | A gene with only one copy (and thus only one allele) normally present; for example, a Y gene is hemizygous). |
| Dominant Allele | In a heterozygote, only the dominant allele is expressed. |
| Recessive | In a heterozygote, the recessive allele is not expressed. |
| Multifactorial | Aka complex, any disease caused by complex genetic (gene-gene or polygenic) and environmental (gene-environment) interactions, and not following a Mendellian pattern. |
| Autosomal | Not pertaining to a sex chromosome (an autosome being any chromosome not a sex chromosome). |
| Penetrance | The likelihood that a disease genotype leads to disease phenotype. |
| Expressivity | Either constant or variable, expressivity describes disease severity among individuals with the same genotype. |
| Concordance | When two family members share a disease, the two individuals are concordant for the disease. |
| Discordance | When two family members do not share a disease, the two individuals are disconcordant for the disease. |
| Genocopy | A concordant relative has a genocopy (aka phenocopy) if they express the disease for different genetic reasons. |
| FISH | Fluorescent in situ hybridization (aka ish). |
| p | Short arm of a chromosome |
| q | Long arm of a chromosome |
| mar | Marker chromosome |
| r | Ring chromosome |
| i | Isochromosome |
| der | Derivative chromosome |
| dic | Dicentric chromosome |
| cen | Centromere |
| ter | Telomere, or at the end |
| del | Deletion |
| dup | Duplication |
| fra | Fragile site |
| ins | Insertion |
| inv | Inversion |
| t | Translocation |
| rcp | Reciprocal translocation |
| rob | Robertsonian translocation |
| + | Gain or addition |
| - | Loss or omission |
| : | Breakage |
| :: | Breakage and joining |
| / | Mosaic |
| arr cgh | Array competitive genome hybridization |
| Abbreviations derived from Nussbaum, McInnes & Willard: Genetics in Medicine, 7th ed. Philadelphia, Saunders, 2007. (pg 66) | |
Probability can be sued to predict the types fo progeny that will result from a monohybrid or dihybrid cross. The Punnett square is a graphical representation of these possible outcome. Phenotypes are the result of the genotype of an organism, more than one genotype may result in the same phenotype. Distinct segregation patterns result from monohybrid, dihybrid, and test-crosses.
Probability (expected frequency): probability of an outcome…# of times event is expected to happen/# of opportunities (trials)
The sum of all the probabilties of all possible events = 1
General rules:
Step 1: For outcome “A,†what events must happen? Multiply the probability of each event.
Step 2: Are there different ways to get outcome “A� Calculate the probabilities of each different way and add them up.
Multiplication Rule (Product Rule). The probability of two independent events, A and B, being realized simutaneously is given by the product of their separate probability
Prob{A & B} = Prob{A}â‹…Prob{B}
The probability of indepdent events occuring together is the product of the possibilties of the individual events: p(A and B) = p(A)p(B)
If you roll two dice, what is the chance of getting two 5′s? A 5 on the 1st die and a 5 on the 2nd die? 1/6*1/6
Addition Rule (Sum Rule). The probability of the realization of one or the other of two mutually exclusive events, A and B, is the sum of their separate probability. The probability of either of two mutually exclusive events occurring is the sum of their individual probabilites.
Prob{A or B} = Prob{A}+Prob{B}
If you roll two dice, what is the probabiltiy of two 5′s or two 6′s? Probability of a 5 on 1st die and a 5 on 2nd die or a 6 on 1st die and a 6 on 2nd die.
1/36+1/36=1/18
The Punnett Square is a way of depicting the product rule. Using Mendel’s Law of segregation, we know that both alleles are equaly likely to occur. So for a cross:
1/2R 1/2 r
1/2R 1/4RR 1/4 Rr
1/2r 1/4 rr 1/4 Rr
1/4 RR 1/4 Rr 1/2 Rr
t T
t tt tT
T Tt TT
| 60 red gum bals nd 40 green gum balls. If you buy one gum ball, what is the probability of getting a red one? |
|---|
| # of red gum balls/total # of gum balls = 60/100 = .6 (.06 x 100% = 60%) |
| How many (ratio) progeny will have the same phenotypes of one of their parents from such a cross: AaBBCcDdee x aaBbccDdEE? |
| They must be either A_B_C_D_ee, or aaB_ccD_E_ … (½)(1)(½)(¾)(0) + (½)(1) (½)(¾)(1) = 3/16 |
| What is the probability of one green and one red gum ball if we have two quarters? Green first then red or red first then green. When not specifying order, we must figure out each way of getting the outcome. |
| p(green, then red) = .4*.6 and p(red, then green) = .6*.4 .24+.24=.48 |
| What fraction of the progeny will have large, smooth, purple fruit? SsPpLl x ssPPLl Texture: S-smooth and s-rough; P-purple and p-pink; L-large and l-small. |
| large: p(LL or Ll) = 1/4 + 2/4 = 3/4 smooth: p(Ss) = 1/2 purple: p(Pp or PP) = all = 1 large, smooth, purple p(L_S_P_) = 1/2 * 1 * 3/4 = 3/8 |
| What fraction of the progeny will not have large, smooth, purple fruit? SsPpLlxssPPLl |
| large, smooth, purple = p(L_P_S_) = 3/8. Not large, smooth, purple = 1-3/8 = 5/8 |
These are extension to the 3:1 Ratio, the ratio of monohybrid crosses (1 gene):
Definition: The phenotype of the heterzygote is different from the phenotype of either of the homozygote, and the heterzygote’s phenotype lies somewhere in the range between the phenotypes of the homozygotes.
Example: Four O’Clock flower
Red x White ⇒ Pink
Pink x Pink ⇒ 1:2:1 Red:Pink:White
Instead of seeing 3:1 ratio, we observe a 1:2:1 phenotypic ratio.
(Also note that the phenotypic ratio is the same as the genotypic ratio)
Definition: Codominance is the condition under which the heterozygote manifests phenotypically the features of two different alleles, rather than an intermediate phenotype between the homozygous dominant and homozygous recessive phenotypes.
Although there is 1:2:1 phenotypic ratio, as in incomplete dominance, the heterozygote does not represent an intermediate phenotype, but rather a phenotype in which the traits from the two different alleles coexist.
Example 1: ABO blood types. IA, IB, and IO are the alleles for a gene that produces blood antigen:
IA and IB alleles encode for antigen A and B, respectively.
IO alleles does not encode for any antigen.
An AB individual has both A and B antigen. So allele IA and IB are codominant.
Example 2: The L and M antigens are similar to the ABO system. The ability to produce M and N antigens is determined by alleles of single gene.
Homozygotes for M allele LMLM produce only M antigens.
Homozygotes for N allele LNLN produce only N antigens.
Heterozygotes LMLN produce both M and N antigens.
Question: If two AB blood type individuals mate, what type of blood will the offspring have?
Answer: There will be a 1 :2: 1 ratio, in which 1/4 of the offspring will be type A, Y2 will be type AB, and 1/4 will be type B.
AB x AB ⇒ 1A : 2AB : 1B
They are often multiple forms of the same gene present in a population. As a diploid organism, only two of the alleles are expressed in the organism. Many genes have more than two alleles: for example: blood groups (discussed above) have 3 (A, B, O).
Example: Multiple alleles determine hair color in rabbits.
Genotype Phenotype
cc
Albino -white hair allover the body
Ch Ch Himalayan- Black hairs on the extremities and white hair on the body
Cch Cch Chinchilla- White hair with black tips over the entire body CC colored hair(wild type)
Cch and Ch show codominance: the genotype Cch Ch has white hair with black tips on the body but black hair at the extremities.
Cch and c show incomplete dominance: Crossing a chinchilla with an albino produces a rabbit with white hair with gray tips (light chinchilla). It’s genotype is Cch c.
Some alleles turn out to be lethal if carried by individuals. Most of these lethal alleles are only lethal when homozygous. Such alleles may distort the expected 3:1 ratio a 2:1 ratio may be observed because the individuals that are homozygous for the lethal allele die.
Example: coat color in mice.
Yellow coat color (Y) is dominant over brown coat color (y). When two heterozygous (Yy) mice mate, 2/3 of the offspring are yellow (Yy) and 1/3 are brown (yy). The 2:1 ratio results because the homozygous yellow (YY) mice die.
Yy x Yy ⇒ 1 YY (dead!) : 2 Yy (yellow) : 1 yy (brown)
Question: What happens if the lethal allele is sex-linked?
Answer: The offspring will skew toward having more females than males.
| Cross | Resulting Genotypes |
| AaBb x AaBb | 9: A_ B_ 3: A_ bb 3: aa B_ 1: aa bb |
The 3:1 ratio deals with one gene. The 9:3:3:1 ratio deals with two genes. However, there are many extensions to this ratio. The classical 9:3:3:1 ratio is produced when we cross two double heterozygote individuals exhibiting traits with complete dominance. The table below explains patterns which result via extensions of the 9:3:3:1 ratio.
| Ratio | Term | Meaning | ||||||||||||||||||||||
| 9:3:3:1 | No Effect | The expression of gene A has no effect on the expression of gene B. Interaction between gene A and B will lead to the variation of this ratio. There are 9 possible variations to this ratio. The most common ones result from epistasis, duplication, complementation, and suppression, and are listed below. | ||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9:7 | Complementation |
Complementation implicates different genes needed for a particular trait. If two recessive mutations are alleles of the same gene, then the phenotype of an organism containing both mutations is mutant; if they are alleles of different genes, then the phenotype of an organism containing both mutations is wild type (non mutant). The flower color in sweet peas. If you cross two mutant white flowers and end up with purple wildtype F1 progeny, complementation occurs.
At each pair of loci in the F1 heterozygous progeny, there is at least one dominant allele present. Therefore, all of these flowers will be purple. Would you expect to see complementation if the mutations were in the same genes? If the mutations are on the same gene, crossing two mutations will not restore the wildtype phenotype. In the other words, of the two mutations don’t complement. (Complementation is a commonly used test in genetics to examine whether the mutations are on the same gene). |
||||||||||||||||||||||
| 12:3:1 | Dominant epistasis | |||||||||||||||||||||||
| 9:4:3 | Recessive Epistasis |
Recessive epistasis, a condition where gene A masks the effect of gene B when A is homozygous aa. For example, the production of pigment (i.e. Albinism) in a species of snake. We are looking at two genes (P and G) which affect coloration in these snakes:
Cross two heterozygotes (green snakes): PpGg x PpGg
The phenotypic ratio observed here is 9:3:4. (9 green: 3 orange: 4 albino). In this case, the P gene determines whether pigment will be produced (P) or not produced (p). The G gene determines if the color produced will be green or orange. If the genotype is pp, then no pigment is produced, and the allele of the color (G) allele is irrelevant. |
||||||||||||||||||||||
| 9:7 | Suppression |
Suppression, where One genotype suppresses the expression of a second genotype. This usually results in only two phenotypes observed. Sometimes one gene suppresses the effect of an abnormal (mutant) allele of another gene, resulting in the normal phenotype. The wild type phenotype is therefore restored. |
||||||||||||||||||||||
| 13:3 | ||||||||||||||||||||||||
| 15:1 | Duplicate Genes |
Duplicate genes, where only one dominant allele of either gene is needed in order to produce a given phenotype (two genes share the same phenotype). This results in a 15: 1 ratio, where only the homozygous recessive are different from the rest of the offspring. Question: The plant shepard’s purse has 2 different fruits (heart shaped or narrow). A cross between 2 heart-shaped heterozygotes produces offspring with heart-shaped fruits and narrow fruits in the ratio of 15: 1. How might you explain this? Using an example, discuss why duplication is an important precursor to evolution. Answer: The observation that there are only two phenotypes might make you think that there is only 1 gene involved, which isn’t the case since you should see a 3:1 ratio in a monohybrid cross. However, if that gene becomes duplicated and you cross two individuals that are heterozygous for both copies of the gene, only l out of 16 will not have a single dominant allele. The dominant phenotype appears if a dominant allele is present for either copy of the gene. This is true because the genes have identical actions. Al a1 A2 a2 x Al al A2 a2 If there is a dominant allele for either Al or A2 a heart shaped fruit is produced. If a mutation occurs in one of the copies and causes production of an enzyme slightly different from the original, its carrier has not lost the gene’s function. Moreover, if the duplicated gene mutates again, causing it to produce a novel gene product that is valuable the individual may actually be at an advantage relative to other individuals. Duplication is an important precursor to evolution because with two identical genes one is free to evolve into something better. |
||||||||||||||||||||||
| 9:6:1 | ||||||||||||||||||||||||
The following questions refer to offspring ratios after the mating of individuals heterozygous for the suppressor (gene A) and heterozygous for the gene it suppresses (gene B), e.g. AaBb x AaBb
Question 1: What ratio of offspring phenotypes would you expect if suppression occurs when “gene A” possesses a dominant allele and the target gene produces a mutant phenotype when “gene B” is homozygous recessive?
A = suppressor gene: suppresses mutant pigment when genotype is AA or Aa
B = pigment gene: produces mutant pigment when genotype is bb
9 A_B_ normal
3 A_bb normal (mutant pigment is suppressed)
3 aaB_ normal
1 aabb mutant (not suppressed)
Ratio is 15 : 1 (normal : mutant).
Question 2: What ratio of offspring phenotypes would you expect if suppression occurs when “gene A” is homozygous recessive and the target gene produces a mutant phenotype when “gene B” possesses a dominant allele?
A= suppressor gene: suppresses the mutant pigment when genotype is aa
B= pigment gene: produces a mutant pigment when genotype is BB or Bb
9 A_B_ Mutant (not suppressed)
3 A_bb normal
3 aaB_ normal (mutant pigment is suppressed)
1 aabb normal
Ratio is 7 : 9 (normal : mutant).
Linkage is the tendency of genes, which are closely located on the same chromosome, to segregate together. The physical distance two genes on the chromosome is proportional to how frequently they cross over together during homologous recombination. Therefore, frequency of recombination is a tool for mapping chromosomes. Linkage analysis follows DNA regions, allowing not only mapping of a chromosome but also identification of disease genes. Association analysis is a similar approach for identifying disease genes, but instead just analyzes which alleles are present in diseased individuals throughout a population.
Key Concepts of Linkage
Loci cross over together in proportion to how near each other they are. Adjacent loci almost always cross over together; far-apart loci rarely cross over together. The proportion of recombinants to nonrecombinants is the recombination frequency, measured in Morgans and labeled θ (theta). The smaller θ gets, the closer the two loci. Recombination frequency can be used to draft a chromosome map.
A low recombination frequency (θ≈0) means two loci are tightly linked (close together and almost always assort together), while a high recombination frequency (θ≈0.5) means two loci are unlinked and could even be on different chromosomes. θ can be used to calculate the LOD score (determining whether two loci are linked) and the map distance (how far apart are two loci).
To determine recombination frequency, recombination events must be deduced. This requires that: at least one parent be heterozygous (informative); and the phase of the alleles must be known. The phase of an allele is simply whether it is on the maternal or paternal homologue. Two alleles on the same homologue are in coupling (they are cis) and two alleles on opposite homologues are in repulsion (they are trans). In a DdMm individual, D could have the same phase as either M or m; more information is needed to deduce which homologue bears which allele.
Imagine that genes HAIR and BRAIN are two genes present on Chromosome 13, and that HAIR and BRAIN each have two alleles. Alleles HAIRbrown and HAIRblonde are equally prevalent throughout the population; thus, any Chromosome 13 is equally likely to bear HAIRbrown or HAIRblonde. However, alleles BRAINsmart and BRAINdumb are not equally prevalent throughout the population. In fact, BRAINdumb is four times as prevalent as BRAINsmart; thus, any given Chromosome 13 has a 20% chance of bearing BRAINsmart and a 80% chance of bearing BRAINdumb.
When two alleles are in linkage equilibrium, then 20% of Chromosome 13′s bearing HAIRblonde or HAIRbrown would also bear HAIRsmart and 80% of Chromosome 13′s bearing HAIRblonde or HAIRbrown would also bear HAIRdumb. When an allele is in linkage disequilibrium, then it is more likely to associate with a certain allele of a different gene. For example, although BRAINsmart and BRAINdumb are present on 20% and 80% of Chromosome 13′s, a Chromosome 13 with HAIRblonde might also bear only BRAINdumb alleles while HAIRbrown bears the remaining 30% BRAINdumb and all the 20% BRAINsmart alleles. In this case, HAIRblonde and BRAINdumb are in strong linkage disequilibrium.
| Inheritance | Genomes | Outcome |
|---|---|---|
| Biparental | Maternal and paternal genomes. | Normal development. |
| Gynogenetic | Two maternal genomes. | Small placenta causes halt of embryonic development. |
| Androgenetic | Two paternal genomes. | Embryonic growth retarded. |
For most diploid species, sex is determined by one of the homologous pair of sex chromosomes. In humans and fruit flies, the sex chromosomes are X and Y (XX=female ; XY=male). In birds, female is WZ and male is ZZ. There are many ways of determining sex:
Since in humans, Y chromosome carries very little genetic information, sex-linked genes usually refer to X-linked genes (genes on X chromosome). To determine if a gene is sex-linked, you perform pedigree analysis, use Drosophila, or perform a reciprocal cross.
Also known as the testes-determining gene or sex-determining region on the Y chromosome, SRY is a gene located on the short arm of the Y chromosome and encodes a DNA-binding transcription factor known as testes-determining factor (TDF) early (and briefly) in development. Although SRY does not determine sex in all cases — it is not present in 10% of unambiguous XX males — it is a key instrument in the concert of sex determination.
The LOD score (aka Z) gives an estimation of how closely two loci are linked (for example, a marker locus and a disease locus) and quantitates sample size (data reliability). A LOD score less than 2.0 means the two loci are not linked; a LOD score between 2.0 and 3.0 is inconclusive; and a LOD score greater than 3.0 strongly indicates linkage. Lod scores are always reported in association with the recombination frequency θ (theta), measured in Morgans, which describes linkage without taking into account sample size.
| Step | Description |
|---|---|
| Step 1 | Look at nothing more than affected/unaffected individuals and determine the mode of inheritance. |
| Step 2 |
Cross out individuals who cannot be identified as recombinants or nonrecombinants (uninformative individuals).
|
| Step 3 |
Determine whether each informative individual is recombinant or nonrecombinant. Considering an informative individual’s haplotype, is it consistent with their parent’s haplotype and the mode of inheritance?
|
| Step 4 | Count how many recombinants and non-recombinants there are. |
| Step 5 |
|
![]() |
| Next Steps | If these concepts are unclear, please review core concepts of linkage analysis and try a few problems. |
|---|
| Topic | Overview | ||||||||||||||
| Cancer |
Once you have a tumor cells there is just a massive accumulation of more mutations. Tumors replicate more, and faster — each division is an opportunity for mutation. Sporadic cnacers are more common — you inherit susceptibility but most important mutations are somatic during your lifetime. There can be strong, almost mendellian predisposition. Mutations in tumor suppressor, tumor promoter (proto-oncogenes, which mutate to become oncogens) and DNA mismatch repair genes. Cancer is any tumor which can spread — a tumor develops in the following steps:
|
||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Familial vs Sporadic Cancers |
|
||||||||||||||
| Cancer Genes |
Chromosome abnormalities (like translocations), loss of heterozygosity, positional cloning of familial cancer genes and functional cloning all tip off which cancer genes are involved in a cancer.
|
||||||||||||||
| Loss of Heterozygosity |
Loss of heterozygosity is characterized by normal tissues showing two bands when probed — one for each allele — and tumor tissues only showing one band. This is an important mechanism of cancer, whereby a healthy cell with one faulty allele and one healthy allele suddenly has two faulty alleles. Loss of heterozygosity can occur via:
Recombination events are what usually lead to loss of heterozygosity. This happens via unequal crossing-over (improper recombiation) whereby part of one chromosome replaces the homologous region on the other chromosome, which leads to both chromosomes carrying the mutant region. |
||||||||||||||
| p53 |
Unlike other tumor suppressor genes, just one defective p53 allele is detrimental because it is a tetramer. Just one mutant p53 allele means that only 1/16 of all p53 tetramers will be function. Most common detrimental mutations involve p53 core domain that interacts with DNA. Germ-line mutation of p53 (Li-Fraumeni) leads to oncogenesis amongst almost all relatives, and of different tissues (breast/epithelial/etc). p53 inducers and mtuant phenotypes are:
|
||||||||||||||
| Proto-Oncogenes |
Normal cell require stimulatory signals to proliferate; tumor cells achieve autonomy where they proliferate without stimultion. Known proto-oncogenes include growth factors and their receptors (PDGF, FGF, EGF, CSF), signal transducers (ras, src, abl) and signal effectors (myc, fos, jun). Oncogenes arise via:
Oncogenes are isolated via the DNA transformation assay:
|
||||||||||||||
| Balanced Translocation | Not always detrimental. Same information, just different places. But if the break causes a fusion protein (Philadelphia chromosome) then it is inappropriately activated). | ||||||||||||||
| DNA Mismatch Repair | DNA Mismatch Repair Genes are needed for repairing DNA. Familial Adenomatous Polyposis (FAP)—germline mutation in APC tumor suppressor gene (Chr 5), followed by accumulation of additional mutations. Hereditary Non-polyposis Colorectal Cancer (HNPCC)—linked to multiple loci other than Chr 5. Not associated with LOH at these loci, but rather instability of repeated sequences. | ||||||||||||||
| Telomerase | Somatic cells have a finite replicative potential, after which point they reach a crisis and senesce. With each cell division, chromosome ends (telomeres) are shortened due to inability to replicate ends. 90% of tumors reactivate telomerase. It is required for genome stability, but the genome grows increasingly unstable once senescence is reached and genomes are usually unstable in the oft-oncogenic Telomerase-activated somatic cells (usually only activated for gametogenesis). | ||||||||||||||
| Deletion Mapping | Deletions at various points along gene; locate which deletions leads to phenotype. Critical region will be consistent in all deletions which lead to phenotype. | ||||||||||||||
| Microdeletions | Small chromosomal deletions which require FISH for visualization. An example of a microdeletion is the family of 22q microdeletions, leading to Velocardiofacial and DiGeorge syndromes characterized by CATCH 22 (cardiac defect, abnormal facies, thymic hypoplasia, cleft palate, hypocalcemia, 22q11 deletions). | ||||||||||||||
| William’s Syndrome | Deletion on chromosomal 7q11.3, involving the elastin locus and other genes; phenotype can reflect variety of severity. | ||||||||||||||
| Neurofibromatosis | An autosomal dominant disease involving the NF1 disease — a disease with 100% penetrance | ||||||||||||||
| Prenatal Diagnostics | CVS @ 11 weeks; early amniocentesis @ 12-15 weeks; routine amniocentesis @ 16-20 weeks. | ||||||||||||||
| Linkage Analysis | Frequently not possible for complex traits, constructs a transmission model to explain inheritance of a disease in a pedigree. | ||||||||||||||
| Allele-Sharing | Allele sharing methods involve testing whether affected relatives inherit a region identical-by-descent more often than expected under random Mendellian segregation, and makes no assumptions about inheritance mode. Just determines if certain alleles are more present in affected individuals. An exampel is sib-pair analysis. | ||||||||||||||
| Association Study | Is a given allele more present in affected than non-affected individuals? Involves whole populations, not just cosegregation in a family. For example, certain HLA alleles are very present in certain disease phenotypes. | ||||||||||||||
| GWA | Genome-wide association studies testing at thousands or millions of alleles for disease association. | ||||||||||||||
| SNPs | SNPs involed with disease are identified via tables of various sequences and locating which regions are consistently changed in diseased individuals but not wild-type individuals. | ||||||||||||||
| Rare Variants | Massive parallel sequencing allows identification of rare variations contributing to rare and common disorders. Randomly fragment DNA and ligate adapters to both ends. Bind single-stranded fragments randomly to the inside surface of flow cell surface. DNA fragments will form “U-shaped” bridges, with both adapters bound to the well surface. Add unlabeled nucleotides and enzymes to initiate amplification of these regions. Fragments become double-stranded, with the new strand being slightly shorter at both ends than the bridge. Repeating this will generate severa million dense clusters of dsDNA that is no longer bridged in each channel of the flow cell, then determine first base of all clusters using nucleotides which are each labeled differently so you can distinguish G, C, T and A — result is an image with lots fo differently-colored dots. Then remove that nucleotide, and repeat until you have sequenced the fragments. | ||||||||||||||
| Variation | Genotyping is identification of genetic difference, best way to locate rare mutation is sequencing. Genetic variations present in populations arise as a result of mutations. Dominant lethal variations are rare. Polymorphisms are common variations (1% or more) and probaly arose early in evolution. Single nucleotide polymorphisms (SNPs). Polymorphisms provie genetic markers that identify chromosomal regions segregating (linkage analysis). Restriction fragmnet length polymorphisms (RFLPs) identify DNA variations also. Tandem repeat sequence polymorphisms arise by replication slippage and are useful for this. PCR amplifies short tandem repeats and is most useful for linkage analysis. Genome wide association uses tousnads/millions of SNPs to identify common variations contributing to complex genetic traits such as genes for heart disease, diabetes or schizophrenia. Polymorphisms in the human population tend to be old.Polymorphism useful for identity, paternity and forensics testing. Lots o diffrent polymorphisms for high specificity. As well as migrations since migrated subpopulation within same species exlcusively have some alleles, and for history of genes (lactase persistience arose indepdently in African and Europe). Polymorphisms/mutations farm from exonic sequence an influence gene expression. Polymrphisms are RFLP, VNTR, STR, SNP. | ||||||||||||||
| DNA Fingerprinting | Paternity testing, establishing twin zygosity, determining bone marrow transplant engraftment, identifying mislabeled pathology species (major!), pedigree analysis of animals and animal products, and establishing identity of criminals. CODIX is an FBI-administered DNA index. | ||||||||||||||
| Mutations | are either deletions (Cri Du Chat), duplications, insertions or translocations. Biochemical genetics looks at one mutated gene at a time to establish roles . | ||||||||||||||
| Quinacrine Banding | Bright q-bands are AT rch, late replication and relatively gene poor | ||||||||||||||
| FISH | Labeling, denaturing, hybridiziation and visualization. There are gene-specific, centromeric, telomeric and chromosome-painting probes. Interphase FISH can locate extra centromeres, leading to easy identification of trisomies. Interphase gene-specific FISH would be good for diagnosing diseases, as it would just tell you if a disease allele is present. Metaphase FISH is useful for identifying small chromosomal deletions and translocations. | ||||||||||||||
| Competitive Genome Hybridization | Comparative genomic hybridization (CGH), aka Chromosomal Microarray Analysis (CMA), is a cytogenetic method for analyzing small mutations (like deletion) and for copy number changes (duplications or losses of a gene). It is frequently used for tumors. | ||||||||||||||
| SKY | Spectral karyotyping is a molecular cytogenetic technique used to simultaneously visualize all the pairs of chromosomes in an organism in different colors. Fluorescently-labeled probes for each chromosome are made by labeling chromosome-specific DNA with different fluorophores. Because there are a limited number of spectrally-distinct fluorophores, a combinatorial labeling method is used to generate many different colors. Spectral differences generated by combinatorial labeling are captured and analyzed by using an interferometer attached to a fluorescence microscope. Image processing software then assigns a pseudo color to each spectrally different combination, allowing the visualization of the individually colored chromosomes. This technique is used to identify structural chromosome aberrations in cancer cells and other disease conditions when Giemsa banding or other techniques are not accurate enough. | ||||||||||||||
| Chormosome abnormalities | Chormosome abnormalities are somatic/acquired, numerical/structural, balanced/unbalanced. | ||||||||||||||
| Structural Rearrangements | Pericentric or pracentric inversion; robertsonian or reciprocal translcoations. When balanced, leads to normal phenotype but problems in meiosis. Translocation, two chromosomes exchange parts. Reciprocal translocation is when two non-homologus chromosomes exchange DNA, requires two breaks. Robertsonian can be balanced or unbalanced. | ||||||||||||||
| Philadelphia Chromosome | |||||||||||||||
| Contiguous Gene Syndromes | |||||||||||||||
| Polyploidy | Typically via non-disjunction, where meiosis leads to two diploid cells and then four haploid cells. Howver, non-disjunction occurs in the second meiotic division and causes on cell to lack a chromosome and the other to be diploid still, with two normal haploid cells. Example is kilnefelter, 47,xxy. Monosomy (like Turner Syndrome, 45,x) is | ||||||||||||||
| Copy Number Variants | CNVs are larger insertions/deletions (Charcot Marie Tooth syndrome is an example). | ||||||||||||||
| Non-coding RNA | Most of human genome encodes RNA, but only 2% encodes protein. Non-protein-coding RNA genes may account for transfer RNA (tRNA), ribosomal RNA (rRNA), small nucleolar RNA (snoRNA) for RNA modification and processing, small nuclear RNA (snRNA) for mRNA splicing, miRNA that regulations mRNA transcription levels), antisenseRNA that inhibits translation of complementary mRNA. | ||||||||||||||
| Human vs Others |
Protein coding changes alone are unlikely to determine human-chimp differences. Sequence differences in non-coding DNA that influences expression levels are likely to be critical.Which species comparison is most useful? • Human-mouse comparisons (80 Myr) |
||||||||||||||
| Allele Frequency | For two-allele genes, AA=p2, AB=2pq and BB=q2. For genes with multiple alleles, just keep expanding… AA=p2, BB=q2, CC=r2, AB=2pq, AC=2pr, BC = 2qr. | ||||||||||||||
| Some alleles are deleterious in a homozygous but provide a heterozygous advantage, like sickle cell anemia. Dominant alleles with fitness <1 disappear after several generations; recessive allles with fitness <1 remain, just less frequently than other alleles. | |||||||||||||||
| Microsatellites | Microsatellites are small repeats like GAGAGA which expands or contracts due to DNAP slippage. Lots of bands is indicative of poor DNA matching or mismatch repair. | ||||||||||||||
| People with familial adenomatous polyposis have mutantions in AC tumor suppressor gene… lots of polyps by age 15, but then if one polyp mutates then TUMOR! hereditoary non… what? | |||||||||||||||
| Genomic Imprinting |
DNA methylation may mediate transcriptional repression by histone deacetylation and histone methylation. The methylation imprint established in early development is maintained by a “maintenance methylase” that recognizes hemimethylated DNA and methylates the CpG on the other strand. Methylation occurs most often in occurs most often in cytosines of CpG islands; these islands are relatively rich in CpG dinucleotides and are often associated with genes. Imprinted genes have CpG islands which are methylated differently on the maternal and paternal alleles. During gametogenesis, the old imprint must be erased and a new sex-specific imprint must be established. Different genes can be active or silenced on the same parental homologue. Beckwith-Wiedeman syndrome arises when a patient has both Chromosome 11′s from the same parent — this leads to bilallelic expression of IGF2 (growth factor gene) and biallelic silencing of H19 (tumor suppressor gene). Hypermethylation leads to chromatin condensation and gene silencing. Not all imprinted genes are hypermethylated at their promoters; some are hypermethylated at intergenic control regions called imprinting centers that control imprinted gene cluster (called enhancer blocking). Identical deletions of Chr 15 cause PWS when maternal UPD (silencing of matneral genes) inherited or AS when paternal UPD (paternal alleles silenced). So far all PWS cases are new due to sterility of patients; some AS cases are familial, with unaffected fathers passing a mutant gene to half their offspring of tiether sex. If a son passes the mutant on it will not show. If a daughter does then half her kids will have AS, leading to its appearance in multiple progeny over mutliple genrations. |
||||||||||||||
| Euchromatic Genes | Active -regulated- Inactive (silenced) –Normal gene regulation @ development –Abnormal (cancers) –Normal parent specific germ line silencing (imprinting) |
||||||||||||||
| Aneuoploidy | |||||||||||||||
| Uniparental Disomy | Both chromosomes are from one parent — isodisomy if they are identical, heterodisomy if not. Examples are Prader-Willi, Angelman, Beckwith-Wiedemann, . | ||||||||||||||
| Hemoglobin | Oyxgen carrier in vertebrate red blood cells. Tetramer, with 2 identical α chains and 2 identical β chain. Each globin chain covalently linked to a heme group, an iron-containing pigment binding oxygen The common adult hemoglobin is HbA, written α2β2. Classic model for proteins and mutations. Mutations resulting in amino acid substitution lead to a difference in the charge of the protein, are co-dominant as electrophoreti phenotypes and easily seen in heterozygotes. Patients without HbA are anemic. | ||||||||||||||
| Electrophoretic Mobility | Uses amino acid changes leading to protein charge differences to identify mutants. Different Hbs can be separated from each other using electrophoretic mobiliity. | ||||||||||||||
| Sickel Cell | Autosomal recessive. Heterozygote (only!) advantge resistance to malaria. HbS is due to a single amino acid substitution in the β globin chain. Fiber polymerization is new protein property of HbS | ||||||||||||||
| Hb Hammersmith | Reduced O2 binding by heme group. | ||||||||||||||
| Hb Kempsey | Binds but fails to release O2 | ||||||||||||||
| Compound heterozygotes | Different mutations in same gene in a patient have disease similar to homozygosity for mutant allele. For example, BetaS/BetaHammersmith is affected. | ||||||||||||||
| X Chrsm Inactivation |
Explains why monozygotic twins might have different phenotypes, with one child having an X-linked disorder and the other child not. In a normal female, only one of the two X chromosomes present is genetically active, the other being inactivated. X-inactivation occurs early in development. The inactive X can be either maternal or paternal in origin; the choice is random. X inactivation is irreversible in somatic cells, such that the inactive X in a particular cell remains inactive in all descendants of that cell. In most mammals, one X in females is inactivated. Alternatives are for one X in male hypertranscribes, two X’s in female hypotranscribe or the apternal X chromosome is inactive. In the early zygote in/activation, all descendant cells have inactivation or in alte blastocyst it is random and their is mosaicism. Properties of the inactive X chromosome:
Epigenetic regulation is stable transmission of gene expression to daughter cells in absence of change in DNA content or sequence. This is due to chromatin structure, DNA mod or both. Methylation imprint established in early dvelopment is maintined by maintenance methylase that recognizes hemi(one strand)methylated DNA and methylates CpG on other strand. CpG dinucleotides are targets for DNA methylation, and methlyated CpG are targets for specific binding by proteins like MeCP2. MeCP2 recruits histone deacetylases that remove acetyl groups from histone tail. X inactivation involves a recognition step (referred to as counting counting) in which the number of X chr in a cells is counted relative to cell ploidy so that only a single X chr is functional per diploid adult cell.
|
||||||||||||||
| der(a)/der(x) | Abnormal X chromosomes are usually inactivated, due to selection since the abnormal cells are fault. If there is an X;autosome translocation, the normal X chromosome is preferentially inactivated so that there is not ianctivation of the autosome. However, the progeny are unbalanced and carry the X inactivation center leading to ivnariable inactivation of athe chromosome with the normal X always active. Se page 103. | ||||||||||||||
| Xic | An 80 kb region of the X-chromosome that is required for X-inactivation to occur. It is responsible for initiating X-inactivation in cis: An X-chromosome that carries Xic can become inactivated, whereas one in which Xic is missing cannot. The Xic is also responsible for “counting”, whereby a single X is kept active and all other Xic carrying chromosomes are inactivated. First gene identified in Xic recgion was X inactive specific transcript (Xist) — this gene is expressed exclsuively from inactive X chromosomes, procuing a spliced noncodiing trancript that is the primary signal for spreading inactive state along chromosome. Xist recruits histone deacetlyases and a unique histone called macroH2A which causes chromatin condensation and gene silencing. A second Xic gene is Tsix (reverse of Xist) which encoded a noncoding RNA that controls Xist in cis and is complementary to Xist. How might Tsix inhibit Xist expression??? | ||||||||||||||
| Xic Crosstalk |
The choice of Xa and Xi always occurs in a mutually exclusive manner. What mechanism mediates the crosstalk?Hypothesis: Choice is mediated by a physical interaction of X-chromosomes (chromosome kissing). Experiment: Measure distance between Xic regions during ES cell differentiation. During XI establishment the X chromosomes become very near and then separate. This can be done by formaldehyde induced crosslinks of near DNAs, followed by digestion, ligation of the two DNA segments and then detect ligation product by quantitative PCR. |
||||||||||||||
| Barr Body | The number of Barr bodies is thus one less than the number of X-chromosomes. The condensed, single X-chromosome, appearing as a densely staining mass, that is found in the nuclei of somatic cells of female mammals. Is derived from one of the two X-chromosomes which becomes inactivated. Barr bodies are commonly referred to as sex chromatin. The human abnormalities called Kleinefelter’s syndrome and Turner’s syndrome both result from an unnatural presence or absence of a Barr body. In the case of the former, the male possesses a Barr body that it would normally not have, and in the latter case the Barr body is absent. There is logically lots of Xist RNA in in individuals with inactivated X chromosomes (none in normal 46,XY males). | ||||||||||||||
| Turner Syndrome | Why do 45,X females (only one X chromosome) have such severe phenotypes? | ||||||||||||||
| Werner Syndrome | Very rare autosomal recessive progeria. | ||||||||||||||
| Alzheimer’s | Amyloid Precursor Protein (APP) leads to amyloid plaques and memory deficits — also, presenilin 1 and apolipoprotein E4 enhance the Alzheimer’s phenotype. | ||||||||||||||
| Angelman/PWS | Genetic info only from father in Angelman; only from mother in PWS. Involves Chromosome 15. 15q11-q13 is a tipoff of these disorders. | ||||||||||||||
| Mouse Genetics | Inbred strains: unlimited identical twins with a unique set of alleles useful for research purposes. Spontaneous mutations, however, influence gene expression, structure or presence. Transgenics is the random insertion of engineered genes to produce enhanced expression, and can be inducible. Targeted mutations alter endogenous genes in a controlled manner to create null or mutant alleles. Induced mutations are random chemically induced point mutations; gene-trap insertions inactivate genes. | ||||||||||||||
| Transgenic Mice | This adds a new gene to the normal genes. DNA construct is microinjected into pronuclei of fertilized eggs; these eggs are transferred to pseudopregnant females. 30% of surviving progeny will have the microinjected DNA heritably integrated into their genome. Transgenic animals have variability transgene expression due to construct, promoter, site of integration and number of copies inserted. This is usually measured using mRNA expression. | ||||||||||||||
| Regulated Transgene | Transgenes can be expressed in a reversible or inducible manner. There is a bacterial tetracycline repressor protein which binds strongly to tetracyclne operator sequences, but is displaced in presence of tetracycline, its derivativees or its analogs. The repressor protein is then converted into an activator by fusion with the transcriptional activation domain from a viral protein. | ||||||||||||||
| Gene-Targeting | Gene-targeting allows creation of knockouts and insertion of new genes. | ||||||||||||||
| ES Cells | ES cells are usually strain 129 and recipient blastocyst is C57BL/6, leading to possible observed phenotype due to the targeted gene being expressed on different genetic backgrounds. | ||||||||||||||
| Cre/Lox | The Cre/Lox system is used for making conditional knockouts. The essential exon is flanked by loxP sites (locus of X-ing over) which are 34 base pair sites where Cre (causes recombination) takes effect — Floxed regions (flanked by loxP sites) are excised. However, mating two engineered mice — one who carries a Floxed region, and a transgenic mouse expressing Cre in specific tissues — allows genes to be knocked out in certain tissues only. Also, Cre can be attached to a promoter activated by specific chemicals that can be added at desired times to the animal’s diet. | ||||||||||||||
| Knock-In | A targeting vector can be gene-targeting to recombine and insert a new gene and a Floxed neomycin gene into ES cells. These ES cells are then transfected with CMV expressing a Cre gene. This leads to a targeted locus without neomycin, and these ES cells are injected into blastocysts. | ||||||||||||||
| Author | Question | Answer | |
| Fagundes et al | Question? | Are Native American haplogroups part of a founding population (explaining whether North America was settled by a single or multiple migrations) and what was the path and era of the migration(s) across the Americas? | |
|---|---|---|---|
| Importance? | Human history. | ||
| Strategy? | Analysis of haplogroups within mitochondrial genomes, correlated to geological events (glacier melting), weather changes (ice age) and archaeologial finds. | ||
| Results? | North America was settled by a single founding population which migrated along a Pacific coastal route from Asia and throgh | ||
| Interpretation? | |||
| Next Step? | |||
| Hanna et al | Question? | Can induced pluripotent stem cells stimulated into hematopoietic progenitors and transgenic for a healthy β chain allele, resolve a sickle cell anemia mouse model? | |
| Importance? | Possible future stem cell treatments of patients with sickle cell anemia. | ||
| Strategy? | Induce epithelial cells into pluripotency, gene target to replace an hβs allele, stimulate into hematopoietic progenitor cells and transplant into irradiated mice. Then examined red blood cells via direct microscopy (for evidence of sickling), PCR of hβs and hβa DNA and electrophoresis of hemoglobin proteins to determine if hemoglobin A is being expressed. | ||
| Results? | Via every manner of examination, the mice apparently had functional red blood cells with hemoglobin A and not only hemoglobin S post-treatment. | ||
| Interpretation? | The presence of hemoglobin A — and the lack of sickling, the direct phenotype — indicates a successful experiment. | ||
| Next Step? | Developing similar iPS treatments for recessive thalassemias, and possibly even for other kinds of genetic diseases. | ||
| Kagami et al | Question? | What mutations in imprinted Chromosome 14 regions can lead to phenotypes resembling maternal or paternal upd(14)? | |
| Importance? | Identifying genes and their products involved in the maternal and paternal upd(14) phenotypes opens up potential for gene therapies. | ||
| Strategy? | Identify individuals with phenotypes resembling paternal and maternal upd(14) but who are not actually uniparentally disomic for Chromosome 14. Examine the imprinted regions within Chromosome 14 for deletions, variations, epimutations (methylation analysis) and FISH analysis. | ||
| Results? | Found mutations of maternally expressed genes (MEGs) MEG3, RTL1as and MEG8 for most patients with a phenotype resembling paternal upd(14). Found mutations of paternally expressed genes (PEGs) DLK1 alone, or DLK1 and RTL1 for patients with a phenotype resembling maternal upd(14). | ||
| Interpretation? | Overexpression or underexpression of genes in imprinted regions of Chromosome 14 lead to a phenotype resembling maternal or paternal upd(14). | ||
| Next Step? | Knocking out and knocking in of DLK1 and RTL1, and MEG3, RTL1as and MEG3, to simulate overexpression and underexpression of the genes potentially leading to maternal and paternal upd(14) phenotypes. | ||
| DMR = Differentially Methylated Region. Epimutations = Epimutations are stable (affect progency cells) changes in gene expression not involving DNA sequence changes. |
|||
| Moller et al | Question? | Does DYRK1A play a role in Down Syndrome? | |
| Importance? | Global genetic disease. | ||
| Strategy? | Examine non-Down-Syndrome patients with Down Syndrome similar phenotype. | ||
| Results? | Patients had truncation of DYRK1A gene. | ||
| Interpretation? | Down Syndrome patients have extra DYRK1A gene. Patients in this study had truncated DYRK1A gene. DYRK1A gene expression must be tightly regulated or microcephaly, low weight, low height and possibly phalangeal defects can occur at varying intensities. Overexpression or underexpression of DYRK1A gene both lead to disease phenotype. | ||
| Next Step? | Create mouse models of overexpression (transgenic for constitutive DYRK1A promoter) or undexpression (knockout of DYRK1A) to examine if DYRK1A gene alone is responsible for the disease phenotype. | ||
A genetically normal human has a maternal copy and a paternal copy of 23 different chromosomes. With 23 pairs of each chromosome, a genetically normal human has 46 chromosomes total. There are exceptions — for example, sperm cells are haploid and contain only one copy of each chromosome (23 chromosomes total). The number of chromosomes within a haploid cell is referred to as n. Thus, a cell with 2 copies of each chromosome (a diploid cell) has 2n chromosomes; a cell with 3 copies of each chromosome (a triploid cell) has 3n chromosomes; and a cell with 4 copies of each chromosome (a tetraploid cell) has 4n chromosomes.
Not including reproductive cells, if an individual has any other than 2n chromosomes then they are heteroploid. If a heteroploid has an exact multiple of n chromosomes (such as 69 or 92, in humans) then that individual is euploid. If a heteroploid does not have an exact multiple of n chromosomes (such as 45, 47 or any value ≠ 46) then that individual is aneuploid. Although rare, triploidy and tetraploidy are forms of euploidy. Paternal chromosome triploids (individuals with an extra copy of every paternal chromosome) have abnormal placentas and are known as partial hydatiform moles; however, maternal chromosome triploids are spontaneously aborted early during gestation.
Aneuploidy is more common than euploidy. Most aneuploidy is trisomy (three copies of an individual chromosome, instead of a pair) or monosomy (one copy of an individual chromosome). The most common trisomy is trisomy 21, which occurs in 95% of Down Syndrome patients. Chromosomes 21, 13 and 18 contain the fewest genes of any other chromosome — trisomy of any chromosome other than these three deadly. Monosomy of any chromosome except Chromosome X (a condition called Turner Syndrome) is lethal. Aneuploidy is usually caused by meiotic nondisjunction, meaning that (usually during meiosis I) chromosomes fail to disjoin properly and daughter cells either have too many or too few chromosomes.
Chromosome structure abnormalities can be either unbalanced rearrangements or balanced rearrangements. The various kinds of unbalanced rearrangements are detailed below. Unbalanced rearrangements involve deletion, duplication or both. Deletion of a chromosome segment leads to partial monosomy of that segment. Duplication of a chromosome segments leads to partial trisomy of that segment. Any mutation leading to an imbalance like monosomy, trisomy or other can result in an abnormal phenotype.
| Kind | Overview |
|---|---|
| Deletions |
A deletion is loss of a chromosome segment. There are two kinds of deletions: terminal, at the end of a chromosome; and interstitial, along a chromosome arm. Deletions can occur if a chromosome simply breaks and part of it floats away. Also, deletions can occur of misaligned chromatids cross over unequally. In addition, deletions can occur if balanced translocations or inversions segregate abnormally. A carrier individual — with one chromosome carrying the deletion, and another chromosome lacking the deletion — is monosomic for the affected genes. Monosomy usually leads to haploinsufficiency, which arises when two copies of a gene are necessary and only one copy is insufficient. Monosomy of a hormone-encoding gene is usually haploinsufficient, since one gene copy cannot encode for enough production of that hormone. |
| Duplications | Duplications, like deletions, can arise by unequal crossing over or abnormal segregation. Phenotypic abnormalities can arise if the duplication results in an extra gene copy (partial trisomy), leading to over-expression. Also, duplications within a gene can destroy its function, thus leading to haploinsufficiency. |
| Markers & Rings |
Marker chromosomes (aka supernumary chromosomes or extra structurally abnormal chromosomes) are very small and unnamed chromosomes which usually exist in addition to a normal diploid. Marker chromosomes usually consist of just centromeric heterochromatin. Larger marker chromosomes, though, contain some material from one or both chromosome arms; this disruption can lead to phenotypic abnormalities. Neocentromeres are marker chromosomes stable during mitosis which lack centromeric DNA, but which somehow have centromere activity. When a chromosome undergoes two breaks, chromosomal DNA can separate and form a ring structure. This form of marker is known as a ring chromosome. Although quite rare, ring chromosomes containing centromeric DNA are mitotically stable. However, sometime sister ring chromatids can tangle while trying to separate during anaphase. This tangling can lead to breakage, resulting in demise of the ring chromosome or even causing new ring chromosomes to form of unequal size. Thus, ring chromosomes are rarely found in all of an organism’s cells. High-resolution banding is ineffective at detecting marker chromosomes because most makers are too small to even discern the bands. However, FISH using probes for centromeric heterochromatin can usually visualize marker chromosomes. |
| Isochromosomes | Isochromosomes arise when one arm of a chromosome goes missing, and the remaining arm is duplicated to form a mirror image. In a diploid cell, this leads to partial trisomy of the genetic material from one arm (the duplicate arms, and arm on the homologous chromosome) and partial monosomy of the genetic material from the other arm. |
| Dicentrics | Although very rare, a dicentric occurs when centromere-containing fragments fuse together to form a new chromosome with two centromeres. Dicentrics can be mitotically stable if one centromere is inactivated, or if the chromosome is pseudodicentric. A pseuodicentric chromosome has centromeres which move to only one pole or the other during anaphase. |
High-resolution banding can detect deletions and duplications on the magnitude of several thousand base-pairs, and FISH can detect very small or uncertain deletions.
Chromosome structure abnormalities can be either unbalanced rearrangements or balanced rearrangements. Balanced rearrangements usually do not lead to phenotypic abnormalities; all genetic information is present in the right amount, just in a different order or in the wrong location.
| Kind | Overview |
|---|---|
| Inversions |
An inversion occurs in two steps: a chromosome breaks in two locations; the chromosome refuses with the segment between the breaks inverted. When both breaks are in the same arm, the inversion is paracentric; when there is one break in each arm, the inversion is pericentric. Paracentric inversions do not usually affect phenotype, and are hard to detect using banding or FISH. However, offspring of an inversion carrier are at risk because a loop forms when paternal and maternal chromosomes cross over; aberrant recombinations can occur within this inversion loop. Pericentric inversions, though, can lead to unbalanced gametes with duplication and deficiency of chromosome segments. |
| Translocations |
Translocation is the exchange of chromosome segments, usually between nonhomologous chromosomes. There are two primary types of translocations: reciprocal translocations and Robertsonian translocations. A reciprocal translocation occurs when segments from nonhomologous chromosomes break off, with segment attaching to the other chromosome and vice-versa. A Robertsonian translocation occurs when two acrocentric chromosomes (chromosomes with tiny short arms, meaning the centromere is near one end) fuse at the centromere and lose their short arms. Reciprocal translocations usually involve only two chromosomes, meaning the total chromosome number is unchanged. Reciprocal translocation are typically harmless, despite being more common in individuals so retarded that they require institutional care. There are three kinds of reciprocal translocation: alternate; adjacent-1; and adjacent-2. Robertsonian translocations lead to a balanced karyotype with only 45 chromosomes. Because acrocentric short arms contain just multiple copies of ribosomal RNA genes, loss of acrocentric short arms is not phenotypically important. A third type of translocation — an insertion — occurs when a segment from one chromosome inserts into a different chromosome (either in its original orientation, or inverted). Inversion translocations are rare because they require three chromosome breaks. Abnormal segretation in insertion carriers can lead to offspring with deletions or duplications of the inserted segment. Alternatively, offspring can be normal or balanced carriers. |
There are numerous diseases which arise from expansions in (CAG)n repeats, which encode glutamine. In a coding region, a (CAG)n repeat expansion can either inactivate a protein or exhibit a dominant phenotype due to poly-glutamine’s toxic effects; examples include Huntington Disease and Spinocerebellar Ataxia (SCA) type I. In a non-coding region, a (CAG)n repeat expansion can interfere with chromatin structure and disrupt transcriptional regulation, and/or exhibit a dominant phenotype by distracting splicing factors with the triplet repeat; examples include Myotonic Dystrophy, Fragile X Syndrome and other SCA forms.
Patients with Chronic Myelogenous Leukemia have undergone a translocation between chromosomes 9 and 22 to form a Philadelphia chromosome (so named due to where it was discovered) containing the tip of chromosome 9 and a portion of chromosome 22, and with an activated oncogene.
Thalassemias are a heterogeneous group of hemoglobinopathies whereby a reduction in synthesis/stability of α or β globin chain causes α- or β-thalassemia, respectively. The clinical problem is excess β peptides (in α-thalassemia) or excess α peptides (in β-thalassemia); this imbalanced α:β ratio leads to early cell destruction and iron overload. The chain produced at a normal rate becomes relatively overabundant, eventually precipitating in the cell, damaging the membrane and causing premature hemolysis.
The heterozygote advantage is that thalassemias weaken red blood cells and confer some resistance to malaria. Upon infection, the cell just dies and prevents spread. Individuals that are α-α-/α-α- die in utero, although this does not include heterozygous mutations in different globin genes. Expression of the globin gene locus is managed by a locus control region (LCR).
| Treatment | Overview |
|---|---|
| Drug Therapy | Possible drug therapy to influence LCR to prolong γ-globin (fetal) expression. |
| Gene Therapy | Using a lentivirus vector to add back normal β-globin gene or normal γ-globin (fetal) genes. Attempts without a new LCR failed; additions require the LCR for consistent, strong globin expressionAdditions require LCR in the construct for consistent, strong globin expression. Early attempts failed without the LCR. |
β-globin deficiency. Mostly due to single base-pair substitutions in the β-globin genes. Much more heterogenous than α thalassemias, even in high frequency populations. Complex β thalassemias involve β globin plus one or more of the genes from the β globin cluster LCR. β globin cluster is totally dependent on its LCR for expression.
Due to α-globin deficiency. Excess γ-globin chain form g4 tetramer (Hb Bart’s). Excess β-globin chains form b4 tetramer ( Hb H). These are ineffective oxygen carriers.
Nearly all disease genes have mutant alleles which affect splicing of the mRNA, rather than protein coding. Exon skipping can arise via mutations in the 5′ splice site, the 3′ splice site, and/or exonic or intronic splicing enhancers. Activation of cryptic splice sites (thus shortening the exon) can arise via mutated exonic splicing enhancers. Examples of diseases due to mRNA splicing errors include Familial Isolated Growth Hormone Deficiency (mutations in the GH-1 gene), Cystic Fibrosis (mutations in the CFTR gene), Spinal Muscular Atrophy and some thalassemias (mutations in the β-globin gene that activate a cryptic 3′ splice site).
Spinal Muscular Atrophy is the most common genetic cause of infant death, occurring in 1 in 6,000-10,000 births. This is an autosomal recessive disorder caused by mutations in the SMN1 gene, encoding a protein involved in snRNP maturation. Humans have undergone an SMN gene duplication generating the SMN2 gene. Even though it encodes an identical protein, the presence of SMN2 cannot compensate for the loss of SMN1 in spinal muscular atrophy, because its splicing is abnormal and it does not produce functional protein.
Splicing targeted therapies are being tested for Spinal Muscular Atrophy. A single C to T mutation in an exonic splicing enhancer of SMN2 exon 7 causes exon skipping and production of a truncated protein from 80% of the SMN2 mRNA. If drugs or other therapies could be found that increase exon 7 splicing of SMN2, one could potentially treat the disease even without correcting the SMN1 defect.
A hemoglobinopathy is any disorder of hemoglobin, the red blood cell protein which binds oxygen. Oxyhemoglobin has a reduced orion (does not carry oxygen); methemoglobin has an oxidized iron (carries oxygen). Methemoglobin reductase reverses the oxidation, thus unload the oxygen. Some mutant hemoglobin is resistant to methemoglobin reductase, and the heme iron thus remains permanently oxidized. Heterozygotes accumulate methemoglobin, leading to asymptomatic cyanosis (blue skin). Homozygotes are completely unable to unload oxygen and have thus not been found. Separate from reductase-resistant methemoglobin are diseases where hemoglobin has altered oxygen affinity. For example, Hb Kempsey binds oxygen very strongly and cannot unload oxygen to tissues. Encoded by an autosomal dominant, Hb Kempsey heterozygotes over-proliferate RBCs in an attempt to resolve low oxygen levels in tissues; HB Kempsey is lethal for homozygotes.
Clinically significant structural variants of normal hemoglobin are synthesized in the proper amount, but have structural defects which can cause or interfere any of the mechanisms described below. Structural variants are typically caused by point mutations, and are responsible for Sickle Cell Disease. Recall that Amino1#Amino2 means ‘Amino1′ replaced ‘Amino2′ at position ‘#’ on the polypeptide.
| Hb | Impact | Usual Mutation | Overview |
| HbS | Hemolytic Anemia | β Chain: Glu6Val | HbS is responsible for the automosal recessive Sickle Disease Disease. |
|---|---|---|---|
| HbC | Hemolytic Anemia | β Chain: Glu6Lys | Oxygenated HbC crystalizes, causing mild hemolysis and circulatory obstruction. |
| HbH | Hemolytic Anemia | β Chain: Phe42Ser | Hb Hammersmith (β4) is unstable, precipatory, hemolytic and autosomal dominant. |
| HbM | Oxygen Transport | β Chain: His92Tyr | Methemoglobin cannot unbind oxygen, causing cyanosis. Autosomal dominant. |
| HbK | Oxygen Transport | β Chain: Asp99Asn | Hb Kempsey cannot unbind oxygen, causing autosomal dominant polycythemia. |
| HbE | Thalassemia | β Chain: Glu26Lys | Autosomal recessive thalassemia due to mutant structure and reduced synthesis. |
| Clinical Significance | Overview |
| Hemolytic Anemia | Hemolytic anemia (lysis of red blood cells) is typically caused by an unstable hemoglobin. Notable exceptions are HbS (sickle cell hemoglobin) and HbC, which are very stable but still detrimental to RBCs. |
|---|---|
| Oxygen Transport | Altered oxygen trasnport is caused by mutant hemoglobin with increased or decrease oxygen affinity. An example is methemoglobin, which is incapable of releasing bound oxygen (as opposed to oxyhemoglobin, which can reversibly oxygenate). |
| Thalassemia | When one hemoglobin chain is unavailable, abnormal hemoglobin form which do not function properly. This condition — thalassemia — causes anemia. Also, hemolysis results from extremely unstable abnormal hemoglobin which may form. |
A hydatid cyst (also known as a mole) is a mass of placenta-derived cells resembling a grape cluster. Complete hydatidiform moles contain only paternal chromosomes but have a 46,XX karyotype. Complete hydatidiform moles (or complete moles, for short) develop into a hydatid cyst. Complete moles contain no normal placenta and no fetus. Ovarian tetranomas are cases with only maternal chromosomes which exist only as benign tumors.
Partial hydatidiform moles (or partial moles, for short) are triploids containing either an extra maternal chromosome set or an extra paternal chromosome set. Cases with an extra set of maternal chromosomes are severely retarded with small, fribrotic placentas. Cases with an extra set of paternal chromosomes have abundant trophoblast (“pre-placental”) growth and poor embryonic development.
X-Linked Severe Combined Immunodeficiency (SCID) stems from a mutation in either common γ chain gene or IL-7 receptor-α gene. A mutation in the common γ chain gene leads to an inability to bind (and hence respond) to cytokines IL-2,4,7,9,15,21. Without activity of those cytokines, the individual cannot produce T cells, B cells nor NK cells; the patient has bubble boy disease and almost no ability to fend off infection. A mutation in the IL-7 receptor-α gene leads to a lack of T cells but present B cells and NK cells. Based on this — and the paragraph above — it is clear that IL-7 is critical for T cell formation, but not B cell nor NK cell formation.
It is worth noting that mice require IL-7 for both T cell and B cell formation. SCID mice are extremely useful research tools. Lacking B and T cells, they are unable to mount an adaptive immune response. Thus, they do not reject transplanted tissues and are also useful for studying how to restore hematopoiesis. Also, SCID has been successfully treated via gene therapy techniques to introduce a functioning common γ chain gene into hematopoietic stem cells.
Down Syndrome (aka trisomy 21) is one of only three thoroughly-studied viable non-mosaic autosomal disorders (the others being trisomies 13 and 18), all of which result in stunted growth, mental retardation and multiple congenital deformities. The phenotype of Down Syndrome involves: hypotonia (low muscle tone) at birth; dysmorphic facial features; folded, low-set ears; flat nasal bridge; mental retardation, with an average adult IQ or 30-60; and, in about a third of all cases, congenital heart disease.
Down Syndrome is the most frequent and studied chromosomal disorder, and can arise in several different ways. All of these avenues somehow involve trisomy of Chromosome 21, and are detailed below.
| Genotype | Overview |
| Trisomy 21 | Approximately 95% of Down Syndrome cases arise from meiotic nondisjunction of the chromosome 21 pair, leading to trisomy 21. This nondisjunction is maternal in 90% of cases, usually occurring during meiosis I. In the paternal 10% of cases, the nondisjunction usually occurs in meiosis II. |
| Robertsonian Translocation | Approximately 4% of Down Syndrome patients have Robertsonian translocations. These patients arise when the long arm of Chromosome 21 (Chromosome 21q) fuses with the long arm of another chromosome, and the short arms break off (but are not considered their own chromosome). If this occurs during meiosis, then cells arise that are trisomic for Chromosome 21q, with: 2 normal Chromosome 21′s; and a Robertsonian chromosome containing Chromsome 21q and the long arm of another chromosome. Usually a translocation with Chromosome 14, this Robertsonian chromosome is denoted der(14;21). |
| 21q21q Translocation | Actually an isochromosome, this situation usually occurs postzygotically when one Chromosome 21 breaks apart into a short arm and a long arm; the long arm duplicates, forming a chromosome composed of two Chromosome 21 long arm. |
| Mosaic Down Syndrome | Accounting for approximately 2% of patients, mosaic Down Syndrome can range from very mild to very severe, depending on the number of affected cells. This variability possibly reflects what proportion of embryonic cells carried the disease genotype. |
| Partial Trisomy 21 | There are a few rare patients who have trisomy of only a portion of Chromosome 21; these patients give tremendous insight into which genes are responsible for disease when triplicate. |
Trisomy 18 is one of only three thoroughly-studied viable non-mosaic autosomal disorders (the others being trisomies 18 and 21), all of which result in stunted growth, mental retardation and multiple congenital deformities. Trisomy 18 frequently arises via translocation of all or most of Chromosome 18. Trisomy 18′s phenotype is mental retardation in all cases, with most cases also involving: severe heart malformation; a receded jaw (overbite); low-set malformed ears; rocker-like feet; and hypertonia, usually leading to unique clenched fists characteristic of trisomy 18.
Trisomy 13 — occurring in 1 in ∼20,000 births — is one of only three thoroughly-studied viable non-mosaic autosomal disorders (the others being trisomies 18 and 21), all of which result in stunted growth, mental retardation and multiple congenital deformities. Trisomy 13 (aka Patau Syndrome) leads to severe growth and mental retardation, usually accompanied by: a sloped forehead; malformed ears; clenched fists and rocker-bottom feet (as in trisomy 18); congenital heart defects; urogenital defects; and polycystic kidneys. Most trisomy 13 individuals gain their extra chromosome via maternal nondisjunction in meiosis I.
Many thanks to ThereseAnn, the commenter who provided incredible links to learn more about trisomy 13. I urge StudentReader.com visitors to peruse LivingWithTrisomy13.org, a site about fellow human beings living with Trisomy 13.
Sickle cell hemoglobin (HbS) results from a single nucleotide substitution which changes the codon of the sixth β-globin amino acid from glutamic acid to valine (GAG→GTG: Glu6Val) and is the causative agent of sickle cell disease when homozygous. Homozygotes have α2Aβ2S hemoglobin. Heterozygotes (said to have sickle cell trait) have α2Aβ2S (HbS), α2Aβ2A (HbA) and α2Aβ2Sβ2A (HbS/HbA hybrid) hemoglobin, and are clinically normal despite mild symptoms under low oxygen pressure. In Sickle Cell Disease, the HbS β chains form chains, aggregating into long fibers which deform the cell, impair its function and result in hemolysis for homozygotes.
Hyperphenylalaninemias are enzymopathies (enzyme inactivity) and aminoacidopathies (inappropriate amino acid processing) that lead to an increase in phenylalanine blood levels due to improper pheylalanine catabolism. They are all autosomal recessive and exemplified by phenylketonuria and phenylalanine hydroxylase deficiency. All hyperphenylalaninemias stem from loss of function in phenylalanine hydroxylase (PAH or PheH, a liver enzyme which catabolizes phenylalanine to tryosine) and/or genes required to synthesize its cofactor, tetrahydrobiopterin (BH4).
Classic phenylketonuria (PKU) stems from an autosomal recessive PAH mutation that renders it inactive. Unable to degrade phenylalanine, PKU patients accumulate phenylalanine in their body fluids and the ensuing hyperphenylalaninemia damages the developing nervous system and mature brain. It is treated by complete diet restriction of phenylalanine; if no phenylalanine is ingested, then it cannot accumulate in the body. Phenylalanine hydroxylase variants are frequently benign, with many people being heterozygous at the gene encoding PAH. The few clinically significant PAH alleles can lead to near-complete PAH activity (classic PKU) or variant PKU (aka non-PKU hyperphenylalaninemia). Variant PKU occurs when PAH carries enough residual activity to lessen the disease phenotype, or prevent the disease despite elevated phenylalanine blood levels.
Rarely, a patient with hyperphenylalaninemia will have normal PAH but be unable to properly synthesize or recycle its cofactor, tetrahydrobiopterin (BH4). Since BH4 is necessary for the activity of other enzymes, phenylalanine restriction alone will not fully treat the patient. Fortunately, normal BH4 can be administered in large oral doses that allow normal development and even unrestricted consumption of phenylalanine. In addition, BH4 administration can help lower phenylalanine blood levels in patients whose PAH has a weak BH4 affinity.
Hyperphenylalaninemias lead to mental retardation due to an imbalance of amino acids crossing the blood brain barrier. With too much phenylalanine in the bloodstream, other amino acids are outcompeted and their levels in the brain significantly decrease. This decrease in brain levels of other amino acids leads to disruption of brain development and, to a lesser extent, mature brain function. Patients with PKU are allowed slight flexibility in their diet during adulthood, as elevated phenylalanine blood levels will not have as detrimental an effect after development. However, elevated phenylalanine levels are detrimental for fetal life. Thus, females with hyperphenylalaninemia expecting to conceive must vigorously restrict their phenylalanine consumption to avoid severe developmental retardation in their offspring.
Lysosomes are membrane-bound organelles which contain hydrolases that degrade unneeded proteins and infectious bacteria. When these substrates accumulate without degradation inside the lysosome, the cells die lysosomal storage diseases (a subset of enzymopathies ensue. Affected tissues grow noticeably large, and mental dysfunction occurs in diseased brains. Tay-Sachs disease is a group of disorders where hexosaminidase A (hexA) cannot degrade the sphingolipid GM2 ganglioside. Since GM2 ganglioside synthesis is most prevalent in the brain, it is most severely affected by Tay-Sachs.
Normal hexA consists of α and β subunits (encoded by HEXA and HEXB) that associates with an activator protein to cleave GM2 ganglioside. Disruption of the α subunit leads to Tay-Sachs disease; disruption of the β subunit o the activator protein leads to, respectively, Sandhoff disease and activator protein deficiency. Tay-Sachs homozygotes are clinically normal at first, but mentally deteriorate by 3-6 months of age and die by 2-4 years. HEXA alleles with some residual activity can lead to late onset, motor dysfunction, psychosis or sometimes no disease at all.
Lysosomes are membrane-bound organelles which contain hydrolases that degrade unneeded proteins and infectious bacteria. When these substrates accumulate without degradation inside the lysosome, the cells die lysosomal storage diseases (a subset of enzymopathies ensue. Hurler Syndrome is a mucopolysaccharidose, a kind of disease involving accumulation of mucopolysaccharides (or glcyosaminoglycans) within lysosomes due to deficiencies in degradative enzymes. An autosomal recessive disease, Hurler Syndrome is caused by extreme impairment of the enzyme α-L-iduronidase, leading to facial dysmorphia, developmental retardation and death by ∼10 years. In Scheie Syndrome, also autosomal recessive, α-L-iduronidase has residual activity and patients sometimes have normal life spans and normal intelligence (despite facial dysmorphia).
Hunter Syndrome is a phenotypically similar mycopolysaccharidose, but is X-linked recessive and impairs iduronate sulfatase (not α-L-iduronidase). Hurler Syndrome fibroblasts synthesize normal iduronate sulfatase, and Hunter Sydrome fibroblasts synthesize normal α-L-iduronidase. When grown together in culture, they uptake secreted proteins and thereby complement each other to produce non-diseased cells. Complementation analysis is the study of genetic complementation, where mutant gene products can complement each other to neutralize disease.
Duchenne Muscular Dystrophy (DMD) is untreatable, severe, deteriorative and relatively common X-linked recessive disorder caused by a mutation of the structural protein dystrophin. DMD causes worsening muscle weakness, with wheelchair confinement by ∼12 years and death by ∼20 years. Females very rarely have DMD, since it is lethal in males and thus cannot be passed on to XX progeny; however, females with Turner Syndrome or X chromosome translocations have been known to have DMD. About 1/3 of DMD patients have new mutations, while 2/3 have maternal inheritance of the DMD allele.
At ∼2,300 kilobases (1.5% of the X chromosome), the gene encoding dystrophin is among the largest in the animal kingdom. It contains 79 exons, and its 7 tissue-specific promoters allow differential splicing into tissue-specific and developmentally regulated isoforms. Dystrophin is a massive complex with a myriad of functions that maintain muscle integrity. Most DMD patients have deletions in either the 5′ or central regions, while most other patients have randomly distributed point mutations. Due to its sheer size, the gene for dystrophin has one of the highest frequencies of mutation.
PCR analysis of carriers and fetuses can identify deletions in dystrophin, and molecular sequencing can identify point mutations. Also, linked markers are useful for prenatal diagnosis when direct analysis does not detect a mutation. If a DMD patient is the first affected family member and a mutant dystrophin allele is not found in his mother, then most of the time it is due to a new mutation. However, ∼10% of such cases are in fact due to maternal germline mosaicism, meaning only some of her cells carry the mutant allele; this is significant due to an increased risk of recurrence.
Cystic Fibrosis (CF) is caused by an autosomal recessive mutation of the cystic fibrosis transmembrane regulator gene (CFTR), which was identified via positional cloning. Cystic fibrosis is due to abnormal fluid and electrolyte transport across epithelial apical (pointed) membranes. This leads to disease of the lung, pancrea, intestine, hepatobiliary tree and male genital tract. Noticeable when kissing a baby with cystic fibrosis, loss of CFTR activity leads to chloride and sodium build-up in sweat glands. Also, the mucous layer of the lung becomes ‘sticky’ and traps pathogens, leading to chronic pulmonary infections..
The first sign of cystic fibrosis is usually high concentration of sodium and chloride in sweat. The average patient survives to ∼33 years, eventually succumbing to chronic lung and heart infections, and maldigestion due to a deficiency of pancreatic enzymes. Many males are infertile due to absence of a vas deferens; most females have near-normal fertility. Postnatal lower intestinal tract obstruction also may occur, treatable by surgery. Some individuals with mutations in the CFTR may have just one or all of these phenotypes.
Present on Chromosome 7 (7q31, to be precise), the CFTR gene is ∼190kB and encodes the large 170kD cystic fibrosis transmembrane conductance regulator (CFTR) protein. Likely an ABC transporter, it contains a chloride channel with five domains: two membrane-spanning domains, each of which contain six transmembrane sequences; two nucleotide-binding domains which bind and hydrolyze ATP to power opening and closing the channel; and a regulatory domain, mediated via phosphorylation.
Approximately 2% of white (European) populations carry a cystic fibrosis disease allele, although carriers are almost nonexistent in other populations. The most frequent CFTR defect is δF508, a missense mutation that reduces ATP binding and accounts for 70% of all CF alleles in white populations. Patients homozygous for δF508, or with premature stop codons in CFTR all have insufficient pancreatic activity, but differing pulmonary (lung) disease. Two alleles of a modifier gene encoding transforming growth factor β1 (TGFβ1) are frequent in cystic fibrosis patients with severe pulmonary disease. Also, a cystic fibrosis genocopy (aka phenocopy — an identical phenotype for different genetic reasons) is a mutation in SCNN1, which encodes an epithelial sodium channel. With ∼85% of CFTR disease alleles identified, DNA diagnosis is useful prenatally and to identify carriers amongst relatives of a patient with no familial history of cystic fibrosis. However, the only current treatments for cystic fibrosis are antibiotics, exercise and a nutritious diet.
Fragile X Syndrome is an X-linked mutation in FMR1, which encodes an abundant neuron protein named FMRP. Almost every FMR1 mutation is an expansion of the CGG-repeat sequence in the 5′ untranslated region of the gene. Normal individuals between 6 and 50 CGG repeats, while diseased patients have over 200 repeats. So many repeats does not happen within one generation; it takes several expansions (premutations) over several generations to acquire hundreds of CGG repeats (full mutation). Having over 200 CGG repeats leads to hypermethylation of the repeat region and the adjacent FMR1 promoter, leading to obliteration of FMRP expression. Mental retardation ensues, with a more severe phenotype in males than females. AGG triplets nestled between CGG repeats inhibit AGG expansion; thus, individuals with reduced AGG triplets are predisposed to become carriers of a diseased FMR1 allele.
Almost all males and ∼50% of females with a disease allele will have the Fragile X phenotype. Some patients have a mixture of premutant and full mutant cells (repeat length mosaicism) due to a worsening genotype with every round of mitosis. Patients with over 200 CGG repeats but lacking methylation in some of their cells (repeat methylation mosaicism) or none of their cells range from normal to fully affected. When analyzing a pedigree, increasing prevalence of the disease in successive generations and inheritance from unaffected mothers are both signs of fragile X syndrome.
Huntington Disease (HD) is an autosomal dominant, late-onset, progressively neurodegenerative disorder caused by mutation of the ubiquitously expressed HD gene. Normal HD alleles have 10 to 26 CAG repeats, while disease HD alleles typically contain more than 36 CAG repeats; 97% of disease alleles are inherited, while 3% of disease alleles are new mutations arising from inherited premutations of 20 to 35 repeats. The function of huntingtin — the gene product of HD — is unclear, and it is also unknown how huntingtin encoded by a disease allele leads to the Huntington Disease phenotype.
Age of onset is inversely proportional to the number of CAG repeats in the HD allele. Juvenile onset ensues from over 60 CAG repeats, adult onset ensues from 40-55 CAG repeats and reduced penetrance (possibly no disease phenotype) ensues from 36-41 CAG repeats. Alleles with more than 36 repeats expand during paternal transmission, with somewhat less expansion during maternal transmission. The phenotype includes cognitive loss, behavioral instability and even schizophrenia. As death approaches, motor skills fail such that incontinence and mutism develops; however, behavioral disturbances lessen. The only current treatment for Huntington Disease is pharmacologic treatment of behavioral instability, and lots of support.
A homeodomain is a highly conserved protein domain (first identified in Antp and Ubx proteins) containing 60 amino acids that fold into three conserved DNA-binding helics (indicating a role in transcriptional reglation). Many homeodomain proteins have been shown in vitro to bind to regulatory regions of genes known to regulate by genetic criteria. The DNA sequence encoding a homeodomain is a homeobox.
Specific and highly conserved residues in the homeodomain’s third helix (helix 3) bind the major groove of DNA, meaning all homeodomain proteins bind the same DNA sequence in vitro. Specific homeodomain proteins have different regulatory properties via interaction with different co-factors; the co-factor and the homeodomain protein bind a specific DNA sequence together, but are unable to bind it individually.
Patients with two defective retinoblastoma (Rb) alleles invariably develop retinoblatomas, leading to its discovery as a tumor suppressor. E2F is a key host cell transcription factor in activating cellular genes required for synthesis of dNTPs, DNA polymerases, and other proteins required for cells to begin S-phase. Also, E2F binds and activates transcription from the adenovirus E2 promoter. The E2 region encodes viral proteins required for adenovirus DNA replication. In uninfected cells, retinoblastoma (aka Rb) is bound to E2F and inactivates it; in infected cells, Rb is not bound to E2F — the adenovirus protein E1A binds to retinoblastoma and removes it from E2F. Since E1A deactivates retinoblastoma by binding it and displacing it from E2F, it accomplishes the equivalent of mutations in both retinoblastoma alleles: there is no functional cellular retinoblastoma. SV40 and polyomavirus Large T-antigen also binds Rb; mutations of SV40 and polyomavirus Large-T that did not bind retinoblastoma did not transform cells. p110 is another tumor suppressor that was found to be absolutely identical to retinoblastoma.
| Next Steps | Study the other tumor suppressor p53 |
|---|
The most important message of the Hardy–Weinberg equilibrium is that allele frequencies remain the same from generation to generation unless some agent acts to change them. With that in mind, the Hardy–Weinberg equilibrium allows scientists to determine whether evolutionary agents are operating and their identity (as evidenced by the pattern of deviation from the equilibrium). The equilibrium also shows the distribution of genotypes that would be expected for a population at genetic equilibrium.
No real-life population is ever at Hardy-Weinberg equilibrium, but fortunately aberrations are usually rare enough that it can be assumed the population is at Hardy-Weinberg equilibrium. The five requirements for Hard-Weinberg are:
If the conditions of the Hardy–Weinberg equilibrium are met, then the frequencies of alleles at a locus remain constant from generation to generation, and after one the genotype frequencies will not change after one generation of random mating. If p and q represent the frequencies of the dominant and recessive alleles at a locus, then p2 and q2 are the frequencies of the homozygous genotypes and 2pq (or pq+qp) is the the frequency of the heterozygous genotype. These can be related by the Hardy-Weinberg equation:
Below are examples of different types of DNA-binding proteins. The most common and best studied DNA-binding proteins are the Zinc finger proteins, the Helix-turn-helix proteins, and the Leucine zipper proteins.
| TATA Box Binding Prtn | The TATA box binding protein is a subunit of the eukaryotic transcription factor, TFIID. This protein is somewhat unusual in that its TBP-binding domain binds to the minor groove of DNA. |
|---|---|
| Zinc finger domain | This domain is common in eukaryotic DNA-binding proteins. It was first noticed in the eukaryotic transcription factor, TFIIIA. TFIIIA contains 9 repeated modules, each of which contains two Cysteine and two Histidine residues. These four residues chelate one Zn++ ion. Each finger is bound in the major groove of B-DNA. |
| Helix-turn-helix domain | This motif was first noticed as a feature of the crystal structure of the bacteriophage l Cro protein. The structure of this small regulatory protein contained two a-helices separated by 34 Ã… – the pitch of a DNA double helix. Model building studies showed that these two a-helices would fit into two successive major grooves. As the structures of a number of other bacterial regulatory proteins (the CRP protein and the bacteriophage l cI repressor) were solved, the same structural motif – called a helix-turn-helix – was observed. It consists of two a-helices separated by a short turn (it is not a b turn). One helix binds to recognition elements within the major groove of DNA; the other helps to keep the binding helix properly positioned with respect to the rest of the molecule. This motif, common in bacterial DNA-binding proteins, also occurs in the eukaryotic homeobox proteins. |
| Leucine Zipper domains | This domain is an important feature of many eukaryotic regulatory proteins. Leucine is an hydrophobic amino acid. When it occurs at every seventh position of an a-helix, the aliphatic side-chains are all oriented on the same side of the helix and they can interact with another such helix to form a coiled coil type of structure. The GCN4 transcription activator in yeast is a dimer in which the leucine zipper region helps to position the two basic regions that bind to the DNA recognition sequence. |
| Helix-Loop-Helix binding motif | A variation of the leucine zipper, the basic DNA-binding helices are connected to the dimerization helices by a short loop. |
|
|