Professional Documents
Culture Documents
KRISHNA BAKSI
DEPARTMENT OF ANATOMY AND CELL BIOLOGY
2015
Content:
DNA repair is an important mechanism because there is no single molecule whose integrity is as
vital to the cells as DNA.
Mismatch repair in eukaryotes may be similar to that in E. coli. Homologs of MutS and MutL
have been identified in yeast, mammals, and other eukaryotes. MSH1 to MSH5 are homologous
to MutS; MLH1, PMS1 and PMS2 are homologous to MutL. Mutations of MSH2, PMS1 and
PMS2 are related to colon cancer.
Mismatch Repair
So far, we have been discussing repair of DNA damage caused by mutagenic agents. What
about DNA that simply has a mismatch due to incorporation of the wrong base and failure of the
proofreading system? At ! rst, it would seem tricky to repair such a mistake because of the
apparent dif! culty in determining which strand is the newly synthesized one that has the mistake
and which is the parental one that should be left alone. At least in E. coli this is not a problem
because the parental strand has identi! cation tags that distinguish it from the progeny strand.
These tags are methylated adenines, created by a methylating enzyme that recognizes the
sequence GATC and places a methyl group on the A. Because this 4-base sequence occurs
approximately every 250 bp, one is usually not far from a newly created mismatch. Moreover,
GATC is a palindrome, so the opposite strand also reads GATC in its 59!39 direction. This
means that a newly synthesized strand across from a methylated GATC is also destined to
become methylated, but a little time elapses before that can happen. The mismatch repair
system (Figure 20.35) takes advantage of this delay; it uses the methylation on the parental
strand as a signal to leave that strand alone and correct the nearby mismatch in the
unmethylated progeny strand. This process must occur fairly soon after the mismatch is created,
or both strands will be methylated and no distinction between them will be possible. Eukaryotic
mismatch repair is not as well understood as that in E. coli. The genes encoding the mismatch
recognition and excision enzymes (MutS and MutL) are very well conserved, so the mechanisms
that depend on these enzymes are likely to be similar in eukaryotes and bacteria. However, the
gene encoding the strand recognition protein (MutH) is not found in eukaryotes, so eukaryotes
appear not to use the methylation recognition trick. It is not clear yet how eukaryotic cells
distinguish the progeny strand from the parental strand at a mismatch.
Mismatch repair
To repair mismatched bases, the system has to know which base is the correct one. In E. coli,
this is achieved by a special methylase called the "Dam methylase", which can methylate all
adenines that occur within (5')GATC sequences. Immediately after DNA replication, the
template strand has been methylated, but the newly synthesized strand is not methylated yet.
Thus, the template strand and the new strand can be distinguished.
Figure 7-G-3. Mismatch repair.
The repairing process begins with the protein MutS which binds to mismatched base pairs.
Then, MutL is recruited to the complex and activates MutH which binds to GATC sequences.
Activation of MutH cleaves the unmethylated strand at the GATC site. Subsequently, the
segment from the cleavage site to the mismatch is removed by exonuclease (with assistance
from helicase II and SSB proteins). If the cleavage occurs on the 3' side of the mismatch, this
step is carried out by exonuclease I (which degrades a single strand only in the 3' to 5'
direction). If the cleavage occurs on the 5' side of the mismatch, exonuclease VII or RecJ is
used to degrade the single stranded DNA. The gap is filled by DNA polymerase III and DNA
ligase.
The distance between the GATC site and the mismatch could be as long as 1,000 base pairs.
Therefore, mismatch repair is very expensive and inefficient.
Mismatch repair in eukaryotes may be similar to that in E. coli. Homologs of MutS and MutL
have been identified in yeast, mammals, and other eukaryotes. MSH1 to MSH5 are homologous
to MutS; MLH1, PMS1 and PMS2 are homologous to MutL. Mutations of MSH2, PMS1 and
PMS2 are related to colon cancer.
In eukaryotes, the mechanism to distinguish the template strand from the
new strand is still unclear. Methyl-directed mismatch repair in E. coli
The newly replicated daughter strand (red) contains a T mismatched to G in the
template strand (blue). The mismatch repair system identifies the daughter strand
because it is not yet methylated. Thus, this system must function before the newly
replicated daughter strand becomes methylated, through action of the Dam methylase
on the A residue in the GATC sequence. Note: the figure in the textbook has several
errors. First, the cleavage by mutH is shown incorrectly: the cleavage is just 5' to the
GATC sequence in the unmethylated strand. The cleavage has been corrected in this
figure. Second, the polarity of the strands in the textbook figure are not correct. The
polarity of the strands is correct in this figure.
HNPCC and mismatch repair
Hereditary non-polyposis colon cancer (HNPCC) is a form of colon cancer characterized by
early age of onset and autosomal dominant inheritance with high penetrance. It is frequently
associated with defects in the genes encoding MSH2 (about 35% of cases in which responsible
genes have been identified) and MLH1 (about 60% of cases). Note that these two genes are
essential for formation of all of the MutS and MutL homologue heterodimers that function in the
nucleus during normal cell division cycles. HNPCC is occasionally associated with defects in
other mismatch repair genes (defects in MSH6, PMS2 and PMS1 have been detected). The
rarity of defects in these MMR genes in HNPCC is probably a consequence of their functional
redundancy, which makes them individually non-essential for MMR.
HNPCC is nearly always associated with defects in DNA repair evidenced by "microsatellite
instability"--frequent variations in the number of repeat units of short tandemly repeated
sequences. In many cases of colon cancer associated with microsatellite instability (MIN) in
which no mismatch-repair gene mutation is evident, extensive methylation of the MLH1 promoter
is evident. This methylation silences the MLH1 gene, so no MLH1 protein is produced. Thus it is
likely that all cases of MIN-associated colon cancer will eventually be found to be due to defects
in the mismatch repair pathway.
SUMMARY The E. coli mismatch repair system recognizes the parental strand by its methylated
adenines in GATC sequences. Then it corrects the mismatch in the complementary (progeny)
strand. Eukaryotes use part of this repair system, but they rely on a different, uncharacterized
method for distinguishing the strands at a mismatch.
SUMMARY The failure of human mismatch repair leads to microsatellite instability, and
ultimately to cancer.
SUMMARY The Pol III core (a" or a"u) does not function processively by itself, so it can replicate
only a short stretch of DNA before falling off the template. By contrast, the core plus the b-
subunit can replicate DNA processively at a rate approaching 1000 nt/sec. The b-subunit forms a
dimer that is ring-shaped. This ring ! ts around a DNA template and interacts with the a-subunit
of the core to tetherthe whole polymerase and template together. This is why the holoenzyme
stays on its template so long and is therefore so processive. The eukaryotic processivity factor
PCNA forms a trimer with a similar
ring shape that can encircle DNA and hold DNA polymerase on the template.
SUMMARY
1. BASE EXCISION REPAIR (BER) typically acts on subtle base damage. This process
begins with a DNA glycosylase, which extrudes a base in a damaged base pair, then clips
out the damaged base, leaving an apurinic or apyrimidinic site that attracts the DNA
repair enzymes that remove the remaining deoxyribose phosphate and replace it with a
normal nucleotide. In bacteria, DNA polymerase is the enzyme that is involved in the
missing nucleotide in BER; in eukaryotes, DNA polymerase plays this role.
It is multistep enzymatic process. The enzymes involved are glycosylase, AP
endonuclease and phosphodiesterase , DNA polymerase and DNA ligase.
2. NUCLEOTIDE EXCISION REPAIR (NER) typically handles bulky damage that distorts
the DNA double helix. NER in E. coli begins when the damaged DNA is clipped by an
endonucleotides on either side of the lesion, at sites 12-13 nt apart. . This allows the
damaged DNA to be removed as part of the resulting 12-13 base oligonucleotide. The
Polymerase I fills the gap and DNA ligase seals the final nick. Eukaryotic NER follows
two pathways. In GG-NER, a complex composed of XPC and hHR23B initiates repair by
binding to a lesion anywhere in the genome and causing limited amount of DNA melting.
This protein apparently recruits XPA and RPA. TFIIH then joins the complex, and two of
its subunits (XPB and XPD) use their DNA helicase activities to expand the meltied
region. RPA binds two excinuleases (XPF and XPG) and postions them for cleavage of
the DNA strand on either side of the lesion. This releases the damaged on a fragment
tetween 24 and 321 nt long. TC-NER is very similar to GG-NER, expcept that RNA
polymerase plays the role of XPC in damage sensing and initial DNA melting. In either
kind of NER, DNA polymerase or fills in the gap left by the removal of the damaged
fragment, and DNA ligase seels the DNA.
It is a multistep enzymatic process. The enzymes involved are nuclease, DNA
helicase, and DNA polymerase and DNA ligase.
3. There are several human diseases due to the defect in the DNA repair mechanism.
MUTATION
MUTATION : It refers to any structural alteration of DNA that is present in a mutant. A mutation
is always a change in the base sequence of DNA. The most common change is a substitution,
addition rearrangement or a deletion of one or more bases.
By a base analogue one means a substance other than a standard nucleic acid base which can
be built into DNA molecule by the normal process of polymerization. Such a substance must be
able to pair with the base on the complementary strand being copied or the 3' - 5' editing function
will remove it. However, if it can tautomeraze or if it has two modes of hydrogen-bonding, it will
be mutagenic.
The substituted base 5-bromouracil (BU) is an analogue of thymine as the bromine has about
the same van der Waals radius as the methyl group of thymine. In subsequent rounds of
replication BU functions like thymine and primarily pairs with adenine. Thymine can sometimes
(bur rarely) assume an enol from that capable of pairing with guanine, and this is capable of
pairing with guanine and this conversion occasionally gives rise to mutants in the course of
replication. The mutagenic activity of 5-BU stems from a shift in the keto-enol equilibrium caused
by the bromine atom; that is, the enol form exist from a grater fraction of time for BU than for
thymine. Thus, if BU replaces a thymine, in which turn specifies cytosine, resulting in formation a
G-C to A-T. The enol is actually sufficiently prevalent so that BU is sometimes (but infrequently)
incorporated into DNA in that form. When that occurs, BU is acting as an analogue of cytosine
rather than thymine. However even though it may become part of the DNA by temporarily having
the base pairing properties of cytosine, the keto is the predominant form, so that in subsequent
rounds of replication BU will usually pair like thymine. Thus, a G-C pair, which as it shows in the
figure . Thus BU can stimulate a transition from A-T to G-C as well as from G-C to A-T.
Note that it takes two rounds of replication in order to generate a new base pair. Similarly, a
mutant progeny cell would not appear until two generations elapsed; one says that it takes two
generations for the mutation to be "expressed".
CHEMICAL MUTAGENS
A chemical mutagen is a substance that can alter a base that is already incorporate in DNA and
thereby change its hydrogen bonding specificity. Four very powerful chemical mutagen are
nitrous acid ( NHO2 ), hydroxiamine ( HA ), ethylmethane sulfonate ( EMS ), and N-methyl-N-
nitro-N-nitrosoguanidine ( NNG ).
Nitrous acid primarily converts amino groups to keto groups by oxidative deamination . Thus
cytosine, adenine, and guanine are converted to uracil ( U ), hypoxanthine ( H ), and xanthine
( X ), respectively. These bases can form the base pairs U-A, H-C and X-C. Therefore, the
changes are C-C A-T and A-T G-C as cytosine and adenine respectively are deaminated.
Since B and X both pair with C, the conversion G to X does not cause mutations to occur in this
way. These changes are summarized in Fig. 10-9. Note that these changes are all transitions
also.
MUTAGENESIS BY INTERCALATING SUBSTANCES
Acridine orange, profavine, and acriflavine (Fig. 10-11) which are substituted acridines, are
planar, three ringed molecules whose dimensions are roughly the same as those of a purine-
pyrimidine pair. In aqueous solution, these substances form stacked arrays and are also able to
stack with a base pair; this is done by insertion between two base pairs, a process called
intercalation. Since, the thickness of the acridine molecule is approximately that of a base pair
and because the two base pairs are normally in contact, the intercalation of one acridine
molecule causes adjacent base pairs to move apart by a distance equal to that of the thickness
of one base pair (Fig. 10-12). This has bizarre effects on the outcome of DNA replication, though
the mechanism of action of the mutagen is not known. When DNA containing intercalated
acridines is replicated, additional bases appear in the sequence (Fig. 10-13). The usual addition
is a single base, though occasionally two bases are added. Mutations of this sort are called
frame shift mutations. This is because the base sequence is read in groups of three bases when
it is being translated into an amino acid sequence and the addition of a base changes the
reading frame (Fig. 10-13).
Consequences of altering the nucleotide sequence: Changing a single nucleotide base on
the mRNA chain (a point mutation) can lead to any one of the three results (Fig. 32.2).
2. Manifested only in females and is lethal in utero in males. Examples include incontinenta
pigmenti, focal dermal hypoplasia, and orofaciodigital syndrome.
Trinucleotide Repeat Disorders General Facts:
Mutation that is an expansion of a repetitive sequence of 3 nucleotides.
This sequence is prone to instability - ie a dynamic mutation.
The length of the repetitive sequence continues to increase as the cells divide throughout
life - 'somatic instability'.
Produces predominantly neurological disorders.
Associated with genetic anticipation - condition severity increasing with each successive
generation.
Polyglutamine (PolyQ) Diseases:
DRPLA (Dentatorubropallidoluysian atrophy)
HD (Huntington's disease)
SBMA (Spinobulbar muscular atrophy or Kennedy disease)
SCA1 (Spinocerebellar ataxia Type 1)
SCA2 (Spinocerebellar ataxia Type 2)
SCA3 (Spinocerebellar ataxia Type 3 or Machado-Joseph disease)
SCA6 (Spinocerebellar ataxia Type 6)
SCA7 (Spinocerebellar ataxia Type 7)
SCA17 (Spinocerebellar ataxia Type 17)
Non-Polyglutamine Diseases:
DM (Myotonic dystrophy)
FRAXA (Fragile X syndrome)
FRAXE (Fragile XE mental retardation)
FRDA (Friedreich's ataxia)
FXTAS (Fragile X-associated tremor/ataxia syndrome)
SCA8 (Spinocerebellar ataxia Type 8)
SCA12 (Spinocerebellar ataxia Type 12)
Fragile X syndrome:
Read more about fragile X-associated tremor/ataxia syndrome and fragile X-associated primary
ovarian insufficiency.
1. 5 Bromouracil:
BU can Tautomerize: Therefore, BU in its Keto form is an analogue of T and base pair with
A. But in its Enol form, it is an analogue of C and a base pair with G. Hence, when it is
incorporated into DNA in its Keto form, it induces a change from A-T to G-C. Whereas,
when it is incorporated into DNA in its Enol form, it induces a change from G-C to A-T.
However, it takes two rounds of replication in order to generate a new base pair. Hence, it
takes two generations for the mutation to be Expressed.
2. Chemical Mutagen: A substance that can alter a base that is already incorporated in DNA
and thereby change its hydrogen-binding specificity.
i. Nitrous acid primarily converts amino groups to keto groups by oxidative deamination
. Thus cytosine, adenine, and guanine are converted to uracil ( U ), hypoxanthine ( H ), and
xanthine ( X ), respectively.
C A G
These bases can form the base pairs U-A, H-C and X-C.
Therefore, the changes are G-C A-T and A-T G-C as Cytosine and Adenine
respectively are deaminated.
Note: Since, G and X both pair with C, the conversion of G X does not cause
mutations to occur in this way.
The dimensions of these molecules are roughly the same as those of a Purine-
Pyrimidine pair. They cause the adjacent base pairs to move apart by a distance equal to
that of the base to that of the thickness of one base pair. Therefore, when DNA
containing intercalated Acridines is replicated, additional bases appear in the sequence,
hence changing the reading frame and called Fame-shift Mutations.
GENETIC RECOMBINATION
(i) General Recombination: This form of recombination takes place between identical or
nearly identical chromosomes (homologous), such as homologously paired eukaryotic
chromosome during meiosis.
(ii) Site-directed Recombination: DNA homology is not required but the exchange occurs
at short, specific nucleotide sequences by a variety of site-specific recombination enzymes.
Thus, it alters the relative positions of nucleotide sequences in genes.
The abundant general recombination observed in meiosis has the following characteristics:
(i) Two homologous molecules DNA molecules that were originally part of different
chromosomes cross over that is, their double helices break and the two broken ends join to
their opposite partners to re-form two intact double helices, each composed of parts of the two
initial DNA molecules (Fig. 5-54).
(ii). The site of exchange (that is, where a red double helix is joined to a green double helix in
(Fig. 5-54) can occur anywhere in the homologous nucleotide sequences of the two participating
DNA molecules.
(iii). At the site of exchange, a strand of one DNA molecule become base-paired to a strand of
the second DNA molecule to create a staggered joint (heteroduplex joint) between the two
double helices (Fig. 5-55). This heteroduplex region can be thousands of base pairs long.
DNA renaturation or hybridization occurs when a rare random collision just a positions of
complementary nucleotide sequences on two matching DNA single strands, allowing the
formation of a short stretch of double helix between them. This relatively slow helix nucleation
step is followed by a very rapid zippering step (Fig. 5-57).
Fig. 5-57 DNA hybridization. DNA double helices re-form from their separated strands in a
reaction that depends on the random collision of two complementary DNA strands. The vast
majority of such collisions are not productive, as shown left, but a few result in a short region
where complementary base pairs have formed (helix nucleation). A rapid zippering then leads to
the formation of a complete double helix. Through this trial-and-error process, a DNA strand will
find its complementary partner even in the midst of millions of no matching DNA strands. A
related, highly efficient trial-and-error recognition of a complementary partner DNA sequence
seems to initiate all general recombination events.
THE RecA PROTEIN AND ITS HOMOLOGS ENABLE A DNA SINGLE STRAND TO PAIR
WITH A HOMOLOGOUS REGION OF DNA DOUBLE HELIX
General recombination is more complex than the simple hybridization reactions, and it requires
several types of specialized proteins. In particular in E. coli RecA protein has a central role in
the recombination between chromosomes; it and its homologs in yeast, mice, and humans
make synapsis possible (Fig. 5-58).
Like a single-strand DNA-binding protein, the RecA type of protein binds tightly and in
cooperative clusters to a single-stranded DNA to form a nucleoprotein filament. Because each
RecA monomer has more that one DNA-binding site, a RecA filament can bold a single strand
and a double helix together. This allows it to catalyze multistep DNA synapsis reaction between
a DNA double helix and a homologous region of single-stranded DNA (Fig. 5-59). These sites
allow the RecA /type protein to catalyze a multiple step reaction, called synapsis between a
DNA double helix and a homologous region of a single-stranded DNA.
Fig. 5-59 DNA synapsis catalyzed by the RecA protein. In vitro experiments show that several types of complex
are formed between a DNA single strand covered with RecA protein (red), and a DNA double helix (green). First a
non-base-paired complex is formed, which is covered though transient base-flipping to a three-stranded structure as
soon as a region of homologous sequence is found. This complex is unstable because it involves an unusual form
of DNA, and it spins out a DNA heteroduplex (one strand green and other red) plus a displaced single strand from
the original helix (green). Thus the structure shown in this diagram migrates to the left, reeling in the input DNAs
while producing in the output DNAs. The net result is a DNA strand exchange identical to that diagrammed earlier
in Fig. 5-56.
Once DNA synapsis has occurred, the short heteroduplex region where the stands from two
different DNA molecules have begun to pair is enlarged through a process called branch
migration.
Meiotic Recombination is Initiated by Double-strand DNA breaks:
Extensive base-pair interactions cannot occur between
two intact DNA double helices. Thus, the DNA synapsis
that is critical for general recombination in meiosis can
begin only after a DNA strand from one DNA helix has
been exposed and its nucleotides have been made
available for pairing with another DNA helix. Hypothetical
model was proposed in 1960, and the mechanism was
finally proven in 1990 with the yeast chromosomes at
various stages of meiosis. These studies revealed that
general recombination is initiated by a special
endonuclease that simultaneously cuts both strands of
the double helix, creating a complete break in the DNA
molecule. The 5 ends at the break are then chewed
back by an exonuclease, creating protruding single-
stranded 3 ends. It is these single strands that search for
a homologous DNA helix with which to pair, leading to the
formation of joint molecule between a maternal and a
paternal chromosome (Fig. 5-56).
SITE-SPECIFIC RECOMBINATION
ENZYMES MOVE SPECIAL DNA SEQUENCES INTO AND OUT OF GENOMES:
Fig. 5-69 Three of the many types of mobile genetic elements found in bacteria. Each of these elements
contains a gene that encodes a transposase, an enzyme that conducts at least some of the DNA breakages and
joining reactions needed for the elements to move. Each mobile element also carries short DNA sequences
(indicated in red) that are recognized only by the transposase encoded by that element and are necessary for
movement of the element. In addition, two of the three mobile elements shown carry genes that encode enzymes
that inactivate the antibiotics ampicillin (ampR) and tetracycline (terR). The transposable element Tn10, shown in
the bottom diagram, is thought to have evolved from the chance landing of two short mobile elements on either side
of a tetracycline-resistant gene; the wide use of tetracycline as an antibiotic has aided the populations. The three
mobile elements shown are examples of DNA-only transposons.
Transpositional Site-specific Recombination Can Insert Mobile Genetic Elements into Any
DNA Sequence:
Transposons, also called transposable elements, are mobile genetic elements that
generally have only modest target site selectivity and can thus insert themselves into many
different DNA sites. In transposition, a specific enzyme, usually encoded by the transposon, and
called a transposase, acts on a specific DNA sequence at each end of the transposon first
disconnecting it from the flanking DNA and then inserting it into a new target DNA site. There is
no requirement for homology between the ends of the element and the insertion site. On the
basis of their structure and transposition mechanism, transposons can be grouped into three
large classes (Table 5-3).
Fig. 5-70 Cut-and-Paste
transposition. DNA-
only transposons can be
recognized in
chromosomes by the
inverted repeat DNA
sequences (red) at their
ends. Experiments show
that these sequences,
which can be a short as
20 nucleotides, are all
that is necessary for the DNA between them to be traipsed by the particular transposase enzyme
associated with the element. The cut-and-past movement of a DNA-only transposable element
from one chromosomal site to another begins when the transposase brings the two inverted DNA
sequences together, forming a DNA loop. Insertion into the target chromosome, catalyzed by
the transposase, occurs at a random site through the creation of staggered breaks in the target
chromosome (red arrowheads). As a result, the insertion site is marked by a short direct repeat
of the target DNA sequence, as shown. Although the break in the donor chromosome (green) is
resealed, the breakage-and-repair process often alters the DNA sequence, causing a mutation
at the original site of the excised transposable element (not shown).
Transposition also has a key role in the life cycle of many other viruses. Most notable are the
retroviruses, which include the AIDS virus, called HIV that infects human cells.
SUMMARY
RecA protein
Synapsis
RecBCD protein
2. Site-Specific recombination: DNA homology is not required but the exchange occurs
at short, specific nucleotide sequences by a variety of site-specific recombination enzymes.