Expanded trinucleotide repeat sequences are the cause of several inherited neurodegenerative diseases. Disease pathogenesis is correlated with several features of somatic instability of these sequences, including further large expansions in postmitotic tissues. The presence of somatic expansions in postmitotic tissues is consistent with DNA repair being a major determinant of somatic instability. Indeed, proteins in the mismatch repair (MMR) pathway are required for instability of the expanded (CAG·CTG)n sequence, likely via recognition of intrastrand hairpins by MutSβ. It is not clear if or how MMR would affect instability of disease-causing expanded trinucleotide repeat sequences that adopt secondary structures other than hairpins, such as the triplex/R-loop forming (GAA·TTC)n sequence that causes Friedreich ataxia. We analyzed somatic instability in transgenic mice that carry an expanded (GAA·TTC)n sequence in the context of the human FXN locus and lack the individual MMR proteins Msh2, Msh6 or Pms2. The absence of Msh2 or Msh6 resulted in a dramatic reduction in somatic mutations, indicating that mammalian MMR promotes instability of the (GAA·TTC)n sequence via MutSα. The absence of Pms2 resulted in increased accumulation of large expansions in the nervous system (cerebellum, cerebrum, and dorsal root ganglia) but not in non-neuronal tissues (heart and kidney), without affecting the prevalence of contractions. Pms2 suppressed large expansions specifically in tissues showing MutSα-dependent somatic instability, suggesting that they may act on the same lesion or structure associated with the expanded (GAA·TTC)n sequence. We conclude that Pms2 specifically suppresses large expansions of a pathogenic trinucleotide repeat sequence in neuronal tissues, possibly acting independently of the canonical MMR pathway.
Citation: Bourn RL, De Biase I, Pinto RM, Sandi C, Al-Mahdawi S, Pook MA, et al. (2012) Pms2 Suppresses Large Expansions of the (GAA·TTC)n Sequence in Neuronal Tissues. PLoS ONE7(10): e47085. https://doi.org/10.1371/journal.pone.0047085
Editor: David T. Kirkpatrick, University of Minnesota, United States of America
Received: March 31, 2012; Accepted: September 12, 2012; Published: October 11, 2012
Copyright: © Bourn et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: IDB was supported by a postdoctoral fellowship from the National Ataxia Foundation. RMP was supported by Ataxia UK. SA was supported by The Wellcome Trust. This research was made possible by grants from the National Institutes of Health (NIH/NINDS) and the Muscular Dystrophy Association to S.I.B. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Expanded trinucleotide repeat sequences are known to cause several inherited human neurodegenerative diseases . These sequences are subject to changes in repeat length during parental transmission, a phenomenon termed intergenerational instability. In addition, expanded trinucleotide repeats undergo further large expansions in specific somatic tissues. Although both forms of instability appear to influence disease onset and progression, it is likely that intergenerational and somatic instability occur by distinct mechanisms. This report focuses specifically on uncovering the processes that influence somatic instability.
Several observations indicate that somatic instability resulting in progressive expansion within specific tissues contributes to disease pathogenesis and progression. First, the tissue-specificity of somatic instability tends to correlate with disease pathology. For example, in Huntington disease and myotonic dystrophy the expanded (CAG·CTG)n sequence undergoes further large expansions in the striatum and muscle, respectively , . Similarly, in Friedreich ataxia (FRDA), the pathogenic expanded (GAA·TTC)n sequence in intron 1 of the FXN gene  undergoes progressive expansion in the cerebellum and dorsal root ganglia of FRDA patients  and in a transgenic mouse model . Second, somatic instability is known to precede the development of the disease phenotype in a mouse model of Huntington disease , , . Finally, the absence of somatic instability can ameliorate the disease phenotype. Genetic suppression of somatic instability in a mouse model for Huntington disease significantly delayed intracellular accumulation of mutant huntingtin protein, the onset of disease, and death . Likewise, somatic instability seems to be important in the pathology of FRDA. In transgenic mice, an expanded (GAA·TTC)n sequence within a human FXN transgene recapitulated the age-dependent and tissue-specific somatic instability observed in FRDA patients , and rescuing the embryonic-lethal Fxn-deficient mice with this transgene resulted in an FRDA-like phenotype , . However, when a similar-sized (GAA·TTC)n repeat was knocked-in into the mouse Fxn gene, the sequence did not show somatic instability, and it was not possible to replicate the FRDA phenotype . Furthermore, an expanded (GAA·TTC)n sequence in an intron of the At4g13430 gene was found to cause a growth defect in Arabidopsis thaliana, and this repeat also displayed somatic instability . Therefore, understanding the molecular mechanisms of somatic instability of disease-causing expanded trinucleotide repeats in somatic tissues is of great interest.
The post-mitotic nature of tissues affected in trinucleotide expansion disorders indicates that DNA repair may be an important determinant of somatic instability. The highly conserved mismatch repair (MMR) system prevents point mutations, insertions and deletions. In Escherichia coli, MMR is initiated when a MutS homodimer recognizes and binds a mismatch or insertion-deletion loop (IDL) . Mismatch binding stimulates ATP hydrolysis by MutS, which is required for progression of MMR. A MutL homodimer is recruited, and the MutS-MutL complex activates the latent endonuclease activity of MutH. The bases from the nick to the mispaired base(s) are excised and re-synthesized to repair the lesion. Mammalian MMR is initiated by a heterodimer of the MutS homologs Msh2, Msh3 and Msh6 . MutSα (Msh2-Msh6) preferentially recognizes base-base mispairs and IDLs of one to two bases, while MutSβ (Msh2-Msh3) preferentially recognizes IDLs of two to ten bases. The mammalian MutL homologs Mlh1, Mlh3 and Pms2 also function as heterodimers. The primary MutL heterodimer, MutLα (Mlh1-Pms2), can be recruited by either MutSα or MutSβ. MutLγ (Mlh1-Mlh3) may be recruited by MutSα, but the role of MutLγ in MMR is unclear. There is no known mammalian MutH homolog, although there is evidence to suggest that Pms2 can function as an endonuclease , .
MMR deficiency causes microsatellite instability, a form of genomic instability characterized by widespread somatic length variability in short tandem repeats, typically involving few repeat units. Consistent with this role, MMR is known to prevent small length changes of expanded trinucleotide repeats . However, proteins in the MMR pathway have been shown to promote large length changes of expanded (CAG·CTG)n sequences. In mice, the absence of Msh2 or Msh3 was shown to eliminate somatic expansions of (CAG·CTG)n , –. The absence of Msh6 did not reduce the frequency of somatic expansions in neuronal tissues, although it may affect instability in other tissues , . These results indicate that MutSβ is required to produce somatic expansions of the expanded (CAG·CTG)n sequence, and multiple lines of evidence suggest that somatic expansions occur through the MMR pathway itself. First, MutSβ promotes repair on CAG/CTG hairpin templates in vitro . In addition, in mice lacking Pms2, the rate of somatic expansions of the expanded (CAG·CTG)n sequence was reduced by ∼50% . Finally, expansions were eliminated in mice homozygous for an ATPase-deficient mutation of Msh2 . These data indicate involvement of the downstream steps of MMR and support the role of an intact, but error-prone, MMR pathway in producing somatic expansions of the (CAG·CTG)n sequence. However, it has also been proposed that MutSβ could produce expansions by acting independently of the other MMR proteins to create CAG/CTG hairpins or prevent their repair –. Indeed, many MMR proteins are known to have functions outside of the canonical MMR pathway; the mammalian MutS and MutL homologs are known to play important roles in processes such as class switch recombination, homologous recombination and DNA damage signaling –. Thus, it is possible that MMR proteins could act through various DNA modifying pathways to influence somatic instability.
The accuracy and efficiency of DNA repair, and which repair pathway(s) are activated, are in turn affected by the DNA sequence and structure –. Therefore, the effect of MMR proteins on trinucleotide repeat instability could vary depending on the sequence of the repeat motif and its secondary structure(s). While the (CAG·CTG)n sequence forms intrastrand hairpins , , the (GAA·TTC)n sequence tends to form triplex-based structures –. In yeast, MutSβ and MutLα promote fragility of expanded (GAA·TTC)n sequences, leading to large deletions within the repeat tract . However, the role of mammalian MMR proteins in tissue-specific expansions of the (GAA·TTC)n sequence remains unclear.
To determine if and how MMR proteins affect instability of the (GAA·TTC)n sequence in the context of mammalian tissues, we crossed the previously described YG8 transgenic mouse model ,  with mice lacking the individual MMR proteins Msh2, Msh6, or Pms2 – and analyzed somatic instability by small-pool PCR . Absence of Msh2 or Msh6 significantly stabilized the repeat tract. Remarkably, Pms2 deficiency increased the frequency of large expansions in the cerebellum, cerebrum, and dorsal root ganglia, but not in heart or kidney. The absence of Pms2 had no significant effect on contractions. Our results show that while mammalian MMR promotes somatic instability of the (GAA·TTC)n sequence, Pms2 plays a unique role in suppressing large somatic expansions in neuronal tissues.
Materials and Methods
Transgenic mice, tissues and DNA samples
YG8 FXN GAA repeat expansion-containing transgenic mice were crossed with Msh2, Msh6 or Pms2 heterozygous knockout mice to establish double genetically modified mice as previously described . Genomic DNA was isolated from multiple tissues of MMR-deficient and MMR-proficient YG8 mice using the DNeasy Blood & Tissue Kit (Qiagen). We previously showed that the YG8 mice developed substantial somatic instability by 10 to 12 months of age, and that instability was most marked in the cerebellum . In order to test the effect of MMR we therefore collected tissues from mice at 10 to 12 months of age. Cerebellum was analyzed for all genotypes, and other selected tissues were analyzed as follows: 10-month old YG8-Msh2+/+ and YG8-Msh2−/− (cerebellum, cerebrum, DRG, heart); 12-month old YG8-Pms2+/+ and YG8-Pms2−/− (cerebellum, cerebrum, DRG, heart, kidney); 5-month old YG8-Pms2+/+ and YG8-Pms2−/− (cerebellum); 11- or 12-month old YG8-Msh6+/+ and YG8-Msh6−/− (cerebellum). The use of animals for this research project was reviewed and approved by the Institutional Animal Care and Use Committee at the University of Oklahoma Health Sciences Center (protocol 08-129), and the Animal Ethics Committee at Brunel University.
Small-pool PCR analysis
Small-pool PCR was performed as previously described , . Briefly, serial dilutions of DNA were made to obtain “small pools” of DNA containing 1–19 amplifiable molecules per reaction for mutation load analysis, or 12–84 amplifiable molecules per reaction for analysis of large expansions. The number of molecules per reaction was calculated according to the Poisson distribution, as previously described , . The repeat tract was amplified using primers 147–For and 602-Rev . Products were detected by Southern blot analysis with an end-labeled (TTC)11 probe. The sizes of the progenitor alleles were determined by conventional PCR, and mutations were defined as length changes of >5% from the progenitor. Mutation load was calculated as 100× (total number of mutations)/(total molecules analyzed). As we have previously described, the YG8 mice carry multiple copies of the transgene, seen as multiple progenitor/constitutional allele lengths  Therefore, expansions and contractions were conservatively defined as products that measured >5% above the longest or below the shortest constitutional allele, respectively. Mutations that measured between the top and bottom progenitors were classified as intermediate changes, since they could have resulted from expansion of a shorter progenitor or contraction of a larger progenitor. The size of an expansion was calculated as the percent increase in length above the longest progenitor allele. Importantly, all comparisons examined instability in the same tissue from wild-type vs. knockout littermates of similar age, or from young vs. old mice from the same line and genotype. In particular, the different number of progenitor alleles and different progenitor allele lengths of the YG8-Pms2+/+ and YG8-Pms2−/− lines vs. other lines would confound comparisons between these and other lines. Since only the bands that were longer than the longest progenitor allele were considered expansions, the presence of more progenitor alleles would reduce the apparent expansion load. Therefore, comparisons could not be made between different lines (e.g., YG8-Msh2+/+ vs. YG8-Pms2+/+).
Deficiency of MutSα reduces somatic instability and prevalence of expansions of the expanded (GAA·TTC)n sequence in neuronal tissues
We previously described tissue-specific, age-dependent somatic instability in the YG8 transgenic mouse model carrying (GAA·TTC)190 and (GAA·TTC)82 sequences in the context of the human FXN locus . Mice deficient in individual MMR proteins were crossed onto the background of the YG8 transgenic mouse to generate models for studying the effects of MMR on instability of the expanded (GAA·TTC)n sequence. Small-pool PCR was used to analyze instability in neuronal and non-neuronal tissues in order to detect whether instability increased or decreased in the absence of individual MMR proteins. Cerebellum, cerebrum, and DRG were selected as representative neuronal tissues, and heart and kidney were selected as non-neuronal tissues. Products with repeat lengths that varied in length by more than 5% from the progenitors were considered mutations (see Materials and Methods). Multiple progenitor alleles were generated during the breeding process (e.g., note the three progenitor alleles for Msh2 and Msh6 mice, indicated by arrowheads next to each gel in Figure 1). Any mutation falling between progenitor alleles could have resulted from expansion of a shorter progenitor or contraction of a longer progenitor; such mutations could not be definitively described as either expansions or contractions. Therefore, expansions were conservatively defined as mutations that were larger than the longest progenitor allele, and contractions were similarly defined as mutations that were smaller than the shortest progenitor allele. Mutations that fell between the longest and shortest progenitors were not classified as expansions or contractions, but they were included in the calculation of the total mutation load.
Panels A, B, E, and F show representative autoradiographs from SP-PCR analysis of DNA extracted from (A) Msh2+/+, (B) Msh2−/−, (E) Msh6+/+, and (F) Msh6−/− cerebellum. Progenitor allele lengths determined by conventional PCR are 192, 130, and 89 repeats for Msh2+/+; 188, 128, and 88 repeats for Msh2−/−; 186, 133, and 98 repeats for Msh6+/+; and 187, 129, and 90 repeats for Msh6−/−, as indicated by arrowheads at the right of each panel. Results from the total analysis of 150 to 282 molecules per genotype are quantified in C and D for Msh2, and G and H for Msh6.
Consistent with our previously published results , the prevalence of contractions in all YG8 transgenic mice was much lower than the prevalence of expansions, with many tissues showing no contractions (Tables 1 and 2). The absence of Msh2, Msh6, or Pms2 did not increase the contraction load, indicating that these MMR proteins do not suppress somatic contractions of the (GAA·TTC)n sequence. We could not conclusively determine whether the absence of Msh2 might result in a further decrease in somatic contractions of the (GAA·TTC)n sequence, although any such decrease is unlikely to be biologically relevant given the very low levels of somatic contraction in all tissues analyzed. Based on these data, we conclude that MMR does not affect somatic contractions of the (GAA·TTC)n sequence. Finally, because of the generally low prevalence of contractions, it is quite possible that the unclassified mutations in our analyses are likely the result of expansions from smaller progenitor alleles rather than contractions from larger progenitor alleles.
The results of small-pool PCR analysis of the tissues from 10-month old YG8-Msh2+/+ and YG8-Msh2−/− littermates are shown in Figure 1 (A–D) and summarized in Table 1. In contrast, Msh2 deficiency resulted in a significant and substantial decrease in overall mutation load as well as expansion load in the cerebellum and a significant decrease in overall mutation load in cerebrum. In the heart and DRG, there were virtually no expansions in either genotype, and the absence of Msh2 did not further reduce the already low mutation load (P>0.05 in YG8-Msh2+/+ vs. YG8-Msh2−/−). Because Msh2 is required for initiation of MMR, these results suggest that an active MMR system may be required for somatic expansion of the (GAA·TTC)n sequence.
To further investigate the role of the MMR system in promoting instability of the expanded (GAA·TTC)n sequence, we analyzed somatic instability in cerebellum of 11- to 12-month old YG8-Msh6+/+ and YG8-Msh6−/− mice. The mutation load was also significantly lower in the YG8-Msh6−/− versus YG8-Msh6+/+ littermates (Figure 1E–H and Table 1). Together these results indicate that somatic expansion of the expanded (GAA·TTC)n sequence, especially in cerebellum and cerebrum, is dependent on MutSα.
Pms2 deficiency increases the frequency and magnitude of (GAA·TTC)n expansions in neuronal tissues
Small-pool PCR analysis of somatic instability in tissues from the 12-month old YG8-Pms2+/+ and YG8-Pms2−/− mice showed that the absence of Pms2 did not significantly change the mutation load in cerebellum, DRG, heart, or kidney, and only slightly increased the mutation load in the cerebrum. However, Pms2 deficiency significantly increased the prevalence of expansions in cerebrum (Figure 2, Table 2), cerebellum (Figure 3, Table 2) and DRG (Table 2), but not in heart or kidney (Figure 4, Table 2). It is interesting that the expansion frequency in the YG8-Pms2+/+ tissues is much lower than that in the YG8-Msh2+/+ and YG8-Msh6+/+ tissues. This observation raises the possibility that the increased expansion load observed in YG8-Pms2−/− tissues could be an artifact of the low level of expansions in the wild-type. However, we observed significant increases in expansion load in both young and old YG8-Pms2−/− mice, and these data are consistent with recently published results showing that the loss of Pms2 specifically promotes expansions of repeat sequences, including during intergenerational transmission of the (GAA·TTC)n sequence  (see Discussion). Therefore, we conclude that Pms2 may specifically prevent expansions of the (GAA·TTC)n sequence in neuronal tissues.
Analysis of instability in cerebrum of Pms2+/+ and Pms2−/− mice. Representative autoradiographs from (A) Pms2+/+ and (B) Pms2−/− cerebrum are shown. Progenitor allele lengths of 229, 194, 135, and 94 repeats for Pms2+/+ and 229, 185, 130, and 86 repeats for Pms2−/− are indicated by arrowheads at the right of each panel. Mutation load and expansion load, quantified from the analysis of ∼550 molecules per genotype, are shown in panels C and D, respectively. Panels E and F indicate the distribution of repeat lengths for all SP-PCR products of (E) Pms2+/+ and (F) Pms2−/−. These graphs are therefore an exact, combined representation of all blots analyzed for Pms2+/+ and Pms2−/− cerebrum. The number of products measured at each repeat length is on the Y-axis. Note that the actual frequency of products at the progenitor lengths may be slightly higher than the counted frequency, because a single band may represent PCR products amplified from multiple progenitor molecules. The gray bands represent the progenitor allele lengths +/−5%. Lines within the gray bands represent unchanged bands. Lines outside of these ranges represent mutations. Those to the right of the rightmost band represent expansions, those to the left of the leftmost band represent contractions, and those in the middle represent unclassified mutations. Panel G shows the cumulative number of products observed with increasing expansion size. Expansion size was calculated as the percent increase in repeat length above the largest progenitor. An incremental increase is seen in Pms2−/− versus Pms2+/+ mice at all expansion sizes, but the increase in magnitude of the difference at higher expansion sizes points to the role of Pms2 in preferentially suppressing large expansions in the cerebrum.
Analysis of instability in cerebellum of Pms2+/+ and Pms2−/− mice. Progenitor allele lengths of 232, 199, 143, and 99 repeats for (A) Pms2+/+, and 224, 195, 152, and 101 repeats for (B) Pms2−/− are indicated by arrowheads. Mutation load and expansion load, calculated from analysis of ∼400 molecules per genotype, are shown in C and D, respectively. Panels E and F indicate the distribution of repeat lengths for all SP-PCR products of (E) Pms2+/+ and (F) Pms2−/−. These graphs are therefore an exact, combined representation of all blots analyzed for Pms2+/+ and Pms2−/− cerebrum. The number of products measured at each repeat length is on the Y-axis. Note that the actual frequency of products at the progenitor lengths may be slightly higher than the counted frequency, because a single band may represent PCR products amplified from multiple progenitor molecules. The gray bars represent the progenitor allele lengths +/−5%. Lines within the gray bands represent unchanged bands. Lines outside of these ranges represent mutations. Those to the right of the rightmost band represent expansions, those to the left of the leftmost band represent contractions, and those in the middle represent unclassified mutations. Products outside of these ranges were counted as mutations. Panel G shows the cumulative number of products observed with increasing expansion size. Expansion size was calculated as the percent increase in repeat length above the largest progenitor. An incremental increase is seen in Pms2−/− versus Pms2+/+ mice at all expansion sizes, but the increase in magnitude of the difference at higher expansion sizes points to the role of Pms2 in preferentially suppressing large expansions in the cerebellum.
Panels show the distribution of repeat lengths measured in (A,B) heart and (C,D) kidney from (A,C) Pms2+/+ and (B,D) Pms2−/− littermates. The number of products measured at each repeat length is on the Y-axis. Gray bars represent the length of the progenitor alleles, +/−5%. Products that fell outside of these ranges were considered mutations.
Furthermore, Pms2 deficiency in the cerebrum significantly increased the prevalence of large expansions, defined as those >10% longer than the longest progenitor allele (Figure 2; 9.0% of amplifiable molecules in YG8-Pms2−/− vs. 3.4% in YG8-Pms2+/+; P<0.05). In addition, while Pms2 deficiency did not affect the prevalence of large expansions in other tissues, we did observe that expansions exceeding the longest progenitor allele length by >20% occurred only in the cerebellum (n = 7) and DRG (n = 1) of the Pms2 knockout, and not in the wild-type.
To further investigate the influence of Pms2 deficiency on the prevalence of large expansions in the cerebellum and DRG, we performed additional small-pool PCR experiments using ∼80 amplifiable molecules per reaction to facilitate the detection of rare large expansions. Analysis of >8,000 amplifiable molecules from the cerebellum and >1,500 from the DRG revealed a significant increase in the prevalence of large expansions in the absence of Pms2 (Table 3). The absence of Pms2 also significantly changed the size distribution of expansions in the cerebellum (Figure 5; P<0.05). In the absence of Pms2, we observed a median expansion size of 20.2% (P<0.005 compared to wild-type), including expansions that were >40% longer than the longest progenitor allele; in contrast, in the presence of Pms2, the median expansion size was 14.8%, with no expansions of >40% and very few expansions of >20%. Together, these data indicate that Pms2 suppresses large expansions of the (GAA·TTC)n sequence in neuronal tissues.
SP-PCR was performed using approximately 80 molecules per reaction to detect rare large expansions. Expansion sizes were calculated as the percent increase, of number of repeats, from the longest progenitor. Expansions of at least 10% were considered large expansions. The size distribution of large expansions was significantly different between Pms2+/+ and Pms2−/− cerebellum (P<0.05).
Expansions of the (GAA·TTC)n sequence have been shown to accumulate with age, both in humans ,  and in YG8 mice . Therefore, to determine if Pms2 influenced the age-dependent accumulation of expansions, we analyzed the prevalence of large expansions in the cerebellum of 5-month old YG8-Pms2+/+ and YG8-Pms2−/− mice (Table 3). Rare large expansions were already present at 5 months of age in the knockout, but these were not observed in the wild-type. The prevalence of large expansions was significantly higher at 12 months of age (Table 3), suggesting that Pms2 deficiency also influences the age-dependent rate of accumulation of large expansions of the (GAA·TTC)n sequence in the cerebellum.
Consistent with previous studies on somatic instability of trinucleotide repeats , –, our data also show that mammalian MutS homologs are required for further somatic expansion of the expanded (GAA·TTC)n sequence in transgenic mice. This observation is also consistent with the observation that expansion of the (GAA·TTC)n sequence in induced pluripotent stem cells (iPSCs) derived from fibroblasts of Friedreich ataxia patients was dependent on Msh2 . Our analysis did not directly assess the role of MutSβ in somatic instability, since the effects of Msh3 were not determined. Our data specifically point to a role for MutSα in somatic expansions, but this does not exclude the possibility that MutSβ participates in somatic instability of the expanded (GAA·TTC)n sequence.
Mismatch repair proteins may act on a lesion or secondary structure of the (GAA·TTC)n sequence that predominantly occurs in neuronal tissues, since instability in non-neuronal tissues was not affected by Msh2, Msh6, or Pms2. Triplexes, R-loops and/or DNA double strand breaks (DSBs) are likely candidates. The (GAA·TTC)n sequence is known to form Y·R·Y triplexes under physiologically relevant conditions . Triplex structures have been shown to promote replication stalling , which is known to promote DSB formation , and evidence suggests that recombinases may preferentially target expanded (GAA·TTC)n sequences in a Y·R·Y formation . Similarly, R-loops, in which the nascent transcript forms a duplex with the transcriptional template strand, are mutagenic structures that can result in DSBs and recombination via TAR (transcription associated recombination) . R-loops do not normally form during transcription because the nascent transcript is prevented from interacting with the template DNA. However, R-loop formation is known to occur when the (GAA·TTC)n sequence is transcribed , and multiple observations suggest that R-loops may be involved in the expansion process. First, with increasing or decreasing transcription of the (GAA·TTC)n sequence in cultured human cells, somatic instability and expansions also increase or decrease, respectively , . In addition, data from cultured primate cells implicate a direct role for the GAA-containing transcript in causing expansions of the (GAA·TTC)n sequence . Therefore, factors that regulate somatic expansion of the (GAA·TTC)n sequence could do so by influencing the formation and/or resolution of R-loop formation
Several factors promote R-loop formation, such as defective biogenesis of messenger ribonucleoprotein (mRNP) particles, deficiency of proteins responsible for splicing and/or nuclear export of transcripts, and sequence composition of the non-template strand . For instance, human cells deficient in the ASF/SF2 splicing factor are prone to R-loop formation and DSBs . In addition, both transcription and G-rich sequences, which adopt a G-quartet structure on the non-template strand, facilitate R-loop formation in the switch regions of immunoglobulin (Ig) genes, a requirement for intrachromosomal class switch recombination during antibody isotype switching in mature B cells . The R-loop structure predicted for the transcribed (GAA·TTC)n sequence involves the nascent GAA-containing transcript forming an R-loop structure with the TTC-containing template strand, which would result in a segment of single-stranded non-template strand . This GAA-containing stand would be expected to adopt one or more metastable secondary structures, as the single-stranded GAA sequence is known to do , and the very low affinity of polypurine sequences for the eukaryotic single-strand binding protein (replication protein A) , would be expected to exacerbate secondary structure formation. In this regard, the R-loops formed during transcription of the (GAA·TTC)n sequence would be similar to those promoted by the G-rich sequences in Ig class switch recombination. In addition, like the R-loops formed in Ig genes, it is possible that one or more of the abnormal structures of the non-template GAA strand could be recognized by MutS homologs, leading to error prone repair. This model is consistent with the observation made by Ku and colleagues, who demonstrated that Msh2 binds in or near the expanded (GAA·TTC)n sequence in vivo in iPSCs derived from Friedreich ataxia patients .
If the R-loop or triplex structures results in a DSB, as in Ig class switch recombination and in cells deficient in ASF/SF2 splicing factor, then expansions of the (GAA·TTC)n sequence could occur via MMR proteins contributing to error-prone DSB repair. DSB repair, via both the non-homologous end-joining and homologous recombination pathways, is known to occur in neurons and the postnatal brain , . Indeed, given the extensive complementarity across the expanded (GAA·TTC)n tract, and its propensity to adopt secondary structures, DSB repair within the expanded (GAA·TTC)n sequence is known to be highly error-prone, with misalignments contributing to repeat instability . It is also noteworthy that the DNA breaks during class switch recombination in B cells actually occur in the G1 phase and are dependent upon MMR . Thus, MutS-initiated error prone repair of secondary structures or DSBs could explain the reduced levels of somatic expansions observed in the mice deficient in MutS homologs.
As part of the MutLα complex, Pms2 links mismatch recognition by MutS homologs to the downstream steps in MMR. Therefore, if the canonical MMR pathway affects somatic instability of (GAA·TTC)n sequences, Pms2 would be expected to have an effect on repeat instability similar to that of the MutS homologs. Indeed, the absence of Pms2 reduced the rate of expansion of the (CAG·CTG)n sequence in transgenic mice by ∼50% . Paradoxically, our results indicate that the absence of Pms2 actually increases the prevalence of large expansions of the (GAA·TTC)n sequence in neuronal tissues, which contrasts with the dramatic reduction of somatic expansions caused by the absence of MutS homologs. The exact reason for this difference is unclear; however, it is noteworthy that Pms2 has also been shown to suppress expansions of other non-hairpin-forming microsatellite repeats, albeit in disparate assays that involved shorter tracts. For instance, in Pms2-null human cell lines, a plasmid-based assay showed a higher percentage of length mutations involving small expansions rather than contractions of the (TTCC·GGAA)9 sequence . Similarly, at three different mouse loci containing (CA)n repeats, the absence of Pms2 produced instability that was biased toward expansions in somatic tissues, although these expansions only consisted of small gains . It is interesting to note that the (TTCC·GGAA)n sequence forms a polypurine·polypyrimidine tract similar to the (GAA·TTC)n sequence whereas the (CA)n repeat does not. Indeed, intergenerational transmission of the expanded (GAA·TTC)n sequence by Pms2 null mice led to expansions in the offspring in the majority of transmissions, and this skewing towards expansions was not seen in mice deficient in other MMR proteins . Taken together, these data tend to support a broader role for Pms2 in the prevention of expansion mutations involving a variety of microsatellite repeats, and possibly independent of its role in the MMR pathway.
How Pms2 prevents the accumulation of large expansions in neuronal tissues remains unclear; however, it is plausible that this may occur via its ability to act independently of Msh2 in suppressing homologous recombination . It is conceivable that the deficiency of Pms2, and thereby its ability to suppress homologous recombination, could result in large expansions of the (GAA·TTC)n sequence via skewing of the repair mechanism employed for DSB repair (Figure 6). This model would be consistent with the differential effects of Pms2 and MutSα deficiency on the expanded (GAA·TTC)n sequence we observed.
(A) Repair of a double strand break within the repeat sequence begins with 5′ to 3′ resection of the free ends in the broken molecule (thin solid line). (B) The unpaired end of a strand in the broken molecule anneals to a complementary sequence in the template molecule (thick solid line). (C) In the absence of Pms2, DNA synthesis extends the invading strand (thin arrow). In the presence of Pms2, extension is inhibited. The break may be repaired by non-homologous end joining. (D) Erroneous alignment of the newly synthesized DNA (thick dashed line) during reannealing leaves a gap that is filled by another round of synthesis (thin dashed line), resulting in an expansion. The template molecule remains unchanged.
It is unclear why the accumulation of mutations varies substantially among different regions of the nervous system. Potential contributing factors include the distinct variability of cell types composing various regions of the nervous system, differential prevalence of triplexes, R-loops and DSBs, and differential activity and fidelity of DNA repair mechanisms. Indeed, DNA repair activity varies between different regions of the brain . In addition, cellular DSBs, which are usually efficiently repaired in somatic cells to permit cell survival , are known to accumulate in postmitotic neurons in an age-dependent manner . These changes could thus allow for the age-dependent accumulation of expansions in specific components of the nervous system. Furthermore, the expanded (GAA·TTC)n sequence became more unstable during intergenerational transmission , as opposed to the enhanced stability we observed in somatic tissues of Msh2 and Msh6 null mice. This, and the age-dependent accumulation of mutations in somatic tissues, suggests that the mechanism of expansion and contraction of the (GAA·TTC)n sequence are different in somatic and germ cells.
The progressive expansion of the (GAA·TTC)n sequence in the FXN gene in DRG and cerebellum of Friedreich ataxia patients may play a role in influencing the tissue-specific pathogenesis of this disease . Furthermore, the progressive expansion of relatively short alleles within the expanded range may influence the development of mild, late-onset Friedreich ataxia . Therefore, the finding that Pms2 prevents large expansions of the (GAA·TTC)n sequences in neuronal tissues is particularly exciting. Given this unique role, it is tempting to speculate that Pms2 may serve as a genetic modifier of the disease phenotype in Friedreich ataxia.
We thank Hein te Riele (The Netherlands Cancer Institute) for providing the Msh2 and Msh6 knockout mice and Michael Liskay (Oregon Health & Science University) and Darren Monckton (University of Glasgow) for providing the Pms2 knockout mice.
Conceived and designed the experiments: RLB IDB MAP SIB. Performed the experiments: RLB IDB RMP CS SAM. Analyzed the data: RLB SIB. Contributed reagents/materials/analysis tools: MAP SIB. Wrote the paper: RLB SIB.
- 1. Lopez CA, Cleary JD, Pearson CE (2010) Repeat instability as the basis for human diseases and as a potential target for therapy. Nat Rev Mol Cell Biol 11: 165–70.
- 2. Kennedy L, Evans E, Chen CM, Craven L, Detloff PJ, et al. (2003) Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis. Hum Mol Genet 12: 3359–3367.
- 3. Shelbourne PF, Keller-McGandy C, Bi WL, Yoon SR, Dubeau L, et al. (2007) Triplet repeat mutation length gains correlate with cell-type specific vulnerability in Huntington disease brain. Hum Mol Genet 16: 1133–1142.
- 4. Campuzano V, Montermini L, Molto MD, Pianese L, Cossee M, et al. (1996) Friedreich's ataxia: Autosomal recessive disease caused by an intronic GAA triplet repeat expansion. Science 271: 1423–1427.
- 5. De Biase I, Rasmussen A, Endres D, Al-Mahdawi S, Monticelli A, et al. (2007) Progressive GAA expansions in dorsal root ganglia of Friedreich's ataxia patients. Ann Neurol 61: 55–60.
- 6. Clark RM, De Biase I, Malykhina AP, Al-Mahdawi S, Pook M, et al. (2007) The GAA triplet-repeat is unstable in the context of the human FXN locus and displays age-dependent expansions in cerebellum and DRG in a transgenic mouse model. Hum Genet 120: 633–640.
- 7. Kovtun IV, Thornhill AR, McMurray CT (2004) Somatic deletion events occur during early embryonic development and modify the extent of CAG expansion in subsequent generations. Hum Mol Genet 13: 3057–3068.
- 8. Wheeler VC, Lebel LA, Vrbanac V, Teed A, te Riele H (2003) Mismatch repair gene Msh2 modifies the timing of early disease in Hdh(Q111) striatum. Hum Mol Genet 12: 273–281.
- 9. Al-Mahdawi S, Pinto RM, Varshney D, Lawrence L, Lowrie MB, et al. (2006) GAA repeat expansion mutation mouse models of Friedreich ataxia exhibit oxidative stress leading to progressive neuronal and cardiac pathology. Genomics 88: 580–590.
- 10. Al-Mahdawi S, Pinto RM, Ismail O, Varshney D, Lymperi S, et al. (2008) The Friedreich ataxia GAA repeat expansion mutation induces comparable epigenetic changes in human and transgenic mouse brain and heart tissues. Hum Mol Genet 17: 735–746.
- 11. Miranda CJ, Santos MM, Ohshima K, Smith J, Li L, et al. (2002) Frataxin knockin mouse. FEBS Lett 512: 291–297.
- 12. Sureshkumar S, Todesco M, Schneeberger K, Harilal R, Balasubramanian S, et al. (2009) A genetic defect caused by a triplet repeat expansion in Arabidopsis thaliana. Science 323: 1060–1063.
- 13. Jiricny J (2006) The multifaceted mismatch-repair system. Nat Rev Mol Cell Biol 7: 335–346.
- 14. Modrich P (2006) Mechanisms in eukaryotic mismatch repair. J Biol Chem 281: 30305–30309.
- 15. Kadyrov FA, Dzantiev L, Constantin N, Modrich P (2006) Endonucleolytic function of MutLalpha in human mismatch repair. Cell 126: 297–308.
- 16. Kadyrov FA, Homes SF, Arana ME, Lukianova OA, O'Donnell M, et al. (2007) Saccharomyces cerevisiae MutLa is a mismatch repair endonuclease. J Biol Chem 282: 37181–37190.
- 17. Jun SH, Kim TG, Ban C (2006) DNA mismatch repair system: Classical and fresh roles. FEBS J 273: 1609–1619.
- 18. Goellner GM, Tester D, Thibodeau S, Almqvist E, Goldberg YP, et al. (1997) Different mechanisms underlie DNA instability in Huntington disease and colorectal cancer. Am J Hum Genet 60: 879–890.
- 19. Manley K, Shirley TL, Flaherty L, Messer A (1999) Msh2 deficiency prevents in vivo somatic instability of the CAG repeat in Huntington disease transgenic mice. Nat Genet 23: 471–473.
- 20. van den Broek WJ, Nelen MR, Wansink DG, Coerwinkel MM, te Riele H, et al. (2002) Somatic expansion behaviour of the (CTG)n repeat in myotonic dystrophy knock-in mice is differentially affected by Msh3 and Msh6 mismatch-repair proteins. Hum Mol Genet 11: 191–198.
- 21. Savouret C, Brisson E, Essers J, Kanaar R, Pastink A, et al. (2003) CTG repeat instability and size variation timing in DNA repair-deficient mice. EMBO J 22: 2264–2273.
- 22. Dragileva E, Hendricks A, Teed A, Gillis T, Lopez ET, et al. (2009) Intergenerational and striatal CAG repeat instability in Huntington's disease knock-in mice involve different DNA repair genes. Neurobiol Dis 33: 37–47.
- 23. Tian L, Hou C, Tian K, Holcomb NC, Gu L, et al. (2009) Mismatch recognition protein MutSβ does not hijack (CAG)n hairpin repair in vitro. J Biol Chem 284: 20452–20456.
- 24. Gomes-Pereira M, Fortune MT, Ingram L, McAbney JP, Monckton DG (2004) Pms2 is a genetic enhancer of trinucleotide CAG·CTG repeat somatic mosaicism: Implications for the mechanism of triplet repeat expansion. Hum Mol Genet 13: 1815–1825.
- 25. Tome S, Holt I, Edelmann W, Morris GE, Munnich A, et al. (2009) Msh2 ATPase domain mutation affects CTG·CAG repeat instability in transgenic mice. PLoS Genet 5: e1000482.
- 26. Pearson CE, Ewel A, Acharya S, Fishel RA, Sinden RR (1997) Human Msh2 binds to trinucleotide repeat DNA structures associated with neurodegenerative diseases. Hum Mol Genet 6: 1117–1123.
- 27. Panigrahi GB, Lau R, Montgomery SE, Leonard MR, Pearson CE (2005) Slipped (CTG)·(CAG) repeats can be correctly repaired, escape repair or undergo error-prone repair. Nat Struct Mol Biol 12: 654–662.
- 28. Owen BA, Yang Z, Lai M, Gajek M, Badger JD, et al. (2005) (CAG)(n)-hairpin DNA binds to Msh2-Msh3 and changes properties of mismatch recognition. Nat Struct Mol Biol 12: 663–670.
- 29. Su SS, Lahue RS, Au KG, Modrich P (1988) Mispair specificity of methyl-directed DNA mismatch correction in vitro. J Biol Chem 263: 6829–6835.
- 30. Varlet I, Radman M, Brooks P (1990) DNA mismatch repair in Xenopus egg extracts: Repair efficiency and DNA repair synthesis for all single base-pair mismatches. Proc Natl Acad Sci U S A 87: 7883–7887.
- 31. Hawk JD, Stefanovic L, Boyer JC, Petes TD, Farber RA (2005) Variation in efficiency of DNA mismatch repair at different sites in the yeast genome. Proc Natl Acad Sci U S A 102: 8639–8643.
- 32. Pluciennik A, Modrich P (2007) Protein roadblocks and helix discontinuities are barriers to the initiation of mismatch repair. Proc Natl Acad Sci U S A 104: 12709–12713.
- 33. Smith GK, Jie J, Fox GE, Gao X (1995) DNA CTG triplet repeats involved in dynamic mutations of neurologically related gene sequences form stable duplexes. Nucleic Acids Res 23: 4303–4311.
- 34. Pearson CE, Sinden RR (1996) Alternative structures in duplex DNA formed within the trinucleotide repeats of the myotonic dystrophy and fragile X loci. Biochemistry 35: 5041–5053.
- 35. Bidichandani SI, Ashizawa T, Patel PI (1998) The GAA triplet repeat expansion in Friedreich ataxia interferes with transcription and may be associated with an unusual DNA structure. Am J Hum Genet 62: 111–121.
- 36. Sakamoto N, Chastain PD, Parniewski P, Ohshima K, Pandolfo M, et al. (1999) Sticky DNA: self-association properties of long GAA.TTC repeats in R.R.Y triplex structures from Friedreich's ataxia. Mol Cell 3: 465–475.
- 37. Suen IS, Rhodes JN, Christy M, McEwen B, Gray DM, et al. (1999) Structural properties of Friedreich's ataxia d(GAA) repeats. Biochim Biophys Acta 1444: 14–24.
- 38. Mariappan SV, Catasti P, Silks LA 3rd, Bradbury EM, Gupta G (1999) The high-resolution structure of the triplex formed by the GAA/TTC triplet repeat associated with Friedreich's ataxia. J Mol Biol 285: 2035–2052.
- 39. Tiner WJS, Potaman VN, Sinden RR, Lyubchenko YL (2001) The structure of intramolecular triplex DNA: Atomic force microscopy study. J Mol Biol 314: 353–357.
- 40. Potaman VN, Oussatcheva EA, Lyubchenko YL, Shlyakhtenko LS, Bidichandani SI, et al. (2004) Length-dependent structure formation in Friedreich ataxia (GAA)n·(TTC)n repeats at neutral pH. Nucleic Acids Res 32: 1224–1231.
- 41. Kim HM, Narayanan V, Mieczkowski PA, Petes TD, Krasilnikova MM, et al. (2008) Chromosome fragility at GAA tracts in yeast depends on repeat orientation and requires mismatch repair. EMBO J 27: 2896–2906.
- 42. Al-Mahdawi S, Pinto RM, Ruddle P, Carroll C, Webster Z, et al. (2004) GAA repeat instability in Friedreich ataxia YAC transgenic mice. Genomics 84: 301–310.
- 43. Baker SM, Bronner CE, Zhang L, Plug AW, Robatzek M, et al. (1995) Male mice defective in the DNA mismatch repair gene Pms2 exhibit abnormal chromosome synapsis in meiosis. Cell 82: 309–319.
- 44. de Wind N, Dekker M, Berns A, Radman M, te Riele H (1995) Inactivation of the mouse Msh2 gene results in mismatch repair deficiency, methylation tolerance, hyperrecombination, and predisposition to cancer. Cell 82: 321–330.
- 45. de Wind N, Dekker M, Claij N, Jansen L, van Klink Y, et al. (1999) HNPCC-like cancer predisposition in mice through simultaneous loss of Msh3 and Msh6 mismatch-repair protein functions. Nat Genet 23: 359–362.
- 46. Gomes-Pereira M, Bidichandani SI, Monckton DG (2004) Analysis of unstable triplet repeats using small-pool polymerase chain reaction. Methods Mol Biol 277: 61–76.
- 47. Ezzatizadeh V, Mouro Pinto R, Sandi C, Sandi M, Al-Mahdawi S, et al. (2012) The mismatch repair system protects against intergenerational GAA repeat instability in a Friedreich ataxia mouse model. Neurobiol Dis 46: 165–171.
- 48. Sharma R, Bhatti S, Gomez M, Clark RM, Murray C, et al. (2002) The GAA triplet-repeat sequence in Friedreich ataxia shows a high level of somatic instability in vivo, with a significant predilection for large contractions. Hum Mol Genet 11: 2175–2187.
- 49. De Biase I, Rasmussen A, Monticelli A, Al-Mahdawi S, Pook M, et al. (2007) Somatic instability of the expanded GAA triplet-repeat sequence in Friedreich ataxia progresses throughout life. Genomics 90: 1–5.
- 50. Ku S, Soragni E, Campau E, Thomas EA, Altun G, et al. (2010) Friedreich's ataxia induced pluripotent stem cells model intergenerational GAA⋅TTC triplet repeat instability. Cell Stem Cell 7: 631–637.
- 51. Krasilnikova MM, Mirkin SM (2004) Replication stalling at Friedreich's ataxia (GAA)n repeats in vivo. Mol. Cell. Biol 24: 2286–2295.
- 52. Michel B, Ehrlich SD, Uzest M (1997) DNA double-strand breaks caused by replication arrest. EMBO J 16: 430–438.
- 53. Mancuso M, Sammarco MC, Grabczyk E (2010) Transposon Tn7 preferentially inserts into GAA*TTC triplet repeats under conditions conducive to Y*R*Y triplex formation. PLoS One 5 (6): e11121.
- 54. Aguilera A, Gómez-González B (2008) Genome instability: a mechanistic view of its causes and consequences. Nat Rev Genet 9: 204–217.
- 55. Grabczyk E, Mancuso M, Sammarco MC (2007) A persistent RNA.DNA hybrid formed by transcription of the Friedreich ataxia triplet repeat in live bacteria, and by T7 RNAP in vitro. Nucleic Acids Res 35: 5351–5359.
- 56. Soragni E, Herman D, Dent SY, Gottesfeld JM, Wells RD, et al. (2008) Long intronic GAA*TTC repeats induce epigenetic changes and reporter gene silencing in a molecular model of Friedreich ataxia. Nucleic Acids Res 36: 6056–6065.
- 57. Ditch S, Sammarco MC, Banerjee A, Grabczyk E (2009) Progressive GAA.TTC repeat expansion in human cell lines. PLoS Genet 5: e1000704.
- 58. Rindler PM, Bidichandani SI (2010) Role of transcript and interplay between transcription and replication in triplet-repeat instability in mammalian cells. Nucleic Acids Res 39: 526–535.
- 59. Li X, Manley JL (2005) Inactivation of the SR protein splicing factor ASF/SF2 results in genomic instability. Cell 122: 365–378.
- 60. Duquette ML, Handa P, Vincent JA, Taylor AF, Maizels N (2004) Intracellular transcription of G-rich DNAs induces formation of G-loops, novel structures containing G4 DNA. Genes Dev 18: 1618–29.
- 61. LeProust EM, Pearson CE, Sinden RR, Gao X (2000) Unexpected formation of parallel duplex in GAA and TTC trinucleotide repeats of Friedreich's ataxia. J Mol Biol 302: 1063–1080.
- 62. Kim C, Snyder RO, Wold MS (1992) Binding properties of replication protein A from human and yeast cells. Mol Cell Biol 12: 3050–3059.
- 63. Merlo D, Di Stasi AM, Bonini P, Mollinari C, Cardinale A, et al. (2005) DNA repair in post- mitotic neurons: a gene-trapping strategy. Cell Death Differ 12: 307–309.
- 64. Fishel ML, Vasko MR, Kelley MR (2007) DNA repair in neurons: so if they don't divide what's to repair? Mutat Res 614: 24–36.
- 65. Pollard LM, Bourn RL, Bidichandani SI (2008) Repair of DNA double-strand breaks within the (GAA·TTC)n sequence results in frequent deletion of the triplet-repeat sequence. Nucleic Acids Res 36: 489–500.
- 66. Schrader CE, Guikema JE, Linehan EK, Selsing E, Stavnezer J (2008) Activation-induced cytidine deaminase-dependent DNA breaks in class switch recombination occur during G1 phase of the cell cycle and depend upon mismatch repair. J Immunol 179: 6064–6071.
- 67. Yao X, Buermeyer AB, Narayanan L, Tran D, Baker SM, et al. (1999) Different mutator phenotypes in Mlh1- versus Pms2-deficient mice. Proc Natl Acad Sci U S A 96: 6850–6855.
- 68. Shah SN, Eckert KA (2009) Human postmeiotic segregation 2 exhibits biased repair at tetranucleotide microsatellite sequences. Cancer Res 69: 1143–1149.
- 69. Siehler SY, Schrauder M, Gerischer U, Cantor S, Marra G, et al. (2009) Human MutL- complexes monitor homologous recombination independently of mismatch repair. DNA Repair (Amst) 8: 242–252.
- 70. Pardo B, Gómez-González B, Aguilera A (2009) DNA repair in mammalian cells: DNA double-strand break repair: how to fix a broken relationship. Cell Mol Life Sci 66: 1039–1056.
- 71. Brasnjevic I, Hof PR, Steinbusch HWM, Schmitz C (2008) Accumulation of nuclear DNA damage or neuron loss: Molecular basis for a new approach to understanding selective neuronal vulnerability in neurodegenerative diseases. DNA Repair 7: 1087–1097.
- 72. Sharma R, Gómez M, De Biase I, Ashizawa T, Bidichandani SI (2004) Friedreich ataxia in carriers of somatically unstable borderline GAA repeat alleles. Ann Neurol 56: 898–901.