Mismatch Repair Balances Leading and Lagging Strand DNA Replication Fidelity

The two DNA strands of the nuclear genome are replicated asymmetrically using three DNA polymerases, α, δ, and ε. Current evidence suggests that DNA polymerase ε (Pol ε) is the primary leading strand replicase, whereas Pols α and δ primarily perform lagging strand replication. The fact that these polymerases differ in fidelity and error specificity is interesting in light of the fact that the stability of the nuclear genome depends in part on the ability of mismatch repair (MMR) to correct different mismatches generated in different contexts during replication. Here we provide the first comparison, to our knowledge, of the efficiency of MMR of leading and lagging strand replication errors. We first use the strand-biased ribonucleotide incorporation propensity of a Pol ε mutator variant to confirm that Pol ε is the primary leading strand replicase in Saccharomyces cerevisiae. We then use polymerase-specific error signatures to show that MMR efficiency in vivo strongly depends on the polymerase, the mismatch composition, and the location of the mismatch. An extreme case of variation by location is a T-T mismatch that is refractory to MMR. This mismatch is flanked by an AT-rich triplet repeat sequence that, when interrupted, restores MMR to >95% efficiency. Thus this natural DNA sequence suppresses MMR, placing a nearby base pair at high risk of mutation due to leading strand replication infidelity. We find that, overall, MMR most efficiently corrects the most potentially deleterious errors (indels) and then the most common substitution mismatches. In combination with earlier studies, the results suggest that significant differences exist in the generation and repair of Pol α, δ, and ε replication errors, but in a generally complementary manner that results in high-fidelity replication of both DNA strands of the yeast nuclear genome.


Introduction
Three processes operate to ensure faithful replication of the eukaryotic nuclear genome [1,2]. The first is the ability of DNA polymerases a, d and e to selectively insert correct rather than incorrect nucleotides onto correctly aligned rather than misaligned primer-templates. The second is proofreading, the 39 exonucleolytic excision of errors from the primer terminus during replication. The third is mismatch repair (MMR) of errors that escape proofreading (reviewed in [3][4][5][6][7]). MMR begins when a mismatch is recognized by homologues of the bacterial MutS homodimer, either Msh2-Msh6 (MutSa) or Msh2-Msh3 (MutSb). This recognition initiates a series of steps that ultimately remove the replication error from the nascent strand and allow new DNA to be synthesized accurately.
The origin and nature of the strand discrimination signal used for MMR in vivo remains uncertain. MMR requires the presence of a discontinuity in the newly synthesized strand. At least in vitro, this discontinuity can be a nick or gap located either 39 or 59 to the mismatch, with the protein requirements for MMR differing somewhat depending on the location of the DNA ends relative to the mismatch. This provides an attractive possibility (reviewed in [3]), namely that MMR may be directed to the nascent strand by the 39 ends of growing chains at the replication fork and/or by the 59 ends of Okazaki fragments that are transiently present during lagging strand replication. That the latter could provide a higher signal density for MMR of lagging strand replication errors was suggested in an earlier study of MMR of a damaged (8-oxo-G-A) mismatch [8]. This leads to a previously unexplored question addressed by the present study, i.e., is the efficiency of MMR similar or different for mismatches generated during leading and lagging strand replication?
Investigation of this question is complicated by the fact that DNA polymerases a, d and e (Pols a, d and e, respectively) are all required to efficiently replicate the nuclear genome [9], and these polymerases have different error rates and error specificities [2,10]. Over the years, multiple models have been considered for the division of labor among these three polymerases during replication (reviewed in [9][10][11][12]). Among these models, recent evidence [2,13,14] suggests that under normal circumstances, the leading strand template is primarily replicated by Pol e, while the lagging strand template is replicated by Pol a-primase and Pol d.
Although MMR corrects errors made by all three polymerases [2,13,[15][16][17][18][19][20][21], it has only recently become possible to determine the extent to which MMR efficiency, and possibly MMR enzymology, varies depending on the replicase that made the error, the nascent strand containing the error and/or the location of the error within a DNA strand. We are investigating these variables using Saccharomyces cerevisiae strains containing mutant alleles of the POL1 (Pol a), POL2 (Pol e) and POL3 (Pol d) genes. These mutant alleles, pol1-L868M [18,19], pol2-M644G [13] and pol3-L612M ( [2] and references therein), encode enzymes with single animo acid replacements at the polymerase active site that reduce the fidelity of DNA synthesis. As a consequence, strains harboring these alleles have elevated spontaneous mutation rates, thereby allowing assignment of responsibility for most in vivo errors to a chosen mutator polymerase, rather than its wild type counterparts [2,13]. In strains containing these mutator polymerases, URA3 mutation rates and mutational spectra can be determined and used to calculate the rates for specific mutations, e.g., single base substitutions and insertions/deletions (indels) in various sequence contexts. Comparison of these rates in MMRproficient yeast strains to strains that lack MSH2-dependent MMR yields a calculation of the apparent MSH2-dependent MMR efficiency for a variety of replication errors generated during replication in vivo.
Using this approach, we recently described the efficiency of repairing lagging strand replication errors generated by L868M Pol a and L612M Pol d [21]. Here we extend the effort using yeast strains encoding M644G Pol e, allowing the comparison of MMR correction efficiencies for replication errors made by each of the three eukaryotic replicative polymerases. The results indicate that on average, MMR balances the fidelity of leading and lagging strand DNA replication, but with exceptions that place some base pairs at high risk of mutation from replication infidelity even in cells with normal MMR.

Results
The present study presents what to our knowledge is the first direct comparison of MMR efficiency for errors made by all three replicases in vivo, thereby providing insights into the contribution of MMR to leading and lagging strand replication fidelity. This comparison is a continuation of efforts to examine the possibility that MMR may be directed to the nascent strand by the 39 ends of growing chains at the replication fork [22], and/or by the 59 ends of Okazaki fragments that are transiently present during lagging strand replication [8].
Pol e preferentially incorporates rNMPs into the nascent leading strand Our previous inference that Pol e is a leading strand replicase was based on patterns of rare mutations in one gene (URA3) at one locus (AGP1) [13]. Two recent studies have made it feasible to test Pol e strand assignment using a different biomarker, ribonucleotide incorporation into nuclear DNA. The first study demonstrated that, in addition to reduced fidelity for single base mismatches, M644G Pol e also has reduced sugar discrimination, i.e., it incorporates rNTPs into DNA much more readily than does wildtype Pol e [23]. In that study, rNMPs incorporated into nascent DNA during replication by M644G Pol e were detected as alkalisensitive sites in the nuclear genome of a pol2-M644G rnh201D strain, which lacks the ability to repair rNMPs in DNA due to deletion of the RNH201 gene encoding the catalytic subunit of RNase H2. A more recent study exploited this fact to probe the genomic DNA of a homologous S. pombe pole-M630F rnh201D mutant strain by strand-specific Southern blotting [14]. When strand-specific probes flanking ARS3003/3004 were used, the results revealed that more rNMPs were incorporated into the nascent leading strand than into the nascent lagging strand. This led to the interpretation that, as in budding yeast, fission yeast Pol e is also the primary leading strand replicase [14]. Using this same strategy, we examined the strand specificity of rNMP incorporation in S. cerevisiae pol2-M644G rnh201D strains with the URA3 reporter in one of two possible orientations, using alkali treatment and subsequent probing for either the nascent leading or lagging strand with strand-specific URA3 probes ( Figure 1A). One of the two strands from each pol2-M644G rnh201D strain was preferentially sensitive to alkaline hydrolysis ( Figure 1B). In each case, this corresponded to the nascent leading strand products of replication (probe A in orientation 2 and probe B in orientation 1). These results strongly support the idea that Pol e preferentially replicates the leading strand template. Note that the distribution of ribonucleotides within the two strands across the whole genome remains to be determined and could differ.

Mutagenesis in MMR-proficient pol2-M644G strains
The strategy used here to study strand-specific MMR involves measuring spontaneous mutation rates in yeast strains with the URA3 reporter gene present in either of two orientations, both proximal to ARS306, a well-characterized, early-firing replication origin [24]. In our initial study of the role of Pol e in replication [13], we compared mutation rates in MMR proficient (MSH2 + ) strains with wild type Pol e (encoded by the POL2 gene) to rates in strains with the pol2-M644G mutation. The pol2-M644G strains had elevated mutation rates [13], an observation that is reproduced here ( Table 1). The majority of 5-FOA resistant mutants had single-base mutations in the URA3 gene. In orientation 1, these were predominantly A-T to T-A mutations at base pairs 279 and 686. These mutations were rare in orientation 2 (partial spectra in [13], complete spectra in Figure  S1A). This strong orientation bias, and the fact that the in vitro error rate for template T-dTMP mismatches by M644G Pol e is much higher than the error rate for template A-dAMP mismatches, implies that Pol e participates in leading strand DNA replication [13]. Two later studies [2,25] indicated that Pol d primarily acts as a lagging strand polymerase and has a less substantial role in leading strand replication. This further implied

Author Summary
The stability of complex and highly organized nuclear genomes partly depends on the ability of mismatch repair (MMR) to correct a variety of different mismatches generated as the leading and lagging strand templates are copied by three polymerases, each with different fidelity. Here we provide the first comparison, to our knowledge, of the efficiency of MMR of leading and lagging strand replication errors. We first confirm that Pol e is the primary leading strand replicase, complementing earlier assignment of Pols a and d as the primary lagging strand replicases. We then show that MMR efficiency in vivo strongly depends on the polymerase that generates the mismatch and on the composition and location of mismatches. In one extreme case, a flanking triplet repeat sequence eliminates MMR altogether. Overall, MMR is most efficient for mismatches generated at the highest rates and having the most deleterious potential, thereby ultimately achieving high-fidelity replication of both DNA strands.
that Pol e not only participates in leading strand DNA replication, but that it is the major leading strand replicase.

Mutation rates and specificity in pol2M644G msh2D strains
The pol2-M644G msh2D mutants have strongly elevated mutation rates relative to the MSH2 + strains ( Table 1), indicating that the vast majority of the mutations are made by M644G Pol e. In the absence of mismatch repair, most 5-FOA resistant mutants contained single base changes that were widely scattered throughout the URA3 coding sequence ( Figure S1B). As compared to MMR proficient pol2-M644G strains, base pairs 279 and 686 in pol2-M644G msh2D strains did not stand out as hotspots for A-T to T-A transversions in orientation 1, even though base substitution and single base deletion hotspots were observed at several other locations ( Figure S1B).

MMR correction factors
The data in Table 1 and Figure S1 were used to calculate rates for single base mutations in the MMR-proficient and msh2D strains (Table S2). The ratio of these rates reflects the apparent MMR correction efficiency for each type of error, and the results can be compared (see discussion) to those reported earlier [21] for replication errors made by L868M Pol a and L612M Pol d. As noted previously [21,[26][27][28][29], certain correction factors could be higher if some mismatches in the MMR proficient strains are not subject to MMR, either because they are damaged or because they are generated during DNA transactions that occur outside of replication.
Conclusions about the overall balance of repair between strands and polymerases derive from collective consideration of all single base mismatches. In the pol2-M644G strain background, the MMR correction factor for all single base mismatches is 250-fold (Table 2; Figure 2A, blue bar; Table S2), i.e., on average, 249 of 250 single base replication errors generated by M644G Pol e are corrected by MMR. This correction factor is higher than for L612M Pol d (Table 2; Figure 2A, green diamond), but lower than for L868M Pol a (Table 2; Figure 2A, red diamond). As a consequence, the mutation rates for all three variant polymerase strains are similar when MMR is operative (top line in Table 2). Average correction factors are high for each of the four classes of single base changes generated by M644G Pol e (Figure 2A), in the following order: deletions (1,500-fold), insertions (1,100-fold), transitions (440-fold) and transversions (72-fold). Correction factors vary widely between specific positions in the URA3 open reading frame. Figure 2B-2D show eight locations where it is possible to compare MMR of the same mismatch generated by M644G Pol e during leading strand replication (blue bars) or by Pol a (red diamonds) and d (green diamonds) during lagging strand replication (expanded from [21]). In order to maintain equivalent template context, leading strand errors found in one URA3 orientation in the pol2-M644G strains are always compared to lagging strand errors found in the other URA3 orientation in the A. The orientation of the URA3 reporter with respect to coding sequence is indicated as orientation 1 (OR1) or orientation 2 (OR2). DNA template strands are in black, the nascent leading strand is in blue and the nascent lagging strand is in green. B. Detection of alkali-sensitive sites in yeast genomic DNA reveals a strand bias for incorporation of ribonucleotides. Following alkaline hydrolysis and alkaline agaroseelectrophoresis, the DNA was transferred to a nylon membrane and processed for Southern analysis. The indicated region of the URA3 reporter gene was examined using strand-specific radiolabeled probes that anneal to either the nascent leading or nascent lagging strand. The sizes of DNA markers are indicated on the left. All strains harbor the pol2-M644G mutator allele. Increased DNA mobility is indicative of alkali-sensitivity due to the presence of ribonucleotides in the nascent DNA strand. doi:10.1371/journal.pgen.1003016.g001 pol1-L868M and pol3-L612M strains. For example, the correction factors in Figure 2B for deleting an A-T pair from the three longest runs of A-T pairs in the URA3 coding sequence (base pairs 174-178, 201-205 and 255-260, Figure S1) are each inferred to involve a single unpaired T. The comparative MMR correction factors and their implications are considered in the Discussion.

A-T to T-A transversions at base pair 686
In contrast to the efficient repair of most single mismatches, the rate of A-T to T-A transversions at base pair 686 in orientation 1 ( Figure S1 and Table S3) is no higher in the pol2-M644G msh2D strain than in the MMR-proficient pol2-M644G strain (Table S2). This indicates that T-T mismatches generated at base pair 686 during leading strand replication by M644G Pol e are not efficiently corrected by MMR ( Figure 3, ''T-dT, 686,'' dark blue bar). This contrasts with an average of 41-fold correction (dark blue bar on left) of the same mismatch inferred at all other A-T base pairs in URA3, i.e., A to T substitutions in orientation 1 and T to A transversions in orientation 2 ( Figure S1). Adjacent to base pair 686 is a triplet repeat sequence, 59-ATT ATT ATT gTT (designated here as ATT 3 ). For several reasons (see Discussion), we speculated that this sequence might suppress MMR at base pair 686. To test this, we constructed strains in which ATT 3 was modified to 59-ATA ATC ATA gTT (designated ATT 0 , see Figure 3), with the three (underlined) changes interrupting the repeat units without changing the amino acid sequence. We then measured spontaneous mutation rates and generated mutational spectra ( Figure S2) to determine if the flanking sequence changes allowed MMR of T-T mismatches at base pair 686. The results (Table S3)  Single base-base mismatches are repaired by MutSa (Msh2-Msh6) but not by MutSb (Msh2-Msh3) [3][4][5][6][7], implying that the ATT 3 sequence is suppressing repair of the T-T mismatch that would normally occur via MutSa. However, given evidence that MutSb can bind to a non-B-DNA structure that can form in a triplet repeat sequence and promote triplet repeat expansion (reviewed [30]), we examined whether suppression of MMR at base pair 686 might depend on MutSb. This was done by calculating the A-T to T-A mutation rate at base pair 686 in URA3 orientation 1 in a pol2-M644G msh3D rnh201D strain [31]. The calculated A-T to T-A rate is 17610 28 , which is no lower than observed here in the Msh3 + strain (6.8610 28 , Table S3). Thus suppression of MMR by ATT 3 is independent of MutSb.

Discussion
This study provides new insights into relationships between the intrinsic asymmetry of DNA replication and MMR in yeast.

Ribonucleotides are ''biomarkers'' of Pol e action in vivo
We previously inferred that Pol e participates in leading strand replication using base substitutions as biomarkers for leading strand replication. These events are rare, occurring approximately once per 10 million incorporations. The present study uses ribonucleotides as an independent and much more abundant biomarker. The preferential presence of ribonucleotides in the nascent leading strand observed here in pol2-M644G rnh201D strains (URA3 orientation 1 and orientation 2; Figure 1) strongly  strand replication as compared to mismatches of similar composition made by Pols a and d during lagging strand replication. An additional notable point here is that the sizes of the nascent leading strand fragments resulting from alkaline hydrolysis of DNA from the pol2-M644G rnh201D strains (Figure 1) indicate that approximately one ribonucleotide may be incorporated for every 1,000 deoxyribonucleotides. This density of ribonucleotide incorporation into DNA is about four orders of magnitude higher than for A-Tto T-A transversions. Thus ribonucleotides mapped by deep sequencing techniques could serve as high density, genome-wide biomarkers of Pol e action in vivo during replication and possibly during repair and recombination.

Variations in repairing M644G Pol e replication errors
The average MMR correction factors for errors made by M644G Pol e are highest for indels, intermediate for transitions and lowest for transversions (Figure 2A). This rank order is common to E. coli [28,29,32] and to errors made by yeast Pols a and d [21], suggesting that MMR has conserved the ability to most efficiently correct the most potentially deleterious errors (indels), and also the base-base mismatches made at the highest rates by both bacterial and eukaryotic replicases. This general principal is qualified by the observation that MMR efficiency varies, even for the same inferred mismatch (e.g., either an extra T, a C-dT or a G-dT mismatch, Figure 2B, 2C and 2D, respectively) made by the same polymerase (M644G Pol e) during replication of the same (leading) strand. Most sequence-dependent variations in MMR efficiency seen here are in the 2-to 10-fold range (Figure 2) depending on the comparison. That such variations are typically small is perhaps expected, since MMR is needed to preserve the stability of nuclear genomes despite their enormous sequence complexity.
Variations due to mismatch composition and location are consistent with biochemical studies showing differences in MMR in vitro [33] and with mutational studies in vivo in which the identity of the replicase that made the mismatch was unknown. Several explanations for variations in eukaryotic MMR efficiency can be explored in the future. For example, the efficiency with which E. coli repairs transversion mismatches in phage l increases with increasing G-C content in neighboring nucleotides [32], and recognition of certain mismatches by MutSa is influenced by a 6nucleotide region surrounding the mismatch [34]. Thus it may be that flanking sequences, such as those shown in Figure 2, influence eukaryotic MMR efficiency in vivo by modulating (i) mismatch binding by MutSa, which contacts several base pairs on either side of the mismatch [35], (ii) base pair stacking, since a MutSa-bound mismatched base stacks with a conserved phenylalanine in Msh6, and/or (iii) DNA flexibility, since MutSa-bound mismatched DNA is kinked, and a transition between bent and unbent DNA may be critical for limiting MMR to processing of mismatched as compared to matched base pairs [36]. Variations in MMR efficiency might also depend on proteins that operate downstream of mismatch binding, such as MutLa or exonucleases, or they may reflect other variables, such as the timing of nucleosome reloading behind the replication fork, nucleosome dynamics and/or chromatin remodeling.

A natural DNA sequence that suppresses MMR
A striking observation here is the apparent absence of MMR of the A-T to T-A transversion at base pair 686 (Figure 3), which is inferred to result from a T-T mismatch made by M644G Pol e during leading strand replication. This lack of repair contrasts sharply with efficient repair at many other locations. For example, the deletion mismatch at base pairs 255-260, which is predicted to involve a mismatch containing a single unpaired T in the template ( Figure 2B), has an approximately 6000-fold higher correction factor than for the T-T mismatch at base pair 686. Lack of repair at base pair 686 is not due to a general inability to correct A-T to T-A transversion mismatches, because the average correction factor for these events elsewhere in URA3 is 41-fold ( Figure 3). The absence of correction at position 686 led us to test whether MMR was inhibited by the adjacent 59-ATTATTATTgTT sequence. There were several reasons to suspect that this could be the case. The sequence is A-T rich and may have unusual helical parameters that could diminish MMR. For example, sequences containing larger numbers of ATT repeats can form a nonhydrogen bonded structure [37], and can be induced into hairpins by the DNA minor groove binding ligand DAPI (49,6-diamidino-2-phenylindole) [38,39]. Triplet repeat sequences can form non-B-DNA structures that bind MMR proteins (reviewed in [30]), and they are often associated with genome instability (reviewed in [40]), albeit characterized by indels rather than base substitutions. In addition, recent studies have demonstrated that nucleosomes influence the behavior of MMR proteins and visa versa (e.g., see [41][42][43][44]), and nucleosome binding to DNA is influenced by DNA sequence, with A-T-rich dinucleotides such as those present in ATT 3 having an important role in nucleosome positioning (e.g., see [45,46] and references therein).
For these reasons, we examined MMR at base pair 686 after changing the flanking sequence to eliminate the triplet repeats and decrease A-T content by one base pair. The results indicate that these changes allowed correction of 97% of the mismatches generated by M644G Pol e at base pair 686 (88% correction at the lower 95% confidence limit, Figure 3). This suggests that the ATT 3 flanking sequence is a natural cis-acting suppressor of the normal MSH2-dependent MMR machinery. Suppression does not decrease upon deletion of MSH3, and thus is MutSb independent, unlike triplet repeat expansion [30]. Collectively, position 686 and ATT 3 are an example of what has been called an ''At Risk sequence Motif'' [47], i.e., a naturally occurring DNA sequence that results in inefficient operation of a DNA transaction required for genome stability. The fact that one such sequence exists in the 804 base pair open reading frame of URA3 leads one to wonder how many natural suppressors of MMR might be present in nuclear genomes. This issue is currently being investigated using the deep sequencing approach previously used to infer that Pol d is a lagging strand replicase across the yeast genome [25]. Experiments are also planned to examine which (if any) of the possibilities mentioned in the preceding section may be relevant to inefficient MMR at base pair 686.

Correcting leading and lagging strand replication errors
We previously suggested that MMR may be directed to the nascent strand by the 39 ends of growing chains at the replication fork [22], and/or by the 59 ends of Okazaki fragments that are transiently present during lagging strand replication [8]. The 59 ends of Okazaki fragments, and perhaps the PCNA required to process these ends, could potentially provide a higher signal density for MMR of lagging strand replication errors as compared to errors generated during leading strand replication, which is thought to be more continuous. If so, then MMR might be more efficient in correcting lagging strand errors. In an initial test of this hypothesis, we found that mutagenesis due to a mismatch formed at one particular G-C base pair during replication of unrepaired 8oxo-G in ogg1-deficient yeast was lower for lagging as compared to leading strand replication, and importantly, that this bias was largely eliminated in MMR defective strains [8]. Among several possible explanations that we considered for loss of the strand bias, one was that 8-oxo-G-dA mismatches made during lagging strand replication may be more efficiently corrected than are 8-oxo-G-dA mismatches made during leading strand replication. A major goal of the present study was to test this hypothesis for multiple, natural (i.e., undamaged) mismatches generated at different locations during replication of a larger target sequence. The present study accomplishes this, and allows the first direct comparison of MMR efficiency for errors made by all three replicases, to our knowledge, thereby providing insights into the contribution of MMR to leading and lagging strand replication fidelity.
From the results in Figure 2, we conclude that in general, mismatches made by all three replicases are repaired very efficiently. This is logical given the need to preserve genetic information in both DNA strands. This conclusion is independent of various models regarding which DNA polymerase replicates which strand (reviewed in [11,12]). Other implications derive from the model wherein Pols a and d are the primary lagging strand replicases and Pol e is the primary leading strand replicase. In our earlier report [21], we pointed out that correction factors were higher for mismatches made by Pol a than for the same mismatches made by Pol d, suggesting that the 59 ends of Okazaki fragments may be strand discrimination signals and that MMR efficiency may be related to the proximity of a mismatch to that signal. This is interesting given that DNA polymerase e is highly processive, at least as processive as DNA polymerase d, and that leading strand replication is thought to be largely continuous [48,49,50]. It is of course conceivable that leading strand replication may not be as continuous as current models imply. If leading strand replication is indeed largely continuous, then the fact that MMR corrects most Pol e errors about as efficiently as it corrects errors made by Pols a and d (Figure 2) implies the existence of MMR signals other than the 59 ends of Okazaki fragments, and these can very efficiently direct MMR to the nascent leading strand. Possible signals for leading strand replication include the above-mentioned 39 ends of growing chains at the replication fork [22,51,52], nicks introduced into the nascent leading strand by nucleases, and/or asymmetrically bound PCNA [8,53]. PCNA is a particularly attractive possibility for differentially modulating the efficiency of MMR of errors made by the three replicases, because it is involved in early steps in MMR (see [3][4][5][6][7] for review]), it does not influence DNA synthesis by Pol a, and it does stimulate DNA synthesis by both Pol d and Pol e, albeit through different PCNA-polymerase interactions (see [9] and references therein).
The results in Figure 2 further suggest that, even for the same mismatch (extra T, G-dT or C-dT) in a common sequence context, MMR efficiency varies depending on which polymerase made the error. In two of three instances involving deletion of a single template T ( Figure 2B), the repair of mismatches made by Pol d is higher than for mismatches made by Pol e. This correlates with the observation that Pol d generates this mismatch in vitro at a higher rate than does Pol e [54]. Similarly, transitions and transversions ( Figure 2A) and several site-specific base substitutions ( Figure 2C and 2D) generated by Pol a are corrected more efficiently than are mismatches generated by Pol d and Pol e. Pol a lacks an intrinsic proofreading exonuclease activity and is less accurate than proofreading-proficient Pols d and e (reviewed in [10,55]). Thus the present study of mismatches generated by Pol e extends the idea that MMR has evolved to most efficiently correct the most deleterious mismatches (i.e., indel mismatches). Within classes of similar deleterious potential (base-base mismatches), evolution has produced the highest efficiency versus the most frequently generated mismatches. In a model wherein Pol e is the major leading strand replicase and Pols a and d conduct about 10% and 90% of lagging strand replication [2], respectively, the results (Table 2; Figure 2A, average repair for single base errors) further suggest that MMR balances the fidelity of replication of the two strands despite the use of replicases with substantially different fidelity and error specificity.

Strains, mutation rates, and analysis of ura3 mutants
The strains used in this study, the measurements of spontaneous mutation rates and the sequencing of URA3 mutants were as previously described [2,13,21], save that MSH2 was deleted from haploid pol2-M644G strains rather than diploid. The ATT 3 to ATT 0 conversion was made via site-directed mutagenesis and integration pop-out [56] in a strain with wild type polymerases. PCR product containing the ATT 0 URA3 allele was then transformed into msh2D backgrounds and proper insertion verified via sequencing.
Probing for alkali-sensitive sites in genomic DNA Genomic DNA was isolated from exponentially growing cultures (grown in YPDA at 30uC) using the Epicentre Yeast DNA purification kit. Five mg of DNA was treated with 0.3 M KOH for 2 h at 55uC and subjected to alkaline-agarose electrophoresis as described [23]. Following neutralization, DNA was transferred to a charged nylon membrane (Hybond N+) by capillary action and probed by Southern analysis. Strand-specific radiolabeled probes were prepared from a PCR-amplified fragment of URA3 template, using a previously described procedure and probe design [14].

Statistical analysis
See Text S1. Figure S1 Mutational spectra in pol2-M644G and pol2-M644G msh2D strains. The URA3 reporter was present in either orientation 1 (OR1) or orientation 2 (OR2) at position AGP1. The coding strand of the URA3 open reading frame is shown, with every 10 th base indicated by a dot and mutations depicted above (OR1) and below (OR2) the wild type URA3 sequence. Single letters represent base substitutions, open triangles represent single base deletions, and closed triangles represent single base additions. Indels in homonucleotide runs are shown at the 59-most position of the run. (A) Spectra in MSH2 strains [13]. (B) Spectra in msh2D strains. (TIF) Figure S2 Mutational spectra in pol2-M644G and pol2-M644G msh2D strains with ATT 0 URA3. As for Figure S1, with spectra for the pol2-M644G msh2D and pol2M644G MSH2 strains shown above and below the ATT 0 URA3 sequence, respectively. The three bases that differ between the ATT 3 and ATT 0 URA3 sequences are shown in bold (positions 690, 693, and 696). (TIF)

Table S3
Mutation Rates for pol2-M644G 6 msh2D and ATT 3 / ATT 0 URA3 Orientation 1 strains. a Expanded from [21]. b Expanded from [13]. (DOCX) Text S1 The supporting information includes methods for calculating and statistical analyses of mutation rates and correction factors. (DOC)