Figures
Abstract
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear (‘Old Home’בLouise Bon Jersey’) and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
Citation: Montanari S, Saeed M, Knäbel M, Kim Y, Troggio M, Malnoy M, et al. (2013) Identification of Pyrus Single Nucleotide Polymorphisms (SNPs) and Evaluation for Genetic Mapping in European Pear and Interspecific Pyrus Hybrids. PLoS ONE 8(10): e77022. https://doi.org/10.1371/journal.pone.0077022
Editor: Boris Alexander Vinatzer, Virginia Tech, United States of America
Received: June 19, 2013; Accepted: August 26, 2013; Published: October 14, 2013
Copyright: © 2013 Montanari et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: SM is funded by the FEM PhD school. MS is funded by a Massey University Doctoral Scholarship. MK is funded by the New Zealand Ministry of Science and Innovation grant “Pipfruit: a juicy future” (Contract number: 27744). The visiting scientist fellowship of YKK to PFR was funded by a National Institute of Horticultural and Herbal Science grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
One of the biggest challenges for plant biologists has long been to associate genetic variations with phenotypic traits. The recent technological revolution initiated by new generation sequencing (NGS) has enabled the sequencing of the entire genome of complex organisms, including the higher plants grape [1], [2], maize [3], peach [4], apple [5], potato [6], tomato [7] and most recently, Chinese pear [8]. NGS also enables the inventory of entire sets of DNA variations in genomes, through the re-sequencing of multiple accessions of the same species and alignment of these sequences to the reference genome, for the purpose of in silico detection of DNA polymorphisms [9]–[16].
Single nucleotide polymorphisms (SNPs) are single base variations in DNA sequences that are abundant in plant genomes and are useful for identifying differences within individuals or populations as well as identifying genetic loci associated with phenotypic variation. Within coding regions, SNPs may be defined as non-synonymous or synonymous (resulting in an amino acid change or not) and are also found in gene-regulating regions (e.g. in promoters, untranslated mRNA regions and introns). Once polymorphisms have been detected by NGS, the next challenge is to screen large genetic populations with multiple markers simultaneously. While re-sequencing can be used for both SNP discovery and genotyping of the entire set of polymorphisms of a species [17], high throughput SNP arrays, such as the Infinium® II assay (Illumina Inc.), are effective technologies for genotyping of large populations.
High throughput SNP arrays have been recently developed for a range of fruit tree species. In Rosaceae, an apple SNP array was developed by the International RosBREED SNP consortium (IRSC) (www.rosbreed.org) [9]. This 8K SNP array v1 contains 7867 SNPs, of which 5554 proved to be genome-wide polymorphic SNPs in apple. The International Peach SNP Consortium (IPSC) developed a 9K SNP array for peach that includes 8144 SNPs, 84.3% of which exhibit polymorphism when screened over 709 accessions of peach (comprising peach cultivars, wild related Prunus species and interspecific hybrids) [10]. IRSC also led the development of a 6K SNP array for cherry, with 1825 verified polymorphic SNPs in sweet cherry and 2058 in sour cherry [18]. In Citrus, 54 accessions and 52 interspecific hybrids between pummelo and Clementine were genotyped using a 1457 GoldenGate® SNPs assay developed from clementine BAC-end sequencing. Out of 622 SNPs showing consistent results, 80.5% were demonstrated to be transferable to the whole Citrus gene pool [19].
The genus Pyrus includes both European (Pyrus communis) and Asian pears (P. pyrifolia or Japanese pear, and P. bretschneideri, commonly known as Chinese pear). To date, only a few genetic maps have been developed for Pyrus and none of these contains SNP markers. The first map was constructed using random amplified polymorphic DNA (RAPD) markers in a P. pyrifolia cross between ‘Kinchaku’ and ‘Kosui’ [20]. Yamamoto et al. [21], [22] developed the second generation of pear maps based on amplified fragment length polymorphism (AFLPs) and transferrable apple and pear simple sequence repeat (SSRs), using an interspecific cross between ‘Bartlett’ (P. communis) and ‘Hosui’ (P. pyrifolia). As the ‘Bartlett’בHosui’ map contained SSRs derived from both pear and apple, this study enabled the assessment of genome synteny between pear and apple and suggested that these species have co-linear genomes. Apple and pear markers had also been used earlier to generate maps for the two European pear cultivars ‘Passe Crassane’ and ‘Harrow Sweet’ [23]. SSR markers developed from both apple and pear were also used by Celton et al. [24] to build an integrated map of the P. communis cultivars ‘Bartlett’ and ‘La France’, along with two apple rootstocks. Lu et al. [25] screened the interspecific pear population ‘Mishirazi’ (P. pyrifolia×P. communis)בJinhua’ (P. bretschneideri) with apple SSRs and were able to construct a genetic map. However, the number of markers used in all these studies was limited to few hundreds. Recently, NGS was used to develop a genetic map of ‘Bayuehong’ (P. bretschneideri×P. communis)בDangshansuli’ (P. bretschneideri) to anchor the Chinese pear genome; however, these SNPs were not evaluated for the screening of large segregating populations [8].
In this study, we used NGS to detect SNPs in the pear genome, to enable the design of a medium throughput SNP assay. These new pear SNPs were evaluated for genetic map construction using five segregating populations of European and Asian pear origin. Our incorporation of the new pear SNPs into the IRSC apple Infinium® II 8 K array [9], enabled the study of SNP transferability not only within the genus Pyrus, but also between the genera Malus and Pyrus.
Materials and Methods
NGS Sequencing of Pear Cultivars
A SNP detection panel consisting of three European pear (P. communis) cultivars was chosen for low coverage whole-genome sequencing. The individuals were ‘Bartlett’ (a.k.a. ‘Williams Bon Chrétien’), ‘Old Home’ (OH) and ‘Louise Bon Jersey’ (LBJ). These accessions were chosen as ‘Bartlett’ is a founder of most breeding programmes worldwide, and OH and LBJ are the parents of a segregating population developed at Plant & Food Research (PFR). Each accession was sequenced using one lane of Illumina GA II with 75 cycles per read and small insert paired-end sequencing, as described in [9].
Two pear unnormalized cDNA libraries were prepared by vertis Biotechnologie AG for the European pear cultivar ‘Max Red Bartlett’ following VERTIS customized protocol (http://www.vertis-biotech.com/). One run of 454 sequencing on a Roche/454 GS FLX Sequencer was performed.
Bioinformatics Detection and Selection of SNPs for Array
A de novo assembly was performed for the ‘Bartlett’ sequencing data using AbySS 1.2.1 (k = 43). Contigs of 600 bp or larger were used as a reference genome set. The sequencing data from OH and LBJ were mapped to the reference genome set of ‘Bartlett’ using Soap2.20 (-p 8 -M 4 -v 5 -c 52 -s 12 -n 5 -r 2 -m 50 -x 600). Soap output files were split into a single file per contig and each contig file sorted by location of the mapped reads. SoapSNP was used for SNP detection and filtering with the same parameters as described in [9]. The detected SNPs were then subjected to filtering, where calls were discarded when the quality score was less than 20; fewer than two reads per genotype were present; overall coverage depth was greater than the average coverage plus three standard deviations; the site was at least 25 bases away from another SNP call; and the SNPs were not located within regions associated with a set of candidate genes. The candidate gene set used for filtering consisted of 2559 transcription factor sequences from Malus×domestica [5]. Locations within pear were defined by mapping these sequences to the reference genome set of ‘Bartlett’ using gmap with command line options -K 3000–L 50000.
454 cDNA reads were assembled using CAP3 [26]. Contigs were aligned to the reference M.×domestica genome and only unique alignments were considered to avoid parology issues. SNPs were predicted using a customized bioinformatics pipeline and selected to be well spread over the 17 apple chromosomes.
The Illumina Infinium® assay design tool (ADT) was used on the detected SNPs with a threshold of 0.7. These pear SNPs were synthesized as probes and located on the same array as the IRSC apple Infinium® II 8 K array [9].
Plant Material for SNP Array Evaluation
Five pear segregating populations were screened using the apple and pear Infinium® II 9K SNP array. No permission was required to collect plant material and pear is not an endangered or protected species. These were one P. communis intraspecific family and four interspecific (P. bretschneideri, P. communis and P. pyrifolia) pear populations: OH×LBJ, of 297 F1 individuals and both parents; P128R068T003בMoonglow’ (T003×M), of 220 F1 individuals and both parents; P019R045T042×P037R048T081 (T042×T081), of 142 F1 individuals and both parents; P202R137T052×P128R068T003 (T052×T003), of 91 F1 individuals and T003 parent only; and P202R137T052×P266R225T064 (T052×T065), of 123 F1 individuals and T064 parent only, since parent T052 has been lost. Figure 1 shows the relationships among the interspecific populations. The interspecific hybrid populations were developed as part of the PFR pear breeding programme [27]. Half the P128R068T003בMoonglow’ population was grown at INRA, Angers (France) and genotyped at the Fondazione Edmund Mach (FEM, Italy), and the other half was grown at PFR, Motueka and genotyped at AgResearch Limited, Invermay in New Zealand, together with the other four populations. DNA extraction of OH×LBJ, T042×T081 and T052×T003 populations was performed using a CTAB extraction method [28], followed by purification with NucleoSpin® columns (Macherey-Nagel GmbH & Co. KG). DNA from the T003×M and T052×T064 populations was extracted using the QIAGEN DNeasy Plant Kit (QIAGEN GmbH, Hilden, Germany). DNA quantifications were carried out using a NanoDrop™ 2000c spectrophotometer (Thermo Fisher Scientific Inc.).
A) P128R068T003בMoonglow’; B) P037R048T081×P019R045T042, and C) P202R137T052×P128R068T003 and P202R137T052×P266R225T064.
SNP Genotyping and Data Analysis
Genomic DNA was amplified and hybridized to the apple and pear Infinium® II 9K SNP array following the Infinium® HD Assay Ultra protocol (Illumina Inc., San Diego, USA) and scanned with the Illumina HiScan. Data were analyzed using Illumina’s GenomeStudio v 1.0 software Genotyping Module, setting a GenCall Threshold of 0.15. The software automatically determines the cluster positions of the AA/AB/BB genotypes for each SNP and displays them in normalized graphs (Figure 2). A systematic method was used to evaluate the SNP array data employing quality metrics from GenomeStudio (Illumina): GenTrain score ≥0.50, minor allelic frequency (MAF) ≥0.15 and call rate >80%. A Chi-square test at a significance of 0.01 was performed to determine distortion of markers from the expected segregation. SNPs that were highly distorted or which had the genotype of one or both parents missing were manually edited in GenomeStudio. The SNPs for which 25% or 50% of the individuals were not called in clusters were manually edited, since this kind of segregation may have been due to SNPs with null alleles.
Parents ‘Old Home’ and ‘Louise Bon Jersey’ are indicated in yellow; the red cluster is identified as AA, the blue as BB and the purple as AB genotype. The total number of the individuals analyzed here is 297 and the segregation ratio is 1∶2:1.
Simple Sequence Repeat Genotyping
The T003×M population was genotyped with apple and pear microsatellite markers as well as SNPs. Fifty-four SSRs were selected based on the ‘Bartlett’ consensus map developed by Celton et al. [24] and one SSR, Md-Exp 7, from the work of Costa et al. [29]. They were first screened for polymorphism over DNA extracted from both parents and five individuals of the progeny, and then screened over the subset of the T003×M population raised at INRA (Table S1). PCR amplifications were performed in a final volume of 12.5 uL containing 10 ng of genomic DNA, 1x buffer, 2 mM MgCl2, 0.2 mM of each dNTP, 0.4 uM of each forward and reverse primer and 0.75 U of AmpliTaq Gold® DNA polymerase (Applied Biosystems® by Life Technologies™). All SSR amplifications were performed in a Biometra T gradient Thermocycler (Biometra GmbH, Göttingen, Germany) or in a Bio-Rad C-1000 thermocycler (Bio-Rad Laboratories, Hercules, CA) at FEM (Italy) and INRA, Angers (France) under the following conditions: an initial denaturation at 95°C for 5 min, followed by 36 cycles of 95°C for 30 sec, TA (an optimal annealing temperature for each primer was used) for 30 sec, 72°C for 1 min, finishing with a final extension at 72°C for 7 min. Fragment analysis was performed with an ABI PRISM_3730 capillary sequencer (Applied Biosystems® by Life Technologies™) in a final mix of 0.5 uL of PCR product, 9.97 uL formamide and 0.03 uL of 500-LIZ dye, denaturated for 3 min at 95°C. Fragment sizing was performed with GeneMapper software v. 4.0 (Applied Biosystems® by Life Technologies™).
Linkage Mapping Analysis
The genetic maps of both parents of all five populations were constructed using JoinMap v3.0 and v4.0 software [30], based on the SNP data for each individual population, except for the T003×M population, where both the SNP and SSR data were used. Linkage groups were determined with a LOD score of 5 and higher for grouping and the Kosambi function was used for map calculation. The maps were drawn and aligned using MapChart v2.2 [31].
Pear SNP Alignment to the Apple Genome Sequence
The pear SNPs included in the array were aligned to the apple genome assembly [5] using BLASTN analysis of the SNP flanking sequence against the ‘Golden Delicious’ (GD) genome assembly. A BLASTN cutoff of an alignment length >100 nucleotides and an e-value<e-30 were used.
Results
SNP Detection and Selection for 1 K Pear Array
In total, 34,082,435, 35,687,533 and 25,167,853 paired-end reads were generated for ‘Bartlett’, OH and LBJ, respectively. The de novo assembly genome set of ‘Bartlett’ consisted of 78,748 contigs of 600 bp or greater in length containing a total of 79,067,993 bases, with a maximum contig length of 15,094 bases, N50 of 1004 bases, N90 of 658 bases, and an average contig length of 1004 bases. A total of 73,214 SNPs were predicted by SoapSNP when reads of OH and LBJ were aligned to the genome of ‘Bartlett’ using the Soap aligner, corresponding to one SNP per 1079 bases. In total, 1,456 SNPs passed the filtering criteria and were then subjected to the Illumina ADT. This yielded 1107 SNPs, of which 1064 were included in the final SNP array.
A total of 144,816 high quality 454 sequence reads were generated. Total sequence output was 32,418,987 bases, with an average read length of 224 bases. Quality filtered sequences were de novo assembled using CAP3. The average depth of assembly for all samples was ∼2.5. A total of 1751 cDNA SNPs were predicted using a customized bioinformatics pipeline and 69 experimentally validated by M. Troggio (unpublished data) that passed the Illumina ADT design, were selected for inclusion in the SNP array.
In total, 1133 pear SNPs were incorporated in the final array, making a grand total of 9000 attempted apple and pear SNPs (Table S2).
SNP Chip Evaluation
Of the 1133 attempted pear SNPs, 1096 (96.7%) were successful bead types on the IRSC Infinium® II (Illumina Inc.) array. When the 1096 pear and 7692 apple bead types were evaluated using five segregating populations, twelve and three individuals from the T003×M and T052×T003 populations, respectively, did not hybridize well to the BeadChip and were excluded from the clustering, which resulted in 873 F1 individuals that were used for evaluating the SNP array. All the 1096 pear SNPs hybridized well, resulting to be either polymorphic or monomorphic in at least one population. Of the apple SNPs, 7562 out of the total 7692 bead typed (98.3%) were either polymorphic or monomorphic in at least one population, while only 130 showed low quality hybridization. All 1096 pear SNPs hybridized pear DNA and were either monmorphic or polymorphic.
In total, 1528 unique pear and apple-derived SNPs (872 pear SNPs and 656 apple SNPs) were polymorphic in at least one segregating population, with 713, 508, 437, 442 and 711 polymorphic SNPs for the OH×LBJ, T003×M, T042×T081, T052×T003 and T052×T064 populations, respectively (Table 1). For the newly developed pear SNPs, the polymorphism rate was variable and depended on the informative parent. P. communis parents had higher polymorphism rate (from 25.9% to 35.1%, for ‘Moonglow’, OH and LBJ) than Asian×European hybrid parents (from 2.9% to 21.4%, for T003 and T064, respectively). The number of polymorphic apple SNPs per pear population ranged from 115 to 381 out of 7692 beadtypes (1.5 to 5.0% polymorphic SNPs per population). When the transfer rate of the new pear SNPs was evaluated in the apple ‘Royal Gala’בGranny Smith’ segregating population, it was similar to the apple SNP to pear transfer rate, with 13 (1.2%) polymorphic pear SNPs.
Identification and Genotyping of SNPs with Null Alleles
The analysis of SNP polymorphism in segregating populations highlighted the presence of SNP markers with potential null alleles. By default, the standard SNP calling algorithms of GenomeStudio clustered heterozygous A0 and B0 genotypes together with homozygous AA and BB genotypes, and called homozygous null genotypes (00) as missing genotypic calls. However, some SNPs containing null alleles do not follow the expected Mendelian segregation based on the parental genotypes. Therefore, manual editing of clusters for all the SNPs with strong deviation from Mendelian ratio or around 25% or 50% of no calls was performed and the SNPs which displayed a clear clustering and for which genotypes could be unequivocally determined as containing potential null alleles, were selected for further linkage analysis (Figures 3A, B and C). The following null allele segregation types were observed in the segregating populations: 00×A0, A0×AA, A0×A0, A0×B0, AB×A0, A0×BB and AB×00. The number of polymorphic null allele SNPs varied throughout the five populations: 115 in OH×LBJ, 108 in T003×M, 112 in T042×T081, 702 in T052×T003, and 436 in T052×T064 (Table 2). The percentage of polymorphic null allele markers from attempted bead types seemed to be similar for pear and apple SNPs: 2% and 1.2% in OH×LBJ, 2.9% and 1% in T003×M, 2.4% and 1.1% in T042×T081, 9.9% and 8.1% in T052×T003, and 4.9% and 5% in T052×T064. Of the total of 1132 unique pear and apple SNPs exhibiting null alleles, 255 were polymorphic markers without a null allele in at least one other segregating population. When the polymorphic null allele markers were mapped, the null allele markers were used to increase the density of the maps for the interspecific crosses, but were not required for the already dense OH×LBJ map (Table 3).
A) A 00×AB SNP (ss527789894), as represented in GenomeStudio. Parents P128R068T003 and ‘Moonglow’ are indicated in yellow; the red and blue clusters are identified as A0 and B0 genotypes, respectively. The total number of the individuals analyzed is 143 and the segregation ratio is 1∶1. B) A 00×A0 SNP (ss475879014), as represented in GenomeStudio. Parents P128R068T003 and ‘Moonglow’ are indicated in yellow; the red cluster is identified as heterozygous genotypes (A0), while genotypes with missing call (in black) are identified as homozygous for the null allele (00). The total number of the individuals analyzed is 143 and the segregation ratio is 1∶1. C) A A0×B0 SNP (ss475882353), as represented in GenomeStudio. Parents P128R068T003 and ‘Moonglow’ are indicated in yellow; the red, blue and purple clusters are identified as A0, B0 and AB genotypes, respectively, while genotypes with missing call (in black) are identified as homozygous for the null allele (00). The total number of the individuals analyzed is 143 and the segregation ratio is 1∶1:1∶1.
The total number of unique polymorphic markers, including both apple and pear-derived SNPs and SNPs with null alleles, was 2400 for all five populations. For the pear SNPs, 918 (83.8%) were polymorphic in at least one segregating population, and 623 (56.8%) were polymorphic in OH×LBJ, 384 (35%) in T052×T064, 337 (30.7%) in T042×T081, 337 (30.7%) in T003×M, and 295 (26.9%) in T052×T003.
Genetic Map Construction
Parental genetic maps were constructed for five segregating populations using the 2400 unique polymorphic SNPs. All maps contained 17 linkage groups except T003, T042 and T081(Table S3). For the OH×LBJ population, the parental maps spanned 825 and 974 cM and consisted of 356 and 393 SNP markers for OH and LBJ, respectively. For the T003×M population, the parental maps spanned 980 and 1016 cM and consisted of 182 and 434 SNP markers for T003 and M, respectively. For the T042×T081 population, the parental maps spanned 923 and 1133 cM and consisted of 250 and 312 SNP markers for T042 and T081, respectively. For the T052×T003 population, the parental maps spanned 1018 and 1101 cM and consisted of 370 and 255 SNP markers for T052 and T003, respectively. For T052×T064 the parental maps spanned 1485 and 1580 cM and consisted of 628 and 682 SNP markers for T052 and T064, respectively. In total, 1888 unique SNPs were mapped, including null allele markers.
The markers in common among the five segregating populations enabled the alignment of parental genetic maps as shown in Figure 4 for four maps of LG9. However, the bridges among the 10 parental maps were insufficient for the construction of a unique integrated map. The common polymorphic markers (with and without null alleles) between pairs of parents of the segregating populations are shown in Table 3. For example, there are 105 common polymorphic markers (without null alleles) between the European pears ‘Moonglow’ and ‘Old Home’. In comparison, only 52 markers (without null alleles) are in common between ‘Moonglow’ and the interspecific parent T081. The parent T003 from the T003×M cross has 20 null allele markers in common with the same parent from the T052×T003 cross and only 5 with T081.
The lines between the maps each show markers in common with two other parents.
SSR Mapping
Of the 54 SSR markers derived from the published ‘Bartlett’ consensus map [24] that were screened over the T003×M population, 38 were mapped, 25 loci to T003 and 30 to ‘Moonglow’ (Table S1). This information on linkage group assignment, taken together with data on SNP markers in common, was sufficient to enable the application of the ‘Bartlett’ LG nomenclature across all the pear genetic maps in this study.
Pear SNP Alignment to the Apple Genome Sequence
A total of 1009 pear SNPs (92%) were successfully anchored to the GD genome using bioinformatics analysis. Using the OH×LBJ consensus map as an example, 433 (42.9%) of the pear SNPs were anchored to apple and enabled the comparison of this genetic map with the GD genome assembly. On average, 20 markers per LG were in common between the OH×LBJ map and the GD genome (Figure 5), with LG2 having the most markers in common (32 markers) and LG17 the least (9 markers).
Discussion
SNPs are considered to be the most efficient tools for comprehensive genetic studies [32]. In Pyrus, the number of available SNPs was marginal. We developed more than 1,000 SNPs from the re-sequencing of P. communis cultivars and for the first time we included them in an array, making them easily available for further studies. These SNPs were selected based on their location within candidate genes, to ensure their usefulness for marker-trait association and for future breeding programmes.
We used the apple and pear Infinium® II 9K SNP array for the genotyping of five segregating pear populations, for a grand total of 873 individuals. The clustering of the SNPs using the GenomeStudio software depends on the minor allele frequency of the SNPs: the lower the minor allele frequency, the more samples are required to achieve accurate representation of all clusters. Illumina recommends a population of 100 or more. In our case, all the populations had largely more than 100 individuals (except for T052×T003, with 91 progenies), and this large dataset of 873 individuals ensured an accurate clustering of array SNPs. Moreover, the threshold of 15% for the MAF is relatively high, in comparison with other studies using the same technique [33].
High Polymorphism Rate for the Newly Developed Pear SNPs
A large proportion (83.8%) of the 1096 pear SNPs used to construct the first pear genotyping array were polymorphic in at least one segregating population, and 857 of these unique polymorphic pear markers (93.4%) were demonstrated to be useful for construction of genetic maps, using five populations of a range of genetic backgrounds across P. communis, P. pyrifolia and P. bretschneideri. These maps are the first dense SNP-based genetic maps for pear of any species. The previously developed maps in Pyrus, including those of Yamamoto et al. and Celton et al. [21], [22], [24], as well as an earlier map using pear SNPs constructed in ‘Bartlett’ and ‘Hosui’ [34], are not sufficiently dense to be useful for QTL analysis. Although Wu et al. [8] reported the development of 2005 SNPs in the course of anchoring the P. bretschneideri genome sequence, these SNPs are not available as a genotyping array, as they were obtained using genotyping by sequencing. In addition to the new P. communis pear SNPs developed in this study, we found that 1482 SNP markers derived from apple (19.3% of the total apple SNPs on the IRSC array) were polymorphic in pear, and 1031 of them were positioned on the pear genetic maps. The apple SNPs considerably improved the density of all maps, in some cases, e.g. T052×T003 and T052×T064, even doubling the number of mapped markers. In fact, because of the lower polymorphism of pear SNPs in the interspecific hybrid parents compared with the P. communis parents, the apple SNPs were necessary to saturate these maps.
The higher number of polymorphic pear markers identified in the European pear cross OH×LBJ compared with the four populations with an Asian pear background is because sequence data from OH and LBJ were used to design the pear SNPs, which also validates the bioinformatic SNP detection method used. In the T003×M population, the number of polymorphic pear SNPs in the European parent (‘Moonglow’) was significantly higher than in the hybrid (T003), again because the SNPs were derived from sequencing of P. communis accessions. However, the number of pear SNPs that were polymorphic in the interspecific parents was more variable, and reflects both the number of SNPs that are conserved between European and Asian pear and those that were introgressed from the European parent into the interspecific hybrid parents. The transferability of SNPs between species of the same genus has been reported previously in a few studies. These include the plant genera Vitis [35], Citrus [19] and Eucalyptus [36], as well as the mammalian genus Bubalus [37]. It is noteworthy that the transferability of SNPs between species was as high in these studies as observed in this study in Pyrus.
SNP Transferability between Genera Pyrus and Malus
The distinguishing feature of the apple and pear Infinium® II 9K SNP array is its combination of SNPs from both Malus and Pyrus, making it the first cross-genera SNP array created. It therefore enables, for one of the first time, the assessment of SNP marker transferability between genera. Most of the numerous studies on genetic marker transferability in recent years have focused on SSR markers, including those concerning apple and pear [22], [25], [38], [39]. Previous attempts to transfer SNPs between genera involved a few accessions only of the non-targeted species, including the study of Micheletti et al. [40], who estimated the rate of transferability of the heterozygous state from M.×domestica to P. communis and P. pyrifolia using 237 apple SNPs. In the present study, we observed that 7562 apple SNPs (98.3%) were either monomorphic or polymorphic in at least one pear population, while only 130 did not hybridize well in all of them. The high percentage of hybridization of pear genomic DNA to apple SNPs and vice versa obtained in the present study are not surprising, given that Malus and Pyrus are closely related genera and might be expected to share high sequence similarity. Furthermore, both the pear and apple SNPs included in the array were selected to be located in coding genes, with the consequence that the flanking sequences are more likely to be conserved between species. Although many of the apple SNPs were monomorphic (but still hybridized to pear DNA) and were not useful for genetic mapping in the five pear populations, we were able to map 99 apple markers in the OH×LBJ population, 255 in T003xMoonglow, 199 in T042×T081, 365 in T052×T003, and 631 in T052×T064.
SNPs with Null Alleles
The existence of null or unexpected alleles has been already demonstrated in several other SNP genotyping studies. Such alleles can be explained as deletions spanning a polymorphic site, secondary polymorphisms, or tri-allelic sites at the primary polymorphism [19], [41]. Since the SNP genotyping technology we used was the Infinium® II from Illumina, any putative third allele of polymorphic SNPs was not detectable and, therefore, in our study the SNPs with null alleles can fall only into the first two categories. Null alleles are an important source of polymorphisms; however, they are challenging to detect and analyze using SNP array software. In the present study, a higher number of SNPs with null alleles was detected in the interspecific populations than in the P. communis population. This was expected, as the frequency of null alleles increases with genetic distance between the samples genotyped and the discovery panel [19], because additional SNPs in the flanking sequence used for the Infinium® array design are more likely to occur between different species (Asian versus European pear) or genus (Malus versus Pyrus). We found that the within-species frequency of null alleles was similar in apple and pear SNPs. As heterozygous null alleles are useful for genetic mapping, we used them to increase map density in interspecific populations. It must be noted, however, that null alleles are a potential source of increased false positives in marker-trait association studies [42], [43].
Pear and Apple Genome Synteny
In total, 92% of the pear SNPs included in the Infinium® II array were successfully anchored to the ‘Golden Delicious’ genome [5], and the alignment of the physical map with the OH×LBJ genetic map resulted in an average of 20 orthologous markers per LG. Nevertheless, the apple SNPs were not always located at the same position on the pear genetic map as in the apple genome, which, however, can also be explained by the finding that approximately 15% of the SNPs included in the 9 K array have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence [33]. However, the number of orthologous markers between apple and pear identified in the present work (433 pear SNPs and 99 apple SNPs for OH×LBJ) is almost double the total found in previous studies (227). These studies included those by Pierantoni et al. [39], who demonstrated good genome colinearity between one apple and two pear genetic maps, using 41 and 31 mapped apple SSRs, respectively; Yamamoto et al. [38], who mapped apple and pear markers in European pear cultivars, and found that the position of 66 apple SSRs showed colinearity with the apple reference map; and Celton et al. [24], who aligned the genetic maps of two apple and pear cultivars constructed using apple and pear SSRs, and identified 90 colinear markers (53 pear and 37 apple SSRs) in common between the apple and pear genomes.
Conclusions
We have thoroughly validated the apple and pear Infinium® II 9K SNP array, and demonstrated its usefulness for high throughput genotyping in breeding populations of P. communis, as well as those of a mixed genetic background that includes P. communis, P. pyrifolia and P. bretschneideri. Furthermore, we attested that the arrayed SNPs are transferable not only across these species, but also between the two closely related genera Malus and Pyrus.
The construction of high density gene-based genetic maps using our SNP array represents an important step for the discovery of chromosomal regions associated with commercially important horticultural traits, such as pest and disease resistance, orchard productivity and fruit quality [32] in pears derived from P. communis, P. pyrifolia and P. bretschneideri. The OH×LBJ population was a repeat of a cross [44] used to develop an understanding of genetic determinants of vigour control and precocity in pear rootstocks. The 400 seedlings planted in Motueka (New Zealand) are grafted with ‘Doyenné du Comice’ (P. communis) scions for the purpose of a QTL analysis of rootstock induced dwarfing in pear. The T003×M population was developed to study the genetic basis of resistance to pear scab (Venturia pirina), fire blight (Erwinia amylovora), pear psylla (Cacopsylla pyri) and pear sawfly (Caliroa cerasi). T003 (as most Asian pears in general) is not host to V. pirina [45], [46] and a good source of resistance to C. pyri and C. cerasi [47], while ‘Moonglow’ derives from fire blight-resistant cultivars ‘Roi Charles Würtenberg’ and ‘Seckel’. The T042×T081 population was created to develop an understanding of the genetic control of scab resistance in pear. We are using the T052×T003 and T052×T064 populations to investigate the genetic basis of a storage-related disorder “friction discolouration”, using genetic mapping in combination with metabolomic phenotyping to identify QTLs controlling the disorder. Such examples of applications of the apple and pear Infinium® II 9K SNP array demonstrate that it will produce a range of outcomes that can be applied to pear breeding programmes worldwide.
Genomic Resources
The pear SNPs detected by sequencing, the pear SNPs chosen for the apple and pear Infinium® II 9K SNP array, and the GenomeStudio cluster file developed are deposited in the Genome Database for Rosaceae (www.rosaceae.org). SNPs are available in dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/) under accessions ss527787751 to ss527789916.
Supporting Information
Table S1.
List of SSR markers with primer sequence. Segregation type and comparison of mapping position for WBC and T003×M on maps is also provided.
https://doi.org/10.1371/journal.pone.0077022.s001
(XLSX)
Table S2.
List of 1096 pear SNPs on the pear Infinium® II 9K SNP array. The NCBI dbSNP accession, location on the ‘Golden Delicious’ genome assembly is indicated.
https://doi.org/10.1371/journal.pone.0077022.s002
(XLSX)
Table S3.
Genetic linkage maps of five populations used to validate the apple and pear 9K SNP array.
https://doi.org/10.1371/journal.pone.0077022.s003
(XLSX)
Acknowledgments
We thank Dianne Hyndman and Rosemary Rickman (AgResearch Invermay, New Zealand) and Elisa Banchi (IASMA, Italy) for providing the Illumina genotyping service. DC thanks the Rosaceae genomics community for kindly enabling the inclusion of pear SNPs in the apple and pear Infinium® II 9K SNP array. We also thank the INRA Experimental Unit (UE Horti, Angers, France) for care of the T003×M population and the INRA GENTYANE platform (UMR1095, Clermont-Ferrand, France) for SSR genotyping of this progeny.
Author Contributions
Conceived and designed the experiments: DC. Performed the experiments: SM MS MK YKK. Analyzed the data: SM MS MK YKK MT PF RS RNC. Contributed reagents/materials/analysis tools: MM RV KHW CED LP RS CW VB LB SEG DC. Wrote the paper: SM MS MK DC.
References
- 1. Velasco R, Zharkikh A, Troggio M, Cartwright D a, Cestaro A, et al. (2007) A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS ONE 2: e1326
- 2. Jaillon O, Aury J-M, Noel B, Policriti A, Clepet C, et al. (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449: 463–467
- 3. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, et al. (2009) The B73 maize genome: complexity, diversity, and dynamics. Science (New York, NY) 326: 1112–1115
- 4. Verde I, Abbott AG, Scalabrin S, Jung S, Shu S, et al. (2013) The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nature Genetics 45: 487–494
- 5. Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, et al. (2010) The genome of the domesticated apple (Malus×domestica Borkh.). Nature Genetics 42: 833–839
- 6. Xu X, Pan S, Cheng S, Zhang B, Mu D, et al. (2011) Genome sequence and analysis of the tuber crop potato. Nature 475: 189–195
- 7. Sato S, Tabata S, Hirakawa H, Asamizu E, Shirasawa K, et al. (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485: 635–641
- 8. Wu J, Wang Z, Shi Z, Zhang S, Ming R, et al. (2013) The genome of the pear (Pyrus bretschneideri Rehd.). Genome Research 23: 396–408
- 9. Chagné D, Crowhurst RN, Troggio M, Davey MW, Gilmore B, et al. (2012) Genome-wide SNP detection, validation, and development of an 8K SNP array for apple. PLoS ONE 7: e31745
- 10. Verde I, Bassil N, Scalabrin S, Gilmore B, Lawley CT, et al. (2012) Development and evaluation of a 9K SNP array for peach by internationally coordinated SNP detection and validation in breeding germplasm. PLoS ONE 7: e35668
- 11. Xu X, Liu X, Ge S, Jensen JD, Hu F, et al. (2012) Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nature Biotechnology 30: 105–111
- 12. Hyten DL, Cannon SB, Song Q, Weeks N, Fickus EW, et al. (2010) High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence. BMC Genomics 11: 38
- 13. Hand ML, Cogan NOI, Forster JW (2012) Genome-wide SNP identification in multiple morphotypes of allohexaploid tall fescue (Festuca arundinacea Schreb). BMC Genomics 13: 219
- 14. Stothard P, Choi J-W, Basu U, Sumner-Thomson JM, Meng Y, et al. (2011) Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery. BMC Genomics 12: 559
- 15. Li R, Li Y, Fang X, Yang H, Wang J, et al. (2009) SNP detection for massively parallel whole-genome resequencing. Genome Research 19: 1124–1132
- 16. Bentley DR (2006) Whole-genome re-sequencing. Current opinion in genetics & development 16: 545–552
- 17. Elshire RJ, Glaubitz JC, Sun Q, Poland J a, Kawamoto K, et al. (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6: e19379
- 18. Peace C, Bassil N, Main D, Ficklin S, Rosyara UR, et al. (2012) Development and evaluation of a genome-wide 6K SNP array for diploid sweet cherry and tetraploid sour cherry. PLoS ONE 7: e48305
- 19. Ollitrault P, Terol J, Garcia-Lor A, Bérard A, Chauveau A, et al. (2012) SNP mining in C. clementina BAC end sequences; transferability in the Citrus genus (Rutaceae), phylogenetic inferences and perspectives for genetic mapping. BMC Genomics 13: 13
- 20. Iketani H, Abe K, Yamamoto T, Kotobuki K, Sato Y, et al. (2001) Mapping of disease-related genes in Japanese pear using a molecular linkage map with RAPD markers. Breeding Science 51: 179–184.
- 21. Yamamoto T, Kimura T, Shoda M, Imai T, Saito T, et al. (2002) Genetic linkage maps constructed by using an interspecific cross between Japanese and European pears. Theoretical and Applied Genetics 106: 9–18
- 22. Yamamoto T, Kimura T, Saito T, Kotobuki K, Matsuta N, et al. (2004) Genetic Linkage Maps of Japanese and European Pears Aligned to the Apple Consensus Map. 1: 51–56.
- 23. Dondini L, Pierantoni L, Gaiotti F, Chiodini R, Tartarini S, et al. (2004) Identifying QTLs for fire-blight resistance via a European pear (Pyrus communis L.) genetic linkage map. Molecular Breeding 14: 407–418
- 24. Celton J-M, Chagné D, Tustin SD, Terakami S, Nishitani C, et al. (2009) Update on comparative genome mapping between Malus and Pyrus. BMC Research Notes 2: 182
- 25. Lu M, Tang H, Chen X, Gao J, Chen Q, et al. (2010) Comparative genome mapping between apple and pear by apple mapped SSR markers. American-Eurasian Journal of Agricultural and Environmental Science 9: 303–309.
- 26. Huang X, Madan A (1999) CAP3: A DNA sequence assembly program. Genome Research 9: 868–877.
- 27. Brewer L, Alspach P, Bus V (2005) Fruit and leaf incidence of pear scab (Venturia pirina Aderh.) in mixed European and Asian pear progenies. Acta Horticulturae 671: 595–600.
- 28. Doyle JJ, Doyle JL (1987) A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry Bull 19: 11–15.
- 29. Costa F, Weg WE, Stella S, Dondini L, Pratesi D, et al. (2008) Map position and functional allelic diversity of Md-Exp7, a new putative expansin gene associated with fruit softening in apple (Malus×domestica Borkh.) and pear (Pyrus communis). Tree Genetics & Genomes 4: 575–586
- 30.
Ooijen J Van (2006) JoinMap 4, Software for the calculation of genetic linkage maps in experimental populations. Kyazma B.V., Wageningen, Netherlands.
- 31. Voorrips R (2002) MapChart: software for the graphical presentation of linkage maps and QTLs. Journal of Heredity 93: 77–78.
- 32.
Yamamoto T, Chevreau E (2009) Pear Genomics. In: Folta K, Gardiner S, editors. Genetics and genomics of Rosaceae. Springer, New York, NY. 163–186. doi:10.1007/978-0-387-77491-6_8.
- 33. Antanaviciute L, Fernández-Fernández F, Jansen J, Banchi E, Evans KM, et al. (2012) Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array. BMC Genomics 13: 203
- 34. Terakami S, Nishitani C, Yamamoto T (2011) Development of SNP markers for marker-assisted selection in pear. Acta Horticulturae 976: 463–470.
- 35. Vezzulli S, Micheletti D, Riaz S, Pindo M, Viola R, et al. (2008) A SNP transferability survey within the genus Vitis. BMC Plant Biology 8: 128
- 36. Grattapaglia D, Silva-Junior OB, Kirst M, De Lima BM, Faria D a, et al. (2011) High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species. BMC Plant Biology 11: 65
- 37. Matukumalli LK, Lawley CT, Schnabel RD, Taylor JF, Allan MF, et al. (2009) Development and characterization of a high density SNP genotyping assay for cattle. PLoS ONE 4: e5350
- 38. Yamamoto T, Kimura T, Terakami S, Nishitani C, Sawamura Y, et al. (2007) Integrated reference renetic linkage maps of pear based on SSR and AFLP markers. Breeding Science 57: 321–329
- 39. Pierantoni L, Cho K-H, Shin I-S, Chiodini R, Tartarini S, et al. (2004) Characterisation and transferability of apple SSRs to two European pear F1 populations. Theoretical and Applied Genetics 109: 1519–1524
- 40. Micheletti D, Troggio M, Zharkikh A, Costa F, Malnoy M, et al. (2011) Genetic diversity of the genus Malus and implications for linkage mapping with SNPs. Tree Genetics & Genomes 7: 857–868
- 41. Carlson CS, Smith JD, Stanaway IB, Rieder MJ, Nickerson D a (2006) Direct detection of null alleles in SNP genotyping data. Human Molecular Genetics 15: 1931–1937
- 42. Rice KM, Holmans P (2003) Allowing for genotyping error in analysis of unmatched case-control studies. Annals of Human Genetics 67: 165–174.
- 43. Sawcer SJ, Maranian M, Singlehurst S, Yeo T, Compston A, et al. (2004) Enhancing linkage analysis of complex disorders: an evaluation of high-density genotyping. Human Molecular Genetics 13: 1943–1949
- 44. Jacob H (1998) Pyrodwarf, a clonal rootstock for high density pear orchards. Acta Horticulturae 475: 169–177.
- 45. Brewer L, Alspach P (2009) Resistance to scab caused by Venturia pirina in interspecific pear (Pyrus spp.) hybrids. New Zealand Journal of Crop and Horticultural Science 37: 211–218 doi http://dx.doi.org/10.1080/01140670909510266.
- 46. Bus V, Brewer L, Morgan C (2013) Observations on scab resistance in interspecific pear seedling families. Acta Horticulturae 976. Vol. 2: 493–498.
- 47. Brewer L, Alspach P, White A (2002) Variation in the susceptibility of pear seedlings to damage by the larvae of the sawfly (Caliroa cerasi). Acta Horticulturae 596: 571–574.