Development of a Gene-Centered SSR Atlas as a Resource for Papaya (Carica papaya) Marker-Assisted Selection and Population Genetic Studies

Newton Medeiros Vidal; Ana Laura Grazziotin; Helaine Christine Cancela Ramos; Messias Gonzaga Pereira; Thiago Motta Venancio

doi:10.1371/journal.pone.0112654

Abstract

Carica papaya (papaya) is an economically important tropical fruit. Molecular marker-assisted selection is an inexpensive and reliable tool that has been widely used to improve fruit quality traits and resistance against diseases. In the present study we report the development and validation of an atlas of papaya simple sequence repeat (SSR) markers. We integrated gene predictions and functional annotations to provide a gene-centered perspective for marker-assisted selection studies. Our atlas comprises 160,318 SSRs, from which 21,231 were located in genic regions (i.e. inside exons, exon-intron junctions or introns). A total of 116,453 (72.6%) of all identified repeats were successfully mapped to one of the nine papaya linkage groups. Primer pairs were designed for markers from 9,594 genes (34.5% of the papaya gene complement). Using papaya-tomato orthology assessments, we assembled a list of 300 genes (comprising 785 SSRs) potentially involved in fruit ripening. We validated our atlas by screening 73 SSR markers (including 25 fruit ripening genes), achieving 100% amplification rate and uncovering 26% polymorphism rate between the parental genotypes (Sekati and JS12). The SSR atlas presented here is the first comprehensive gene-centered collection of annotated and genome positioned papaya SSRs. These features combined with thousands of high-quality primer pairs make the atlas an important resource for the papaya research community.

Citation: Vidal NM, Grazziotin AL, Ramos HCC, Pereira MG, Venancio TM (2014) Development of a Gene-Centered SSR Atlas as a Resource for Papaya (Carica papaya) Marker-Assisted Selection and Population Genetic Studies. PLoS ONE 9(11): e112654. https://doi.org/10.1371/journal.pone.0112654

Editor: Chunxian Chen, USDA/ARS, United States of America

Received: July 10, 2014; Accepted: October 8, 2014; Published: November 13, 2014

This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All data are available in supplementary files.

Funding: This work was funded by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Instituto Nacional de Ciência e Tecnologia - Entomologia Molecular (INCT-EM), Universidade Estadual do Norte Fluminense Darcy Ribeiro (UENF), and Fundação de Amparo à Pesquisa do Estado do Rio de Janeiro Carlos Chagas Filho (FAPERJ) (PENSARIO E-26/110.720/2012). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Papaya (Carica papaya Linneaus) is an economically and nutritionally important fruit tree of tropical and subtropical regions. Papaya is well known for its nutritional benefits [1], medical [2] and industrial [3] applications. Due to its commercial importance, papaya production is currently ranked as the third major global production among tropical fruits [4]. Notwithstanding the increased papaya trade, a limited number of cultivars are commercially available, hampering papaya production worldwide. Further, the low genetic diversity of selected C. papaya cultivars [5]–[7] makes the species susceptible to bacterial and viral infections [8], [9]. To improve disease resistance, genetic diversity and productiveness, researchers have been using molecular marker-assisted selection (MAS), a well-established procedure employed in commercial breeding programs to enhance the gain from artificial selection.

Microsatellites, also known as simple sequence repeats (SSRs), are simple tandemly repeated DNA sequences, ranging from 2–6 base pairs per repeat unit [10]. These repeated sequences are highly variable in length, mainly due to unequal recombination events or DNA polymerase slippage. Microsatellite PCR amplification has been described as a reliable, rapid, and inexpensive technique. Combined with the highly polymorphic nature and co-dominant segregation of SSRs, PCR amplification of microsatellites is a powerful technique for plant breeding and genetic studies, such as MAS, population genetic analysis, quantitative trait locus (QTLs) mapping, DNA fingerprinting, and genome mapping [11]–[16].

Once a labor-intensive and time-consuming process, identification of new microsatellite markers became increasingly feasible with the improvement of molecular biology techniques and availability of genomic information for several plant species. Over the past decade, SSRs derived from expressed sequence tags (EST-SSRs) markers emerged as a feasible alternative in marker development for several crop species [17]. EST-SSRs are transcribed from coding sequences (CDS), which tend to be conserved between species, high interspecies transferability rates can be achieved [18], [19]. Moreover, CDS markers are more informative than intergenic ‘anonymous’ markers because they are more likely to be functional [20], [21]. However, there are also a few disadvantages of using CDS markers. High conservation may result in low polymorphism rates, which are of limited use in MAS studies. In addition, primers designed for exon-intron junctions may result in PCR amplification failures.

With the increasing number of sequenced genomes, SSRs can be computationally detected and classified according to their genomic locations in 5′ untranslated regions (UTRs), exons, introns and 3′ UTRs. By using this strategy, exon-intron junctions can be avoided during primer design and fully exonic markers can be selected. Completely sequenced genomes also allow the selection of intronic markers which are more polymorphic than exonic markers and segregate with a particular gene that may be associated with a biochemical function or phenotype of interest [22], [23].

In the present work we describe the analysis of papaya SSR markers in a genome-wide scale, integrating SSR positioning and functional annotation data. Stringent primer design criteria were used to allow better results in genetic studies. This complete catalog will be of great value for the papaya research community, especially for groups conducting MAS projects. Using this map, researchers will be able to filter and choose interesting markers according to SSR type, length, sequence, region location (exon, intron or intergenic), linkage group and gene annotation.

Materials and Methods

Genomic data and SSR annotation

Papaya genome assembly and annotation files were downloaded from Phytozome v7.0 FTP (http://www.phytozome.net/) [24]. The genome assembly 113 consists of 244.5 Mb of gapless sequences, distributed in 3,207 scaffolds and 2,693 contigs. Genomic locations of detected SSRs were integrated with gene and exon coordinates from the reference GFF file. Based on their genomic mapping, SSRs were categorized as exonic (entirely within the CDS), exon-intron (in exon-intron boundaries), intronic (within introns) and intergenic (outside of genic regions). Identifiers and annotations based on papaya-Arabidopsis thaliana homology were obtained for genic SSRs. Papaya genes were also annotated using Gene Ontology (GO) terms from the Plant Ontology Tool (http://www.arabidopsis.org/tools/bulk/po/index.jsp). In order to map scaffolds and contigs to the major nine linkage groups, all 47,483 papaya contigs [GenBank: ABIM01000001-ABIM01047483] were downloaded from the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov). Linkage group information was retrieved from GenBank files for each contig and scaffold [GenBank: DS981520-DS984726].

Simple Sequence Repeats Identification

Exact maximal repeats were detected in the papaya genome using mreps [25]. Perfect repeats with more than 12 nucleotides, motif lengths of 2–6 bp and at least 2 units of repetition were analyzed. Parameters were set as follows: -r 0 -minsize 12 -minperiod 2 -maxperiod 6 -exp 2. The mreps algorithm finds exact maximal repeats, removes redundancy by selecting the best period for each repeat, merges repeats with same period and eliminates statistically insignificant expected repeats [25].

For description of di- and trinucleotide motifs, circular permutations and complementary strand nucleotides were considered as equivalents and grouped in one class after determining the individual repeat frequencies. Thus, there are four possible di-nucleotide motifs and ten possible tri-nucleotide motifs. For example, motifs AG, GA, CT and TC are equivalent and grouped as AG/GA/CT/TC. Likewise, motifs ACG, CGA, GAC, CGT, TCG and GTC are also equivalents and represented as ACG/CGA/GAC/CGT/TCG/GTC.

Primer design

SSRs were retrieved from the genome with 250 bp upstream/downstream flanking regions and had their low complexity regions masked with DustMasker [26]. Primers were designed with the standalone version of Primer3 [27] using the following parameters: primer length between 18 and 25 nucleotides (optimal length = 20 nt), melting temperature between 57 and 63°C (optimal Tm = 60°C), PCR product size between 250 and 350 bp (optimal 300 bp), GC content of 20–60%, and PRIMER_MAX_HAIRPIN_TH = 24. All SSR sequences were also used as a repeat library (option PRIMER_MISPRIMING_LIBRARY) to avoid primer design within the SSRs. Primers with low complexity regions were discarded.

Analysis of genes involved in fruit ripening

As a proof of concept, our gene-centered SSR map were used to identify SSR markers for genes potentially involved in cell wall remodeling, transcriptional regulation and hormone signaling. Genes with differential expression in tomato ripening fruits (determined by RNA-Seq) [28] were used to identify homologous genes in papaya. Fifty-three cell wall and 222 transcription/ethylene proteins were used as BLASTP queries to search the papaya predicted proteins with the following criteria: E-value ≤1e–30, similarity of at least 50%, query and hit coverage of at least 75%. Tomato protein sequences and annotations release ITAG2.3 were downloaded from the Sol Genomics Network FTP [29] (ftp://ftp.solgenomics.net/).

SSR screening and polymorphism survey

The papaya genotypes Sekati and JS12 were used for screening polymorphic genic SSRs. Total genomic DNA was extracted from young leaves according to the CBAB method [30]. A total of 73 primer pairs comprising ∼8 genic SSR regions per chromosome were selected. PCR amplifications were performed in 15 µL reaction, containing 10 ng DNA, 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 2 mM MgCl₂, 100 µM dNTPs, 0.2 µM of each primer, and 1 U Taq DNA polymerase. PCR cycling was performed in an Eppendorf thermal cycler, according to the following profile: 4 min of denaturation at 94°C, 35 amplification cycles (94°C at 30 s, 58°C at 1 min, 72°C at 3 min), followed by a final extension of 7 min at 72°C. In 4 of these cases, the primer annealing temperature was set to 65°C. Amplified products were separated in a 4% agarose gel Metaphor, stained by GelRedTM/Blue Juice mixture (1∶1) and visualized through the MiniBis Pro photodocumentation system (DNR Bio-Imaging Systems Ltd., Jerusalem, Israel).

Data Access and Retrieval

Information regarding SSR identifiers, genomic coordinates, motif sequence, period, size and exponent, genomic location (i.e. exon, intron, exon-intron, intergenic), linkage group, and gene annotations are fully available in two user-friendly spreadsheets (Table S1, Table S2).

Results and Discussion

SSR classification and genomic positioning

We identified 160,318 SSRs with a density of 656 SSR/Mb in the most recent version of the papaya genome (see methods for details). Previous studies of papaya SSRs reported densities of 1,340 SSR/Mb [31] and 746 SSR/Mb [32]. Such disparities in the number of identified microsatellites are usual among different reports, mainly due to differences in the algorithms, parameter settings, minimal repeat length and redundancy filtering [33]–[35]. Differently from other two previous studies [31], [32], we used the mreps algorithm, which was ranked as the best algorithm for repeat detection in a systematic study [35]. Specifically, mreps does not report all the overlapping repeats, but efficiently retains only the most credible overlapping ones, giving more reproducible and reliable results.

We detected 160,318 perfectly matching, non-redundant SSRs. After integrating SSRs and gene coordinates, we found that 36% of the papaya genes (9,992/27,769) have at least one SSR. A total of 21,231 SSRs were identified in genic regions, while 139,087 were intergenic (Figure 1). Because UTR annotations are not available for the papaya genome, SSRs located on these regions were not classified as such. As expected, most SSRs are intergenic (86.8%), followed by intronic (9.9%), exonic (3.3%), and only 73 (0.04%) SSRs in exon-intron boundaries (Table 1). Dinucleotide motifs were abundant in intergenic (39.4%) and intronic regions (45.3%), while tri- to hexanucleotides were uniformly distributed in such regions. On the other hand, exons and exon-intron boundaries were enriched in tri- (67.1% and 37%) and hexanucleotides (24.4% and 42.5%), which is expected due to the selective pressure against frameshift mutations in coding regions.

Download:

Figure 1. Distribution of SSR type according to genomic location.

https://doi.org/10.1371/journal.pone.0112654.g001

Download:

Table 1. Number of simple sequence repeats by genomic location.

https://doi.org/10.1371/journal.pone.0112654.t001

Sequence AT/TA was the most common dinucleotide motif (69.9%) in the papaya genome, corroborating previous results obtained from whole-genome shotgun sequences (WGS) and BAC End Sequences (BES) [7], [31]. Nevertheless, when considering genomic locations, this motif was enriched only in intronic (63.6%) and intergenic (70.9%) regions, whereas AG/GA/CT/TC motifs were predominant in exons (88.5%) and exon-intron boundaries (85.7%) (Table 2). Among trinucleotide motifs, AAT/ATA/TAA/ATT/TAT/TTA has been described as most prevalent in papaya genome [7], [31]. Although AAT/TTA sequence was also frequent (47%) in our study, it is mainly located in introns (46.6%) and intergenic (53.3%) regions. Conversely, the second most predominant trinucleotide motif, AAG/AGA/GAA/CTT/TCT/TTC, was more evenly distributed in exons (34.4%), exon-intron boundaries (25.9%), introns (27.9%), and intergenic (21.7%) locations in genomic context. Coherently to a previous study [31], dinucleotides CG/GC and trinucleotides CCG/GGC were rarely found (0.1% and 1.2% respectively) here.

Download:

Table 2. Distribution of SSRs by genomic location.

https://doi.org/10.1371/journal.pone.0112654.t002

Based on repeat length, papaya SSRs were defined as class I (≥20 nucleotides) and class II (between 12–19 nucleotides) (Figure 2). SSR lengths ranged from: 12–82 bases in exons; 12–30 bases in exon-intron boundaries; 12–201 bases in introns and 12–155 bases in intergenic regions. Most SSRs were classified as class II (79.0%–84.1%), regardless of their genomic location (Table 3). However, 24,234 primer pairs were designed for class I SSRs (Table S3). Since these longer sequences are typically hypervariable and more likely to be polymorphic, they are the preferable choice as molecular markers for diversity studies.

Download:

Figure 2. Percentage frequency of SSR according to SSR size.

https://doi.org/10.1371/journal.pone.0112654.g002

Download:

Table 3. Distribution of Class I and Class II SSRs in different genomic regions.

https://doi.org/10.1371/journal.pone.0112654.t003

All SSRs were assigned to chromosomes according to their scaffold or contig localization. A total of 116,453 SSRs (72.6%) could be mapped to one of the nine papaya linkage groups. The number of SSRs in each chromosome ranged from 10,566 (6.6%) in LG7 to 14,773 (9.2%) in LG9. The proportion of SSR types and motifs among different chromosomes was similar to the overall genomic distribution (Table S4). The proportion of SSR genomic locations in each chromosome was higher in intergenic regions, followed by introns, exons and exon-intron junctions. There was no bias for SSR types and SSR genomic locations among the chromosomes (Table S5), which is highly desirable for researchers aiming to develop a collection of polymorphic markers for genetic studies.

Design of high-quality primer pairs for papaya SSRs

Aiming to provide a comprehensive source of SSRs to be used in marker-assisted selection and population genetics studies, all 21,231 genic and 139,087 intergenic SSRs were submitted to primer design. All primer pairs were optimized for the same PCR conditions (see methods for details). In a preliminary analysis, 20,659 (97.3%) and 118,831 (85.4%) primer pairs were respectively designed for genic and intergenic SSRs using Primer3 with default parameters. Manual inspection of results revealed a significant number of primers containing repeated or low complexity sequences. Therefore, we decided to adopt more stringent criteria for primer design: 1) Primers within repetitive sequences were removed; 2) Primers in soft-masked low complexity 3′ end regions were not allowed (PRIMER_LOWERCASE_MASKING = 1); and 3) A parameter to minimize hairpin formation (PRIMER_MAX_HAIRPIN_TH = 24) was employed. By using this stringent parameterization, we obtained a much more reliable primer set, although the overall success rate of primer design dropped to 71%. A total of 18,925 primer pairs were successfully designed for 89.1% and 67.9% of all genic and intergenic SSR sequences, respectively (Table 4). Although such stringent settings prevented primer design for 8% genic and 17% intergenic SSRs, this methodology resulted in an extensive and more reliable list of primer pairs designed for distinct SSR types, genomic locations and chromosome linkage groups (Table S1, Table S2).

Download:

Table 4. Number of successfully designed primer pairs for SSR type and linkage group.

https://doi.org/10.1371/journal.pone.0112654.t004

Designed primers were uniformly distributed among SSR types (Table 4) and chromosomes (Table 5). Regarding SSR type, most of the 113,446 primer pairs were designed for dinucleotide repeats (39.7%), followed by tri- (18.2%), hexa- (15.2%), penta- (13.6%) and tetranucleotide repeats (13.3%) (Table 4). As expected, the majority of exonic SSRs with designed primers are composed of trinucleotide repeats (Table S6).

Download:

Table 5. Number of successfully designed primer pairs for each genomic context and linkage group.

https://doi.org/10.1371/journal.pone.0112654.t005

Analysis of SSRs located in genes related to fruit ripening

Next, we aimed to use our SSR atlas to find genes related to fruit ripening, a trait of high agronomical interest in papaya. To achieve this goal, tomato gene expression and tomato/papaya orthology data were integrated. Tomato is the papaya’s closest climacteric fleshy fruit with available genome-wide gene expression data [28]. Sato et al. reported significant differential expression of 53 cell wall and 222 transcription factors (TFs)/ethylene-related genes during fruit ripening [28]. Based on BLASTP searches (see methods for details), 175 cell wall and 319 transcription factor orthologous genes were found in papaya (Table S7, Table S8). Our atlas include SSRs with primer pairs for 113 cell wall-related (257 SSRs, 40 exonic and 217 intronic; 2.3 SSRs/gene) and 187 TF/ethylene-related genes (528 SSRs, 127 exonic and 400 intronic; 2.8 SSRs/gene). These two groups comprise very good candidate pulp softening and pigmentation control genes [36]. By integrating this information in our gene-centered SSR map, we provide an unprecedented list of markers for studying the genetic and functional variability of fruit ripening processes.

Fruit ripening is a developmental process characterized by remarkable changes related to flavor, sugar metabolism, color, aroma, texture, softening and nutritional content [37]. These metabolic and physical alterations are driven by genetically coordinated expression profiles in several metabolic pathways, such as cell wall disassembly, sugar hydrolysis, ethylene biosynthesis and pigmentation. Using KEGG Orthology (KO) we identified ripening-related pathways for cell wall genes and TFs harboring SSRs with primer pairs. Nine entries for cell wall genes were found in KO00050 - Starch and sucrose metabolism (5) and KO00040– Pentose and glucuronate interconversions (4). During climacteric fruit ripening, respiration rate increases and a series of enzymes degrade starch and synthesize sucrose (e.g. starch phosphorylase and sucrose synthase) [38]. The carboxylic acid glucuronate is the precursor of pectin, one of the main components of plant cell wall [39]. Pectin levels decrease during fruit maturation [39] typically due to the increased expression of pectin lyases [28]. However, since pectin lyases and methyltransferases are repressed during papaya ripening, pectin solubilization is probably catalyzed by polygalacturonases [40].

Among TFs/ethylene-related genes, 24 entries were found, mostly from the category KO04075 (Plant hormone signal transduction; 8 genes), such as auxin-responsive protein, ethylene receptor and responsive transcription factor 1 and ABA-responsive element binding factor. The phytohormone auxin plays important roles in fruit growth, regulating cell division, differentiation, lateral root formation and embryogenesis [41]. Particularly, genes involved in signaling and auxin response factors were up-regulated during papaya [40] and peach [42] ripening. In addition, auxin can stimulate ethylene biosynthesis via transcription of acetyl-coenzyme A synthetase genes [42]. In turn, the involvement of ethylene response factor in climacteric fruit ripening is well-established and this regulator determines, for example, fruit firmness reduction and defense response to pathogens after ripe [36]. By aggregating these annotations, our atlas provides a rich resource from which the scientific community can rapidly draw genes and SSRs with designed primer pairs for genetic studies.

SSR screening and polymorphism survey

A 100% PCR amplification rate was achieved for the 73 genic SSRs (16 exonic and 57 intronic), allowing the identification of 19 polymorphic alleles (26%) (Table S9). Twenty five of such genes are orthologs of tomato genes with differential expression during fruit ripening (Table S7, Table S8). Among the polymorphic alleles we found 5 cell wall and 2 transcription factors/ethylene-related genes. Polymorphisms were also detected in genes from the cellulose synthase, pectin lyase-like and ethylene response factor families/superfamilies. Taken together, these results not only validate the use of our atlas as an efficient tool in papaya breeding projects, but also stimulate additional genetic and biochemical studies to detail the functions these polymorphic genes in papaya, as they may be useful in the production of fruits with increased shelf life.

Conclusion

Non-coding SSRs near genes or inside introns can directly affect gene expression [21], [43]–[45]. On the other hand, SSRs within exons often result in amino acid changes that may affect protein function. For example, a study using genic SSRs derived from candidate genes involved in wood formation identified two SSR markers (one in coding and the other in non-coding region) explaining 13.5% of the lignin content variation in Chinese white poplar [46]. These results demonstrate the power of genic markers to identify genotype-to-phenotype associations and make these markers very useful in genetic improvement of desired characteristics.

In the present work we surveyed the papaya genome for the presence of perfect, non-redundant SSRs. We analyzed the distribution of SSR locations (exon, exon-intron, intron, intergenic) and established a comprehensive atlas of SSR markers with SSR type, motif sequence, SSR size, genomic location (exon, intron or intergenic), linkage group location and gene-centered information (gene annotation and GO assignments). The resource reported here is fully accessible through our supplementary material (Table S1, Table S2), allowing plant breeders and researchers to easily choose gene-centered markers to test their association with biological processes or phenotypes of agronomic interest. Moreover, we achieved a 100% PCR amplification rate during a genetic survey of 73 SSR markers, supporting the high quality of the predicted SSRs and designed primers. The atlas developed in this study will certainly serve as a toolbox to assist and improve the efficiency of marker-assisted selection in papaya breeding and population genetic studies.

Supporting Information

Table S1.

Catalog of gene-centered SSR markers within genic regions (exon, exon-intron, intron).

https://doi.org/10.1371/journal.pone.0112654.s001

(ZIP)

Table S2.

Catalog of SSR markers within intergenic regions.

https://doi.org/10.1371/journal.pone.0112654.s002

(ZIP)

Table S3.

Class I SSR markers.

https://doi.org/10.1371/journal.pone.0112654.s003

(ZIP)

Table S4.

Distribution of SSR motifs by linkage group.

https://doi.org/10.1371/journal.pone.0112654.s004

(XLS)

Table S5.

Distribution of SSR types by linkage group.

https://doi.org/10.1371/journal.pone.0112654.s005

(XLS)

Table S6.

Number of successfully designed primer pairs for each SSR type, genomic location and linkage group.

https://doi.org/10.1371/journal.pone.0112654.s006

(XLS)

Table S7.

SSR markers for cell wall genes.

https://doi.org/10.1371/journal.pone.0112654.s007

(XLS)

Table S8.

SSR markers for transcriptional/ethylene genes.

https://doi.org/10.1371/journal.pone.0112654.s008

(XLS)

Table S9.

Primer pairs used for polymorphism analysis. Genes related with cell wall metabolism and transcriptional regulation/ethylene signaling genes are highlighted in green and yellow, respectively.

https://doi.org/10.1371/journal.pone.0112654.s009

(XLS)

Author Contributions

Conceived and designed the experiments: NMV ALG HCCR MGP TMV. Performed the experiments: NMV HCCR TMV. Analyzed the data: NMV ALG TMV. Contributed reagents/materials/analysis tools: NMV ALG HCCR MGP TMV. Contributed to the writing of the manuscript: NMV ALG TMV.

References

1. Marotta F, Catanzaro R, Yadav H, Jain S, Tomella C, et al. (2012) Functional foods in genomic medicine: a review of fermented papaya preparation research progress. Acta Biomed 83: 21–29.
- View Article
- Google Scholar
2. Jimenez-Coello M, Guzman-Marin E, Ortega-Pacheco A, Perez-Gutierrez S, Acosta-Viana KY (2013) Assessment of the anti-protozoal activity of crude Carica papaya seed extract against Trypanosoma cruzi. Molecules 18: 12621–12632.
- View Article
- Google Scholar
3. Tu T, Meng K, Bai Y, Shi P, Luo H, et al. (2013) High-yield production of a low-temperature-active polygalacturonase for papaya juice clarification. Food Chemistry 141: 2974–2981.
- View Article
- Google Scholar
4. Evans EA, Ballen FH (2012) An overview of global papaya production, trade, and consumption. Gainesville: University of Florida. 7 p.
5. Kim MS, Moore PH, Zee F, Fitch MM, Steiger DL, et al. (2002) Genetic diversity of Carica papaya as revealed by AFLP markers. Genome 45: 503–512.
- View Article
- Google Scholar
6. Ma H, Moore PH, Liu Z, Kim MS, Yu Q, et al. (2004) High-density linkage mapping revealed suppression of recombination at the sex determination locus in papaya. Genetics 166: 419–436.
- View Article
- Google Scholar
7. Eustice M, Yu Q, Lai CW, Hou S, Thimmapuram J, et al. (2008) Development and application of microsatellite markers for genomic analysis of papaya. Tree Genetics and Genomes 4: 333–341.
- View Article
- Google Scholar
8. Davis MJ, Ying Z, Brunner BR, Pantoja A, Ferwerda FH (1998) Rickettsial relative associated with papaya bunchy top disease. Current Microbiology 36: 80–84.
- View Article
- Google Scholar
9. Gonsalves D (1998) Control of papaya ringspot virus in papaya: a case study. Annual Review of Phytopathology 36: 415–437.
- View Article
- Google Scholar
10. Tautz D, Trick M, Dover GA (1986) Cryptic simplicity in DNA is a major source of genetic variation. Nature 322: 652–656.
- View Article
- Google Scholar
11. Zhou H, Steffenson BJ, Muehlbauer G, Wanyera R, Njau P, et al. (2014) Association mapping of stem rust race TTKSK resistance in US barley breeding germplasm. Theoretical and Applied Genetics.
12. Carrillo E, Satovic Z, Aubert G, Boucherot K, Rubiales D, et al. (2014) Identification of quantitative trait loci and candidate genes for specific cellular resistance responses against Didymella pinodes in pea. Plant Cell Reports.
13. Wang Z, Kang M, Liu H, Gao J, Zhang Z, et al. (2014) High-level genetic diversity and complex population structure of Siberian apricot (Prunus sibirica L.) in China as revealed by nuclear SSR markers. PLoS One 9: e87381.
- View Article
- Google Scholar
14. Yilancioglu K, Cetiner S (2013) Rediscovery of historical Vitis vinifera varieties from the South Anatolia region by using amplified fragment length polymorphism and simple sequence repeat DNA fingerprinting methods. Genome 56: 295–302.
- View Article
- Google Scholar
15. Liu SR, Li WY, Long D, Hu CG, Zhang JZ (2013) Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis. PLoS One 8: e75149.
- View Article
- Google Scholar
16. Xiao L, Hu Y, Wang B, Wu T (2013) Genetic mapping of a novel gene for soybean aphid resistance in soybean (Glycine max [L.] Merr.) line P203 from China. Theoretical and Applied Genetics 126: 2279–2287.
- View Article
- Google Scholar
17. Kantety RV, La Rota M, Matthews DE, Sorrells ME (2002) Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Molecular Biology 48: 501–510.
- View Article
- Google Scholar
18. Pashley CH, Ellis JR, McCauley DE, Burke JM (2006) EST databases as a source for molecular markers: lessons from Helianthus. Journal of Heredity 97: 381–388.
- View Article
- Google Scholar
19. Victoria FC, da Maia LC, de Oliveira AC (2011) In silico comparative analysis of SSR markers in plants. BMC Plant Biology 11: 15.
- View Article
- Google Scholar
20. Coulibaly I, Gharbi K, Danzmann RG, Yao J, Rexroad CE (2005) Characterization and comparison of microsatellites derived from repeat-enriched libraries and expressed sequence tags. Animal Genetics 36: 309–315.
- View Article
- Google Scholar
21. Varshney RK, Graner A, Sorrells ME (2005) Genic microsatellite markers in plants: features and applications. Trends in Biotechnology 23: 48–55.
- View Article
- Google Scholar
22. Zhang L, Zuo K, Zhang F, Cao Y, Wang J, et al. (2006) Conservation of noncoding microsatellites in plants: implication for gene regulation. BMC Genomics 7: 323.
- View Article
- Google Scholar
23. Parida SK, Dalal V, Singh AK, Singh NK, Mohapatra T (2009) Genic non-coding microsatellites in the rice genome: characterization, marker design and use in assessing genetic and evolutionary relationships among domesticated groups. BMC Genomics 10: 140.
- View Article
- Google Scholar
24. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, et al. (2012) Phytozome: a comparative platform for green plant genomics. Nucleic Acids Research 40: D1178–1186.
- View Article
- Google Scholar
25. Kolpakov R, Bana G, Kucherov G (2003) mreps: Efficient and flexible detection of tandem repeats in DNA. Nucleic Acids Research 31: 3672–3678.
- View Article
- Google Scholar
26. Morgulis A, Gertz EM, Schaffer AA, Agarwala R (2006) A fast and symmetric DUST implementation to mask low-complexity DNA sequences. Journal of Computational Biology 13: 1028–1040.
- View Article
- Google Scholar
27. Rozen S, Skaletsky H (2000) Primer3 on the WWW for general users and for biologist programmers. Methods in Molecular Biology 132: 365–386.
- View Article
- Google Scholar
28. Sato S, Tabata S, Hirakawa H, Asamizu E, Shirasawa K, et al. (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485: 635–641.
- View Article
- Google Scholar
29. Bombarely A, Menda N, Tecle IY, Buels RM, Strickler S, et al. (2011) The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl. Nucleic Acids Research 39: D1149–1155.
- View Article
- Google Scholar
30. Doyle J, Doyle J (1990) Isolation of plant DNA from fresh tissue. Focus 12: 3.
- View Article
- Google Scholar
31. Wang J, Chen C, Na J-K, Qingyi Y, Hou S, et al. (2008) Genome-wide comparative analyses of microsatellites in papaya. Tropical Plant Biology: 14.
32. Shi J, Huang S, Fu D, Yu J, Wang X, et al. (2013) Evolutionary dynamics of microsatellite distribution in plants: insight from the comparison of sequenced brassica, Arabidopsis and other angiosperm species. PLoS One 8: e59988.
- View Article
- Google Scholar
33. Leclercq S, Rivals E, Jarne P (2007) Detecting microsatellites within genomes: significant variation among algorithms. BMC Bioinformatics 8: 125.
- View Article
- Google Scholar
34. Merkel A, Gemmell N (2008) Detecting short tandem repeats from genome data: opening the software black box. Briefings in Bioinformatics 9: 355–366.
- View Article
- Google Scholar
35. Lim KG, Kwoh CK, Hsu LY, Wirawan A (2013) Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance. Briefings in Bioinformatics 14: 67–81.
- View Article
- Google Scholar
36. Li X, Zhu X, Mao J, Zou Y, Fu D, et al. (2013) Isolation and characterization of ethylene response factor family genes during development, ethylene regulation and stress treatments in papaya fruit. Plant Physiology and Biochemistry 70: 81–92.
- View Article
- Google Scholar
37. Giovannoni JJ (2004) Genetic regulation of fruit development and ripening. Plant Cell 16: S170–S180.
- View Article
- Google Scholar
38. Hubbard NL, Pharr DM, Huber SC (1990) Role of sucrose phosphate synthase in sucrose biosynthesis in ripening bananas and its relationship to the respiratory climateric. Plant Physiology 94: 201–208.
- View Article
- Google Scholar
39. Saito K, Kasai Z (1978) Conversion of labeled substrates to sugars, cell-wall polysaccharides, and tartaric acid in grape berries. Plant Physiology 62: 215–219.
- View Article
- Google Scholar
40. Fabi JP, Seymour GB, Graham NS, Broadley MR, May ST, et al. (2012) Analysis of ripening-related gene expression in papaya using an Arabidopsis-based microarray. BMC Plant Biology 12.
41. Quint M, Gray WM (2006) Auxin signaling. Current Opinion in Plant Biology 9: 448–453.
- View Article
- Google Scholar
42. Trainotti L, Tadiello A, Casadoro G (2007) The involvement of auxin in the ripening of climacteric fruits comes of age: the hormone plays a role of its own and has an intense interplay with ethylene in ripening peaches. Journal of Experimental Botany 58: 3299–3308.
- View Article
- Google Scholar
43. Young ET, Sloan JS, Van Riper K (2000) Trinucleotide repeats are clustered in regulatory genes in Saccharomyces cerevisiae. Genetics 154: 1053–1068.
- View Article
- Google Scholar
44. Li YC, Korol AB, Fahima T, Beiles A, Nevo E (2002) Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Molecular Ecology 11: 2453–2465.
- View Article
- Google Scholar
45. Li YC, Korol AB, Fahima T, Nevo E (2004) Microsatellites within genes: structure, function, and evolution. Molecular Biology and Evolution 21: 991–1007.
- View Article
- Google Scholar
46. Du Q, Gong C, Pan W, Zhang D (2013) Development and application of microsatellites in candidate genes related to wood properties in the Chinese white poplar (Populus tomentosa Carr.). DNA Research 20: 31–44.
- View Article
- Google Scholar

[ref1] 1. Marotta F, Catanzaro R, Yadav H, Jain S, Tomella C, et al. (2012) Functional foods in genomic medicine: a review of fermented papaya preparation research progress. Acta Biomed 83: 21–29.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Jimenez-Coello M, Guzman-Marin E, Ortega-Pacheco A, Perez-Gutierrez S, Acosta-Viana KY (2013) Assessment of the anti-protozoal activity of crude Carica papaya seed extract against Trypanosoma cruzi. Molecules 18: 12621–12632.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Tu T, Meng K, Bai Y, Shi P, Luo H, et al. (2013) High-yield production of a low-temperature-active polygalacturonase for papaya juice clarification. Food Chemistry 141: 2974–2981.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Evans EA, Ballen FH (2012) An overview of global papaya production, trade, and consumption. Gainesville: University of Florida. 7 p.

[ref5] 5. Kim MS, Moore PH, Zee F, Fitch MM, Steiger DL, et al. (2002) Genetic diversity of Carica papaya as revealed by AFLP markers. Genome 45: 503–512.
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Ma H, Moore PH, Liu Z, Kim MS, Yu Q, et al. (2004) High-density linkage mapping revealed suppression of recombination at the sex determination locus in papaya. Genetics 166: 419–436.
View Article
Google Scholar

[15] View Article

[16] Google Scholar

[ref7] 7. Eustice M, Yu Q, Lai CW, Hou S, Thimmapuram J, et al. (2008) Development and application of microsatellite markers for genomic analysis of papaya. Tree Genetics and Genomes 4: 333–341.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref8] 8. Davis MJ, Ying Z, Brunner BR, Pantoja A, Ferwerda FH (1998) Rickettsial relative associated with papaya bunchy top disease. Current Microbiology 36: 80–84.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref9] 9. Gonsalves D (1998) Control of papaya ringspot virus in papaya: a case study. Annual Review of Phytopathology 36: 415–437.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Tautz D, Trick M, Dover GA (1986) Cryptic simplicity in DNA is a major source of genetic variation. Nature 322: 652–656.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Zhou H, Steffenson BJ, Muehlbauer G, Wanyera R, Njau P, et al. (2014) Association mapping of stem rust race TTKSK resistance in US barley breeding germplasm. Theoretical and Applied Genetics.

[ref12] 12. Carrillo E, Satovic Z, Aubert G, Boucherot K, Rubiales D, et al. (2014) Identification of quantitative trait loci and candidate genes for specific cellular resistance responses against Didymella pinodes in pea. Plant Cell Reports.

[ref13] 13. Wang Z, Kang M, Liu H, Gao J, Zhang Z, et al. (2014) High-level genetic diversity and complex population structure of Siberian apricot (Prunus sibirica L.) in China as revealed by nuclear SSR markers. PLoS One 9: e87381.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref14] 14. Yilancioglu K, Cetiner S (2013) Rediscovery of historical Vitis vinifera varieties from the South Anatolia region by using amplified fragment length polymorphism and simple sequence repeat DNA fingerprinting methods. Genome 56: 295–302.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref15] 15. Liu SR, Li WY, Long D, Hu CG, Zhang JZ (2013) Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis. PLoS One 8: e75149.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref16] 16. Xiao L, Hu Y, Wang B, Wu T (2013) Genetic mapping of a novel gene for soybean aphid resistance in soybean (Glycine max [L.] Merr.) line P203 from China. Theoretical and Applied Genetics 126: 2279–2287.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref17] 17. Kantety RV, La Rota M, Matthews DE, Sorrells ME (2002) Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Molecular Biology 48: 501–510.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref18] 18. Pashley CH, Ellis JR, McCauley DE, Burke JM (2006) EST databases as a source for molecular markers: lessons from Helianthus. Journal of Heredity 97: 381–388.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref19] 19. Victoria FC, da Maia LC, de Oliveira AC (2011) In silico comparative analysis of SSR markers in plants. BMC Plant Biology 11: 15.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref20] 20. Coulibaly I, Gharbi K, Danzmann RG, Yao J, Rexroad CE (2005) Characterization and comparison of microsatellites derived from repeat-enriched libraries and expressed sequence tags. Animal Genetics 36: 309–315.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref21] 21. Varshney RK, Graner A, Sorrells ME (2005) Genic microsatellite markers in plants: features and applications. Trends in Biotechnology 23: 48–55.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref22] 22. Zhang L, Zuo K, Zhang F, Cao Y, Wang J, et al. (2006) Conservation of noncoding microsatellites in plants: implication for gene regulation. BMC Genomics 7: 323.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref23] 23. Parida SK, Dalal V, Singh AK, Singh NK, Mohapatra T (2009) Genic non-coding microsatellites in the rice genome: characterization, marker design and use in assessing genetic and evolutionary relationships among domesticated groups. BMC Genomics 10: 140.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref24] 24. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, et al. (2012) Phytozome: a comparative platform for green plant genomics. Nucleic Acids Research 40: D1178–1186.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref25] 25. Kolpakov R, Bana G, Kucherov G (2003) mreps: Efficient and flexible detection of tandem repeats in DNA. Nucleic Acids Research 31: 3672–3678.
View Article
Google Scholar

[68] View Article

[69] Google Scholar

[ref26] 26. Morgulis A, Gertz EM, Schaffer AA, Agarwala R (2006) A fast and symmetric DUST implementation to mask low-complexity DNA sequences. Journal of Computational Biology 13: 1028–1040.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref27] 27. Rozen S, Skaletsky H (2000) Primer3 on the WWW for general users and for biologist programmers. Methods in Molecular Biology 132: 365–386.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref28] 28. Sato S, Tabata S, Hirakawa H, Asamizu E, Shirasawa K, et al. (2012) The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485: 635–641.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref29] 29. Bombarely A, Menda N, Tecle IY, Buels RM, Strickler S, et al. (2011) The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl. Nucleic Acids Research 39: D1149–1155.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref30] 30. Doyle J, Doyle J (1990) Isolation of plant DNA from fresh tissue. Focus 12: 3.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref31] 31. Wang J, Chen C, Na J-K, Qingyi Y, Hou S, et al. (2008) Genome-wide comparative analyses of microsatellites in papaya. Tropical Plant Biology: 14.

[ref32] 32. Shi J, Huang S, Fu D, Yu J, Wang X, et al. (2013) Evolutionary dynamics of microsatellite distribution in plants: insight from the comparison of sequenced brassica, Arabidopsis and other angiosperm species. PLoS One 8: e59988.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref33] 33. Leclercq S, Rivals E, Jarne P (2007) Detecting microsatellites within genomes: significant variation among algorithms. BMC Bioinformatics 8: 125.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref34] 34. Merkel A, Gemmell N (2008) Detecting short tandem repeats from genome data: opening the software black box. Briefings in Bioinformatics 9: 355–366.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref35] 35. Lim KG, Kwoh CK, Hsu LY, Wirawan A (2013) Review of tandem repeat search tools: a systematic approach to evaluating algorithmic performance. Briefings in Bioinformatics 14: 67–81.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref36] 36. Li X, Zhu X, Mao J, Zou Y, Fu D, et al. (2013) Isolation and characterization of ethylene response factor family genes during development, ethylene regulation and stress treatments in papaya fruit. Plant Physiology and Biochemistry 70: 81–92.
View Article
Google Scholar

[99] View Article

[100] Google Scholar

[ref37] 37. Giovannoni JJ (2004) Genetic regulation of fruit development and ripening. Plant Cell 16: S170–S180.
View Article
Google Scholar

[102] View Article

[103] Google Scholar

[ref38] 38. Hubbard NL, Pharr DM, Huber SC (1990) Role of sucrose phosphate synthase in sucrose biosynthesis in ripening bananas and its relationship to the respiratory climateric. Plant Physiology 94: 201–208.
View Article
Google Scholar

[105] View Article

[106] Google Scholar

[ref39] 39. Saito K, Kasai Z (1978) Conversion of labeled substrates to sugars, cell-wall polysaccharides, and tartaric acid in grape berries. Plant Physiology 62: 215–219.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref40] 40. Fabi JP, Seymour GB, Graham NS, Broadley MR, May ST, et al. (2012) Analysis of ripening-related gene expression in papaya using an Arabidopsis-based microarray. BMC Plant Biology 12.

[ref41] 41. Quint M, Gray WM (2006) Auxin signaling. Current Opinion in Plant Biology 9: 448–453.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref42] 42. Trainotti L, Tadiello A, Casadoro G (2007) The involvement of auxin in the ripening of climacteric fruits comes of age: the hormone plays a role of its own and has an intense interplay with ethylene in ripening peaches. Journal of Experimental Botany 58: 3299–3308.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref43] 43. Young ET, Sloan JS, Van Riper K (2000) Trinucleotide repeats are clustered in regulatory genes in Saccharomyces cerevisiae. Genetics 154: 1053–1068.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref44] 44. Li YC, Korol AB, Fahima T, Beiles A, Nevo E (2002) Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Molecular Ecology 11: 2453–2465.
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref45] 45. Li YC, Korol AB, Fahima T, Nevo E (2004) Microsatellites within genes: structure, function, and evolution. Molecular Biology and Evolution 21: 991–1007.
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref46] 46. Du Q, Gong C, Pan W, Zhang D (2013) Development and application of microsatellites in candidate genes related to wood properties in the Chinese white poplar (Populus tomentosa Carr.). DNA Research 20: 31–44.
View Article
Google Scholar

[127] View Article

[128] Google Scholar

Figures

Abstract

Introduction

Materials and Methods

Genomic data and SSR annotation

Simple Sequence Repeats Identification

Primer design

Analysis of genes involved in fruit ripening

SSR screening and polymorphism survey

Data Access and Retrieval

Results and Discussion

SSR classification and genomic positioning

Design of high-quality primer pairs for papaya SSRs

Analysis of SSRs located in genes related to fruit ripening

SSR screening and polymorphism survey

Conclusion

Supporting Information

Author Contributions

References