Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Novel and Stress Relevant EST Derived SSR Markers Developed and Validated in Peanut

  • Tejas C. Bosamia,

    Affiliations Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India, Department of Biotechnology, College of Agriculture, Junagadh Agricultural University, Junagadh, Gujarat, 362001,India

  • Gyan P. Mishra,

    Affiliation Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India

  • Radhakrishnan Thankappan ,

    radhakrishnan.nrcg@gmail.com

    Affiliation Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India

  • Jentilal R. Dobaria

    Affiliation Crop Improvement Division, ICAR- Directorate of Groundnut Research, Junagadh, Gujarat, 362001, India

Novel and Stress Relevant EST Derived SSR Markers Developed and Validated in Peanut

  • Tejas C. Bosamia, 
  • Gyan P. Mishra, 
  • Radhakrishnan Thankappan, 
  • Jentilal R. Dobaria
PLOS
x

Correction

15 Jul 2015: The PLOS ONE Staff (2015) Correction: Novel and Stress Relevant EST Derived SSR Markers Developed and Validated in Peanut. doi: info:doi/10.1371/journal.pone.0133537 View correction

Abstract

With the aim to increase the number of functional markers in resource poor crop like cultivated peanut (Arachis hypogaea), large numbers of available expressed sequence tags (ESTs) in the public databases, were employed for the development of novel EST derived simple sequence repeat (SSR) markers. From 16424 unigenes, 2784 (16.95%) SSRs containing unigenes having 3373 SSR motifs were identified. Of these, 2027 (72.81%) sequences were annotated and 4124 gene ontology terms were assigned. Among different SSR motif-classes, tri-nucleotide repeats (33.86%) were the most abundant followed by di-nucleotide repeats (27.51%) while AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant repeat-motifs. A total of 2456 EST-SSR novel primer pairs were designed, of which 366 unigenes having relevance to various stresses and other functions, were PCR validated using a set of 11 diverse peanut genotypes. Of these, 340 (92.62%) primer pairs yielded clear and scorable PCR products and 39 (10.66%) primer pairs exhibited polymorphisms. Overall, the number of alleles per marker ranged from 1-12 with an average of 3.77 and the PIC ranged from 0.028 to 0.375 with an average of 0.325. The identified EST-SSRs not only enriched the existing molecular markers kitty, but would also facilitate the targeted research in marker-trait association for various stresses, inter-specific studies and genetic diversity analysis in peanut.

Introduction

Peanut or groundnut (Arachis hypogaea L.), is generally cultivated in low-input farming systems, between 40° N and 40° S in the semi-arid tropical and sub-tropical regions of the world [1]. It is the sixth major oil-yielding, leguminous cash crop, which is cultivated in India approximately 20–25 million ha area, with a production of 35–40 million tons of pods annually [2]. It is a self-pollinated allotetraploid (2n = 4x = 40, AABB) crop having ten basic chromosomes with DNA content of about 2813 Mbp per 1C [3] with approximately 50000–70000 genes [4]. It belongs to genus Arachis, which is grouped into nine sections and includes approximately 80 species [5]. It is believed to have originated from a few or even a single hybridization event between Arachis duranensis (AA) and Arachis ipaensis (BB) [6], followed by spontaneous chromosome duplication [7].

Despite the high nutritional and economic implications; production and productivity is constrained due to various types of stresses, in most peanut growing areas of the world [1,811]. This crop is mostly grown under rain-fed conditions, where drought is a major constraint, limiting its productivity [11]. Similarly, salinity-affected area is approximately 830 million ha globally [12], an issue that deserves major attention. Marker-assisted selection is an important tool to enhance tolerance/resistance to these stresses and has the potential to enable faster and larger gains through genetic improvement [13]. Therefore, there is a need for the development of markers having functional relevance to various stresses in peanut.

In peanut molecular breeding programme, PCR based markers, like simple sequence repeats (SSRs) are considered as the marker of choice, due to many desirable attributes like co-dominance, high-variability, reproducibility, wide genome coverage and ease in use [14]. Peanut has comparatively lower genomic resources (including transcriptome data) compared to other legumes like medicago [15], lotus [16], and chickpea [17] which adds to the necessity of developing more robust molecular markers, specifically the genic ones, as they provide the insight into the functional information. Although, quite recently in April 2014, the genome sequence of two probable progenitors of cultivated groundnut, A. duranensis and A. ipeansis was completed with information available in public domain (http://peanutbase.org/), but the genome sequence of cultivated peanut, A. hypogaea is yet to be sequenced.

Development of SSRs using conventional experimental methods like SSR-enriched or non-enriched genomic library construction is quite laborious, time consuming and expensive. However, in silico development of Expressed Sequence Tag (EST) derived SSRs through mining of publicly available database are relatively simpler and cost effective [18,19]. Such EST-SSRs finds potential use in genetic research for molecular breeding in peanut. Alternatively, with the development of peanut based EST projects, vast amount of EST data has been generated which is available in the NCBI databases and offers an opportunity to identify SSRs [20,21]. However, the available peanut high throughput transcriptome sequences are not complete; many have low N50 values, ranging from 500 to 750 bp [2225]. Because peanut has such a large number of genes, it is important to have a good representation of the transcriptome. On other hand, ESTs data of cultivated peanut is still remain unexplored for the development of SSR markers.

In peanut, work has been documented by various research groups on the development and characterization of both genomic SSRs [2632] as well as genic SSRs [20,21,24,3338]. However, nearly 10000 SSRs were available in peanut in public domain, which are grossly inadequate for genetic studies. Moreover, several peanut genetic maps have been published, but marker density is not satisfactory especially in the context of large genome-size and 20 linkage groups. Besides, the bottleneck with reference to paucity of polymorphic markers in cultivated peanut compared to the wild genotypes, makes an exclusive demand for the development of large number of markers for the effective molecular breeding in peanut [39,40]. Very recently, different success stories have started coming up, where a few SSR markers were validated and successfully used for the improvement of biotic stress resistance in peanut [10,41,42].

In the present study, large number of novel EST derived SSR markers have been developed and markers with functional relevance to different stresses have been validated, which will provide valuable resource for linkage mapping, integration of QTLs, comparative genomics and marker-assisted selection for improving peanut genotypes for various traits.

Materials and Methods

All the experiments were done in the labs and fields of the Directorate of Groundnut Research and no animals were used

Plant materials and DNA extraction

Eleven peanut genotypes, differing in their ability to various biotic and abiotic stresses, and also used as parental lines, for the development of various mapping population by different research groups, were used to screen the developed EST-SSR markers [4347] (Table 1). The seeds of these genotypes were obtained from the Genetic Resources Section of the Directorate of Groundnut Research, Gujarat (India).

thumbnail
Table 1. List of parental genotypes used in current study along with its pedigree.

http://dx.doi.org/10.1371/journal.pone.0129127.t001

Two seeds of each genotype were grown in plastic pots filled with sand, under controlled conditions. Genomic DNA was extracted from fresh leaf tissue of one week old plants by CTAB method [48]. The quality of DNA was checked on 0.8% (w/v) agarose gel with λ DNA as standard and DNA was quantified using NanoDrop ND-1000 (NanoDrop products, DE, USA). The working concentration of DNA was adjusted to 20 ng μL-1.

Data mining and processing of Arachis hypogaea ESTs sequences

From the NCBI database, 178490 EST sequences of A. hypogaea were downloaded (till 20th March, 2013). Similarity search was done using all the SSR primer pairs sequence information available in the public domain. The candidate EST sequences were removed, using in-house Perl script (S1 Script). Remaining EST sequences were then processed for the removal of low-complexity regions which included trimming of poly- A, poly- T tracts, sequence ends rich in undetermined bases and low quality sequences (<100 bp). After that, NCBI UniVec databases (http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html) were used to detect potential vector contaminations, and were removed by Cross_Match software [49]. Finally, the remaining ESTs were assembled using TGICL software [50] using the command, tgicl<fasta_file>-p 95-l 50-v 20-s 10 000-O ‘-p 95-y 20-o 50’ [51] to reduce the sequence redundancy.

Identification of microsatellites and functional annotation

For identifying the SSRs, the processed EST sequences were screened using MIcroSAtellite identification tool (MISA) [52] based on unit size or minimum number of repeats (2–6, 3–5, 4–4, 5–3 and 6–3) and the maximum number of bases interrupting two SSRs in a compound microsatellite = 100. The SSR containing sequences were separated by in-house Perl script (S2 Script) and subjected to online functional annotation tool, Blast2Go wherein BlastX of EST sequences (with e-value cut-off 10–6 or better) was carried out against the NCBI non-redundant protein sequence database (nr). Gene ontology terms were assigned to SSR containing sequences and visualized by online software WEGO [53] to understand the distribution of the gene functions.

Primer designing

SSR containing EST sequences were used to develop EST-SSR primer pairs using online software BatchPrimer3 v1.0 [54]. The criteria used for primer designing were as follows primer length 18–23 bp, with optimum value 20 bp; Tm 57–63°C, with optimum value 60°C; GC content 40–60%, with the optimum value 50%; maximum Tm difference between forward and reverse primer 1.5°C and product size range 100–300 bp optimum value 150 bp.

Screening and assessment of polymorphic EST-SSRs

The newly designed primers were further selected based on its relevance to various stresses and other functional unigenes. The selected primers were synthesized from Xcleris, India and polymerase chain reaction (PCR) was performed. The PCR mixtures (10 μl) contained 1μL template DNA (20 ng), 1 μL 10x Taq buffer + MgCl2 (15 mM), 0.8 μL dNTP (2 mM), 1.0 μL primers (10 p moles each, Forward and Reverse), 0.1 μL Taq polymerase (Promega 5U μL-1) and 5.1 μL sterile double distilled water. Amplification was performed in 0.2 mL (each tube) thin walled PCR plates (96 wells plate-1) in a thermal cycler (Eppendorf, Germany).

A touch down PCR amplification profile was programmed for 94°C for 3 min of initial denaturation, followed by first 5 cycles of 94°C for 30 s, 65°C to 60°C for 30 s and 72°C for 1 min, with 1°C decrement in annealing temperature per cycle, then 30 cycles of 94°C for 30 s with constant annealing temperature of 60°C and 72°C for 1 min followed by a final extension for 7 min at 72°C. Amplified products were analyzed using 6% non-denaturing poly-acrylamide gel at constant power 225 volts for about 2.5–3.0 h and stained with Ethidium bromide [10,55]. The gels were documented in automated gel documentation system (Fuji FLA-5000 PhosphorImager, Japan) and scored for the marker amplification.

Data analysis

The size range of the amplified fragments for each microsatellite was estimated by using 50 bp DNA ladder (Fermentas, USA). Number of alleles and polymorphism information content (PIC) value for each polymorphic EST-SSR markers were determined using PowerMarker version 3.25 [56] The PIC was calculated as formula where Pi and Pj are the frequencies of the ith and jth alleles, respectively [57].

Results and Discussion

Sequence data assembly by TGICL program

Redundancy of SSR markers developed from different research groups along with the use of non-uniform marker names have resulted in duplicate genotyping of peanut germplasm and inefficient use of resources for peanut genomics [58]. Thus, development of non-redundant novel set of markers which excludes publically available SSR markers could assist in enhancing such efforts towards generation of genomic resources. In this context, an attempt has been made to develop non-redundant markers from ESTs database available from the public databases. Analysis of all publicly available Arachis hypogaea ESTs (178490 numbers) indicated a huge variation in length of sequences which ranged from 37 to 2038 bp with an average of 571 bp.

Non-redundant sequence assembly.

In order to find novel non-redundant EST-SSR markers, all the publically available SSR primer sequences [20,21,24,26,27,29,34,35,37,5860] were subjected to sequence similarity search with available EST sequences and 23,696 (13.28%) very similar sequences were excluded from further analysis. This step contributed to the identification of novel SSR markers not yet reported in the peanut. Prior to EST assembling, remaining sequences were subjected to pre-processing viz., removal of vector contamination, low-complexity sequences of less than 100 bp and poly A/T sequences. Pre-processing reduced the overall noise in EST data and thus improved the efficacy of subsequent analysis [61].

The sequences downloaded from NCBI displayed huge variation in its length with an average length of 571 bp, which indicated considerable amount of long transcripts. Assembling the ESTs constitutes an important step to provide non-redundant and high-quality sequences for the development of SSRs [61]. The long as well as high quality transcripts, totaling 138628 were assembled using TGICL software (Table 2). The TGICL software is quite efficient in handling long reads or transcripts [50] and there have been several reports of using TGICL for clustering the EST transcripts [17,38]. Moreover, TGICL assembly was also used to reduce the redundancy present in the publically available ESTs [62]. In the present study, TGICL assembly reduced the redundancy by 82.21% and generated unigenes with an average length of 857 bp and contig N50 length of 942 bp.

thumbnail
Table 2. Summary and statistics of Arachis hypogaea ESTs assembled by TGICL program at stringency of 50 bp similarity and 95% identity.

http://dx.doi.org/10.1371/journal.pone.0129127.t002

The N50 value is a statistical measure of average length of a set of sequence, which is used especially in reference to contig length. Within an assembly, it was found considerably higher than earlier report of 823 bases [23] whereas, similar results were reported by Chen et al. [63] with an average read length of 589 bp and N50 length of 974 bp, while assembling the long transcripts. This means, pre-processed long EST transcripts were handled quite efficiently by the TGICL assembler. Higher values of N50 and average read length of clustered datasets, provides longer lengths of contigs for selection of SSR flanking sequence, thus assisting in efficient designing of SSR primers [64]. In terms of de novo transcriptome assembly, TGICL performs better than any other assembler as it was developed especially for assembling the long EST reads [50]. Corroborating with the same, Bräutigam et al. [65] also judged TGICL as one of the best assembler in terms of contig length, hybrid assemblies, redundancy reduction and error tolerance in the study of non-model C3 and C4 crops.

Identification and characterization of EST- SSR motifs

Out of 16424 unigenes subjected for SSR screening, 2784 (16.95%) SSR containing unigenes were identified harboring 3373 SSR motifs. The sequences harboring more than one SSR and compound SSRs were 17.49% (487) and 10.38% (289) respectively with an average frequency of one SSR per 4.17 Kb (Table 3). The density of SSR containing sequences was higher in the present study than previous reports in peanut viz., 13.34% [24] and 12.41% [21]. The frequency of SSRs obtained was also consistent with the frequency range of 2.65 to 10.62%, which has been reported in 49 dicot species [66]. Moreover, the frequency of EST-SSRs is known to be significantly influenced by the factors like repeat length and the criteria used for SSRs mining [67].

thumbnail
Table 3. Feature of microsatellites identified by MISA in non redundant EST sequences of Arachis hypogaea.

http://dx.doi.org/10.1371/journal.pone.0129127.t003

In earlier reports, frequency of SSR motifs in Arachis hypogaea was observed in the range of one SSR per 4.52 to 7.3 kb [21,37,38]. However, in this study, it is slightly less (one SSR per 4.17 Kb) which reflected relatively high density of SSRs in the EST sequences. The frequency of SSRs in other legumes, like chickpea and Medicago is reported as one SSR per 8.54 kb [68] and 7.47 kb [69] respectively. However, any sort of direct comparison about abundance and frequency of SSRs with other reports is very difficult, since the estimates were dependent on various factors like SSR search criteria, size of the dataset, database mining tools and the EST sequence redundancy [67,69].

Functional annotation of unigene containing SSRs

The SSR containing unigenes were subjected to functional annotation by Blast2GO software so as to find its putative function(s). A total of 2463 (88.51%) unigenes matched with BlastX search of nr proteins database with e-value cut-off 10–6 or better, of which 2027 (72.81%) SSR containing ESTs were fully annotated for functional protein-encoding sequences, whereas 756 (27.19%) were either putative or hypothetical or uncharacterized or unknown or with no considerable homology (Table 3). The uncharacterized transcripts can be attributed to the unavailability of fully annotated peanut genomic data, and also to the limited numbers of characterized transcripts in this crop. Partially, it can also be attributed to the lack of information with respect to number of protein-encoding genes and transcripts derived from alternative splicing in peanut [63]. However, the percentage of uncharacterized transcripts was very less, compared to 52.77% of sequences having no significant match in study of whole plant transcriptome of A. hypogaea Spanish botanical type [70]. Hence, it can be assured that most of the sequences harboring SSRs were annotated using Blast2GO functional annotation tool by and maximum data was utilized.

The best blast hit distribution revealed 48.2% sequences having similarity with Glycine max followed by Cicer arietinum (16.6%) and other legumes (Fig 1). On the other hand, the best blast-hits distribution with Arachis hypogaea contributed only 5.08% sequence similarity, which is mainly due to the less number of genes identified and characterized in peanut, compared to other legumes like Glycine max or Cicer arietinum [63]. The results were in agreement with the conservation of SSR loci and high level of synteny across the legumes [13,71].

thumbnail
Fig 1. Distribution of best Blast hits for Arachis hypogaea ESTs-SSR containing sequences against other species.

(*Others constitute the species having <2% similarity in Blast hit)

http://dx.doi.org/10.1371/journal.pone.0129127.g001

Assignment of Gene Ontology (GO) terms.

On the basis of gene annotations, of 2784 SSR containing sequences, 2027 (72.81%) were annotated and assigned in 4124 gene ontology (GO) terms, which are further categorized into biological ontology of cellular components (391), molecular functions (1120) and biological processes (2613). Distribution of GO term in cellular components, molecular functions, and biological processes revealed the maximum association with cell part (GO: 0044464), binding genes (GO: 0005488), and cellular process (GO: 0009987) respectively (Fig 2). Similar proportions of the biological ontology were also observed in the study of transcriptome analysis during seed development in peanut [23]. Additionally, 31.91% sequences were found to be associated with response to stimulus (GO: 0050896) encompassing biotic as well as abiotic stresses. The gene ontology categorization of EST sequences harboring SSR is represented in the Fig 2.

thumbnail
Fig 2. Distribution of most abundant Gene ontology (GO) terms assigned to 2027 annotated SSR containing sequences.

http://dx.doi.org/10.1371/journal.pone.0129127.g002

Abundance and distribution of EST-SSRs motifs

Among 3373 SSR motifs, tri-nucleotide motifs were found most pre-dominant (33.86%) followed by di-nucleotide repeats (27.51%). Tetra-, penta- and hexa- nucleotides repeats recorded 7.47%, 13.85%, 17.31% frequencies respectively (Fig 3). As reported earlier, tri-nucleotide repeats were generally most abundant in SSR markers of both dicots and monocots [67,69]. The tri-nucleotide repeat abundance in the present study also corroborated with the earlier studies in peanut [20,21,24] and other legumes like medicago [69], chickpea [68] and field-pea [72]. The abundance of tri-nucleotide repeat motifs is quite common for EST-derived SSRs, as additions or deletions within translated regions, mostly do not disturb the open reading frames (ORFs), and thus can be tolerated more over other types of repeats [72,73]. It is also very well shown by Metzgar et al. [74] that in exons, trinucleotide repeats are invariably the most abundant in all taxa. Moreover, 6 bp di-nucleotide repeats comprised highest (30.6%), among different types of di-nucleotide repeats (Table 4). Like tri-nucleotide repeats, this combination also does not alter the ORF, largely at the functional level, thus favoured and retained by the system [73]. Among the di-nucleotide repeats, AG/TC (75.2%) was the most abundant while AT/AT and AC/GT motifs accounted for 15.4% and 9.4% respectively (Table 4).

thumbnail
Table 4. Frequency distribution of di—and tri- nucleotide motif repeats in peanut.

http://dx.doi.org/10.1371/journal.pone.0129127.t004

Of the tri-nucleotide repeat motifs, AAG/CTT (39.1%) was the most abundant followed by ATC/ATG (14.3%) and AAT/ATT (14.2%) (Table 4). The abundance of AAG repeat affirms the earlier report of Zhao et al. [58]. The AT-rich tri-nucleotide motif (AAG/CTT, AAT/ATT, ACT/AGT, and ATC/ATG comprised ~70%) were more abundant than GC rich tri-nucleotide motif (ACC/GGT, ACG/CGT, AGC/CTG, AGG/CCT and CCG/CGG comprised ~30%) in peanut EST-SSRs. Such abundance affirmed with earlier reports in other legume crops like chickpea [68], and faba bean [75]. It could be noted that tri-nucleotide motifs are conserved in the genic regions among the legume plants and is also supported by the fact that certain motifs coding for structural proteins are conserved in legumes [76].

In the EST-SSR loci, each tri-nucleotide repeat motif codes a specific amino acid, which plays an important role in various cellular, biological, and metabolic processes in plants [77]. The percentage of tri-nucleotide motifs AAG/CTT, which codes for leucine and lysine was the highest (39.14%) followed by isoleucine and methionine coding repeats ATC/ATG (14.42%). Generally, CG/GC and CCG/CGG are very rare in dicotyledonous plants but common in monocots. In this investigation, out of the 3373 SSRs studied, no GC/CG repeats and only 29 CCG/CGG repeats were found, which is in agreement with the previous results [21,38,58].

Overall, among the total SSR motifs, AG/CT (20.7%) and AAG/CTT (13.25%) were the most abundant motifs followed by ATC/ATG (4.83%), AAT/ATT (4.80%) and AT/TA (4.24%) (S1 Table). However, Koilkonda et al. [21] indicated that AAG/CTT motifs were the most abundant followed by AG/CT motif. In general, AG/CT and AAG/CTT motifs were the most predominant in the earlier reports in peanut [20,78]. Similar patterns of repeat motifs were also observed in other crop like chickpea [68], castor bean [79] and Medicago [69].

The SSR motifs were also classified based on their motifs length [76,80]. Among the total SSRs, 657 (19.47%) SSR motifs were of more than 20 bp (class І) while 2716 (80.53%) were of less than 20 bp (class II) (Table 5). The class ІІ SSRs were present in more numbers than class І, which is in agreement with earlier studies in peanut [14,78]. The frequency of SSR motifs decreased with increase in the length of motifs, indicating negative correlation between frequency and length of motifs. Interestingly, in both class І and class ІІ, tri-nucleotides repeat motifs were detected in higher proportion. In class I microsatellites, the proportion of trinucleotide (266 No) was higher compared to di- (223 No), hexa- (122 No), tetra- (29 No) and penta- nucleotide (17 No). Similarly, in the class II microsatellite, the proportion of tri-nucleotides (876 No) was more than rest of the repeat motifs (Table 5).

Designing novel primer sets

A total of 2456 primer pairs were designed for 3373 SSR motifs, of which maximum are of tri- (33.92%), followed by di- (20.20%), hexa- (18.69%), penta- (12.66%), compound- (8.35%), and tetra- nucleotides (6.19%). The primers were named by adding the prefix DGR (acronym for Directorate of Groundnut Research) followed by the numbers (S2 and S3 Tables). The remaining SSR containing sequences, either fail to generate primer-pair due to unavailability of flanking site for primer designing or it did not matched the primer designing parameters. Since, EST-SSR markers are generally transferable among distantly related species; therefore, these markers could be used in other Arachis species for which little or no information is available on SSRs or ESTs. It is reported that approximately, 96% of the primers designed for Medicago truncatula produced amplicons in six other Medicago species, and 66% primers were polymorphic between medicago and ryegrass [81]. Moreover, these markers are also very good candidates for the development of conserved orthologous markers especially for genetic analysis and breeding of minor or poorly-funded crop species including wild Arachis species.

Sorting of EST-SSRs based on functional relevance and its PCR validation.

The genic microsatellites are such class of marker which can target functional polymorphisms within genes and contribute to the ‘direct allele selection’, for any target trait [67]. It could be noted that the SSR motifs are conserved in the genic regions among the legume plants and certain motifs coding for structural proteins are conserved in legumes [76]. Although, SSRs are non-randomly distributed across protein-coding regions, UTRs, and introns [82], but UTRs are predominant in microsatellites than CDS [19,83] and presence of SSRs in the 5’ UTRs are required for gene regulation. Moreover, SSRs in the 3’ UTRs are needed for transcription slippage and expanded mRNA production, which can disrupt splicing and other cellular functions [19]. Role of SSRs in functional genes are not yet properly understood, and further studies are needed to find the effect of microsatellites in gene expression of functional gene containing SSR. However, structural variation in SSR motifs, regulating the gene expression at transcript level and altering the phenotype was reported for amylose content and waxy gene expression [84]. Considering this, it is emphasized that the selection of SSR motifs on the basis of functional relevance to different stresses can be advantageous, so as to get more number of polymorphism in the relevant genotypes. Therefore, the newly designed primers were sorted based on its relevance to various stresses and other functional unigenes containing SSRs (S2 Table).

Among 366 primer pairs, 339 (92.62%) were amplified, which illustrated the precision or suitability of in silico primer designing criteria employed for primer designing (Table 6). Within the PCR validated EST-SSRs, 295 (80.6%) and 71 (19.4%) were derived from stress relevant and other functional unigenes respectively. Out of all the synthesized primer-pairs, 39 (10.66%) displayed polymorphism within 11 selected peanut genotypes, of which, 34 and 5 were derived from stress relevant and other functional unigenes respectively (Table 7 and S2 Table). Total number of detected alleles ranged from 1–12 with an average of 3.77 alleles per marker (S2 Table).

thumbnail
Table 6. PCR validation of SSRs having functional relevance to various stresses across selected peanut genotypes.

http://dx.doi.org/10.1371/journal.pone.0129127.t006

thumbnail
Table 7. List of polymorphic primer with predicted function based on sequence homology.

http://dx.doi.org/10.1371/journal.pone.0129127.t007

On the similar lines, Peng et al. [85] have also observed 89.2% primer amplification and 6.5% polymorphism in their EST derived SSR marker set. Comparatively, in the present study, on an average higher number of alleles were recorded. Pandey et al. [14] also got nearly similar results for allele numbers, which ranged from 2 to 14 with an average of 3.2 alleles in cultivated peanut. These results also corresponded well with other studies in peanut, where 2.3 [31], 2.44 [30] and 2.99 [86] alleles per marker is reported. A total of 384 AG/CT di-nucleotide motifs were utilized for primer designing, of which 327 (85%) were having functional annotations. Among these, 55 primers with functional enrichment were synthesized and validated on 11 diverse genotypes, resulted in 8 polymorphic primers (S2 Table).

Among the 39 polymorphic markers, 2–12 alleles amplified with average of 5.1 alleles per marker. The PIC values of polymorphic primers ranged from 0.028 to 0.375 with an average of 0.325 (Table 7). In general, the PIC values of less than 0.5 is also reported for the EST-SSR markers developed by other research groups in cultivated peanut [14,87], which is evident of low level of polymorphism in those genotypes. It has also been well documented that EST-SSRs are less polymorphic than genomic SSRs because of greater DNA sequence conservation in transcribed regions [14,88]. On the basis of their sequence homology with stress relevant genes, the polymorphic EST-SSR primers showing high PIC values (in parenthesis) like DGR-48 (0.375), DGR-58 (0.375), DGR-105 (0.375), DGR-172 (0.375), DGR-174 (0.370), DGR-198 (0.375), DGR-258 (0.375), DGR-289 (0.365), DGR-316 (0.365), DGR-362 (0.375) and DGR-312 (0.375) could be further validated on mapping population. However, two EST-SSR markers viz., DGR-259 (0.375) and DGR-263 (0.372) associated with the other functional genes have also shown higher degree of polymorphism (Table 7).

Conclusions

In peanut, a constant increase in the volume of sequence data generated from EST projects running in different labs across the world has facilitated the identification of a large number of genic SSRs. During recent years, a wealth of genomic data has been generated in peanut by high throughput transcriptome sequencing [25,63,85,89,90]. Besides, EST database are equally informative and consequential as high throughput transcriptome data. The present work offers complete utilization of EST database for development of SSR markers exclusively from cultivated peanut.

The information of polymorphic EST-SSRs markers not only facilitated better understanding the nature of SSRs in the peanut genome, but also provided a useful source for conducting additional genetic and genomic studies to improve this crop [58]. As demonstrated by the functional annotation, these polymorphic EST-SSR markers increased the chances of linkage to loci, contributing to stress tolerance or resistance. Because of their association to the coding regions, these polymorphic markers could be further validated on mapping population segregating for various biotic and abiotic stress-tolerance or resistance traits. As these genic EST-SSRs are more likely to be conserved between closely related species, they can also facilitate better cross genome comparisons [91]. The most noticeable feature of EST-SSRs is its transferability in related species which makes it potentially more useful for comparative mapping studies [67]. These markers could be also employed in characterizing related legume genomes, with no prior available information. There is a need to validate all the developed EST-SSRs markers, for polymorphism, so as to enhance the density of the existing genetic maps of peanut. In the longer run, development of allele-specific markers for the genes controlling various biotic and abiotic traits will be important for QTL mapping and marker-assisted selection in peanut improvement.

To sum up, this study reports the primer sequences for 2456 novel EST-SSR markers, and analysis of 366 markers, selected on the basis of stress related functions, on a set of 11 diverse genotypes, identified 39 polymorphic markers. It is hoped that the identified EST-SSR markers will not only enrich the current marker resources but also benefit the international peanut research community working on molecular breeding.

Supporting Information

S1 Fig. Representative gel photograph showing amplification of 4 DGR primers (a. DGR- 41, b. DGR- 114, c. DGR-146 and d. DGR-179) in 11 peanut genotypes.

Where M: 50 bp DNA marker, 1: GPBD-4, 2: JSP-39, 3: R-2001-3, 4: ALR-2, 5: VG-09405, 6: ICGV 86590, 7: CS-85, 8: CS-319, 9: JL-24, 10: GG-20, 11: TG 37-A

doi:10.1371/journal.pone.0129127.s001

(TIF)

S1 Table. Overall frequency distribution of 3373 motif repeats identified from A. hypogaea unigenes.

doi:10.1371/journal.pone.0129127.s002

(XLSX)

S2 Table. PCR validation of 366 EST-SSR primers, using the panel of 11 peanut genotypes.

(Where light-green and pink color codes represent stress-relevant and other functional unigenes respectively)

doi:10.1371/journal.pone.0129127.s003

(XLSX)

S3 Table. A set of 2456 novel EST-SSR primers, designed from publically available Arachis hypogaea databases.

(Excluding 366, stress relevant and other functional unigenes containing SSR markers used for the PCR validation)

doi:10.1371/journal.pone.0129127.s004

(XLSX)

S1 Script. Search for sequence similarity of primer sequence (query) from EST database (Subject).

doi:10.1371/journal.pone.0129127.s005

(PL)

S2 Script. Extraction of specific sequences based on sequences ID.

doi:10.1371/journal.pone.0129127.s006

(PL)

Acknowledgments

Authors acknowledge the help rendered by Ms. Mruduka Patel of Anand Agriculture University, Gujarat, India for developing the Perl script used in the investigation.

Author Contributions

Conceived and designed the experiments: RT. Performed the experiments: RT. Analyzed the data: GPM TCB JRD. Contributed reagents/materials/analysis tools: TCB. Wrote the paper: GPM TCB.

References

  1. 1. Sarkar T, Thankappan R, Kumar A, Mishra GP, Dobaria JR. Heterologous expression of the AtDREB1A gene in transgenic peanut-conferred tolerance to drought and salinity stresses. PLoS ONE 2014; 9: e110507. doi: 10.1371/journal.pone.0110507. pmid:25545786
  2. 2. FAO. Food and Agricultural Organization of the United Nation. FAO statistical database. Available: http://faostat.fao.org/faostat/collections?Subset=agriculture. 2013. Accessed 20 January 2015.
  3. 3. Arumuganatham E. Nuclear DNA content of some important plant species. Plant Mol Biol Rep 1991; 9: 211–215. doi: 10.1007/bf02672016
  4. 4. Temsch EM, Greilhuber J. Genome size variation in Arachis hypogaea and A. monticola reevaluated. Genome 2000; 43: 449–451. pmid:10902707 doi: 10.1139/gen-43-3-449
  5. 5. Krapovickas A, Gregory WC. Taxonomia del genero Arachis (Leguminosae). Bonplandia 1994; 8: 1–186.
  6. 6. Young ND, Weeden NF, Kochert G. Genome mapping in legumes (Family Fabaceae) In: Paterson AH (ed), Genome mapping in plants. Landes Biomedical Press, Austin, Texas; 1996; pp. 211–277.
  7. 7. Halward TM, Stalker HT, Larue EA, Kochert G. Genetic variation detectable with molecular markers among unadapted germplasm resources of cultivated peanut and related wild-species. Genome 1991; 34: 1013–1020. doi: 10.1139/g91-156
  8. 8. Singh D, Radhakrishnan T, Kumar V, Bagwan NB, Basu MS, Dobaria JR, et al. Morphological and toxigenic variability in the Aspergillus flavus isolates from peanut (Arachis hypogaea L.) production system in Gujarat (India). J Environ Biol 2015; 36:441–449. pmid:25895268
  9. 9. Dodia SM, Mishra GP, Radhakrishnan T, Dobaria JR. Molecular analysis of gene-defects among non-aflatoxigenic Aspergillus flavus isolates for aflatoxin biosynthesis gene-cluster and flanking-regions. J Pure Appl Microbiol 2014; 8: 4623–4635.
  10. 10. Gajjar KN, Mishra GP, Radhakrishnan T, Dodia SM, Rathnakumar AL, Kumar N, et al. Validation of SSR markers linked to the rust and late leaf spot diseases resistance in diverse peanut genotypes. Aust J Crop Sci 2014; 8: 927–936.
  11. 11. Bhauso TD, Radhakrishnan T, Kumar A, Mishra GP, Dobaria JR, Patel K, et al. Overexpression of bacterial mtlD gene in peanut improves drought tolerance through accumulation of mannitol. Sci World J 2014; ArticleID 125967, doi: 10.1155/2014/125967.
  12. 12. Martinez-Beltran J, Manzur CL. Overview of salinity problems in the world and FAO strategies to address the problem. In: Proceedings of the International Salinity Forum, April 2005; Riverside, California, USA. 2005; pp 311–313.
  13. 13. Pandey MK, Monyo E, Ozias-Akins P, Liang X, Guimarães P, Nigam SN, et al. Advances in Arachis genomics for peanut improvement. Biotechnol Adv 2012a; 30: 639–651. doi: 10.1016/j.biotechadv.2011.11.001. pmid:22094114
  14. 14. Pandey MK, Gautami B, Jayakumar T, Sriswathi M, Upadhyaya HD, Gowda MVC, et al. Highly informative genic and genomic SSR markers to facilitate molecular breeding in cultivated groundnut (Arachis hypogaea). Plant Breed 2012b; 131: 139–147. doi: 10.1111/j.1439-0523.2011.01911.x
  15. 15. Cheung F, Haas BJ, Goldberg SM, May GD, Xiao Y, Town CD. Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology. BMC Genomics 2006; 7: 272. doi: 10.1186/1471-2164-7-272. pmid:17062153
  16. 16. Sato S, Nakamura Y, Kaneko T, Asamizu E, Kato T, Nakao M, et al. Genome structure of the legume, Lotus japonicus. DNA Res 2008; 15: 227–239. doi: 10.1093/dnares/dsn008. pmid:18511435
  17. 17. Garg R, Patel RK, Tyagi AK, Jain M. De novo assembly of chickpea transcriptome using short reads for gene discovery and marker identification. DNA Res 2011; 18: 53–63. doi: 10.1093/dnares/dsq028. pmid:21217129
  18. 18. Tang J, Baldwin SJ, Jacobs JME, Linden CG, Voorrips RV, Leunissen JAM, et al. Large-scale identification of polymorphic microsatellites using an in silico approach. BMC Bioinformatics 2008; 9: 374. doi: 10.1186/1471-2105-9-374. pmid:18793407
  19. 19. Kalia RK, Rai MK, Kalia S, Singh R, Dhawan AK. Microsatellite markers: an overview of the recent progress in plants. Euphytica 2011; 177: 309–334. doi: 10.1007/s10681-010-0286-9
  20. 20. Song GQ, Li MJ, Xiao H, Wang XJ, Tang RH, Xia H, et al. EST sequencing and SSR marker development from cultivated peanut (Arachis hypogaea L.). Electron J Biotechnol 2010; 13: 3. doi: 10.2225/vol13-issue3-fulltext-10.
  21. 21. Koilkonda P, Sato S, Tabata S, Shirasawa K, Hirakawa H, Sakai H, et al. Large-scale development of expressed sequence tag-derived simple sequence repeat markers and diversity analysis in Arachis spp. Mol Breed 2011; 30: 125–138. pmid:22707912 doi: 10.1007/s11032-011-9604-8
  22. 22. Nagy ED, Guo Y, Tang S, Bowers JE, Okashah RA, Taylor CA, et al. A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut. BMC Genomics 2012; 13: 469. doi: 10.1186/1471-2164-13-469. pmid:22967170
  23. 23. Zhang J, Liang S, Duan J, Wang J, Chen S, Cheng Z, et al. De novo assembly and characterisation of the transcriptome during seed development, and generation of genic-SSR markers in peanut (Arachis hypogaea L.). BMC Genomics 2012; 13: 90. doi: 10.1186/1471-2164-13-90. pmid:22409576
  24. 24. Guimarães PM, Brasileiro ACM, Morgante CV, Martins ACQ, Pappas G, Silva OB Jr, et al. Global transcriptome analysis of two wild relatives of peanut under drought and fungi infection. BMC Genomics 2012; 13: 387. doi: 10.1186/1471-2164-13-387. pmid:22888963
  25. 25. Chopra R, Burow G, Farmer A, Mudge J, Simpson CE, Burow MD. Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-seq data. PLoS ONE 2014; 9(12): e115055. doi: 10.1371/journal.pone.0115055. pmid:25551607
  26. 26. He GH, Meng R, Newman M, Gao G, Pittman RN, Prakash CS. Microsatellites as DNA markers in cultivated peanut (Arachis hypogaea L.). BMC Plant Biol 2003; 3: 3. doi: 10.1186/1471-2229-3-3. pmid:12713672
  27. 27. Ferguson ME, Burow MD, Schulze SR, Bramel PJ, Paterson AH, Kresovich S, et al. Microsatellite identification and characterization in peanut (Arachis hypogaea L.). Theor Appl Genet 2004; 108: 1064–1070. pmid:15067392 doi: 10.1007/s00122-003-1535-2
  28. 28. Moretzsohn MC, Barbosa AVG, Alves-freitas DMT, Teizeira C, Leal-Bertioli SCM, Guimarães PM, et al. A linkage map for the B-genome of Arachis (Fabaceae) and its synteny to the A-genome. BMC Plant Biol 2009; 9: 40. doi: 10.1186/1471-2229-9-40. pmid:19351409
  29. 29. Gimenes MA, Hosino AA, Barbosa AVG, Palmieri DA, Lopes CR. Characterization and transferability of microsatellite markers of cultivated peanut (Arachis hypogaea). BMC Plant Biol 2007; 7: 9. doi: 10.1186/1471-2229-7-9. pmid:17326826
  30. 30. Cuc LM, Mace ES, Crouch JH, Quang VD, Long TD, Varshney RK. Isolation and characterization of novel microsatellite markers and their application for diversity assessment in cultivated groundnut (Arachis hypogaea). BMC Plant Biol 2008; 8: 55. doi: 10.1186/1471-2229-8-55. pmid:18482440
  31. 31. Gautami B, Ravi K, Narasu ML, Hoisington DA, Varshney RK. Novel set of groundnut SSR markers for germplasm analysis and inter-specific transferability. Int J Integr Biol 2009; 7: 100–106.
  32. 32. Macedo SE, Moretzsohn MC, Leal-Bertioli SCM, Alves DMT, Gouvea EG, Azevedo VCR, et al. Development and characterization of highly polymorphic long TC repeat microsatellite markers for genetic analysis of peanut. BMC Res Notes 2012; 5: 86. doi: 10.1186/1756-0500-5-86. pmid:22305491
  33. 33. He GH, Meng R, Gao H, Guo B, Gao G, Newman M, et al. Simple sequence repeat markers for botanical varieties of cultivated peanut (Arachis hypogaea L.). Euphytica 2005; 142: 131–136. doi: 10.1007/s10681-005-1043-3
  34. 34. Moretzsohn MC, Leoi L, Proite K, Guimarães PM, Leal-Bertioli SCM, Gimenes MA, et al. A microsatellite- based, gene-rich linkage map for the AA genome of Arachis (Fabaceae). Theor Appl Genet 2005; 111: 1060–1071. pmid:16088397 doi: 10.1007/s00122-005-0028-x
  35. 35. Proite K, Leal-Bertioli SC, Bertioli DJ, Moretzsohn MC, daSilva FR, Martins NF, et al. ESTs from a wild Arachis species for gene discovery and marker development. BMC Plant Biol 2007; 7: 7. doi: 10.1186/1471-2229-7-7. pmid:17302987
  36. 36. Wang CT, Yang XD, Chen DX, Yu SL, Liu GZ, Tang YY, et al. Isolation of simple sequence repeats from groundnut. Electron J Biotechnol 2007; 10: 3. doi: 10.2225/vol10-issue3-fulltext-10.
  37. 37. Guo BZ, Chen XP, Hong YB, Liang XQ, Dang P, Brenneman T, et al. Analysis of gene expression profiles in leaf tissues of cultivated peanuts and development of EST-SSR markers and gene discovery. Int J Plant Genomics 2009; doi: 10.1155/2009/715605. pmid:20182638
  38. 38. Liang X, Chen X, Hong Y, Liu H, Zhou G, Li S, et al. Utility of EST- derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachis wild species. BMC Plant Biol 2009; 9: 35. doi: 10.1186/1471-2229-9-35. pmid:19309524
  39. 39. Sarvamangala C, Gowda MVC, Varshney RK. Identification of quantitative trait loci for protein content, oil content and oil quality for groundnut (Arachis hypogaea L.). Field Crops Res 2011; 122: 49–59. doi: 10.1016/j.fcr.2011.02.010
  40. 40. Gautami B, Pandey MK, Vadez V, Nigam SN, Ratnakumar P, Krishnamurthy L, et al. Quantitative trait locus analysis and construction of consensus genetic map for drought tolerance traits based on three recombinant inbred line populations in cultivated groundnut (Arachis hypogaea L.). Mol Breed 2012; 30: 757–772. pmid:22924017 doi: 10.1007/s11032-011-9660-0
  41. 41. Sukruth M, Paratwagh SA, Sujay V, Kumari V, Gowda MVC, Nadaf HL, et al. Validation of markers linked to late leaf spot and rust resistance, and selection of superior genotypes among diverse recombinant inbred lines and backcross lines in peanut (Arachis hypogaea L.). Euphytica 2015; doi: 10.1007/s10681-014-1339-2.
  42. 42. Varshney RK, Pandey MK, Janila P, Nigam SN, Sudini H, Gowda MVC, et al. Marker-assisted introgression of a QTL region to improve rust resistance in three elite and popular varieties of peanut (Arachis hypogaea L.). Theor Appl Genet 2014; 127: 1771–1781. doi: 10.1007/s00122-014-2338-3. pmid:24927821
  43. 43. Rathnakumar AL, Singh R, Parmar DL, Misra JB. Groundnut: a crop profile and compendium of notified varieties of India. Directorate of Groundnut Research, India. 2013; 118 p.
  44. 44. Rathnakumar AL. Crop improvement: Hybridization program. In: Rathnakumar AL, Dagla M, Kumar N, Parmar DL, Singh R, Eds., Annual meeting of groundnut researchers: All India Coordinated Research Project on Groundnut. ICAR-DGR, India. 2013; pp 1–19.
  45. 45. Khedikar YP, Gowda MVC, Sarvamangala C, Patgar KV, Upadhyaya HD, Varshney RK. A QTL study on late leaf spot and rust revealed one major QTL for molecular breeding for rust resistance in groundnut (Arachis hypogaea L.). Theor Appl Genet 2010; 121: 971–984. doi: 10.1007/s00122-010-1366-x. pmid:20526757
  46. 46. Sujay V, Gowda MVC, Pandey MK, Bhat RS, Khedikar YP, Nadaf HL, et al. Quantitative trait locus analysis and construction of consensus genetic map for foliar disease resistance based on two recombinant inbred line populations in cultivated groundnut (Arachis hypogaea L.). Mol Breed 2012; 30: 773–788. pmid:22924018 doi: 10.1007/s11032-011-9661-z
  47. 47. Thirumalaisamy PP, Kumar N, Radhakrishnan T, Rathnakumar AL, Bera SK, Jadon KS, et al. Phenotyping of groundnut genotypes for resistance to sclerotium stem rot. J Mycol Plant Pathol 2014; 44(4): 459–462.
  48. 48. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull 1987; 19: 11–15.
  49. 49. Ewing B, Green P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 1998; 8: 186–194. pmid:9521922 doi: 10.1101/gr.8.3.175
  50. 50. Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 2003; 19: 651–652. pmid:12651724 doi: 10.1093/bioinformatics/btg034
  51. 51. Childs KL, Hamilton JP, Zhu W, Ly E, Cheung F, Wu H, et al. The TIGR plant transcript assemblies database. Nucleic Acids Res 2007; 35: D846–D851. doi: 10.1093/nar/gkl785. pmid:17088284
  52. 52. Thiel T, Michalek W, Varshney RK, Graver A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet 2003; 106: 411–422. pmid:12589540
  53. 53. Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, et al. WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 2006; 34: W293–W297. doi: 10.1093/nar/gkl031. pmid:16845012
  54. 54. You FM, Huo N, Gu YQ, Luo MC, Ma Y, Hane D, et al. BatchPrimer3: a high throughput web application for PCR and sequencing primer designi. BMC Bioinformatics 2008; 9: 253. doi: 10.1186/1471-2105-9-253. pmid:18510760
  55. 55. Benbouza H, Jacquemin JM, Baudoin JP, Mergeai G. Optimization of a reliable, fast cheap and sensitive silver staining method to detect SSR markers in polyacrylamide gels. Biotechnol Agron Soc 2006; 93: 77–81.
  56. 56. Liu K, Muse S. PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 2005; 21: 2128–2129. pmid:15705655 doi: 10.1093/bioinformatics/bti282
  57. 57. Botstein D, White RL, Skolnick M, Davis RW. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 1980; 32: 314–331. pmid:6247908
  58. 58. Zhao Y, Prakash CS, He G. Characterization and compilation of polymorphic simple sequence repeat (SSR) markers of peanut from public database. BMC Res Notes 2012; 5: 362. doi: 10.1186/1756-0500-5-362. pmid:22818284
  59. 59. Hopkins MS, Casa AM, Wang T, Mitchell SE, Dean RE, Kochert GD, et al. Discovery and characterization of polymorphic simple sequence repeats (SSRs) in peanut. Crop Sci 1999; 39: 1243–1247. doi: 10.2135/cropsci1999.0011183x003900040047x
  60. 60. Nagy ED, Chu Y, Guo YF, Khanal S, Tang S, Li Y, et al. Recombination is suppressed in an alien introgression in peanut harboring Rma, a dominant root-knot nematode resistance gene. Mol Breed 2010; 26: 357–370. doi: 10.1007/s11032-010-9430-4
  61. 61. Vijay N, Poelstra JW, Kunstner A, Wolf JBW. Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA-seq experiments. Mol Ecol 2013; 22: 620–634. doi: 10.1111/mec.12014. pmid:22998089
  62. 62. Nakasugi K, Crowhurst RN, Bally J, Wood CC, Hellens RP, Waterhouse PM. De novo transcriptome sequence assembly and analysis of RNA silencing genes of Nicotiana benthamiana. PLoS One 2013; 8: e59534. doi: 10.1371/journal.pone.0091776. pmid:23555698
  63. 63. Chen X, Zhu W, Azam S, Li H, Zhu F, Li H, et al. Deep sequencing analysis of the transcriptomes of peanut aerial and subterranean young pods identifies candidate genes related to early embryo abortion. Plant Biotechnol J 2013; 11: 115–127. doi: 10.1111/pbi.12018. pmid:23130888
  64. 64. Jayashree B, Punna R, Prasad P, Bantte K, Hash CT, Chandra S, et al. A database of simple sequence repeats from cereal and legume expressed sequence tags mined in silico: survey and evaluation. In Silico Biol 2006; 6: 607–620. pmid:17518768
  65. 65. Bräutigam A, Mullick T, Schliesky S, Weber APM. Critical assessment of assembly strategies for non-model species mRNA-Seq data and application of next-generation sequencing to the comparison of C3 and C4 species. J Exp Bot 2011; 1–10. doi: 10.1093/jxb/err029. pmid:21926090
  66. 66. Kumpatla SP, Mukhopadhyay S. Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species. Genome 2005; 48: 985–998. pmid:16391668 doi: 10.1139/g05-060
  67. 67. Varshney RK, Graner A, Sorrells ME. Genic microsatellite markers in plants: features and applications. Trends Biotechnol 2005; 23: 48–55. pmid:15629858 doi: 10.1016/j.tibtech.2004.11.005
  68. 68. Agarwal G, Jhanwar S, Priya P, Singh VK, Saxena MS, Parida SK, et al. Comparative analysis of Kabuli chickpea transcriptome with desi and wild chickpea provides a rich resource for development of functional markers. PLoS ONE 2012; 7: e52443. doi: 10.1371/journal.pone.0052443. pmid:23300670
  69. 69. Wang Z, Yu GH, Shi BB, Wang XM, Qiang HP, Gao HW. Development and characterization of simple sequence repeat (SSR) markers based on RNA sequencing of Medicago sativa and in silico mapping onto the M. truncatula genome. PLoS ONE 2014; 9: e92029. doi: 10.1371/journal.pone.0092029. pmid:24642969
  70. 70. Wu N, Matand K, Wu H, Li B, Li Y, Zhang X, et al. De novo next-generation sequencing, assembling and annotation of Arachis hypogaea L. spanish botanical type whole plant transcriptome. Theor Appl Genet 2013; 126: 1145–1149. doi: 10.1007/s00122-013-2042-8. pmid:23338522
  71. 71. Leal-Bertioli SCM, José ACVF, Alves-Freitas DMT, Moretzsohn MC, Guimarães PM, Nielen S, et al. Identification of candidate genome regions controlling disease resistance in Arachis. BMC Plant Biol 2009; 9: 112. doi: 10.1186/1471-2229-9-112. pmid:19698131
  72. 72. Kaur S, Pembleton LW, Cogan NOI, Savin KW, Leonforte T, Paull J, et al. Transcriptome sequencing of field pea and faba bean for discovery and validation of SSR genetic markers. BMC Genomics 2012; 13: 104. doi: 10.1186/1471-2164-13-104. pmid:22433453
  73. 73. Asp T, Frei UK, Didion T, Nielsen KK, Lubberstedt T. Frequency, type, and distribution of EST-SSRs from three genotypes of Lolium perenne, and their conservation across orthologous sequences of Festuca arundinacea, Brachypodium distachyon and Oryza sativa. BMC Plant Biol 2007; 7: 36. doi: 10.1186/1471-2229-7-36. pmid:17626623
  74. 74. Metzgar D, Bytof J, Wills C. Selection against frameshift mutations limits microsatellite expansion in coding DNA. Genome Res 2000; 10(1): 72–80. pmid:10645952
  75. 75. Akash MW, Myers GO. The development of faba bean expressed sequence tag–simple sequence repeats (EST-SSRs) and their validity in diversity analysis. Plant Breed 2012; 131: 522–530. doi: 10.1111/j.1439-0523.2012.01969.x
  76. 76. Blair MW, Hurtado N, Sharma P. New gene-derived simple sequence repeat markers for common bean (Phaseolus vulgaris L.). Mol Ecol Resour 2012; 12: 661–668. doi: 10.1111/j.1755-0998.2012.03136.x. pmid:22540633
  77. 77. Kumari K, Muthamilarasan M, Misra G, Gupta S, Subramanian A, Parida SK, et al. Development of eSSR-markers in Setaria italica and their applicability in studying genetic diversity, cross-transferability and comparative mapping in millet and non-millet species. PLoS ONE 2013; 8(6): e67742. doi: 10.1371/journal.pone.0067742. pmid:23805325
  78. 78. Wang H, Penmetsa RV, Yuan M, Gong LM, Zhao YL, Guo BZ, et al. Development and characterization of BAC-end sequence derived SSRs, and their incorporation into a new higher density genetic map for cultivated peanut (Arachis hypogaea L.). BMC Plant Biol 2012; 12: 10. doi: 10.1186/1471-2229-12-10. pmid:22260238
  79. 79. Qiu L, Yang C, Tian B, Yang JB, Liu A. Exploiting EST databases for the development and characterization of EST-SSR markers in castor bean (Ricinus communis L.). BMC Plant Biol 2010; 10: 278. doi: 10.1186/1471-2229-10-278. pmid:21162723
  80. 80. Temnykh S, Declerck G, Lukashova A, Lipovich L, Cartinhour S, Mccouch S. Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): Frequency, length variation, transposon associations, and genetic marker potential. Genome Res 2001; 11: 1441–1452. pmid:11483586 doi: 10.1101/gr.184001
  81. 81. Saha MC, Mian MAR, Eujayl I, Zwonitzer JC, Wang L, May GD. Tall fescue EST-SSR markers with transferability across several grass species. Theor Appl Genet 2004; 109: 783–791. pmid:15205737 doi: 10.1007/s00122-004-1681-1
  82. 82. Li YC, Korol AB, Fahima T, Nevo E. Microsatellites within genes: structure, function, and evolution. Mol Biol Evol 2004; 21(6): 991–1007. pmid:14963101 doi: 10.1093/molbev/msh073
  83. 83. Morgante M, Hanafey M, Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genome. Nat Genet 2002; 30(2): 194–200. pmid:11799393 doi: 10.1038/ng822
  84. 84. Bao JS, Corke H, Sun M. Microsatellites in starch synthesizing genes in relation to starch physicochemical properties in waxy rice (Oryza sativa L.). Theor Appl Genet 2002; 105: 898–905. pmid:12582915 doi: 10.1007/s00122-002-1049-3
  85. 85. Peng Z, Gallo M, Rowland D, Tillman B, Wang J. Molecular marker development and germplasm evaluation for cultivated peanut (Arachis hypogaea L.). 2015; P0740, Plant and Animal Genome XXIII, San Diego, CA (January 10–14, 2015). Available: https://pag.confex.com/pag/xxiii/webprogram/Paper16067.html.
  86. 86. Ren X, Jiang H, Yan Z, Chen Y, Zhou X, Huang L, et al. Genetic diversity and population structure of the major peanut (Arachis hypogaea L.) cultivars grown in china by SSR markers. PLoS ONE 2014; 9: e88091. doi: 10.1371/journal.pone.0088091. pmid:24520347
  87. 87. Yuan M, Gong L, Meng R, Li S, Dang P, Guo B, et al. Development of trinucleotide (GGC)n SSR markers in peanut (Arachis hypogaea L.). Electron J Biotechnol 2010; 13: doi: 10.2225/vol13-issue6-fulltext-6.
  88. 88. Varshney RK, Tuberosa R. Genomic approaches and platforms: An Overview. In: Varshney RK and Tuberosa RT, Eds., Genomics Assisted Crop Improvement: Genomics Approaches and Platforms. Springer, Berlin; 2007. pp 13–29.
  89. 89. Li X, Lu J, Liu S, Liu S, Lin Y, Li L. Identification of rapidly induced genes in the response of peanut (Arachis hypogaea) to water deficit and abscisic acid. BMC Biotechnol 2014; 14: 58. doi: 10.1186/1472-6750-14-58. pmid:24970488
  90. 90. Chopra R, Burow G, Farmer E, Mudge J, Simpson CE, Wilkins TA, et al. Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L. Mol Genet Genomics 2015; doi: 10.1007/s00438-014-0976-4.
  91. 91. Dutta S, Kumawat G, Singh BP, Gupta DK, Singh S, Dogra V, et al. Development of genic-SSR markers by deep transcriptome sequencing in pigeonpea [Cajanus cajan (L.) Millspaugh]. BMC Plant Biol 2011; 11:17, doi: 10.1186/1471-2229-11-17. pmid:21251263