Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

DArTseq-based silicoDArT and SNP markers reveal the genetic diversity and population structure of Kenyan cashew (Anacardium occidentale L.) landraces

  • Dennis Wamalabe Mukhebi,

    Roles Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliation Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya

  • Pauline Wambui Gachanja,

    Roles Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliation Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya

  • Diana Jepkoech Karan,

    Roles Formal analysis, Investigation, Writing – original draft

    Affiliation Pwani University Bioscience Research Center (PUBReC), Pwani University, Kilifi, Kenya

  • Brenda Muthoni Kamau,

    Roles Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliation Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya

  • Pauline Wangeci King’ori,

    Roles Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliation Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya

  • Bicko Steve Juma,

    Roles Formal analysis, Investigation, Methodology, Writing – original draft

    Affiliations Pwani University Bioscience Research Center (PUBReC), Pwani University, Kilifi, Kenya, Department of Agroecology, Crop Genetics and Biotechnology Section, Aarhus University, Flakkebjerg, Denmark

  • Wilton Mwema Mbinda

    Roles Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Validation, Writing – review & editing

    wilton.mbinda@gmail.com, w.mbinda@pu.ac.ke

    Affiliations Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya, Pwani University Bioscience Research Center (PUBReC), Pwani University, Kilifi, Kenya

Abstract

Cashew (Anacardium occidentale L.) is an important tree grown worldwide for its edible fruits, nuts and other products of industrial applications. The ecologically sensitive cashew-growing region in coastal Kenya is significantly affected by rising temperatures, droughts, floods, and shifting rainfall patterns. These changes adversely impact cashew growth by altering flowering patterns, increasing pests and diseases, and causing postharvest losses, which ultimately result in reduced yields and tree mortality. This is exacerbated by the long juvenile phase, high heterozygosity, lack of trait correlations, large mature plant size, and inadequate genomic resources. For the first time, the Diversity Array Technology (DArT) technology was employed to identify DArT (silicoDArT) and single nucleotide polymorphisms (SNPs) markers for genomic understanding of cashew in Kenya. Cashew leaf samples were collected in Kwale, Kilifi and Lamu counties along coastal Kenya followed by DNA extraction. The reduced libraries were sequenced using Hiseq 2500 Illumina sequencer, and the SNPs called using DarTsoft14. A total of 27,495 silicoDArT and 17,008 SNP markers were reported, of which 1340 silicoDArT and 824 SNP markers were used for analyses after screening, with > 80% call rate, > 95% reproducibility, polymorphism information content (PIC ≥  0.25) and one ratio (>0.25). The silicoDArT and SNP markers had mean PIC values ranging from 0.02–0.50 and 0.0–0.5, with an allelic richness ranging from 1.992 to 1.994 for silicoDArT and 1.862 to 1.889 for SNP markers. The observed heterozygosity and expected values ranged from 0.50–0.55 and 0.34–0.37, and 0.56–0.57 and 0.33 for both silicoDArT and SNP markers respectively. Understanding cashew genomics through the application of SilicoDArT and SNP markers is crucial for advancing cashew genomic breeding programs aimed at improving yield and nut quality, and enhancing resistance or tolerance to biotic and abiotic stresses. Our study presents an overview of the genetic diversity of cashew landraces in Kenya and demonstrates that DArT systems are a reliable tool for advancing genomic research in cashew breeding.

Introduction

Cashew is a tropical evergreen tree from the Anacardiaceae family, which comprises approximately 75 genera and 700 species. Its origin is traced from the northeastern region of Brazil [1]. In Africa, Portuguese traders first introduced cashew in Mozambique in the 16th century for land reclamation [2]. From Mozambique, cashew was spread to East Africa [3]. Africa accounts for more than 50% of the global production of cashew nuts, making the crop an important source of income for smallholder farmers and the regional economy [4]. The evergreen perennial tropical tree is cultivated primarily for its nuts, the second most important edible tree nuts after almond nuts [5]. Other cashew products include the cashew apple (fleshy pseudo fruit), gum, wood, kernel oils, and nutshell liquid [6] which are utilized in food and other industries, making the crop an important source of income for small-scale farmers and the national economy. Cashew trees play key biodiversity roles as sources of food and shelter for wildlife. Additionally, the cashew tree deep rooting system supports the provision of ecosystem services such as soil stabilization, erosion prevention and carbon sequestration [7]. This is a potential adaptation strategy to suboptimal or adverse environmental conditions such as extreme salinity and poor drainage, suggesting that its integration into agroforestry systems makes it suitable for upscaling climate-smart technologies and building resilience to climate change among smallholder farmers.

A. occidentale is a diploid (2n = 42) self-pollinated plant with an approximate genome size of 420 Mb [8]. The intrinsic heterozygosity due to the plant’s predominant allogamy makes the cashew genome complex [9], causing a major hindrance in genomics studies and genetic improvement [5]. This challenge is also compounded by the plant’s long juvenile phase, lack of juvenile‒mature trait correlations and large size of the mature plant [10], making the plant’s conventional breeding strategies slow and unpredictable. Additionally, climate change, which is causing a rise in temperatures, extended droughts, floods, and shifts in rainfall patterns, persists in disrupting cashew farming through alterations in flowering, fruiting, pests and diseases occurrences, wilting of plants inundated farms, and postharvest losses [11]. Genomic techniques should be applied in cashew breeding programs to reduce the errors of conventional breeding approaches and enhance breeding efficiency.

Highly efficient, robust, and polymorphic molecular markers are essential for genetic analysis and marker-assisted breeding in plants. In the past, several markers, including random amplified polymorphic DNA [1214], simple sequence repeats [13,15] and amplified fragment length polymorphisms [15] have been utilized. These are classified as dominant markers and were later superseded by codominant markers such as simple sequence repeats, single nucleotide polymorphisms (SNPs) and diversity array technology (DArT), which are more informative and produce consistent results than dominant markers [16]. Until recently, cashew genomic resources were characterized using microsatellites [17] and insertion/deletion (InDel) markers [10]. The paucity of effective molecular markers and the absence of cashew genomic sequence data have hampered genetic and genomics studies, and molecular breeding.

DArT is a genotyping-by-sequencing platform that utilizes restriction enzymes to digest an organism’s genome, followed by high-throughput sequencing of the fragments to discover genome-wide markers. The choice of restriction enzymes used permits the segregation of highly informative low-copy fragments of the genome. DArTseq-based analysis generates silicoDArT markers (presence/absence of restriction fragments in representation), which are dominant, and single-nucleotide polymorphism (SNP) markers, which are codominant [18]. This platform is widely used in genomic research due to its efficiency in generating a large number of molecular markers across the entire genome in a short time. It is also easy to use and is relatively cost-effective [19]. DArT technology has been applied to several crops, including barley [20], cassava [21], maize [22], macadamia [23,24], almond [25], and cashew [26]. However, there are limited reports on the application of DArTseq markers in cashew or any genomic studies on Kenyan cashew breeding programs. In the present study, we report for the first time, the effectiveness of DArTseq-based silicoDArT and SNP markers as alternative markers to study the genetic diversity and population structure among a collection of cashew landraces along the Kenyan coast. The silicoDArT and SNP resources generated here lay a foundation for future genome-wide association studies or genomic selection in cashew.

Materials and methods

Plant material and DNA extraction

Leaf samples from field-grown cashew trees were collected from farmers in Kwale, Kilifi and Lamu Counties via a stratified random system as described by [27]. The farmers were grouped into strata that were equivalent to the sub-counties. The number of samples collected was proportional to the number of farmers in the stratum, and the density of cashew trees in each sub-county (Table 1). All the plant materials were landraces, and 92 apical leaf samples (27 from Kwale, 28 Lamu, and 37 Kilifi) were collected. The samples were transferred to Falcon tubes containing 20% DESS (a solution containing dimethyl sulfoxide, disodium EDTA, and saturated NaCl buffer [28] and transported in thermally insulated boxes to the Pwani University Bioscience Research Centre (PUBReC) following standard laboratory operating protocols. The collected samples were used for high-quality genomic DNA (gDNA) isolation using Qiagen Plant Mini kit (Hilden, Germany). The quantity and integrity of the extracted gDNA were evaluated using a Nanodrop 2000c spectrophotometer and 1% agarose gel electrophoresis, respectively. The gDNA was then shipped to SEQART AFRICA, Nairobi, Kenya, for genotyping.

thumbnail
Table 1. Cashew landrace collections across Kwale, Kilifi and Lamu counties in coastal Kenya.

https://doi.org/10.1371/journal.pone.0313850.t001

DArTseq genotyping and marker quality control

Diversity Array Technology libraries were constructed according to [29]. The DArTseq complexity reduction method was employed by digesting gDNA with two restriction enzymes (PstI and MseI) and ligating barcoded adapters, followed by PCR amplification of adapter-ligated fragments. Pooling and purification of the PCR amplicons were performed with a QIAquick PCR purification kit (QIAGEN Hilden, Germany). A single-read Illumina HiSeq 2500 platform for 77 cycles was used to sequence purified PCR products. DarTsoft14, an in-house marker-scoring program was used to score the DArTseq markers (S1S3 Tables).

Two types of DArT markers were scored: SilicoDArT and SNP markers. Both markers were scored as binary ‘1’ for the presence or ‘0’ for the absence of a marker in the genomic representation of each sample as described by [30]. Markers were tested for call rate (%), one ratio, and reproducibility (%, a proportion of technical replicate assay pairs in which the marker score is consistent) [23]. Both silicoDArT and SNP markers were filtered based on a minor call rate ( ≥80%), one ratio (>0.25) and reproducibility (≥95%).

Genetic diversity and population structure analysis

Genetic diversity analyses were conducted to determine the polymorphic information content (PIC), observed heterozygosity (Ho), expected heterozygosity (He), allelic richness, and fixation indices (pairwise and overall FST). The genetic indices were computed in R statistical software using adegenet package. A landscape and ecological associations (LEA) R package v.3.19 was used to determine the population genetic structure among the cashew samples across Kwale, Kilifi and Lamu counties. Using the snmf function of LEA, ancestry coefficients were estimated where cross-entropy criteria were used to identify the optimum number of k populations where k =  1–10 with 10 repetitions. Cashew landraces in the subpopulations of Kwale, Kilifi and Lamu counties were identified in R statistical software using Q matrix. Admixture plots of individual cashew landraces were plotted with the plot function of base R. The results of individual markers (silicoDArT and SNP markers) from cashew landraces obtained from population analysis were subjected to analysis of molecular variance (AMOVA) using the poppr package and principal coordinate analysis (PCoA) in R statistical software. Weighted neighbor‒joining dendrograms were constructed for both markers.

Results

Marker characterization

A total of 27,494 silicoDArT and 17,008 SNP markers from 92 apical leaves of cashew landraces were filtered, resulting in 1340 and 824 silicoDArt and SNP markers, respectively. All the identified markers had reproducibility ≥ 95% (Fig 1A and B) and a call rate of ≥ 80% (Fig 1C and D).

thumbnail
Fig 1. Distribution of marker data for reproducibility and call rate.

A. Reproducibility of SilicoDArT markers, B. Reproducibility of SNP markers. C. Call rates of SilicoDArT markers, D. Call rates of SNP markers.

https://doi.org/10.1371/journal.pone.0313850.g001

Genetic relationships among cashew landraces

Among the 1340 and 824 informative markers (silicoDArt and SNP), the observed PIC values varied across the markers between 0.15 and 0.30 and between 0.45 and 0.50 (Fig 2). The silicoDArT and SNP markers presented PIC range values of 0.02–0.50 and 0.0–0.50 and average of 0.29 and 0.24, respectively (Table 2). The allelic richness among the populations ranged from 1.992 to 1.994 for silicoDArT and from 1.862 to 1.889 for the SNP markers (Table 3). The observed heterozygosity and expected values for the silicoDArT markers ranged from 0.50–0.55 and 0.34–0.37, while the SNPs ranged from 0.56–0.57 and 0.33, respectively (Table 3).

thumbnail
Fig 2. Distribution of PIC values of markers used for genomic studies in 92 cashew landrace genotypes.

A. silicoDArT markers and B. SNP markers.

https://doi.org/10.1371/journal.pone.0313850.g002

thumbnail
Table 2. Polymorphic distribution for both silicoDArT and SNP markers.

https://doi.org/10.1371/journal.pone.0313850.t002

thumbnail
Table 3. Statistics of the genetic diversity indices for SilicoDArT and SNP markers from cashew landraces along the Kenyan coast.

https://doi.org/10.1371/journal.pone.0313850.t003

Population structure analysis using cross-entropy criteria revealed that individual cashew landraces could be divided into two subpopulations (K =  2) for both markers (Fig 3A and 3B). These findings were validated by visualizing the admixture bar plot of the Q matrix (Figs 4 and 5). The overall fixation indices among the subpopulations for the silicoDArT and SNP markers were 0.003 and 0.008, respectively (Table 4). The extent of genetic differentiation across the three study sites was substantially low (Table 4) as indicated by the low overall fixation indices among the subpopulations for both the silicoDArT (Fst =  0.003) and SNP (Fst =  0.008) markers (Table 4). Analysis of molecular variance revealed a 6.8% variation between the cashew landrace populations and 93.2% within them (Table 5). Further evaluation with a principal coordinate analysis (PCoA) revealed that most cashew landraces in Kilifi and Kwale counties share a genetic lineage, as compared to that of Lamu on the PCoA plot for both silicoDArT and SNP markers (Fig 6). Weighted neighbor‒joining hierarchical clustering assorted the cashew landrace population into three clusters with different numbers of landraces: cluster 1 (53), cluster 2 (22) and cluster 3 (17) for SNP markers and cluster 1 (28), cluster 2 (39) and cluster 3 (25) for silicoDArT (Fig 7A). However, some landraces for SNP markers in cluster 2, such as LM/LW/103 and KLF/KAL/69, were clustered in cluster 1. The silicoDArT markers LM/LW/106, LM/LW/88 and LM/LE/114 of cluster 2 were clustered in cluster 3, whereas the KLF/MAL/52 and KWL/MAT/32 cashew landraces of cluster 3 were in cluster 2 (Fig 7B).

thumbnail
Fig 3. Cashew landrace population structure determination based cross-entropy of Q Matrix for population admixture.

A. SilicoDArT markers, B. SNP markers.

https://doi.org/10.1371/journal.pone.0313850.g003

thumbnail
Fig 4. Population admixtures of 92 cashew landrace genotypes based on the Q matrix derived from 1,340 high-density silicoDArT markers.

https://doi.org/10.1371/journal.pone.0313850.g004

thumbnail
Fig 5. Population admixtures of 92 cashew landrace genotypes based on the Q matrix for 824 silicoDArT markers.

https://doi.org/10.1371/journal.pone.0313850.g005

thumbnail
Table 4. Population pair-wise fixation index of cashew landrace genotypes genotyped using silico DArT and SNPs markers.

https://doi.org/10.1371/journal.pone.0313850.t004

thumbnail
Table 5. Analysis of molecular variance in cashew landrace genotype collection based silicoDArT and SNP markers.

https://doi.org/10.1371/journal.pone.0313850.t005

thumbnail
Fig 6. Principal coordinate analysis of cashew landraces based on population structure via silicoDArT and SNP markers.

https://doi.org/10.1371/journal.pone.0313850.g006

thumbnail
Fig 7. Genetic clusters (red, blue and green clusters) among 92 cashew landrace genotypes based on A. silicoDArT and B. SNP markers.

https://doi.org/10.1371/journal.pone.0313850.g007

Discussion

Establishing efficient and well-characterized genetic resources is a primary goal for successful breeding programs. This approach is fundamental for crop genetic resource management, breeding programs and understanding the ancestral relationships of genotypes [31]. Crop genetic of yield, quality, and response to biotic and abiotic stress resistance/tolerance depend on the efficiency of selection. However, few genomic resources utilizing random amplified polymorphic DNA, and simple sequence repeat markers have been developed for cashew varieties to support genetic differentiation and selection. Germplasm characterization of cashew genotypes has been performed through conventional molecular markers, yet they are few and currently only 51 sequences are available in the public database. We evaluated the effectiveness of DArTseq-based silicoDArT and SNP markers as alternative markers to analyze genetic diversity and population structure.

After filtering, 1,340 high-quality silicoDArT markers and 824 SNP markers were identified from the initial 27,494 silicoDArT and 17,008 SNP markers, which are recommended for use in genomic analysis [24]. Our analysis identified SNP and silicoDArT markers with PIC values ranging between 0.0 and 0.5, indicating a moderate genetic variation. The moderate genetic divergence of individual cashew landraces in this study corroborates the findings of [23] on macadamia, who reported PIC values of silicoDArT and SNP markers with moderate levels of informativeness. This low genetic diversity among the cashew landraces is strongly attributed to the local domestication and distribution of the crop in the Kenyan coastal counties (Kwale, Kilifi and Lamu). The genetic diversity observed among the cashew landraces suggests a consistent gene flow across the populations.

The observed and expected heterozygosity were distinct, with observed heterozygosity being greater than expected. This observation could be attributed to outcrossing, reflecting a deviation from Hardy-Weinberg equilibrium in the analyzed cashew landraces. The outcrossing among the cashew landraces confers its fitness and relevance to withstand disease and enhanced environmental adaptability [32]. Cross-pollination between staminate and pistillate flowers enables cashew trees to predominantly outcross, which maintains certain alleles in the population, even if they might be detrimental in a homozygous state [33].

Both markers exhibited moderate allelic richness, ranging from 1.862 to 1.994 (Table 3), reflecting genetic drift among the individual cashew landraces. The observed moderate allelic richness among the cashew landraces explains their ability to adapt to diseases and climate changes [34,35]. The allelic richness observed in this study backtraces on the impact of outcrossing, as indicated by Ho>  He, which contributes to the resilience of cashew trees to diseases and their adaptability to climate change. Our findings concur with those of [9], who reported average allelic richness alleles per locus among cashews in East Timor. The allelic richness among cashews reflects their evolutionary potential in breeding and conservation programs in East Africa and Kenya.

The presence of both dominant and recessive alleles among cashew landraces has important implications for selection and breeding. More often, the upregulation of dominant alleles may enhance the selection of well-adapted genotypes. However, this may hinder the selection of recessive alleles [36]. The fixation index values (Table 4) suggest limited gene exchange between populations, indicating a high flow of genetic information within cashew landraces. This lack of distinct population structure is evident in both the admixture barplot and the PCoA plot. To maximize genetic gains when breeding for new cashew varieties, it is advisable to cross genetically divergent genotypes from different clusters. This approach can improve economic traits such as yield, nutrient composition, and resistance to heat, drought, and diseases.

Our study provides insight on the population structure, where two clusters (k =  2) were identified. This foundational work was essential in advancing the understanding of genetic diversity and population structure among 92 distinct cashew landraces from Kwale, Kilifi, and Lamu counties in Kenya. The admixture plot, a cross-validation of the cross-entropy value K =  2, indicates a pattern of similarity among cashew landraces from Kwale and Kilifi hence they share a genetic structure with an even distribution of two ancestral populations (red and blue clusters) for both markers. However, besides Lamu’s landraces having a different genetic structure for SNP markers, it conforms to K =  2. The groupings of these individual cashew landraces within the admixture plot reflect the historical exchange of cashew seeds through informal systems or germplasm exchange in propagative breeding programs. Both the silicoDArT and SNP markers indicated two ancestral populations. The selection of K =  2 effectively captures these regional differences in genetic structure. Therefore, it can be hypothesized that human activities, such as using seeds from the same tree to grow new seedlings, could be a source of gene flow [37]. Our findings for genetic diversity and population structure in cashew employing ultrahigh-throughput DArTseq-based silicoDArT markers is fundamental towards understanding the progressive knowledge in cashew breeding and the conservation of germplasm similar to [26] work on cashew in Tanzania. The DArT platform allows simultaneous characterization of a substantial number of markers across the cashew genome, facilitating the efficient evaluation of its genetic diversity in breeding programs. Our AMOVA supports the moderate genetic diversity observed (Table 5), as indicated by allelic richness and the influence of cross-pollination, which is linked to cashew landraces outcrossing in this study.

Individual cashew landraces were clustered into three subpopulations, which correlated with their geographical locations. This observation supports the single domestication theory [38], highlighting a significant genetic bottleneck and limited genetic variation in cashew. This observation reiterates the single domestication theory which is a key genetic bottleneck and limited genetic variation in cashew. Understanding cashew genomics through the application of SilicoDArT and SNP markers is crucial for advancing breeding programs aimed at improving yield and nut quality, and enhancing resistance or tolerance to biotic and abiotic stresses, thereby optimizing selection outcomes.

Conclusion

Our study evaluated silicoDArT and SNP markers and determined their usefulness in assessing the extent of genetic diversity and population structure among individual cashew landraces, as revealed in the cross-entropy and admixture plots. The genetic diversity among the cashew landraces for both silico DArT and SNP markers reflects a steady gene flow across the study sites which shows its adaptability to disease and climate change. However, the outcrossing effect presents an important research gap towards understanding cashew genomics in breeding programs. This information will guide genotype selection and subsequent breeding programs to develop new cashew germplasms to increase genetic gains for economic traits. Our study provides a snapshot of genetic diversity at a specific point in time. A longitudinal study tracking changes in genetic diversity over time could provide insights into the temporal dynamics of cashew populations.

Supporting information

S2 Table. Silico-DArT markers identified through the DArTseq process show the presence/absence of variation and SNP markers generated from the DArTseq process across the cashew genome.

https://doi.org/10.1371/journal.pone.0313850.s002

(CSV)

S3 Table. Information on the different files generated and their raw data is provided in an Excel file containing the metadata.

https://doi.org/10.1371/journal.pone.0313850.s003

(XLSX)

Acknowledgments

The authors are grateful to Pwani University for providing the laboratory space at the Pwani University Biosciences Research Centre (PUBReC) to perform this work.

References

  1. 1. Salehi B, Gültekin-Özgüven M, Kırkın C, Özçelik B, Morais-Braga MFB, Carneiro JNP, et al. Anacardium plants: chemical, nutritional composition and biotechnological applications. Biomolecules. 2019;9(9):465. pmid:31505888
  2. 2. Dendena B, Corsi S. Cashew, from seed to market: a review. Agron Sustain Dev. 2014;34(4):753–72.
  3. 3. Havik PJ, Monteiro F, Catarino S, Correia AM, Catarino L, Romeiras MM, et al. Agro-economic transitions in Guinea-Bissau (West Africa): historical trends and current insights. Sustainability. 2018;10(10):3408.
  4. 4. Monteiro F, Romeiras MM, Barnabé J, Catarino S, Batista D, Sebastiana M, et al. Disease-causing agents in cashew: a review in a tropical cash crop. Agronomy. 2022;12(10):2553.
  5. 5. Savadi S, Muralidhara BM, Preethi P. Advances in genomics of cashew tree: molecular tools and strategies for accelerated breeding. Tree Genet Genom. 2020;16(5).
  6. 6. Oliveira NN, Mothé CG, Mothé MG. Sustainable uses of cashew tree rejects: cashew apple bagasse and cashew gum. Biomass Conv Bioref. 2020;12(7):2623–30.
  7. 7. Monteiro F, Catarino L, Batista D, Indjai B, Duarte M, Romeiras M, et al. Cashew as a high agricultural commodity in West Africa: insights towards sustainable production in Guinea-Bissau. Sustainability. 2017;9(9):1666.
  8. 8. Aliyu OM, Awopetu JA. Assessment of genetic diversity in three populations of cashew (Anacardium occidentale L.) using protein-isoenzyme-electrophoretic analysis. Genet Resour Crop Evol. 2007;54(7):1489–97.
  9. 9. Kouakou C.K, Konan JA, Doga D, Kouadio ABR. Genetic diversity of cashew. In: Priyadarshan P, Jain SM. editors. Cash Crops. Cham: Springer; 2022. https://doi.org/10.1007/978-3-030-74926-2_14
  10. 10. Savadi S, Adiga JD, Muralidhara BM, Prasad P, Manjunatha K, Ashwitha K, et al. Discovery of genome-wide genetic variations and development of first set of InDel markers for genetics research in cashew. Scientia Horticulturae. 2023;320:112233.
  11. 11. Mohana GS, Eradasappa E, Vanitha Nayak MG. Cashew genetic resources: a global perspective. In: Rajasekharan PE, Rao VR. (editors). Fruit and nut crops. handbooks of crop diversity: conservation and use of plant genetic resources. Singapore: Springer; 2023 https://doi.org/10.1007/978-981-99-1586-6_8-1
  12. 12. Mneney E, Mantell S, Bennett M. Use of random amplified polymorphic DNA (RAPD) markers to reveal genetic diversity within and between populations of cashew (Anacardium occidentale L.). J Horticul Sci Biotechnol. 2001;76(4):375–83.
  13. 13. Archak S, Gaikwad AB, Gautam D, Rao EB, Swamy KRM, Karihaloo JL, et al. DNA fingerprinting of Indian cashew (Anacardium occidentale L.) varieties using RAPD and ISSR techniques. Euphytica. 2003;130(3):397–404.
  14. 14. Samal S, Lenka PC, Nanda RM, Nayak S, Rout GR, Das P, et al. Genetic relatedness in cashew (Anacardium occidentale L.) germplasm collections as determined by randomly amplified polymorphic DNA. Genet Resourc Crop Evol. 2004;51(2):161–6.
  15. 15. Cavalcanti JJV, Wilkinson MJ. The first genetic maps of cashew (Anacardium occidentale L.). Euphytica. 2007;157(1–2):131–43.
  16. 16. Osundare OT, Akinyele BO, Odiyi AC, Paliwal R, Oyatomi OA, Abberton MT. Genetic diversity and population structure of some Nigerian accessions of Bambara groundnut (Vigna subterranea (L.) Verdc.,) using DArT SNP markers. Genet Resour Crop Evol. 2022.
  17. 17. Savadi S, Sowmya K, Megha VS, Muralidhara BM, Mohana GS. Genetic diversity and identification of interspecific hybrids of Anacardium species using microsatellites. Braz J Bot. 2020;44(1):139–48.
  18. 18. Nantongo JS, Potts BM, Davies NW, Fitzgerald H, Rodemann T, O’Reilly-Wapstra J, et al. Variation in constitutive and induced chemistry in the needles, bark and roots of young Pinus radiata trees. Trees. 2021;36(1):341–59.
  19. 19. Deres D, Feyissa T. Concepts and applications of diversity array technology (DArT) markers for crop improvement. J Crop Improv. 2022;37(6):913–33.
  20. 20. Fiust A, Rapacz M, Wójcik-Jagła M, Tyrka M. Development of DArT-based PCR markers for selecting drought-tolerant spring barley. J Appl Genet. 2015;56(3):299–309. pmid:25716655
  21. 21. Adu BG, Akromah R, Amoah S, Nyadanu D, Yeboah A, Aboagye LM, et al. High-density DArT-based SilicoDArT and SNP markers for genetic diversity and population structure studies in cassava (Manihot esculenta Crantz). PLoS One. 2021;16(7):e0255290. pmid:34314448
  22. 22. Tomkowiak A, Nowak B, Sobiech A, Bocianowski J, Wolko Ł, Spychała J, et al. The use of DArTseq technology to identify new SNP and SilicoDArT markers related to the yield-related traits components in maize. Genes (Basel). 2022;13(5):848. pmid:35627233
  23. 23. Alam M, Neal J, O’Connor K, Kilian A, Topp B. Ultra-high-throughput DArTseq-based silicoDArT and SNP markers for genomic studies in macadamia. PLoS One. 2018;13(8):e0203465. pmid:30169500
  24. 24. O’Connor K, Kilian A, Hayes B, Hardner C, Nock C, Baten A, et al. Population structure, genetic diversity and linkage disequilibrium in a macadamia breeding population using SNP and silicoDArT markers. Tree Genet Genom. 2019;15(2).
  25. 25. Tevfik H, Karcı H, Wai J, Kafkas S. Construction of high density linkage maps in almond and synteny analysis with peach, Japanese apricot and sweet cherry genomes. Euphytica. 2023;219(8):82.
  26. 26. Mzena GP, Kusolwa PM, Rwegasira GM, Yao NK. Discovery of novel single nucleotide polymorphic (SNP) markers for genetic mapping of cashew (Anacardium occidentale L.). Int J Agric Env Biores. 2018;3:186–96.
  27. 27. Lance PM, Hattori A. Sampling and evaluation, a guide to sampling for program impact evaluation. Chapel Hill, North Carolina: Measure Evaluation, University of North Carolina; 2016.
  28. 28. Yoder M, De Ley IT, Wm King I, Mundo-Ocampo M, Mann J, Blaxter M, et al. DESS: a versatile solution for preserving morphology and extractable DNA of nematodes. Nematol. 2006;8(3):367–76.
  29. 29. Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, et al. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol Biol. 2012;888:67–89. pmid:22665276
  30. 30. Wenzl P, Carling J, Kudrna D, Jaccoud D, Huttner E, Kleinhofs A, et al. Diversity Arrays Technology (DArT) for whole-genome profiling of barley. Proc Natl Acad Sci U S A. 2004;101(26):9915–20. pmid:15192146
  31. 31. Ketema S, Tesfaye B, Keneni G, Amsalu Fenta B, Assefa E, Greliche N, et al. DArTSeq SNP-based markers revealed high genetic diversity and structured population in Ethiopian cowpea [Vigna unguiculata (L.) Walp] germplasms. PLoS One. 2020;15(10):e0239122. pmid:33031381
  32. 32. Bayala J, Zougmoré R, Ky-Dembele C, Bationo BA, Buah S, Sanogo D, et al. Towards developing scalable climate-smart village models: approach and lessons learnt from pilot research in West Africa. ICRAF Occasional Paper No. 25. Nairobi: World Agroforestry Centre; 2016.
  33. 33. Greenbaum G, Templeton AR, Zarmi Y, Bar-David S. Allelic richness following population founding events–a stochastic modeling framework incorporating gene flow and genetic drift. PLoS One. 2014;9(12):e115203. pmid:25526062
  34. 34. Caballero A, García-Dorado A. Allelic diversity and its implications for the rate of adaptation. Genetics. 2013;195(4):1373–84. pmid:24121776
  35. 35. DeWoody JA, Harder AM, Mathur S, Willoughby JR. The long-standing significance of genetic diversity in conservation. Mol Ecol. 2021;30(17):4147–54. pmid:34191374
  36. 36. Billiard S, Castric V, Llaurens V. The integrative biology of genetic dominance. Biol Rev Camb Philos Soc. 2021;96(6):2925–42. pmid:34382317
  37. 37. Guterres L, Barnabé J, Barros A, Charrua AB, Duarte MC, Romeiras MM, et al. East Timor as an important source of cashew (Anacardium occidentale L.) genetic diversity. PeerJ. 2023;11:e14894. pmid:37123007
  38. 38. Allaby RG, Stevens CJ, Kistler L, Fuller DQ. Genetic revelations of a new paradigm of plant domestication as a landscape level process. Plant Breed Rev. 2021;321–43.