There is a growing call for inventories that evaluate geographic patterns in diversity of plant genetic resources maintained on farm and in species' natural populations in order to enhance their use and conservation. Such evaluations are relevant for useful tropical and subtropical tree species, as many of these species are still undomesticated, or in incipient stages of domestication and local populations can offer yet-unknown traits of high value to further domestication. For many outcrossing species, such as most trees, inbreeding depression can be an issue, and genetic diversity is important to sustain local production. Diversity is also crucial for species to adapt to environmental changes. This paper explores the possibilities of incorporating molecular marker data into Geographic Information Systems (GIS) to allow visualization and better understanding of spatial patterns of genetic diversity as a key input to optimize conservation and use of plant genetic resources, based on a case study of cherimoya (Annona cherimola Mill.), a Neotropical fruit tree species. We present spatial analyses to (1) improve the understanding of spatial distribution of genetic diversity of cherimoya natural stands and cultivated trees in Ecuador, Bolivia and Peru based on microsatellite molecular markers (SSRs); and (2) formulate optimal conservation strategies by revealing priority areas for in situ conservation, and identifying existing diversity gaps in ex situ collections. We found high levels of allelic richness, locally common alleles and expected heterozygosity in cherimoya's putative centre of origin, southern Ecuador and northern Peru, whereas levels of diversity in southern Peru and especially in Bolivia were significantly lower. The application of GIS on a large microsatellite dataset allows a more detailed prioritization of areas for in situ conservation and targeted collection across the Andean distribution range of cherimoya than previous studies could do, i.e. at province and department level in Ecuador and Peru, respectively.
Citation: van Zonneveld M, Scheldeman X, Escribano P, Viruel MA, Van Damme P, Garcia W, et al. (2012) Mapping Genetic Diversity of Cherimoya (Annona cherimola Mill.): Application of Spatial Analysis for Conservation and Use of Plant Genetic Resources. PLoS ONE 7(1): e29845. doi:10.1371/journal.pone.0029845
Editor: Pär K. Ingvarsson, University of Umeå, Sweden
Received: July 6, 2011; Accepted: December 6, 2011; Published: January 9, 2012
Copyright: © 2012 van Zonneveld et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study has been carried out within the context of the CHERLA project funded by the International Cooperation with Developing Countries (INCO-DEV) Sixth Framework Programme (Contract 015100) of the European Commission. Additional financial support was provided by the Spanish Ministry of Education (Project Grants AGL2010-15140), the Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA) from Spain (RF2009-00010), Junta de Andalucía (FEDER AGR2742) and by the INIA-Spain financed project ‘Strengthening Regional Collaboration in Conservation and Sustainable Use of Forest Genetic Resources in Latin America and Sub-Saharan Africa.’ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Many useful tropical and subtropical tree species, even those commonly cultivated, are still in incipient stages of domestication, with their genetic resources often principally or exclusively, present in situ, i.e. on farm in home gardens or orchards and/or in natural populations. The local diversity of these tree species could offer yet-unknown traits of high value to further domestication . For many outcrossing species, such as most tropical tree species, this genetic diversity is important to sustain local production as many of these species are vulnerable to inbreeding depression . Diversity is also a key factor for adaption to environmental changes . However, tree species are increasingly vulnerable to losses of genetic diversity, referred to as genetic erosion, due to decreased population sizes resulting from land use changes and land degradation, and due to changes in local climate that may select against some genotypes . Therefore, there is a growing call to assess the conservation status of the genetic resources of tree species .
The formulation of effective and efficient conservation strategies requires a thorough understanding of spatial patterns of genetic diversity . A better knowledge of areas of high genetic diversity is also important in optimizing the use of genetic resources, as the likelihood to find interesting materials for breeding is higher where levels of genetic diversity are maximal , . Initiatives to prioritize research on global plant genetic resources, such as those lead by the Food and Agriculture Organization of the United Nations (FAO), include calls for more inventories and surveys to increase understanding of variation in plant genetic resources, explicitly referring to the application of molecular tools in such assessments , .
This study focuses on cherimoya (Annona cherimola Mill.), an underutilized fruit tree species that belongs to the Annonaceae, a family included within the Magnoliales in the Eumagnoliid clade among the early-divergent angiosperms . This Neotropical tree species still is in its initial stages of domestication  and it is considered at high risk of losing valuable genetic material from its genepool . Cherimoya fruits are widely praised for their excellent organoleptic characteristics, and the species is therefore considered to have a high potential for commercial production and income generation for both small and large-scale producers in subtropical climates . Cherimoya presents protogynous dichogamy, i.e. it has hermaphroditic flowers wherein female and male parts do not mature simultaneously, which favors outcrossing in its native range . For commercial production, hand pollination with pollen and stamens is common practice due to lack in overlap of the female and male stages and absence of pollinating agents outside its native range . At present, advanced commercial production is found in Spain, the world's largest cherimoya producer, with around 3000 ha of plantations, while small-scale cultivation occurs throughout the Andes, Central America and Mexico.
Most early chroniclers and scientists proposed the Andean region, and more specifically, the valleys of southern Ecuador and northern Peru, as cherimoya's centre of origin , , . The existence of natural cherimoya forest patches, which are scattered across the inter-Andean valleys in Ecuador and northern Peru, supports this hypothesis. Nonetheless the possibility that these are feral populations cannot be excluded. This phenomenon has been observed in the case of several fruit tree species, such as olives . An alternative hypothesis for the centre of origin of cherimoya is Central America , which would imply that the area of northern Peru and southern Ecuador is a secondary centre of diversity. Most relatives of cherimoya are native to Central America and southern Mexico, which is an argument in favor of this alternate hypothesis (H. Rainer, Institute of Botany, University of Vienna, 2011, pers. comm.). In any case, cherimoya fruits were consumed in the Andean region in antiquity  and the movement of germplasm across Mesoamerica, southern Mexico and the Andes probably took place in pre-Columbian times.
The conservation status of cherimoya genetic resources has improved considerably in recent years. Due to increasing commercial prices for cherimoya at local markets, Andean farmers are stimulated to conserve in situ the cherimoya trees growing in their backyards. Indeed, trees established in home gardens and orchards are common throughout the Andean region in Bolivia, Ecuador and Peru, which usually originate from planted local seeds or chance seedlings , and among them some individuals show promising traits for future breeding programs . In Peru, the local cultivar ‘Cumbe’ is already fetching retail prices significantly above the prices of unselected cherimoya fruit types . In contrast to most tropical and subtropical underutilized fruit tree species, cherimoya genetic resources are also well conserved ex situ. Several field collections have been established in Spain, Peru and Ecuador, preserving over 500 different accessions , . The Spanish collection based at la Estación Experimental La Mayora in Malaga, which holds over 300 accessions (190 of them collected in the Andean region), is currently used as a source of materials for the Spanish cherimoya breeding program and has been thoroughly analyzed using isozymes – and microsatellite markers , –.
The recent development of new molecular tools in combination with new spatial methods and increased computer capacity has created opportunities for new applications of genetic diversity analyses –. Whereas neutral molecular markers are considered a sound tool to measure patterns and trends in the use and conservation of plant genetic resources , Geographic Information Systems (GIS) provide opportunities to carry out spatial analyses of genetic diversity patterns identified with these markers . GIS can be used to interpolate genetic parameters between sampled populations (e.g. –), to apply re-sampling of georeferenced samples within a defined buffer zone , , or to develop grid-based genetic distance models , . GIS are also an acknowledged tool to prioritize areas for conservation of plant genetic resources . Several studies have used spatial analysis to develop conservation strategies for plant genetic resources based on molecular marker data (e.g. , ). Moreover, results obtained using GIS can be presented in a clear way on maps, which facilitates the incorporation of these findings into the formulation of conservation strategies and the implementation of conservation measures .
In this article we further explore the possibilities of incorporating molecular marker data into GIS to better visualize and understand spatial patterns of genetic diversity, as a key input to optimize conservation and enhance use of local plant diversity, based on a case study of cherimoya. The specific objectives of this article are to (1) apply innovative spatial analysis to improve understanding of the geographic distribution of cherimoya ‘s genetic diversity in its putative native range, based on microsatellite molecular markers (SSRs); and (2) formulate optimal conservation strategies by prioritizing areas for in situ conservation and identifying existing diversity gaps in ex situ collections. Based on the outcomes, we discuss how these spatial analyses can be used to define possible strategies that guarantee the long term conservation of cherimoya genetic resources and how these analyses can be applied to improve conservation and use of tree and crop genetic resources in general.
A total of 1504 trees were analyzed in this study, i.e. 395 from Bolivia, 351 from Ecuador and 758 from Peru. Of those, 502 are currently conserved in ex situ collections (either in Ecuador, Peru or Spain) whereas the remainder trees were sampled in situ. The molecular analysis included a core set of nine microsatellite loci  resulting in 71 different alleles. In all analyses of α-diversity and β-diversity (also referred to as divergence) we applied circular neighborhood re-sampling technique resulting in a total dataset of 48,128 trees (Figure 1). This technique facilitates analysis of patterns in genetic variation across extensive distribution ranges while maintaining high-resolution grids.
This map is made with a 10-minutes grid applying a one-degree circular neighborhood.
Allelic richness is a straightforward measure of genetic diversity that is commonly used in studies based on molecular markers that aim at selecting populations for conservation , . Figure 2 presents the distribution of the average number of alleles per locus found in the study area. It clearly shows that a higher number of alleles is present in the northern part of the study area, specifically in northern Peru, around Cajamarca Department, while other areas of high diversity are located on the border zone between Ecuador (Loja Province) and Peru (Piura Department), in the northern part of Ecuador around its capital Quito and in the northern part of the Lima Department in Peru.
This map shows the average number of alleles per locus in all 10-minutes grid cells applying a one-degree circular neighborhood.
Despite the effort to implement a similar sampling density throughout the study area, some areas (often locations with a higher abundance of traditionally managed cherimoya trees and stands) have been sampled more intensively than others (Figure 1), generating a sampling bias . The rarefaction methodology corrects this sampling bias by recalculating allelic richness in each grid cell to a minimum sample size . Figure 3 shows only the grid cells where 20 or more trees were present after applying a one-degree circular neighborhood, and for which allelic richness was corrected following the rarefaction methodology to a minimum sample size of 20 trees. The Cajamarca Department in northern Peru remains the area with the highest diversity, up to an average of 5.18 different alleles per locus. After correction by rarefaction, diversity in Ecuador, especially around Quito, is reduced, whereas the same seems to happen in the northern part of the Lima Department, in Peru, indicating the presence of a sampling bias around the capitals of both countries. The area around the Peruvian capital Lima, an important commercial cherimoya cultivation area, shows the lowest allelic richness within Peru, probably due to the widespread cultivation of a vegetatively propagated cultivar, ‘Cumbe’. Another striking result is that allelic richness in Bolivia, already low in the uncorrected analysis, is even lower with correction of sampling bias, resulting in an even higher contrast between cherimoya genetic diversity in Bolivia and that found in Peru and Ecuador.
This map shows the average number of alleles per locus in the 10-minutes grid cells applying a one-degree circular neighborhood and a correction by rarefaction to a minimum sample size of 20 trees.
Locally common alleles
Priority for conservation should be given to populations that retain locally common alleles; these are alleles that occur in high frequency in a limited area, and can indicate the presence of genotypes adapted to specific environments . Figure 4 shows the richness of locally common alleles per locus in the study area. The high diversity levels found in the Cajamarca Department in northern Peru are reconfirmed. Besides harboring the highest number of different alleles, it also contains the highest number of locally common alleles. This makes this area a priority for in situ conservation, both of cultivated trees on farm and of natural stands. The border region between Peru and Ecuador (Piura Department and Loja Province) is another area where a high concentration of locally common alleles has been observed and may, therefore, be a second area to prioritize in situ conservation efforts. To a lesser extent, the area around Quito in Ecuador and the northern part of the Lima Department in Peru also present locally common alleles.
This map shows the average number of alleles per locus in a 10-minutes grid cell that are relatively common (occurring with a frequency higher that 5%) in a limited area (in 25% or less of the grid cells) applying a one-degree circular neighborhood.
Expected Heterozygosity (He), Fixation Index (F) and Genetic Distance (GD)
In situ conservation should focus on viable populations, where inbreeding and subsequent loss of alleles are minimal. Parameters that allow assessment of inbreeding are expected heterozygosity (He) and the fixation index (F). The fixation index (F) was used to detect areas subjected to high inbreeding depression and, as the inverse to that, excess in heterozygosity . Figure 5 shows the values for He in the study area, again confirming Cajamarca Department in northern Peru as the area with the highest genetic diversity. High He values, however, radiate towards the south (as opposed to the higher diversity towards the north found in the allelic richness analyses) indicating higher levels of diversity in terms of heterozygosity in central Peru compared to Ecuador. Figure 6 shows the values for the fixation index, with F values close to 0 in the Cajamarca Department indicating that natural and cultivated cherimoya tree stands in this area have not experienced inbreeding. The highest values for F are observed in central Ecuador, suggesting that the level of inbreeding is highest in that part of cherimoya's Andean distribution range.
This map shows the average He value in each 10-minutes grid cell with 20 or more trees applying a one-degree circular neighborhood.
This map shows the average F value in each 10-minutes cell with 20 or more trees applying a one-degree circular neighborhood. Yellow areas indicate cherimoya stands where observed heterozygosity is as expected, red areas indicate stands where observed heterozygosity is lower than expected (indicating inbreeding) whereas observed heterozygosity is higher than expected in green areas.
The most important Peruvian commercial cherimoya cultivation area, located near the Capital Lima, particularly shows negative F values, i.e. excess of heterozygosity. Most of the cherimoyas cultivated in this area are vegetatively propagated clones of the cultivar ‘Cumbe’ which resulted in highly heterozygous values from the molecular analysis, i.e. the ‘Cumbe’ accession conserved in the Spanish genebank is heterozygote for eight of the nine microsatellite loci analyzed in this study (Ho value of 0.89). An analysis of the average genetic distance, between the ‘Cumbe’ accession and the genotypes in each grid cell with 20 or more re-sampled trees in the study area, clearly shows lowest genetic distance values near the Peruvian capital, Lima, indicating that the cherimoya trees in this area are very similar to the cultivar ‘Cumbe’ (Figure 7). This area clearly differs from the rest of the cherimoya distribution area in our study, which is more likely to be a product of natural gene flow patterns.
This maps shows the average genetic distance (GD) to the cultivar ‘Cumbe’, in each 10-minutes cell with 20 or more trees applying a one-degree circular neighborhood. As reference of the cultivar, the ‘Cumbe’ accession from the collection la Mayora, Malaga, Spain, was used.
Besides α-diversity parameters, aimed at identifying those areas with highest allelic richness and balanced allele frequencies, in situ conservation also needs to take into account allelic composition (β-diversity or divergence) as it is possible that populations with low allelic richness possess unique allele compositions, different from those of populations in other areas of the range, which would warrant their in situ conservation . Applying the Structure software (see ) and using the statistic parameter ΔK  to define the number of clusters with genetically similar trees present in the study area, we differentiated two main populations. Figure 8 shows the differentiation of the populations among distribution areas in cluster A and B, respectively. Cluster A has the highest presence in the areas previously identified as those with the highest allelic richness (Cajamarca Department in northern Peru, border zone between Ecuador and Peru and the area around Quito in Ecuador), whereas cluster B is mainly confined to southern Peru and Bolivia. Bolivian cherimoya trees are almost exclusively assigned to cluster B. Particular areas that did not show a strong linkage to either of the two clusters included the surroundings of the city of Lima and Loja Province in southern Ecuador.
This map shows in each 10-minutes cell with 20 or more trees applying a one-degree circular neighborhood, the average probability of finding a cherimoya tree belonging to cluster A or B. Dark blue areas show a higher probability of finding trees belonging to cluster A whereas dark green areas show a higher probability of finding trees belonging to cluster B. Light blue colored areas are not clearly assigned to any of the two clusters.
Ex situ conservation status
Of the 1504 trees included in this study, 502 genotypes are currently conserved in ex situ collections (either in Ecuador, Peru or Spain). Only eight alleles, corresponding to 11% of the total of 71 alleles that have been found in the study area, are not represented in any accession of these collections. Figure 9 shows the distribution of the missing alleles. There is only a small area with a significant portion of missing alleles (3 in total), i.e. in southern Ecuador (Azuay Province). Natural cherimoya forest patches and areas of traditional cherimoya cultivation in this province should be prioritized for future cherimoya collection missions. With almost 90% of alleles found to be present in ex situ collections, it can be concluded that, in general, cherimoya diversity from the countries analyzed is fairly well conserved ex situ.
Richness analysis of alleles (eight alleles out of the total of 71 observed alleles) that are not found in any ex situ collection based on 10-minutes grid with a one-degree circular neighborhood.
Distribution range of cherimoya in the Andes
The above results and subsequent conclusions are obviously only of practical use if the sampling performed was indeed representative for the distribution of cherimoya in the study area. Maxent species distribution modeling software was applied to model cherimoya's distribution range in Ecuador, Peru and Bolivia based on the climatic niche in which the 1504 sampled trees of our study were located. The modeled distribution was then compared with the sampled areas in these countries.
Cross-validation, to evaluate the quality of the distribution model, returned an Area Under Curve (AUC) value of 0.9, which indicates good model performance . AUC is a commonly used parameter in the validation of distribution models. Another measure of validation, the Kappa value, returned a value of 0.799 indicating the model performed even excellent .
In general, sampling covered most of the cherimoya-modeled distribution (Figure 10); 46% of the modeled distribution area is covered by grid cells with 20 or more re-sampled trees (Figure 10, dark blue areas). In 24.5% of the potential area of cherimoya occurrence less than 20 trees were re-sampled (light blue areas) whereas 29.5% of the modeled range was not sampled (red areas) and are considered sample gaps. The largest sample gaps are located in northern Peru in the transition zone between the Peruvian Andes and the Amazon (in the Departments of San Martin and Amazonas) and in southern Peru (in the Departments of Junín, Pasco, Huancavelica, Ayacucho and Puno). The Andean-Amazon transition zone should be priority for future complementary cherimoya collection trips because it is adjacent to an area where already high levels of diversity have been found, i.e. Cajamarca Department in northern Peru.
Areas of the modeled distribution in dark blue are covered by the 10-minutes grid cells with 20 or more trees applying circular neighborhood. Light blue areas of modeled distribution coincide with grid cells that contain less than 20 trees after re-sampling. Red areas indicate potential areas for cherimoya occurrence and cultivation that have not been in sampled.
Cherimoya was predicted absent by the distribution model in a significant area of southern Peru, indicating that the environmental conditions in substantial parts of that region are not suitable for cherimoya cultivation (Figure 10). This explains why no trees have been sampled in that area.
Areas of high diversity in the cherimoya centre of origin
Our results are in line with a previous genetic study of the Spanish cherimoya collection that also distinguished populations in Ecuador and northern Peru from those in southern Peru , and corroborate with results from isozyme markers that showed high genetic variation present in Peru and Ecuador . However, our study is based on a much higher number of samples and, therefore, provides much more detail for prioritizing areas for in situ conservation and germplasm collection.
At the allele level, our analysis confirms that, within our study area, the highest allelic richness as well as the highest number of locally common alleles are found in the area of southern Ecuador and northern Peru, i.e. the putative centre of origin of cherimoya. Northern Peru, and more specifically the Cajamarca Department, shows the highest levels of genetic diversity.
The highest values of the fixation index, which is an indication of inbreeding, were found in Ecuador. Inbreeding may occur because of reduction and fragmentation of natural stands and cultivated areas, increasing the risk of allele loss, which eventually leads to genetic erosion . Our results do not allow us to determine how much genetic erosion has taken place in Ecuador in comparison to Peru and Bolivia, but the high inbreeding values in Ecuador could explain why currently allelic richness is lower in this country than in northern Peru.
At the population level, significant differences can be observed between the cherimoya germplasm present in the area with highest diversity (where genotypes belonging to cluster A are predominant) and genotypes found in areas with lower diversity, i.e. in southern Peru and Bolivia (represented by cluster B). Cluster A seems likely to represent material that is genetically closer to the “wild” cherimoya type. No natural cherimoya stands have been observed in Bolivia, and this probably explains why no genotypes pertaining to cluster A have been recorded there. Cluster B probably corresponds to a genepool that is genetically different from most of the wild or semi-domesticated cherimoya found in northern Peru and Ecuador and that could have formed the basis for Bolivian cherimoya cultivation. Looking at the areas of high cluster B dominance, Bolivian germplasm probably originates from southern Peru.
Although most early chroniclers and scientists proposed southern Ecuador and northern Peru to be cherimoya's centre of origin , , , , the possibility of that area being a secondary centre of origin cannot be discarded. A diversity study similar to the one described in this study, but including cherimoya genotypes from Central America and Mexico, would shed light on the genetic variation across the complete pre-Columbian distribution range of cherimoya and provide additional clues on the primary centre of origin and diversification of this species.
Ex situ and in situ conservation of cherimoya genetic resources in the Andean region
Most alleles identified in our study are represented in one or more of the existing ex situ collections in Ecuador, Peru and Spain. The results obtained suggest that the highest priority for further collection should be the Azuay Province in Ecuador, since cherimoya stands in this area harbor most alleles not yet included in genebanks. It is also one of the areas with the highest risk of allele loss because of the high observed levels of inbreeding, compared to other parts of the study area. An additional priority area for germplasm collection is the transition zone from the Andes to the Amazon in Peru (in the higher elevation areas of the Departments of San Martin and Amazonas), which was not sampled in this study. According to the distribution model there is a high probability of finding cherimoya stands in this region, which probably is also high in genetic diversity, because it is adjacent to the area with the highest diversity found in this study, i.e. the Cajamarca Department in northern Peru.
A priority for in situ conservation should be the Cajamarca Department, the area with the highest levels of genetic diversity. A second area of priority should be the Loja Province in southern Ecuador, an area with a high number of locally common alleles. Both areas are assigned mostly to cluster A. Since trees pre-dominantly assigned to cluster B have a particular allelic composition in comparison to trees predominantly grouped in cluster A, genotypes of cluster B should also be considered in conservation activities. The part of Lima Department north of the Peruvian capital, which is assigned mostly to cluster B, could be prioritized for in situ conservation of genotypes from this cluster. In contrast to the low levels of allelic richness around Lima city in the southern part of the Lima Department, the northern part of this Department contains a fair number of locally common alleles.
The long-term conservation of cherimoya genetic resources is far from guaranteed. As commercial prices for fruits can fluctuate, short-term incentives for farmers to maintain cherimoya as a profitable crop are reduced and a decline in commercial interest may lead to the replacement of cherimoya trees by other crops, increasing the risks of genetic erosion. Around Quito, for example, most of the traditional cherimoya cultivation is being replaced by avocado plantations, which are commercially more attractive (X. Scheldeman, pers. obs.). An increase in commercial prices for cherimoya products will not necessarily promote the conservation of the existing genetic diversity. Indeed, in our study we found low levels of genetic diversity around the Peruvian capital, Lima where the clonally propagated cultivar ‘Cumbe’ is widely cultivated, because it fetches higher prices in the market.
A promising strategy to enhance in situ conservation on farm is through the promotion of seed or bud-for-grafting exchange between farmers . During the CHERLA project, cherimoya fairs, which facilitate exchange of plant material, were organized in different areas of this study, including the Cajamarca and Piura Departments in Peru, Loja Province in Ecuador and various departments in Bolivia. Seed and bud exchange can also be a way to conserve local races from unfavorable alterations in the local environment due to climate change, by re-distributing them in new areas with suitable climate conditions . Another way to ensure conservation of genetic resources of tree species while their use is stimulated could be the establishment of local clonal seed orchards if and when adequate propagation techniques, to enable the multiplication of clones, are made available as well , . This is the case for cherimoya, as demonstrated by the successful clonal propagation of the cultivar ‘Cumbe’ around the city of Lima.
Ideally, each area targeted for in situ conservation - where the existing cherimoya stands and forest patches can evolve within the local environment - should be backed up by ex situ conservation of germplasm (which currently is the case for cherimoya), and be monitored periodically to assess the dynamics in diversity use and risks of genetic erosion. Ex situ collections of fruit tree species often consist of living trees, such as the cherimoya collections. This allows conservation of superior combination of alleles that can be propagated vegetatively through grafting. Additional reasons include the following: (1) many tropical and subtropical trees (including cherimoya) have seeds with recalcitrant or intermediate behavior, which cannot be stored for long-term conservation; and (2) pollen, fruits and seeds can be collected continuously for characterization, evaluation and genetic improvement once trees have reached the reproductive stage. Nevertheless, the high costs for research institutions to maintain field genebanks of woody perennial species, can be a reason to minimize ex situ collections and focus especially on in situ conservation . In that case, it is important to screen the existing accessions through morphological, biochemical and/or molecular characterization to maximize the conservation of genetic diversity and potentially interesting functional attributes in a reduced collection . This approach has already successfully been used in the cherimoya collection la Mayora, Malaga, Spain . Ex situ conservation may particularly be important for areas that suffer from inbreeding -an indicator for high rates of allelic loss and genetic erosion- such as central Ecuador in the case of cherimoya, whereas in situ conservation may be most successful in areas of high diversity where still low rates of inbreeding are observed such as in the cherimoya stands from northern Peru.
Use of GIS and molecular markers to enhance conservation and use of plant genetic resources
Despite the advances in new computational applications and the use of molecular tools, spatial analyses are still underutilized in efforts to conserve plant diversity . With respect to targeting collection sites and prioritizing the conservation of plant genetic resources, spatial analyses of diversity have been carried out mainly at the species level for crop genepools (e.g. –). Only a few studies have mapped intraspecific diversity to enhance the conservation of genetic resources of specific crops and trees (e.g. , ). Kiambi et al.  grouped samples using a grid to compare diversity between geographic areas of similar size, whereas Lowe et al.  applied re-sampling to enable the calculation of diversity estimates with high degrees of confidence. However, these studies were carried out with fewer than 100 individuals per species, which limits the type of spatial analysis that can be carried out over the geographic distribution range of species. Our analysis combines both techniques on a large dataset (1504 trees), which can be conceptualized as a continuous distribution of plant individuals, in which each individual is connected to its neighboring trees because they share the same seed system, and/or breed with each other. Based on this concept, trees have been sampled in this study following a scattered distribution to calculate, across the Andean distribution range of cherimoya, several diversity estimates important to prioritize areas for conservation, including two recommended parameters: allelic richness  and the number of locally common alleles . Since the application of molecular tools is becoming cheaper, intraspecific diversity studies with large datasets will probably be more common in the near future, allowing for studies of this sort on other tree species and annual crops.
The size of the grid cells and width of the circular neighborhood for this type of spatial analysis depends on how many plant individuals have been collected across the landscape, and the minimum number of plant individuals that is considered sufficient to make confident estimates of genetic parameters per grid cell. Application of circular neighborhood provides an effective way to decrease grid cell size, which facilitates detection of spatial patterns in genetic variation across an extensive distribution range. By re-sampling the trees in the landscape, it generates a high number of grid cells with a sufficient number of trees to make confident calculations of genetic parameters per grid cell. It also makes analyses less sensitive to grid origin definition and enables the inclusion of isolated trees in the calculation of the genetic parameters, i.e. together with their closest neighboring trees.
Ideally, the sampling strategy for this type of analysis should be identified based on a pre-defined grid, aiming at measuring the same number trees per grid cell. However, due to logistical constraints and because a species simply may be more abundant in some areas than in others, in practice, sampling will always be sub-optimal to a certain degree. Of all the genetic parameters, allelic richness is most sensitive to uneven sampling and, accordingly, we have corrected sample size by rarefaction . Repeated subsampling of a minimum number of tree individuals per grid cell is another possibility to correct for sampling bias . This technique could also be used to correct other genetic parameters than allelic richness for sampling bias, such as expected heterozygosity, although these are less sensitive to uneven sampling .
Given the sampling distribution in our study area and the fact that for the calculation of most genetic parameters, we maintained a minimum of 20 re-sampled trees per grid cell, we defined a cell size of 10 minutes and a circular neighborhood with a diameter of one degree, which enabled us to detect spatial patterns of genetic variation at administrative level one in Ecuador, Peru and Bolivia (provinces and departments). For studies of plant species, in which individuals are sampled in a more clumped distribution compared to our scattered sampling distribution and/or in lower densities across the landscape, larger grid cells and/or a larger width of circular neighborhood could be applied, always assuring a sufficient number of trees per grid cell. The overall resolution of the study will obviously be lower.
Following Frankel et al. , we hypothesized that areas with high diversity measured by neutral molecular markers (like our microsatellite loci) have a high probability to contain genetic material that will also show diversity in functional traits, including traits of agronomic interest. Molecular markers are considered an appropriate indicator to quantify patterns and trends in the use and conservation of plant genetic resources . However, while neutral molecular marker surveys are suitable for diversity studies, direct measurement of traits in field trials may be more desirable to evaluate the genetic health and adaptive capacity of tree populations . Nevertheless, molecular marker studies representative of the whole genome provide a less expensive and scientifically sound alternative to assess the genetic resource status of tree species, for which, in comparison to annual crops, field trials are particularly expensive because of the long generation times . Markers of DNA sequences related to phenotypic traits, including expressed sequence tagged (EST) markers and markers in specific genes, could be of interest to include in spatial analysis of patterns and trends in plant genetic resources. More and more are becoming available, especially for important crops where sequencing programs have been performed or will be carried out in the near future. An example in cherimoya is a recently described gene involved in seedlessness in a sister species, Annona squamosa . However these markers are less polymorphic than neutral ones, such as those that have been used in our study, so the use of neutral markers to study spatial patterns of genetic diversity is still necessary.
It is difficult to compare our results with those of Lowe et al.  and Kiambi et al.  because of the differences in methodology used. To examine molecular marker studies on the same species, minimum standard sets of markers have already been suggested . Standardization of methodologies in studies on different species would improve comparability of results and also would facilitate Meta-analyses, for example to better understand how well genetic diversity of tropical and subtropical tree species is protected on farm and in protected areas.
In our study we only examined spatial patterns of genetic variation without relating them to other spatial attributes. GIS can also be used to link genetic data to available spatial information relevant to conservation of plant genetic resources, for instance to reveal short-term threats such as accessibility and long-term threats such as climate change. With this type of analysis, hotspots of diversity under threat could be identified following Myers et al.  but instead of looking at species level, this could be done at the intraspecific level, to ensure the conservation of priority populations of specific crops and useful tree species. Spatial information on the patterns and characteristics of human societies can be used to understand the drivers behind threats. In a study on changes in cassava diversity in the Peruvian Amazon, GIS was used to correlate cassava diversity data with biotic and socio-economic spatial data to identify possible drivers behind diversity and genetic erosion . This can be useful information in the development of adequate policies and measures to promote in situ conservation of plant genetic resources on farms and in natural populations.
Materials and Methods
Sampling and SSR analysis: A total of 1504 cherimoya accessions have been analyzed in this study, 395 from Bolivia, 351 from Ecuador and 758 from Peru. DNA was extracted from young leaves after . Based on polymorphism, a set of nine SSRs has been selected from those previously developed in cherimoya . A 15 µL of reaction solution containing 16 mM (NH4)2SO4, 67 mM Tris-ClH pH 8.8, 0.01% Tween20, 2 mM MgCl2, 0.1 mM each dNTP, 0.4 µM each primer, 25 ng genomic DNA and 0.5 units of BioTaq™ DNA polymerase (Bioline, London, UK) was used for amplification on an I-cycler (Bio-Rad Laboratories, Hercules, CA, USA) thermocycler using the following temperature profile: an initial step of 1 min at 94°C, 35 cycles of 30 s at 94°C, 30 s at 45°C–55°C and 1 min at 72°C, and a final step of 5 min at 72°C. Forward primers were labeled with a fluorescent dye on the 5′ end. The PCR products were analyzed by capillary electrophoresis in a CEQ™ 8000 capillary DNA analysis system (Beckman Coulter, Fullerton, CA, USA). Samples were denaturalized at 90°C during 120 s, injected at 2.0 kV, 30 s and separated at 6.0 kV during 35 min. Each reaction was repeated twice and the Spanish cultivar Fino de Jete was used as control in each run to ensure size accuracy and to minimize run-to-run variation.
Data cleaning: The coordinates of the respective tree locations were checked in DIVA-GIS (www.diva-gis.org) on erroneous points based on passport data at administrative level one (e.g. departments, provinces) with a buffer of 20 minutes (approx 30 km), and outliers based on climate data derived from the Worldclim data set  (two or more of the 19 bioclim variables according the Reverse jackknife method ). Based on these analyses, two points were excluded. The cleaned dataset thus included microsatellite data of 1504 georeferenced trees. Taking into account that nine SSR markers were analyzed, this results in a total of 27,072 georeferenced alleles.
Spatial analysis – Circular neighborhood: Grids for all genetic parameters were generated in DIVA-GIS and are based on a grid with a cell size of 10 minutes (which corresponds to approximate 18 km in the study area) applying a circular neighborhood with a diameter of one degree (corresponding to approximate 111 km) constructed in Excel. The circular neighborhood is used to re-sample the allelic composition of a single tree to all surrounding grid cells, in this case, 32 cells with a size of 10 minutes, within a diameter of one degree around its location. In this way, the allelic composition of each sampled tree is representative for the area within the defined buffer zone. Applying the circular neighborhood re-sampling technique resulted in a total dataset of 48,128 trees and 866,304 alleles.
Spatial analysis – α-diversity: After applying circular neighborhood to all trees, genetic parameters were calculated in GenAlEx per 10-minutes grid cell, for all trees present in each cell after re-sampling. Genetic parameters included the average number of alleles per locus (Na), the number of locally common alleles per locus (alleles occurring with a frequency higher than 5% in 25% or less of the grid cells), average expected heterozygosity per locus (He), fixation index (F) and genetic distance (GD) (see ). Na and the number of locally common alleles per locus were presented for all grid cells with trees included. Na was corrected by rarefaction to a minimum sample size of 20 trees per cell with the HP-RARE software (see ); consequently, this parameter was only calculated for grid cells with 20 or more re-sampled trees. This minimum sample size was also used as a threshold of the number of trees per grid cell to get interpretable results for the parameters He, F and GD. GD, which was used to calculate distance in allelic composition of each cherimoya genotype to the commercial variety ‘Cumbe’, was calculated in GenAlEx using the GD option for codominant markers (see ). Final GD value per grid cell was the average GD for all re-sampled trees present in each cell. The reference tree was the accession ‘Cumbe’ from the Spanish cherimoya genebank in Malaga.
Spatial analysis - β-diversity: Population structure was defined by running the software Structure (see ) on all 1504 samples applying a 10,000 burn-in period, 10,000 Markov chain Monte Carlo (MCMC) repetitions after burn-in, and 20 iterations. Optimal K was selected after  by running Structure for K values between one and 10 and defining the final number of clusters where value of ΔK was highest. This was at K = 2, hence a map was developed for these two clusters, which we named respectively A and B. We used the probabilities of each tree belonging to cluster A and B to visualize the clusters on a map. Mapping of probabilities was done based on the average value of all trees per 10-minutes cell for those grid cells with 20 or more re-sampled trees after applying the one-degree circular neighborhood.
Spatial analysis - Ex situ conservation status: The private alleles function in GenAlEx (PAS) was used to identify the alleles exclusively found in trees that were sampled in situ. To visualize patterns in these alleles that are not included in any genebank, a point-to-grid richness analysis, using a 10-minutes grid, was carried out in DIVA-GIS based on the one-degree circular neighborhood re-sampled tree grid.
Spatial analysis - distribution modeling: To identify how well the sampling covered the Andean distribution range of cherimoya, and thus to identify potential collection gaps, we modeled the distribution (presence only) of cherimoya in the study area using the distribution modeling program Maxent (see , ). With this technique, potential distribution areas are identified as those areas where similar environmental conditions prevail as those at the sites where the species has already been observed. The data required to identify these areas include species presence points as well as layers of environmental variables covering the study area. Maxent is a species distribution modeling tool for which the applied algorithm has been evaluated as performing very well, in comparison to other ecological niche modeling software , . Therefore, it was selected for this study's distribution modeling analysis. The coordinates in the passport data of the sampled trees were used for the presence point input. For environmental layer input, we used the 10-minutes grids of 19 bioclimatic variables (see ), derived from the Worldclim dataset . The modeled distribution area was restricted using the 10 percentile training presence threshold, which indicates the probability value at which 10% of the presence points falls outsides the potential area. The modeled distribution was generated in Maxent with 80% of the points (training data) and was cross-validated in DIVA-GIS with 20% of the remaining tree observations (test data). Besides 20% of the presence points, test data included randomly generated points in 0.1× the bounding box of the presence points as a proxy for absence points (5 times the number of presence points). Based on the cross-validation, the Area Under Curve (AUC) and Kappa value were calculated in DIVA-GIS as measures of model performance.
All maps were edited in ArcMap.
We thank Jorge Rojas and his team from PROINPA for DNA extraction and Bernardo Guzmán from PROINPA for field prospection and sampling in Bolivia. We also thank the personnel from INIA for the DNA extraction and field prospection in Peru and from INIAP Ecuador. Doris Chalampuente, Fernando Paredes, Marcelo Tacán, Eddie Zambrano and Edwin Naranjo. Laura Snook and Evert Thomas from Bioversity, and an anonymous reviewer provided useful comments on an early version of the manuscript.
Conceived and designed the experiments: XS JIH WG CT JR MS MAV. Performed the experiments: PE MV JIH. Analyzed the data: MvZ XS. Contributed reagents/materials/analysis tools: PE MAV JIH WG CT JR MS MvZ XS. Wrote the paper: MvZ XS PVD JIH.
- 1. Ræbild A, Larsen AS, Jensen JS, Ouedraogo M, De Groote S, et al. (2011) Advances in domestication of indigenous fruit trees in the West African Sahel. New Forests 41: 297–315.
- 2. Dawson IK, Lengkeek A, Weber JC, Jamnadass R (2009) Managing genetic variation in tropical trees: linking knowledge with action in agroforestry ecosystems for improved conservation and enhanced livelihoods. Biodiversity and Conservation 18: 969–986.
- 3. Dawson IK, Vinceti B, Weber JC, Neufeldt H, Russell J, et al. (2011) Climate change and tree genetic resource management: maintaining and enhancing the productivity and value of smallholder tropical agroforestry landscapes. A review. Agroforestry Systems 81: 67–78.
- 4. Palmberg-Lerche C (2008) Thoughts on the conservation of forest biological diversity and forest tree and shrub genetic resources. Journal of Tropical Forest Science 20: 300–312.
- 5. Petit RJ, El Mousadik A, Pons O (1998) Identifying populations for conservation on the basis of genetic markers. Conservation Biology 12: 844–855.
- 6. Frankel OH, Brown AHD, Burdon J (1995) The conservation of cultivated plants. The conservation of plant biodiversity. pp. 79–117. Cambridge University Press, UK. First edition.
- 7. Tanksley SD, McCouch SR (1997) Seed banks and molecular maps: unlocking genetic potential from the wild. Science 227: 1063–1066.
- 8. FAO (2010) The second report on the state of the world's plant genetic resources for food and agriculture. Rome.
- 9. FAO (2011) Draft updated global plan of action for the conservation and sustainable utilization of plant genetic resources for food and agriculture. Fifth session of the Intergovernmental Technical Working Group on Plant Genetic Resources for Food and Agriculture, Rome, 27–29 April 2011.
- 10. Bremer B, Bremer K, Chase MW, Fay MF, Reveal JL, et al. (2009) An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Botanical Journal of the Linnean Society 161: 105–121.
- 11. Escribano P, Viruel MA, Hormaza JI (2007) Molecular analysis of genetic diversity and geographic origin within an ex situ germplasm collection of cherimoya by using SSRs. Journal of the American Society for Horticultural Science 132: 357–367.
- 12. Popenoe H, King SR, León J, Kalinowski LS, Vietmeyer ND, et al. (1989) Cherimoya. Lost crops of the Incas: Little-known plants of the Andes with promise for worldwide cultivation. pp. 228–239. National Academy Press, Washington, D.C.
- 13. Van Damme P, Scheldeman X (1999) Promoting cultivation of cherimoya in Latin America. Unasylva 198: 43–47.
- 14. Lora J, Hormaza JI, Herrero M (2010) The progamic phase of an early-divergent angiosperm, Annona cherimola (Annonaceae). Annals of Botany 105: 221–231.
- 15. Popenoe W (1921) The native home of the cherimoya. Journal of Heredity 12: 331–336.
- 16. Guzman VL (1951) Informe del viaje de exploración sobre la cherimoya y otros frutales tropicales. 25 p. Ministerio de Agricultura, Centro Nacional de Investigación y Experimentación Agrícola La Molina, Lima, Peru.
- 17. Gepts P (2003) Crop domestication as a long-term selection experiment. In: Janick J, editor. Plant breeding reviews 24 Part 2: Long-term Selection: Crops, Animals, and Bacteria. pp. 1–44.
- 18. Pozorski T, Pozorski S (1997) Cherimoya and guanabana in the archaeological record of Peru. Journal of Ethnobiology 17: 235–248.
- 19. Scheldeman X, Van Damme P, Ureña Alvarez JV, Romero Motoche JP (2003) Horticultural potential of Andean fruit crops exploring their centre of origin. Acta Horticulturae 598: 97–102.
- 20. Vanhove W, Van Damme P (2009) Marketing of cherimoya in the Andes for the benefit of the rural poor and as a tool for agrobiodiversity conservation. Acta Horticulturae 806: 497–504.
- 21. CHERLA (2008) Inventory of current ex situ germplasm collections. Deliverable 7, Project no. 015100, INCO sixth framework programme.
- 22. Pascual L, Perfectti F, Gutierrez M, Vargas AM (1993) Characterizing isozymes of Spanish cherimoya cultivars. HortScience 28: 845–847.
- 23. Perfectti F, Pascual L (1998) Characterization of cherimoya germplasm by isozyme markers. Fruit Varieties Journal 52: 53–62.
- 24. Perfectti F, Pascual L (2005) Genetic diversity in a worldwide collection of cherimoya cultivars. Genetic Resources and Crop Evolution 52: 959–966.
- 25. Escribano P, Viruel MA, Hormaza JI (2004) Characterization and cross-species amplification of microsatellite markers in cherimoya (Annona cherimola Mill. Annonaceae). Molecular Ecology Notes 4: 746–748.
- 26. Escribano P, Viruel MA, Hormaza JI (2008) PERMANENT GENETIC RESOURCES: Development of 52 new polymorphic SSR markers from cherimoya (Annona cherimola Mill.). Transferability to related taxa and selection of a reduced set for DNA fingerprinting and diversity studies. Molecular Ecology Resources 8: 317–321.
- 27. Escribano P, Viruel MA, Hormaza JI (2008) Comparison of different methods to construct a core germplasm collection in woody perennial species with simple sequence repeat markers. A case study in cherimoya (Annona cherimola, Annonaceae), an underutilised subtropical fruit tree species. Annals of Applied Biology 153: 25–32.
- 28. Manel S, Schwartz MK, Luikart G, Taberlet P (2003) Landscape genetics: combining landscape ecology and population genetics. Trends in Ecology and Evolution 18: 189–197.
- 29. Holderegger R, Buehler D, Gugerli F, Manel S (2010) Landscape genetics of plants. Trends in Plant Science 15: 675–683.
- 30. Scheldeman X, van Zonneveld M (2010) Training manual on spatial analysis of plant diversity and distribution. Bioversity International, Rome, Italy.
- 31. Eaton D, Windig J, Hiemstra SJ, van Veller M, Trach NX, et al. (2006) Indicators for livestock and crop biodiversity. Report.2006/05. CGN/DLO Foundation, Wageningen UR, Wageningen.
- 32. Kozak KH, Graham CH, Wiens JJ (2008) Integrating GIS-based environmental data into evolutionary biology. Trends in Ecology and Evolution 23: 141–148.
- 33. Degen B, Scholz F (1998) Spatial genetic differentiation among populations of European beech (Fagus sylvatica L.) in western Germany as identified by geostatistical analysis. Forest Genetics 5: 191–199.
- 34. Hanotte O, Bradley DG, Ochieng JW, Verjee Y, Hill EW, et al. (2002) African pastoralism: genetic imprints of origins and migrations. Science 296: 336–339.
- 35. Hoffmann MH, Glaß AS, Tomiuk J, Schmuths H, Fritsch RM, et al. (2003) Analysis of molecular data of Arabidopsis thaliana (L.) Heynh. (Brassicaceae) with Geographical Information Systems (GIS). Molecular Ecology 12: 1007–1019.
- 36. Lowe AJ, Gillies ACM, Wilson J, Dawson IK (2000) Conservation genetics of bush mango from central/west Africa: implications from random amplified polymorphic DNA analysis. Molecular Ecology 9: 831–841.
- 37. Vigouroux Y, Glaubitz JC, Matsuoka Y, Goodman MM, Sánchez GJ, et al. (2008) Population structure and genetic diversity of New World maize races assessed by DNA microsatellites. American Journal of Botany 95: 1240–1253.
- 38. McRae BH (2006) Isolation by resistance. Evolution 60: 1551–1561.
- 39. van Etten J, Hijmans RJ (2010) A geospatial modelling approach integrating archaeobotany and genetics to trace the origin and dispersal of domesticated plants. PLoS ONE 5: e12060. doi:10.1371/journal.pone.0012060.
- 40. Guarino L, Jarvis A, Hijmans RJ, Maxted N (2002) Geographic Information Systems (GIS) and the conservation and use of plant genetic resources. In: Engels JMM, Ramanatha Rao V, Brown AHD, Jackson MT, editors. Managing plant genetic diversity. pp. 387–404. International Plant Genetic Resources Institute (IPGRI) Rome, Italy. 2002.
- 41. Kiambi DK, Newbury HJ, Maxted N, Ford-Lloyd BV (2008) Molecular genetic variation in the African wild rice Oryza longistaminata A. Chev. et Roehr. and its association with environmental variables. African Journal of Biotechnology 7: 1446–1460.
- 42. Jarvis A, Touval JL, Castro Schmitz M, Sotomayor L, Hyman GG (2010) Assessment of threats to ecosystems in South America. Journal for Nature Conservation 18: 180–188.
- 43. Frankel OH, Brown AHD, Burdon J (1995) The genetic diversity of wild plants. In: Frankel OH, Brown AHD, Burdon J, editors. The conservation of plant biodiversity. pp. 10–38. Cambridge University Press, UK. First edition.
- 44. Hijmans RJ, Garrett KA, Huamán Z, Zhang DP, Schreuder M, et al. (2000) Assessing the geographic representativeness of genebank collections: the case of Bolivian wild potatoes. Conservation Biology 14: 1755–1765.
- 45. Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Molecular Ecology Notes 6: 288–295.
- 46. Pritchard JK, Stephens M, Donnelly P (2000) Inference of Population Structure Using Multilocus Genotype Data. Genetics 155: 945–959.
- 47. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology 14: 2611–2620.
- 48. Araújo MB, Pearson RG, Thuiller W, Erhard M (2005) Validation of species-climate impact models under climate change. Global Change Biology 11: 1504–1513.
- 49. Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environmental Conservation 24: 38–49.
- 50. Lowe AJ, Boshier D, Ward M, Bacles CFE, Navarro C (2005) Genetic resource impacts of habitat loss and degradation; reconciling empirical evidence and predicted theory for neotropical trees. Heredity 95: 255–273.
- 51. Bonavia D, Ochoa CM, Tovar SO, Palomino RC (2004) Archaeological evidence of cherimoya (Annona cherimolia Mill.) and guanabana (Annona muricata L.) in ancient Peru. Economic Botany 58: 509–522.
- 52. Tapia ME (2000) Mountain agrobiodiversity in Peru. Seed fairs, seed banks, and mountain-to-mountain exchange. Mountain Research and Development 20: 220–225.
- 53. Mercer KL, Perales HR (2010) Evolutionary response of landraces to climate change in centers of crop diversity. Evolutionary Applications 3: 480–493.
- 54. Cornelius JP, Clement CR, Weber JC, Sotelo-Montes C, van Leeuwen J, et al. (2006) The trade-off between genetic gain and conservation in a participatory improvement programme: the case of peach palm (Bactris gasipaes Kunth). Forest, Trees and Livelihoods 16: 17–34.
- 55. van Leeuwen J, Lleras Pérez E, Clement CR (2005) Field genebanks may impede instead of promote crop development: Lessons of failed genebanks of “promising” Brazilian palms. Agrociencia 9: 61–66.
- 56. Escudero A, Iriondo JM, Torres ME (2003) Spatial analysis of genetic diversity as a tool for plant conservation. Biological Conservation 113: 351–365.
- 57. Hijmans RJ, Spooner DM (2001) Geographic distribution of wild potato species. American Journal of Botany 88: 2101–2112.
- 58. Jarvis A, Ferguson ME, Williams DE, Guarino L, Jones PG, et al. (2003) Biogeography of wild Arachis: assessing conservation status and setting future priorities. Crop Science 43: 1100–1108.
- 59. Scheldeman X, Willemen L, Coppens D'eeckenbrugge G, Romeijn-Peeters E, Restrepo MT, et al. (2007) Distribution, diversity and environmental adaptation of highland papayas (Vasconcellea spp.) in tropical and subtropical America. Biodiversity and Conservation 16: 1867–1884.
- 60. Leberg PL (2002) Estimating allelic richness: Effects of sample size and bottlenecks. Molecular Ecology 11: 2445–2449.
- 61. Lowe A, Harris S, Ashton P (2004) Genetic diversity and differentiation. In: Lowe A, Harris S, Ashton P, editors. Ecological genetics: design, analysis, and application. pp. 50–105. Blackwell Publishing, UK. First edition.
- 62. Rajora OP, Mosseler A (2001) Challenges and opportunities for conservation of forest genetic resources. Euphytica 118: 197–212.
- 63. Lora J, Hormaza JI, Herrero M, Gasser CS (2011) Seedless fruits and the disruption of a conserved genetic pathway in angiosperm ovule development. Proceedings of the National Academy of Sciences USA 108: 5461–5465.
- 64. Van Damme V, Gómez-Paniagua H, de Vicente MC (2011) The GCP molecular marker toolkit, an instrument for use in breeding food security crops. Molecular breeding 28: 597–610.
- 65. Myers N, Mittermeier RA, Mittermeier CG, da Fonseca GAB, Kent J (2000) Biodiversity hotspots for conservation priorities. Nature 403: 853–858.
- 66. Willemen L, Scheldeman X, Soto Cabellos V, Salazar SR, Guarino L (2007) Spatial patterns of diversity and genetic erosion of traditional cassava (Manihot esculenta Crantz) cultivation in the Peruvian Amazon: An evaluation of socio-economic and environmental indicators. Genetic Resources and Crop Evolution 54: 1599–1612.
- 67. Viruel MA, Hormaza JI (2004) Development, characterization and variability analysis of microsatellites in lychee (Litchi chinensis Sonn., Sapindaceae). Theoretical and Applied Genetics 108: 896–902.
- 68. Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A (2005) Very high resolution interpolated climate surfaces for global land areas. International Journal of Climatology 25: 1965–1978.
- 69. Chapman AD (2005) Principles and methods of data cleaning – primary species and species-occurrence data, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen.
- 70. Kalinowski ST (2005) HP-RARE 1.0: a computer program for performing rarefaction on measures of allelic richness. Molecular Ecology Notes 5: 187–189.
- 71. Smouse PE, Peakall R (1999) Spatial autocorrelation analysis of individual multiallele and multilocus genetic structure. Heredity 82: 561–573.
- 72. Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecological Modeling 190: 231–259.
- 73. Elith J, Phillips SJ, Hastie T, Dudík M, Chee YE, et al. (2011) A statistical explanation of MaxEnt for ecologists. Diversity and Distributions 17: 43–57.
- 74. Elith J, Graham CH, Anderson RP, Dudík M, Ferrier S, et al. (2006) Novel methods improve prediction of species' distributions from occurrence data. Ecography 29: 129–151.
- 75. Hernandez PA, Graham CH, Master LL, Albert DL (2006) The effect of sample size and species characteristics on performance of different species distribution modeling methods. Ecography 29: 773–785.
- 76. Busby JR (1991) BIOCLIM a bioclimatic analysis and prediction system. In: Margules CR, Austin MP, editors. Nature Conservation: Cost Effective Biological Surveys and Data Analysis, CSIRO, Canberra. pp. 64–68.