Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Ultra-high-throughput DArTseq-based silicoDArT and SNP markers for genomic studies in macadamia

  • Mobashwer Alam ,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft, Writing – review & editing

    m.alam@uq.edu.au

    Affiliation Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, the University of Queensland, Nambour, Queensland, Australia

  • Jodi Neal,

    Roles Conceptualization, Writing – review & editing

    Affiliation Department of Agriculture and Forestry, Maroochy Research Facility, Nambour, Queensland, Australia

  • Katie O’Connor,

    Roles Writing – review & editing

    Affiliation Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, the University of Queensland, Nambour, Queensland, Australia

  • Andrzej Kilian,

    Roles Methodology, Software, Writing – review & editing

    Affiliation Diversity Arrays Technology Pty Ltd, University of Canberra, Monana St., Canberra ACT, Australia

  • Bruce Topp

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliation Centre for Horticultural Science, Queensland Alliance for Agriculture and Food Innovation, the University of Queensland, Nambour, Queensland, Australia

Abstract

Macadamia (Macadamia integrifolia, M. tetraphylla and hybrids) is an Australian native nut crop and has a significant economic value in the food industries worldwide. Long juvenility along with traditional breeding strategies impede quick genetic improvement of this crop. The existing cultivars constitute only second to fourth generation of the wild germplasm in the rainforest. The utilisation of molecular markers for genomic selection and genome-wide association studies may accelerate genetic gains. Identification of a robust, reproducible, and cost-effective marker system is instrumental in increasing the efficiency of genomic studies. This study is the first to report the potential of two ultra-high-throughput diversity array technology (DArT) markers (silicoDArT and SNP) in macadamia. Both markers were used to identify the genetic diversity and population structure in 80 macadamia cultivars. Parentage analysis of 25 scions in a rootstock trial was conducted to confirm plant identity where recorded identities did not corroborate with phenotypic field observations. A total of 22,280 silicoDArT and 7,332 SNP markers were reported, of which 11,526 silicoDArT and 3,956 SNP markers were used for analyses after screening with quality control parameters including >95% call rate, >95% reproducibility, and >0.05 one ratio. The average polymorphic information content (PIC) values of silicoDArT and SNP markers were 0.29 and 0.21, respectively. Genetic variance among the cultivars ranged from 0.003 to 0.738 in silicoDArT and 0.004 to 0.412 in SNP markers. Four distinct population groups were identified from SNP data analysis. Most of the accessions used in this study were descended from two or more populations. Cluster analysis clearly separated genotypes of distinct origins, such as the Hawaii Agricultural Experiment Station and Hidden Valley Plantation accessions. Two wild accessions of Macadamia jansenii and M. ternifolia were found to be distantly related to the cultivars. Wild germplasm individuals and their hybrids with cv. ‘660’ formed separate clusters, suggesting that crossing between wild and cultivated genepools can extend genetic diversity. DArTseq-based SNP markers were successfully utilized to confirm the genetic identity of 25 scions in a rootstock trial. Our study suggests that DArT platforms are a robust system for the facilitation of genomic studies with regard to macadamia.

Introduction

Macadamia is a cross-pollinated tree and constitutes a nut crop industry with a high value kernel [1, 2]. Its unique flavours, multipurpose uses, and long shelf life have increased the demand for macadamia nuts worldwide. The genetic improvement of Australian macadamia cultivars is still at an early stage. Though the crop was first domesticated one and a half centuries ago [1, 3, 4], existing cultivars are only second to fourth generations from their wild progenitors [4, 5]. Until recently, worldwide macadamia improvement programs were mostly dependent on pedigree analysis and phenotypic characterization [69], subjected to the inaccuracy involved in the selection of elite accessions due to the effects of environment and genotype-environment interactions. To reduce inaccuracies in selection procedure and to accelerate breeding efficiency, genomic techniques may be used as a tool of macadamia breeding program [10].

Molecular markers have been utilized in several crop species to ascertain the genetic diversity in the gene pool [11], identify QTLs and candidate genes conferring valuable traits [12, 13], authenticate plant identity and the parentage of hybrids [14], generate data for gene expression profiling [15], and predict the genetic potentiality/performance of individual cultivars [16]. Over the last few decades, several marker technologies have been developed for macadamia, primarily for the study of genetic diversity. Some of the important markers used in macadamia are isozyme [17], randomly amplified DNA fingerprinting (RAF) [18, 19], random amplified polymorphic DNA (RAPD) [20, 21], amplified fragment length polymorphism (AFLP) [22, 23], randomly amplified microsatellite fingerprinting (RAMiFi) [24] and microsatellite [2527] markers. Limitations associated with these marker systems include low marker density, poor genome coverage, and less cost-effectiveness. In this perspective, sequence-based single nucleotide polymorphism (SNP) markers developed through automated sequencers can constitute an important choice for molecular studies due to their wide and uniform genome coverage with high-throughput and cost-effectiveness [28]. There is an increasing demand for the development of ultra-high-throughput low cost assays to facilitate the genotyping of individuals with the use of a large number of high-density markers that cover the entire genome. High-density array-based oligonucleotide markers such as single feature polymorphisms (SFPs), restriction site-associated DNA (RAD), and diversity array technology (DArT) provide a way to achieve this goal of development of cost-efficient ultra-high-throughput genotyping systems.

In 2001 [29], Diversity Array Technology Pty Ltd (DArT, Canberra, ACT, Australia) developed cost-effective sequence-independent ultra-high-throughput marker systems. DArT develops markers through a microarray hybridization method and can produce thousands of polymorphic loci in a single assay. This marker platform has been used for whole-genome scanning of a range of crop species [3036]. Over the last decade, DArT has generated two types of markers: i) silicoDArT and ii) SNP markers. SilicoDArT markers are microarray markers that are dominant and scored for the presence or absence of a single allele. DArTseq-based SNPs are co-dominant markers. Both types of markers have been successfully applied in several crop species for genetic diversity [3741], genetic mapping [36, 4244], and population structure [45, 46] studies. The present study is the first to utilize the DArT platforms in macadamia; we present the development of DArT marker platforms, and compare and analyse the usefulness of silicoDArT and SNP markers for genomic studies. DArT markers were applied to investigate the genetic diversity in cultivated and wild germplasm. DArT markers were also tested for utilization in plant identification.

Materials and methods

Plant materials

Macadamia cultivars of multiple origins including from Hawaii Agricultural Experiment Station (HAES), Hidden Valley Plantation (HVP), Israeli and Australian selections, accessions from wild germplasm, and progeny from the Australian industry macadamia breeding program (Australian elite selections, dwarves, breeding progeny and hybrids of wild germplasm) were employed for genetic diversity and population structure analysis (Table 1). Leaf samples from seven cultivars were collected from each of two different plants to use as biological replicates. All 21 cultivars were genotyped twice to use as technical replicates. Twenty-five scions from a rootstock trial at Bundaberg, Queensland, were also tested for plant identity to solve a mismatch with phenotypic characteristics observed in the field.

thumbnail
Table 1. List of macadamia cultivars used in genetic diversity study.

HAES Hawaiian Agricultural Experiment Station; HVP Hidden Valley Plantation.

https://doi.org/10.1371/journal.pone.0203465.t001

DNA extraction

Samples obtained from newly flushed young leaves were collected from arboretums and progeny trials of the Australian industry macadamia breeding program [47] in 2013 and 2014 growing seasons. Leaf samples were sealed in a zip-lock bag labelled with the corresponding tree barcode, kept cool in insulated bags with freezer blocks and sent to DArT (next day) for DNA extraction. Total genomic DNA was extracted by adhering to the modified CTAB protocol [48, 49] as described by Kilian et al., (2012) [50]. The quality and quantity of the DNA samples were evaluated through a spectrophotometric analysis in DS-11FX series spectrophotometer/fluorometer (Denovix, Wilmington, DE, USA), followed by running agarose gel electrophoresis (1.2% agarose). The DNA concentration was adjusted within the range of 50–100 ηg μl–1.

Genotyping macadamia accessions using silicoDArT and SNP markers

The high-throughput DArTseq technology was used to genotype macadamia cultivars. In this technology, the PstI-based complexity reduction method [39] was applied for the enrichment of genomic representation with single copy sequences. This method involved the digestion of DNA samples with a rare cutting enzyme PstI, paired with a set of secondary frequently cutting restriction endonucleases (RE), ligation with site-specific adapters, and amplification of adapter-ligated fragments. The secondary frequently cutting RE enzymes employed in this study were AluI, ApoI, BanII, Bsp1286I, BstNI, HaeIII, MseI, RsaI, MspI, HpaII, MseI, TaqI, and HhaI. Post digestion with a PstI-RE pair, a PstI overhang compatible oligocleotide adapter (5′-CAC GAT GGA TCC AGT GCA-3′ annealed with 5′-CTG GAT CCA TCG TGC A-3′) was ligated, and the adapter-ligated fragments were amplified in adherence to the prescribed standard procedures [39]. To develop SNP and silicoDArT markers, the DArTseq technology was optimized using two PstI-compatible adapters corresponding to two different RE overhangs. The genomic representations were generated following the procedures described by Kilian et al. [50] and PstI+HhaI was selected as the most appropriate complexity reduction method. Next-generation sequencing technology was implemented using HiSeq2000 (Illumina, USA) to detect SNPs and silicoDArT markers. The sequence data was analyzed using DarTsoft14, an automated genotypic data analysis program and DArTdb, a laboratory management system. Markers were scored ‘1’ for presence, and ‘0’ for absence and ‘-’ for failure to score. Two technical replicates of the DNA samples of each of 21 cultivars were genotyped to calculate the reproducibility of the marker data.

Quality analysis of marker data

The markers were tested for reproducibility (%), call rate (%), polymorphism information content (PIC) and one ratio. Scoring of reproducibility involved the proportion of technical replicate assay pairs for which the marker score exhibited consistency. The call rate determined the success of reading the marker sequence across the samples and was estimated from the percentage of samples for which the score was either ‘0’ or ‘1’. PIC is the degree of diversity of the marker in the population and showed the usefulness of the marker for linkage analysis. One ratio constitutes the proportion of the samples for which genotype scores equalled ‘1’.

Genetic dissimilarity analysis

Genetic dissimilarity matrices were constructed in DARwin v. 6.0.13 [51] to identify the genetic relationships among the cultivars. Weighted neighbour-joining dendrograms were constructed in both marker platforms. Clade strength in the dendrograms was tested by 20,000 bootstrap analyses. Biological and technical replicates of 28 accessions were compared to identify the reliability of the markers using the dissimilarity index between the replicates of each accession.

Population structure and genetic diversity analysis

The genetic structure of the germplasm was analyzed using STRUCTURE v.2.3.4 [52] and GenAlEx v.6.5 [53]. The number of hypothetical subpopulations (K) was estimated with the STRUCTURE software through the application of a Bayesian clustering approach for the organisation of genetically similar cultivars into the same subgroups. Ten individual Markov Chain Monte Carlo (MCMC) simulations were conducted for each K-value from 1 to 10 with a burn-in length of 50,000, followed by 100,000 iterations. The admixture model was applied without using any prior population information. The log-likelihood of the observed data for each K-value was calculated and compared across the range of K values. The best K-value was estimated based on the membership coefficient (Q) for each individual in each cluster. The Q values indicate the level of relatedness of each accession to various subgroups. The STRUCTURE results were subsequently analyzed by the STRUCTURE HARVESTER application [54] (http://taylor0.biology.ucla.edu/structureHarvester/) to identify a distinct peak in the change of likelihood (ΔK) at the true value of K. GenAlEx was used to perform Principal Coordinates Analysis (PCoA), based on the standardized covariance of genetic distances calculated for the markers under evaluation, using 999 permutations. PCoA explains the genetic distances among the accessions.

Determining plant identity

In a rootstock trial at Bundaberg we identified that the phenotypic characteristics of 25 scions did not corroborate with their recorded variety. Using SNP markers we conducted PCoA and parentage analysis of all the mismatched scions and the results were compared with phenotypic variety allocations.

To determine plant identity, initially allelic variations were observed between the possible varieties and the mismatched cultivars using PCoA in GenAlEx. Later, the analysis of plant identity was performed with Cervus v.3.0.7 [55] to detect and confirm the candidate plant ID. In cases where the plant ID remained indeterminate, parentage analysis was conducted, allowing 40 simulated parents and 10,000 simulated offspring. The natural logarithm of the likelihood ratio, logarithm of odds (LOD), scores were provided for each candidate parent and the offspring-parents trio along with the confidence of these scores at strict (95%) and relaxed (80%) levels. Based on the analysis, the parent with the highest LOD is denoted as the putative parent. A positive LOD score indicates that the parent is more likely to be the true parent in comparison to one drawn at random from the population; negative LOD scores indicate that the parent is less likely to be the parent than one drawn at random from the population [55].

Results

Marker quality analysis

Through the application of the complexity reduction method, a total of 22,280 polymorphic silicoDArT (S1 Table) markers were generated, of which 353 were aligned with the marker sequence obtained from bacteria (NCBI) and 154 from expressed sequenced tags (EST) of several plant species. We report these DArT markers for the first time, though the chromosomal location is yet to be described. Most of the markers (22,237) showed ≥95% reproducibility. All the identified silicoDArT markers had a call rate value ≥75% (Fig 1) with an average value of 97.44 (S1 Table). However, low frequency markers can affect the statistical analysis [56]. As such, 10,711 markers with extremely low one ratio (<0.05) were not considered in the analysis. In total, 11,526 silicoDArT markers cleared all the quality parameters and were selected for the study. Among the 11,526 informative markers, around 21% were observed in PIC class 0.45 to 0.50 and 13% in 0.05 to 0.10 class (Fig 2). PIC values of the remaining markers were distributed almost equally (8–10%) across the rest of the marker groups. Therefore, the median (0.28) was located extremely close to the average PIC value of 0.29 (S3 Table), and the data exhibited approximately equitable distribution on either side of the median.

thumbnail
Fig 1. Distribution of silicoDArT and SNP marker data for several quality parameters.

https://doi.org/10.1371/journal.pone.0203465.g001

thumbnail
Fig 2. Distribution of PIC values of silicoDArT and SNP markers used for genomic studies in macadamia.

https://doi.org/10.1371/journal.pone.0203465.g002

A total of 7,332 SNP markers (S2 Table) were identified, and had an average of 99% reproducibility and 76% call rate. Around 98% SNP markers had ≥95% reproducibility, of which 6,375 were found to be 100% reproducible (Fig 1). The call rate exhibited variance ranging from 38% to 100%. Around 44% of SNP markers displayed a <75% call rate (Fig 1), and were therefore not considered for this study. For the remaining markers, 2,650 showed a >90% call rate. All the identified markers had >0.5 average one ratio. Considering all the quality parameters, 3,956 SNP markers were used for subsequent analysis. These markers were determined to be highly informative with an average PIC value of 0.21, and 0.19 median (S3 Table). Around 14% of markers were in the lowest PIC value range (0–0.05) and 9% in the highest PIC value range (0.45 to 0.50) (Fig 2). The remaining PIC value groups exhibited an approximately similar marker frequency value ranging from 10 to 12% each.

Genetic relationships among cultivars

The genetic dissimilarities among the cultivars estimated through the silicoDArT markers ranged from 0.003 to 0.738 (S4 Table). The technical and biological replicates of the cultivars revealed the least amount of genetic dissimilarity (Fig 3), ranging from 0.004 to 0.02. Among the cultivars, ‘660’ displayed exceptionally close genetic similarity with ‘741’ (dissimilarity indexes: 0.004 to 0.008). This value was similar to that of the biological replicates of ‘741’. The dissimilarity indices of M. jansenii and M. ternifolia with the existing cultivars ranged from 0.590 to 0.738, suggesting that the two wild accessions are distantly related to the cultivated gene pool. The Australian industry breeding lines (TF-11-1, TF-23-4, TF-23-9, TF-34-11 and TF-37-1) developed through the hybridization of wild germplasm with commercial cultivars further exhibited wide genetic distance from Australian macadamia cultivars.

thumbnail
Fig 3. Genetic relationships among 80 macadamia accessions, and 7 biological and 21 technical replicates.

(a) Weighted neighbour-joining dendrogram based on silicoDArT markers. (b) Weighted neighbour-joining dendrogram based on SNP markers.

https://doi.org/10.1371/journal.pone.0203465.g003

The dendrogram obtained with silicoDArT markers produced several small clusters of related cultivars, and most of them contained cultivars that shared parental lines or a similar origin (Fig 3A). We identified nine major groups of clusters. Most of the HVP cultivars clustered in C1, while HAES cultivars were grouped in four other separate clusters. Cultivars ‘816’, ‘804’, ‘814’, ‘781’, ‘849’, ‘842’, and ‘333’ were grouped in C3. In C9, HAES cultivar ‘762’ clustered with ‘NG8’. Other three HAES cultivars (‘741’, ‘660’ and ‘344’) and an Australian selection, ‘NG29’, were grouped in cluster C7. HAES cultivar ‘791’ constituted the sole representative of C8. Wild M. jansenii and M. ternifolia and their hybrids (‘TF-23-4, ‘TF-15-1’, ‘TF-37-1’, ‘TF-34-11’, ‘TF-23-9’, ‘TF-47-16,’ and ‘TF-48-8’) formed a separate cluster (C6). Australian elites and selections were distributed across the clusters, which illustrates the wide range of diversity created by the Australian breeding program. The Australian selection, ‘D4’, grouped with HVP cultivars in C1. ‘Beaumont’, ‘NG18’, and ‘Own Venture’ grouped together in C5. Cultivars ‘H2’, and ‘Daddow’ formed two separate clusters, C10 and C4, respectively. We further identified that four dwarf cultivars (‘DW1’, ‘DW2’, ‘DW3’, and ‘DW4’) clustered in the same group (C9) as their common parent ‘NG8’.

SNP markers were also useful for the identification of genetic relationships among Australian cultivars. The range of genetic dissimilarities identified through SNP markers was narrower than that observed through silicoDArT markers. Among the 80 cultivars and 28 replicates, dissimilarity ranged from 0.004 to 0.412 (S5 Table). The genetic dissimilarity index between ‘741’ and ‘660’ (0.013) fell within the range of dissimilarities between technical and biological replicates (0.004 to 0.03).

Similar to silicoDArT markers, SNP markers also developed several clusters of macadamia cultivars based on their relatedness (Fig 3B). Cultivars with common origin or those that shared common parents were clustered together. HAES and HVP cultivars formed distinctly separate clusters. HAES cultivars were grouped in C4, C5, C8 and C9; whereas HVP cultivars were clustered in C1 and C3. As observed with silicoDArT markers, the wild accessions of M. ternifolia and M. jansenii were found to be distantly related with other cultivars, and grouped in C7 with their respective hybrids (Fig 3B). We also observed three separate clusters of ‘H2’ (C10), ‘791’ (C8), and ‘Daddow’ (C6) in the SNP analysis. Same as in silicoDArTs, four dwarf cultivars (DW1, DW2, DW3 and DW4) were clustered together (in C9) with their common parent ‘NG8’.

Population structure and genetic diversity analysis

The model-based Bayesian cluster analysis in STRUCTURE visualized the genetic structure of the population under examination (Fig 4). The values of ΔK, which were estimated from SNP markers peaked at K = 4 (Fig 4A); hence, four distinct groups were found to contribute significant genetic information across cultivars. The sub-populations were denoted as POP1, POP2, POP3 and POP4, and each sub-population contained 41%, 7%, 23%, and 29% of total accessions, respectively (Table 2).

thumbnail
Fig 4. Population structure of 80 macadamia accessions using SNP marker data, as estimated using the model-based Bayesian algorithm implemented in the STRUCTURE program.

a) estimation of number of groups (K), b) proportion of assignment of individuals to four population groups.

https://doi.org/10.1371/journal.pone.0203465.g004

thumbnail
Table 2. Genetic divergence among (net nucleotide distance) and within (expected heterozygosity) populations, and the proportion of membership of the population samples.

https://doi.org/10.1371/journal.pone.0203465.t002

The genetic diversity within each population was explained through the estimation of the expected heterozygosity, which varied from 0.12 (POP3) to 0.22 (POP4). The expected heterozygosity of POP1 was 0.19 and that of POP2 was 0.14. The genetic divergence among the populations revealed by Nei’s net nucleotide distance (D) indicated that POP2 was widely related to POP1 (D = 0.19), POP3 (D = 0.18), and POP4 (D = 0.18), respectively. The genetic distance observed between POP1 and POP4 (D = 0.05) was the least among the pairs of populations examined (Table 2).

The proportion of membership of individual cultivars in each population is illustrated in the bar plot of the population assignment test in structure analysis (Fig 4B). The estimated proportion of membership (Q) suggested that 12 Australian elite cultivars (‘QB-14-93G’, ‘QB-6-71’, ‘QB-6-79I’, ‘QB-6-16S’, ‘QB-10-111Q’, ‘QB-2-46E’, ‘QB-14-25P’, ‘QB-11-80C’, ‘QB-10-93J’, ‘QB-13-115R’, ‘QB-6-73N’ and ‘QB-7-74O’) were assigned entirely in POP1. M. jansenii and M. ternifolia comprised POP2. Two Hawaiian cultivars ‘741’ and ‘660’ were included in POP3. Australian selections ‘NG8’ and ‘Beaumont’ represented POP4 along with the dwarf progeny ‘DW2’, and Australian elites ‘QB-7-5’ and ‘QB-5-81’. The remaining 60 cultivars showed intermediate and/or highly mixed genetic composition and were hence determined as heterogeneous. The hybrid progeny of ‘660’ X M. jansenii (‘TF-11-1’, ‘TF-23-9’, and ‘TF-34-11’), and ‘660’ X M. ternifoila (‘TF-23-4’ and ‘TF-37-1’) were assigned in both POP2 and POP3. Most of the HAES cultivars (‘333’, 344’, ‘781’, ‘804’, ‘814’, ‘816,’ and ‘849’) comprised admixtures of POP1 and POP3. Conversely, all HVP accessions were constituted partly of POP4. Two HAES cultivars ‘791’ and ‘762’ also shared large amounts of genetic information with POP4. Cultivar ‘791’ was the only individual allocated to all four populations, and possessed 33% genetic information from POP1, 13% from POP2, 9% from POP3, and 46% from POP4.

PCoA illustrated the genetic divergence among the cultivars (Fig 5). In silicoDArT and SNP markers, the first two axes of the PCoA explained 25.64% and 37.61% of the total genetic divergence, respectively. The population distribution determined by both markers is consistent with the output of population structure analysis (Fig 4B). HAES cultivars were located in the top two quadrants, while HVP and Australian selections were mainly located in the bottom quadrants. Australian breeding lines displayed wide diversity, as they were mostly distributed throughout the three quadrants of the PCoA. Only a few accessions were in the bottom right quadrant of PCoA, which was predominantly occupied by wild germplasm.

thumbnail
Fig 5. Principal coordinate analysis (PCoA) to explain the genetic diversity across macadamia accessions.

a) PCoA based on silicoDArT markers, and b) PCoA based on SNP markers.

https://doi.org/10.1371/journal.pone.0203465.g005

Plant identification using SNP markers

PCoA of the SNP markers derived from possible cultivars and mismatched scions illustrated allelic variations among the accessions (Fig 6). Parentage analysis identified that seven scions in the rootstock trial were planted with wrong plant ID and the remaining 18 scions were the seedling progeny of rootstocks ‘Beaumont’ and ‘H2’ (S7 Table).

thumbnail
Fig 6. Principal coordinate analysis of SNP markers showing allelic variation among macadamia cultivars and mismatched scions.

https://doi.org/10.1371/journal.pone.0203465.g006

The first two components of the PCoA explained 43.08% of total variation. Pair wise comparison showed that least allelic variations were observed within the following mismatched scion and cultivar pairs: ‘8_1’ and ‘A268’, ‘268_2’ and ‘344’, ‘268_3’ and ‘842’, ‘814_1’ and ‘Beaumont’, ‘816_1’ and ‘814’, ‘842_1’ and ‘Beaumont’, and ‘842_12’ and ‘816’ (Fig 6 and S6 Table). The parentage analysis (S7 Table) confirmed these seven plant identities and suggested that scions labelled with ‘8_1’, ‘268_2’, ‘814_1’, ‘816_1’ and ‘842_1’ showed a completely congruous match with phenotypic plant IDs (‘A268’, ‘344’, ‘Beaumont’, ‘814’, and ‘Beaumont’, respectively). Plant IDs of ‘268_3’ and ‘842_12’ did not match with phenotypic IDs. Molecular analysis confirmed scion ‘842_12’ as ‘816’, whereas it was phenotypically identified as ‘H2’. Similarly, ‘268_3’ was suggested as ‘814’ in the field, while it was identified as ‘842’ from SNP marker analysis. The candidate mother ID, pair loci mismatching data, and LOD scores suggested that the remaining ‘842’ marked plants have common candidate mother ‘H2’. Trees ‘A16_2’ and ‘268_1’ were indicated to be the progeny of ‘Beaumont’ and ‘H2’, respectively.

Discussion

DArT platforms provide cost-effective ultra-high-throughput reliable markers

Our study highlights the suitability of DArT platforms that can be applied for the genomic dissection of macadamia cultivars. A total of 22,237 silicoDArT markers were developed, of which 11,526 markers provided robust information of the macadamia genome in the absence of sequence information. On the other hand, DArTseq SNPs provided 3,956 informative markers. The development of a large number of these high-throughput markers provided an opportunity to anchor the markers on a recently released macadamia reference genome, which is yet to be completed [57].

The quality parameter of both silicoDArT and SNP markers in macadamia were comparable with that of other species. The average PIC values of both markers in macadamia was similar to values identified in DArT markers of sugar beet (0.28) [58] and Asplenium fern (0.20) [59], but lower than that of sorghum (0.41) [36], cassava (0.42) [60], and wheat (0.44) [61], and higher than SNP markers in watermelon (0.13) [62] and Lesquerella (0.12) [63]. The average PIC values of silicoDArT was greater than that of SNP markers. Around 30% of silicoDArT and 20% of SNP markers showed PIC values within the range of 0.40 to 0.50. The silicoDArT markers were therefore more informative than SNP markers. The abundance of silicoDArT and SNP markers may achieve better genome coverage through the sampling of a greater number of points in the whole genome, as marker density has a high correlation with gene density [50, 64]. Previous studies in macadamia using other platforms utilized only a small number of molecular markers, for example, O’Connor et al. (2015) used only 11 microsatellite markers [25]. Therefore, both silicoDArT and SNP markers may better suit for genetic diversity studies, association/linkage mapping and/or sequence based physical mapping in macadamia. Additionally, the co-dominant inheritance pattern of SNP markers may increase the utility of DArT platforms for genetic identity and parentage analysis.

In comparison with the other existing marker technologies like microsatellite markers, DArT markers are pertinent to high-throughput work and have merits in terms of cost effectiveness and time aspect [65]. The cost of marker development of silicoDArT and SNP markers are the same. Due to the higher number of markers produced in silicoDArT, the average cost per data point is less than SNP markers. However, the effectiveness of both platforms varies depending on the type of application. For genetic diversity and linkage mapping large number of silicoDArT markers are suitable. However, for genetic identity and product quality testing, both markers can perform equally. Due to the opportunity to track alleles from parental genotypes, the co-dominant SNP markers are more suitable in plant identity and parentage analysis than silicoDArT.

DArT markers successfully evidenced the historical background and pedigree relationships of the cultivars

The genetic diversity analysis illustrated the historical development of macadamia cultivars. Macadamia cultivars have been developed by few breeding programs. The macadamia breeding program in Hawaii began in approximately 1948 and continued to 1990, producing many HAES cultivars which are still dominant in plantings around the world [66]. Australian varieties including ‘Beaumont’, ‘D4’, and ‘Daddow’ are the product of 1952-onwards seedling surveys in Australia, after observation of the Hawaiian industry success. Hidden Valley Plantation (HVP), which commenced in 1972 in Beerwah, Queensland, Australia released successful ‘A’ series cultivars from open-pollinated progeny of HAES and Australian varieties. Genetic diversity analysis clearly separated most of the accessions of these three groups. Accessions of Australian industry breeding efforts were selected from the crossing program initiated in 1992 and included HAES, HVP and Australian cultivars in the parentage, hence are scattered across all clusters.

The statistical analysis of DArT data sets showed highly consistent results obtained for genetic diversity, population structure and PCoA. The pedigree relationships obtained through genetic diversity and population structure analysis are in line with the results of previous studies [17, 18, 23]. Two HAES cultivars ‘660’ and ‘741’, which were selected from same orchard at Glaisyer (at Hawaii) showed strong genetic similarity, as also observed by Peace et al. [18] and Steiger et al. [23]. Cultivar ‘344’ was grouped with ‘741’ and ‘660’, which is similar to previous observations made with RAF [18], isozyme [17], RAMiFi [24], and AFLP [23] markers. Population structure analysis revealed that cultivar ‘791’ constituted three or four distinct populations, which was supported by the observation of Peace et al. [18]. It was identified that ‘791’ is a tri-species cultivar containing M. integrifolia, M. tetraphylla and M. ternifolia in its ancestry. Our results additionally showed that ‘791’ may also have M. jansenii in its genetic background. We identified that ‘791’ shares almost equal amounts of M. integrifolia and M. tetraphylla. In contrast, Peace et al. [18] previously reported 10% M. tetraphylla, 55% M. integrifolia and 35% M. ternifolia in the ancestry of ‘791’. Interestingly, population structure analysis identified that two wild species M. jansenii and M. ternifolia represented the same population (Fig 4A). Although no historical relationships among these two wild species were evidenced, there are some clear morphological similarities [1]. For example, both M. jansenii and M. ternifolia are relatively dwarf accessions, produce pink red leaf flushes and flowers, and have bitter nuts. Previous isozyme analysis showed that M. jansenii and M. ternifolia are closely related [67], as observed in RAF [68] and STMS (sequence-tagged microsatellite site) studies [3], whilst Waldron et al. [68] and Peace et al. [3] suggested that M. ternifolia and M. jansenii were of sister species. An investigation on large numbers of wild accessions would be instrumental to explore differentiation among the four macadamia species. Identification of markers showing the signature of domestication could be the focus of future studies.

Though Australian elite hybrids were distributed across the four clusters, the ancestral relationship determined by both silicoDArT and SNP markers was consistent. Dwarf cultivars ‘DW1’, ‘DW2’, ‘DW3’, and ‘DW4’, and selections ‘QB-35-10’, ‘QB-5-81’, and ‘QB-7-11’ were grouped with their common parent ‘NG8’. The cultivar ‘762’ was also a common parent of ‘DW1’, DW2’, ‘QB-7-11’, and ‘QB-5-81’, and hence sub-grouped. Similarly, other elites ‘TF-9-22B’, ‘TF-44-15D’, ‘QB-9-72K’, and ‘QB-7-109L’ clustered with common parent ‘Daddow’. Consistency in determining pedigree relationships suggested that both DArT platforms are highly reliable for genetic diversity study in macadamia.

DArTseq based SNP markers are useful tools for plant identity confirmation

The use of molecular markers for plant identity and parentage analysis has increased over the last two decades [69]. Although microsatellite markers were used previously in parentage analysis [25], this study was the first to use DArT markers in parentage analysis in macadamia. Using SNP markers, our study successfully predicted the plant identity and probable mother identities of plants in a rootstock trial. Most of the predictions validated field observations (S7 Table). DArTseq-based SNP markers are therefore useful to confirm plant identity through parentage analysis of macadamia progeny.

Conclusions

Our study identified that both DArT platforms are of high quality, supported by the quality parameters and close relationships of the replicates of each cultivar in the neighbourhood join dendrograms. Both DArT markers successfully reflected the parental relationships and the extent of diversity in the population. Structure analysis clearly separated the population of various origins. DArTseq-based SNPs successfully demonstrated population structure across cultivars and confirmed plant identity in a mismatched rootstock trial. We therefore suggest that both silicoDArT and SNP markers are robust and an inexpensive option for breeders for genomics studies in macadamia. Both DArT platforms developed a large number of highly polymorphic markers, and hence are useful for linkage mapping in macadamia.

Supporting information

S3 Table. Summary statistics of informative markers of both markers of DArT platform.

https://doi.org/10.1371/journal.pone.0203465.s003

(XLSX)

S4 Table. Dissimilarity index among macadamia cultivars estimated by weighted neighbour-join analysis of silicoDArT markers in DARwin software.

https://doi.org/10.1371/journal.pone.0203465.s004

(XLSX)

S5 Table. Dissimilarity index among macadamia cultivars estimated by weighted neighbour-join analysis of DArTseq-based SNP markers in DARwin software.

https://doi.org/10.1371/journal.pone.0203465.s005

(XLSX)

S6 Table. Linear genetic distances across the cultivars and mismatched scions estimated from SNP markers using GenAlEx software.

https://doi.org/10.1371/journal.pone.0203465.s006

(XLSX)

S7 Table. Dissimilarity results of plant identity and maternity analysis of phenotypically mismatched scions in a rootstock trial.

https://doi.org/10.1371/journal.pone.0203465.s007

(XLSX)

Acknowledgments

This research has been funded by Hort Innovation, Australia, using the Macadamia research and development levy and contributions from the Australian Government. Hort Innovation is the grower owned, not-for-profit research and development corporation for Australian horticulture.

References

  1. 1. Hardner CM, Peace C, Lowe AJ, Neal J, Pisanu P, Powell M, et al. Genetic Resources and Domestication of Macadamia. Horticultural Reviews. 2009;35:1–126.
  2. 2. Stephenson R. Macadamia: Domestication and commercialization. Chronica horticulture. 2005;45(2):11–5.
  3. 3. Peace C, Hardner C, Brown AHD, O'Connor K, Vithanage V, Turnbull C, et al. The diversity and origins of macadamia cultivars. WANATCA Yearbook. 2002;26:19–25.
  4. 4. Peace C, Ming R, Schmidt A, Manners J, Vithanage V. Genomics of macadamia, a recently domesticated tree nut crop. Moore PH, Ming R, editors. Heidelberg, Germany: Springer-Verlag GmbH; 2008. pp. 313–32.
  5. 5. Peace CP. Genetic characterisation of macadamia with DNA markers: The University of Queensland; 2005.
  6. 6. Shigeura GT, Ooka H. Macadamia nuts in Hawaii: history and production. Hawaii Institute of Tropical Agriculture and Human Resources, College of Tropical Agriculture and Human Resources, University of Hawaii at Manoa; 1984.
  7. 7. Gitonga L, Muigai A, Kahangi E, Ngamau K, Gichuki S. Status of macadamia production in Kenya and the potential of biotechnology in enhancing its genetic improvement. Journal of plant breeding and crop science. 2009;1(3):049–59.
  8. 8. Hardner C, Peace C, Henshall J, Manners J. Opportunities and constraints for marker-assisted selection in macadamia breeding. Acta horticulturae; 2005;694:85–90
  9. 9. McHargue LT. Macadamia production in southern California. Progress in New Crops. In: Janick J., editor. Progress in new crops. Arlington: ASHS Press; 1996. pp. 458–62.
  10. 10. O’Connor K, Hayes B, Topp B. Prospects for increasing yield in macadamia using component traits and genomics. Tree Genetics & Genomes. 2018;14(1):7.
  11. 11. Melchinger A. Genetic diversity and heterosis. In: James G. Coors SP, editors. The genetics and exploitation of heterosis in crops: ASA-CSSA; 1999. pp. 99–118.
  12. 12. Mohan M, Nair S, Bhagwat A, Krishna T, Yano M, Bhatia C, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Molecular breeding. 1997;3(2):87–103.
  13. 13. Zeng Z- B. Precision mapping of quantitative trait loci. Genetics. 1994;136(4):1457–68. pmid:8013918
  14. 14. Agarwal M, Shrivastava N, Padh H. Advances in molecular marker techniques and their applications in plant sciences. Plant cell reports. 2008;27(4):617–31. pmid:18246355
  15. 15. Jansen RC, Nap J-P. Genetical genomics: the added value from segregation. Trends in Genetics. 2001;17(7):388–91. pmid:11418218
  16. 16. Heffner EL, Sorrells ME, Jannink J-L. Genomic selection for crop improvement. Crop Science. 2009;49(1):1–12.
  17. 17. Aradhya MK, Yee LK, Zee FT, Manshardt RM. Genetic variability in Macadamia. Genetic Resources and Crop Evolution. 1998;45(1):19–32.
  18. 18. Peace C, Allan P, Vithanage V, Turnbull C, Carroll B. Genetic relationships amongst macadamia varieties grown in South Africa as assessed by RAF markers. South African Journal of Plant and Soil. 2005;22(2):71–5.
  19. 19. Peace CP, Vithanage V, Turnbull CG, Carroll BJ. A genetic map of macadamia based on randomly amplified DNA fingerprinting (RAF) markers. Euphytica. 2003;134(1):17–26.
  20. 20. Machado Neto NB, Moryia AT. Variability in Macadamia integrifolia by RAPD markers. Crop Breeding and Applied Biotechnology. 2010;10(3):266–70.
  21. 21. Gan L-p Wang S-r. Application of RAPD in the studies on genetic diversity and physiological specialization of plant pathogenic fungi [J]. Journal of Gansu Agricultural University. 2004;1:019.
  22. 22. Gitonga L, Wanjala B, Muigai A, Ngamau K, Kahangi E, Gichuki S, et al. Genetic Characterization of Selected Macadamia Germplasm and the Implications on Breeding and Conservation in Kenya. Acta horticulturae. 2011; 918(2):877–84.
  23. 23. Steiger DL, Moore PH, Zee F, Liu Z, Ming R. Genetic relationships of macadamia cultivars and species revealed by AFLP markers. Euphytica. 2003;132(3):269–77.
  24. 24. Peace C, Vithanage V, Neal J, Turnbull C, Carroll B. A comparison of molecular markers for genetic analysis of macadamia. The Journal of Horticultural Science and Biotechnology. 2004;79(6):965–70.
  25. 25. O'Connor K, Powell M, Nock C, Shapcott A. Crop to wild gene flow and genetic diversity in a vulnerable Macadamia (Proteaceae) species in New South Wales, Australia. Biological Conservation. 2015;191:504–11.
  26. 26. Nock CJ, Elphinstone MS, Ablett G, Kawamata A, Hancock W, Hardner CM, et al. Whole genome shotgun sequences for microsatellite discovery and application in cultivated and wild Macadamia (Proteaceae). Appl Plant Sci. 2014;2(4):1300089.
  27. 27. Vithanage V, Hardner C, Anderson K, Meyers N, McConchie C, Peace C, editors. Progress made with molecular markers for genetic improvement of macadamia. International Symposium on Biotechnology of Tropical and Subtropical Species Part 2 461; 1997.
  28. 28. Gupta P, Rustgi S, Mir R. Array-based high-throughput DNA markers for crop improvement. Heredity. 2008;101(1):5–18. pmid:18461083
  29. 29. Jaccoud D, Peng KM, Feinstein D, Kilian A. Diversity Arrays: a solid state technology for sequence information independent genotyping. Nucleic Acids Research. 2001;29(4):e25. pmid:11160945
  30. 30. Sanchez-Sevilla JF, Horvath A, Botella MA, Gaston A, Folta K, Kilian A, et al. Diversity Arrays Technology (DArT) Marker Platforms for Diversity Analysis and Linkage Mapping in a Complex Crop, the Octoploid Cultivated Strawberry (Fragaria x ananassa). Plos One. 2015;10(12):22.
  31. 31. Sanewski G, Ko L, Innes D, Kilian A, Carling J, Song J. DArTseq molecular markers for resistance to Phytophthora cinnamomi in pineapple (Ananas comosus L.). Australasian Plant Pathology. 2017;46(5):499–509.
  32. 32. Vishwakarma MK, Nayak SN, Guo B, Wan L, Liao B, Varshney RK, et al. Classical and Molecular Approaches for Mapping of Genes and Quantitative Trait Loci in Peanut. In: Varshney RK PM, Puppala N, editors. The peanut genome: Springer; 2017. pp. 93–116.
  33. 33. Grzebelus D, Iorizzo M, Senalik D, Ellison S, Cavagnaro P, Macko-Podgorni A, et al. Diversity, genetic mapping, and signatures of domestication in the carrot (Daucus carota L.) genome, as revealed by Diversity Arrays Technology (DArT) markers. Molecular breeding. 2014;33(3):625–37.
  34. 34. Sansaloni CP, Petroli CD, Carling J, Hudson CJ, Steane DA, Myburg AA, et al. A high-density Diversity Arrays Technology (DArT) microarray for genome-wide genotyping in Eucalyptus. Plant Methods. 2010;6(1):16.
  35. 35. Hudson CJ, Freeman JS, Kullan AR, Petroli CD, Sansaloni CP, Kilian A, et al. A reference linkage map for Eucalyptus. BMC genomics. 2012;13(1):240.
  36. 36. Mace ES, Xia L, Jordan DR, Halloran K, Parh DK, Huttner E, et al. DArT markers: diversity analyses and mapping in Sorghum bicolor. Bmc Genomics. 2008;9(1):1.
  37. 37. Sánchez-Sevilla JF, Horvath A, Botella MA, Gaston A, Folta K, Kilian A, et al. Diversity Arrays Technology (DArT) marker platforms for diversity analysis and linkage mapping in a complex crop, the octoploid cultivated strawberry (Fragaria× ananassa). PLoS One. 2015;10(12):e0144960. pmid:26675207
  38. 38. Tang J, Daroch M, Kilian A, Jeżowski S, Pogrzeba M, Mos M. DArT-based characterisation of genetic diversity in a Miscanthus collection from Poland. Planta. 2015;242(4):985–96. pmid:26040407
  39. 39. Wenzl P, Carling J, Kudrna D, Jaccoud D, Huttner E, Kleinhofs A, et al. Diversity Arrays Technology (DArT) for whole-genome profiling of barley. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(26):9915–20. pmid:15192146
  40. 40. Bolibok-Brągoszewska H, Heller-Uszyńska K, Wenzl P, Uszyński G, Kilian A, Rakoczy-Trojanowska M. DArT markers for the rye genome-genetic diversity and mapping. Bmc Genomics. 2009;10(1):1.
  41. 41. Yang S, Pang W, Ash G, Harper J, Carling J, Wenzl P, et al. Low level of genetic diversity in cultivated pigeonpea compared to its wild relatives is revealed by diversity arrays technology. Theoretical and applied genetics. 2006;113(4):585–95. pmid:16845522
  42. 42. Alam M, Mace E, Van Oosterom E, Cruickshank A, Hunt C, Hammer G, et al. QTL analysis in multiple sorghum populations facilitates the dissection of the genetic and physiological control of tillering. Theoretical and Applied Genetics. 2014;127(10):2253–66. pmid:25163934
  43. 43. Ambawat S, Senthilvel S, Hash CT, Nepolean T, Rajaram V, Eshwar K, et al. QTL mapping of pearl millet rust resistance using an integrated DArT-and SSR-based linkage map. Euphytica. 2016;209(2):461–76.
  44. 44. Schouten HJ, van de Weg WE, Carling J, Khan SA, McKay SJ, van Kaauwen MP, et al. Diversity arrays technology (DArT) markers in apple for genetic linkage maps. Molecular Breeding. 2012;29(3):645–60. pmid:22408382
  45. 45. Laidò G, Mangini G, Taranto F, Gadaleta A, Blanco A, Cattivelli L, et al. Genetic diversity and population structure of tetraploid wheats (Triticum turgidum L.) estimated by SSR, DArT and pedigree data. PLoS One. 2013;8(6):e67280. pmid:23826256
  46. 46. Matthies IE, van Hintum T, Weise S, Röder MS. Population structure revealed by different marker types (SSR or DArT) has an impact on the results of genome-wide association mapping in European barley cultivars. Molecular breeding. 2012;30(2):951–66.
  47. 47. Topp B, Hardner C, Neal J, Kelly A, Russell D, McConchie C, et al., editors. Overview of the Australian macadamia industry breeding program. Acta Horticulturae. 2016;1127:45–50.
  48. 48. Doyle JJ. Isolation of plant DNA from fresh tissue. Focus. 1990;12:13–5.
  49. 49. Baloch FS, Alsaleh A, Andeden EE, Hatipoglu R, Nachit M, Ozkan H. High levels of segregation distortion in the molecular linkage map of bread wheat representing the West Asia and North Africa region. Turkish Journal of Agriculture and Forestry. 2016;40(3):352–64.
  50. 50. Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, et al. Diversity Arrays Technology: A Generic Genome Profiling Technology on Open Platforms. In: Bonin FPaA, editor. Data Production and Analysis in Population Genomics: Methods and protocol. New York Heidelberg Dordrecht London: Springer; 2012. p. 67–89.
  51. 51. Perrier X, Flori A, Bonnot F. Methods for data analysis. Genetic diversity of cultivated tropical plants Science Publishers, Inc and CIRAD, Montpellier. 2003:31–63.
  52. 52. Pritchard J, Wen X, Falush D. Documentation for structure software: Version 2.3. University of Chicago, Chicago, IL. 2010.
  53. 53. PE PRS. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics. 2012;28:2537–9. pmid:22820204
  54. 54. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources. 2012;4(2):359–61
  55. 55. Kalinowski ST, Taper ML, Marshall TC. Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment. Molecular Ecology. 2007;16(5):1099–106. pmid:17305863
  56. 56. Tabangin ME, Woo JG, Martin LJ, editors. The effect of minor allele frequency on the likelihood of obtaining false positives. BMC proceedings. 2009; 3 (Suppl 7):S41.
  57. 57. Nock CJ, Baten A, Barkla BJ, Furtado A, Henry RJ, King GJ. Genome and transcriptome sequencing characterises the gene space of Macadamia integrifolia (Proteaceae). BMC genomics. 2016;17(1):937. pmid:27855648
  58. 58. Simko I, Eujayl I, van Hintum TJ. Empirical evaluation of DArT, SNP, and SSR marker-systems for genotyping, clustering, and assigning sugar beet hybrid varieties into populations. Plant science. 2012;184:54–62. pmid:22284710
  59. 59. James KE, Schneider H, Ansell SW, Evers M, Robba L, Uszynski G, et al. Diversity arrays technology (DArT) for pan-genomic evolutionary studies of non-model organisms. PloS one. 2008;3(2):e1682. pmid:18301759
  60. 60. Xia L, Peng K, Yang S, Wenzl P, De Vicente MC, Fregene M, et al. DArT for high-throughput genotyping of cassava (Manihot esculenta) and its wild relatives. Theoretical and Applied Genetics. 2005;110(6):1092–8. pmid:15742202
  61. 61. Akbari M, Wenzl P, Caig V, Carling J, Xia L, Yang S, et al. Diversity arrays technology (DArT) for high-throughput profiling of the hexaploid wheat genome. Theoretical and applied genetics. 2006;113(8):1409–20. pmid:17033786
  62. 62. Yang X, Ren R, Ray R, Xu J, Li P, Zhang M, et al. Genetic diversity and population structure of core watermelon (Citrullus lanatus) genotypes using DArTseq-based SNPs. Plant Genetic Resources. 2016;14(03):226–33.
  63. 63. Von Mark VC, Kilian A, Dierig DA. Development of DArT marker platforms and genetic diversity assessment of the US collection of the new oilseed crop lesquerella and related species. PLoS One. 2013;8(5):e64062. pmid:23724020
  64. 64. Dierig D, Ray DT. New crops breeding: Lesquerella. In: Vollmann J, Rajcan I, editors. Oil Crops: Springer; 2009. pp. 507–16.
  65. 65. Kilian A, Huttner E, Wenzl P, Jaccoud D, Carling J, Caig V, et al. The Fast and the Cheap: SNP and DArT-Based Whole Genome Profiling for Crop Improvement. Proceedings of the international congress in the wake of the double helix: from the green revolution to the gene revolution; 2003:443–461.
  66. 66. Nagao MA, Hirae HH. Macadamia: cultivation and physiology. Crit Rev Plant Sci. 1992;10(5):441–70.
  67. 67. Sharp D, Playford J. Native macadamia germplasm resources. Australian Macadamia Society annual conference report. Lismore, NSW, Australia: Australian Macadamia Society; 1997. p. 20.
  68. 68. Waldron J, Peace CP, Searle IR, Furtado A, Wade N, Graham MW, et al. Randomly amplified DNA fingerprinting: a culmination of DNA marker technologies based on arbitrarily-primed PCR amplification. Journal of Biomedicine and Biotechnology. 2002;2(3):141–50. pmid:12488579
  69. 69. Jones AG, Small CM, Paczolt KA, Ratterman NL. A practical guide to methods of parentage analysis. Molecular ecology resources. 2010;10(1):6–30. pmid:21564987