Fig 1.
Flow chart of the locus discovery pipeline.
Table 1.
Number of reads, percentage of reads mapped back to the Eucalyputus grandis targets and the average coverage of 47 samples (chloroplast, nuclear and total).
P-distance (p-dist) of each species compared to Eucalyputus globulus subsp. pseudoglobulus.
Fig 2.
Flow chart of the bioinformatics analysis pipeline.
Fig 3.
Number of reads per sample mapped back to the Eucalyptus grandis targets, sorted from high to low.
Fig 4.
A heat map of the average coverage across 209 loci and 47 samples.
Average reads mapped were log transformed. Samples are on the x-axis (A = Arillastrum, Cor = Corymbia, H = Heteropyxis, Mel = Melaleuca), loci are on the y-axis (chl = chloroplast loci).
Fig 5.
Linear regression of p-distance of each sample from Eucalyptus globulus subsp. pseudoglobulus versus percentage of reads mapped back to Eucalyptus grandis targets.
The percentage of reads mapped and the p-distance is significantly negatively correlated (R2 = 0.36, P<0.0001).
Fig 6.
Linear regression of average read coverage versus GC content.
GC content and the average coverage are significantly negatively correlated (R2 = 0.2070, P < 0.0001).
Fig 7.
The single maximum likelihood tree of 47 taxa found by multiple searches (see Methods).
Bootstrap values are shown and branch labels indicate higher taxonomic groups down to the rank of subgenera (sg). When bootstrap values are not shown, nodes had a bootstrap value of 100.