Identifying genetic markers for a range of phylogenetic utility–From species to family level

doi:10.1371/journal.pone.0218995

Fig 1.

Flow chart of the locus discovery pipeline.

More »

Expand

Table 1.

Number of reads, percentage of reads mapped back to the Eucalyputus grandis targets and the average coverage of 47 samples (chloroplast, nuclear and total).

P-distance (p-dist) of each species compared to Eucalyputus globulus subsp. pseudoglobulus.

More »

Expand

Fig 2.

Flow chart of the bioinformatics analysis pipeline.

More »

Expand

Fig 3.

Number of reads per sample mapped back to the Eucalyptus grandis targets, sorted from high to low.

More »

Expand

Fig 4.

A heat map of the average coverage across 209 loci and 47 samples.

Average reads mapped were log transformed. Samples are on the x-axis (A = Arillastrum, Cor = Corymbia, H = Heteropyxis, Mel = Melaleuca), loci are on the y-axis (chl = chloroplast loci).

More »

Expand

Fig 5.

Linear regression of p-distance of each sample from Eucalyptus globulus subsp. pseudoglobulus versus percentage of reads mapped back to Eucalyptus grandis targets.

The percentage of reads mapped and the p-distance is significantly negatively correlated (R² = 0.36, P<0.0001).

More »

Expand

Fig 6.

Linear regression of average read coverage versus GC content.

GC content and the average coverage are significantly negatively correlated (R² = 0.2070, P < 0.0001).

More »

Expand

Fig 7.

The single maximum likelihood tree of 47 taxa found by multiple searches (see Methods).

Bootstrap values are shown and branch labels indicate higher taxonomic groups down to the rank of subgenera (sg). When bootstrap values are not shown, nodes had a bootstrap value of 100.

More »

Expand