Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1.

Geographical map of Uruguay showing the spatial distribution of the 82 accessions conserved in the INIA germplasm bank.

The purple diamonds indicate the five accessions evaluated in this study. Black circles represent the 77 accessions pending genomic evaluation. The map includes ecoregions according to the classification framework established by Brazeiro et al. (2012) [51]. The colour scheme is delineated in the accompanying legend. The graph was generated using the “ggplot2” package in R [53].

More »

Fig 1 Expand

Fig 2.

Diagram illustrating the procedures and parameters analyzed, using one accession as an example.

(A.) Individual sequencing (ind-seq) (blue background) involved individual sample collection, unique barcoding, sequencing at 0.9 Mr depth, individual SNP calling, and analysis with dartR and Bio-R. (B.) Pooled sequencing (pool-seq) (pink background) involved two tissue replicates, two libraries per replicate, sequencing at two depths, pooled SNP calling, and analysis with Bio-R. (C.) Common SNP calling (orange background) is employed to compare allele frequencies between paired individuals and pools with the same sample size and missing data. The minimum allele frequency and sequencing depth used in the individual SNP dataset remained constant. They were compared with pooled datasets based on two MAF and three sequencing depths. See the step-by-step protocol for more details [54]. This diagram was created using Microsoft PowerPoint.

More »

Fig 2 Expand

Fig 3.

Effect of sample size (A, B) and missing data thresholds (MD) (C, D) on Representativity (%) and Concordance Correlation Coefficient (CCC).

Each data point represents the average Representativity or CCC for a single accession, with the color key provided in the legend. For each group, the black horizontal line indicates the means, while the vertical lines represent the standard error. The upper panels (A.) and (B.) illustrate the effect of different sample sizes (20, 30, 40, 50, 60) on Representativity and CCC, respectively. For the same metrics, the lower panels, (C.) and (D.), show the influence of MD thresholds (10, 20, 30, 40, 50, and 60%). Statistical differences between groups, as determined by Tukey’s test (p < 0.05), are indicated by different letters. Plots were generated using the “ggplot2” and “ggpubr” packages in R [53,70].

More »

Fig 3 Expand

Fig 4.

Effect of sequencing depth (Mr) (A, B) and minor allele frequency thresholds (MAF) (C, D) on Representativity (%) and Concordance Correlation Coefficient (CCC).

Each dot represents the average Representativity or CCC of an individual accession, with the color coding outlined in the accompanying legend. For each group, the black horizontal line indicates the mean values, while the vertical lines represent the standard deviation. Panels (A.) and (B.) show the effect of sequencing depth (1.8, 3.0, and 4.8 Mr) on Representativity and CCC, respectively. Panels (C.) and (D.) illustrate the effect of MAF thresholds (0.01 and 0.05) on Representativity and CCC, respectively. Statistical differences between groups, as determined by Tukey’s test (p < 0.05), are indicated by different letters. Plots were generated using the “ggplot2” and “ggpubr” packages in R [53,70].

More »

Fig 4 Expand

Fig 5.

Cumulative effect of optimized factors (Selected) on Representativity (%) and Concordance Correlation Coefficient (CCC).

“All” compresses the entire dataset from the S6 Table. The “Selected” dataset includes samples with a minimum size of 30 plants, a missing data (MD) threshold of 10%, a coverage depth of 4.8 Mr for pooled samples, and a MAF threshold of 0.01. Each point represents the average Representativity or CCC of a single accession, with the color coding detailed in the legend. For each group, the black horizontal line indicates the mean value, while the vertical line represents the standard deviation. (A.) Shows the effect of Selected data on Representativity. (B.) Displays the effect of Selected data on CCC. Statistical differences between groups, determined by Tukey’s test (p < 0.05), are indicated by different letters. Plots were generated using the “ggplot2” and “ggpubr” packages in R [53,70].

More »

Fig 5 Expand

Table 1.

Effect of sample size, from 20 to 60 individuals, on genetic diversity parameters calculated using individual sequencing (ind-seq) data. The parameters analyzed include the number of single nucleotide polymorphisms (SNPs), observed heterozygosity (HO), expected heterozygosity (HE), inbreeding coefficient (FIS), and allelic richness (Ae), along with their minimum and maximum (min-max) values. The “Significance” row indicates the statistical significance of these effects, as determined by Tukey’s test, where “ns” represents non-significant differences, while “**” indicates statistically significant differences at p < 0.01.

More »

Table 1 Expand

Fig 6.

Effect of Sample size on Allele Richness (Ae).

Each dot represents a single accession, with the color coding detailed in the legend. For each group, the black horizontal line indicates the mean, while the vertical lines represent the standard deviation. Statistical differences between groups, determined by Tukey’s test (p < 0.05), are indicated by different letters. The plot was generated using the “ggplot2” package in R [53].

More »

Fig 6 Expand

Fig 7.

Effect of sample size and sequencing depth on single nucleotide polymorphisms (SNP) number and the influence of sequencing depth on Expected heterozygosity (HE) with pool-seq dataset.

Panel (A.) presents the average number of SNPs (SNPs) across accessions, along with the corresponding standard deviation, stratified by sample size and sequencing depth. Panel (B.) illustrates the average expected heterozygosity and its standard deviation across all accessions at varying sequencing depths.

More »

Fig 7 Expand

Fig 8.

Boxplot illustrating influence of sequencing depth in

ΔHE: HE pool-seq – HE ind-seq. Each dot represents a single accession, with color explained in the legend. The line inside each box represents the median, while the lower and upper box edges indicate the first and third quartiles, respectively. The whiskers extend from the box to the minimum and maximum values within 1.5 times the interquartile range from the first and third quartiles, respectively. Tukey’s test (p < 0.05) is indicated by letters statistical differences between groups. Plots were generated using the package “ggplot2” in R [53].

More »

Fig 8 Expand

Table 2.

Comparative population genetics parameters for each accession using a sample size of 50 individuals, analyzed with individual sequencing (ind-seq) and pooled sequencing (pool-seq). The table includes observed data for heterozygosity (HO), expected heterozygosity (HE), inbreeding coefficient (FIS), and allelic richness (Ae) from Ind-seq, while pool-seq data includes HE and Ae. Averages across all accessions are also presented.

More »

Table 2 Expand

Table 3.

Analysis of molecular variance (AMOVA) was conducted on a sample size of 50 individuals to assess genetic variation within and between accessions using both individual sequencing (ind-seq) and pooled sequencing (pool-seq) approaches. The table presents the degrees of freedom and percentage of variation for each source, with p-value reported for variation between-accession.

More »

Table 3 Expand

Table 4.

Pairwise fixation index (FST) values calculated between accessions using individual sequencing (ind-seq) dataset, based on a sample size of 50 individuals.

More »

Table 4 Expand

Fig 9.

Multidimensional scaling (MDS) based on Roger’s distance.

(A.) MDS plot derived from individual sequencing (ind-seq) data using 50 individuals per accession and 2,124 single nucleotide polymorphisms (SNPs). (B.) MDS plot derived from pool sequencing (pool-seq) data using pools of 40, 50, and 60 plants with a sequencing depth of 4.8 Mr and 63,017 SNPs. Graphs were generated using the “plotly” library in Python [71].

More »

Fig 9 Expand

Fig 10.

Proposed workflows for analyzing genetic diversity in B. auleticus.

(A.) Ind-seq: individual sequencing approach. (B.) pool-seq: pooled sequencing workflow. The diagram outlines the number of samples, sequencing depth, R packages used, and the analyses proposed for each workflow. This diagram was created using Microsoft PowerPoint.

More »

Fig 10 Expand