Skip to main content
Advertisement

< Back to Article

Fig 1.

Geographic origin of the 620 human P. vivax isolates and two African great apes P. vivax-like isolates.

Distribution of the samples studied per region: Central America (n = 16, light blue), South America (n = 111, dark blue), Europe (n = 1), West Africa (n = 18, light green), East Africa (n = 115, dark green), Middle East (n = 62, orange), South Asia (n = 91, light red), East Asia (n = 7, dark red), Southeast Asia (n = 183, pink), and Oceania (n = 18, purple). The circle size is proportional to the sample size in log10 units. The countries included in the sampling effort are differentiated by varying shades of gray.: darker gray for countries where new samples were sequenced and lighter gray for countries where already published samples were used. Country names highlighted in bold indicate that new genomes were sequenced in this study. In brackets, the number of samples from the literature is listed first, followed by the number of newly sequenced samples. The chimpanzee pictogram indicates the two great apes P. vivax-like samples studied. PNG: Papua New Guinea. The base layer of the map was made with Natural Earth (naturalearthdata.com).

More »

Fig 1 Expand

Fig 2.

Worldwide genetic structure of P. vivax.

(a) Principal component analysis of 619 modern P. vivax strains and the Ebro ancient DNA sample from Spain, showing the first three PCs based on the genotype likelihood of 105,527 unlinked SNPs (see S2 Fig for more details). (b) Maximum likelihood (ML) tree of the 620 P. vivax individual genomes obtained with IQ-TREE [32] based on a general time reversible model of nucleotide evolution [33], as determined by ModelFinder [34]. The ML tree includes two P. vivax-like strains from African great apes indicated by an asterisk (*) used to locate the root. Note that the length of the outgroup branch was truncated. Black dots at nodes correspond to highly supported nodes (SH-aLRT ≥ 80% and UFboot ≥ 95%). The reference mark (※) highlight a Brazilian P. vivax isolate, discussed in the text. (c) Individual genetic ancestry assuming K = 4 and K = 9 distinct genetic clusters estimated using PCAngsd (see S5 Fig for the other K values). (d) Geographic distribution of the population average genetic ancestry proportions at K = 9 estimated using PCAngsd. PNG: Papua New Guinea. The base layer of the map was made with Natural Earth (naturalearthdata.com).

More »

Fig 2 Expand

Fig 3.

Population graphs describing the genetic relationships and admixture proportions between P. vivax populations.

(a) Population tree estimated using TreeMix for a subset of 18 P. vivax populations with two migration edges (orange arrows) and rooted using the two P. vivax-like genomes indicated with the asterisk (*). The scale bar shows ten times the mean standard error (s.e.) of the covariance matrix. The migration weight is indicated as a percentage in orange on the arrows. (b) Network topology of a subset of 18 P. vivax populations with the highest posterior probability obtained with AdmixtureBayes, rooted with the two P. vivax-like genomes (*). The branch length indicates the genetic divergence between populations (measured by drift), multiplied by 100. The percentages in the nodes are the posterior probability that the true graph has a node with the same descendants. For each admixture event (indicated by the dotted arrows) the percentages illustrate the admixture proportion. PNG: Papua New Guinea.

More »

Fig 3 Expand

Fig 4.

Coalescent-based inference of the demographic history of P. vivax populations.

(a) The variation in effective population (Ne) size was estimated using Relate (axes are log10 transformed). The period highlighted by the blue rectangle corresponds to the diversification of the Latin American populations. (b) Comparison between the inverse coalescence rates (ICR) and inverse cross-coalescence rates (ICCR) from Relate between Latin America (Colombia on top, French Guiana below) and Eurasia/Africa (left to right: Thailand, Papua-New-Guinea, Mauritania) provides a formal estimate of their split time [46]. The axes are log10 transformed. The periods highlighted by the colored rectangles correspond to the divergence time (Tdiv) between populations and are also specified in the panels. (c) Comparison of the ICR and ICCR between Latin America (Colombia on top, French Guiana below) and Ebro. The 95% confidence interval resulting from 100 bootstraps inferred using Colate is indicated by the envelope surrounding the estimate line. The black dot indicates Ebro sampling time. Axes are in log10 scale units. The period highlighted by the light-yellow rectangles corresponds to the divergence time (Tdiv) between the American populations and Ebro and is specified. PNG: Papua New Guinea.

More »

Fig 4 Expand

Fig 5.

Most likely P. vivax colonization history scenarios of Latin America inferred using the Approximate Bayesian Computation Random Forest approach.

(a) Twelve colonization scenarios were tested. Distinct solid colored lines correspond to populations with distinct effective size parameters (labels starting with “N”) and horizontal dashed lines represent admixture events with a contribution ra and 1-ra from each of the two contributors. Time parameters of the different events are displayed along the vertical line on the right of each scenario (labels starting with “t”, not to scale). The parameters of each scenario are characterized by probabilistic distributions detailed in S3 Table. Under each scenario is the mean percentage of RF classification votes ± standard deviation (SD) and the mean type II error ± SD. The three most supported scenarios (scenarios 6, 4 and 8, with 46.0% of the total votes) are highlighted by dotted-line rectangles. (b) The best-supported scenario (scenario 6) in the DIYABC-RF analysis, scaled to relative time-parameter estimates (converted to years assuming a generation time of 5.5 generations/year). On the left, each time parameter estimate is indicated by the mean (circle), median (diamond), and 90% confidence intervals (colored bars). Here, the results are only shown considering the Colombian population as a Latin America representative population. Results considering the French Guiana population were very similar (see S4 Table).

More »

Fig 5 Expand