Skip to main content
Advertisement

< Back to Article

Figure 1.

Graphical representation of the evolutionary model for a sample of two bacterial populations.

Strain sequences are represented as vertical bars with horizontal lines indicating the mutations that have occurred since the global ancestor. Stage-1 mutations are defined as those that occurred on local ancestors which provide candidate sites for gene flow between the populations. Mutations that occurred after the local ancestors are referred to as stage-2 mutations.

More »

Figure 1 Expand

Figure 2.

Tentative gene flow graph in six populations.

The graph topology can be succinctly termed as , where the node set and the arrow set . The actual rates of admixture associated with the arrows were randomly generated from a uniform distribution. Note the two ways of gene flow between population 2 and 3.

More »

Figure 2 Expand

Figure 3.

Testing partition accuracy for different choices of gene flow weights for a small population size (upper panel) and a large population size (lower panel).

The number of segregating sites for both settings is and the ratio of mutations at two stages is . Data were generated by assigning and randomly at the interval [0,1] with the gene flow topology fixed as in Figure 2. A brighter area corresponds to a range of and , within which the true partition has been identified by BAPS with a higher accuracy as measured by Rand Index (RI).

More »

Figure 3 Expand

Figure 4.

Testing gene flow structure accuracy for and .

Graph similarity was measured in the Hamming distance coded in a gray-scale image. Cells with the paper white color represent the scenarios where the partition and the gene flow structure in Figure 2 are both correctly identified by BAPS.

More »

Figure 4 Expand

Figure 5.

Genetic shapes of five populations relative to population 2.

The data set was generated with , , and Figure 2 as the underlying population structure. Each curve is a density estimation of (8) using (9) for one target population.

More »

Figure 5 Expand

Figure 6.

Bootstrap mixture analyses of the Neisseria data.

The figure shows the adjusted rand index between the partition based on the original data and the alternative based on a bootstrap data set by resampling in clusters. Five repetitions were made for each of the clusters.

More »

Figure 6 Expand

Figure 7.

Gene flow network identified in the N. meningitidis and N. lactamica populations.

To investigate the ancestral admixture of a certain population, one can look at all the arrows pointing at this population. A typical population contains the major sources of its own, denoted as a self-looping arrows, and small proportions of gene flow from other populations. For instance, population 29 has 73% of its own genetic makeup and 27% of the DNA introduced via gene flow from other populations. Two major sources of gene flow for population 29 are population 19 and population 11, with 3.3% and 6.9% in the contribution separately. The remaining 17.1% of genes comes from various sources while none of them contributes a proportion larger than 3% and therefore are not displayed due to the pruning.

More »

Figure 7 Expand

Table 1.

Bootstrap admixture analyses of the Neisseria data.

More »

Table 1 Expand

Figure 8.

Genetic shapes of N. lactamica populations 8, 29 and 32 as relative to N. meningitidis populations 11 and 19.

More »

Figure 8 Expand

Table 2.

The number of strains that are jointly assigned into a BAPS population and a EBURST group.

More »

Table 2 Expand

Figure 9.

The first NJ tree.

It shows a subset of the BAPS populations indicated with distinct colors.

More »

Figure 9 Expand

Figure 10.

The second NJ tree.

More »

Figure 10 Expand

Figure 11.

The third NJ tree.

More »

Figure 11 Expand

Figure 12.

The fourth NJ tree.

More »

Figure 12 Expand

Figure 13.

Color coding scheme used in the BAPS populations and the NJ trees.

For example, the first row shows the BAPS populations highlighted in the first NJ tree in Figure 9.

More »

Figure 13 Expand