Table 1.
SNP datasets for different geographic regions.
Figure 1.
Procrustes analysis of genetic and geographic coordinates of worldwide populations.
(A) Geographic coordinates of 53 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the Gall-Peters projected coordinates of geographic locations and PC1-PC2 coordinates of 938 individuals. The figures are plotted according to the Gall-Peters projection. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. PC1 and PC2 account for 6.22% and 4.72% of the total variance, respectively. The Procrustes similarity is (
). The rotation angle of the PCA map is
.
Figure 2.
Procrustes analysis of genetic and geographic coordinates of European populations.
(A) Geographic coordinates of 37 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the unprojected latitude-longitude coordinates and PC1-PC2 coordinates of 1378 individuals. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. Abbreviations are as follows: AL, Albania; AT, Austria; BA, Bosnia-Herzegovina; BE, Belgium; BG, Bulgaria; CH-F, Swiss-French; CH-G, Swiss-German; CH-I, Swiss-Italian; CY, Cyprus; CZ, Czech Republic; DE, Germany; DK, Denmark; ES, Spain; FI, Finland; FR, France; GB, United Kingdom; GR, Greece; HR, Croatia; HU, Hungary; IE, Ireland; IT, Italy; KS, Kosovo; LV, Latvia; MK, Macedonia; NL, Netherlands; NO, Norway; PL, Poland; PT, Portugal; RO, Romania; RU, Russia; Sct, Scotland; SE, Sweden; SI, Slovenia; TR, Turkey; UA, Ukraine; YG, Serbia and Montenegro. Population labels follow the color scheme of Novembre et al. [9]. PC1 and PC2 account for 0.30% and 0.16% of the total variance, respectively. The Procrustes similarity is (
). The rotation angle of the PCA map is
.
Figure 3.
Procrustes analysis of genetic and geographic coordinates of Sub-Saharan African populations, excluding hunter-gatherer populations and Mbororo Fulani.
(A) Geographic coordinates of 23 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the unprojected latitude-longitude coordinates and PC1-PC2 coordinates of 348 individuals. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. PC1 and PC2 account for 1.34% and 0.69% of the total variance, respectively. The Procrustes similarity is (
). The rotation angle of the PCA map is
.
Figure 4.
Procrustes analysis of genetic and geographic coordinates of Asian populations.
(A) Geographic coordinates of 44 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the unprojected latitude-longitude coordinates and PC1-PC2 coordinates of 749 individuals. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. PC1 and PC2 account for 5.42% and 0.85% of the total variance, respectively. The Procrustes similarity is (
). The rotation angle of the PCA map is
.
Figure 5.
Procrustes analysis of genetic and geographic coordinates of East Asian populations.
(A) Geographic coordinates of 23 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the unprojected latitude-longitude coordinates and PC1-PC2 coordinates of 334 individuals. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. PC1 and PC2 account for 1.58% and 0.98% of the total variance, respectively. The Procrustes similarity statistic is (
). The rotation angle of the PCA map is
.
Figure 6.
Procrustes analysis of genetic and geographic coordinates of Central/South Asian populations.
(A) Geographic coordinates of 18 populations. (B) Procrustes-transformed PCA plot of genetic variation. The Procrustes analysis is based on the unprojected latitude-longitude coordinates and PC1-PC2 coordinates of 362 individuals. PC1 and PC2 are indicated by dotted lines, crossing over the centroid of all individuals. PC1 and PC2 account for 1.59% and 1.31% of the total variance, respectively. The Procrustes similarity statistic is (
). The rotation angle of the PCA map is
.
Figure 7.
Histograms of the Procrustes similarity of 100,000 permutations for analyses in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, and Figure 6.
The blue vertical lines indicate the value of . (A) The worldwide dataset in Figure 1 (
,
). (B) The European dataset in Figure 2 (
,
). (C) The Sub-Saharan African dataset in Figure 3 (
,
). (D) The Asian dataset in Figure 4 (
,
). (E) The East Asian dataset in Figure 5 (
,
). (F) The Central/South dataset in Figure 6 (
,
).
Table 2.
Summary of the results for datasets from different geographic regions.
Figure 8.
Procrustes analyses of genetic and geographic coordinates based on different numbers of loci.
The same sets of randomly selected markers were used to generate PCA maps of genetic variation to compare with geographic maps for different regions.
.
Figure 9.
Relationship between and the proportion of genetic variation explained by the first two components of the PCA.
Both the main analyses of the paper in Table 2 and the supplementary analyses of Sub-Saharan Africa, in which certain populations excluded from the main analysis are included, are considered in obtaining the regression line. The values on the x-axis were obtained by summing the proportions of variance explained by PC1 and PC2 (columns 2 and 3 in Table 2, columns 6 and 7 in Table S7). values were estimated from the same datasets as used in the PCA (column 7 in Table 2, column 11 in Table S7). The dashed line indicates the linear least squares fit of
. The Pearson correlation is
.