Native Americans derive from a small number of Asian founders who likely arrived to the Americas via Beringia. However, additional details about the intial colonization of the Americas remain unclear. To investigate the pioneering phase in the Americas we analyzed a total of 623 complete mtDNAs from the Americas and Asia, including 20 new complete mtDNAs from the Americas and seven from Asia. This sequence data was used to direct high-resolution genotyping from 20 American and 26 Asian populations. Here we describe more genetic diversity within the founder population than was previously reported. The newly resolved phylogenetic structure suggests that ancestors of Native Americans paused when they reached Beringia, during which time New World founder lineages differentiated from their Asian sister-clades. This pause in movement was followed by a swift migration southward that distributed the founder types all the way to South America. The data also suggest more recent bi-directional gene flow between Siberia and the North American Arctic.
Citation: Tamm E, Kivisild T, Reidla M, Metspalu M, Smith DG, Mulligan CJ, et al. (2007) Beringian Standstill and Spread of Native American Founders. PLoS ONE 2(9): e829. doi:10.1371/journal.pone.0000829
Academic Editor: Dee Carter, University of Sydney, Australia
Received: April 24, 2007; Accepted: August 10, 2007; Published: September 5, 2007
Copyright: © 2007 Tamm et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: Funding for this work is from NSF BCS#0422144 to RSM and from the University of Tartu #PBGMR06901 to TK.
Competing interests: The authors have declared that no competing interests exist.
The mitochondrial DNA haplogroup nomenclature that is widely used today in population and medical genetics, forensic science, and in other interdisciplinary studies, traces back to the analysis of Native American populations by Torroni et al. ,. The first four letters of the phylogenetic alphabet for mtDNA haplogroups - A-D - were coined to refer to just four founding haplogroups that exhibit virtually all North and South American mtDNA diversity.
Genetic studies demonstrate that Native Americans inherited their mitochondrial DNA (mtDNA) from a handful of founders who arrived from Asia via Beringia ,. No more than four major pan American and three minor North American founding mtDNA haplotypes (A2, B2, C1, D1 and X2a, D2, D3, respectively) have been convincingly established in previous studies of control region sequence, RFLP markers and 30 complete mtDNA genomes (Table 1) –. The paucity of established founding mtDNAs suggests that the number of migrants that initially peopled the Americas was relatively low. However, determining the full range of diversity surviving to the present day in the founding population requires high-resolution mtDNA sequence data. Previous estimates of mtDNA diversity are predominantly based on control region sequences representing only a minor fraction of the mtDNA genome. In addition, control region sequences experience a high frequency of recurrent mutations, potentially obscuring the identification of additional founding mtDNAs –.
Even though some additional minor founder types have been later identified in North America, such as X, the hypothesis of just four major founder types in the initial colonization of the New World remains uncontested. However, the timing of their entry remains debated. Previous studies of mtDNA data place estimates for the peopling of the New World in a broad range from 11,000 to over 40,000 years before present (ybp) [reviewed by 20], although more recent estimates range from 20,000–15,000 ybp . Recent archaeological evidence places Homo sapiens in northeastern Siberia at the Yana Rhinoceros Horn Site as early as 30,000 ybp  about twice the 15,000 ybp  date for humans at the southern end of South America. These archaeological dates suggest two likely scenarios. First, the ancestors of Native Americans peopled Beringia before the Last Glacial Maximum, but remained locally isolated (likely due to ecological barriers) until entering the Americas at 15,000 ybp (Beringian incubation model, BIM) . Second, the ancestors of Native Americans did not reach Beringia until just before 15,000 ybp, and then moved continuously on into the Americas, being recently derived from a larger parent Asian population (direct colonization model, DCM).
The DCM model hypothesizes the presence of founding mtDNA haplotypes that include members from both Northeast Asia and the Americas. It presumes a continuous movement of recently derived migrants across Beringia. In contrast, the BIM model predicts widespread, derived founding haplotypes specific to the Americas that are not found in Asia. This implies that migrants were isolated for an extended period of time before entering the Americas and that the founder haplotypes arose in situ in Beringia. Once in the Americas, these immigrants spread southward. Therefore, the phylogeographic distribution of this diversity can provide insights into the mode of the initial phase of the peopling of the Americas. A nested hierarchy of diversity from north to south in Native American founding haplogroups would reflect a gradual peopling, whereas a uniform distribution of Native American founding haplotypes both in North and South America implies a rapid occupation.
Previous studies of mtDNA variation recognized no major subclade structure within each New World haplogroup ,–,. A few studies of mtDNA variation suggested subclade structure ,,, but lacked the power to convincingly demonstrate it. In this study we identified three sub-clades - C1b, C1c and C1d - that incorporate nearly all of Native American haplogroup C mtDNAs. All three are widely distributed in the New World. They are absent in Asia, and show similar coalescence times of approximately 13,900±2,700 years ago (Figure 1). Similar coalescence times were estimated for the other major founder haplogroups - A2, B2 and D1 - suggesting the simultaneous divergence of all founder clades across North and South America. A different C1 sub-clade in Asia - C1a - likely derives from the same ancestral population as the three Native American sub-clades. Thus C1b, C1c and C1d are likely independent New World founders. In addition to C1 sub-clades, we defined two additional founders–D4h3 and C4c. These differ by several mutations from the Asian-derived ancestral clades, D4h and C4, respectively (Figure S1d,c). Haplogroup D4h3 ranges from Alaska to Tierra del Fuego and has recently been identified in Alaskan skeletal remains (10,300 ybp) . We identified haplogroup C4c in two Ijka-speakers from Colombia, but its distribution in the Americas remains poorly characterized.
Branches encompassing Native Americans and their immediate Asian ancestral and sister lineages, represented by complete sequences, are shown in black with coalescence ages indicated and geographic location identified by colours. Lineages in brown correspond to the main haplogroups, found in Eurasia and Oceania, but absent in Native Americans. For complete phylogenetic tree see Supplementary figure 1.
Our phylogeographic analysis of a new mitochondrial genome dataset allows us to draw several conclusions. First, before spreading across the Americas, the ancestral population paused in Beringia long enough for specific mutations to accumulate that separate the New World founder lineages from their Asian sister-clades (Figure 2) ,, –. Second, founding haplotypes are uniformly distributed across North and South America instead of exhibiting a nested structure from north to south (Figure 1). Thus, after the Beringian standstill, the initial North to South migration was likely a swift pioneering process, not a gradual diffusion. This scenario matches the pattern of distribution of the first archaeological sites in Northeast Asia and the Americas ,. Third, the largely autochthonous pattern of variation seen in Native American mtDNAs suggests that the swift migration was followed by long-term isolation of local populations accompanied with the development of regional haplotypes within continental founder haplogroups .
Colours of the arrows correspond to approximate timing of the events and are decoded in the coloured time-bar. The initial peopling of Berinigia (depicted in light yellow) was followed by a standstill after which the ancestors of the Native Americans spread swiftly all over the New World while some of the Beringian maternal lineages–C1a-spred westwards. More recent (shown in green) genetic exchange is manifested by back-migration of A2a into Siberia and the spread of D2a into north-eastern America that post-dated the initial peopling of the New World.
In addition to illuminating the peopling process during the pioneering phase, the new dataset allows identification of more-recent genetic exchanges around and across Beringia (Figure 2). Specifically, haplogroup D2 consists of two sister clades, one found only in Siberia (D2b) and the other found in northernmost Eskimos, Chukchi, Aleut, and Athapaskans (D2a). While sub-haplogroup D2a is shared between ethno-historically close related Beringian Aleuts and Eskimos, (Figure S1) its sister clade D2b is spread among populations from distantly related linguistic groups (Tungusic, Turkic, Mongolic) (Table S2). A close relationship of matrilineal ancestry between individuals from different linguistic groups may be due to an overlap of geographic range of their ancestors approximately at the time of the Pleistocene-Holocene boundary. Alternatively, some populations may have received the D2b variant through more recent gene flow. It is also worthwhile to note the absence of D2 in all other Native American populations, suggesting that D2 diversified in Beringia after the initial migration into the Americas had occurred. Haplogroup D3 may have also reached America through more recent genetic exchange. It is spread in Nganasans, Mansi, Evenks, Ulchi, Tuvas, Chukchi and Siberian Eskimos , and recently reported in Greenland and Canadian Inuit populations , but absent in other Native Americans. Additional investigatios of these populations may provide insight into the cause of the phylogenetic connections.
Surprisingly, we also found a Native American sub-type of haplogroup A2 among Evenks and Selkups in southern and western Siberia (Table S2). Previously, this HVS I motif is reported in one Yakut-speaking Evenk in northwestern Siberia . A novel demographic scenario of relatively recent gene flow from Beringia to deep into western Siberia (Samoyedic-speaking Selkups) is the most likely explanation for the phylogeography of haplogroup A2a, which is nested within an otherwise exclusively Native American A2 phylogeny (Figure S1).
The high-resolution sequence data analyzed in this study reveals previously hidden diversity within the Native American mtDNA gene pool. The new data suggest that the initial founders of the Americas emerged from a single source ancestral population that evolved in isolation, likely in Beringia. This scenario is consistent with the unique pattern of diversity from autosomal locus D9S1120  of a private allele in high frequency and ubiquitous in the Americas. The finding that humans were present at the Yana Rhinoceros Horn Site dated to 30,000 ybp  suggests that the isolation in Beringia might have lasted up to 15,000 years. Following this isolation, the initial founders of the Americas began rapidly populating the New World from North to South America.
Materials and Methods
The sample-set comprises 601 Native American individuals from 20 populations distributed throughout the Americas (23 Dogribs from Subarctic Canada; 20 Apaches, 20 Northern Paiutes, 11 Zunis from Southwest US; 77 Ngöbes, 34 Kunas from Panama; 39 Emberas, 57 Waunanas from Panama and Colombia; 47 Arsarios, 48 Koguis, 29 Ijkas, 42 Wayuus, 27 Coreguajes, 22 Vaupes from Colombia; 12 Secoyas-Sionas, 32 Cayapas from Ecuador; 9 Tucuman, 18 Salta, 25 Catamarca, 5 Mocovi from Argentina) and 3764 samples from 26 Asian populations (51 Eskimos; 155 Chukchi; 120 Selkups; 66 Kets; 70 Tundra Nenets; 275 Tuvas; 185 Khakas; 339 Altaians; 170 Shors; 71 Koryaks; 85 Nanais; 122 Uyghurs; 406 Kazakhs; 58 Gilyaks; 61 Oroks; 105 Kirghiz; 48 Uzbeks; 38 Tajiks; 201 Buriats; 324 Evenks; 105 Evens; 22 Yukaghirs; 423 Yakuts; 157 Dolgans; 107 Nganasans). A subset of these sequences were reported elsewhere –.
First, haplogroup affiliations of the individual samples were determined through RFLP analysis and DNA sequencing of the HVS I region, if not known earlier –. Samples that could not be assigned to haplogroups A–D or X were investigated for evidence of recent admixture, particularly among populations with well established historical accounts of co-existence of Native American and either European or African populations. Samples of European or African origin were excluded from the current study. Further, 20 Native American and 7 Asian samples were selected for complete sequencing of mtDNA genomes. Using these 27 novel and 113 published Native American and relevant Asian complete or coding region sequences ,,,–, phylogenetic trees were reconstructed based on a maximum parsimony approach (Figure S1, Text S1). From these whole mtDNA genomes coding region markers were selected for screening in the sample set through RFLP analyses or direct sequencing (Table S1). Hierarchical method was used, so, that each Native American sample was first cheked for nucleotide positions, where a polymorphism could be assumed based on the HVS I information and close ethnic, geographic or linquistic affiliation to complete sequenced sample. From Asian populations, samples, which could be relevant to Native American haplogroups, were selected based on HVS I sequence and analyzed for coding region markers (A2–12007, 8027; A2a–3330; C1a–3826, 7598; C4–11969; C4a–12672; C4b–3816; C4c–11440; D2–8703; D2a–4991, 11959; D2b–9181; D4–8414T; D4a–3206; D4e1–3316; D4h–3336, 3644; D4m–9667; D5–5301; D5a–11944, 12026).
DNA was extracted using conventional methods –. Preparation of sequencing templates was carried out following standard protocols, employing FIREPol polymerase (Solis BioDyne). Purified products were sequenced with the DYEnamic™ ET terminator cycle sequencing kit (Amersham Pharmacia Biotech) and analyzed on MegaBace1000 or ABI 3730xl sequencers. Sequences were aligned and analyzed with the Wisconsin Package (GCG) or ChromasPro 1.34.
Coalescence-age calculations and SDs were estimated based on the phylogenies of complete sequences ,. Given the global propensity of young mtDNA clades showing a significant excess of non-synonymous mutations, application of the raw molecular clock  in intra-species data sets is problematic . Therefore, for dating the coalescent times of founder haplogroups we employed only synonymous transitions between the np 577-16023, assuming the rate of 3.5×10-8 (SD 0.1×10-8)/year/position . The complete mtDNA genome data can be found in Genbank.
Complete Phylogenetic Trees
(0.10 MB XLS)
(0.71 MB XLS)
(0.02 MB XLS)
We would like to thank the late John McDonough and the late Dr. Surinder Papiha for their contributions to this research project.
Conceived and designed the experiments: TK RM. Performed the experiments: ET MR. Analyzed the data: CB MM TK RM ET MR. Contributed reagents/materials/analysis tools: CB CM RV TK RM DS OR CM EK SF MG VS MG SZ LO LD MV JD. Wrote the paper: CM RV MM TK RM ET DS.
- 1. Torroni A, Schurr TG, Cabell MF, Brown MD, Nell JV, et al. (1993) Asian affinities and contiental radiation of the four founding Native American mtDNAs. Am J Hum Genet 53: 563–590.
- 2. Schurr TG, Ballinger SW, Gan YY, Hodge JA, Merriwether DA, et al. (1990) Amerindian mitochondrial DNAs have rare Asian mutations at high frequencies, suggesting they derived from four primary maternal lineages. Am J Hum Genet 46: 613–623.
- 3. Forster P, Harding R, Torroni A, Bandelt HJ (1996) Origin and evolution of Native American mtDNA variation: a reappraisal. Am J Hum Genet 59: 935–945.
- 4. Bandelt HJ, Hernnstadt C, Yao YG, Kong QP, Kivisild T, et al. (2003) Identification of Native American founder mtDNas through the analysis of complete mtDNA sequences: some caveats. Ann Hum Genet 67: 512–524.
- 5. Brown MD, Hosseini SH, Torroni A, Bandelt HJ, Allen JC, et al. (1998) mtDNA haplgroup X: An ancient link between Europe/Western Asia and North America? Am J Hum Genet 63: 1852–1861.
- 6. Saillard J, Forster P, Lynnerup N, Bandelt HJ, Norby S (2000) mtDNA variation among Greenland Eskimos: the edge of the Beringian expansion. Am J Hum Genet 67: 718–726.
- 7. Merriwether DA, Rothhammer F, Ferell RE (1995) Distribution of the four founding lineage haplotypes in Native Americans suggests a single wave of migration for the New World. Am J Phys Anthropol 98: 411–430.
- 8. Bailliet G, Rothhammer F, Carnese FR, Bravi CM, Bianchi NO (1994) Founder mitochondrial haplotypes in Amerindian populations. Am J Hum Genet 55: 27–33.
- 9. Ward RH, Redd A, Valencia D, Frazier B, Paabo S (1993) Genetic and linguistic differentiation in the Americas. Proc Natl Acad Sci U.S.A. 90: 10663–10667.
- 10. Shields GF, Schmiechen AM, Frazier BL, Redd A, Voevoda ML, et al. (1993) mtDNA sequences suggest a recent evolutionary divergence for Beringian and northern North American populations. Am J Hum Genet 53: 549–562.
- 11. Lorenz JG, Smith DG (1996) Distribution of four founding mtDNA haplogroups among Native North Americans. Am J Phys Anthropol 101: 307–323.
- 12. Santos SE, Ribeiro-Dos-Santos AK, Meyer D, Zago MA (1996) Multiple founder haplotypes of mitochondrial DNA in Amerindians revealed by RFLP and sequencing. Ann Hum Genet 60: 305–319.
- 13. Horai S, Kondo R, Nakagawa-Hattori Y, Hayashi S, Sonoda S, et al. (1993) Peopling of the Americas, founded by four major lineages of mitochondrial DNA. Mol Biol Evol 10: 23–47.
- 14. Malhi RS, Eshleman JA, Greenberg JA, Weiss DA, Schultz Shook BA, et al. (2002) The structure of diversity within New World mitochondrial DNA haplogroups: implications for the prehistory of North America. Am J Hum Genet 70: 905–919.
- 15. Non AL, Kitchen A, Mulligan CJ (2006) Indentification of the most informative regions of the mitochondrial genome for phylogenetic and coalescent analyses. Mol Phylogenet Evol Epub.
- 16. Wakeley J (1993) Substitution rate variation among sites in hypervariable region 1 of human mitochondrial DNA. J Mol Evol 37: 613–623.
- 17. Hasegawa M, Di Rienzo A, Kocher TD, Wilson AC (1993) Toward a more accurate time scale for the human mitochondrial DNA tree. J Mol Evol 37: 347–354.
- 18. Malyarchuk BA, Rogozin IB (2004) Mutagenesis by transient misalignment in the human mitochondrial DNA control region. Ann Hum Genet 68: 324–339.
- 19. Sigurğardóttir S, Helgason A, Guicher JR, Stefansson K, Donnelly P (2000) The mutation rate in the human mtDNA control region. Am J Hum Genet 66: 1599–1609.
- 20. Eshleman JA, Malhi RS, Smith DG (2003) Mitochondrial DNA studies of Native Americans: Conceptions and misconceptions of the population prehistory of the Americas. Evo Anth 12: 7–18.
- 21. Schurr TG, Sherry ST (2004) Mitochondrial DNA and Y chromosome diversity and the peopling of the Americas: evolutionary and demographic evidence. Am J Hum Biol 16: 420–439.
- 22. Pitulko VV, Nikolsky PA, Girya EY, Basilyan AE, Tumskoy VE, et al. (2003) The Yana RHS site: humans in the Arctic before the last glacial maximum. Science 303: 52–56.
- 23. Dillehay TD (1999) The late Pleistocene cultures of South America. Evol Anth 7: 206–216.
- 24. Bonatto SL, Salzano FM (1997) A single and early migration for the peopling of the Americas supported by mitochondrial DNAn sequence data. Proc Natl Acad Sci USA 94: 1866–1871.
- 25. Merriewther DA, Ferrell RE (1996) The four founding lineage hypothesis for the New World: a critical reevaluation. Mol Phylogenet Evol 5: 241–246.
- 26. Starikovskaya EB, Sukernik RI, Derbeneva OA, Volodko NV, Ruiz-Pesini E (2005) Mitochondrial DNA diversity in indigenous populations of the southern extent of Siberia, and the origins of Native American haplogroups. Ann Hum Genet 69: 67–89.
- 27. Kemp BM, Malhi RS, McDonough J, Bolnick DA, Eshleman JA, et al. (2007) Genetic analysis of early holocene skeletal remains from Alaska and its implications for the settlement of the Americas. Am J Phy Anth 132: 605–621.
- 28. Szathmary EJ (1993) mtDNA and the peopling of the Americas. Am J Hum Genet 53: 793–799.
- 29. Crawford MH (1998) The Origins of Native Americans: Evidence from Anthropological Genetics. Cambridge University Press.
- 30. Derbeneva OA, Sukernik RI, Volodko NV, Hosseini SH, Lott MT, et al. (2002) Analysis of mitochondrial DNA diversity in the Aleuts of the Commander Islands and its implications for the genetic history of Beringia. Am J Hum Genet 71: 415–21.
- 31. Helgason A, Palsson G, Pedersen HS, Angulalik E, Gunnarsdottir ED, et al. (2006) MtDNA variation in Inuit populations of Greenland and Canada: migration history and population structure. Am J Phys Anthropol 130: 123–134.
- 32. Pakendorf B, Novgorodov IN, Osakovskija VL, Danilova AP, Protodjakonov AP, et al. (2006) Investigating the effects of prehistoric migrations in Siberia: genetic variation and the origin of Yakuts. Hum Genet 120: 334–353.
- 33. Schroeder KB, Schurr TG, Long JC, Rosenberg NA, Crawford MH, et al. (2007) A private allele ubiquitous in the Americas. Biol Lett Epub.
- 34. Kolman CJ, Bermingham E (1997) Mitochondrial and nuclear DNA diversity in the Choco and Chibcha Amerinds of Panama. Genetics 147: 1289–302.
- 35. Kolman CJ, Bermingham E, Cooke R, Ward RH, Arias TD, et al. (1995) Reduced mtDNA Diversity in the Ngöbe Amerinds of Panama. Genetics 140: 275–283.
- 36. Rickards O, Martinez-Labarga C, Lum JK, De Stefano GF, Cann RL (1999) mtDNA history of the Cayapa Amerinds of Ecuador: detection of additional founding lineages for the Native American populations. Am J Hum Genet 65: 519–30.
- 37. Kaestle FA, Smith DG (2001) Ancient mitochondrial DNA evidence for prehistoric population movement: the Numic expansion. Am J Phys Anthropol 115: 1–12.
- 38. Malhi RS, Mortensen HM, Eshleman JA, Kemp BM, Lorenz JG, et al. (2003) Native American mtDNA prehistory in the American Southwest. Am J Phys Anthropol 120: 108–24.
- 39. Fedorova SA, Bermisheva MA, Villems R, Maksimova NR, Khusnutdinova EK (2003) Analysis of mitochondrial DNA lineages in Yakuts. Mol Biol (Mosk) 37: 643–53.
- 40. Bermisheva MA, Kutuev IA, Spitsyn VA, Villems R, Batyrova AZ, et al. (2005) Analysis of mitochondrial DNA variation in the population of Oroks. Russian Journal of Genetics 41: 66–71.
- 41. Goltsova TV, Osipova L, Zhadanov S, Villems R (2005) The Effect of Marriage Migration on the Genetic Structure of the Taimyr Nganasan Population. Russian Journal of Genetics 41: 954–965.
- 42. Melton PE, Briceno I, Gomez A, Devor EJ, Bernal JE, et al. (2007) Biological Relationship Between Central and South American Chibchan Speaking Populations: Evidence From mtDNA. Am J Phys Anthropol Mar 5.
- 43. Herrnstadt C, Elson JL, Fahy E, Preston G, Turnbull DM, et al. (2002) Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups. Am J Hum Genet 70: 1152–1171.
- 44. Herrnstadt C, Preston G, Howell N (2003) Errors, phantoms and otherwise, in human mtDNA sequences. Am J Hum Genet 72: 1585–6.
- 45. Ingman M, Kaessmann H, Pääbo S, Gyllensten U (2000) Mitochondrial genome variation and the origin of modern humans. Nature 408: 708–713.
- 46. Kivisild T, Shen P, Wall DP, Do B, Sung R, et al. (2006) The role of selection in the evolution of human mitochondrial genomes. Genetics 172: 373–87.
- 47. Kong Q-P, Yao Y-G, Sun C, Bandelt H-J, Zhu C-L, et al. (2003) Phylogeny of East Asian mitochondrial DNA lineages inferred from complete sequences. Am J Hum Genet 73: 671–676.
- 48. Maca-Meyer N, Gonźalez AM, Larruga JM, Flores C, Cabrera VM (2001) Major genomic mitochondrial lineages delineate early human expansions. BMC Genet 2: 13.
- 49. Mishmar D, Ruiz-Pesini E, Golik P, Macaulay V, Clark AG, et al. (2003) Natural selection shaped regional mtDNA variation in humans. Proc Natl Acad Sci USA 100: 171–176.
- 50. Tanaka M, Cabrera VM, Gonzalez AM, Larruga JM, Takeyasu T, et al. (2004) Mitochondrial genome variation in eastern Asia and the peopling of Japan. Genome Res 14: 1832–1850.