Given the paucity of archaeogenetic data available for medieval European populations in comparison to other historical periods, the genetic landscape of this age appears as a puzzle of dispersed, small, known pieces. In particular, Southeastern Europe has been scarcely investigated to date. In this paper, we report the study of mitochondrial DNA in 10th century AD human samples from Capidava necropolis, located in Dobruja (Southeastern Romania, Southeastern Europe). This geographical region is particularly interesting because of the extensive population flux following diverse migration routes, and the complex interactions between distinct population groups during the medieval period. We successfully amplified and typed the mitochondrial control region of 10 individuals. For five of them, we also reconstructed the complete mitochondrial genomes using hybridization-based DNA capture combined with Next Generation Sequencing. We have portrayed the genetic structure of the Capidava medieval population, represented by 10 individuals displaying 8 haplotypes (U5a1c2a, V1a, R0a2’3, H1, U3a, N9a9, H5e1a1, and H13a1a3). Remarkable for this site is the presence of both Central Asiatic (N9a) and common European mtDNA haplotypes, establishing Capidava as a point of convergence between East and West. The distribution of mtDNA lineages in the necropolis highlighted the existence of two groups of two individuals with close maternal relationships as they share the same haplotypes. We also sketch, using comparative statistical and population genetic analyses, the genetic relationships between the investigated dataset and other medieval and modern Eurasian populations.
Citation: Rusu I, Modi A, Vai S, Pilli E, Mircea C, Radu C, et al. (2018) Maternal DNA lineages at the gate of Europe in the 10th century AD. PLoS ONE 13(3): e0193578. https://doi.org/10.1371/journal.pone.0193578
Editor: Dan Mishmar, Ben-Gurion University of the Negev, ISRAEL
Received: September 18, 2017; Accepted: February 14, 2018; Published: March 14, 2018
Copyright: © 2018 Rusu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Sequences were submitted to NCBI GenBank (https://www.ncbi.nlm.nih.gov/genbank/) under the accession numbers MF320129-MF320142 and MF597774-MF597782. All other relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Executive Unit for Financing Higher Education, Research, Development and Innovation (UEFISCDI), project number PN-II-PT-PCCA-2011-3.1-1153, contract number 229/2013: “Genetic Evolution: New Evidences for the Study of Interconnected Structures. A Biomolecular Journey around the Carpathians from Ancient to Medieval Times”. M.H. was supported by a Basque Government grand to the research group (IT 1138-16). I.R.’s research internship at the University of Florence was supported by the FONDUL SOCIAL EUROPEAN, Programul Operaţional Sectorial Dezvoltarea Resurselor Umane 2007-2013, “Calitate, excelență, mobilitate transnațională în cercetarea doctorală”, POSDRU/187/1.5/S/155383. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: David Caramelli is a member of the PLOS ONE editorial board, which does not alter the authors’ adherence to PLOS ONE Editorial policies and criteria.
Dobruja is a historical region of Southeastern Europe, situated between the lower Danube River and the Black Sea. Today, this area is shared by Romania and Bulgaria, but in the historical past the territory surrounding the lower section of the Danube basin was a continuous territory and under the periodic domination and influence of distinct powerful state entities (Byzantine Empire, Kievan Rus’ and Bulgarians) [1, 2], being an important migration node between Asia and several parts of Europe. Within this area, one can find tracks marking the passage of multiple populations (e.g. Greeks, Romans, Byzantines, Pechenegs, Tatars, etc.) . A witness of regional changes is the Capidava fortress, from the beginning of the 2nd century AD when the Romans settled and became aware of its strategic importance, until the 11th century when the Pecheneg invasion ended the Byzantine inhabitation of this location . Capidava acts, from this perspective, as a gate of access for migratory populations towards Western Europe and also as a trade center with an extremely complex history during the Middle Ages. Through the 10th century AD when Capidava is alternatively dominated, in the context of continuous conflicts between the Byzantine Empire, the Kievan Rus’ and the proto-Bulgarians, by each of them. Questions regarding the impact and extent of these past political and associated demographic events, that left imprints on the local genetic structure and influenced the current European gene pool remain unanswered, even though, the first elements of evidence could be inferred from the archaeological context . Additional clues on historical population movements can be provided by ‘bio-archives’ such as human skeletal remains. Despite recent advances in the study of nuclear genome from ancient samples, mitochondrial DNA (mtDNA) remains a suitable instrument for studying past population movements [5–8].
Currently, most information on the genetic landscape of Romanian populations is based on mtDNA diversity in present-day inhabitants. A few studies attempted to evaluate their genetic relationship with other Eurasians [9–11], whereas some focused on mtDNA variation in minority groups [12–14]. To shed light on possible past migratory routes, a recent study pointed that contemporary distribution pattern of mtDNA haplogroups in the historical provinces of Romania is mostly governed by genetic affinities towards the Balkans, whereas the Transylvanian population is more closely related to Central European groups . The genetic makeup of modern populations is the reflection of complex interactions between local and external genetic pools with variable dimensions and distributions. Traces of past events might have been blurred by more recent events, but direct evidence can be provided by local ancient genetic information. Ancient DNA (aDNA) studies using archaeological material from the current Romanian territory are scarce and restricted to distinct past periods such as the Early Neolithic, the Late Bronze Age  or the Upper Paleolithic which is represented by an early anatomically modern human fossil discovered in Peştera cu Oase, Southwestern Romania . Thus, the genetics of Romanian medieval period remains a blank spot on the European map.
The aim of this study was to determine the genetic architecture of maternal lineages in a medieval population from Capidava, Dobruja, Romania, and also to evaluate the relationships to other medieval and modern populations from Europe and Asia. Medieval mtDNA information can be knitted together with archaeological and historical data to provide a better understanding of the impact of local events on the genetic structure of larger subsequent populations which are the de facto result of an original genetic pool modeled by familial relationships and immigrants arriving on different migratory routes, in different historical periods.
Materials and methods
Archaeological background and sample information
In this study we processed eleven archaeological human remains sets, identified as: Cap-M1, Cap-M2, Cap-M3, Cap-M4, Cap-M5, Cap-M6, Cap-M8, Cap-M9, Cap-M11, Cap_M15, Cap_M17, retrieved from the Capidava necropolis (zone X extra muros), in the Dobruja region, Constanța county, Capidava village, during the 2010–2014 campaigns. Yearly systematic archaeological research is carried out at this site under the coordination of the National History and Archaeology Museum—Constanța and authorized by the Romanian Ministry of Culture and National Identity through the National Commission of Archaeology. Permit numbers for the 2010–2014 campaigns are: 33/2010, 105/2011, 52/2012, 69/2013 and 45/2014. Reports detailing all archaeological excavations are yearly published in the Cronica Cercetărilor Arheologice series, edited by the National Institute of Heritage, in Romanian (cronica.cimec.ro). All human remains excavated from the X extra muros zone in this time frame were transferred to a permanent collection deposited for research at the Molecular Biology Center, Interdisciplinary Research Institute on Bio-Nano Sciences, “Babeș-Bolyai” University in Cluj-Napoca and carry the above mentioned codes. The collection is open to researchers, upon request.
We analyzed the human skeletal remains of 11 individuals excavated from the archaeological site of Capidava (Fig 1), outside the walls of the Capidava fortress, in its associated necropolis (S1 Fig). According to funerary rites, grave goods, and subsequent radiocarbon dating, the investigated individuals are dated to the 10th century AD and were most probably part of a Christian population . The only exception, the subadult from grave M6, placed in the same stratigraphic layer as the others, presented a particular and different funerary context marked by the evidence of a stone cist, being, therefore better fitted in the Roman period. A direct radiocarbon dating was performed on the skeletal remains of individual M4 at Beta Analytic (Florida, USA).
The map was created using QGIS 2.18.11.
Prior to the molecular investigation, all specimens were morphologically assessed in order to infer sex, age at death, metric measurements and characterization of potential pathological conditions. For this purpose, standard guidelines were used [18, 19]. Of the total number of individuals (n = 15) recovered from the necropolis at the moment when this study was initiated, only 11 specimens were selected for the molecular analysis. This selection took into consideration the preservation status of the specimens. A summary of the data retrieved through the osteological and pathological analysis is listed in the S1 Table.
Molecular analysis of the 11 selected archaeological specimens was performed under sterile conditions in a dedicated ancient DNA facility at the Interdisciplinary Research Institute on Bio-Nano-Sciences, “Babeș-Bolyai” University, Romania, following strict guidelines and standard precautionary measures to ensure aDNA authenticity [20–23]. Laboratory rooms designated for pre-PCR and post-PCR processes are physically separated, and sterile conditions are maintained using UV-irradiation and regularly cleaning with bleach of all appliances, materials and work areas. The laboratory work was carried out while wearing suitable protective equipment that includes clean overalls, facemasks, shoe covers and disposable gloves. In order to screen for potential contamination with exogenous DNA, at least one extraction and one amplification blank for every three samples were used as negative controls. MtDNA profiles of the researchers who have been in contact with the samples were recorded and compared with the results obtained for the ancient specimens in order to track modern contamination (S10 Table).
Since teeth are less susceptible to contamination than other skeletal elements , intact ones (multi-rooted, if available) were selected as a source for aDNA isolation, for each tested individual. The external surface of the dental samples was decontaminated using bleach, washing with sterile water, followed by UV exposure (254 nm) in a cross-linker (UVILink CL 508G, UVITec, UK) for 25 minutes on each side. Biological material from human teeth was obtained by accessing the pulp cavity via the apical end of the roots using a dental micro-motor, at low speed (Marathon-3 Champion, Saeyang Microtech, Korea). The resulting powdered tissue was used for DNA extraction by means of a silica-based spin column protocol, modified after Yang et al. . Generally, 100–200 mg of dental powder was incubated at 55°C for 40 hours with 1 ml digestion buffer (0.5 M EDTA pH 8.0; 0.5% SDS; 100 μg/mL Proteinase K), then incubated further at 37°C for 4 hours. This solution was afterward centrifuged at 2,000 x g for 5 minutes and the supernatant was divided into aliquots and transferred into new 2 ml Eppendorf tubes for DNA purification using QIAquick PCR purification kit (Qiagen, Hilden, Germany) according to manufacturer’s instructions. DNA was eluted in a final volume of 100 μl and subsequently stored at −20°C until use.
For results authentication, a second extraction was performed in the case of four samples (M1, M4, M8, M17), using an extraction method, designed in our laboratory, to co-purify DNA and proteins from archaeological remains. This protocol employs overnight decalcification of dental powder in 1.5 ml of 0.5 M EDTA solution pH 8.0 under permanent stirring, followed by the use of centrifugal filter units for concentration and purification of biological solutions (Amicon® Ultra-15, MWCO 30,000 kDa, Millipore). The filtration product was separated into two equal aliquots, for specific downstream applications regarding DNA and proteins. Further purification of DNA was performed using the Animal and Fungi DNA Preparation Kit (Jena Bioscience, Jena, Germany) according to manufacturer’s specifications, resulting in a final total amount of 50 μl DNA extract.
Both hypervariable segments (HVS-I and HVS-II) of the mitochondrial genome were amplified using four overlapping fragments that span each of the HVS, with an average size of 141 bps . Additionally, three primer pairs were used to amplify haplogroup (hg) diagnostic nucleotide positions (np) in the coding region (see S2 Table). PCR reactions were set up as follows: 2–4 μl DNA extract, 1.25 U MangoTaq™ DNA Polymerase (Bioline Reagents Ltd, London, UK), 1x MangoTaq™ Colored Reaction Buffer, 2.5 mM MgCl2, 0.2 mM dNTP mix, 0.5 μM each primer, and PCR-grade water (Jena Bioscience, Jena, Germany) up to 25 μl. The applied thermocycling conditions were: initial denaturation at 95°C for 5 min, 35 amplification cycles consisting of three steps (denaturation at 95°C for 30 sec, annealing at 46–56°C depending on the primer pair, for 30 sec and extension at 72°C for 30 sec) and final elongation at 72°C for 5 min. PCR products were checked on 1.5% agarose gel and then purified using FavorPrep GEL/ PCR Purification Kit (Favorgen Biotech Corp., Pingtung, Taiwan), according to the manufacturer’s protocol. All amplified target sequences were cloned (CloneJet PCR Cloning Kit, Thermo Scientific, Waltham, USA), and a minimum of six clones per amplicon was commercially sequenced (Macrogen Europe, Amsterdam, The Netherlands) using the standard primer pJET1.2R.
MtDNA sequences were aligned using BioEdit Sequence Alignment Editor v. 126.96.36.199  in order to establish consensus sequence and to highlight post-mortem damage and/or potential contamination. HVS-I (np range 16009–16390) and HVS-II (np range 57–357) mutational motifs and coding region polymorphic positions were inferred by comparison to the revised Cambridge Reference Sequence (rCRS: NC_012920.1) . Haplogroups were assigned using the HaploGrep2 web application, based on PhyloTree built 17 [29, 30].
For results authentication, a subset of samples (M2, M5, M9, M11, M15, M17) were independently processed in the Laboratory of Molecular Anthropology and Paleogenetics, University of Florence, Italy, a facility exclusively dedicated to ancient DNA investigations. Instead of using the classical method for the retrieval of genetic information from degraded samples, Next-Generation Sequencing (NGS) approach was used and the whole mitochondrial genome of the individuals was reconstructed.
DNA extraction was carried out using a commercial silica columns protocol designed to also recover very short DNA fragments, dominant in ancient samples . Prior to DNA extraction, all dental pieces were cleaned by removing the outer surface with a micro-drill and UV-irradiation for 45 minutes in a cross-linker (UVILink CL 508G, UVITec, UK) and subsequently ground into a fine powder by a dental drill. Twenty microliters of each extract were then converted in a double-stranded and double-index Illumina library without enzymatic damage repair in order to preserve the damage patterns of the DNA fragments . All libraries were amplified to reach the plateau and human mtDNA molecules were selected by capture with homemade probes . Negative controls were set up during each experimental stage. High-throughput DNA sequencing was carried out at the Institute of Biomedical Technologies, National Research Council, in Milan. For this purpose, the Illumina MySeq platform was used and the sequencing was run as paired-end with 2 × 75 + 8 + 8 cycles.
A specific pipeline developed for the analysis of aDNA NGS data was used to retrieve the genetic information of the individuals from Capidava necropolis and to test for the authenticity of the obtained sequences. Primary data analysis was completed by means of the SeqPrep tool : adapter sequences were trimmed and paired-end reads were merged into single sequences with a minimum overlap of 10 base pairs (bp), in order to exclude all sequences longer than 140 bp. Only reads with a minimum length of 30 bp were kept. Filtered reads were mapped against the rCRS using Burrows-Wheeler-Aligner (BWA) ; reads with mapping quality below 30 were discarded and PCR duplicates were removed by rmdup in SAMtools . For all samples, a probabilistic iterative approach, Schmutzi , was used to call the endogenous consensus sequences and to estimate the contamination levels. Misincorporation patterns were tracked and quantified by mapDamage 2.0 . The latter one was also executed for the reconstruction of endogenous mitochondrial genomes. Classification of complete mtDNA sequences according to haplogroup nomenclature was achieved using HaploFind .
Molecular analyses for another subset of samples (M1, M3, M4, and M5) were replicated by another independent research group at the Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country UPV/EHU, Bizkaia, Spain. DNA was isolated from four teeth using the protocol based on guanidine thiocyanate and silica columns . The extraction session involved two contamination controls that were applied to the entire process, except no tooth was added. The extraction and preparation of the PCR were undertaken in a specific lab for aDNA, equipped with a positive pressure sterile chamber, located in a physically separated space from the laboratory where post-PCR processes are carried out. All the work surfaces were cleaned regularly with sodium hypochlorite and irradiated with UV light. Suitable disposable clothing was worn (lab coat, mask, gloves, and cap).
The processing of the ancient samples in the UPV/EHU laboratory involved the application of a series of strict criteria for the authentication of results, as detailed by several authors [21, 41]. Sequencing of HVS-I (np 15998–16400) was undertaken in six overlapping fragments, each with a length of approximately 100 bp . The amplification of each fragment was undertaken in independent PCRs. In the case of positive amplification and the absence of contamination, the PCR products were purified by ExoSAP-IT (USB Corporation, Cleveland, USA), with subsequent sequencing in an ABI310 automatic sequencer using chemistry based on BigDye 1.1 (Life Technology). The results obtained were edited with BioEdit software (http://www.mbio.ncsu.edu/BioEdit/bioedit.html) and the sequences were aligned manually .
Sequences corresponding to each medieval individual from Capidava were submitted to NCBI GenBank under the accession numbers MF320129-MF320142 and MF597774-MF597782.
Reference population data sets
MtDNA profiles of medieval individuals from Capidava were compared with a data set of 15368 contemporary mtDNA sequences of the HVS-I region comprising European, Near Eastern and Asian populations. Another data set consisting of 495 medieval sequences corresponding to 13 populations originating from Europe and one from Asia was compiled in order to evaluate genetic relationships between the population analyzed in this study and other medieval populations. All sequence data were assembled from available published data, as listed in the electronic supplementary material S5 and S8 Tables.
Population genetic analysis
Basic statistical methods were implemented for comparison and calculations of genetic distances between the examined population and other 14 ancient and 35 present-day populations. Diversity indices for the Romanian medieval population (consisting of 10 successfully typed individuals) were calculated in DNASP v5  using both HVS-I (np 16009–16390) and HVS-II (np 37–357) sequences. Only 8 of the 11 individuals reported in this study were included in the population genetics analysis; reasons for this decision are outlined in the observation column of S3 Table.
Principal Component Analysis (PCA) was performed based on mtDNA haplogroup frequencies of medieval and modern populations (S5 and S8 Tables). Due to discrepancies in the phylogenetic resolution of the reference population’s mtDNA (haplogroup assignment based on older Phylotree versions) the haplogroup status in the case of these populations was updated, making them comparable to the genetic data reported in this study. In the PCA of the 15 medieval populations, 26 mtDNA haplogroups were considered (A, B, C/F/G, D, H, HV, I, J, JT, K, M, N1, N9, R0, R1, T, U, U2, U3, U4, U5a, U5b, U8, V, W, and X), while in the PCA of the 35 contemporary Eurasian populations, 36 mtDNA haplogroups were considered (A, B, C, D, F, G, H, HV, HV0, I, J, K, L, M, N, N9, R, T, T1, T2, U, U1, U2, U3, U4, U5a, U5b, U6, U7, U8, V, W, X, Y, Z, and Other). All PCAs were computed using the prcomp function of the built-in R stats package, R version 3.3.2 , and plotted in a two-dimensional space, displaying the first two principal components (PCs). In the case of the PCA comparing the Capidava medieval population to the modern-day populations, another graph displaying the first and third principal components was generated.
All PCs derived from the medieval PCA were used for hierarchical clustering, using Ward’s agglomerative method  and Euclidean distance measurement. The significance of each cluster was given as an approximately unbiased (AU) p-value, as a percentage, that was calculated using pvclust function  with 10000 bootstrap replicates. The resulted arrangement of the clusters was illustrated as a dendrogram in R 3.3.2 .
Inter-population comparisons were performed using the Arlequin software, version 188.8.131.52 . Pairwise genetic distances (FST) among populations were computed based on control region haplotypes of uniform sequence length, ranging from nucleotide position 16050 to 16383. Cytosine insertions at position 16193 were excluded from statistical analyses and all comparisons. The best fitting evolutionary model and the estimate of the gamma parameter were inferred from jModeltest 2.1.10 [47, 48]. Pairwise FST values were computed for both medieval and modern data sets, assuming a Tamura & Nei substitution model , 10000 permutations and an allowed level of missing data of 0.05 (S6 and S9 Tables). The linearized Slatkin FST values and significant p-values of the ancient populations (S6 Table) were visualized on a levelplot, whereas the matrix of linearized Slatkin FST values of modern populations (S9 Table) was displayed on a multi-dimensional scaling (MDS) plot represented in a two-dimensional space using metaMDS function based on Euclidean distances implemented in the vegan library of R 3.3.2 [43, 50].
Shared haplotype analysis (SHA) was performed in order to detect and compare the mtDNA haplotypes shared between the 15 medieval populations, by counting the absolute and relative shared haplotypes (S7 Table).
Results and discussion
MtDNA sequences obtained in all three laboratories during this study (HVS-I and HVS-II in Romanian lab: M1, M2, M3, M4, M5, M6, M8, M9, M11, M15 and M17; HVS-I Spanish lab: M1, M3, M4 and M5 and Italian lab: M2, M5, M9, M11, M15 and M17), originating from the medieval Capidava necropolis are detailed in S3 Table. Only one of the individuals, M5, has a haplotype that matches the mtDNA profile of a researcher implicated in the experimental procedure, belonging to the common European J haplogroup. The only observed difference between M5 and researcher involves the absence of mutation 228A for M5, explained by the lack of amplification of segment ranging from np 169 to np 273 in the ancient sample. Even though the common mutations between M5 and researcher might be a coincidence, the fact that successful results were not obtained in either of the two additional laboratories in which M5 was processed indicates that it is prudent not to consider this sequence in the following statistical analysis. HVS-I of M1 and M3 was successfully replicated by the Spanish research group. In case of 5 individuals (M2, M9, M11, M15, and M17) the complete mitochondrial genome was reconstructed using high-throughput sequencing. All resulted sequences reach standard quality requested to guaranty the reliability of the NGS data (S4 Table). Whole mtDNA sequence data shows that all control region SNPs were correctly identified with PCR based methods. The DNA of the 3 remaining individuals (M1, M4, and M8) was extracted following two distinct protocols from different dental samples, most of the HVS-I fragments being amplified at least twice per each individual. All sequences that were replicated, along with the typed coding region positions, negative control results and the presence of characteristic post-mortem DNA damage in all cloned mtDNA inserts suggest the assigned haplotypes are authentic.
The 10 sequences that were successfully retrieved for the Capidava skeletal remains, spanning positions 37–357 and 16009–16390 belong to 8 haplotypes (haplotype diversity of 0.956). The unstable length polymorphisms associated with poly-C stretches between sites 302–315 has been ignored. Identical haplotypes were identified in two groups of two individuals derived from neighboring burials. One group of adult individuals that might be maternally related (M3 and M4) belong to hg R0a2’3, even though a first attempt to characterize their mitochondrial genetic material yielded a faulty, at that stage unverified assignment . The other group consists of two subadults that possess the same haplotype (M9 and M11) belonging to hg N9a (S1 Fig). Due to the low frequencies of these haplogroups among ancient and modern populations, and the particular archaeological context, it is very unlikely that identical haplotypes occurred together by chance, given that cross sample contamination was excluded. The high number of intra-cemetery maternal relations relative to the population size suggests that family connections were defining parameters of spatial burial distribution. One possible scenario is that mobile groups of people that reached Dobruja settled for at least a few generations in the vicinity of Capidava fortress, burying their deceased in allotted familial cemetery plots. In order to avoid overrepresentation of mtDNA lineages due to potential family relationships, duplicate sequences were removed from the subsequent statistical analysis. This resulted in a medieval population consisting of 8 sequences with distinct haplotypes. Observation of diverse mitochondrial haplogroups in small medieval populations is prone to stochastic events.
The direct radiocarbon dating performed on the M4 human bones placed the remains at 880 and 990 cal. AD (Beta Analytic, Florida, USA) which corresponds to the age estimated according to the archaeological record. The medieval Capidava population analyzed here has a predominantly Western Eurasian haplogroup composition (H, R0, U3, U5, and V), the Eastern Eurasian component being represented by hg N9a with a frequency of 12.5% (S5 Table). In contrast to the other medieval populations analyzed in this study, the N9a haplotype is distinguished solely in the gene pool of the Capidava population. Characteristic N9a HVS-I polymorphisms have been identified in three human archaeological remains from the Carpathian Basin (Hungary), initially identified as early Neolithic (then revised as Sarmatian for two of the three samples, and Hungarian Conquest period for the third one) [52, 53]. Identical HVS-I haplotype was also found in an archaeological specimen (female) from Krasnoyarsk region, Siberia, dated to the Iron Age (100–400 AD) . The presence of the N9a maternal lineage in the gene pool of medieval representatives from the Southeastern region of the extra Carpathian Basin (Dobruja) supports the idea of a “westward genetic influence of nomads from eastern Central Asia” . N9a frequencies in modern European populations are extremely low and it is mainly found in the Volga-Ural region Tatars (~1%), Czechs (0.6%) and Russians from Southwestern Russia (1.5%), all potential donor or acceptor spaces for our population [56–58]. In our dataset of modern populations (S8), the highest frequencies (0.61%-1.61%) of N9 can be identified in three populations (KAZ, KGZ, and UZB) from Central Asia. Lower frequencies (0.13%-0.28%) of this maternal lineage can be observed in Eastern Europe (CZE, UKR) and in Southern Europe (ESP). The most frequent hg detected in Capidava population is H. This is the dominant maternal lineage among modern-day Europeans, encompassing over 40% of the total mtDNA variation in most of Europe . However, its frequency observed in the analyzed medieval population is lower than it is in most contemporary European populations and in some historical Romanian provinces, other than Dobruja and Moldavia [8, 15]. It is important to note that the small sample size only allowed for rough comparisons.
Besides confirming the polymorphism pattern uncovered in the control region, the complete mitogenomic data of five individuals allows for the refinement of haplogroup assignment that was initially described by the hypervariable regions mutational motifs. According to the high-resolution mitogenomic data, M2 sample belongs to V1a maternal lineage, a typical Western Eurasian V sub-clade, that is currently distributed, based on complete modern mitochondrial data, across all over Europe and European Russia. Information from whole mtDNA sequencing reveals that this medieval sample matches two other ancient samples, one Early Bronze Age specimen from Germany (MF498721)  and a Neolithic one from Hungary (EBVO5a) . The haplotype of M2 contains all of the expected V1a defining variants as well as three personal transitions (G9139A, T11089C, and C16266T). To our knowledge, none of these three polymorphisms were observed in conjunction with any published V complete mitogenome. As such, it occupies a nodal position within V1a, similarly to the two ancient published mitogenomes, and present-day samples from Western Europe. NGS data ascertain the fact that M9 and M11 have identical haplotypes, and due to the presence of the T2287C transition, they can be further classified as N9a9. Both samples harbor two private mutations (G228T and A15799G) which can be identified along with diagnostic motifs in two modern mitogenomes: FJ147307 , belonging to an individual from the North-East Altai, the other being associated with an individual from Kyrgyzstan KX675302 . The common H control region haplotypes of M15 and M17 can be also further narrowed using the information from complete mitochondrial sequences. Sample M15 contains all defining mutations for H13a1a3, a descendant sub-lineage of the widely spread Western Eurasian H13 haplogroup. The H13a1a3 sub-branch currently includes only three modern maternally inherited genomes, two from Russia (KC911290  and JQ701842 ) and one from Poland (JQ703193 ). Of them, the KC911290 sample from Belgorod region in Russia resembles the maternal signature of M15, having a genetic profile without private mutations. The matrilineal variability observed in the complete sequence of M17 can be classified as H5e1a1 and shows the presence of private mutation A13731G. This rare polymorphism (only 21 GenBank records in MITOMAP ; 0.049% when only full-length mitochondrial sequences are considered) is not presently associated with any H5e1a1 published mitogenome. This very derivative sub-branch currently includes only three modern whole mtDNA genomes, two from the European part of Russia (JX128081 and JX128091 ) and one from Denmark (KF161669). The further classification into V1a, H13a1a3, and H5e1a1 for the three Capidava samples points out to links much more geographically restricted to Central and Eastern Europe. This shows that even though few inferences regarding the genetic relations of the analyzed samples with modern and ancient mitogenomes can be made using only the control region, precise and direct connections among closely related mitogenomes can be identified based on high-resolution mitochondrial DNA data. This fact is even more conspicuous when N9a9 samples are taken into consideration. The genetic connection to Central Asia, already shown by the control region data, is now even more straightforward as both M9 and M11 share their two private mutations (G228T and A15799G) with two modern individuals, one belonging to the Tubalar ethnic group (Altai Republic) and one belonging to the Kyrgyz ethnic group.
The PCAs of ancient and modern-day populations were computed based on mtDNA haplogroup frequencies (S5 and S8 Tables). The PCA of the 15 medieval populations considered here shows a roughly clustering between the investigated population and the Cumanians from Hungary along the PC1 and PC2 components (34.82% of the variance is displayed) (Fig 2). This assembly is fairly distantly positioned with respect to the remaining medieval European populations, being displayed at the opposite pole on PC2 from the Avars and the Hungarian conquerors. The farthest population along the PC1 component is the medieval population of Cumanians, culturally Asian steppe nomadic immigrants. Close genetic ties are shown between populations derived from same geographical areas, but this pattern is observed only for the medieval populations from Southern Europe and those from Northern Europe. Such connections were also previously noted by Csákyová et al . Medieval populations from Eastern Europe are more dispersed within the PCA plot, the Romanian population being the most distant, fact which might be explained by a different hg composition, a potential sentinel signal for intense population movement in the area. For example, the genetic architecture of 13 proto-Bulgarians (BGR-med) shows no evidence of Central Asian maternal lineages, while the Western Eurasian maternal lineages do not match, with the exception of hg U3 and hg H, the mtDNA haplogroups found in Capidava . A better understanding of the genetic links between past populations inhabiting modern Bulgarian and Romanian territories could be provided, in the future, by increasing the number of analyzed samples. Hierarchical clustering shows a very similar diagram to the PCA and reveals the clustering of the medieval Romanian population with the Cumanians from Hungary, a fact which can be explained by the interplay of all PCs in this analysis (S2).
The PCA based on mtDNA haplogroup frequencies of the 15 medieval populations shows a roughly clustering of the ancient populations from Romania (ROU-med) and medieval Cumanians from Hungary (HUN-Cum). The other medieval populations from Southern Europe are clustered together, as are those from the North of Europe. The abbreviations, references and haplogroup frequencies are presented in S5 Table.
The PCA plot of the contemporary Eurasian populations and the Capidava population shows the agglomeration of the European ones on the PC1 (23.38% variance) and PC2 (15.73% variance) components, in contrast to modern Asian populations which are scattered (Fig 3). Along PC1, the medieval Romanian population is more distant from the modern European groups than the Asiatic ones, but the contribution of the second PC shows that its connections to the latter populations are not so tight. Of the European populations, those that seem to have a higher genetic affinity with the one analyzed in this study, are mainly the Slavic populations from Eastern Europe, as well as those originating in the Northern Europe. Present-day Romanians inhabiting the historical province of Moldavia, an adjacent (to the North) region to Dobruja, displays closer ties to our medieval samples, indicating that this could have been a possible dispersal direction. Nevertheless, when PC3 is displayed (12.03% variance) the medieval Romanian population appears isolated from the rest of all modern-day populations due to dissimilarities induced by the haplogroups U5a, V, U3, N9 and R frequencies (S3).
The PCA is based on mtDNA haplogroup frequencies of the medieval population from Romania and 35 modern populations from Eurasia and shows PC1 and PC2. The haplogroup frequencies and population information are listed in S8 Table.
Pairwise genetic distances were computed for 14 medieval populations versus the investigated medieval Capidava population and, respectively, a set of present-day populations from Eurasia. When compared to other populations dating from similar historical periods, the medieval population from the current Southeastern territory of Romania showed non-significant differences from all populations (p>0.05), the smallest p-values, associated to the highest FST values, being observed in medieval population from Slovakia (SVK-med) with p = 0.08336±0.0028 and FST = 0.05047 and in medieval Italians (ITA-med) with p = 0.12999±0.0036and FST = 0.04055 (S6 Table). The linearized Slatkin population differentiation (FST) values and significant p values of ancient populations are visualized on the Fig 4 level plot.
Lower left corner: significant p values (< 0.05) are indicated in green. Upper right corner: larger Slatkin FST values indicating greater genetic distances are marked by dark red shades. The exact FST and p values and population information are indicated in S6 Table.
The medieval Romanian population showed the lowest genetic distances from present-day Slavic populations from Eastern Europe (BLR, UKR, RUS, SVN, and SVK) and also from a modern Turkish population (TUR), while the highest distances were noted relative to modern populations from Central and Southern Asia, as well as to current inhabitants of Dobruja. In either of these cases, the values were not significant (p>0.05) which is not quite surprising given the small sample size of the analyzed population and the different and very diverse haplotype composition. The exact FST values and their corresponding p-values are listed in the S9 Table. These genetic distances were visualized by displaying the linearized Slatkin FST values on a multi-dimensional scaling plot (Fig 5, S9 Table). This illustrates a similar pattern to the one reflected by the PCA, showing the aggregation of most of the modern European populations along the first two coordinates, whereas most of the Asian populations are disconnected. Of the Western Asian populations, the one from Turkey shows stronger affinities to the modern European ones and also to the Capidava population, a finding that can be somewhat explained by the prolonged interactions between the people from these regions (Turkic groups to which also the Cumanians and Pechenegs belong).
Stress value is 0.1323 and non-metric fit (R2) is 0.982, values that highlight a good fit between the two-dimensional graph and the original distance matrix. The linearized Slatkin FST values and population information are presented in the S9 Table.
The SHA performed on the medieval populations reveals a distinct result from the FST calculation, showing that the medieval population from Romania shares most of the haplotypes with medieval populations from Spain and Italy (S4 Fig, S7 Table). This high proportion of lineage sharing between these populations, probably occurred due to the presence of the high percentage of the most widely distributed mtDNA haplotypes, H lineages, in modern populations of Eastern Eurasia, being therefore phylogeographically uninformative in this instance due to small data set and low HVS-I resolution. The strength of this apparent genetic affinity to medieval pools from Southern Europe is significantly diminished when high-resolution mitochondrial data is considered, as seen in the three Capidava samples (V1a, H13a1a3, and H5e1a1) that display strong, direct links to Central and Eastern Europe.
The Capidava medieval sample analyzed here displays a very high haplogroup/haplotype diversity, which in conjunction with the very large difference when compared to the local modern population may indicate a very dynamic genetic environment: intense local population turnover. Genetic duplicates in adjacent burial plots highlight possible familial affiliations, characteristic for Christian cemeteries.
Low-resolution data obtained in this study suggest that in this particular case the haplogroups with low frequencies in modern populations (e.g. N9a) may be more informative than common genetic variants. The presence of the Central Asian N9a haplotype connected to ancient and conqueror Hungarian individuals, an ancient Siberian sample and modern Volga-Ural, Southwestern Russian, and Czech groups seem to place Capidava in a genetic landscape dominated by Turkic influences. Genetic affinities towards Central Asia are more straightforwardly illustrated by the high-resolution mitochondrial data as both M9 and M11 share their two private coding region variants with two modern individuals, one belonging to the Tubalar ethnic group (Altai Republic) and one belonging to the Kyrgyz ethnic group. The further classification of the common “European” lineages (H and V) into H5e1a1, H13a1a3, and V1a points out to genetic connections much more restricted to Central and Eastern Europe, decreasing thus the strength of the apparent link to medieval populations from Southern Europe which emerged from the analysis of low-resolution H haplotypes. Therefore, the phylogenetic and phylogeographic interpretation of the complete mitochondrial data, generated by NGS, counteracts to some extent the effects of the small sample size, emphasizing very precise and direct links among closely related old and modern mitogenomes.
Currently, different geographic regions and historical periods are characterized by imbalanced genetic datasets, be it due to number of samples/region or due to differences in resolution of genetic targets, thus limiting the establishment of strong unequivocal analogies. Given our small data set, the main importance of the present study consists in supplying a list of mitochondrial variants for a space and time completely lacking information, a situation similar to many other European medieval populations. In the age of NGS technology, a critical mass of data has to be reached in order to permit future more thorough phylogenetically, phylogeographically and demographically informative comparisons.
S1 Fig. Distribution of the graves within the Capidava necropolis.
The colors of the graves symbolize the anthropological sex, age determination, and samples selected for molecular analysis.
S2 Fig. Ward type hierarchical clustering of the medieval populations.
Percentual AU p-values are given as red numbers on the dendogram.
S3 Fig. PCA plot of the investigated medieval and present-day populations.
The PCA is based on mtDNA haplogroup frequencies of the medieval population from Romania and 35 modern-day populations from Eurasia, and shows PC1 and PC3. The haplogroup frequencies and population information are listed in the S8 Table.
S4 Fig. Levelplot of shared haplotype analysis (SHA).
The levelplot is based on the percentage values of the relative shared haplotypes, also shown in the figure. The absolute values and the population information are given in S7 Table.
S1 Table. Information regarding the osteological and pathological analysis of the samples.
S2 Table. Details of primer sets used for PCR.
S3 Table. MtDNA results obtained in three different laboratories.
S4 Table. Bioinformatics analyses output for NGS data.
S5 Table. MtDNA haplogroup frequencies of medieval populations.
S6 Table. FST values, p values and Slatkin FST matrix of the medieval populations.
S7 Table. Shared haplotype analysis (SHA) of the medieval populations.
S8 Table. MtDNA haplogroup frequencies of modern-day Eurasian populations.
S9 Table. FST values, p values and Slatkin matrix of the analyzed and modern populations.
We are grateful to Prof. Concepcion de la Rua for providing access to the ancient DNA laboratory from University of the Basque Country (UPV/EHU) in order to perform the replication of results.
- 1. Barnea I, Ştefănescu Ş. Din istoria Dobrogei: Bizantini, romani şi bulgari la Dunărea de Jos: Editura Academiei Republicii Populare Române; 1971.
- 2. Madgearu A. Byzantine military organization on the Danube, 10th-12th Centuries. Leiden: Brill; 2013.
- 3. Florescu G. Capidava în epoca migraţiilor. RIR. 1946; XVI(4): 325–343.
- 4. Pinter ZK, Dobrinescu CI, Dragotă A, Kelemen B. Preliminary Research in Capidava Medieval Necropolis (Topalu com., Constanţa County). Pontica. 2011; 44: 387–400.
- 5. Modi A, Tassi F, Susca RR, Vai S, Rizzi E, Bellis G, et al. Complete mitochondrial sequences from Mesolithic Sardinia. Sci Rep. 2017; 7: 42869. pmid:28256601.
- 6. Posth C, Renaud G, Mittnik A, Drucker DG, Rougier H, Cupillard C, et al. Pleistocene Mitochondrial Genomes Suggest a Single Major Dispersal of Non-Africans and a Late Glacial Population Turnover in Europe. Curr Biol. 2016; 26(6): 827–833. pmid:26853362.
- 7. Ning C, Gao S, Deng B, Zheng H, Wei D, Lv H, et al. Ancient mitochondrial genome reveals trace of prehistoric migration in the east Pamir by pastoralists. J Hum Genet. 2016; 61(2): 103–108. pmid:26511065.
- 8. Brotherton P, Haak W, Templeton J, Brandt G, Soubrier J, Jane Adler C, et al. Neolithic mitochondrial haplogroup H genomes and the genetic origins of Europeans. Nat Commun. 2013; 4: 1764. pmid:23612305.
- 9. Hervella M, Izagirre N, Alonso S, Ioana M, Netea MG, de-la-Rua C. The Carpathian range represents a weak genetic barrier in South-East Europe. BMC Genet. 2014; 15: 56. pmid:24885208.
- 10. Richards M, Macaulay V, Hickey E, Vega E, Sykes B, Guida V, et al. Tracing European founder lineages in the Near Eastern mtDNA pool. Am J Hum Genet. 2000; 67(5): 1251–1276. pmid:11032788.
- 11. Turchi C, Stanciu F, Paselli G, Buscemi L, Parson W, Tagliabracci A. The mitochondrial DNA makeup of Romanians: A forensic mtDNA control region database and phylogenetic characterization. Forensic Sci Int Genet. 2016; 24: 136–142. pmid:27414754.
- 12. Bosch E, Calafell F, Gonzalez-Neira A, Flaiz C, Mateu E, Scheil HG, et al. Paternal and maternal lineages in the Balkans show a homogeneous landscape over linguistic barriers, except for the isolated Aromuns. Ann Hum Genet. 2006; 70(Pt 4): 459–487. pmid:16759179.
- 13. Brandstatter A, Egyed B, Zimmermann B, Duftner N, Padar Z, Parson W. Migration rates and genetic structure of two Hungarian ethnic groups in Transylvania, Romania. Ann Hum Genet. 2007; 71(Pt 6): 791–803. pmid:17532745.
- 14. Mendizabal I, Lao O, Marigorta UM, Wollstein A, Gusmao L, Ferak V, et al. Reconstructing the population history of European Romani from genome-wide data. Curr Biol. 2012; 22(24): 2342–2349. pmid:23219723.
- 15. Cocos R, Schipor S, Hervella M, Cianga P, Popescu R, Banescu C, et al. Genetic affinities among the historical provinces of Romania and Central Europe as revealed by an mtDNA analysis. BMC Genet. 2017; 18(1): 20. pmid:28270115.
- 16. Hervella M, Rotea M, Izagirre N, Constantinescu M, Alonso S, Ioana M, et al. Ancient DNA from South-East Europe Reveals Different Events during Early and Middle Neolithic Influencing the European Genetic Heritage. PLoS One. 2015; 10(6): e0128810. pmid:26053041.
- 17. Fu Q, Hajdinjak M, Moldovan OT, Constantin S, Mallick S, Skoglund P, et al. An early modern human from Romania with a recent Neanderthal ancestor. Nature. 2015; 524(7564): 216–219. pmid:26098372.
- 18. Buikstra JE, Ubelaker DH. Standards for data collection from human skeletal remains: proceedings of a seminar at the Field Museum of Natural History: Arkansas Archeological Survey; 1994.
- 19. Steckel RH, Larsen CS, Sciulli PW, Walker PL. Data Collection Codebook for the Global History of Health Project 2011.
- 20. Gilbert MT, Bandelt HJ, Hofreiter M, Barnes I. Assessing ancient DNA studies. Trends Ecol Evol. 2005; 20(10): 541–544.
- 21. Paabo S, Poinar H, Serre D, Jaenicke-Despres V, Hebler J, Rohland N, et al. Genetic analyses from ancient DNA. Annu Rev Genet. 2004; 38: 645–679. pmid:15568989.
- 22. Poinar HN, editor The top 10 list: criteria of authenticity for DNA from ancient and forensic samples. International Congress Series; 2003: Elsevier.
- 23. Yang DY, Watt K. Contamination controls when preparing archaeological remains for ancient DNA analysis. Journal of Archaeological Science. 2005; 32(3): 331–336. WOS:000227845600002.
- 24. Pilli E, Modi A, Serpico C, Achilli A, Lancioni H, Lippi B, et al. Monitoring DNA contamination in handled vs. directly excavated ancient human skeletal remains. PLoS One. 2013; 8(1): e52524. pmid:23372650.
- 25. Yang DY, Eng B, Waye JS, Dudar JC, Saunders SR. Technical note: Improved DNA extraction from ancient bones using silica-based spin columns. Am J Phys Anthropol. 1998; 105(4): 539–543. pmid:9584894
- 26. Gabriel MN, Huffine EF, Ryan JH, Holland MM, Parsons TJ. Improved MtDNA sequence analysis of forensic remains using a "mini-primer set" amplification strategy. J Forensic Sci. 2001; 46(2): 247–253. Epub 2001/04/18. pmid:11305426.
- 27. Hall TA, editor BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic acids symposium series; 1999: [London]: Information Retrieval Ltd., c1979-c2000.
- 28. Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999; 23(2): 147. pmid:10508508.
- 29. van Oven M. PhyloTree Build 17: Growing the human mitochondrial DNA tree. Forensic Sci Int Genet Suppl Ser. 2015; 5: e392–e394.
- 30. Weissensteiner H, Pacher D, Kloss-Brandstätter A, Forer L, Specht G, Bandelt HJ, et al. HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic Acids Res. 2016; 44(W1): W58–63. pmid:27084951.
- 31. Dabney J, Knapp M, Glocke I, Gansauge MT, Weihmann A, Nickel B, et al. Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. Proc Natl Acad Sci U S A. 2013; 110(39): 15758–15763. pmid:24019490.
- 32. Meyer M, Kircher M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb Protoc. 2010; 2010(6): pdb prot5448. pmid:20516186.
- 33. Maricic T, Whitten M, Paabo S. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PLoS One. 2010; 5(11): e14004. pmid:21103372.
- 34. John JS. SeqPrep. 2011.
- 35. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14): 1754–1760. pmid:19451168
- 36. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009; 25(16): 2078–2079. pmid:19505943.
- 37. Renaud G, Slon V, Duggan AT, Kelso J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biology. 2015; 16: 224. pmid:26458810.
- 38. Jonsson H, Ginolhac A, Schubert M, Johnson PL, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics. 2013; 29(13): 1682–1684. pmid:23613487.
- 39. Vianello D, Sevini F, Castellani G, Lomartire L, Capri M, Franceschi C. HAPLOFIND: a new method for high-throughput mtDNA haplogroup assignment. Hum Mutat. 2013; 34(9): 1189–1194. pmid:23696374.
- 40. Maca-Meyer N, Cabrera VM, Arnay M, Flores C, Fregel R, Gonzalez AM, et al. Mitochondrial DNA diversity in 17th-18th century remains from Tenerife (Canary Islands). Am J Phys Anthropol. 2005; 127(4): 418–426. pmid:15714457.
- 41. Gilbert MT, Willerslev E. Authenticity in ancient DNA studies. Med Secoli. 2006; 18(3): 701–723. pmid:18175618.
- 42. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009; 25(11): 1451–1452. pmid:19346325.
- 43. R Development Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2016.
- 44. Ward JH. Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963; 58(301): 236–244.
- 45. Suzuki R, Shimodaira H. Hierarchical Clustering with P-Values via Multiscale Bootstrap Resampling. R package version 2.0–0; 2015.
- 46. Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010; 10(3): 564–567. pmid:21565059.
- 47. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nature Methods. 2012; 9: 772. pmid:22847109
- 48. Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 2003; 52(5): 696–704. pmid:14530136.
- 49. Tamura K, Nei M. Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol. 1993; 10(3): 512–526. pmid:8336541.
- 50. Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O’hara R, et al. Package ‘vegan’. Community ecology package, version. 2013; 2(9).
- 51. Rusu I, Radu C, Oltean A, Dobrinescu C, Kelemen B. Analysis of Kinship Using Mitochondrial DNA: A Case Study from a 10th Century Medieval Population in Capidava (Constanţa, Romania). Ann Roum Anthropol. 2014; 51: 11–19.
- 52. Guba Z, Hadadi E, Major A, Furka T, Juhasz E, Koos J, et al. HVS-I polymorphism screening of ancient human mitochondrial DNA provides evidence for N9a discontinuity and East Asian haplogroups in the Neolithic Hungary. J Hum Genet. 2011; 56(11): 784–796. pmid:21918529.
- 53. Banffy E, Brandt G, Alt KW. 'Early Neolithic' graves of the Carpathian Basin are in fact 6000 years younger-appeal for real interdisciplinarity between archaeology and ancient DNA research. J Hum Genet. 2012; 57(7): 467–469; author reply 470–461. pmid:22673687.
- 54. Keyser C, Bouakaze C, Crubezy E, Nikolaev VG, Montagnon D, Reis T, et al. Ancient DNA provides new insights into the history of south Siberian Kurgan people. Hum Genet. 2009; 126(3): 395–410. pmid:19449030.
- 55. Pilipenko AS, Cherdantsev SV, Trapezov RO, Zhuravlev AA, Babenko VN, Pozdnyakov DV, et al. Mitochondrial DNA diversity in a Transbaikalian Xiongnu population. Archaeol Anthropol Sci. 2017: 1–14.
- 56. Bermisheva M, Tambets K, Villems R, Khusnutdinova E. Diversity of mitochondrial DNA haplotypes in ethnic populations of the Volga-Ural region of Russia. Mol Biol. 2002; 36(6): 990–1001. pmid:12500536.
- 57. Malyarchuk BA, Vanecek T, Perkova MA, Derenko MV, Sip M. Mitochondrial DNA variability in the Czech population, with application to the ethnic history of Slavs. Hum Biol. 2006; 78(6): 681–696. pmid:17564247.
- 58. Pshenichnov A, Balanovsky O, Utevska O, Metspalu E, Zaporozhchenko V, Agdzhoyan A, et al. Genetic affinities of Ukrainians from the maternal perspective. Am J Phys Anthropol. 2013; 152(4): 543–550. pmid:24122717.
- 59. Roostalu U, Kutuev I, Loogvali EL, Metspalu E, Tambets K, Reidla M, et al. Origin and expansion of haplogroup H, the dominant human mitochondrial DNA lineage in West Eurasia: the Near Eastern and Caucasian perspective. Mol Biol Evol. 2007; 24(2): 436–448. pmid:17099056.
- 60. Knipper C, Mittnik A, Massy K, Kociumaka C, Kucukkalipci I, Maus M, et al. Female exogamy and gene pool diversification at the transition from the Final Neolithic to the Early Bronze Age in central Europe. Proc Natl Acad Sci U S A. 2017; 114(38): 10083–10088. pmid:28874531.
- 61. Lipson M, Szecsenyi-Nagy A, Mallick S, Posa A, Stegmar B, Keerl V, et al. Parallel palaeogenomic transects reveal complex genetic history of early European farmers. Nature. 2017; 551(7680): 368–372. pmid:29144465.
- 62. Sukernik RI, Volodko NV, Mazunin IO, Eltsov NP, Dryomov SV, Starikovskaya EB. Mitochondrial genome diversity in the Tubalar, Even, and Ulchi: contribution to prehistory of native Siberians and their affinities to Native Americans. Am J Phys Anthropol. 2012; 148(1): 123–138. pmid:22487888.
- 63. Marchi N, Hegay T, Mennecier P, Georges M, Laurent R, Whitten M, et al. Sex-specific genetic diversity is shaped by cultural factors in Inner Asian human populations. Am J Phys Anthropol. 2017; 162(4): 627–640. pmid:28158897.
- 64. Derenko M, Malyarchuk B, Bahmanimehr A, Denisova G, Perkova M, Farjadian S, et al. Complete mitochondrial DNA diversity in Iranians. PLoS One. 2013; 8(11): e80673. pmid:24244704.
- 65. Behar DM, van Oven M, Rosset S, Metspalu M, Loogvali EL, Silva NM, et al. A "Copernican" reassessment of the human mitochondrial DNA tree from its root. Am J Hum Genet. 2012; 90(4): 675–684. pmid:22482806.
- 66. Lott MT, Leipzig JN, Derbeneva O, Xie HM, Chalkia D, Sarmady M, et al. mtDNA Variation and Analysis Using Mitomap and Mitomaster. Curr Protoc Bioinformatics. 2013; 44: 1.23.21–26. pmid:25489354.
- 67. Mielnik-Sikorska M, Daca P, Malyarchuk B, Derenko M, Skonieczna K, Perkova M, et al. The history of Slavs inferred from complete mitochondrial genome sequences. PLoS One. 2013; 8(1): e54360. pmid:23342138.
- 68. Csákyová V, Szécsényi-Nagy A, Csősz A, Nagy M, Fusek G, Langó P, et al. Maternal Genetic Composition of a Medieval Population from a Hungarian-Slavic Contact Zone in Central Europe. PLoS One. 2016; 11(3): e0151206. pmid:26963389.
- 69. Nesheva DV, Karachanak-Yankova S, Lari M, Yordanov Y, Galabov A, Caramelli D, et al. Mitochondrial DNA Suggests a Western Eurasian Origin for Ancient (Proto-) Bulgarians. Hum Biol. 2015; 87(1): 19–28. pmid:26416319.