Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetic evidence for subspecies differentiation of the Himalayan marmot, Marmota himalayana, in the Qinghai-Tibet Plateau

  • Jingyan Yan ,

    Contributed equally to this work with: Jingyan Yan, Hongjian Chen

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China, University of Chinese Academy of Sciences, Beijing, China

  • Hongjian Chen ,

    Contributed equally to this work with: Jingyan Yan, Hongjian Chen

    Roles Conceptualization, Data curation, Funding acquisition, Project administration, Resources, Validation, Writing – review & editing

    Affiliation Qinghai Institute for Endemic Disease Prevention and Control, Xining, Qinghai, China

  • Gonghua Lin,

    Roles Conceptualization, Funding acquisition, Methodology, Validation, Writing – review & editing

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China

  • Qian Li,

    Roles Investigation, Resources

    Affiliation Qinghai Institute for Endemic Disease Prevention and Control, Xining, Qinghai, China

  • Jiarui Chen,

    Roles Formal analysis, Visualization

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China

  • Wen Qin,

    Roles Visualization

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China

  • Jianping Su ,

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing

    jpsu@nwipb.cas.cn (JS); zhangtz@nwipb.cas.cn (TZ)

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China

  • Tongzuo Zhang

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    jpsu@nwipb.cas.cn (JS); zhangtz@nwipb.cas.cn (TZ)

    Affiliations Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, Qinghai, China, Qinghai Key Laboratory of Animal Ecological Genomics, Xining, Qinghai, China

Abstract

The primary host of plague in the Qinghai-Tibet Plateau (QTP), China, is Marmota himalayana, which plays an essential role in the maintenance, transmission, and prevalence of plague. To achieve a more clear insight into the differentiation of M. himalayana, complete cytochrome b (cyt b) gene and 11 microsatellite loci were analyzed for a total of 423 individuals from 43 localities in the northeast of the QTP. Phylogenetic analyses with maximum likelihood and Bayesian inference methods showed that all derived haplotypes diverged into two primary well-supported monophyletic lineages, I and II, which corresponded to the referential sequences of two recognized subspecies, M. h. himalayana and M. h. robusta, respectively. The divergence between the two lineages was estimated to be at about 1.03 million years ago, nearly synchronously with the divergence between M. baibacina and M. kastschenkoi and much earlier than that between M. vancouverensis and M. caligata. Genetic structure analyses based on the microsatellite dataset detected significant admixture between the two lineages in the mixed region, which verified the intraspecies level of the differentiation between the two lineages. Our results for the first time demonstrated the coexistence of M. h. himalayana and M. h. robusta, and also, determined the distribution range of the two subspecies in the northeast of QTP. We provided fundamental information for more effective plague control in the QTP.

Introduction

The plague, historically known as the “Black Death” because of the 14th century pandemic, is an infectious disease caused by the bacterium Yersinia Pestis. The massive pandemics of the plague in Eurasia resulted in the death of an estimated 75 to 200 million people and caused significant economic, cultural, and population changes worldwide [1]. It remains a lethal threat to human society nowadays, and sporadic plague outbreaks still happen in many parts of the world [2]. According to the World Health Organization, there were 3248 cases reported worldwide with 584 deaths from 2010 to 2015 (http://www.who.int/mediacentre/factsheets/fs267/en/).

Many stable enzootic plague foci on every major inhabited continent except Australia have come into being, among which China makes up a great part [2]. In China, the natural focus of Himalayan marmot plague on the Qinghai-Tibet Plateau (QTP), with an area of 630 000 km2, is the largest and most active one [3]. Human deaths caused by plague in this focus were reported even in recent years [4]. As the most important enzootic host in this focus, the Himalayan marmot (Marmota himalayana), plays a key role in the maintenance, transmission, and prevalence of plague, and is directly or indirectly responsible for most human epidemics [4]. Therefore, studies on biological features of Himalayan marmot are greatly beneficial for the prevention and control of plague. However, previous studies on this animal are mainly restricted to simple population investigations (e.g. population distribution, flea infestation, plague prevalence, etc.), while very few studies extend to other areas such as the molecular biology [5, 6].

Marmots (family Sciuridae, genus Marmota) are large squirrels and widespread in the Holarctic [7]. Four marmot species, M. baibacina (Kastschenko, 1899), M. sibirica (Radde, 1862), M. caudata (Geoffroy, 1844) and M. himalayana (Hodgson, 1841), have been found in China [8]. Among those, M. himalayana, endemic to the QTP, is the most abundant and widely distributed one. The taxonomic status of M. himalayana as an independent species closest to M. sibirica and M. camtschatica is well evidenced and widely accepted [9, 10]. However, as to the more exquisite phylogenetic relationship and more recent differentiation in this species, few literatures can offer any knowledge about it. After M. himalayana, Arctomys himalayanus (Hodgson, 1841), and M. robusta, Arctomys robustus (Milne-Edwards, 1871), were successively put forward, the relationship between them, or the taxonomic status of the latter, is controversial and obscure up to now. Based on a simple morphological comparison between specimens of the two types, De Winton and Styan (1899) [11] regarded them as identical while Jacobi (1923) [12] and Howell (1929) [13] considered the named M. robusta ‘merely a subspecies of the more western M. himalayana’ and described their diagnostic characters as follows: ‘(M. h. himalayana) apparently differs in being grayer, without the more ochraceous tints of M. h. robusta, and in having the black of the forehead less extensive’. Although Allen (1938) [14] inclined to agree with De and Styan (1899), considering the paucity of specimens available, he just retained it. After these, discussions about this issue can still only be accessed in simple checklists, most of which stood for the subspecies idea but reached few consensuses as to the distribution of two types and conveyed no information about their distinct characteristics and differentiation [15, 16].

It is evident that additional studies are needed to figure out the genetic diversity and intraspecific differentiation in M. himalayana and to clarify the relationship between M. h. himalayana and M. h. robusta. Here, we used the complete mitochondrial cytochrome b gene (cyt b), and 11 microsatellite markers to investigate the genetic diversity and differentiation of the marmots in northeastern QTP. We tried to answer the following questions: 1) are the two independent taxonomic units, the named M. h. himalayana and M. h. robusta, both frequently distributed in northeastern QTP? 2) If yes, where are they respectively distributed and what’s their taxonomic status and relationship?

Materials and methods

To enhance the reproducibility of our results, the laboratory protocol was deposited in protocols.io (http://dx.doi.org/10.17504/protocols.io.iyxcfxn). Any possible questions about the step-by-step protocol can be discussed with the authors on this site.

Sample collection and DNA extraction

423 muscle specimens were collected from 43 localities in northeastern QTP (Fig 1, Table 1 and S1 Table) during the period 2014 to 2016. For each locality, at least 4 individuals were randomly sampled. After trapped, the marmots were anaesthetized to death by 100% diethyl ether. And then, muscle tissues were removed and stored in 95% ethanol immediately. Total genomic DNA was extracted using the commercial kit (Ezup Column Animal Genomic DNA Purification Kit, Sangon Biotech).

thumbnail
Fig 1. Geographical distribution and haplotype frequencies of 43 populations of M. himalayana in this study.

The size of pie charts indicates the sample size of each population with the blue green part corresponding to the percentage of haplotypes belonging to lineage A and the purple red part the percentage of haplotypes belonging to lineage B. The dashed line divides the total area into two parts: Region A, covering populations that consist of only haplotypes in lineage I, and Region B, covering populations that include haplotypes in both lineages and several ones located among them with only haplotypes in lineage II.

https://doi.org/10.1371/journal.pone.0183375.g001

thumbnail
Table 1. Genetic diversity of 43 populations of M.himalayana.

https://doi.org/10.1371/journal.pone.0183375.t001

Ethics statement

Our study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. The protocol was approved by the Committee on the Ethics of Animal Experiments of the Northwest Institute of Plateau Biology, Chinese Academy of Sciences (Permit Number: NWIPB-2014-023). The field work was permitted by the Wild Animals and Plants and Natural Reserves Administration, Qinghai Forestry Bureau. All samples were collected by cooperators from the Qinghai Institute for Endemic Disease Prevention and Control in their routine marmot-killing tasks in order to minimize the effect of sampling on wild marmot populations and eliminate the risk of plague infection.

Mitochondrial DNA amplification and sequencing

The complete sequence (1140 bp) of the cyt b gene and partial flanking segments (partial ND6 and tRNA-Glu for 5’-end, partial tRNA-Pro and tRNA-Thr for 3’-end) were amplified and sequenced using a pair of self-designed primers: CY-F (5’-ATCCTAAGCCTCCGTAAATAGGA-3’) and CY-R (5’-CAGGGAATAGTT TAAGTAAGAAATGTCA-3’) (referring to the M. himalayana mitochondrial genome under accession No. JX069958). PCRs were carried out in a 40 μL reaction volume containing 1 μL genomic DNA (approximately 60 ng), 5.0 μL of 10×PCR buffer (with Mg2+), 2.5 μL of dNTPs (2.5 mM), 1 μL of each primer (10 μM), 2.5 units Taq DNA polymerase (5 U/μL) and 29 μL sterilized double-distilled water. The PCR conditions used were as follows: an initial denaturation at 95°C for 5 minutes followed by 30 cycles of denaturation at 95°C for 45 seconds, annealing at 57°C for 45 seconds, and extension at 72°C for 1 minute 45 seconds, and a final extension at 72°C for 10 minutes. PCR products were then sequenced in both directions by the Shanghai Sangon Biological Engineering Technology & Services Co., Ltd (Shanghai, China).

Microsatellite amplification and genotyping

All 423 specimens were genotyped at 11 microsatellite loci, B, D, E, G, H, I, J, M, R, T, and W [17]. PCRs were conducted using fluorescently-labelled primers in a total volume of 20 μL that contained 0.5 μL genomic DNA (approximately 30 ng), 2 μL of 10×PCR buffer (with Mg2+), 1 μL of dNTPs (2.5 mM), 0.4 μL of each primer (10 μM), 1 unit Taq DNA polymerase (5 U/μL) and 15.5 μL sterilized double-distilled water. The amplification profile was: an initial denaturation at 95°C for 4 min followed by 40 cycles of denaturation at 95°C for 45 sec, annealing at Tm (annealing temperature, as listed in Table 2) for 25 sec and extension at 72°C for 20 sec, and a final extension at 72°C for 2 min. Fragments were then analyzed by the Shanghai Sangon Biological Engineering Technology & Services Co., Ltd (Shanghai, China).

thumbnail
Table 2. Characteristics of 11 microsatellite loci used here: Accession number, repeat motif, primer sequence, annealing temperature(AT, °C) and allele number (AN).

https://doi.org/10.1371/journal.pone.0183375.t002

Phylogenetic analysis

DNA sequences were checked and aligned using MEGA v7.0.18 [18]. Polymorphic sites and haplotypes were identified using DnaSP v5.10.01 [19], with which the genetic diversity indices, haplotype diversity (h) and nucleotide diversity (π), were also calculated for every population. Phylogenetic relationship of the detected haplotypes was reconstructed by maximum likelihood (ML) and Bayesian inference (BI) methods. Jmodeltest v2.1.6 [20] and the Akaike Information Criterion (AIC) were used to select a best-fit DNA substitution model. Maximum-likelihood analyses were conducted in PhyML v3.0 [21] using the BEST search method. The branch support values of the ML trees were estimated using non-parametric bootstrap (BSP) with 1000 replicates. The Bayesian analyses were performed using MrBayes v3.2.3 [22]. Two simultaneous Markov Chain Monte Carlo (MCMC) analyses were run for 10 million generations, and trees were sampled every 100 generations. The first 25% of trees were discarded as ‘burn-in’, and the remaining samples were used to generate the consensus tree and the Bayesian posterior probability (BPP). Phylogenetic trees were visualized with Figtree v1.4.3 [23]. The consistency of the two methods (BI and ML) was evaluated through the similarity of topology and comparison of support values for identical nodes in the resultant phylogenetic trees. To indicate the phylogenetic features more clearly, another two cyt b sequences with confirmed taxonomic identities from Genbank, one for M. h. himalayana (from the genome under Accession No. JX069958) and the other for M. h. robusta (Accession No. AF143928), were added to the final data set, as well as a homogenous sequence for M. sibirica (Accession No. AF143937) as an outgroup to root the phylogenetic trees.

Besides, a median-joining haplotype network was calculated based on the maximum parsimony criterion with NETWORK v5.0 [24, 25] since this approach was suggested to be more efficient than classical phylogenetic methods for representing intraspecific evolution [26]. Ambiguous connections were removed to show the most definite connections clearly under the criteria described in Posada and Crandall (2001) [26].

Genetic distance calculation

In the phylogenetic analyses, all cyt b haplotypes were clustered into two well-supported basal lineages, lineage I (the himalayana lineage) and lineage II (the robusta lineage). To examine the divergence degree of the two lineages, we calculated their ML genetic distance in MEGA v7.0.18 and compared it with those between some other species or subspecies pairs in marmot genus based on same sequences in Steppan et al. (1999) [9]. Relevant parameters were set up referring to Steppan et al. (1999) [9].

Divergence time estimation

Divergence times of the well-supported basal nodes were then estimated using a strict molecular clock method in BEAST v1.8.3 [27], with a constant-size coalescent tree prior and HKY + I substitution model. Three calibrations were used. The 1st was a fossil calibration regarding Marmota minor as the oldest marmot fossil at 10.3 mya (J. Alroy, Macquarie University, Sydney, pers. comm.; Paleobiology Database, http://www.paleodb.org, accessed 1 January 2010). Following the protocol of Steppan et al. (2011) [28], a prior normal distribution with a mean of 10.3 mya and a standard deviation (SD) of 0.9 mya was assigned to the most recent common ancestor (MRCA) of marmots and their sister group, the clade of Callospermophilus lateralis, C. saturatus, and Otospermophilus beecheyi. Another two calibrations were based on estimates of Steppan et al. (2011) [28]. One was a normal distribution with a mean of 6.0 mya and an SD of 0.8 mya for the MRCA of subgenera Petromarmota and Marmota, and another was a normal distribution with a mean of 4.9 mya and an SD of 0.6 mya for the MRCA of subgenus Marmota. For the calibrations, cyt b sequences of another 16 relative species, M. camtschatica (AF100715), M. baibacina (AF143915), M. kastschenkoi (AF143914), M. bobak (AF143917), M. menzbieri (AF143931), M. caudata (AF143925), M. marmota (AF143929), M. broweri (AF143919), M. monax (AF143934), M. olympus (JF313271), M. vancouverensis (AF143939), M. caligata (AF143920), M. flaviventris (AF143927), C. lateralis (AF157887), C. saturatus (AF157917) and O. Beecheyi (AF157919), were added to the data set.

The MCMC was run for 10 million generations with parameters sampled every 1000 generations. With the first 10% samples discarded as burn-in, the convergence of the stationary distribution and stability of estimated parameters were examined using TRACER v1.6 [29] by inspection of plotted posterior estimates and the effective sample sizes of all parameters. The maximum clade credibility tree was summarized in TreeAnnotator v1.8.3 [30]. A tree with ages for main nodes and their 95% highest posterior density intervals was displayed using FigTree v1.4.3 [23].

Test for reproductive isolation using microsatellite data

First, for each population, mean values across all loci for the observed heterozygosity (Ho) and expected heterozygosity (He) (Nei, 1978) were calculated using Arlequin v3.5 [31]. Departure from Hardy-Weinberg equilibrium of each population across all loci and linkage disequilibrium (LD) of all pairs of loci across populations were tested by Fisher’s exact tests using a Markov chain Monte Carlo approach with 100 000 steps and 1 000 iterations in GenePop v4.6 [32].

To verify the taxonomic level (interspecies or intraspecies) of the differentiation between the two basal lineages in the derived phylogenetic trees, i.e., the degree of the reproductive isolation between the two lineages, the most direct method should be laboratory crosses. But subjected to the limit of practice, here, we tried to find some hints from genetic structure analyses. It can be deduced that under the hypothesis of reproductive isolation of species level, strong genetic structure exactly corresponding to the species differentiation should be inferred from the rapidly varying nuclear microsatellite markers, even in the mixed range of two lineages, and of course, if the exploration for such a structure just fails, the hypothesis can definitely be rejected.

Apparently, the phylogenetic relationship and the haplotype distribution pattern (Fig 1) demonstrated that the studied area could be divided into two regions, A and B. The region A, a unique breeding range for the himalayana lineage, covered the populations consisting of only haplotypes in lineage I. The region B, a mixed range, covered the populations including haplotypes from both lineages and several populations located among them although with only haplotypes from lineage II. To achieve a better graphic representation and a more explicit interpretation for the differentiation between the two cyt b-based lineages in the genetic structure analyses, all individuals from region A were organized as group A and individuals from region B with haplotypes belonging to the himalayana lineage were assigned into group B-I while the rest ones with haplotypes belonging to the robusta lineage into group B-II. GenePop v4.6 was employed to calculate the Fst between three group pairs, group A vs. group B-I, group A vs. group B-II, and group B-I vs. group B-II. Then, a Bayesian model-based clustering method implemented in structure v2.3 [33] was employed to detect the genetic structure based on the microsatellite dataset. To determine the optimal number of genetic clusters (K), five independent runs were performed for each K-value ranging from 1 to 20, based on the admixture model with correlated allele frequencies. The length of the MCMCs was 500 000 steps after a burn-in period of 50 000 steps. The most likely value of K was estimated with Structure Harvester using the statistic ΔK [34, 35]. Membership coefficients were permuted using CLUMPP v1.1.2 [36]. Plots of individual assignment probabilities at the optimal value of K and several other K-values were generated with DISTRUCT v1.1 [37].

Results

Sequence variation and phylogenetic relationship

A 1345bp long DNA fragment covering the partial sequences of ND6 and tRNA-Pro and complete sequences of tRNA-Glu, cyt b and tRNA-Thr was harvested. However, only the 1140bp long sequences for the complete cyt b genes were used in following analyses. A total of 100 polymorphic sites, comprising 2 three-variant and 77 two-variant parsimony informative sites and 21 singleton variable sites, were detected in the aligned dataset. Among all 423 sampled individuals from 43 populations, 66 haplotypes were identified. The top two most widespread haplotypes were H17 and H10, which were respectively observed in 17 (39.5%) populations with a total frequency of 100 (23.6%) and 14 (32.6%) populations with a total frequency of 56 (13.2%). Aside from H2 found in 6 populations and 16 individuals, all the remaining haplotypes appeared in no more than 4 populations with a total frequency less than 15. Out of these, 40 were private ones, i. e., discovered in single populations.

The topological structures of phylogenetic trees reconstructed by ML and Bayesian methods were consistent in the main divergences. The Bayesian tree with BSPs (percentages) and BPPs (decimals) for main identical nodes in both trees is shown in Fig 2a. All derived haplotypes have diverged into two lineages at the basal node in both trees. Two referential sequences representing M. h. himalayana and M. h. robusta were separately clustered into one of the two lineages with M. h. himalayana in lineage I (the himalayana lineage) and M. h. robusta in lineage II (the robusta lineage). The support values for this node indicated by BSP and BPP were both 1. Then, both the himalayan and robustus lineages split into two branches again. The support values were 99.6% (BSP) and 1 (BPP) for the split in the himalayana lineage and 54.9% (BSP) and 0.99 (BPP) for that in the other lineage. BSPs greater than 70% and BPPs greater than 0.95 were considered as strong support [3840]. The haplotype network (Fig 2b) was greatly consistent with the Bayesian tree and confirmed again the phylogenic patterns described above.

thumbnail
Fig 2. Phylogeny of the derived 66 cyt b haplotypes in M. himalayana.

a. Bayesian phylogenetic tree with M. sibirica as an outgroup and two referential sequences representing M. h. himalayana and M. h. robusta included. BSPs (percentages) and BPPs (decimals) for main identical nodes in BI and ML trees are displayed. b. Network of the 66 cyt b haplotypes. Black dots indicate unsampled or extinct haplotypes. Circle sizes correspond to the haplotype frequencies with the blue part representing the percentage in Region A and the yellow part that in Region B. The line length is approximately proportional to the number of mutation steps between the connected haplotypes.

https://doi.org/10.1371/journal.pone.0183375.g002

Genetic distance between the himalayana and robusta lineages

Our analysis showed that the genetic distance between the himalayana and robusta lineages was estimated to be 2.7% and was larger than that of any subspecies pair (0–2.2%, mean = 1.20%) listed in Steppan et al. (1999) [9], and even than that between M. vancouverensis and M. caligata (1.2%). Although the distance between M. baibacina. baibacina and M. b. kastschenkoi (3.5%) was larger, it was removed from the comparison since M. b. kastschenkoi was elevated to an independent species in Brandler (2003) [41].

Estimates of divergence times

The BEAST analysis (Fig 3) indicated that the divergence between the himalayana and robusta lineages occurred at about 1.03 mya (95% highest posterior density, HPD: 0.72–1.34 mya). It happened at almost the same time when the divergence between M. baibacina and M. kastschenkoi (1.20 mya; 95% HPD: 0.78–1.60 mya) took place, and it was much earlier than the divergence between M. vancouverensis and M. caligata (0.45 mya; 95% HPD: 0.21–0.69 mya). After that, the two lineages both split into two branches again, respectively at 0.34 mya (95% HPD: 0.22–0.48 mya) and 0.42 mya (95% HPD: 0.26–0.62 mya), with the latter almost simultaneously with the divergence between M. vancouverensis and M. caligata.

thumbnail
Fig 3. The divergence times estimated with BEAST based on cyt b sequence data for Marmota.

Bayesian posterior probabilities for main nodes were indicated by decimal numbers around them. Mean values and 95% HPDs (in brackets) are shown behind the arrows.

https://doi.org/10.1371/journal.pone.0183375.g003

Result of reproductive isolation test using microsatellite data

All 11 microsatellite loci were polymorphic. The overall number of alleles for each locus ranged from 7 (R) to 14 (E, G, M) with a total of 124 observed alleles and an average number of 11.3 (Table 2). The average genetic diversity for each population, measured as observed heterozygosity (Ho) and expected heterozygosity (He) (averaged across loci), ranged from 0.52 and 0.54 to 0.89 and 0.80 (Table 1). Only one out of all 43 populations, the number 18 population, WLW, was detected to diverge from Hardy-Weinberg equilibrium significantly (P < 0.01). Tests of linkage disequilibrium for all pairs of loci across populations yielded no significant genotypic disequilibrium (P < 0.01).

In total, 23 populations with 259 individuals were allocated to region A, i.e. group A, while the rest 20 populations with 164 individuals were assigned to region B. In region B, 60 samples from 14 populations were divided into group B-I while the remaining 104 samples from 20 populations into group B-II. According to Structure Harvester, the optimal value for K was estimated to be 2, and then 3 and 4 (Fig 4). The bar plots of individual assignment probabilities at K = 2, K = 3 and K = 4 are shown in Fig 5. As displayed in all three situations, strong genetic differences accompanied by clear signs of admixture was detected between group A (the unique breeding range for the himalayana lineage) and the B-I and B-II groups (the mixed range of two lineages), while inside the mixed range, no genetic differences could be observed between the B-I and B-II groups at all, demonstrating the strong gene flow and little reproductive isolation between the two lineages in the mixed range. The meaning of the Fst values confirmed our results. It was estimated as 0.0677 for group A vs. group B-I and 0.0765 for group A vs. group B-II in contrast to a smaller Fst value for group B-I vs. group B-II (0.0067).

thumbnail
Fig 4. The Delta K curve generated by Structure Harvester.

K = 2 was estimated as the optimal.

https://doi.org/10.1371/journal.pone.0183375.g004

thumbnail
Fig 5. Results of Bayesian individual-based clustering in Structure with K = 2, K = 3 and K = 4.

Each individual is represented by a single vertical bar divided into K colors. The colored segment shows the estimated proportion of membership to the genetic cluster.

https://doi.org/10.1371/journal.pone.0183375.g005

Discussion

Evidence for the himalayana and robusta lineages as independent taxonomic units

In our study, cyt b sequences of all 423 specimens from the northeast of QTP were clustered into two basal well-supported and monophyletic lineages, lineage I and lineage II. The lineages corresponded to the referential sequences representing M. h. himalayana and M. h. robusta, respectively (Fig 2). The genetic distance between the himalayana and robusta lineages was estimated to be 2.7%, which was larger than that of any subspecies pair (0–2.2%, mean = 1.20%) listed in Steppan et al. (1999) [9] and even that between two independent species, M. vancouverensis and M. caligata (1.2%). Taking the subspecies taxonomic status of the himalayana and robusta lineages we are going to illustrate here into consideration, this result seems unexpected but it is consistent with the different divergence histories. As revealed in the tMRCA analysis in BEAST (Fig 3), our two lineages were evaluated to diverge at about 1.03 mya nearly synchronously with the divergence between M. baibacina and M. kastschenkoi (1.20 mya; 95% HPD: 0.78–1.60 mya) and much earlier than that between M. vancouverensis and M. caligata (0.45 mya; 95% HPD: 0.21–0.69 mya). M. vancouverensis is a special marmot species living exclusively on an island, the Vancouver Island [42]. Although it was estimated to diverge from the continental M. caligata more recently, which can be explained by the isolation of that island from the mainland after the sea level rise at the end of the last glaciation, little chance for second contact since the isolation enabled them to evolve into different species [42]. However in M. himalayana, the situation was just the opposite due to frequent and great second contacts as discovered here in the mixed range, leaving the two diverged lineages merely developing into subspecies in M. himalayana. Back to our topic, a certain distribution pattern of haplotypes in the two lineages emerged in the studied range. Only haplotypes in the himalayana lineage were detected in all 259 individuals in a pretty large area in the northeast (region A). However, all haplotypes in the robusta lineage and a few ones in the himalayana lineage, specific to this region (H01, H06, H08, H09, H12, H14, H42, H46, H47 and H49) or shared with region A (H7, H10, H11 and H17), were found in the southwest (region B) (Fig 1 and Table 1). Undeniably, all the described outcomes bear testimony to the status of the uncovered lineages as independent evolutionary units at least for a certain period in history and thus potential independent taxonomic units.

Correspondence to the subspecies definition

The conception and diagnosable criteria of subspecies as a taxonomic unit have gone through a long and impassioned debate [4346]. Patten, Winker and Haig (2010) [46] figured out four essential features for subspecies from definitions proffered over years as follows: (i) no reproductive isolation between different subspecies; (ii) defining features based on genetic or developmental facts; (iii) unique breeding ranges that separate from each other for various subspecies, and (iv) diagnosable distinctions. Three points out of the four listed characteristics are indeed met by two units studied by us. First, our units were morphologically different. M. h. himalayana ‘apparently differs in being grayer, without the more ochraceous tints of M. h. robusta, and in having the black of the forehead less extensive’ as described by Howell (1929) [13]. Second, a unique breeding range (region A) for M. h. himalayana was found in our study, and the sampling there was sufficient to strut a compelling assessment. Though an exclusive breeding range for M. h. robusta was not found in the studied area; it is believed to be located somewhere else in the whole species’ range. Third, the two units were proved as not reproductively isolated. The analyses on the genetic structure revealed strong genetic differentiation accompanied by obvious admixture between the unique breeding range for the himalayana lineage and the mixed range of two lineages (Fig 5). Furthermore, inside the mixed range, no genetic structure was observed between the two lineages, indicating the existence of significantly frequent admixture. Correspondingly, the Fst values of group A vs. group B-I and group A vs. group B-II (0.0677 and 0.0765) were significantly larger than that between group B-I and group B-II (0.0067). These results serve as a sufficient demonstration for the strong gene flow between the two lineages in the mixed range and the absence of reproductive isolation between them. Last, although no defining features based on genetic or developmental facts were verified here, sufficient time has elapsed for some “heritable geographic variation in phenotype” to be built since the lineages split at about 1.03 mya, provided that the story is not too intricate. So, although a reciprocal monophyly is considered not necessary for subspecies differentiation and cannot act as a proper excuse to reject any subspecies, it does be an important positive signal indicating the occurrence of subspecies.

Discrepancy with formerly described subspecies distribution

The subspecies rank of M. h. himalayana and M. h. robusta has been in use for years. Although no studies especially aimed at their differentiation are available, detailed descriptions of their distribution in China could be found in many taxonomic checklists, two opinions out of which will be discussed here. According to Wang (2002) [15], the nominate subspecies is distributed in Xizang, southern Xinjiang, Qinghai (Qaidam Basin) and western Gansu, while M. h. robusta in northwestern Yunnan, western Sichuan and Qinghai (except Qaidam Basin). However, in Smith and Xie (2009) [16], the former was thought to dwell only in southern Xizang, and the latter inhabited a pretty extensive range: Xinjiang, Xizang, Qinghai, Gansu, western Sichuan and Yunnan. Judging from our study, apparently, M. h. himalayana lives in a larger range than Smith and Xie (2009) [16] expected since a wide area in the northeast of Qinghai was discovered to be its unique breeding range. Also, the opinion of Wang (2002) [15] about both subspecies’ distribution in Qinghai is not so accurate because both subspecies were found outside Qaidam Basin in our study, letting alone the unique breeding range for M. h. himalayana outside Qaidam Basin.

Significance of our study

Resistance and susceptibility to Y. Pestis, which can act as important determinants for the role and contribution of a particular host population in the maintenance and transmission of plague, are recognized to depend on host species and even genetic differences among individuals and populations [4749]. Variations in plague resistance among different populations of deer mice (Peromyscus maniculatus) and California voles (Microtus californicus) were already reported, and the resistance in M. californicus was revealed to be genetic [50, 51]. Therefore, whether or not any resistance or susceptibility differences exist in subspecies level in M. himalayana is a question of future studies, which may provide constructive knowledge for the prevention and control of plague. Whatever, it is proved by our study that the genetic differentiation in the Himalaya marmot population in QTP is discrete and heterogeneous, which can serve as reliable supporting information for more precise investigations on plague epizootiology in future. Meanwhile, a unique breeding range for M. h. himalayana and a mixed range where both subspecies are distributed and interbreed fairly frequently are located in the northeast of QTP. If the subspecies contribute to the plague differently, corresponding prevention and control strategies should be developed for different ranges.

On the other hand, due to the key role of the Himalayan marmot in the maintenance, transmission, and prevalence of plague [4], the government always conducts extensive marmot-killing tasks. Undoubtedly, this is a potential threat for the biological diversity of the Himalayan marmot, although it is widespread and abundant in the QTP now. So this study provides scientifically based information on the subspecies differentiation for M. himalayana and can act as a guide to the government’s marmot-killing actions to control the marmot population and conserve its biological diversity simultaneously.

Conclusions

As discussed above, the taxonomic rank of two subspecies, M. h. himalayana and M. h. robusta, was confirmed in different aspects, i.e., the well supported reciprocal monophyly, quite a long period of divergence (1.03 mya) and large genetic distance exceeding that between almost any other subspecies pair and even some species pair in the same genus, a unique breeding range for M. h. himalayana and clear signals for admixture between them. Provided all these proofs, we proved that the subspecies differentiation of Himalayan marmot and a unique breeding range for M. h. himalayana, together with a second contact range where both subspecies are distributed and interbreed, took place in the northeast of QTP.

Supporting information

S1 File. Genotypic data for all individuals at 11 microsatellite loci.

https://doi.org/10.1371/journal.pone.0183375.s001

(XLSX)

S1 Table. Geographic information of 43 populations of M.himalayana.

https://doi.org/10.1371/journal.pone.0183375.s002

(DOCX)

Acknowledgments

We are grateful for all the hard work contributed to the sampling by Key Laboratory for Plague Prevention and Control of Qinghai Province.

References

  1. 1. Perry RD, Fetherston JD. Yersinia pestis—etiologic agent of plague. Clinical microbiology reviews. 1997;10(1):35–66. pmid:8993858
  2. 2. Stenseth NC, Atshabar BB, Begon M, Belmain SR, Bertherat E, Carniel E, et al. Plague: past, present, and future. PLoS Med. 2008;5(1):e3. pmid:18198939
  3. 3. Yu D. Plague epizootiology. Beijing: Science Press; 2009.
  4. 4. Luo X, Hai R. An overview of the epidemiological characteristics of Marmota himalayana plague on Qinghai-Tibet plateau. Zhongguo Meijie Shengwuxue ji Kongzhi Zazhi = Chinese Journal of Vector Biology and Control. 2010;21(4):394–8.
  5. 5. Xu J-h, Wang L-l, Xue H-l, Wang Y-s, Xu L-x. Genetic structure of Himalayan marmot (Marmota himalayana) population alongside the Qinghai–Tibet Railway. Acta Ecologica Sinica. 2009;29(5):314–9.
  6. 6. Chao QJ, Li YD, Geng XX, Zhang L, Dai X, Zhang X, et al. Complete mitochondrial genome sequence of Marmota himalayana (Rodentia: Sciuridae) and phylogenetic analysis within Rodentia. Genet Mol Res. 2014;13(2):2739–51. pmid:24782088.
  7. 7. Nikol’skii A, Rumiantsev VY, editors. Center of species diversity of Eurasian marmots (Marmota, Rodentia) in an epi-platformal orogeny area. Doklady Biological Sciences; 2012: Springer.
  8. 8. Chen Y, Wang Z, Zhang G, Fan W, Tao Y, He X, et al. Genetic characterization of four wild species of Chinese marmots using microsatellite markers. Biologia. 2012;67(5).
  9. 9. Steppan SJ, Akhverdyan MR, Lyapunova EA, Fraser DG, Vorontsov NN, Hoffmann RS, et al. Molecular phylogeny of the marmots (Rodentia: Sciuridae): tests of evolutionary and biogeographic hypotheses. Systematic biology. 1999;48(4):715–34. pmid:12066297
  10. 10. Brandler O, Lyapunova E. Molecular phylogenies of the genus Marmota (Rodentia Sciuridae): comparative analysis. Ethology Ecology & Evolution. 2009;21(3–4):289–98.
  11. 11. De Winton WE. On Chinese mammals, principally from Western Szechuen. With notes on Chinese squirrels by F. W. Styan. Proc. Zool. Soc. Lond. 1899: 575.
  12. 12. Jacobi A. Mammalia: I. Sӓugetiere, in: Zoologische Ergebnisse der Walter Stӧznerschen Expeditionen nach Szetschwan, Osttibet und Tschili, Abh. u. Ber. Mus. f. Tierk. Völkerk. Dresden. 1923, vol. 16, no. 1, p. 10.
  13. 13. Howell AB. Mammals from China in the collections of the United States National Museum. Proc. U. S. Nat. Mus.1929. vol. 75, art. 1, p. 36.
  14. 14. Allen GM. The mammals of China and Mongolia. In: Granger W. (editor) Natural History of Central Asia. XI. New York: Amer. Mus. Nat. Hist.; 1938. p. 713.
  15. 15. Wang Y. A complete checklist of mammal species and subspecies in China. Beijing: China Forestry Publishing House; 2002.
  16. 16. Smith AT, Xie Y. A guide to the mammals of China. Hunan: Hunan Education Publishing House; 2009.
  17. 17. Ma H. The study of Marmota himalayana migrant among different families within small scale: Chinese Center for Disease Control and Prevention; 2011.
  18. 18. Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Molecular biology and evolution. 2016:msw054.
  19. 19. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2. pmid:19346325
  20. 20. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nature methods. 2012;9(8):772-.
  21. 21. Guindon S, Gascuel O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Systematic biology. 2003;52(5):696–704. pmid:14530136
  22. 22. Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Hohna S, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61(3):539–42. pmid:22357727; PMC3329765.
  23. 23. Rambaut A. FigTree version 1.4.3 2009. http://tree.bio.ed.ac.uk/software/figtree.
  24. 24. Bandelt H-J, Forster P, Röhl A. Median-joining networks for inferring intraspecific phylogenies. Molecular biology and evolution. 1999;16(1):37–48. pmid:10331250
  25. 25. Polzin T, Daneshmand SV. On Steiner trees and minimum spanning trees in hypergraphs. Operations Research Letters. 2003;31(1):12–20.
  26. 26. Posada D, Crandall KA. Intraspecific gene genealogies: trees grafting into networks. Trends in ecology & evolution. 2001;16(1):37–45.
  27. 27. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular biology and evolution. 2012;29(8):1969–73. pmid:22367748
  28. 28. Steppan SJ, Kenagy GJ, Zawadzki C, Robles R, Lyapunova EA, Hoffmann RS. Molecular data resolve placement of the Olympic marmot and estimate dates of trans-Beringian interchange. Journal of Mammalogy. 2011;92(5):1028–37.
  29. 29. Rambaut A, Suchard MA, Xie W, Drummond AJ. Tracer version 1.6 2014. http://beast.bio.ed.ac.uk/Tracer.
  30. 30. Rambaut A, Drummond AJ. TreeAnnotator version 1.8.3 2009. http://beast.bio.ed.ac.uk/TreeAnnotator.
  31. 31. Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular ecology resources. 2010;10(3):564–7. pmid:21565059
  32. 32. Rousset F. genepop’007: a complete re—implementation of the genepop software for Windows and Linux. Molecular ecology resources. 2008;8(1):103–6. pmid:21585727
  33. 33. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59. pmid:10835412
  34. 34. Earl DA. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation genetics resources. 2012;4(2):359–61.
  35. 35. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular ecology. 2005;14(8):2611–20. pmid:15969739
  36. 36. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23(14):1801–6. pmid:17485429
  37. 37. Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Molecular Ecology Notes. 2004;4(1):137–8.
  38. 38. Hillis DM, Bull JJ. An empirical test of bootstrapping as a method for assessing confidence in phylogenetic analysis. Systematic biology. 1993;42(2):182–92.
  39. 39. Erixon P, Svennblad B, Britton T, Oxelman B. Reliability of Bayesian posterior probabilities and bootstrap frequencies in phylogenetics. Systematic Biology. 2003;52(5):665–73. pmid:14530133
  40. 40. Huelsenbeck J, Rannala B. Frequentist properties of Bayesian posterior probabilities of phylogenetic trees under simple and complex substitution models. Syst Biol. 2004;53(6):904–13. pmid:15764559.
  41. 41. Brandler O. On species status of the forest-steppe marmot Marmota kastschenkoi (Rodentia, Marmotinae). Zool Zh. 2003;82:1498–505.
  42. 42. Cardini A, Thorington RW, Polly PD. Evolutionary acceleration in the most endangered mammal of Canada: speciation and divergence in the Vancouver Island marmot (Rodentia, Sciuridae). Journal of evolutionary biology. 2007;20(5):1833–1846. pmid:17714301
  43. 43. Lack D. Subspecies and sympatry in Darwin's finches. Evolution. 1969:252–63. pmid:28562888
  44. 44. Patten MA, Unitt P, Sheldon F. Diagnosability versus mean differences of Sage Sparrow subspecies. The Auk. 2002;119(1):26–35.
  45. 45. Phillimore AB, Owens IP. Are subspecies useful in evolutionary and conservation biology? Proceedings of the Royal Society of London B: Biological Sciences. 2006;273(1590):1049–53.
  46. 46. Patten MA, Winker K, Haig S. Null expectations in subspecies diagnosis. Ornithological Monographs. 2010;67(1):35.
  47. 47. Rivkus Y, Ostrovsky I, Melnikov I, Dyatlov A. On sensitivity to plague in Rhombomys opimus from the northern Kizil-Kum. Problemi Osobo Opasnih Infektsii. 1973;30:48–58.
  48. 48. Kozlov M, Sultanov G. Plague. Natural focality, epizootiology, and epidemic manifestations (in Russian) Meditsina, Moscow. 1979.
  49. 49. Gage KL, Kosoy MY. Natural history of plague: perspectives from more than a century of research. Annu Rev Entomol. 2005;50:505–28. pmid:15471529
  50. 50. Quan S, Kartman L. Ecological studies of wild rodent plague in the San Francisco Bay Area of California. VIII. Susceptibility of wild rodents to experimental plague infection. Zoonoses research. 1962;1(7):121–44.
  51. 51. Hubbert WT, Goldenberg MI. Natural resistance to plague: genetic basis in the vole (Microtus californicus). The American journal of tropical medicine and hygiene. 1970;19(6):1015–9. pmid:5493047