Skip to main content
Advertisement
  • Loading metrics

Genome-informed investigation of the molecular evolution and genetic reassortment of severe fever with thrombocytopenia syndrome virus

  • Kyuyoung Lee,

    Roles Formal analysis, Investigation, Writing – original draft

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Jong Hyeon Seok,

    Roles Formal analysis

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Hyunbeen Kim,

    Roles Formal analysis

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Sejik Park,

    Roles Formal analysis

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Sohyun Lee,

    Roles Formal analysis

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Joon-Yong Bae,

    Roles Formal analysis

    Affiliation Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea

  • Kyeongseok Jeon,

    Roles Formal analysis

    Affiliation Department of Microbiology and Immunology, College of Medicine, Seoul National University, Seoul, Republic of Korea

  • Jun-Gu Kang,

    Roles Formal analysis

    Affiliation Laboratory for Vector Borne Disease, Korea Zoonosis Research Institute, Jeonbuk National University, Iksan, Republic of Korea

  • Jeong Rae Yoo,

    Roles Formal analysis, Resources

    Affiliation Department of Internal Medicine, College of Medicine, Jeju National University, Jeju, Republic of Korea

  • Sang Taek Heo,

    Roles Formal analysis, Resources

    Affiliation Department of Internal Medicine, College of Medicine, Jeju National University, Jeju, Republic of Korea

  • Nam-Hyuk Cho,

    Roles Data curation, Resources, Validation

    Affiliation Department of Microbiology and Immunology, College of Medicine, Seoul National University, Seoul, Republic of Korea

  • Keun Hwa Lee,

    Roles Data curation, Resources, Validation

    Affiliation Department of Microbiology, College of Medicine, Hanyang University, Seoul, Republic of Korea

  • Kisoon Kim,

    Roles Data curation, Validation

    Affiliations Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea, Vaccine Innovation Center, College of Medicine, Korea University, Seoul, Republic of Korea

  • Man-Seong Park ,

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    manseong.park@gmail.com (M-SP); jinil_kim@korea.ac.kr (JIK)

    Affiliations Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea, Vaccine Innovation Center, College of Medicine, Korea University, Seoul, Republic of Korea, Biosafety Center, College of Medicine, Korea University, Seoul, Republic of Korea

  • Jin Il Kim

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    manseong.park@gmail.com (M-SP); jinil_kim@korea.ac.kr (JIK)

    Affiliations Department of Microbiology, Institute for Viral Diseases, College of Medicine, Korea University, Seoul, Republic of Korea, Vaccine Innovation Center, College of Medicine, Korea University, Seoul, Republic of Korea, Biosafety Center, College of Medicine, Korea University, Seoul, Republic of Korea

Abstract

Background

Severe fever with thrombocytopenia syndrome virus (SFTSV) is a viral pathogen causing significant clinical signs from mild fever with thrombocytopenia to severe hemorrhages. World Health Organization has paid special attention to the dramatic increase in human SFTS cases in China, Japan, and South Korea since the 2010s. The present study investigated the molecular evolution and genetic reassortment of SFTSVs using complete genomic sequences.

Methods/Principal finding

We collected the complete genome sequences of SFTSVs globally isolated until 2019 (L segment, n = 307; M segment, n = 326; and S segment, n = 564) and evaluated the evolutionary profiles of SFTSVs based on phylogenetic and molecular selection pressure analyses. By employing a time-scaled Bayesian inference method, we found the geographical heterogeneity of dominant SFTSV genotypes in China, Japan, and South Korea around several centuries before and locally spread by tick-born spillover with infrequent long-distance transmission. Purifying selection predominated the molecular evolution of SFTSVs with limited gene reassortment and fixed substitution, but almost all three gene segments appeared to harbor at least one amino acid residue under positive selection. Specifically, the nonstructural protein and glycoprotein (Gn/Gc) genes were preferential selective targets, and the Gn region retained the highest number of positively selected residues.

Conclusion/Significance

Here, the large-scale genomic analyses of SFTSVs improved prior knowledge of how this virus emerged and evolved in China, Japan, and South Korea. Our results highlight the importance of SFTSV surveillance in both human and non-human reservoirs at the molecular level to fight against fatal human infection with the virus.

Author summary

SFTSV causes a high mortality disease in humans, especially in China, Japan, and South Korea. For a better understanding of the evolutionary characteristics of SFTSVs, our study performed phylogenetic and genetic reassortment analyses using the largely collected genomic information of SFTSVs. It appeared that dominant endemic SFTSVs in China, Japan, and South Korea likely diverged several centuries ago. Movement of vector (e.g., tick) or non-human mammalian reservoirs possibly played a key role in human SFTSV spillover and infrequently led to international spread among three countries. Purifying selection strongly limited the generation of novel variants through either molecular mutations or genetic reassortment. However, we observed a few amino acid sites under positive selection, especially in glycoproteins which might limitedly allow the generation of variants within a clade, in which molecular evolution might not be a major cause for the current emergence of SFTSVs but the ecological or epidemiological change possibly be a more plausible factor. However, we should still carefully monitor the evolutionary dynamics of SFTSV considering the potential of host adaptation in humans.

Introduction

Severe fever with thrombocytopenia syndrome (SFTS) was first reported from hospitalized patients in China in 2009, and its causative virus (SFTSV) was identified in 2011 [1]. The SFTSV is one of the viruses belonging to the genus Phlebovirus of the family Phenuiviridae in the Order Bunyavirales [2]. It retains single-stranded, negative-sense RNA genome segments (L, M, and S) [1]. The L gene encodes an RNA-dependent RNA polymerase (RdRp) for viral genome replication and transcription [3], the M gene encodes glycoproteins N (Gn) and C (Gc) for viral attachment and entry to host cells [4], and the S gene encodes nucleocapsid (NP) and nonstructural (NS) proteins for viral replication protection and immune evasion [5,6].

Since the first identification of SFTSV, the number of human infection cases with the virus drastically increased in China [7,8], as well as in South Korea [9], Japan [10,11], and Vietnam [12], and more than 13 thousand human cases were reported globally until 2019 [7,8]. SFTSV causes clinical symptoms in humans from mild fever with thrombocytopenia and leukopenia to severe hemorrhages, encephalitis, or multiple organ failures to death [1,13]. The case fatality rate (CFR) of SFTS showed differences by epidemiological and environmental factors but remained quite high in China (2.7–8.0%), Japan (4.9–35.0%), and South Korea (11.5–47.2%) (Fig 1) [710,1417].

thumbnail
Fig 1. Summary of incidence and case fatality rate (CFR) of human SFTSV infection from 2013 to 2021.

Annual incidence (Dotted and triangle) and CFR (Line and circle) of the SFTSV in China (Green), Japan (Black), and South Korea (Red) from 2013 to 2021. The plot was reconstructed from the statistics of national surveillance reports in China [15], Japan [16], and South Korea [17].

https://doi.org/10.1371/journal.pntd.0011630.g001

Considering its clinical and epidemiological significance in public health, the World Health Organization (WHO) listed the virus as one of the most threatening pathogens with a strong potential for epidemic [13]. Current surveillance presumed that the incidence of human SFTS could increase, and the geographic distribution of human infection would be expanded further over time [8,18,19]. Tick bite, specifically by Haemaphysalis longicornis, is a well-known transmission factor of SFTS in humans considering a broad host range, geographical distribution, and virus isolation [2022]. Previous investigations supported that human SFTSV infection might be incidental spillover events from viral reproduction in the life cycle of the tick or the transmission cycle of the tick with sub-clinically infected animals [2326].

Many studies investigated the genetic diversity, molecular evolution, and transmission of SFTSVs across the three most affected countries: China, Japan, and South Korea [9,11,2729]. These studies generally collected genomic information from SFTSV isolates and performed a phylogenetic analysis to classify their SFTSVs into five genotypes and to detect key mutations [11,27,29,30]. One study reconstructed epidemiological circumstances among China, Japan, and South Korea from the genotypic relatedness of SFTSV [27]. However, many studies likely showed a potential for a sampling bias in phylogenetic inferences considering the limited number of genomic information. Here, we revisited the phylogenetic and evolutionary analyses of SFTSVs using the large-scale genomic sequence data compared to previous research. We evaluated the genetic characteristics of the three gene segments of SFTSVs using phylogenetic approaches. Furthermore, our study investigated the genetic reassortment and selection pressure profiles of the three gene segments at the molecular level to better understand the molecular evolutionary dynamics of SFTSVs.

Materials and methods

Ethics statement

The present study used the genomic sequences of SFTSV from human clinical specimens regarding the institutional review board (IRB) approved at Jeju National University Hospital (IRB file no. 2015-08-002). The formal statements of written consent were obtained from the all the participants.

Data collection

Our study collected nucleotide sequences of L (6,252 nucleotides, nts), M (3,222 nts), and S (1,744 nts) gene segments of SFTSVs from the database of the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) (https://www.bv-brc.org/). Epidemiological information about collection dates, countries, and host species were obtained based on the accession number of the genome sequences. The present study additionally involved genomic sequences of SFTSVs during the outbreak in Japan from 2013 to 2014 provided by the previous phylogenetic study (L, n = 38; M, n = 35; and S, n = 56) [30]. We also used the genomic sequences of SFTSV isolated from human clinical specimens collected (L, n = 8; M, n = 8; and S, n = 34) and ticks captured in South Korea (L, n = 1; M, n = 1; and S, n = 3). Our study excluded gene sequences if a sample did not have full length of nucleotide sequences, nor provide collection dates, countries, and host species. Furthermore, we only involved one representative sequence among multiple sequences with very high nucleotide sequence identity if the sequences were collected from the same date, country, and host species. Finally, a total of 1,197 genome sequences (L, n = 307; M, n = 326; and S, n = 564) of SFTSVs were analyzed in this study (S1 Table and S1 Data). Each sequence set was aligned using Multiple Alignment using Fast Fourier Transform approach (version 7), [31] and manually adjusted (S1 Data).

Phylogenetic analysis

Bayesian Markov chain Monte Carlo (MCMC) Metropolis–Hastings algorithm was used for the phylogeny estimation of the three gene segments (overall sequence sets for the L, M, and S genes) on BEAST (version 1.10.4, [32]). Our study used jModelTest2 (version 2.1.10) to select the best-fit model for Bayesian phylogeny estimation for the three gene segments through the phylogenetic model averaging [33]. We selected General Time Reversal (GTR) + I + Γ model [34] as the nucleotide substitution model with uncorrelated relaxed, lognormal clock model and Bayesian skygrid tree prior for the analysis. Our study initiated the MCMC chain from 50 million runs with a 10% burn-in and sub-sampled in every 50 thousand iterations. If the MCMC chain showed the lack of convergence with low effective sample sizes, we performed the Bayesian estimation with higher MCMC chain runs and repeated this step until the MCMC chains reached to appropriate convergence, stationary, and mixing properties with the sufficient effective sample sizes (>100) on Tracer (version 1.6) [35]. The maximum clade credibility (MCC) trees of the three gene segments were summarized using TreeAnnotator (version 1.10.4) and visualized by FigTree (version 1.4.2). The Bayesian phylogeny was used to estimate the evolutionary rates and the time of the most recent common ancestor (tMRCA) of the three gene segments of SFTSVs in four datasets. Estimates of the evolutionary rate and tMRCA were presented as median along with the lower and upper limits of the 95% highest probability density (HPD). All statistical estimates of phylogenetic analysis were summarized and visualized using “ggplot2” package on R (version 4.1.3) [36] with R studio (version 2022.12.0) [37].

Reassortment analysis

Our study detected reassortment events in each pair of gene segments using the graph incompatibility-based reassortment finder (GiRaF) approach (version 1.2) [38]. Alignments of the L, M, and S gene segments in the full dataset were used for the reconstruction of phylogenetic relationship in MrBayes (GTR+I+Γ, 1,000,000 iterations, 10% burn-in, and sampling every 10,000 iterations) [39]. The phylogenetic uncertainty was statistically estimated by the trees of three gene segments in the GiRaF. We performed three independent GiRaF trials in each pair of gene segments and confirmed reassortment events exhibited in all three trials.

Natural selection pressure profile

Natural selection pressure profiles were measured by the functional subunit of coding regions in the three gene segments of SFTSVs (L: RdRp and C-terminus regions; M: Gn and Gc regions; and S: NP and NS regions). The selection pressure was estimated by the ratio between nonsynonymous (dN) and synonymous substitutions rates (dS) employing the best-fit REV nucleotide substitution model (cutoff p-value <0.05). The dN/dS ratios were estimated by four methods, the single likelihood ancestor counting (SLAC) [40], fast unconstrained Bayesian approximation (FUBAR) [41] (cutoff posterior probability > 0.9), mixed effects model of evolution (MEME) [42] (cutoff p-value <0.1) in the HyPhy (version 2.2) on the Datamonkey web-server (http://www.datamonkey.org, [43]) and the Renaissance counting (RNSC) method implemented in BEAST (cutoff p-value <0.1) [44].

Results

Phylogenetic relationships of the SFTSV gene segments

All phylogenetic trees of L, M, and S genes in SFTSVs showed highly homogeneous topology comprising co-circulation of multiple lineages with intra-subtype diversification (Fig 2). Following the genetic clade definition by Fu et al. [27], we observed two major clades, Clade 1 (genotypes A, D, E, and F) and Clade 2 (genotype B) of SFTSVs. The reference strain of genotype C (China/AHL/2011; GenBank accession number JQ670934) showed incongruence of classification between the L (Clade 2) and M (Clade 1) genes, and its S gene belonged to genotype D. Most Chinese SFTSVs formed multiple genotypes in Clade 1, and only a few strains belonged to genotypes B and C (or genotype B) in Clade 2. Japanese SFTSVs formed one subclade in genotype B in Clade 2. Only a few strains belonged to either other subclades in genotype B or genotype A in Clade 1. Most South Korean SFTSVs formed subclades in genotype B, and a few minor strains belonged to either Chinese or Japanese subclades in genotype B, or rarely in Clade 1 (each of A, D, or F genotype). On the other hand, the phylogenetic locations of different host SFTSVs (human, tick, and non-human mammal hosts) displayed a strong polyphyletic pattern similarly in all three gene segments (S1 Fig). The SFTSVs isolated in tick and non-human mammal hosts were reported in China and South Korea in both Clades 1 and 2 and showed genetic relatedness with human SFTSVs in all three gene segments.

thumbnail
Fig 2. Phylogenetic relationships of SFTSV.

The maximum clade credibility tree of the (A) Long (L), (B) Medium (M), (C) Short (S) gene segments was reconstructed using the Bayesian evolutionary interference method. The brackets in the x-axis display the genotype classification by Fu et al (2016). The phylogenetic branches with specific countries were color labeled black (China), red (Japan), and blue (South Korea), respectively.

https://doi.org/10.1371/journal.pntd.0011630.g002

Evolutionary profiles and genetic reassortment of the SFTSV gene segments

SFTSVs exhibited different evolutionary rates and tMRCAs by three gene segments even with similar phylogenic genealogy (Fig 3). The S gene segment showed the highest evolutionary rate (4.06 x 10−4 substitutions/site/year) among the three gene segments, which was approximately 2 to 19 times higher than that of the other two genes (Table 1). As the differences in the evolutionary rates among the three gene segments, the S gene showed the youngest tMRCA than the other two genes (Table 1).

thumbnail
Fig 3. Evolutionary rate and the time of most recent common ancestor (tMRCA) of three gene segments in SFTSV by three countries.

(A) The Evolutionary rate and (B) tMRCA of Long (L), Medium (M), Short (S) gene segments were summarized by median and 95% HPD.

https://doi.org/10.1371/journal.pntd.0011630.g003

thumbnail
Table 1. The estimates of the evolutionary rate and the time to the most recent common ancestor (tMRCA) in the phylogenetic analysis for SFTSV.

https://doi.org/10.1371/journal.pntd.0011630.t001

Our study observed 22 genetic reassortment events between the three gene segments in 18 SFTSVs (Fig 4 and S2 Table). All reassortment events were detected only in Chinese SFTSVs sampled from humans between 2011 to 2016. The reassortment events between the L and M gene segments (L-M reassortant) were exclusively observed in one subclade belonging to genotype B in Clade 2 (Fig 4A). While the 8 L-M reassortant strains switched their gene segments from the neighboring subclade, two functional subdomains in the L gene did not show any amino acid substitution. However, the Gc domain in the M gene obtained one amino acid substitution (M-Gc: Q297P). The reassortment events of the S gene with the L (L-S reassortant) or M (M-S reassortant) genes were widely observed in both Clades 1 and 2 (Fig 4B and 4C). All M-S reassortant (6/6) and 75% of the L-S reassortant SFTSVs (6/8) switched their S gene segment with SFTSVs belonging to other genotypes. We did not observe any generation of a subclade originating from either L-S or M-S reassortants.

thumbnail
Fig 4. Reassortment between pairs of three gene segments in SFTSV.

(A) Reassortment of the L and M gene, (B) the L and S gene (C) the M and S gene segments. The red line between phylogeny depicted gene reassortment between two genes. The brackets in the y-axis display the genotype classification by Fu et al (2016).

https://doi.org/10.1371/journal.pntd.0011630.g004

Natural selection profiles of the SFTSV gene segments

A large fraction of the three gene segments of SFTSVs harbored amino acid residues under purifying selection and showed significant heterogeneity of local selection pressure by genetic regions. All estimates of the global dN/dS ratios in the six protein-coding regions (L gene, RdRp, and C-terminus; M gene, Gn and Gc; and S gene, NP and NS) in the L, M, and S genes ranged from 0.03 to 0.13, which were much lower than 1 (Table 2). Accordingly, the fractions of amino acid sites under statistically significant purifying selection varied from 61.77% (NS in the S gene) to 81.1% (c-terminus of the L gene). However, at least 1 to 5 amino acid residues were found to be under positive selection pressure across the six protein-coding regions. The Gn domain in the M gene had the highest number of amino acid residues (n = 5) under positive selection. The RNSC method revealed that synonymous substitution rates were 3 to 12 times higher than their non-synonymous substitution rates in the three gene segments (Table 2). The NS in the S gene segment showed the highest non-synonymous substitution rate (1.14 x 10−4 substitutions/site/year) whereas the NP showed the highest synonymous substitution rate (3.98 x 10−4 substitutions/site/year) among all protein-coding domains.

thumbnail
Table 2. Molecular selection pressure of the SFTSV genomes.

https://doi.org/10.1371/journal.pntd.0011630.t002

Discussion

Since the early 2010s, human infection of SFTSV has been mainly reported in China, Japan, and South Korea. Our phylogenies of three gene segments uniformly depicted that endemic SFTSVs have mainly caused human infection within limited local areas. However, even with the genetic homogeneity of endemic SFTSVs by countries, we occasionally observed polyphyletic genealogy in the phylogenies likely depicting the long-distance spread of endemic SFTSVs among China, Japan, and South Korea [27,30], but not frequent as the phylogenies of influenza virus or SARS-CoV-2 with highly intensive international spread [45,46]. Chinese SFTSV genotype B was most prevalent on Zhoushan Island located in the East China Sea but also caused human infection in South Korea and Japan [27]. Furthermore, Japanese endemic genotype B and genotype A, C, D, and F were detected in human SFTS cases in South Korea. As previous studies uniformly addressed [2325], the close genetic relatedness of SFTSVs among humans, ticks, and non-human mammals in our study supported that the vector-borne spillover of SFTSVs by ticks would play a key role in human infection from livestock or wildlife reservoirs (S1 Fig). With the limited number of reports of patient-to-patient SFTSV transmission through the direct contact of blood or organ [47,48], long-distance movement of sub-clinically infected non-human reservoirs and/or vectors, such as migratory wild birds or imported ticks, might be plausible routes of the international spread of SFTSVs [27]. Consequently, non-human reservoirs and/or vectors would likely be critical sentinels to rapidly capture the introduction of nonendemic SFTSVs among the three countries.

Using the complete genomic sequences of SFTSVs, our phylogenetic analyses found the heterogeneity in the evolutionary characteristics of SFTSVs by gene segments. Longer between-genotype branches than within-genotype branches in our phylogenies likely explained that the between-genotype genetic variation would be much higher than within-genotype variation, especially in the M gene, and the accumulation of unusual within-genotype mutations would be limited compared to the between-genotype (or genotype-defining) mutations. Subsequently, we estimated the lower evolutionary rates and older divergence times of endemic SFTSVs in three countries than those in Fu et al [27], especially in the L and M genes. The ancestral SFTSV appeared to diverge into the C1 and C2 lineages around 500 years (L gene) or 900 (M gene) ago and the C2 lineage subsequently diverged into Japanese and Chinese—Korean endemic genotype B sub-lineages around 450 years ago (L and M genes) (Fig 2). However, we would carefully interpret the genealogy because highly heterogeneous evolutionary rates by gene regions (e.g., within-genotype vs between-genotype mutations) within the L and M genes might have lowered the average evolutionary rates.

Our selection profiles found that all three gene segments under intense purifying selection (dN/dS ratios = 0.03 to 0.13), which is close to the estimates of flaviviruses with one of the lowest dN/dS ratios (0.01 to 0.14) among RNA viruses [49]. It might indicate that most nonsynonymous mutations were deleterious, and the key mutations were very limitedly fixed in all three gene segments [49,50]. Furthermore, nonendemic SFTSVs generally disappeared after the introduction into other geographical regions failing to generate their own subclade, which possibly explained that the genetic turnover by the introduction of nonendemic SFTSVs would also be very limited. The genetic reassortments also appeared to be deleterious for SFTSVs. Our study observed genetic reassortment events in less than 5% of our SFTSV samples which was a similar result to the previous study (6.3%) [27]. The L-S and M-S reassortant strains widely switched their S gene segment with other genotypes, but all these strains failed to fix the reassortment trait and rapidly disappeared. The L-M reassortant strains appeared to form their own subclade, but they showed narrow reassortment compatibility because the L-M reassortant strains were limitedly observed between two genetically close subclades within the same genotype. Consequently, the epidemiological changes of human SFTS in three countries since 2010 (Fig 1) might not be followed by the emergence of novel SFTSV variants nor a substantial evolutionary change considering similar genealogy of SFTSVs by genotypes and evidence of intense purifying selection. On the other hand, the change of transmission dynamics of non-human reservoirs and/or vectors, such as tick and wildlife ecology or environmental condition through climate change [51], or the better SFTS detection, control and preventive measures might be more plausible factors to promote the current changes of human SFTS epidemiology.

Under intense purifying selection in the SFTSV evolution, our study found a few positive selection residues. The Gn region in the M gene harbored the largest number of amino acid residues under positive selection (residues 18, 151, 279, and 304) and showed the second highest non-synonymous substitution rate among six genetic regions. In addition, the Gc region in the M gene had two amino acid residues under positive selection (residues 77 and 398) and one fixed amino acid substitution through the L-M reassortment event (Q297P). Notably, the Gn and Gc regions in the M gene encode two components of the surface protein for viral adsorption and entry [52] and possibly played a key role to maintain the genetic diversity of SFTSVs. The recent study of the Gn region and human neutralizing antibodies revealed that domain III of the Gn head part is a critical region recognized by the host’s antibodies [53]. Interestingly, half (2/4) of the positively selected sites in the Gn localized in domain III (residues 249–313), probably indicating the action of selective pressure on the such immunodominant region, like other phleboviral Gn protein [54]. The NS region in the S gene had a relatively higher dN/dS ratio (0.13) and harbored fewer amino acid residues under negative selection (61.77%) than the other gene regions (Table 2). This finding agrees with the study of Lam et al. [55], suggesting that the NS region possibly be a preferential target for positive selection because the NS protein suppresses host antiviral immunity through the type I interferon (IFN) system in SFTSV’s infection [56]. Generally, infectious pathogens are forced to compete for their fitness among multiple strains [57,58]. However, SFTSV maintained genetic diversity through the cocirculation of multiple lineages over the evolutionary history with very limited extinction of genotypes [54]. Most vector-borne viral diseases, such as dengue or West Nile virus, typically promoted the cocirculation of multiple lineages through antibody-dependent enhancement (ADE) [59]. Despite limited reports of ADE by re-infection compared to other vector-borne diseases, our study suspected that ADE possibly drove to maintain the genetic diversity and allow cocirculation of multiple lineages [60].

Although we conducted a large-scale phylogenetic study to investigate the evolutionary characteristics of SFTSVs, we still acknowledge the importance of systematic data collection and the development of statistical approaches. Although our study efficiently collected more genetic sequences of SFTSV from the BV-BRC than the previous study [27], we could additionally obtain genetic sequences from the previous research publication (e.g., human SFTSVs from the outbreak in Japan from 2013 to 2014) [30]. The development of genetic resources repository systematically collecting genetic sequences from research publications as well as voluntary submission will improve the efficiency of genetic data collection and help to minimize geographical and/or temporal sampling bias in the phylogenetic estimation. In addition, the local selection pressure profiles might be inconsistently measured due to statistical sensitivity, genetic diversity of alignment, and length of gene segments [29,42,49,55,61]. Our study was also uncertain about how complex genetic mutations such as gene recombination influenced the SFTSV evolution [55,62,63]. In addition, the present study could not explore how key substitutions in the amino acid residues under positive selection functionally influenced the molecular evolution of SFTSVs. We expected that the reverse genetics incorporating serological approaches enables clarification of the implication of site-specific positive selection profiles in the immune-related genes (i.e., the NS and Gn/Gc), as demonstrated in other RNA viruses [64,65].

Our large-scale re-evaluation of SFTSV evolution facilitates step-changes in our understanding of SFTSV evolution and epidemiology through large-scale phylogenetic analyses using complete genome sequences. The origin of SFTSVs between China, Japan, and South Korea appeared to be far more ancient than previously known by maintaining their own geographical heterogeneity with infrequent international spread through non-human reservoirs and/or vectors. Furthermore, we characterized how purifying and positive selection profiles shaped the evolutionary dynamics of SFTSVs. Considering all, this study provides meaningful insight to better monitor and control the current cases of human SFTSV infection and to prepare for a potential subsequent generation of human-adaptive variants.

Supporting information

S1 Fig. Phylogenetic relationship among human, tick and non-human mammal host of SFTSV by (A) L, (B) M, and (C) S gene segments.

In the MCC trees, tick and non-human mammal SFTSVs are labelled with red stars and blue squares. Human SFTSVs have no labels.

https://doi.org/10.1371/journal.pntd.0011630.s001

(TIF)

S1 Table. The number of SFTSV sequences by the gene segment, collection country and host species.

https://doi.org/10.1371/journal.pntd.0011630.s002

(DOCX)

S2 Table. The complete list of reassortment events between each pair of Long (L), Medium (M) and Short (S) gene segment of SFTSV.

https://doi.org/10.1371/journal.pntd.0011630.s003

(DOCX)

S1 Data. The complete genome sequence sets (L, M, and S genes) used in this study.

https://doi.org/10.1371/journal.pntd.0011630.s004

(ZIP)

Acknowledgments

The authors thank all data contributors (the authors and their laboratories) responsible for obtaining the specimens and for generating the genetic sequences and sharing via the Bacterial and Viral Bioinformatics. Resource Center (BV-BRC) (https://www.bv-brc.org/), on which this research is based.

References

  1. 1. Yu X-J, Liang M-F, Zhang S-Y, Liu Y, Li J-D, Sun Y-L, et al. Fever with Thrombocytopenia Associated with a Novel Bunyavirus in China. New England Journal of Medicine. 2011;364: 1523–1532. pmid:21410387
  2. 2. Walker PJ, Siddell SG, Lefkowitz EJ, Mushegian AR, Adriaenssens EM, Alfenas-Zerbini P, et al. Changes to virus taxonomy and to the International Code of Virus Classification and Nomenclature ratified by the International Committee on Taxonomy of Viruses (2021). Arch Virol. 2021;166: 2633–2648. pmid:34231026
  3. 3. Vogel D, Thorkelsson SR, Quemin ERJ, Meier K, Kouba T, Gogrefe N, et al. Structural and functional characterization of the severe fever with thrombocytopenia syndrome virus L protein. Nucleic Acids Research. 2020;48: 5749–5765. pmid:32313945
  4. 4. Hofmann H, Li X, Zhang X, Liu W, Kühl A, Kaup F, et al. Severe Fever with Thrombocytopenia Virus Glycoproteins Are Targeted by Neutralizing Antibodies and Can Use DC-SIGN as a Receptor for pH-Dependent Entry into Human and Animal Cell Lines. Journal of Virology. 2013;87: 4384–4394. pmid:23388721
  5. 5. Zhang S, Zheng B, Wang T, Li A, Wan J, Qu J, et al. NSs protein of severe fever with thrombocytopenia syndrome virus suppresses interferon production through different mechanism than Rift Valley fever virus. Acta Virol. 2017;61: 289–298. pmid:28854793
  6. 6. Rezelj VV, Li P, Chaudhary V, Elliott RM, Jin D-Y, Brennan B. Differential Antagonism of Human Innate Immune Responses by Tick-Borne Phlebovirus Nonstructural Proteins. mSphere. 2017;2: e00234–17. pmid:28680969
  7. 7. Huang X, Li J, Li A, Wang S, Li D. Epidemiological Characteristics of Severe Fever with Thrombocytopenia Syndrome from 2010 to 2019 in Mainland China. International Journal of Environmental Research and Public Health. 2021;18: 3092. pmid:33802869
  8. 8. Miao D, Dai K, Zhao G-P, Li X-L, Shi W-Q, Zhang JS, et al. Mapping the global potential transmission hotspots for severe fever with thrombocytopenia syndrome by machine learning methods. Emerging Microbes & Infections. 2020;9: 817–826. pmid:32212956
  9. 9. Yoo JR, Heo ST, Song SW, Bae SG, Lee S, Choi S, et al. Severe Fever with Thrombocytopenia Syndrome Virus in Ticks and SFTS Incidence in Humans, South Korea. Emerg Infect Dis. 2020;26: 2292–2294. pmid:32818414
  10. 10. Kobayashi Y, Kato H, Yamagishi T, Shimada T, Matsui T, Yoshikawa T, et al. Severe Fever with Thrombocytopenia Syndrome, Japan, 2013–2017. Emerg Infect Dis. 2020;26: 692–699. pmid:32186502
  11. 11. Takahashi T, Maeda K, Suzuki T, Ishido A, Shigeoka T, Tominaga T, et al. The first identification and retrospective study of severe fever with thrombocytopenia syndrome in Japan. The Journal of infectious diseases. 2014;209: 816–827. pmid:24231186
  12. 12. Tran XC, Yun Y, Van An L, Kim S-H, Thao NTP, Man PKC, et al. Endemic Severe Fever with Thrombocytopenia Syndrome, Vietnam. Emerg Infect Dis. 2019;25: 1029–1031. pmid:31002059
  13. 13. Li H, Lu Q-B, Xing B, Zhang S-F, Liu K, Du J, et al. Epidemiological and clinical features of laboratory-diagnosed severe fever with thrombocytopenia syndrome in China, 2011–17: a prospective observational study. The Lancet Infectious Diseases. 2018;18: 1127–1137. pmid:30054190
  14. 14. Crump A, Tanimoto T. Severe fever with thrombocytopenia syndrome: Japan under threat from life-threatening emerging tick-borne disease. JMA journal. 2020;3: 295–302. pmid:33225100
  15. 15. Chen QL, Zhu MT, Chen N, Yang D, Yin WW, Mu D, et al. Epidemiological characteristics of severe fever with thtrombocytopenia syndrome in China, 2011–2021. Chin J of Epi. 2022;43: 852–859. pmid:35725341
  16. 16. National Institute of Infectious Diseases, Japan. Summary of SFTS cases reported in the surveillance of infectious diseases. 2023. https://www.niid.go.jp/niid/ja/diseases/sa/sfts.html.
  17. 17. Korea Disease Control and Prevention Agency. National infectious diseases surveillance data of South Korea—severe fever with thrombocytopenia syndrome. 14 Jun 2023. https://npt.kdca.go.kr/npt/biz/npp/ist/bass/bassDissStatsMain.do.
  18. 18. Lin T-L, Ou S-C, Maeda K, Shimoda H, Chan JP-W, Tu W-C, et al. The first discovery of severe fever with thrombocytopenia syndrome virus in Taiwan. Emerging Microbes & Infections. 2020;9: 148–151. pmid:31918622
  19. 19. McMullan LK, Folk SM, Kelly AJ, MacNeil A, Goldsmith CS, Metcalfe MG, et al. A New Phlebovirus Associated with Severe Febrile Illness in Missouri. New England Journal of Medicine. 2012;367: 834–841. pmid:22931317
  20. 20. Li Z, Bao C, Hu J, Liu W, Wang X, Zhang L, et al. Ecology of the Tick-Borne Phlebovirus Causing Severe Fever with Thrombocytopenia Syndrome in an Endemic Area of China. PLOS Neglected Tropical Diseases. 2016;10: e0004574. pmid:27035712
  21. 21. Zhuang L, Sun Y, Cui X-M, Tang F, Hu J-G, Wang L-Y, et al. Transmission of Severe Fever with Thrombocytopenia Syndrome Virus by Haemaphysalis longicornis Ticks, China. Emerg Infect Dis. 2018;24: 868–871. pmid:29664718
  22. 22. Hu Y-Y, Zhuang L, Liu K, Sun Y, Dai K, Zhang X-A, et al. Role of three tick species in the maintenance and transmission of Severe Fever with Thrombocytopenia Syndrome Virus. PLOS Neglected Tropical Diseases. 2020;14: e0008368. pmid:32520966
  23. 23. Kang J-G, Cho Y-K, Jo Y-S, Chae J-B, Oh S-S, Kim K-H, et al. Prevalence of severe fever with thrombocytopenia syndrome virus in black goats (Capra hircus coreanae) in the Republic of Korea. Ticks and Tick-borne Diseases. 2018;9: 1153–1157. pmid:29724620
  24. 24. Luo L-M, Zhao L, Wen H-L, Zhang Z-T, Liu J-W, Fang L-Z, et al. Haemaphysalis longicornis Ticks as Reservoir and Vector of Severe Fever with Thrombocytopenia Syndrome Virus in China. Emerg Infect Dis. 2015;21: 1770–1776. pmid:26402039
  25. 25. Lee S-H, Kim H-J, Byun J-W, Lee M-J, Kim N-H, Kim D-H, et al. Molecular detection and phylogenetic analysis of severe fever with thrombocytopenia syndrome virus in shelter dogs and cats in the Republic of Korea. Ticks and Tick-borne Diseases. 2017;8: 626–630. pmid:28442241
  26. 26. Lee S-H, Kim H-J, Lee M-J, Byun J-W, Kim D-Y, Kim N-H, et al. Prevalence of antibodies against severe fever with thrombocytopenia syndrome virus in shelter dogs in the Republic of Korea. Ticks Tick Borne Dis. 2018;9: 183–187. pmid:28899663
  27. 27. Fu Y, Li S, Zhang Z, Man S, Li X, Zhang W, et al. Phylogeographic analysis of severe fever with thrombocytopenia syndrome virus from Zhoushan Islands, China: implication for transmission across the ocean. Sci Rep. 2016;6: 19563. pmid:26806841
  28. 28. Li Z, Hu J, Cui L, Hong Y, Liu J, Li P, et al. Increased Prevalence of Severe Fever with Thrombocytopenia Syndrome in Eastern China Clustered with Multiple Genotypes and Reasserted Virus during 2010–2015. Sci Rep. 2017;7: 6503. pmid:28747674
  29. 29. Liu J-W, Zhao L, Luo L-M, Liu M-M, Sun Y, Su X, et al. Molecular Evolution and Spatial Transmission of Severe Fever with Thrombocytopenia Syndrome Virus Based on Complete Genome Sequences. PLOS ONE. 2016;11: e0151677. pmid:26999664
  30. 30. Yoshikawa T, Shimojima M, Fukushi S, Tani H, Fukuma A, Taniguchi S, et al. Phylogenetic and Geographic Relationships of Severe Fever With Thrombocytopenia Syndrome Virus in China, South Korea, and Japan. J Infect Dis. 2015;212: 889–898. pmid:25762790
  31. 31. Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30: 3059–3066. pmid:12136088
  32. 32. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29: 1969–1973. pmid:22367748
  33. 33. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9: 772–772. pmid:22847109
  34. 34. Tavaré S. Some probabilistic and statistical problems in the analysis of DNA sequences. Lectures on mathematics in the life sciences. 1986;17: 57–86.
  35. 35. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Systematic biology. 2018;67: 901–904. pmid:29718447
  36. 36. Team RC. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/. 2013.
  37. 37. RStudio Team. RStudio: integrated development for R. Rstudio Team, PBC, Boston, MA. 2020. http://www.rstudio.com.
  38. 38. Nagarajan N, Kingsford C. GiRaF: robust, computational identification of influenza reassortments via graph mining. Nucleic acids research. 2011;39: e34–e34. pmid:21177643
  39. 39. Ronquist F, Teslenko M, Van Der Mark P, Ayres DL, Darling A, Höhna S, et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic biology. 2012;61: 539–542. pmid:22357727
  40. 40. Pond SLK, Frost SDW. Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005;21: 2531–2533. pmid:15713735
  41. 41. Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, et al. FUBAR: A Fast, Unconstrained Bayesian AppRoximation for Inferring Selection. Molecular Biology and Evolution. 2013;30: 1196–1205. pmid:23420840
  42. 42. Murrell B, Wertheim JO, Moola S, Weighill T, Scheffler K, K Pond SL. Detecting Individual Sites Subject to Episodic Diversifying Selection. PLOS Genetics. 2012;8: e1002764. pmid:22807683
  43. 43. Weaver S, Shank SD, Spielman SJ, Li M, Muse SV, Kosakovsky Pond SL. Datamonkey 2.0: a modern web application for characterizing selective and other evolutionary processes. Molecular biology and evolution. 2018;35: 773–777. pmid:29301006
  44. 44. Lemey P, Minin VN, Bielejec F, Kosakovsky Pond SL, Suchard MA. A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection. Bioinformatics. 2012;28: 3248–3256. pmid:23064000
  45. 45. Kim JI, Lee I, Park S, Bae J-Y, Yoo K, Cheong HJ, et al. Phylogenetic relationships of the HA and NA genes between vaccine and seasonal influenza A(H3N2) strains in Korea. PLOS ONE. 2017;12: e0172059. pmid:28257427
  46. 46. Bedford T, Greninger AL, Roychoudhury P, Starita LM, Famulare M, Huang M-L, et al. Cryptic transmission of SARS-CoV-2 in Washington state. Science. 2020;370: 571–575. pmid:32913002
  47. 47. Kim WY, Choi W, Park S-W, Wang EB, Lee W-J, Jee Y, et al. Nosocomial Transmission of Severe Fever With Thrombocytopenia Syndrome in Korea. Clinical Infectious Diseases. 2015;60: 1681–1683. pmid:25694652
  48. 48. Jiang XL, Zhang S, Jiang M, Bi ZQ, Liang MF, Ding SJ, et al. A cluster of person-to-person transmission cases caused by SFTS virus in Penglai, China. Clin Microbiol Infect. 2015;21: 274–279. pmid:25687766
  49. 49. Lin J-J, Bhattacharjee MJ, Yu C-P, Tseng YY, Li W-H. Many human RNA viruses show extraordinarily stringent selective constraints on protein evolution. Proceedings of the National Academy of Sciences. 2019;116: 19009–19018. pmid:31484772
  50. 50. Hughes AL, Hughes MAK. More effective purifying selection on RNA viruses than in DNA viruses. Gene. 2007;404: 117–125. pmid:17928171
  51. 51. Gilbert L. The Impacts of Climate Change on Ticks and Tick-Borne Disease Risk. Annual Review of Entomology. 2021;66: 373–388. pmid:33417823
  52. 52. Sun Y, Qi Y, Liu C, Gao W, Chen P, Fu L, et al. Nonmuscle Myosin Heavy Chain IIA Is a Critical Factor Contributing to the Efficiency of Early Infection of Severe Fever with Thrombocytopenia Syndrome Virus. Journal of Virology. 2014;88: 237–248. pmid:24155382
  53. 53. Wu Y, Zhu Y, Gao F, Jiao Y, Oladejo BO, Chai Y, et al. Structures of phlebovirus glycoprotein Gn and identification of a neutralizing antibody epitope. Proceedings of the National Academy of Sciences. 2017;114: E7564–E7573. pmid:28827346
  54. 54. Allen ER, Krumm SA, Raghwani J, Halldorsson S, Elliott A, Graham VA, et al. A Protective Monoclonal Antibody Targets a Site of Vulnerability on the Surface of Rift Valley Fever Virus. Cell Reports. 2018;25: 3750–3758.e4. pmid:30590046
  55. 55. Lam TT-Y, Liu W, Bowden TA, Cui N, Zhuang L, Liu K, et al. Evolutionary and molecular analysis of the emergent severe fever with thrombocytopenia syndrome virus. Epidemics. 2013;5: 1–10. pmid:23438426
  56. 56. Ning Y-J, Feng K, Min Y-Q, Cao W-C, Wang M, Deng F, et al. Disruption of Type I Interferon Signaling by the Nonstructural Protein of Severe Fever with Thrombocytopenia Syndrome Virus via the Hijacking of STAT2 and STAT1 into Inclusion Bodies. J Virol. 2015;89: 4227–4236. pmid:25631085
  57. 57. Nowak MA, May RM. Superinfection and the evolution of parasite virulence. Proceedings of the Royal Society of London Series B: Biological Sciences. 1994;255: 81–89. pmid:8153140
  58. 58. Rohani P, Green CJ, Mantilla-Beniers NB, Grenfell BT. Ecological interference between fatal diseases. Nature. 2003;422: 885–888. pmid:12712203
  59. 59. Holmes EC, Twiddy SS. The origin, emergence and evolutionary genetics of dengue virus. Infection, Genetics and Evolution. 2003;3: 19–28. pmid:12797969
  60. 60. Lv S-M, Yuan C, Zhang L, Wang Y-N, Dai Z-N, Yang T, et al. Severe fever with thrombocytopenia syndrome with re-infection in China: a case report. Infectious Diseases of Poverty. 2021;10: 90. pmid:34183070
  61. 61. Chen J, Sun Y. Variation in the Analysis of Positively Selected Sites Using Nonsynonymous/Synonymous Rate Ratios: An Example Using Influenza Virus. PLOS ONE. 2011;6: e19996. pmid:21629696
  62. 62. Kim JI, Park S, Lee I, Park KS, Kwak EJ, Moon KM, et al. Genome-Wide Analysis of Human Metapneumovirus Evolution. PLOS ONE. 2016;11: e0152962. pmid:27046055
  63. 63. Rezelj VV, Mottram TJ, Hughes J, Elliott RM, Kohl A, Brennan B. M Segment-Based Minigenomes and Virus-Like Particle Assays as an Approach To Assess the Potential of Tick-Borne Phlebovirus Genome Reassortment. J Virol. 2019;93: e02068–18. pmid:30567991
  64. 64. Meng J, Lee S, Hotard AL, Moore ML. Refining the Balance of Attenuation and Immunogenicity of Respiratory Syncytial Virus by Targeted Codon Deoptimization of Virulence Genes. mBio. 2014;5: e01704–14. pmid:25249281
  65. 65. Kim JI, Lee I, Park S, Lee S, Hwang M-W, Bae J-Y, et al. Effects of a hemagglutinin D222G substitution on the pathogenicity of 2009 influenza A (H1N1) virus in mice. Arch Virol. 2014;159: 2559–2565. pmid:24824345