Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

BA9 lineage of respiratory syncytial virus from across the globe and its evolutionary dynamics

  • Md Shakir Hussain Haider,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India

  • Wajihul Hasan Khan,

    Roles Resources

    Affiliations Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India, Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India

  • Farah Deeba,

    Roles Resources

    Affiliation Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India

  • Sher Ali,

    Roles Writing – review & editing

    Affiliation Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India

  • Anwar Ahmed,

    Roles Formal analysis

    Affiliations Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia, Centre for Excellence in Biotechnology Research, Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia

  • Irshad H. Naqvi,

    Roles Resources

    Affiliation Dr. M. A. Ansari Health Centre, Jamia Millia Islamia, New Delhi, India

  • Ravins Dohare,

    Roles Resources

    Affiliation Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India

  • Hytham A. Alsenaidy,

    Roles Resources

    Affiliation College of Medicine, Shaqra University, Shaqra, Saudi Arabia

  • Abdulrahman M. Alsenaidy,

    Roles Resources

    Affiliation Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia

  • Shobha Broor,

    Roles Resources

    Affiliation Department of Microbiology, Faculty of Medicine and Health Science, Shree Guru Gobind Singh Tricentenary University, Gurgaon, Haryana, India

  • Shama Parveen

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    sparveen2@jmi.ac.in, shamp25@yahoo.com

    Affiliation Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India

Abstract

Respiratory syncytial virus (RSV) is an important pathogen of global significance. The BA9 is one of the most predominant lineages of the BA genotype of group B RSV that has acquired a 60bp duplication in its G protein gene. We describe the local and global evolutionary dynamics of the second hyper variable region in the C- terminal of the G protein gene of the BA9 lineage. A total of 418 sequences (including 31 study and 387 GenBank strains) from 29 different countries were used for phylogenetic analysis. This analysis showed that the study strains clustered with BA (BA9 and BA8) and SAB4 genotype of group B RSV. We performed time-scaled evolutionary clock analyses using Bayesian Markov chain Monte Carlo methods. We also carried out glycosylation, selection pressure, mutational, entropy and Network analyses of the BA9 lineage. The time to the most recent common ancestor (tMRCA) of the BA genotype and BA9 lineage were estimated to be the years 1995 (95% HPD; 1987–1997) and 2000 (95% HPD; 1998–2001), respectively. The nucleotide substitution rate of the BA genotype [(4.58×10−3 (95% HPD; 3.89–5.29×10−3) substitution/site/year] was slightly faster than the BA9 lineage [4.03×10−3 (95% HPD; 4.65–5.2492×10−3)]. The BA9 lineage was categorized into 3 sub lineages (I, II and III) based on the Bayesian and Network analyses. The local transmission pattern suggested that BA9 is the predominant lineage of BA viruses that has been circulating in India since 2002 though showing fluctuations in its effective population size. The BA9 lineage established its global distribution with report from 23 different countries over the past 16 years. The present study augments our understanding of RSV infection, its epidemiological dynamics warranting steps towards its overall global surveillance.

Introduction

Respiratory Syncytial Virus (RSV) is the most frequent respiratory pathogen of severe respiratory tract infection in infants and children worldwide and is the major cause of hospitalization [1]. RSV is an enveloped virus with a non-segmented, negative-sense, single stranded RNA genome of approximately 15.2 Kb in length. The G protein is a type II integral trans-membrane glycoprotein that shows frequent divergence at the nucleotide and amino acid levels [2]. Two antigenic groups of RSV (A and B) have been identified based on the cross-reactivity of antibodies with the G protein [3, 4]. Both the groups were categorized into several genotypes on the basis of genetic divergence within the second hypervariable region (2nd HVR) of G protein gene. Group A have been categorized as GA1–GA7, SAA1-SAA2, NA1–NA4 and ON1 genotypes [511]. Group B includes, GB1–GB6, SAB1–SAB4 and BA1–BA12 genotypes [5, 7, 1217]. More than one genotypes or lineages of both the groups can co-circulate in a community, while new genotypes or lineages can dominate over the previous ones [18].

A new genotype of RSV was identified in Buenos Aires, Argentina in 1999 with a duplication of 60bp in the 2nd HVR of G protein gene [19]. This region plays an important role accommodating selective pressure during the course of evolution. Consequently, it results in the enhanced fitness of the viral population leading to its efficient global transmission including detection in India [18, 2025]. The first report of BA genotype from New Delhi, India was described by our group during the years 2001–05 [26]. Further, an investigation from Kolkata, West Bengal during the years 2005–08 suggested that all the RSV B strains belonged to the BA genotype [27]. It was found that the BA genotype dominated in Pune, Maharashtra during the years 2009–12 [23]. Consequently, the BA viruses from New Delhi attracted our attention during the years 2007–10 [28] and we continued our work on BA genotype throughout the years 2011–15. Interestingly, BA9 was the dominant lineage of the BA genotype identified in our study. Therefore, the aim of the present study was to gain insight into the local and global evolutionary phylodynamics of the BA9 lineage of the BA genotype over the past 16 years. We focused on the second hypervariable region of the G protein gene for Phylogenetic, Bayesian and Network analyses. Such large scale analysis of the global data is envisaged to be useful for undertaking appropriate control measures including possible vaccine development.

Materials and methods

Ethical clearance

This study was approved by the Institutional Ethics Committee, Jamia Millia Islamia, New Delhi, India. The study was carried out in accordance with the World Medical Association of Helsinki. Written informed consent was taken in English/Hindi from the parents/guardians of all the pediatric patients before sample collection.

Clinical samples

Nasopharyngeal/throat swabs were collected from the children less or equal to 5 years of age with acute respiratory infection (ARI) symptoms as defined by WHO [29]. A total of 500 nasopharyngeal/throat swabs were collected from the symptomatic pediatric patients attending the Out Patient Department (OPD) of Dr. M. A. Ansari Health Centre, Jamia Millia Islamia, New Delhi by the Pediatrician.

Detection of RSV by real-time PCR

All the samples were tested for RSV by real-time PCR. The RNA was extracted from 350μl of nasopharyngeal/throat swabs using RNeasy Mini Kit (Qiagen GmbH, Hilden, Germany) as per the manufacturer’s instructions. The extracted RNA was used for real-time PCR for detection and RT-PCR for characterization. The real-time PCR for the large (L) polymerase gene was carried out using one step real time polymerase chain reaction by SYBR® green chemistry using published primers and cycling conditions [30]. The PCR employed the following thermal cycler conditions: 30 min at 50°C, 15 min at 95°C, followed by 50 cycles of 30 sec at 95°C, 30 sec at 55°C and 30 sec at 72°C. This was followed by melt curve stage analysis using the conditions: 15 sec at 95°C, 1 min at 60°C and 15 sec at 95°C.

RT-PCR for the 2nd HVR of G protein gene

The cDNA was synthesized from the extracted RNA using High capacity cDNA Reverse Transcription Kits (Applied Biosystem), according to the manufacturer’s protocol. The reaction was performed in a 10μl volume containing 5μl template RNA and 5μl 2X RT master mixture. The reaction was carried out in four steps: 10 min at 25°C, 120 min at 37°C, and 5 min at 85°C followed by incubation at 4°C. The cDNA was stored at -20°C and was used for RT-PCR for the 2nd HVR of G protein gene. Positive samples were subjected to RT-PCR for sub grouping. External and semi nested PCR was done for the 2nd HVR of G protein gene of RSVB using published primers and cycling conditions [20, 26]. The amplicons size of external PCR was 607bp and 670bp for group B and BA genotype. The nested PCR amplicons were of 585/645bp for group B and BA genotype, respectively. The PCR products were run on 2% agarose gel and were visualized with Dolphin-Doc Plus Gel Image system (Wealtec corp, UK).

DNA sequencing and alignment

The PCR products were extracted from the agarose gel and purified with the Nucleo-pore Sure Extract PCR Clean-up/Gel Extraction Kit (Genetix Biotech Asia Pvt. Ltd New Delhi, India), as per manufacturer’s instructions. The amplicons were sequenced commercially (Applied Bio systems Inc., Foster City, CA, USA) using the published primer BG517 as the forward primer and F164 as the reverse primer [26]. Sequence chromatograms were analyzed with BioEdit version 7.0.9.0 [31] and manually edited in GeneDoc software version 2.7 [32] to resolve nucleotide ambiguities. The RSV sequences were further confirmed by BLAST (http://www.ncbi.nlm.nih.gov/BLAST/).

DNA sequencing and GenBank accession numbers

Partial nucleotide sequences of the 2nd HVR of the G protein gene of 31 study group B RSV were determined. The sequences were submitted to GenBank (http://www.ncbi.nlm.nih.gov/GenBank/index.html) with the following accession numbers KY078406-KY078435 and KY649190.

Phylogenetic analysis

Nucleotide sequence of the 2nd HVR of the G protein gene was used for phylogenetic analysis. A multiple sequence alignment was performed for all the available group B RSV sequences with the help of the CLUSTAL W algorithm embedded in BioEdit software. A total of 418 sequences of group B RSV (including 96 strains from India with 31 studied sequences) were used for the phylogenetic analysis (S1 Table). Phylogenetic trees were constructed using the Neighbor-Joining method implemented in the software MEGA 6.06 [33]. Robustness of tree topology was accessed with 1,000 replicates and bootstrap values greater than 70% are shown on the branches of the consensus trees. The nucleotide and amino acid distances were calculated using the Kimura-2 parameter method [34].

Mutational analysis

The deduced amino acid sequences and the mutations in the study sequence (n = 28) were predicted by comparing with the prototype strain of the BA genotype that was reported from Argentina [19] (accession number: AY333364).

Glycosylation analysis

N-linked and O-linked amino acid glycosylation in the 2nd HVR of G protein gene of study sequences (n = 28) was predicted by the NetNGlyc 1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/) server and the NetOGlyc 4.0 server (http://www.cbs.dtu.dk/services/NetOGlyc/) respectively (S1 Table).The potential N-linked glycosylation sites were defined as Asn-Xaa-Ser/Thr (where Xaa is not proline). Further, serine/threonine was the potential O-linked sugar acceptors.

Entropy analysis

The variation in amino acid positions of the 2nd HVR of G protein gene of group B RSV in the alignment dataset was analyzed by Shannon entropy plot implemented in BioEdit [35]. A total of 281 sequences of BA9 lineage (including 28 BA9 studied sequences) were used for this analysis (S1 Table). The threshold value for the Shannon entropy analysis was set to 0.2. The amino acid with entropy value of <0.2 and >0.2 were considering as a conserved and variable amino acids respectively as described earlier [25].

Selection pressure analysis

The selection pressure in the 2nd HVR of G protein gene of the BA9 lineage of the BA genotype was estimated by using the Datamonkey webserver [36] (http://www.datamonkey.org/). The dataset consisted of 281 sequences from different geographical regions including 28 BA9 strains sequenced in this study (S1 Table). The synonymous (dS), non-synonymous (dN) mutations, positively and negatively selected sites at every codon position were determined using the HKY85, F81, Trn93 and REV nucleotide substitution models by four different algorithms, Single Likelihood Ancestor Counting (SLAC), Fixed Effect Likelihood (FEL) and Internal Fixed Effects Likelihood (IFEL). The selection of sites under positive (dN > dS), neutral (dN = dS) and negative (dN < dS) were determined by cut-off p-value ≤0.2. If an amino acid site met cut-off criteria by at least two methods then it was considered as a positive site. The selection of positive sites was also calculated using the FUBAR algorithm [37]. To consider a positive site, posterior probability (β>α) ≥0.9 was taken for the FUBAR method.

Phylodynamic analysis

The Markov Chain Monte Carlo (MCMC) method implemented in the software BEAST v1.8.3 [38] was used to compute the time-scaled evolutionary relationship of group B RSV with a particular focus on the BA9 lineage. The study strains and the GenBank sequences from different geographical regions were utilized for the analysis. A total of 307 unique sequences (including study sequences) of group B RSV focusing on the BA9 lineage were used (S1 Table). PAUP 4.0 was used for inferring the phylogenetic tree [39] and Modeltest3.7 program to estimate the nucleotide substitution rate [40]. The best model, favoured by Bay’s factor was calculated by path sampling and stepping stone sampling method [41, 42]. Three clock models (strict, uncorrelated exponential, and uncorrelated lognormal) and four demographic model (constant size, exponential growth, expansion growth, and Bayesian skyline) were used for best fit clock model and demographic model respectively.

All the programs were run on a server provide by CDAC (http://www.cdac.in). The GTR + Invariant + gamma substitution model was selected by Modeltest3.7. Strict clock and exponential growth coalescent tree priors were used to calculate the tMRCA. The program BEAUti v1.8.3 was used for generating the xml file from the nexus file for further use in BEASTv1.8.3. MCMC chain was run for 200 million steps and sampled every 20,000 steps to achieve convergence, which was confirmed with effective sample size (ESS) using Tracer v1.6.0 (http://tree.bio.ed.ac.uk/software/tracer/). Only parameters with an ESS value ≥ 200 were accepted after 10% burn-in [41, 43]. Tree Annotator v1.8.3 was used to construct maximum clade credibility (MCC) trees with the maximum product of posterior probabilities, after removing the first 10% of trees as burn-in. FigTree v1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/) program was used for visualization of the phylogenetic tree with lineages, divergence time scale and nucleotide substitution rate.

Bayesian skyline plot (BSP) analysis

The demographic changes of a distinct genotype from a particular geographical region were estimated by Bayesian skyline plot analysis in a single population size [44] using nucleotide sequence of the 2nd HVR of G protein gene. Bayesian skyline plots (BSPs) were generated to understand the impression of effective population size of Indian BA9 lineage (n = 65) with respect to time [38]. The details of these individual sequences are given in the Supplementary Table (S1 Table). The graphical interpretation illustrated the changes in the median estimation of relative hereditary diversity (Ne τ) of the virus with time (Ne and τ represents effective population size and generation time respectively). The MCMC method was used for this study implemented in the program BEASTv1.8.3. The generalized time reversible (GTR) was used for nucleotide substitution model selected by Modeltest3.7 mutually with strict clock model and Bayesian skyline coalescent model. MCMC chains were run with 100 million generations and sampled at every 10,000 steps. The skyline plot was generated from the MCMC output using Tracer v1.6.

Median joining network analysis

The evolutionary relationship between the homologous sequences is usually represented by classical phylogenetic tree based on maximum-likelihood analysis in a trouble-free graphical view. During the evolution process all organisms have passed through some hurdles like microevolution, convergent evolution, and recombination [45]. Therefore, as a result new species or genotypes evolved that are interconnected to each other in complex manner. Keeping these events in mind, for better representation, the Network analysis was performed for a dataset of 307 unique sequences from different geographical regions (S1 Table), using median joining algorithm implemented in Network v 5.0 software [46] (http://www.fluxus-engineering.com). The significance of every pairwise correlation was calculated using a Fisher’s exact test and a Bonferroni correction in DnaSP v. 5.10.01 [47]. This analysis investigates the intricate relationship among haplotypes based on single nucleotide polymorphisms (SNPs) with a minor allele frequency ≥2% in each continental region [48].

Results

RSV prevalence in New Delhi, India (2011–15)

All the children less than or equal to 5 years of age with acute respiratory tract infection (ARI) symptoms (WHO, 2005) attending the OPD of Dr. M.A. Ansari Health Centre, Jamia Millia Islamia, New Delhi, India were enrolled for the study. The clinical specimens (nasopharyngeal/throat swabs) were collected from the symptomatic patients by the Pediatrician during the 4 year study period from November 2011 to December 2015. Samples were collected for a period of 6 months during winter season from September 2011 to February 2012 and then for next three years in winter, since RSV is a seasonal disease. Demographic and clinical information about the patients were recorded in the proformas. The clinical symptoms of the patients such as fever, cough, sore throat, breathlessness/tachypnea, chills/rigor and nasal discharge were recorded. Further, RSV infection was correlated with the clinical symptoms, gender, age and positive/negative samples using statistical analysis [49]

Ninety three (18.6%) of the 500 samples collected were positive for RSV. Sixty six samples were positive for group B and 27 samples for group A RSV. Twelve samples were sequenced for group A and 31 for group B RSV. We describe the evolutionary dynamics of the group B RSV with a focus on the BA9 lineage in the present report.

Local transmission dynamics of the BA9 lineage in India

a) Indian BA9 lineage and mutational analysis.

Phylogenetic analysis of the 31 study sequences revealed that 29 strains clustered in BA genotype (28 BA9 and 1 BA8) and two strains in SAB4 genotype. The prediction of amino acid mutations among the group B study sequences was done in comparison with the BA prototype strain. The BA study strains were predicted to encode a G protein of three different lengths, 312, 315 and 319 amino acids (Fig 1). The G proteins of SAB4 genotype were predicted to be 299 amino acids in length. The genetic distance at the nucleotide and amino acid levels among the BA9 study sequence was 7.4% and 11.7%, respectively. The distance between the study BA sequences and prototype strain at nucleotide and amino acid levels was found to be 0.7% to 7.8% and 1.1% to 11.7%, respectively.

thumbnail
Fig 1. Alignment of deduced amino acid sequences of the 2nd HVR of G protein gene of BA9 lineage with respect to the BA prototype strain (Accession No. AY333364).

The alignment corresponds to the G gene (residue positions 219 to 315) of the BA prototype. Identical amino acids are indicated by dots. Asterisks indicate stop codons. Pink shading highlights the predicted potential N-linked glycosylation sites (Asn-Xaa-Ser/Thr sequence context, where Xaa is not a proline).

https://doi.org/10.1371/journal.pone.0193525.g001

Substitution mutations were frequently observed in the 2nd HVR of G protein gene of the Indian BA lineage that consisted of 96 sequences. (S1 Fig). A total of 37 amino acid mutations were identified in the studied region (Fig 2).

thumbnail
Fig 2. The mutations of deduced amino acid sequence of the 2nd HVR of G gene of BA9 lineage of group B Indian strains including the study sequences.

The amino acid residue corresponds to the 219 to 315 of BA prototype strain. Mutations are indicated by arrows.

https://doi.org/10.1371/journal.pone.0193525.g002

All the Indian BA9 lineage sequences showed a common substitution mutation at Ser247Pro (except one study sequence DEL/RSVB/S100/11). Further, most of the Indian strains exhibited common changes at four different sites Leu223Pro, Thr270Ile, Val271Ala and His287Tyr that were reported earlier [15]. Substitution mutations within the duplicated region (Ser267Pro, Thr270Ile, and Val271Ala) and in the flanking region (Leu223Pro, Ser247Pro, Ile281Thr and His287Tyr) have been reported earlier [15, 50]. Fifty percent of the study strains exhibited substitutions at Ile281Thr and Pro291Leu. Further, five study strains showed an amino acid change at position Ser297Phe. Three substitutions (Ser267Leu, Ile281Thr and Ser297Phe) were identified to be specific to the BA9 lineage. Twenty one mutations in the studied BA9 sequences were found to be newly identified ones from India. The study strain DEL/RSVB/S45/14 showed the highest number (11) of amino acids substitutions (Fig 1).

b) N- and O-linked glycosylation sites of the study strains.

N- and O-linked glycosylation sites were investigated for the studied BA9 sequences (n = 28). The sequences had two N-glycosylation sites at 296 and 310 codon positions that were located ahead of the duplication region (Fig 1). The sites at codon positions 296 and 310 were conserved among all the studied strains and in all the BA9 sequences. The studied strain, DEL/RSVB/G170/12 had one additional N-glycosylation site at position 278 in the duplicated region due to substitution of Lys278Arg. Further, 14 to 27 serine and threonine residues were predicted to be the potential O-linked glycosylated sites for strains sequenced in this study with G score ranging from 0.5 to 0.91. One of the study sequence (DEL/RSVB/G235/13) had the maximum number of 27 potential serine and threonine residues. Based on this in silico analysis, 8 amino acids were most likely to be glycosylated having G score more than 0.9. These included six threonine residues at 228, 232, 236, 240, 246 and 257 and two serine residues at 245 and 297 amino acids. Two substitutions in most of the studied strains at positions Ser247Pro (analogous region) and Thr270Ile (duplicated region) contributed to the loss of the glycosylation sites. Additionally, another substitution at position 281 in 15 studied strains led to the gain in the glycosylation site. Six additional O-linked glycosylation sites (260, 265, 267, 269, 275 and 277) were identified in the 20 amino acid duplicated region of the strains sequenced in this study.

c) Indian BA9 lineage and Bayesian skyline plot (BSP).

BSP analysis was carried out to estimate the effective population size of the Indian BA9 lineage involving time co-ordinates (S1 Table). The results showed significant variation in relative genetic diversity in BA9 lineage including contraction and expansion throughout 14 years of duration from 2002–15 (Fig 3). The effective population of Indian BA9 lineage remained constant until 2007. The population size increased gradually until 2009 and then a constant phase was maintained from 2010 to 2015.

thumbnail
Fig 3. Bayesian skyline plot that describes the demographic history of BA9 lineage of group B RSV in the effective population size with respect to time during the course of evolution in India from 2002 to 2015.

The y-axis represents the population size (Net) whereas x-axis indicates the time period in years. The thick lines represent the mean estimate, whereas the transparent area represents the 95% highest posterior density intervals.

https://doi.org/10.1371/journal.pone.0193525.g003

BA9 lineage and global entropy

This analysis revealed that the range of entropy value of the BA9 lineage was 0.00–0.75 with a threshold value of 0.2. A total of 17 variable amino acid sites (219, 267, 270, 271, 281, 287, 291, 297, 305, 313, 314, 315, 316, 317, 318, 319 and 320) were predicted in the BA9 lineage (Fig 4). One of the variable amino acid was identified at position 219 that was present prior to the analogous region. Three sites at positions 267, 270 and 271 were present in the 20 amino acid duplicated region. Thirteen variable sites (281, 287, 291, 297, 305, 313, 314, 315, 316, 317, 318, 319 and 320) were present in the C-terminal region of the G protein gene. One amino acid at position 270 in the duplicated region and two amino acids at positions 281 and 291 had entropy value greater than 0.60. Six amino acid sites (219, 267, 270, 287, 297 and 305) were identified to be positively selected and had high entropy value reflecting variation at these positions. The Indian BA9 lineage showed 21 variable amino acid sites in the analyzed gene of which 5 sites (261, 262, 267, 270 and 271) were present in the duplicated region.

thumbnail
Fig 4. Global amino acid variability of the 2nd HVR of G gene of BA9 lineage is represented by Shannon entropy plot.

BioEdit software was used for calculation of entropy values of every amino acid at a particular position. Entropy values <0.2 were considered conserved whereas amino acids with >0.2 values are considered variable. High entropy value showed maximum variability at that particular position.

https://doi.org/10.1371/journal.pone.0193525.g004

BA9 lineage and global selection pressure

This analysis revealed a low value of mean dN/dS ratio (0.553–0.630) with a mean value of 0.553 by SLAC method suggesting purifying selection. We found that there were 13 and 7 amino acid sites under positive selection with normalized dN-dS. Thirteen positive sites were confirmed with two different methods: 219, 228, 241, 247, 264, 267, 270, 274, 276, 287, 297, 298 and 305 (Table 1). However, 7 positive sites (219, 267, 270, 276, 287, 297 and 298) were confirmed by all the three methods (Table 1). The selection pressure survival at that particular amino acid positions was due to recurrent mutations (Table 2). A total of 31 negatively selected sites were detected by IFEL method.

thumbnail
Table 1. Positively selected amino acid positions of the BA9 lineage.

https://doi.org/10.1371/journal.pone.0193525.t001

thumbnail
Table 2. Amino acid mutations identified in positively selected sites of the BA9 lineage.

https://doi.org/10.1371/journal.pone.0193525.t002

Group B RSV and global phylogenetics

A total of 418 sequences of group B RSV (including 31 studied sequences) from 29 different countries were used for the phylogenetic analysis. All the sequences of group B RSV that were available in the GenBank during the period of 1962 to 2015 were used. The region of the alignment spanned from nucleotide 655 to 945 (amino acids 219 to 315) corresponding to the prototype strain of the BA genotype. The analysis revealed that 29 study sequences (93.5%) clustered in the BA genotype. Further, 2 study sequences belonged to the SAB4 genotype (Figs 5 and S2).

thumbnail
Fig 5. Neighbor-joining phylogenetic tree of the 2nd HVR of G gene of group B RSV; the tree was constructed using Kimura-2 parameter with 1,000 bootstrapping replicates.

Only bootstrap values greater than 70% are shown at the branch nodes. The genotypes are indicated at the right by brackets. Prototype strains (M17213/USA/62) were used as an out-group. The study sequences are indicated by solid colored triangles.

https://doi.org/10.1371/journal.pone.0193525.g005

Global distribution and phylodynamics of the BA9 lineage

The geographical distribution of the BA9 lineage across the globe is depicted in Fig 6. Circulation of the BA9 lineages has been reported from 23 countries including India, Israel, Vietnam, South Korea, Cambodia, China, South Africa, Malaysia, Philippines, Argentina, Netherlands, Spain, Japan, Germany, Thailand, Kenya, Brazil, Croatia, Ireland, Iran, USA, Saudi Arabia, and Italy.

thumbnail
Fig 6. Graphical representation of worldwide distribution of BA9 lineage of group B RSV; Color code bar in left side of the figure is used to represent the countries.

The free available editable world map was downloaded from http://www.powerpointslides.net (http://www.powerpointslides.net/powerpointgraphics/powerpointmaps.html). The map was created with PowerPoint and Adobe Photoshop.

https://doi.org/10.1371/journal.pone.0193525.g006

Markov Chain Monte Carlo (MCMC) algorithm was used to compute the time-scaled evolutionary relationship to estimate the mean evolutionary and substitution rates of the group B, BA genotype and BA9 lineage (Figs 7 and S3). The analysis revealed that the group B, BA genotype and BA9 lineage deviated from their ancestors during the years 1955 (95% HPD; 1947–1962), 1995 (95% HPD; 1987–1997) and 2000 (95% HPD; 1998–2001), respectively (Table 3). The global mean evolutionary rates (substitution/site/year) were determined for the group B RSV, BA genotype and BA9 lineage. The evolutionary rate of group B RSV was 4.59×10−3 (95% HPD; 3.88–5.30×10−3) that was faster than that for group A RSV i.e 3.49×10−3 (95% HPD, 2.90–4.17×10−3). The evolutionary rate of the BA genotype was 4.58×10−3 (3.89–5.29×10−3) and that of the BA9 lineage was 4.03×10−3 (95% HPD; 4.65–5.25×10−3) (Table 3).

thumbnail
Fig 7. Bayesian MCMC tree of the 2nd HVR of G gene of group B RSV; Nucleotide substitution model (GTR + Invariant + gamma) and exponential growth coalescent tree priors under strict clock was used.

Scale bars represent time in years. Gray bars indicate the 95% HPD for the estimated year of divergence. The study sequences are indicated by red colored line. Sequences from different countries in the tree are indicated by colored line at left side of the figure.

https://doi.org/10.1371/journal.pone.0193525.g007

thumbnail
Table 3. Estimated mean evolutionary rate and tMRCA of group B RSV.

https://doi.org/10.1371/journal.pone.0193525.t003

The BA9 lineage was categorized into 3 sub-lineages (I, II and III) within about 16 years of its origin (Figs 7 and S3). Sub-lineage I (48 sequences) was reported from different continents including Asia (India, Malaysia, Japan, South Korea, Israel and Iran), Europe (Ireland, Netherland, Italy and Croatia), Africa (South Africa and Kenya) and Americas (USA, Argentina and Brazil). Eleven sequences from India clustered within this sub-lineage. Further, sub-lineage II (141 sequences) was described from Asia (India, Japan, Philippines, China, Vietnam, Thailand, Israel and Saudi Arabia), Europe (Netherlands, Spain, Ireland, Italy and Germany) Africa (South Africa) and Americas (Brazil). This sub-lineage included 22 sequences from India. Similarly sub-lineage III (91 sequences) included sequences from Asia (India, Japan, China, Malaysia, South Korea, Vietnam, Thailand, Iran and Cambodia), Europe (Netherlands, Italy, Germany and Spain) and Africa (South Africa). Thirty one sequences from India clustered in this sub-lineage. Sequences from the following countries India, Italy, Japan, Netherland, and South Africa clustered in all the sub-lineages. The studied strains clustered in all the sub-lineages I (1 sequence), II (15 sequences) and III (13 sequences).

BA9 lineage and global phylogeographics

The global phylogeographic analysis was carried out for the BA9 lineage. We obtained 190 variable sites that defined 129 haplotypes (H) with 0.9954 diversity (Hd). The frequency of ≥ 2 was used to allow a large number of haplotypes with maximum single nucleotide polymorphisms. The H1 (M17213/USA/62) was progenitor of all the strains of group B RSV, whereas all the lineages of the BA genotype (BA2-12) emerged from the precursor H13 (AY333364/Argentina/99) (Fig 8). All the RSV strains in the network were interconnected and formed 3 sub-lineages, (I-III). The distribution of the sequences in these 3 sub-lineages was similar to the categorization of the 3 sub-lineages described by the Bayesian analysis (Fig 7). There were 25 haplotypes of strains sequenced in this study clustered in all three sub-lineages. The network analysis showed that BA9 lineage evolved from its progenitors BA7 and BA8 lineages.

thumbnail
Fig 8. Median joining network of the 2nd HVR of G gene of group B depicting the relationship between the strains or lineages.

The length of line between the haplotypes does not depend on the number of mutations. Each circle represents the haplotypes and surface area of each circle reflects the frequency. Haplotypes circulating in different countries were represented by colour-codes. Coloured segments inside the circle indicate the shared haplotypes. Median vectors are indicated by red circles.

https://doi.org/10.1371/journal.pone.0193525.g008

Discussion

RSV is an important pathogen of ARI that has global distribution. The BA and ON1 genotypes of RSV have 60 and 72bp duplication in the 2nd HVR of the G protein gene [11, 14]. These two genotypes with duplications spread across the globe within a few years of their identification probably due to the immunologically naive host population [51]. A recent study described the global evolutionary dynamics of the ON1 genotype that has been categorized into 3 lineages (1.1, 1.2 and 1.3) [52]. Similarly, the BA genotype has been divided into 12 lineages; (BA1-12). We aimed to describe the local and global transmission dynamics of group B RSV with focus on the BA9 lineage. We identified RSV strains during molecular surveillance of ARI from New Delhi, India recorded consecutively for 4 years from 2011 to 2015. The global distribution and evolutionary dynamics of the NA1 genotype has been described in a recent report [49]. The local transmission dynamics of RSV in India was investigated owing to paucity of information on its molecular epidemiology. This included 96 BA strains reported from India during the past 14 years from 2002 to 2015. Significantly, BA viruses have either partly or fully replaced the previously circulating non-BA genotypes across different geographical regions [8, 10, 17, 18, 23, 24, 51, 5359]. However, some of the non-BA genotypes continued to circulate globally at moderate levels. An examination of the Bayesian skyline plot of the Indian BA9 lineage suggested contraction and expansion of the total virus population size during the years 2002–15. Further, surveillance will determine the epidemiological characteristics of this genotype in India for its continued presence or absence altogether.

The genetic variations in the G gene of RSV are caused due to point mutations, large duplications events and use of alternative stop codons. The genetic variations in the 60bp duplicated region in the BA genotype have resulted in formation of different lineages (BA1 to BA12). Further, genetic variations among these lineages may form sub-lineages. Similarly, the antigenic variations occur due to alterations of the N- and O-linked glycosylation sites. Although several amino acid mutations were identified in the BA9 lineage, 12 of these substitutions were mapped exclusively to the 60bp duplicated region suggesting evolutionary selection pressure on this part of the gene [60, 61]. Four of these mutations showed positive selection whereas two substitutions had high entropy values. The amino acid at positions 267 and 270 in the duplicated region had high entropy and showed positive selection as described below. The global selection pressure analysis of the BA9 lineage revealed low mean dN/dS ratio suggesting that codon positions are conserved. Further, 13 positively selected sites were identified in the BA9 lineage suggesting its stochastic route of evolution [61]. Interestingly, six different amino acids (219, 267, 270, 287, 297 and 305) were positively selected and had high entropy values indicating variations at these positions. The amino acid at positions 267 and 270 were reported to have high entropy value from Philippines and Saudi Arabia and 219 from Saudi Arabia alone [25, 51]. In addition, 4 positively selected amino acids (positions 228, 267, 297 and 298) identified in the studied strains also showed O-linked glycosylation. Further, six additional O-linked glycosylation sites were identified in the 20 amino acid duplicated region of the BA9 lineage. Taken together, the genetic and antigenic alterations seems to provide additional evolutionary advantages to the BA9 viruses leading to evasion of host immune response that might contribute towards their enhanced transmission and thus global dispersal.

We carried out the global phylogenetic analysis of the group B RSV sequences reported from 29 different countries. The group B sequences clustered in different genotypes including GB1-GB-6, SAB1-SAB4 and BA. BA9 is the most prevalent lineage of the BA viruses that has been described in recent publications [60]. Therefore, we assessed the global evolutionary dynamics of BA9 lineage that was circulating in 23 different countries. The global spread of the BA9 lineage might be associated with travelers [62, 63] or selective fitness advantages. The 60bp duplication in the BA9 lineage might contribute to altering the antigenic characteristics of the virus leading to immune evasion thus providing selective advantage for transmission in the human host. Phylogenetic analysis showed that studied group B strains clustered in the BA (29 sequences) and SAB4 (2 sequences) genotypes. The BA genotype included 28 sequences of the BA9 lineage whereas 1 strain clustered with the BA8 lineage. We analysed the BA9 lineage of BA genotype in detail since it was the predominant lineage. This analysis revealed global distribution of the BA9 genotype that was reflected across different geographical regions including India, Spain, Philippines, Thailand, South Africa, Japan, China, Vietnam, Ireland, Italy, Israel, Argentina, Iran, Germany, Cambodia and Malaysia [8, 10, 1315, 17, 21, 23, 24, 26, 28, 51, 55, 57, 6481]. Further, the studied SAB4 strains clustered with sequences from Cambodia [13]. The studied BA8 strain clustered with the sequence from Japan [72]

We estimated the divergent year for the tree root of group B RSV. The analysis revealed that group B, BA genotype and BA9 lineage deviated from their ancestors in the years 1955, 1995 and 2000, respectively. It seems all existing strains of group B RSV originated in 1955 [8, 18, 24, 56, 58, 59]. The mean evolutionary rate was determined for different datasets including group A, group B, BA genotype and BA9 lineage. The rate of group B RSV [4.59×10−3 (95% HPD; 3.88–5.30×10−3)] was faster than that for group A viruses [3.49×10−3 (95% HPD; 2.90–4.17×10−3)]. This suggested that group B viruses are probably evolving faster than their group A counterpart as reported earlier [12, 43, 69]. The substitution rate of group B RSV was previously reported from Japan [4.88×10−3 (3.97–5.83×10−3)] and Italy [3.03×10−3 (2.1–3.9×10−3)] [24, 58]. However, the BA9 lineage showed marginally different global evolutionary rate of 4.03×10−3 (95% HPD; 4.65–5.25 ×10−3) as compared to that of BA genotype [4.58×10−3 (3.89–5.29×10−3)]. The evolutionary rate of the BA9 estimated in the present study was within the 95% HPD reported from Japan [4.77×10−3 (3.27–6.39×10−3)] [24]. This recent report analyzed the evolutionary dynamics of the BA9 and BA10 lineages with limited GenBank sequences. We described comprehensive global evolutionary trajectory of the BA9 lineage along with its local transmission dynamics. Interestingly, the global evolutionary rate of BA9 (with 60bp duplication) was found to be similar to that of the ON1 genotype (with 72bp duplication) [4.10 ×10−3 (95% HPD; 3.1–5.0×10−3)] [52]. Taken together, the genotype of group A and lineage of group B that possess duplications (ON1 and BA9 respectively) seem to be evolving at similar rates. This similarity of duplication among ON1 and BA9 may be due to the immunological naive scenario of the human population. We estimated the evolutionary rate of the 2nd HVR of the G gene which tend to accumulate mutations affecting the immunological response. An insight into the evolutionary rates of the full length G gene and genome of BA9 lineage is envisaged to elucidate the detailed evolutionary dynamics of this rapidly evolving respiratory pathogen.

We found that BA9 lineage has emerged in the year 2000 and diverged substantially into 3 global sub-lineages. However, most of the BA9 sequences grouped into the sub-lineage II that showed frequent global spread. All the sub-lineages of BA9 showed somewhat temporal distribution. In this context, the strains described during the year 2002–06 clustered in the sub-lineage I. The recently reported strains from 2013–15 belonged to sub-lineage II and III whereas the strains reported during the year 2007–12 clustered in all the three sub-lineages. Thus, different lineages seem to have followed varying trajectory of evolution to survive efficiently in the human host.

Conclusions

The analysis of the BA9 lineage showing its global spread with higher evolutionary rate warrants community and hospital based surveillance across different geographical regions. Since the virus has three distinct global lineages, it would be of interest to monitor their individual or collective infection in the human host to uncover their survival strategies. Finally, work undertaken on host immune response in the context of different lineages will uncover the possible molecular mechanism of infection.

Supporting information

S1 Fig. Alignment of the deduced amino acid sequences of 2nd HVR of G protein gene of BA9 lineage of the BA genotype of group B RSV.

The alignment includes 96 sequences of the BA9 lineage that have been reported from India.

https://doi.org/10.1371/journal.pone.0193525.s001

(PDF)

S2 Fig. Neighbor-joining phylogenetic tree of 2nd HVR of G protein gene of group B RSV.

https://doi.org/10.1371/journal.pone.0193525.s002

(PDF)

S3 Fig. MCC tree of 2nd HVR of G protein gene of group B RSV.

https://doi.org/10.1371/journal.pone.0193525.s003

(PDF)

S1 Table. Details of the sequences used for the phylogenetic, Bayesian, Network, skyline plot, selection pressure, entropy and glycosylation analyses.

https://doi.org/10.1371/journal.pone.0193525.s004

(DOCX)

Acknowledgments

Md Shakir Hussain Haider is supported by Research fellowship by Council of Scientific and Industrial Research and Indian Council of Medical Research, Government of India. S Ali acknowledges the award of J.C. Bose National fellowship from DST-SERB, New Delhi, India). We thank Centre for Development of Advanced Computing (C-DAC) Pune, for providing Bioinformatics Resources and Applications Facility (BRAF) server. This study was supported by King Saud University, Deanship of Scientific Research, College of Science Research Centre.

References

  1. 1. Nair H, Nokes DJ, Gessner BD, Dherani M, Madhi SA, Singleton RJ, et al. Global burden of acute lower respiratory infections due to respiratory syncytial virus in young children: a systematic review and meta-analysis. Lancet. 2010;375(9725):1545–55. Epub 2010/04/20. pmid:20399493; PubMed Central PMCID: PMC2864404.
  2. 2. Wertz GW, Collins PL, Huang Y, Gruber C, Levine S, Ball LA. Nucleotide sequence of the G protein gene of human respiratory syncytial virus reveals an unusual type of viral membrane protein. Proc Natl Acad Sci U S A. 1985;82(12):4075–9. Epub 1985/06/01. pmid:3858865; PubMed Central PMCID: PMC397937.
  3. 3. Mufson MA, Orvell C, Rafnar B, Norrby E. Two distinct subtypes of human respiratory syncytial virus. J Gen Virol. 1985;66 (Pt 10):2111–24. Epub 1985/10/01. pmid:2413163.
  4. 4. Qin X, Zhang C, Zhao Y, Zhao X. Genetic variability of subgroup A and B respiratory syncytial virus strains circulating in southwestern China from 2009 to 2011. Arch Virol. 2013;158(7):1487–95. Epub 2013/02/28. pmid:23443930.
  5. 5. Peret TC, Hall CB, Schnabel KC, Golub JA, Anderson LJ. Circulation patterns of genetically distinct group A and B strains of human respiratory syncytial virus in a community. J Gen Virol. 1998;79 (Pt 9):2221–9. Epub 1998/09/25. pmid:9747732.
  6. 6. Peret TC, Hall CB, Hammond GW, Piedra PA, Storch GA, Sullender WM, et al. Circulation patterns of group A and B human respiratory syncytial virus genotypes in 5 communities in North America. J Infect Dis. 2000;181(6):1891–6. Epub 2000/06/06. pmid:10837167.
  7. 7. Venter M, Madhi SA, Tiemessen CT, Schoub BD. Genetic diversity and molecular epidemiology of respiratory syncytial virus over four consecutive seasons in South Africa: identification of new subgroup A and B genotypes. J Gen Virol. 2001;82(Pt 9):2117–24. Epub 2001/08/22. pmid:11514720.
  8. 8. Pretorius MA, van Niekerk S, Tempia S, Moyes J, Cohen C, Madhi SA, et al. Replacement and positive evolution of subtype A and B respiratory syncytial virus G-protein genotypes from 1997–2012 in South Africa. J Infect Dis. 2013;208 Suppl 3:S227–37. Epub 2013/12/07. pmid:24265482.
  9. 9. Shobugawa Y, Saito R, Sano Y, Zaraket H, Suzuki Y, Kumaki A, et al. Emerging genotypes of human respiratory syncytial virus subgroup A among patients in Japan. J Clin Microbiol. 2009;47(8):2475–82. Epub 2009/06/26. pmid:19553576; PubMed Central PMCID: PMC2725673.
  10. 10. Cui G, Qian Y, Zhu R, Deng J, Zhao L, Sun Y, et al. Emerging human respiratory syncytial virus genotype ON1 found in infants with pneumonia in Beijing, China. Emerg Microbes Infect. 2013;2(4):e22. Epub 2013/04/01. pmid:26038462; PubMed Central PMCID: PMC3639546.
  11. 11. Eshaghi A, Duvvuri VR, Lai R, Nadarajah JT, Li A, Patel SN, et al. Genetic variability of human respiratory syncytial virus A strains circulating in Ontario: a novel genotype with a 72 nucleotide G gene duplication. PloS one. 2012;7(3):e32807. Epub 2012/04/04. pmid:22470426; PubMed Central PMCID: PMC3314658.
  12. 12. Zlateva KT, Lemey P, Moes E, Vandamme AM, Van Ranst M. Genetic variability and molecular evolution of the human respiratory syncytial virus subgroup B attachment G protein. J Virol. 2005;79(14):9157–67. Epub 2005/07/05. pmid:15994810; PubMed Central PMCID: PMC1168771.
  13. 13. Arnott A, Vong S, Mardy S, Chu S, Naughtin M, Sovann L, et al. A study of the genetic variability of human respiratory syncytial virus (HRSV) in Cambodia reveals the existence of a new HRSV group B genotype. J Clin Microbiol. 2011;49(10):3504–13. Epub 2011/08/26. pmid:21865418; PubMed Central PMCID: PMC3187327.
  14. 14. Trento A, Viegas M, Galiano M, Videla C, Carballal G, Mistchenko AS, et al. Natural history of human respiratory syncytial virus inferred from phylogenetic analysis of the attachment (G) glycoprotein with a 60-nucleotide duplication. J Virol. 2006;80(2):975–84. Epub 2005/12/28. pmid:16378999; PubMed Central PMCID: PMC1346866.
  15. 15. Dapat IC, Shobugawa Y, Sano Y, Saito R, Sasaki A, Suzuki Y, et al. New genotypes within respiratory syncytial virus group B genotype BA in Niigata, Japan. J Clin Microbiol. 2010;48(9):3423–7. Epub 2010/07/09. pmid:20610675; PubMed Central PMCID: PMC2937669.
  16. 16. Baek YH, Choi EH, Song MS, Pascua PN, Kwon HI, Park SJ, et al. Prevalence and genetic characterization of respiratory syncytial virus (RSV) in hospitalized children in Korea. Arch Virol. 2012;157(6):1039–50. Epub 2012/03/10. pmid:22402914.
  17. 17. Khor CS, Sam IC, Hooi PS, Chan YF. Displacement of predominant respiratory syncytial virus genotypes in Malaysia between 1989 and 2011. Infect Genet Evol. 2013;14:357–60. Epub 2013/01/12. pmid:23305888.
  18. 18. Trento A, Casas I, Calderon A, Garcia-Garcia ML, Calvo C, Perez-Brena P, et al. Ten years of global evolution of the human respiratory syncytial virus BA genotype with a 60-nucleotide duplication in the G protein gene. J Virol. 2010;84(15):7500–12. Epub 2010/05/28. pmid:20504933; PubMed Central PMCID: PMC2897623.
  19. 19. Trento A, Galiano M, Videla C, Carballal G, Garcia-Barreno B, Melero JA, et al. Major changes in the G protein of human respiratory syncytial virus isolates introduced by a duplication of 60 nucleotides. J Gen Virol. 2003;84(Pt 11):3115–20. Epub 2003/10/24. pmid:14573817.
  20. 20. Sullender WM, Mufson MA, Anderson LJ, Wertz GW. Genetic diversity of the attachment protein of subgroup B respiratory syncytial viruses. J Virol. 1991.
  21. 21. van Niekerk S, Venter M. Replacement of previously circulating respiratory syncytial virus subtype B strains with the BA genotype in South Africa. J Virol. 2011;85(17):8789–97. Epub 2011/07/01. pmid:21715483; PubMed Central PMCID: PMC3165815.
  22. 22. Zhang ZY, Du LN, Chen X, Zhao Y, Liu EM, Yang XQ, et al. Genetic variability of respiratory syncytial viruses (RSV) prevalent in Southwestern China from 2006 to 2009: emergence of subgroup B and A RSV as dominant strains. J Clin Microbiol. 2010;48(4):1201–7. Epub 2010/02/12. pmid:20147636; PubMed Central PMCID: PMC2849604.
  23. 23. Choudhary ML, Anand SP, Wadhwa BS, Chadha MS. Genetic variability of human respiratory syncytial virus in Pune, Western India. Infect Genet Evol. 2013;20:369–77. Epub 2013/10/12. pmid:24113083.
  24. 24. Nagasawa K, Hirano E, Kobayashi M, Ryo A, Oishi K, Obuchi M, et al. Molecular evolution of the hypervariable region of the attachment glycoprotein gene in human respiratory syncytial virus subgroup B genotypes BA9 and BA10. Infect Genet Evol. 2015;36:217–23. Epub 2015/09/27. pmid:26408340.
  25. 25. Ahmed A, Haider SH, Parveen S, Arshad M, Alsenaidy HA, Baaboud AO, et al. Co-Circulation of 72bp Duplication Group A and 60bp Duplication Group B Respiratory Syncytial Virus (RSV) Strains in Riyadh, Saudi Arabia during 2014. PloS one. 2016;11(11):e0166145. Epub 2016/11/12. pmid:27835664; PubMed Central PMCID: PMC5106011.
  26. 26. Parveen S, Sullender WM, Fowler K, Lefkowitz EJ, Kapoor SK, Broor S. Genetic variability in the G protein gene of group A and B respiratory syncytial viruses from India. J Clin Microbiol. 2006;44(9):3055–64. Epub 2006/09/07. pmid:16954227; PubMed Central PMCID: PMC1594720.
  27. 27. Agrawal AS, Sarkar M, Ghosh S, Chawla-Sarkar M, Chakraborty N, Basak M, et al. Prevalence of respiratory syncytial virus group B genotype BA-IV strains among children with acute respiratory tract infection in Kolkata, Eastern India. J Clin Virol. 2009;45(4):358–61. Epub 2009/07/03. pmid:19570709.
  28. 28. Raghuram SV, Khan WH, Deeba F, Sullender W, Broor S, Parveen S. Retrospective phylogenetic analysis of circulating BA genotype of human respiratory syncytial virus with 60 bp duplication from New Delhi, India during 2007–2010. Virusdisease. 2015;26(4):276–81. Epub 2015/12/09. pmid:26645038; PubMed Central PMCID: PMC4663712.
  29. 29. Organization WH. Make any mother and child count. Geneva. 2005.
  30. 30. Templeton KE, Scheltinga SA, Beersma MF, Kroes AC, Claas EC. Rapid and sensitive method using multiplex real-time PCR for diagnosis of infections by influenza a and influenza B viruses, respiratory syncytial virus, and parainfluenza viruses 1, 2, 3, and 4. J Clin Microbiol. 2004;42(4):1564–9. Epub 2004/04/09. pmid:15071005; PubMed Central PMCID: PMC387552.
  31. 31. Hall TA, editor BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic acids symposium series; 1999: [London]: Information Retrieval Ltd., c1979–c2000.
  32. 32. Nicholas KB, Nicholas HB Jr, Deerfield II DW. embnet. news. GeneDoc: Analysis and Visualization of Genetic Variation. 1997;4:14.
  33. 33. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9. Epub 2013/10/18. pmid:24132122; PubMed Central PMCID: PMC3840312.
  34. 34. Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol. 1980;16(2):111–20. Epub 1980/12/01. pmid:7463489.
  35. 35. Shannon CE. A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review. 2001;5(1):3–55.
  36. 36. Pond SL, Frost SD. Datamonkey: rapid detection of selective pressure on individual sites of codon alignments. Bioinformatics. 2005;21(10):2531–3. Epub 2005/02/17. pmid:15713735.
  37. 37. Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, et al. FUBAR: a fast, unconstrained bayesian approximation for inferring selection. Mol Biol Evol. 2013;30(5):1196–205. Epub 2013/02/20. pmid:23420840; PubMed Central PMCID: PMC3670733.
  38. 38. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29(8):1969–73. Epub 2012/03/01. pmid:22367748; PubMed Central PMCID: PMC3408070.
  39. 39. Swofford DL. PAUP*: Phylogenetic analysis using parsimony (* and other methods), version 4.0 b8. Sinauer, Sunderland, Massachusetts. 2001.
  40. 40. Posada D, Crandall KA. MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998;14(9):817–8. Epub 1999/01/27. pmid:9918953.
  41. 41. Kimura H, Saitoh M, Kobayashi M, Ishii H, Saraya T, Kurai D, et al. Molecular evolution of haemagglutinin (H) gene in measles virus. Sci Rep. 2015;5:11648. Epub 2015/07/02. pmid:26130388; PubMed Central PMCID: PMC4486977.
  42. 42. Baele G, Li WL, Drummond AJ, Suchard MA, Lemey P. Accurate model selection of relaxed molecular clocks in bayesian phylogenetics. Mol Biol Evol. 2013;30(2):239–43. Epub 2012/10/24. pmid:23090976; PubMed Central PMCID: PMC3548314.
  43. 43. Hirano E, Kobayashi M, Tsukagoshi H, Yoshida LM, Kuroda M, Noda M, et al. Molecular evolution of human respiratory syncytial virus attachment glycoprotein (G) gene of new genotype ON1 and ancestor NA1. Infect Genet Evol. 2014;28:183–91. Epub 2014/10/07. pmid:25283803.
  44. 44. Pybus OG, Rambaut A, Harvey PH. An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics. 2000;155(3):1429–37. Epub 2000/07/06. pmid:10880500; PubMed Central PMCID: PMC1461136.
  45. 45. Olvera A, Busquets N, Cortey M, de Deus N, Ganges L, Nunez JI, et al. Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing. Vet J. 2010;184(2):130–7. Epub 2009/03/31. pmid:19329341.
  46. 46. Morrison DA. Networks in phylogenetic analysis: new tools for population biology. Int J Parasitol. 2005;35(5):567–82. Epub 2005/04/14. pmid:15826648.
  47. 47. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2. Epub 2009/04/07. pmid:19346325.
  48. 48. Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999;16(1):37–48. Epub 1999/05/20. pmid:10331250.
  49. 49. Haider MSH, Deeba F, Khan WH, Naqvi IH, Ali S, Ahmed A, et al. Global distribution of NA1 genotype of respiratory syncytial virus and its evolutionary dynamics assessed from the past 11years. Infect Genet Evol. 2018. Epub 2018/02/11. pmid:29427763.
  50. 50. Houspie L, Lemey P, Keyaerts E, Reijmen E, Vergote V, Vankeerberghen A, et al. Circulation of HRSV in Belgium: from multiple genotype circulation to prolonged circulation of predominant genotypes. PLoS One. 2013;8(4):e60416. Epub 2013/04/12. pmid:23577109; PubMed Central PMCID: PMC3618235.
  51. 51. Malasao R, Okamoto M, Chaimongkol N, Imamura T, Tohma K, Dapat I, et al. Molecular Characterization of Human Respiratory Syncytial Virus in the Philippines, 2012–2013. PloS one. 2015;10(11):e0142192. Epub 2015/11/06. pmid:26540236; PubMed Central PMCID: PMC4635013.
  52. 52. Duvvuri VR, Granados A, Rosenfeld P, Bahl J, Eshaghi A, Gubbay JB. Genetic diversity and evolutionary insights of respiratory syncytial virus A ON1 genotype: global and local transmission dynamics. Sci Rep. 2015;5:14268. Epub 2015/10/01. pmid:26420660; PubMed Central PMCID: PMC4588507.
  53. 53. Zlateva KT, Vijgen L, Dekeersmaeker N, Naranjo C, Van Ranst M. Subgroup prevalence and genotype circulation patterns of human respiratory syncytial virus in Belgium during ten successive epidemic seasons. J Clin Microbiol. 2007;45(9):3022–30. Epub 2007/07/05. pmid:17609323; PubMed Central PMCID: PMC2045289.
  54. 54. Forcic D, Ivancic-Jelecki J, Mlinaric-Galinovic G, Vojnovic G, Babic-Erceg A, Tabain I. A study of the genetic variability of human respiratory syncytial virus in Croatia, 2006–2008. J Med Virol. 2012;84(12):1985–92. Epub 2012/10/20. pmid:23080507.
  55. 55. Tran DN, Pham TM, Ha MT, Tran TT, Dang TK, Yoshida LM, et al. Molecular epidemiology and disease severity of human respiratory syncytial virus in Vietnam. PloS one. 2013;8(1):e45436. Epub 2013/01/26. pmid:23349659; PubMed Central PMCID: PMC3551923.
  56. 56. Tan L, Coenjaerts FE, Houspie L, Viveen MC, van Bleek GM, Wiertz EJ, et al. The comparative genomics of human respiratory syncytial virus subgroups A and B: genetic variability and molecular evolutionary dynamics. J Virol. 2013;87(14):8213–26. Epub 2013/05/24. pmid:23698290; PubMed Central PMCID: PMC3700225.
  57. 57. Auksornkitti V, Kamprasert N, Thongkomplew S, Suwannakarn K, Theamboonlers A, Samransamruajkij R, et al. Molecular characterization of human respiratory syncytial virus, 2010–2011: identification of genotype ON1 and a new subgroup B genotype in Thailand. Arch Virol. 2014;159(3):499–507. Epub 2013/09/27. pmid:24068580.
  58. 58. Martinelli M, Frati ER, Zappa A, Ebranati E, Bianchi S, Pariani E, et al. Phylogeny and population dynamics of respiratory syncytial virus (Rsv) A and B. Virus Res. 2014;189:293–302. Epub 2014/06/24. pmid:24954788.
  59. 59. Kim YJ, Kim DW, Lee WJ, Yun MR, Lee HY, Lee HS, et al. Rapid replacement of human respiratory syncytial virus A with the ON1 genotype having 72 nucleotide duplication in G gene. Infect Genet Evol. 2014;26:103–12. Epub 2014/05/14. pmid:24820343.
  60. 60. Botosso VF, Zanotto PM, Ueda M, Arruda E, Gilio AE, Vieira SE, et al. Positive selection results in frequent reversible amino acid replacements in the G protein gene of human respiratory syncytial virus. PLoS Pathog. 2009;5(1):e1000254. Epub 2009/01/03. pmid:19119418; PubMed Central PMCID: PMC2603285.
  61. 61. Bose ME, He J, Shrivastava S, Nelson MI, Bera J, Halpin RA, et al. Sequencing and analysis of globally obtained human respiratory syncytial virus A and B genomes. PLoS One. 2015;10(3):e0120098. Epub 2015/03/21. pmid:25793751; PubMed Central PMCID: PMC4368745.
  62. 62. Jaureguiberry S, Boutolleau D, Grandsire E, Kofman T, Deback C, Ait-Arkoub Z, et al. Clinical and microbiological evaluation of travel-associated respiratory tract infections in travelers returning from countries affected by pandemic A(H1N1) 2009 influenza. J Travel Med. 2012;19(1):22–7. Epub 2012/01/10. pmid:22221808.
  63. 63. Luna LK, Panning M, Grywna K, Pfefferle S, Drosten C. Spectrum of viruses and atypical bacteria in intercontinental air travelers with symptoms of acute respiratory infection. J Infect Dis. 2007;195(5):675–9. Epub 2007/01/31. pmid:17262708.
  64. 64. Bharaj P, Sullender WM, Kabra SK, Mani K, Cherian J, Tyagi V, et al. Respiratory viral infections detected by multiplex PCR among pediatric patients with lower respiratory tract infections seen at an urban hospital in Delhi from 2005 to 2007. Virol J. 2009;6:89. Epub 2009/06/30. pmid:19558656; PubMed Central PMCID: PMC2709894.
  65. 65. Singh AK, Jain A, Jain B, Singh KP, Dangi T, Mohan M, et al. Viral aetiology of acute lower respiratory tract illness in hospitalised paediatric patients of a tertiary hospital: one year prospective study. Indian J Med Microbiol. 2014;32(1):13–8. Epub 2014/01/09. pmid:24399381.
  66. 66. Gimferrer L, Campins M, Codina MG, Martin Mdel C, Fuentes F, Esperalba J, et al. Molecular epidemiology and molecular characterization of respiratory syncytial viruses at a tertiary care university hospital in Catalonia (Spain) during the 2013–2014 season. J Clin Virol. 2015;66:27–32. Epub 2015/04/14. pmid:25866332.
  67. 67. Ohno A, Suzuki A, Lupisan S, Galang H, Sombrero L, Aniceto R, et al. Genetic characterization of human respiratory syncytial virus detected in hospitalized children in the Philippines from 2008 to 2012. J Clin Virol. 2013;57(1):59–65. Epub 2013/01/30. pmid:23357644.
  68. 68. Visser A, Delport S, Venter M. Molecular epidemiological analysis of a nosocomial outbreak of respiratory syncytial virus associated pneumonia in a kangaroo mother care unit in South Africa. J Med Virol. 2008;80(4):724–32. Epub 2008/02/26. pmid:18297695.
  69. 69. Kushibuchi I, Kobayashi M, Kusaka T, Tsukagoshi H, Ryo A, Yoshida A, et al. Molecular evolution of attachment glycoprotein (G) gene in human respiratory syncytial virus detected in Japan 2008–2011. Infect Genet Evol. 2013;18:168–73. Epub 2013/05/28. pmid:23707845.
  70. 70. Yoshida A, Kiyota N, Kobayashi M, Nishimura K, Tsutsui R, Tsukagoshi H, et al. Molecular epidemiology of the attachment glycoprotein (G) gene in respiratory syncytial virus in children with acute respiratory infection in Japan in 2009/2010. J Med Microbiol. 2012;61(Pt 6):820–9. Epub 2012/03/03. pmid:22383445.
  71. 71. Fujitsuka A, Tsukagoshi H, Arakawa M, Goto-Sugai K, Ryo A, Okayama Y, et al. A molecular epidemiological study of respiratory viruses detected in Japanese children with acute wheezing illness. BMC Infect Dis. 2011;11:168. Epub 2011/06/15. pmid:21663657; PubMed Central PMCID: PMC3123215.
  72. 72. Yamaguchi M, Sano Y, Dapat IC, Saito R, Suzuki Y, Kumaki A, et al. High frequency of repeated infections due to emerging genotypes of human respiratory syncytial viruses among children during eight successive epidemic seasons in Japan. J Clin Microbiol. 2011;49(3):1034–40. Epub 2010/12/24. pmid:21177891; PubMed Central PMCID: PMC3067727.
  73. 73. Do LA, Wilm A, Van Doorn HR, Lam HM, Sim S, Sukumaran R, et al. Direct whole-genome deep-sequencing of human respiratory syncytial virus A and B from Vietnamese children identifies distinct patterns of inter- and intra-host evolution. J Gen Virol. 2015;96(12):3470–83. Epub 2015/09/27. pmid:26407694; PubMed Central PMCID: PMC4804761.
  74. 74. Salter A, Laoi BN, Crowley B. Molecular epidemiology of human respiratory syncytial virus subgroups A and B identified in adults with hematological malignancy attending an Irish hospital between 2004 and 2009. J Med Virol. 2011;83(2):337–47. Epub 2010/12/25. pmid:21181932.
  75. 75. Esposito S, Piralla A, Zampiero A, Bianchini S, Di Pietro G, Scala A, et al. Characteristics and Their Clinical Relevance of Respiratory Syncytial Virus Types and Genotypes Circulating in Northern Italy in Five Consecutive Winter Seasons. PLoS One. 2015;10(6):e0129369. Epub 2015/06/06. pmid:26047100; PubMed Central PMCID: PMC4457818.
  76. 76. Hirsh S, Hindiyeh M, Kolet L, Regev L, Sherbany H, Yaary K, et al. Epidemiological changes of respiratory syncytial virus (RSV) infections in Israel. PLoS One. 2014;9(3):e90515. Epub 2014/03/07. pmid:24594694; PubMed Central PMCID: PMC3940902.
  77. 77. Faghihloo E, Yavarian J, Jandaghi NZ, Shadab A, Azad TM. Genotype circulation pattern of human respiratory syncytial virus in Iran. Infect Genet Evol. 2014;22:130–3. Epub 2014/01/28. pmid:24462624.
  78. 78. Faghihloo E, Rezaie F, Salimi V, Naseri M, Mamishi S, Mahmoodi M, et al. Molecular epidemiology of human respiratory syncytial virus in Iran. Acta Virol. 2011;55(1):81–3. Epub 2011/03/26. pmid:21434709.
  79. 79. Tabatabai J, Prifert C, Pfeil J, Grulich-Henn J, Schnitzler P. Novel respiratory syncytial virus (RSV) genotype ON1 predominates in Germany during winter season 2012–13. PLoS One. 2014;9(10):e109191. Epub 2014/10/08. pmid:25290155; PubMed Central PMCID: PMC4188618.
  80. 80. Geis S, Prifert C, Weissbrich B, Lehners N, Egerer G, Eisenbach C, et al. Molecular characterization of a respiratory syncytial virus outbreak in a hematology unit in Heidelberg, Germany. J Clin Microbiol. 2013;51(1):155–62. Epub 2012/10/27. pmid:23100345; PubMed Central PMCID: PMC3536189.
  81. 81. Etemadi MR, Sekawi Z, Othman N, Lye MS, Moghaddam FY. Circulation of human respiratory syncytial virus strains among hospitalized children with acute lower respiratory infection in malaysia. Evol Bioinform Online. 2013;9:151–61. Epub 2013/05/04. pmid:23641140; PubMed Central PMCID: PMC3629939.