Genetic recombination is a well-known phenomenon for enteroviruses. To investigate the genetic characterization and the potential recombination of enterovirus 71 (EV71) circulating in China, we determined the 16 complete genome sequences of EV71 isolated from Hand Foot Mouth Disease (HFMD) patients during the large scale outbreak and non-outbreak years since 1998 in China. The full length genome sequences of 16 Chinese EV71 in present study were aligned with 186 genome sequences of EV71 available from GenBank, including 104 China mainland and 82 international sequences, covering the time period of 1970–2011. The oldest strains of each subgenotype of EV71 and prototype strains of HEV-A were included to do the phylogenetic and Simplot analysis. Phylogenetic analysis indicated that all Chinese strains were clustered into C4 subgenotype of EV71, except for HuB/CHN/2009 clustered into A and Xiamen/CHN/2009 clustered into B5 subgenotype. Most of C4 EV71 were clustered into 2 predominant evolutionary branches: C4b and C4a evolutionary brunches. Our comprehensive recombination analysis showed the evidence of genome recombination of subgenotype C4 (including C4a and C4b) sequences between structural genes from genotype C EV71 and non-structural genes from the prototype strains of CAV16, 14 and 4, but the evidence of intratypic recombination between C4 strains and B subgenotype was not enough strong. This intertypic recombination C4 viruses were first seen in 1998 and became the predominant endemic viruses circulating in China mainland for at least 14 years. A shift between C4a and C4b evolutionary brunches of C4 recombination viruses were observed, and C4a viruses have been associated with large scale nationwide HFMD outbreak with higher morbidity and mortality since 2007.
Citation: Zhang Y, Tan X, Cui A, Mao N, Xu S, et al. (2013) Complete Genome Analysis of the C4 Subgenotype Strains of Enterovirus 71: Predominant Recombination C4 Viruses Persistently Circulating in China for 14 Years. PLoS ONE 8(2): e56341. doi:10.1371/journal.pone.0056341
Editor: Dong-Yan Jin, University of Hong Kong, Hong Kong
Received: September 14, 2012; Accepted: January 8, 2013; Published: February 18, 2013
Copyright: © 2013 Zhang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was supported by National Basic Research Program of China (973 Program): (Grant No. 2011CB504902); Projects of the National Natural Science Foundation of China (Grant No. 30901259); Key Technologies R&D Program of National Ministry of Science (Grant No. 2013ZX10004-202; 2012ZX10004201-003); and Science and Technology Development Plan of Shandong Province (2009GG10002055). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The Enterovirus genus in the family Picornaviridae consists of 4 species with strains isolated from humans: Human enterovirus A (HEV-A), HEV-B, HEV-C, HEV-D . Human enterovirus 71 (EV71) is one of the member in HEV-A species. The genome of enteroviruses consists of a single-stranded positive-sense RNA of approximately 7400 nucleotides. The viral genome contains a 5′- and 3′-untranslated regions (UTRs) which are essential for viral RNA replication. The genome is translated as a single large polyprotein that is composed of four capsid proteins, VP1 to VP4, and seven nonstructural proteins, 2A, 2B, 2C, 3A, 3B, 3C, and 3D . VP1 to VP4 capsid proteins were encoded by P1 region. The P2 (2A, 2B, 2C) and P3 (3A, 3B, 3C, and 3D) regions encode nonstructural proteins involved in polyprotein processing, RNA replication and shut-down of host-cell protein synthesis. The RNA-dependent RNA polymerase, 3D, is a major component of the viral replication complex which also include other viral proteins, such as 2BC, 2C, 3AB, and 3Cpro .
EV71 infection, which was first reported in the USA, has been a recurrent feature in the Asia-Pacific region since its first outbreak in Sarawak, Malaysia in 1997 . In recent years, numerous large outbreaks of EV71-associated Hand Foot Mouth Disease (HFMD) with high morbidity and mortality have occurred in Asian countries and regions, including Singapore , South Korea , Malaysia , Japan , Vietnam , Mainland China , , and Taiwan . This phenomenon has increased the research interest in EV71, leading to extensive nucleotide sequencing and genotype description , . Based on VP1 coding region, EV71 is divided into four genotypes: A, B, C and D , and within the genotypes B and C, there are further subgenotypes, B0–B5 and C1–C5 , , .
In China, large scale EV71 outbreak of HFMD associated with acute neurological disease occurred in 2007 at Linyi City, Shandong province , and since then the outbreak pattern has repeated and exacerbated year by year, with increasing morbidity and mortality , . It has been confirmed that subgenotype C4 has been the sole viral genetic lineage circulating in mainland China since 1998 . The large HFMD outbreaks with fatal neurological complications that have occurred since 2007 are mainly due to subgenotype C4a of EV71 , , .
Genomic recombinations are well known to contribute to genetic variations and evolution of enteroviruses. Complete genome analysis of prototype HEV-A indicated that recombination in the nonstructural region has played a role in the evolution of some HEV-A prototypes . Phylogenetic analyses of several available sequences of EV71 have shown that recombination occurred between EV71 and coxsackievirus A16 in the nonstructural region ,  and between different subgenotypes of EV71 viruses .
Our previous studies have been confirmed that the large-scale HFMD outbreaks with fatal neurological complications that have occurred since 2008 are mainly due to subgenotype C4a of HEV71, which was identified as a recombination virus with CVA16 in 3D region . And as these recombination viruses associated outbreaks might be a threat to public health in China, it would be worthwhile to perform an extensive genetic analysis for the full length genome of C4 viruses circulating in mainland China during a period covering both before and after the large scale outbreaks in order to understand the increase of the scale of the outbreak and the severe cases in mainland China in recent years.
The intensive surveillance for HEV71 circulation maintained by mainland China during and after the 2007 outbreak permitted a detailed analysis of a large number of isolates from the HFMD patients by complete genome analysis methods. To investigate the genetic characterization of complete genome of C4 subgenotype EV71 strains and the recombination with prototype strains of HEV-A strains, we performed a large scale genomic sequence analysis of isolates (n = 202) collected from 17 countries worldwide over a 4 decades period. We sequenced and analyzed the entire genome sequences of 16 C4 subgenotype EV71 isolated from HFMD or encephalomyelitis or fatal patients during a period of both before and after EV71 large-scale outbreak in mainland China. The phylogenetic analysis, similarity plot and bootscan analysis were performed to analyze the phylogenetic relationship and potential recombination between C4 subgenotype EV71 strain circulating in mainland China, oldest strain of EV71 subgenotype and other prototype stains of HEV-A species.
Genetic Characterization of Full Length Genome of Chinese Mainland EV71
The full length genome sequences of 16 China EV71 in present study were aligned with 186 genome sequences of EV71 available from GenBank, including 104 China mainland and 82 international sequences from South and East Asia(22), Australia(8), and Europe(8), America (10), Japan(9), Korea(2), Taiwan(23), and the collection date of the clinical specimens of 186 sequences covered from 1970–2011(Table S1). Similar to other reported genomes of EV71, the full length of 16 China EV71 was 7404–7406 nucleotides. The difference of genome length resulted from insertion or deletion in the 5′ untranslated region (UTR) of 741–743 nucleotides. No deletions and insertions were observed in the P2, P3 and 3′ UTR genomic regions, which were composed of a single open reading frame(ORF) of 6,579 nucleotides encoding a polyprotein of 2,193 amino acid and a 3′ UTR of 82 nucleotides preceding the poly(A) tract. Nucleotide substitutions among different EV71 strains were scattered throughout the genome. The sequence homology of nucleotide acid and deduced amino acid between 16 China EV71 viruses was 93.5%–100% and 83.7%–100%, respectively. It is noteworthy that, compared with all C4a HEV71, a nucleotide substitution in all C4b HEV71 genome (A to C reversion at nt2503 in the VP1 coding region, which caused amino acid substitution of VP1–10: Gln to His) had reverted. Phylogenetic analysis of all the Chinese EV71 based on the complete genome showed that all strains were clustered into C4 subgenotype group, except for HuB/CHN/2009 clustered into A and Xiamen/CHN/2009 clustered into B5 (Fig. 1). Xiamen/CHN/2009 is the first B5 isolate found in China mainland. Most of C4 EV71 were clustered into 2 predominant evolutionary branches: C4b and C4a.C4a lineage was composed of the EV71 strains circulating in mainland China during 2007–2011, and this indicated that C4a lineage Chinese strains evolved independently from other C4 viruses. C4b lineage included Chinese older viruses back to 1998 and fewer international strains before 2007. Taiwan strains circulating in 2004–2005 were grouped into a minor brunch with Vietnam strain independent from mainland Chinese strains. It indicted that Taiwan and Vietnam strains evolved independently from those viruses circulating in mainland of China. 4 orphan (single sporadic virus forms single brunch in the phylogenetic tree) strains from mainland of China have quite different diversity with other C4 viruses, suggesting these single sporadic isolates probably imported rather than derived from the main predominant evolutionary branches of other mainland Chinese strains (Fig. 1).” 16 Chinese EV71 shared 93.5–100% nucleotide sequence identities with each other. Phylogenetic analysis based on the different region of the C4 EV71 strains showed all C4 EV71 strains were clustered into the same group separated from other strains of human enterovirus A species in the region of 5′UTR, P1, P2 and P3. However, the C4 EV71 was clustered with CVA-16, CVA-14 and CVA-4 closely in the region of 3′UTR (tree not shown), might because of the short sequence window. HeN09-17/HeN/CHN/2009-C4a and SH-17/SH/CHN/2002-C4b were chosen as the representative strain for C4a and C4b, respectively. These two representative strains were used to do the further phylogenetic and recombination analysis.
Red dots indicate the sequences from Mainland China and blue triangles indicate the sequences downloaded from GenBank. The prototype of EV71(BrCr), CVA-16(G-10) and the oldest sequences of different subgenotypes of EV71(B0-5, C1-5 subgenotypes) were included as well. Green color strain name indicate the mild case, purple color strain name indicate the severe case and pink color strain name indicate the fatal case.
Nucleotide Acid and Deduced Amino Acid Sequence Identities Analysis Between C4a, C4b and the Oldest Strain of Each Subgenotype of EV71 and Other Prototype Strains of HEV-A Species
A comprehensive comparison of nucleotide acid and deduced amino acid sequence identities between C4a, C4b and the oldest strain of each subgenotype of EV71 and other HEV-A species prototype strains is shown in Table 1. C4a and C4b viruses shared the 74.9–89.5% homology with BrCr strain, representative strains of other EV71 subgenotype and other HEV-A species in the region of 5′UTR. We found that both C4a, C4b viruses had much higher sequence identities with BrCr and other EV71 (nucleotide acid: 80.4–91.2%; amino acid: 62.4–100%) than other HEV-A species strains(nucleotide acid: 56.9–72.3%;amino acid:37.9–78.7%) in the region of overall capsid protein P1 sequences and sequences of the individual mature proteins, VP4 to VP1. Interestingly, in the P2 and P3 regions, the nucleotide acid sequence identities between C4a, C4b viruses and BrCr (nucleotide acid: 77.1–78.2%; amino acid: 95.4–96.1% ) was lower than that between C4a, C4b viruses and CVA-16,14,4 (nucleotide acid: 81.8–84.6%; amino acid: 96.5–97.7%), especially in the region of 2A, 2B, 2C, 3B, 3C, 3D. The 3′UTR sequences of the C4a, C4b viruses analyzed were similar to those of the other EV71 and more than 61.8% identical to each other but the identities varied distinctly from the prototype strains of different other HEV-A viruses (varied from 29.4% to 70.6%) (Table 2).
Phylogenetic Analysis of the Chinese C4 EV71 Strains and Other HEV-A Genomes
To investigate the genetic relationship between the Chinese C4 EV71 strains and the oldest strains of EV71 subgenotypes and other prototype HEV-A strains available in GenBank, the phylogenetic trees based on the full length of genome and 5′UTR, P1, P2, P3, 3′UTR region of the genome were constructed respectively (Fig. 2a–e). At the full length of genome and P1 region, there was similar pattern of the phylogeny between the Chinese C4 EV71 strains and oldest strains of EV71 subgenotypes and other prototype HEV-A strains: the Chinese C4 EV71 strains were clustered with C subgenotype of EV71 closely, and segregated away from other prototype HEV-A strains. At the region of 5′UTR, all members of HEV-A are closely related to one another (Fig. 2a). At both P2 and P3 genomic region, the Chinese C4 EV71 strains were phylogenetically closer to CVA-16, CVA-14, CVA-4 prototype strains than to the EV71 prototype strain BrCr (Fig. 2c,2d). At 3'UTR, C4 EV71 isolates were clustered with EV71 subgenotype of B3 and CV-A4, CV-A14 and CV-A16 (95% bootstrap support).
The neighbour-joining trees were constructed from alignment of the 5' UTR (a), P1 (b), P2 (c), P3 (d) and 3′UTR (e) genomic region, respectively. The percentage of bootstrap (percentage of 1000 pseudoreplicate datasets) replicates supporting the trees are indicated at the nodes; for clarity, only values over 80% are shown. The branch lengths are proportional to the genetic distances corrected using Kimura-two-parameter substitution model.
If we performed the phylogenetic analysis based on the detailed coding region of P2 and P3, we found a slightly different phylogenetic relationship between C4 subgenotype and other subgenotype of EV71 and other HEV-A prototype strains (Fig. S1). C4a, C4b and the prototype strain of C4 subgenotype consistently clustered together with each other in all the region of P2 and P3, except for at the region of 2B, where C4a, C4b segregated from the prototype strain of C4 subgenotype, shzh98/CHN/1998, and cluster with B genotype and some HEV-A strains. At the region of 2A, Chinese C4 viruses were phylogenetically closest to C genotype strains and CVA-4 prototype strain; at 2B region, C4a, C4b viruses closest to CVA-4; This indicated that 2A, 2B region was a genomic region of multi-mutation and a breakpoint for the recombination; while at 2C,3A,3B region, similar phylogeny pattern was shared between C4a, C4b viruses and other HEV-A strains: C4a, C4b viruses were closest to B0-5 subgenotype strains, and trended to closer to CVA-16, CVA-14, CVA-4; At 3C, 3D and 3′UTR region, the C4a, C4b viruses apparently to be closer to CVA-16, CVA-14,CVA-4 than the subgenotypes strains of B genotype, except for B3 subgenotype. Isolates of subgenotype C4 were clustered with B3 isolates, CV-A4, CV-A14 and CV-A16 and support for the clustering was also significant (100% bootstrap). Therefore, the phylogenetic relationships of the viruses were different with respect to different positions in the genome. The observed differences in the phylogenetic tree topologies between the capsid and the noncapsid regions indicate that recombination might have occurred during the evolution of these viruses.
Recombination Analysis of the Chinese C4 EV71 Strains and Other HEV-A Genomes
The genome sequences of Chinese C4 subgenotype viruses and all available HEV-A prototype strains were analyzed with Simplot software, using the representative strain of each lineage in turn as the query sequence. Similarity plot analyses demonstrated that C4a, C4b viruses showed the highest degree of similarity to the C genotype of EV71 in the capsid region, but in the non-capsid region, C4a, C4b viruses were all contained an unidentified sequence in the P2 and P3 coding region that was apparently not related to those of EV71 strains (Fig. 3). Comparison of the P2 and P3 coding region sequences of the C4a, C4b EV71 strains with those of the certain prototype strains of HEV-A, B, C, and D revealed no sequence match above 85.8%, and showed higher similarity to HEV-A than to HEV-B, C, and D. In addition, the deduced amino acid sequence of the recombinant noncapsid sequences of the C4a and C4b EV71 strains showed a high identity with HEV-A, especially those of prototype CVA16 (>97.4%), prototype CVA14 (>97.0%), and prototype CVA4 (>96.5%).
Each analysis used each of the two lineages viruses as the query sequence. A sliding window of 1000 nucleotides moving in 20-nucleotide steps was used in this analysis. (a) C4b virus: SH-17/SH/CHN/2002; (b) C4a virus: HeN09-17/HeN/CHN/2009.
Subsequent bootscan analyses indicated possible recombination events. C4a, C4b EV71 strains were most closely related to the C genotype of EV71 in the 5′ half of the genome, which is consistent with the Simplot analysis results. However, after the junction sequences between VP1 and 2A, the bootscan graph exhibited a sound phylogenetic relationship between C4a, C4b EV71 strains and CVA-16, 14, 4.
Support for the inter-typic recombination involving 3 prototype strains of HEV-A and EV-71 were demonstrated by one major breakpoint identified in nucleotide 3700–3826 within the 2A gene of the complete genome (Fig. 3), where switching the C4a and C4b genome sequences to that of the non-structural of CVA-16, 14, 4. Although this nucleotide 3700–3826 breakpoint also occurred between C genotype and B genotype of EV71, the B genotype strains keep consistent similarity with the query strain of C4a and C4b viruses. This analysis indicated that there is no strong evidence to support the conclusion of the intratypic recombination between C and B subgenotypes of EV71.
4 orphan strains of C4 subgenotype, FJ194964-EV71/GDFS/3/2008, GQ994989-CQ/CHN/2009, FJ607337-SHZH/CHN/2008Fatal, HQ423143-km186/yYN/CHN/2009, were performed with Simplot analysis. A strain from the fatal case, FJ607337-SHZH/CHN/2008Fatal, shared the similar homology (>94%) with other 3 orphan strains from non-fatal cases in 5′UTR, P1 and P2 region of the genome, but the homology decreased gradually in the P3 region down to 85% in 3D region(not shown). It has been reported that sequences in the viral RNA-dependent RNA polymerase 3D (3Dpol) gene are important in determining the neurovirulence of polioviruses , . Similarly, mutation of the EV71 standard strain BrCr in the 3Dpol region, which catalyzes both (−)-strand and (+)-strand RNA syntheses, showed attenuated neurovirulence in the cynomolgus monkey model . Whether this variation in the 3D region play an important role for the death of the fatal case needs to be determined by further study.
Transmission and Evolution of C4 Recombination Virus in China Since 1998
C4 recombination virus was first isolated in ShenZhen, Guangdong province of China in 1998 . Our present study indicated that the C4 recombination viruses have been the predominant endemic viruses circulating in China mainland for at least 14 years since 1998. During the epidemic of the C4 recombination viruses, it evolved into 2 major evolution brunches, C4a and C4b viruses in the decade. Our present and previous data indicated that C4b viruses had been circulating in mainland China for 5 years since 1998 (C4b was prevalent from 1998 to 2003, but its transmission was interrupted since 2003), which rarely caused severe disease and death in HFMD patients, and replaced by C4a viruses causing the nationwide outbreak with higher morbidity and mortality since 2007, indicating a shift between 2 evolutionary brunches within subgenotype C4 recombination viruses. A total of 5,034,764 (range from: 489,082-1,619,148 each year) HFMD cases including 61,582(range from 1,164-27,891 each year) severe and 1,894(range from 126-904 each year) fatal cases were reported to NNDRS during 2008-2011 in mainland of China , , with increase of the severe and fatal cases year by year. The recombination C4a EV71 viruses have been associated with more than 80% of the severe cases and 92% of the fatalities since 2008 .
For the geographical transmission of the recombination C4 viruses, we found that C4b viruses circulated in southern provinces of China, such as, Guangdong, Chongqing, Shanghai,Guangxi; and C4a viruses transmitted extensively throughout.
The phylogenetic analysis of complete genome of EV71 circulating in mainland China in this present study showed all Chinese strains were clustered into C4 subgenotype group, except for HuB/CHN/2009 clustered into A and Xiamen/CHN/2009 clustered into B5(Fig. 1). Our previous studies on EV71 epidemiology showed that reported EV71 infections in mainland China had been associated with the only predominant subgenotype C4 viruses for more than 10 years, except for 2 orphan C genotype viruses were found in 1987 and 1997 respectively. The C4 subgenotype HEV71 has been the predominant endemic viruses circulating in mainland of China since 1998. However, there has been a shift of multiple subgenotypes of HEV71 circulating in other area, such as Taiwan, Australia and Malaysia. The patterns of HEV71 prevalence varied among different areas. Maybe because of the large and high density population in mainland of China, there is large newborn cohort every year becoming the susceptible population; therefore, the endemic viruses could be circulating in mainland China persistently for many years. While in other areas with small and low density population, the endemic viruses could be interrupted after circulating for a period of time with the increase of the population immunity, and then, other genotypes/subgenotypes HEV71 could be imported and becoming the endemic strains. It is also interesting that the molecular epidemiological pattern of HEV71 in mainland China appears to be similar to those of the measles virus and CVA-16 , , a single predominant endemic virus circulated in mainland China for a long time; but to be quite different from those of the rubella virus that different genotypes have co-circulated in China .
In this study, one B5 subgenotype virus was found in Xiamen (located in south of China) in 2009, which is the first B5 isolate found in China mainland. B5 subgenotype viruses were first found in Japan and Malaysia in 2003  and later on circulating in other country or region, such as, Vienam  and Taiwan .Genotype A is composed of the EV71 strain (BrCr-CA-70) identified in 1970 in the USA but was not detected afterwards until 2008 , . Yu et al  reported the emergence of five isolates that are closely related to genotype A in central China. This was the first report of the occurrence of the modern contemporary A genotype EV71. The sequence homologies between HuB/CHN/2009 A strain and the prototype BrCr was 98.9%-98.0% for nucleotide acid and amino acid, respectively, in the full length genome. Based on the reported evolution rates calculated for HEV71 , 3.18×10−3 nucleotide substitutions/site/year, the diversity between the oldest BrCr found in 1970’s and modern Chinese strains found in 2008–2009 would be at least 18%. Therefore, the occurrence of both HuB/CHN/2009 and Luan/CHN/2008 appeared weird. These A genotype viruses might be the laboratory adapted strains or the laboratory contamination. Although C4 subgenotype was the predominant strains circulating in mainland China for more than 10 years, the occurrence of orphan viruses of other genotype or subgenotypes, C, B5 and weird A, indicate the requirement of extensive surveillance of EV71 in mainland China should be strengthened.
No reported severe or fatal cases caused by C4b viruses occurred before large scale outbreak of HFMD in mainland China during 1993–2003 based on our analysis of complete genome. However, during the large scale outbreak of HFMD, increasing neurovirulence associated with C4a virus is a big concern for public health in mainland China. Our study on the analysis of complete genome, C4a viruses caused different phenotype of disease from mild to fatal. No specific lineages were associated with severe or fatal or mild cases. This indicated the viruses isolated from different phenotypic patients derived from the common ancestor and evolutes to different lineages by mutating gradually. We speculated that both viruses and host factor contributed to the phenotype of the disease.
Genomic recombinations are well known to contribute to genetic variations and evolution of enteroviruses. A range of enteroviruses of various serotypes or genotypes co-circulating in populations at some point of time was reported by different research groups . In the period of the enterovirus co-circulation, recombination between parts of genome of different serotype or genotype viruses may occur when different viruses infect and replicate in the same cell. This recombination process allows enteroviruses to create and maintain their genetic diversity and fitness. Several reports have shown their evidences for EV71 recombination, including intertypic and intratypic recombination between subgenotypes of EV71 and other prototype strains of HEV-A , , . But these studies reported the intratypic recombination between genotype/subgenotypes of EV71 based on the analysis of consensus sequences of genotype/subgenotypes viruses or the representative strains of genotype/subgenotype, not based on the oldest strain within the genotype or subgenotype group, which is the very important for the recombination analysis. In this study, we download all the complete genome sequences of EV71 from GenBank and searched by the published paper or the sequence submission information for the oldest strain within each subgenotype group of EV71, and then, the C4a and C4b viruses were aligned with the oldest strains of each subgenotype of EV71 and the prototype strains of HEV-A. Our comprehensive recombination analysis showed the evidence of genome recombination of genotype C4 (including C4a and C4b) sequences between EV71 genotype C structural genes and non-structural genes derived from the prototype strains of CAV16, 14 and 4, but the evidence of intratypic recombination between C4 strains and B subgenotype was not enough strong, since the B subgenotype showed the consistent high homology (80–86%) with C4a and C4b strains in 5′ end genome sequence(5′UTR-P2), while in the P3 region, higher homology was found between C4a, C4b strains and the prototype strains of CAV16, 14, 4, not B subgenotype EV71. In summary, these analyses showed the evidence of genomic recombination of C4a and C4b sequences between EV71 genotype C structural genes and non-structural genes derived from CAV16, 14, 4, not from B subgenotype EV71. This finding was inconsistent with the previous study , , (Table S3).
Interestingly, the EV-71 isolates of subgenotype B3 shared the similar recombination pattern with C4. These isolates had high sequence similarity to EV-71 genotype B, CV-A4, CV-A14 and CV-A16/G10 at P2 genomic region (≥81%) and high sequence similarity to CV-A4, CV-A14 and CV-A16 at P3 genomic region ((≥83%), Figure 2). At the P3 genomic region, the sequence similarity of isolates of subgenotype B3 and C4 to the rest of the EV-71 genotypes was only between 75–79%. In the phylogenetic tree, the B3 strains consistently clustered with B1,2,4,5 in the genome sequences except for P3 region, where B3 segregated from other B subgenotype and clustered with CV-A4, CV-A14 and CV-A16 more closely. It is indicating that B3 shared the similar evolution pattern with C4 subgenotype viruses from the common ancestor: they likely ‘‘trap’’ sequences from other HEV-A viruses, thereby producing new individual viruses that differ from the parental strains during natural multiplication of HEV71 strains.
The clustering of isolates of subgenotype C4 with B3 and CVA4, 14, 16 at the 3' UTR genomic region was consistent with the previous clustering at the P2 and P3 genomic regions. No significant segregation (<30% bootstrap support), however, was observed for the remaining isolates and this was perhaps due to the short sequence length of the 3' UTR(~83nt). Based on these results, it appeared that sequences of genes at the 3' half of the EV-71 genome contributed to the multiple and diverse EV71 subgenotypes and these genes showed high similarity to different HEV-A viruses.
The incongruent phylogenies and simplot similarity analyses imply that recombination has played an evident role in the evolution of C4 EV71 viruses. C4a and C4b clearly contained sequences in the non-capsid region that are also present in CVA4,14,16, suggesting that these three HEV-A strains and C 4 subgenotype viruses of EV71 have a shared evolutionary history, despite their lack of similarity in the capsid region. However, the exact recombination counterpart of HEV-A could not be found because there is not sufficient data regarding the P2 and P3 sequences of the HEV-A in China or any other part of the world, but it may be assumed that genetic exchanges had occurred when the HEV71 strain co-circulated with other HEV-A during that time period in China. In this study, we have already performed the Nucleotide acid and deduced amino acid sequence identities analysis, and Phylogenetic analysis based on 5′UTR, P1, P2, P3, 3′UTR region of the genome respectively. Both identities and phylogenetic analysis indicated that both C4a and C4b viruses had much higher sequence identities with EV71 in P1 region; while in the P2 and P3 regions, both C4a and C4b viruses had much higher sequence identities with CVA-16,14,4. Although we combined the simplot analysis, the identities and phylogenetic analysis to confirm the recombination of C4 EV71 from the prototype strains of CAV16, 14 and 4, it is still a hypothesized conclusion.
Species human enterovirus A (HEV-A) which include 11 members of the coxsackievirus A (CV-A) group; CVA2–8, CVA10, CVA12, CVA14, CVA16 and human enterovirus 71 (EV-71) are associated with several human diseases , . CVA4 cause herpangina  and EV71, CVA14, CVA16 are highly associated with HFMD, and EV71 and CVA7 are occasionally associated with neurological diseases , . Seiya Yamayoshi, et al reported that EV71, CVA7, CVA14 and CVA16, utilized the same cellular receptor SCARB2, a critical receptor common to all EV71 strains, for infection. Provided these SCARB2-dependent viruses sometimes co-circulate during an epidemic of HFMD , , these viruses might have a high potential to undergo an intertypic recombination by co-infection of a SCARB2-expressing cell in vivo. In this study, we provided the evidence for the intertypic recombination of C4 subgentype between EV71 and CVA16, 14, 4. Based on the published co-infection and the cellular receptor study, we proposed that C4a and C4b viruses are intertypic recombination viruses between EV71 and other HEV-A strains derived from CVA16 or CVA14, not CVA4, since CVA4 infected cells via the different cellular receptor pathway and associated mainly with different clinical outcome, herpangina . The utilization of same cellular receptor between EV71 and CVA7 strains provided the possibility for the recombination between these two enteroviruses, which might appear as an emerging infectious pathogen, however, and may have unexpectedly high virulence, since both of them associated with neurological diseases. Careful and continuous surveillance of these viruses for the potential recombination is important for public health.
In this study, we provided the evidence confirming that these recombination C4 viruses have been occurred in China since 1998 and persistently circulated in China more than 14 years, and evolved into 2 major evolution lineages, C4a and C4b viruses during the decade. More and more severe neurological diseases and fatal cases have been caused by the intertypic recombinant C4a viruses throughout mainland China since 2007. A total of 5,034,764 HFMD cases including 61,582 severe and 1,894 fatal cases were reported to NNDRS during 2008–2011 in mainland of China , , with increase of the severe and fatal cases year by year. The recombination C4a EV71 viruses have been associated with more than 80% of the severe cases and 92% of the fatalities since 2008 . The reason for the epidemic of large-scale outbreaks of HFMD with increasing morbidity and mortality has been one of the most important issues in biomedical research in recent years. In the present study, except for the surveillance gap in mainland China during 1999–2000 and 2004–2006, we obtained the full scope of the EV71 epidemic based on the complete genome analysis. Our present and previous data indicated that C4b viruses had been circulating in mainland China for 5 years since 1998 (C4b was prevalent from 1998 to 2003 but has now disappeared from mainland China), which rarely caused severe disease and death in HFMD patients, and replaced by C4a viruses causing the nationwide outbreak with higher morbidity and mortality and caused many severe and fatal HFMD patients since 2007. The acquisition of a segment of genes from CV-A16 or CVA14 by EV71 C4b viruses could have rendered the virus more fit to adapt to a new environment especially the host immunity. And then, C4b viruses continued to adapt to its hosts via nucleotide mutations, while such mutations often lead to the changes in the pathogenicity, patterns of prevalence, and clinical manifestations of HEV71 infection, complicating clinical diagnosis, et al. Hence it could be the reason why C4b was rapidly replaced by C4a viruses which remained higher prevalence with increasing neurovirulence and transmissibility in mainland China since 2007. Additionally, it is possible that the C4a EV71 strains currently circulating in mainland China associated with higher morbidity and mortality obtained an unidentified neurovirulence determinant(s) in the viral genome and became more neurovirulent than those that circulated previously. Because it has been reported that an increase in neurovirulence levels can be caused by point mutations or by genetic recombination between avirulent poliovirus vaccine strains and nonpolio enteroviruses . Other research groups have identified the genetic differences between EV71 isolates from patients with severe and mild clinical disease and try to test the virulence phenotype of these recombinant viruses in the mouse model of infection via a reverse genetic approach. And our research team is currently using reverse genetics methods to identify the key nucleotide and amino acid differences between evolutionary branches C4a and C4b in order to discover the important determinants of neurovirulence and the possible reasons for the repeated outbreaks of HFMD in China in recent years. Our comprehensive study on the complete genome of C4 subgenotype of EV71 is significant for the prevention and control, vaccine development of EV71 in the world. However, genetic studies alone are not significant enough for vaccine development. It is critical to study antigenic variations for selecting vaccine strains .
Materials and Methods
Viruses and Sequence
This study did not involve human participants or human experimentation; the only human materials used were stool samples, throat swab samples, and vesicles collected from HFMD patients at the instigation of the Ministry of Health P. R. of China for public health purposes, and written informed consent for the use of their clinical samples was obtained from all patients involved in this study. This study was approved by the second session of the Ethics Review Committee of the Chinese Center for Disease Control and Prevention. The EV71 strains used in this study were isolated between 2002–2003 and 2007–2009 from stool, throat swabs, or vesicles from HFMD patients from different geographical locations in the Shanghai, Chongqing, Anhui, Shandong and Henan provinces of China (Table S1). Viruses were isolated from original clinical specimens by propagation in RD (human rhabdomyosarcoma) cells by conventional methods and then sequenced. To investigate the full genetic characterization of the complete genome of EV71 in Mainland China, 186 EV71 complete genome available from GenBank were downloaded and aligned with 16 full-length of EV71 determined in the present study, and then, 65 sequences from GenBank (based on the genetic diversity, those sequences identical and <2% were cleared out) and 16 sequences from this study were chosen to do the further phylogenetic and recombination analysis (Table S1).
Determination of the Full-length Genome Sequencing of EV71
Viral RNA was extracted from the viral isolates using a QIAamp Viral RNA Mini Kit (Qiagen, Valencia, CA, USA) and stored at -80°C until further use. The full-length genomes of 16 EV71 strains from the HFMD patients were amplified and sequenced. The viral RNA was converted to cDNA by a random priming strategy. The cDNA was amplified using the primers designed by multiple alignments of EV71 genomes available in GenBank database(Table S2). PCR products obtained were purified using the QIAquick Gel extraction kit (Qiagen). The amplicons were then bi-directionally sequenced using an ABI PRISM 3100 genetic analyzer (Applied Biosystems). 5′-segment sequences were determined by using a 5′-rapid amplification of cDNA ends (RACE) core set (Takara Biomedicals), according to the manufacturer’s instructions.
Phylogenetic and Bioinformatics Analysis
Sequencing analysis overlapping DNA sequences with at least 85% sequence homology and a minimum of 20 overlaps were assembled into contigs to generate consensus sequences using Sequencher version 4.0.5 (Gene Codes Corporation, USA). The consensus sequences were aligned against other EV71 complete genome sequences retrieved from the GenBank (Table S1) using MEGA5.05 . 65 completely sequenced EV71 isolates available in the GenBank and the 16 isolates obtained from the present study were used for construction of phylogenetic trees. The genetic distance was determined by a pairwise estimation of the sequences percent divergence. Positions with gaps were included, transition and transversion ratio was fixed at 10 and corrections for multiple substitutions were made. These options were chosen to take into account the ambiguous part of the alignments and to correct for more than one substitutions happening at many sites that may underestimate the actual genetic distances . Phylogenetic trees presented here were constructed using MEGA 5.05 with neighbour-joining method . The strength of the phylogenetic trees was estimated by bootstrap analyses using 1000 random samplings. A bootstrap value of ≥80% indicates a strong support for the tree topology . Amino acid sequences were examined after stripping the 5' UTR and 3' UTR sequences and consensus sequences for each EV71 genotype were established.
Two nucleotide alignments were generated using the MEGA 5.05 . The first alignment contains the genome sequences of a C4b evolutionary branch EV71 strain (SH-17/SH/CHN/2002), the oldest strains of each subgenotype of EV71 and HEV-A prototype strains (CVA-2, 3, 4,5, 6, 7, 8, 10, 12, 14, 16, EV71, 76, 89, 90, 91 and 92); The second contains the genome sequences of a C4a evolutionary branch EV71 strain (HeN09-17/HeN/CHN/2009), the oldest strains of each subgenotype of EV71 and HEV-A prototype strains. Once aligned, similarity plot and bootscan analysis were performed using Simplot program (version 3.5.1; Stuart Ray, Johns Hopkins University, Baltimore, Maryland, USA) . To demonstrate the recombination analysis clearly, 13 viruses were selected based on identity analysis for Figure 3.
Nucleotide Sequence Accession Numbers
The 16 sequences reported in this study were deposited in the GenBank sequence database, accession numbers: EU703812 to EU703814; JX678874-JX678886.
List of 186 HEV71 strains.
Primers for RT-PCR, Sequencing and RACE.
The difference between this study and other studies (ref. 19, 20, and 27) to identify different break point and parental strains of EV71 C4a and C4b recombinants.
Phylogenetic trees showing the relationships amongst HEV-A isolates using the different genomic regions. The neighbour-joining trees were constructed from alignment of the 2A (a), 2B (b), 2C3A3B (c), and 3C3D (d) genomic region, respectively. The percentage of bootstrap (percentage of 1000 pseudoreplicate datasets) replicates supporting the trees are indicated at the nodes; for clarity, only values over 80% are shown. The branch lengths are proportional to the genetic distances corrected using Kimura-two-parameter substitution model.
We would also like to acknowledge all of the laboratories that isolated the viruses used in this study. And we thank anonymous reviewers for comments that improved the manuscript.
Conceived and designed the experiments: Yan Zhang WBX. Performed the experiments: Yan Zhang XJT ALC NYM STX ZZ JHZ YPZ XJW XYH SLZ Yong Zhang WT HL. Analyzed the data: Yan Zhang JS. Contributed reagents/materials/analysis tools: Yan Zhang YPZ XJW XYH WBX. Wrote the paper: Yan Zhang WBX.
- 1. Stanway G, Brown F, Christian P, Hovi T, Hyypia T, et al.. (2005) Family Picornaviridae. In: “Virus Taxonomy Eighth Report of the International Committee on Taxonomy of Viruses”. London.: Elsevier/Academic Press. 757–778.
- 2. JL M (1996) Enteroviruses: polioviruses, coxsackieviruses, echoviruses and newer enteroviruses. Fields Virology 3rd edition. Philadelphia: Lippincott-Raven Publishers;. 655–711.
- 3. Palmenberg AC (1990) Proteolytic processing of picornaviral polyprotein. Annu Rev Microbiol 44: 603–623. doi: 10.1146/annurev.mi.44.100190.003131
- 4. AbuBakar S, Chee HY, Al-Kobaisi MF, Xiaoshan J, Chua KB, et al. (1999) Identification of enterovirus 71 isolates from an outbreak of hand, foot and mouth disease (HFMD) with fatal cases of encephalomyelitis in Malaysia. Virus Res 61: 1–9. doi: 10.1016/s0168-1702(99)00019-2
- 5. Wu Y, Yeo A, Phoon MC, Tan EL, Poh CL, et al. (2010) The largest outbreak of hand; foot and mouth disease in Singapore in 2008: the role of enterovirus 71 and coxsackievirus A strains. Int J Infect Dis 14: e1076–1081.6. Ryu WS, Kang B, Hong J, Hwang S, Kim J, Cheon DSet al. (2010) Clinical and etiological characteristics of enterovirus 71-related diseases during a recent 2-year period in Korea. J Clin Microbiol 48: 2490–2494.
- 6. Ooi MH, Wong SC, Podin Y, Akin W, del Sel S, et al. (2007) Human enterovirus 71 disease in Sarawak, Malaysia: a prospective clinical, virological, and molecular epidemiological study. Clin Infect Dis 44: 646–656. doi: 10.1086/511073
- 7. Hosoya M, Kawasaki Y, Sato M, Honzumi K, Kato A, et al. (2006) Genetic diversity of enterovirus 71 associated with hand, foot and mouth disease epidemics in Japan from 1983 to 2003. Pediatr Infect Dis J 25: 691–694. doi: 10.1097/01.inf.0000227959.89339.c3
- 8. Tu PV, Thao NT, Perera D, Huu TK, Tien NTet al (2007) Epidemiologic and virologic investigation of hand, foot, and mouth disease, southern Vietnam,2005. Emerg Infect Dis 13: 1733–1741. doi: 10.3201/eid1311.070632
- 9. Zhang Y, Tan XJ, Wang HY, Yan DM, Zhu SL, et al. (2009) An outbreak of hand, foot, and mouth disease associated with subgenotype C4 of human enterovirus 71 in Shandong, China. J Clin Virol 44: 262–267. doi: 10.1016/j.jcv.2009.02.002
- 10. Zhang Y, Zhu Z, Yang W, Ren J, Tan X, et al. (2010) An emerging recombinant human enterovirus 71 responsible for the 2008 outbreak of hand foot and mouth disease in Fuyang city of China. Virol J 7: 94. doi: 10.1186/1743-422x-7-94
- 11. Ho M, Chen ER, Hsu KH, Twu SJ, Chen KT, et al. (1999) An epidemic of enterovirus 71 infection in Taiwan. Taiwan Enterovirus Epidemic Working Group. N Engl J Med 341: 929–935. doi: 10.1056/nejm199909233411301
- 12. Brown BA, Oberste MS, Alexander JP Jr, Kennett ML, Pallansch MA (1999) Molecular epidemiology and evolution of enterovirus 71 strains isolated from 1970 to 1998. J Virol 73: 9969–9975.
- 13. Shimizu H, Utama A, Onnimala N, Li C, Li-Bi Z, et al. (2004) Molecular epidemiology of enterovirus 71 infection in the Western Pacific Region. Pediatr Int 46: 231–235. doi: 10.1046/j.1442-200x.2004.01868.x
- 14. Deshpande JM NS, Francis PP (2003) Enterovirus 71 isolated from a case of acute flaccid paralysis in India represents a new genotype. Curr Sci 84: 1350–1353.
- 15. Huang YP, Lin TL, Kuo CY, Lin MW, Yao CY, et al. (2008) The circulation of subgenogroups B5 and C5 of enterovirus 71 in Taiwan from 2006 to 2007. Virus Res 137: 206–212. doi: 10.1016/j.virusres.2008.07.015
- 16. Tan X, Huang X, Zhu S, Chen H, Yu Q, et al. (2011) The Persistent Circulation of Enterovirus 71 in People’s Republic of China: Causing Emerging Nationwide Epidemics Since 2008. PLoS ONE 6(9) 6: e25662. doi: 10.1371/journal.pone.0025662
- 17. Oberste MS, Penaranda S, Maher K, Pallansch MA (2004) Complete genome sequences of all members of the species Human enterovirus A. J Gen Virol. 85: 1597–1607. doi: 10.1099/vir.0.79789-0
- 18. Chan YF, AbuBaker S (2004) Recombinant human enterovirus 71 in hand, foot and mouth disease patients. Emerg Infect Dis 10: 1468–1470. doi: 10.3201/eid1008.040059
- 19. Yoke-Fun C, AbuBakar S (2006) Phylogenetic evidence for inter-typic recombination in the emergence of human enterovirus 71 subgenotypes. BMC Microbiol 6: 74.
- 20. Huang SC, Hsu YW, Wang HC, Huang SW, Kiang D, et al. (2008) Appearance of intratypic recombination of enterovirus 71 in Taiwan from 2002 to 2005. Virus Res 131: 250–259. doi: 10.1016/j.virusres.2007.10.002
- 21. Christodoulou C, Colbere-Garapin F, Macadam A, Taffs LF, Marsden S, et al. (1990) Mapping of mutations associated with neurovirulence in monkeys infected with Sabin 1 poliovirus revertants selected at high temperature. J Virol 64: 4922–4929.
- 22. McGoldrick A, Macadam AJ, Dunn G, Rowe A, Burlison J, et al. (1995) Role of mutations G-480 and C-6203 in the attenuation phenotype of Sabin type 1 poliovirus. J Virol 69: 7601–7605.
- 23. Arita M, Zhu SL, Yoshida H, Yoneyama T, Miyamura T, et al. (2005) A Sabin 3-derived poliovirus recombinant contained a sequence homologous with indigenous human enterovirus species C in the viral polymerase coding region. J Virol 79: 12650–12657. doi: 10.1128/jvi.79.20.12650-12657.2005
- 24. Yang F, Jin Q, He Y, Li L, Hou Y (2001) The complete genome of Enterovirus 71 China strain. Sci China C Life Sci 44: 178–183. doi: 10.1007/bf02879323
- 25. Zhang Y, Xu S, Wang H, Zhu Z, Ji Y, et al. (2012) Single endemic genotype of measles virus continuously circulating in China for at least 16 years. PLoS One 7: e34401. doi: 10.1371/journal.pone.0034401
- 26. Zhang Y, Wang D, Yan D, Zhu S, Liu J, et al. (2010) Molecular evidence of persistent epidemic and evolution of subgenotype B1 coxsackievirus A16-associated hand, foot, and mouth disease in China. J Clin Microbiol 48: 619–622. doi: 10.1128/jcm.02338-09
- 27. Zhu Z, Abernathy E, Cui A, Zhang Y, Zhou S, et al. (2010) Rubella virus genotypes in the People's Republic of China between 1979 and 2007: a shift in endemic viruses during the 2001 Rubella Epidemic. J Clin Microbiol 48: 1775–1781. doi: 10.1128/jcm.02055-09
- 28. Mizuta K, Abiko C, Murata T, Matsuzaki Y, Itagaki T, et al. (2005) Frequent importation of enterovirus 71 from surrounding countries into the local community of Yamagata, Japan, between 1998 and 2003. J Clin Microbiol 43: 6171–6175. doi: 10.1128/jcm.43.12.6171-6175.2005
- 29. Yu H CW, Chang H, Tang R, Zhao J, Gan L, et al. (2010) Genetic analysis of the VP1 region of enterovirus 71 reveals the emergence of genotype A in central China in 2008. Virus Genes 41: 1–4. doi: 10.1007/s11262-010-0472-9
- 30. Huang SW, Kiang D, Smith DJ, Wang JR (2011) Evolution of re-emergent virus and its impact on enterovirus 71 epidemics. Exp Biol Med (Maywood) 236: 899–908. doi: 10.1258/ebm.2010.010233
- 31. Yip CC, Lau SK, Zhou B, Zhang MX, Tsoi HW, et al. (2010) Emergence of enterovirus 71 "double-recombinant" strains belonging to a novel genotype D originating from southern China: first evidence for combination of intratypic and intertypic recombination events in EV71. Arch Virol 155: 1413–1424.
- 32. Yamayoshi S, Iizuka S, Yamashita T, Minagawa H, Mizuta K, et al. (2012) Human SCARB2-dependent infection by coxsackievirus A7, A14, and A16 and enterovirus 71. J Virol 86: 5686–5696. doi: 10.1128/jvi.00020-12
- 33. Ang LW, Koh BK, Chan KP, Chua LT, James L, et al. (2009) Epidemiology and control of hand, foot and mouth disease in Singapore, 2001–2007. Ann Acad Med Singapore 38: 106–112.
- 34. De W, Changwen K, Wei L, Monagin C, Jin Y, et al. (2011) A large outbreak of hand, foot, and mouth disease caused by EV71 and CAV16 in Guangdong, China, 2009. Arch Virol 156: 945–953. doi: 10.1007/s00705-011-0929-8
- 35. Lee MS TF, Wang JR, Chi CY, Chong P, Su IJ (2012) Challenges to licensure of enterovirus 71 vaccines. PLoS Negl Trop Dis 6: e1737. doi: 10.1371/journal.pntd.0001737
- 36. Tamura K PD, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5:Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Molecular Biology and Evolution 28: 2731–2739. doi: 10.1093/molbev/msr121
- 37. Thompson JD GT, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nuc Acids Res 24: 4876–4882. doi: 10.1093/nar/25.24.4876
- 38. Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, et al. (1999) Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol 73: 152–160.