Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Simplified Large-Scale Sanger Genome Sequencing for Influenza A/H3N2 Virus

  • Hong Kai Lee,

    Affiliations Department of Pathology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Molecular Diagnosis Centre, Department of Laboratory Medicine, National University Hospital, National University Health System, Singapore

  • Julian Wei-Tze Tang,

    Affiliations Alberta Provincial Laboratory for Public Health, University of Alberta Hospital, Edmonton, Alberta, Canada, Department of Medical Microbiology and Immunology, University of Alberta, Edmonton, Alberta, Canada

  • Debra Han-Lin Kong,

    Affiliation Molecular Diagnosis Centre, Department of Laboratory Medicine, National University Hospital, National University Health System, Singapore

  • Evelyn Siew-Chuan Koay

    evelyn_koay@nuhs.edu.sg

    Affiliations Department of Pathology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Molecular Diagnosis Centre, Department of Laboratory Medicine, National University Hospital, National University Health System, Singapore

Abstract

Background

The advent of next-generation sequencing technologies and the resultant lower costs of sequencing have enabled production of massive amounts of data, including the generation of full genome sequences of pathogens. However, the small genome size of the influenza virus arguably justifies the use of the more conventional Sanger sequencing technology which is still currently more readily available in most diagnostic laboratories.

Results

We present a simplified Sanger-based genome sequencing method for sequencing the influenza A/H3N2 virus in a large-scale format. The entire genome sequencing was completed with 19 reverse transcription-polymerase chain reactions (RT-PCRs) and 39 sequencing reactions. This method was tested on 15 native clinical samples and 15 culture isolates, respectively, collected between 2009 and 2011. The 15 native clinical samples registered quantification cycle values ranging from 21.0 to 30.56, which were equivalent to 2.4×103–1.4×106 viral copies/µL of RNA extract. All the PCR-amplified products were sequenced directly without PCR product purification. Notably, high quality sequencing data up to 700 bp were generated for all the samples tested. The completed sequence covered 408,810 nucleotides in total, with 13,627 nucleotides per genome, attaining 100% coding completeness. Of all the bases produced, an average of 89.49% were Phred quality value 40 (QV40) bases (representing an accuracy of circa one miscall for every 10,000 bases) or higher, and an average of 93.46% were QV30 bases (one miscall every 1000 bases) or higher.

Conclusions

This sequencing protocol has been shown to be cost-effective and less labor-intensive in obtaining full influenza genomes. The constant high quality of sequences generated imparts confidence in extending the application of this non-purified amplicon sequencing approach to other gene sequencing assays, with appropriate use of suitably designed primers.

Introduction

In recent years, advances in sequencing techniques have enabled an increasing number of research studies based on the genome-wide sequences of the influenza viruses [1][6], rather than relying solely on an individual gene that may preclude more comprehensive gene signatures [7], [8]. Since the large number of influenza genome sequences deposited by Ghedin et al. [4] and the initiation of the Influenza Genome Sequencing Project in 2005 [9], the deposition of complete human influenza A virus genomes by other groups has increased exponentially.

The genome of the influenza A virus (family Orthomyxoviridae) consists of eight segmented, negative-stranded RNAs, ranging from 890 to 2,341 nucleotides (nt), constituting 13,627 nt per genome. The eight RNA segments encode for (in the order of the segment numbers one to eight): viral RNA polymerase basic 2 (PB2, 2341 nt), polymerase basic 1 (PB1, 2341 nt), polymerase acidic (PA, 2233 nt), hemagglutinin (HA, 1762 nt), nucleoprotein (NP, 1567 nt), neuraminidase (NA, 1466 nt), matrix (M1, 1027 nt), and nonstructural (NS1, 890 nt) protein. Apart from these proteins, alternatively spliced mRNAs of the seventh segment (M1) and the eighth segment (NS1) allow translation of two additional proteins, namely, the ion channel matrix protein (M2) and nuclear export/nonstructural protein (NEP/NS2), respectively. Also, PB1-F2 proteins are alternatively translated from PB1 gene segments of some influenza A viruses [10].

The introduction of next-generation sequencing (NGS), which delivers high throughput readings [11] compared to the traditional Sanger dideoxy chain-termination method [12], has provided a remarkable cost reduction for microbial genome sequencing. However, a higher error rate due to homopolymeric miscalling and other systematic base-calling biases have been observed in NGS techniques, compared with the Sanger methods [13][16]. The average error rate of the former is considerably higher, with a value of 10−2–10−4 versus that of the latter at 10−4–10−5 [13], [14]. A recent report on 12 influenza genomes comparing 2 NGS platforms from 454 Life Sciences and Illumina revealed error rates up to 10−3 and 10−5 at the homopolymeric region, respectively [17]. Besides, the cost of the initial NGS capital equipment outlay, together with the additional bioinformatics manpower support for the storage and analysis of the huge amount of data generated through the NGS system [18] may not be cost-effective for many smaller research laboratories for the sequencing of influenza viruses which have a relatively small genome size (∼14 kb).

The Sanger technique is regarded to be low throughput and more tedious, due to the requirement of multiple purification or plasmid cloning steps [4], [8], [19][23]. Here, we describe a whole genome sequencing method for seasonal influenza A/H3N2, with modifications of the normal sequencing protocol that reduces the number of processing steps, but still constantly produces a high quality sequence read of up to 700 bp. This protocol, when applied systematically, should hasten the routine genome sequencing work for local influenza surveillance studies. It was also demonstrated that this protocol is highly applicable for both clinical samples and Madin-Darby canine kidney- (MDCK-) cultured samples.

Results

Clinical Specimens and Culture Isolates

A total of 30 archived influenza A/H3N2 clinical samples collected from different patients between 2 May 2009-1 Aug 2011 were selected randomly for this study. All samples were received for diagnostic testing at the National University Hospital (NUH) in Singapore and were confirmed positive using two clinically validated, in-house, real-time influenza A/B screening [24] and subtyping assays [25], [26]. The samples included nasal/nasopharyngeal or throat swabs collected in universal transport medium, endotracheal tube aspirates, or sputum samples. Fifteen of the 30 were sequenced from cultured isolates of the original clinical sample using a MDCK.2 (ATCC; CRL-2936) cell line; the other 15 sequences were obtained directly from the clinical samples with no preliminary culture step.

Primer Design

To ensure the utility of the assay for the sequencing of older as well as future circulating strains, two reference gene sequences were randomly chosen per month from depositions from different countries and dates of collection (2007 to 2011) available at the NCBI Influenza Virus Resource. Primer target regions for RT-PCRs for the different gene segments were selected from the conserved regions of the respective aligned gene sequences. Large gene segments (1 to 3) were amplified as three fragments. Small segments (4 to 8) were amplified as two fragments. To achieve tolerance for accurate sequence assembly, the PCR products for each of these segments overlapped with preceding and follow-up segments for at least 39 bp. The 5′ and 3′ ends of each segment were amplified using modified published forward (MBTuni-12) and reverse (MBTuni-13) primers [21], [27]. Sequencing primers were designed within the internal regions of the PCR products. All the sequencing and RT-PCR primers are listed in Tables 1 and 2, respectively.

thumbnail
Table 1. Summary of sequencing primers employed in this study and their respective performance.

https://doi.org/10.1371/journal.pone.0064785.t001

thumbnail
Table 2. PCR primers and second annealing temperatures (TaS) used to amplify the influenza A/H3N2 genome.

https://doi.org/10.1371/journal.pone.0064785.t002

PCR Sensitivity

The 15 RNA samples extracted directly from the clinical samples were of quantification cycle values ranging from 21.0 to 30.56 (equivalent to 2.4×103–1.4×106 viral RNA copies/µL of RNA extract) [24]. All of the gene segments from both the clinical and MDCK-cultured samples collected from 2009–2011 were successfully amplified and appeared as specific and discernible bands on the agarose gel. It was noticed that some gene amplifications additionally produced minor non-specific bands in clinical samples with low viral titers.

Sequencing

All the eight segments from the respective 15 clinical and MDCK-cultured samples were successfully sequenced with high Phred quality value (QV) [28], and sequencing length up to 700 bp (Table 1). Length of read (LOR) for all sequence contigs had base calls of QV20 (representing an accuracy of circa one miscall for every 100 bases) and above for at least 20 continuous bases, which was in accordance to the analyzer machine’s default setting. Sequences with a mixture of nucleotides that contained only a single coverage depth was confirmed with reverse sequencing using PCR primers from the purified amplicon method briefly described in Figure 1. In total, the completed sequences obtained from the 15 cultured isolates and directly from the 15 clinical samples covered 408,810 nucleotides, with 13,627 nucleotides per genome, attaining 100% coding completeness. The entire sequencing protocol produced an average of 1.57 sequencing reads covering each nucleotide. Of all the bases in the assembly, an average of 89.49% were QV40 bases (representing an accuracy of circa one miscall for every 10,000 bases) or higher, and an average of 93.46% were QV30 bases (one miscall every 1000 bases) or higher (Table 1). All the sequences were successfully assembled into their respective segments. The use of the non-purified amplicon method resulted in a very high-quality genome assembly, including samples that had Ct values up to 30. The total sequencing raw data obtained per genome was less than 5 megabytes of data storage. The sequence analyses and assembly for each genome was completed within 15–30 minutes. The sequencing chromatograms generated were uploaded into Trace Archive [trace identifier number: 2333373621–2333374798] to allow visual inspection of the traces and quality scores underlying every nucleotide in each of the thirty genomes [29], [30]. All assembled sequences obtained in this study were uploaded onto NCBI GenBank [accession number: JX437693-JX437932].

thumbnail
Figure 1. Processing times and steps required for plasmid cloning, purified amplicon, and non-purified amplicon methods.

Representative sequencing chromatograms generated from each method are shown. The quality of the raw data obtained from the non-purified amplicon method was comparable with that of the plasmid cloning method. In contrast, the purified amplicon method generated lower quality data in the later portions of the sequence. * Please refer to appropriate references (under the References section).

https://doi.org/10.1371/journal.pone.0064785.g001

Further Testing of Assay Protocol on other Clinical Samples

The genome sequencing and assembling protocols were further tested on 125 additional H3N2 primary clinical samples with Ct values of 30.56 and below. All the 125 samples were collected in NUH as diagnostic samples from 1 May 2009–15 Dec 2012. Of the 125 additional primary clinical samples, 118 were sequenced and assembled completely. In total, 134 out of 140 (96%) primary clinical samples were sequenced successfully in this study with similar Phred quality.

There were seven samples that could not be sequenced completely. More specifically: full PB2, PB1, PA, HA, NP, and NS sequences were not obtainable from 2, 3, 3, 2, 1, 2 of these seven samples, respectively. Of these 13 failures, nine were from two samples with Ct values of 28.72 and 29.04, respectively. The PB1 and PA genes encountered the highest failure rate relative to the others.

Discussion

Traditionally, Sanger sequencing is performed on purified PCR amplicons to prevent background noise generated during sequencing analyses. Here, it was found possible to employ a non-purified amplicon approach for direct sequencing, which minimized processing time and effort for large-scale viral genome sequencing that produced consistently high quality sequencing data. Figure 1 summarizes the comparisons of the steps and amount of time required to perform sequencing using existing methods (plasmid cloning and purified amplicon approaches) and the non-purified amplicon method employed in this study. Direct sequencing on non-purified amplicons using target-specific sequencing primers not only significantly reduced the workload and cost for the entire genome sequencing, but also produced high quality sequencing peaks that were comparable to those generated by the plasmid cloning method (Figure 1). In addition, it will provide a more economical approach to detect viral mixture or quasispecies because unlike the plasmid cloning method [22], it does not require a minimum critical mass in clone selection for sequencing to obtain representative results. In comparison with the purified amplicon method, this non-purified amplicon method produced much higher quality raw data, according to the data produced from this study (Figure 1).

One possible explanation for the success of this simplified approach may be due to minimum loss of the PCR products as a result of the omission of the purification step, in combination with the use of target-specific sequencing primers that were designed discretely from the PCR primers. Unlike the commonly used M13-flanked PCR primers that allow the use of the M13 primer to sequence the PCR product in a more effective way [4], [31], the independent sequencing primers allowed distinctive sequencing amplification of the specific region of the PCR product, without interference from non-specific products and primer-dimers generated during PCR. To minimize the undesirable effects of residual PCR primers during the sequencing reaction, the forward and reverse primers for each PCR were prepared in equimolar amounts, and PCR conditions of up to 50 total PCR cycles were used, to avoid background noise during sequencing analysis. The 4% (v/v) dimethyl sulfoxide (DMSO) used in the sequencing reaction suppressed background noise encountered by sequencing primer NS373R23 during sequencing analysis [32].

Culturing of clinical samples prior to sequencing is a common practice to obtain sufficient viral genetic material for PCR amplification, as well as to avoid contaminants that may inhibit the PCR. However, it is well-recognized that the passaging of viruses in different hosts may induce excessive host-mediated mutations [33], [34] that can inadvertently lead to biased conclusions. Use of the proposed modified protocol allowed successful complete genome sequencing of human influenza A/H3N2 from clinical and MDCK-cultured samples, from samples with viral loads as low as 2,400 viral RNA copies/µL RNA sample. Assay primer designs based on reference sequences collected from different geographical regions from different periods from 2007–2011, and a 96% success rate of the sequencing of 140 clinical samples collected between 2009–2012 showed that this protocol would be widely applicable to a wide range of viruses. However, further testing on A/H3N2 viruses collected prior to 2009 should be performed to check the sensitivity of this full-genome sequencing assay for these earlier viruses.

The two samples that encountered most failures for individual gene segment sequencing could be possibly due to sample degradation or gene reassortment events within these regions. The H3N2 subtyping results were obtained for the purposes of clinical diagnosis earlier, based on specific real-time RT-PCRs targeting HA and MP genes only. The other five samples that had single incomplete gene sequences may possess single point mutation(s) that affected the capability of the assay to amplify those respective gene targets at either the PCR amplification or sequencing stage.

The entire genomic sequencing for the influenza A/H3N2 virus can be completed with a data storage size of approximately 5 megabytes per genome, permitting convenient data handling by biologists or non-bioinformatics expertise for large-scale sequencing for local surveillance purposes. The sequencing cost per genome of the entire protocol from RNA extraction to sequence analysis was calculated to be less than SGD 350 (∼ USD 290), compared to the conventional purified-amplicon method at around SGD 410 (∼USD 340) and plasmid cloning approach at roughly SGD 1360 (∼USD 1120). The high quality data obtained from multiple sequencing reactions targeting different genes (Table 1) suggested the applicability of this technique for other viral (i.e. small genome) gene sequencing work.

Influenza surveillance will continue on a worldwide basis for the foreseeable future, and molecular surveillance for influenza using partial or full-genome sequencing is now becoming routine in many diagnostic laboratories – especially in those which are not set up to perform the traditional serological surveillance for influenza (hemagglutination inhibition and viral micro-neutralization testing). Among the different seasonal human influenza viruses, influenza A/H3N2 has circulated in the human population since its emergence during the 1968 ‘Hong Kong’ pandemic, and has persisted successfully, despite the emergence of the 2009 A/H1N1pdm virus and its subsequent almost virtual replacement of the previously circulating seasonal influenza A/H1N1 [35], [36]. Ongoing antigenic changes in circulating seasonal A/H3N2 viruses continue to trigger new recommendations for seasonal influenza vaccine composition, to optimize vaccine-induced immunity in both the community and healthcare worker populations [37][39]. Thus, ever more efficient and economical methods are required to keep down the costs of molecular surveillance, allowing more laboratories to perform such sequencing routinely, thereby enhancing the quality, temporal and geographical resolution of the local influenza surveillance data available, to keep vaccine manufacturers and public health teams informed [40]. Towards this goal, the simplified sequencing protocol described here has been shown to be effective in obtaining full influenza A/H3N2 genomes at a reasonable price with equipment already available in many diagnostic and research laboratories, suggesting potential use of a similar strategy for studying human influenza A/H1N1pdm viruses.

Methods

Ethics Statement

All research studies involving the use of these clinical samples were reviewed and approved by the local institutional ethics review board (National Healthcare Group: B/09/360 and E/09/341).

Viral RNA Extraction

Viral RNAs were extracted from 200 µL of clinical or cultured samples with either the Qiagen EZ1 Virus mini kit v2.0 or the QIAsymphony Virus/Bacteria mini kit, using their respective proprietary Bio Robot EZ1 and QIAsymphony automated platforms (Qiagen, Valencia, CA), according to the manufacturer’s instructions. All extracted RNAs were eluted into a final volume of 60 µL of elution buffer.

Reverse Transcription Polymerase Chain Reaction

RT-PCRs were performed with a Superscript III one-step RT-PCR system with Platinum Taq high-fidelity polymerase (Invitrogen, Carlsbad, CA). Nineteen RT-PCRs were set up for whole genome amplification. All RT-PCRs were prepared manually in 10 µL of reaction volume, consisting of 5 µL of 2× Reaction Mix, equimolar amounts of forward and reverse primers (0.3 µmol/L each), 0.25 µL of enzyme mix, and 2.5 µL of extracted RNA sample. The remaining volume was topped up with RNase-free water. All RT-PCRs were performed using either the ABI 9700 thermal cycler (Applied Biosystems, CA, USA) or the Biometra T3000 thermocycler (Biometra GmbH, Goettingen, Germany). The cycling conditions were 30 min at 42°C (RT); 2.5 min at 95°C (inactivation of RT enzyme and activation of Taq enzyme); 5 cycles of 30 s at 95°C (denaturation), 30 s at 47°C (annealing), and 1.25 min at 68°C (extension); 45 cycles of 30 s at 95°C, 30 s at the respective second annealing temperature (Ta), and 1.25 min at 68°C; followed by a hold for 10 min at 68°C (final extension). The second Ta for each RT-PCR is summarized in Table 2.

Sequencing

Sequencing reactions were performed directly on non-purified amplicons, using BigDye Terminator v3.1 chemistry (Applied Biosystems). The 10 µL sequencing reaction is composed of 1.5 µL of 5× Buffer, 0.5 µmol/L of respective sequencing primer (Table 1), 1 µL of BigDye enzyme mix, and 1.25 µL of template amplicons. One microliter of 4% DMSO was added into the sequencing reaction together with primer NS373R23 [29]. Large-scale sequencing reactions were carried out on a 96-well plate and purified directly using the BigDyeXTerminator purification kit (Applied Biosystems). Individual sequencing reactions were performed in PCR tubes and purified using the DyeEx 2.0 spin kit (Qiagen). Purified sequencing products were analyzed on the ABI 3130×l genetic analyzer (Applied Biosystems) using the BDx_stdSeq50_POP7_1 run module. Sequencing peak heights were adjusted with the sample injection time ranging from 3–5 seconds.

Contig Assembly

All sequences were assembled and verified using the ATF software, version 1.0.2.41 (Connexio Genomics, Perth, Australia), using the reference sequence influenza A/Nanjing/1/2009(H3N2) for all segments (GenBank accession: GU907114-GU907117 and GU907119-GU907121), except for the PB1 segment which used influenza A/Sendai-H/F193/2007(H3N2) (GenBank accession: AB441948) as the reference sequence. The primer sequences were subtracted from the data during contig assembly. The multiple A’s observed at the 3′end of the NA, NP, and PA genes were checked carefully by visualization of the sequencing chromatograms.

Acknowledgments

We thank Donald Chiang from Microbiology Division, Department of Laboratory Medicine, National University Hospital, Singapore for helping with the MDCK-culture work.

Author Contributions

Conceived and designed the experiments: HKL JWT ESK. Performed the experiments: HKL DHK. Analyzed the data: HKL DHK. Contributed reagents/materials/analysis tools: ESK. Wrote the paper: HKL JWT DHK ESK.

References

  1. 1. Baillie GJ, Galiano M, Agapow PM, Myers R, Chiam R, et al. (2012) Evolutionary dynamics of local pandemic H1N1/2009 influenza virus lineages revealed by whole-genome analysis. J Virol 86: 11–18.
  2. 2. Chen R, Holmes EC (2008) The evolutionary dynamics of human influenza B virus. J Mol Evol 66: 655–663.
  3. 3. Fan X, Zhu H, Zhou B, Smith DK, Chen X, et al. (2012) Emergence and dissemination of a swine H3N2 reassortant influenza virus with 2009 pandemic H1N1 genes in pigs in China. J Virol 86: 2375–2378.
  4. 4. Ghedin E, Sengamalay NA, Shumway M, Zaborsky J, Feldblyum T, et al. (2005) Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature 437: 1162–1166.
  5. 5. Seyer R, Hrincius ER, Ritzel D, Abt M, Mellmann A, et al. (2012) Synergistic adaptive mutations in the hemagglutinin and polymerase acidic protein lead to increased virulence of pandemic 2009 H1N1 influenza A virus in mice. J Infect Dis 205: 262–271.
  6. 6. Simonsen L, Viboud C, Grenfell BT, Dushoff J, Jennings L, et al. (2007) The genesis and spread of reassortment human influenza A/H3N2 viruses conferring adamantane resistance. Mol Biol Evol 24: 1811–1820.
  7. 7. Narasaraju T, Sim MK, Ng HH, Phoon MC, Shanker N, et al. (2009) Adaptation of human influenza H3N2 virus in a mouse pneumonitis model: insights into viral virulence, tissue tropism and host pathogenesis. Microbes Infect 11: 2–11.
  8. 8. Piralla A, Pariani E, Rovida F, Campanini G, Muzzi A, et al. (2011) Segregation of virulent influenza A(H1N1) variants in the lower respiratory tract of critically ill patients during the 2010–2011 seasonal epidemic. PLoS One 6: e28332.
  9. 9. Tsai KN, Chen GW (2011) Influenza genome diversity and evolution. Microbes Infect 13: 479–488.
  10. 10. Chen GW, Yang CC, Tsao KC, Huang CG, Lee LA, et al. (2004) Influenza A virus PB1-F2 gene in recent Taiwanese isolates. Emerg Infect Dis 10: 630–636.
  11. 11. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, et al. (2005) Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380.
  12. 12. Sanger F, Nicklen S, Coulson AR (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 74: 5463–5467.
  13. 13. Kircher M, Kelso J (2010) High-throughput DNA sequencing–concepts and limitations. Bioessays 32: 524–536.
  14. 14. Loman NJ, Misra RV, Dallman TJ, Constantinidou C, Gharbia SE, et al. (2012) Performance comparison of benchtop high-throughput sequencing platforms. Nat Biotechnol 30: 434–439.
  15. 15. Luo C, Tsementzi D, Kyrpides N, Read T, Konstantinidis KT (2012) Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample. PLoS One 7: e30087.
  16. 16. Suzuki S, Ono N, Furusawa C, Ying BW, Yomo T (2011) Comparison of sequence reads obtained from three next-generation sequencing platforms. PLoS One 6: e19534.
  17. 17. Archer J, Baillie G, Watson SJ, Kellam P, Rambaut A, et al. (2012) Analysis of high-depth sequence data for studying viral diversity: a comparison of next generation sequencing platforms using Segminator II. BMC bioinformatics 13: 47.
  18. 18. Barzon L, Lavezzo E, Militello V, Toppo S, Palu G (2011) Applications of next-generation sequencing technologies to diagnostic virology. Int J Mol Sci 12: 7861–7884.
  19. 19. Bell J (2008) A simple way to treat PCR products prior to sequencing using ExoSAP-IT. BioTechniques 44: 834.
  20. 20. Dugan KA, Lawrence HS, Hares DR, Fisher CL, Budowle B (2002) An improved method for post-PCR purification for mtDNA sequence analysis. J Forensic Sci 47: 811–818.
  21. 21. Chan CH, Lin KL, Chan Y, Wang YL, Chi YT, et al. (2006) Amplification of the entire genome of influenza A virus H1N1 and H3N2 subtypes by reverse-transcription polymerase chain reaction. J Virol Methods 136: 38–43.
  22. 22. Djikeng A, Halpin R, Kuzmickas R, Depasse J, Feldblyum J, et al. (2008) Viral genome sequencing by random priming methods. BMC Genomics 9: 5.
  23. 23. Roy T, Agrawal AS, Mukherjee A, Mishra AC, Chadha MS, et al. (2011) Surveillance and molecular characterization of human influenza B viruses during 2006–2010 revealed co-circulation of Yamagata-like and Victoria-like strains in eastern India. Infect Genet Evol 11: 1595–1601.
  24. 24. Lee HK, Loh TP, Lee CK, Tang JW, Chiu L, et al. (2012) A universal influenza A and B duplex real-time RT-PCR assay. J Med Virol 84: 1646–1651.
  25. 25. Krafft AE, Russell KL, Hawksworth AW, McCall S, Irvine M, et al. (2005) Evaluation of PCR testing of ethanol-fixed nasal swab specimens as an augmented surveillance strategy for influenza virus and adenovirus identification. J Clin Microbiol 43: 1768–1775.
  26. 26. Lee HK, Lee CK, Loh TP, Tang JW, Chiu L, et al. (2010) Diagnostic testing for pandemic influenza in Singapore: a novel dual-gene quantitative real-time RT-PCR for the detection of influenza A/H1N1/2009. J Mol Diagn 12: 636–643.
  27. 27. Zhou B, Donnelly ME, Scholes DT, St George K, Hatta M, et al. (2009) Single-reaction genomic amplification accelerates sequencing and vaccine production for classical and Swine origin human influenza a viruses. J Virol 83: 10309–10313.
  28. 28. Ewing B, Green P (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 8: 186–194.
  29. 29. Salzberg SL, Church D, DiCuccio M, Yaschenko E, Ostell J (2004) The genome Assembly Archive: a new public resource. PLoS Biol 2: E285.
  30. 30. Sayers EW, Barrett T, Benson DA, Bolton E, Bryant SH, et al. (2012) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 40: D13–25.
  31. 31. Alvarez AC, Brunck ME, Boyd V, Lai R, Virtue E, et al. (2008) A broad spectrum, one-step reverse-transcription PCR amplification of the neuraminidase gene from multiple subtypes of influenza A virus. Virol J 5: 77.
  32. 32. Choi JS, Kim JS, Joe CO, Kim S, Ha KS, et al. (1999) Improved cycle sequencing of GC-rich DNA template. Exp Mol Med 31: 20–24.
  33. 33. Bush RM, Smith CB, Cox NJ, Fitch WM (2000) Effects of passage history and sampling bias on phylogenetic reconstruction of human influenza A evolution. Proc Natl Acad Sci U S A 97: 6974–6980.
  34. 34. Deom CM, Caton AJ, Schulze IT (1986) Host cell-mediated selection of a mutant influenza A virus that has lost a complex oligosaccharide from the tip of the hemagglutinin. Proc Natl Acad Sci U S A 83: 3771–3775.
  35. 35. Palese P, Wang TT (2011) Why do influenza virus subtypes die out? A hypothesis. MBio 2.
  36. 36. Pica N, Hai R, Krammer F, Wang TT, Maamary J, et al. (2012) Hemagglutinin stalk antibodies elicited by the 2009 pandemic influenza virus as a mechanism for the extinction of seasonal H1N1 viruses. Proc Natl Acad Sci U S A 109: 2573–2578.
  37. 37. WHO (2007) Recommended composition of influenza virus vaccines for use in the 2008 influenza season. Wkly Epidemiol Rec 82: 351–356.
  38. 38. WHO (2009) Recommended composition of influenza virus vaccines for use in the 2010 influenza season (southern hemisphere winter). Wkly Epidemiol Rec 84: 421–431.
  39. 39. WHO (2012) Recommended composition of influenza virus vaccines for use in the 2012–2013 northern hemisphere influenza season. Wkly Epidemiol Rec 87: 83–95.
  40. 40. McHardy AC, Adams B (2009) The role of genomics in tracking the evolution of influenza A virus. PLoS Pathog 5: e1000566.