Deep Sequencing for Evaluation of Genetic Stability of Influenza A/California/07/2009 (H1N1) Vaccine Viruses

Virus growth during influenza vaccine manufacture can lead to mutations that alter antigenic properties of the virus, and thus may affect protective potency of the vaccine. Different reassortants of pandemic "swine" H1N1 influenza A vaccine (121XP, X-179A and X-181) viruses as well as wild type A/California/07/2009(H1N1) and A/PR/8/34 strains were propagated in embryonated eggs and used for DNA/RNA Illumina HiSeq and MiSeq sequencing. The RNA sequences of these viruses published in NCBI were used as references for alignment of the sequencing reads generated in this study. Consensus sequences of these viruses differed from the NCBI-deposited sequences at several nucleotides. 121XP stock derived by reverse genetics was more heterogeneous than X-179A and X-181 stocks prepared by conventional reassortant technology. Passaged 121XP virus contained four non-synonymous mutations in the HA gene. One of these mutations (Lys226Glu) was located in the Ca antigenic site of HA (present in 18% of the population). Two non-synonymous mutations were present in HA of viruses derived from X-179A: Pro314Gln (18%) and Asn146Asp (78%). The latter mutation located in the Sa antigenic site was also detected at a low level (11%) in the wild-type A/California/07/2009(H1N1) virus, and was present as a complete substitution in X-181 viruses derived from X-179A virus. In the passaged X-181 viruses, two mutations emerged in HA: a silent mutation A1398G (31%) in one batch and G756T (Glu252Asp, 47%) in another batch. The latter mutation was located in the conservative region of the antigenic site Ca. The protocol for RNA sequencing was found to be robust, reproducible, and suitable for monitoring genetic consistency of influenza vaccine seed stocks.


Introduction
The influenza A/California/07/2009 (H1N1) virus emerged in Mexico, and was identified in the United States at the beginning of 2009. It caused outbreaks of respiratory illness in Mexico and in the United States, [1] and was shown to be genetically related to swine influenza viruses monitor genetic stability of Oral Polio Vaccines (OPV) [24], and could replace the WHO-recommended MAPREC assay for lot release of OPV. In this study, Illumina sequencing technology was used for screening mutations in viruses of A/California/07/2009 (H1N1) vaccine strains passaged in eggs. We found that this approach could be used to identify and to accurately quantify small amounts of mutant viruses present in preparations of inactivated influenza vaccines. We also found that sequence heterogeneity of seed viruses depended on the methods used for their preparation. The ability to quantify potentially undesirable mutations in vaccine batches makes this method suitable for quality control to ensure manufacture of safe and effective vaccines.

Materials and Methods
Viruses and reference sequences Viruses X-179A, X-181 and 121XP were passaged in eggs under similar conditions to those of the manufacturers [25,26], and A/Puerto Rico/08/34 (H1N1) reference and A/California/ 07/2009 (H1N1) pandemic virus stocks (Table 1), obtained from the strain collection at the Center for Biologics Evaluation and Research, US Food and Drug Administration were grown in eggs. All these viruses (Table 1) were propagated in 10 days old embryonated hen's eggs by inoculation of 0.2 ml of diluted virus stock containing about 10 4 pfu at 33°C. Allantoic fluid was harvested at 72 hours post infection and clarified by spinning at 4000 rpm for 10 min at 4°C. The clarified viruses were saved at -80°C.

Amplification of the entire influenza virus genome
Total RNA was isolated from 140 μl of virus suspension using QIAamp Viral RNA Mini Kit (Qiagen, Valencia, CA) according to the manufacturer's protocol. The RNA was eluted in a  final volume of 60 μl of sterile RNase-free water. cDNA was synthesized using Superscript III Reverse Transcriptase (SSIII, Invitrogen, Carlsbad, CA) with Universal_F primer: 5' GACTAA TACGACTCACTATAGGGAGCAAAAGCAGG 3'. cDNA synthesis was performed in a reaction containing 10 μl of isolated viral RNA, 1.6 μM of Universal_F primer, 600 units of SSIII, 0.5 mM dNTPs, 50 mM DTT, and first strand buffer in a total volume of 50 μl. The reaction mixture was incubated at 55°C for two hours, followed by SSIII inactivation at 70°C for 20 min. Two different PCR protocols were used to simultaneously amplify all genome segments of influenza virus.
Phusion protocol: Four microliters of viral cDNA was used as a template for PCR amplification in a total reaction volume of 50 μl, containing 0.2 μM of each universal primers: Univer-sal_F (see above) and Universal_R: 5'GACATTTAGGTGACACTATAGAAGTAGAAACAAGG 3', 200 μM of dNTP, 3 mM MgCl 2 , and 1 unit of Phusion DNA polymerase (New England BioLabs, Ipswich, MA). PCR cycling conditions were as follows: an initial denaturation at 98°C for 30 sec followed by 35 cycles of 98°C for 10 sec, 55°C for 30 sec and 72°C for 5 min, and the final elongation at 72°C for 10 min.
Extensor protocol: Four microliters of viral cDNA was used as a template for PCR amplification in a total reaction volume of 50 μl containing 0.4 μM of each universal primer (Univer-sal_F and Universal_R), 500 μM of dNTP, 5 μl of 10x Extensor Buffer 2, and 2.5 units of Extensor PCR Enzyme Mix (5U/μl) (Dharmacon, Lafayette, CO). PCR cycling conditions were as follows: an initial denaturation at 94°C for 2 min followed by 35 cycles of 94°C for 10 sec, 60°C for 30 sec and 68°C for 5 min, and the final elongation at 68°C for 10 min.
The resulting eight DNA amplicons representing all segments of viral genome were separated by electrophoresis in 1% agarose gels with ethidium bromide (Lonza, Rockland, ME) and visualized using the Kodak Gel Logic 200 Imaging System and Kodak Molecular Imaging Software (Carestream Health, Inc. Rochester, NY).

Preparation of samples for Illumina sequencing
The PCR product was purified by QIAquick PCR Purification Kit (Qiagen, Valencia, CA) and fragmented by focused ultrasonicator (Covaris) to generate the optimal fragment sizes of 350-400 bp suitable for Illumina sequencing.
The DNA fragments were analyzed for DNA size by the 2100 Bioanalyzer system using a high-sensitivity kit (Agilent Technologies, Inc., Santa Clara, CA). Next, the fragmented DNAs were used for library preparation with NEBNext 1 DNA Sample Prep Reagent Set 1 (New England BioLabs, Ipswich, MA). The primers containing multiplexing index adapters (barcodes) to identify sequencing reads belonging to each sample were synthesized according to the specifications provided by Illumina Inc. for multiplex pair-end sequencing. Deep sequencing was performed at Macrogen (Seoul, Korea) using HiSeq2000 (Illumina) producing 101 bplong paired-end reads, or at our laboratory using MiSeq (Illumina) producing 100 to 300 bplong paired-end reads. NEBNext mRNA Sample Prep Master Mix Set 1 (New England BioLabs, Ipswich, MA) was used for preparation of Illumina sequencing libraries from total RNA contained in the sample (whole-RNA library). Briefly, 0.5 μg total RNA (extracted as mentioned above) was used for fragmentation by focused ultrasonicator (Covaris) to generate the optimal fragment sizes of 200-300 bp suitable for RNA Illumina sequencing. The fragmented RNA was used for cDNA synthesis using ProtoScript II Reverse Transcriptase and random primers. The cDNA was further converted into double stranded cDNA with end-repair procedure (Klenow fragment, T4 polynucleotide kinase and T4 polymerase), and ligated to Illumina paired end (PE) adaptors.
Size selection was performed using 2% agarose gel electrophoresis, generating cDNA libraries ranging from 275 to 325 bp in size. Finally, the libraries were expanded using 15 cycles of PCR with multiplex indexed primers and purified with magnetic beads using Agencourt Ampure Beads (Beckman Coulter).
The RNA sequences were used as references for alignment of the viral sequence reads generated in this study. First, sequencing reads with low quality (phred) scores were removed from the data set. The remaining sequences aligned with reference influenza virus sequences using custom software that is implemented in a highly integrated virtual environment (HIVE) computer cluster (https://hive.biochemistry.gwu.edu/dna.cgi?cmd=main).
To eliminate the false mutations, which could be an artifact produced during PCR amplification or sequencing procedures, the bias mutations were identified by Shannon entropy as a measure of randomness of distribution of mutations along individual sequencing reads [27]. Generally, a bias of mutation is considered significant if its entropy value is lower than one sigma (0.67). Also, only sequencing reads that have more than 75% of their expected size were aligned to the reference sequences, to eliminate nonspecific reads.
Finally, aligned sequencing reads were used to compute single-nucleotide polymorphism (SNP) profiles, read depth and the entropy for the entire viral genomes.

Robustness of simultaneous influenza A genome segment amplification
To amplify all segments in one reaction, the reverse transcriptase Superscript III was used for cDNA synthesis, and two different PCR amplification protocols were used for DNA amplification.
The agarose gel electrophoresis results showed that all eight segments were amplified efficiently from passaged A/California/07/2009 (H1N1) pandemic vaccine viruses (The used viruses are presented in Table 1) by using Phusion DNA polymerase (New England BioLabs, Ipswich, MA) ( Fig 1A). In contrast, use of Thermo Scientific Extensor Mix resulted in nonspecific bands and poor amplification of some segments (Fig 1B). To determine which of the two PCR protocols ensured unbiased amplification of all segments of influenza virus genome, the amplified samples (Fig 1A and 1B) were subjected to Illumina HiSeq sequencing. The percentage of sequencing reads obtained for each of the segments is presented in Fig 1C, along with the whole-RNA library that was run by HiSeq that was prepared as described in section 2.3 of the materials and methods paragraph. This result demonstrated that adequate distribution of sequencing reads between segments were obtained from a DNA library prepared from samples amplified by Phusion DNA polymerase (DNA library), and by whole-RNA library.

Consistency of Illumina sequencing, and sequencing analysis of A/PR/8/ 34 and A/California/07/2009 strains
Center for Biologics Evaluation and Research (CBER) stock of A/PR/8/34 was kindly provided by Dr. Peter Palese at Mount. Sinai School of Medicine and was passaged in CBER to produce CBER stock, this virus was sequenced five times by MiSeq and HiSeq using whole-RNA library and DNA library approaches. The results of sequencing the RNA library showed high levels of reproducibility. There was a slight difference between MiSeq and HiSeq results only for mutations present at low levels (around 5%, Table 2). In contrast, the five repeats of DNA library sequencing were less consistent and more heterogeneous. (S1 Table).
Most of A/PR/8/34 segments contained several complete mutations in comparison to their published sequences, and most of these mutations were silent. This virus was less heterogeneous, as few low substitutions were observed along the genome.
Another A/PR/8/34 virus stock was provided by National Institute for Biological Standards and Control (NIBSC) and was propagated 13 times in eggs and then was grown (passage 14) separately in two different eggs, and the resulting stocks were designated as A/PR/8/34-1 and A/PR/8/34-3 ( Table 1). Both of them were used for whole-RNA library preparation, followed by HiSeq sequencing. The results in Table 3 showed that these two viruses shared almost all mutations, and the percentages of mutations were slightly higher in the A/PR/8/34-1 than in the A/PR/8/34-3 virus. They were more heterogeneous than CBER A/PR/8/34 virus stock, and all three A/PR/8/34 viruses shared several complete mutations along their genomes (Tables 2  and 3). They also shared one low-level mutation C 1094 A (Thr 350 Lys) in the NP gene which was present in about 20% of the RNA sequences in all three A/PR/8/34 viruses.  A/California/07/2009 (H1N1) strain was sequenced by HiSeq using the whole-RNA library approach, and the summary of the results is presented in Table 4. No complete substitutions were observed, and the highest percentage of mutants observed was 66% for mutation A 57 G (Glu 18 Gly) in the PA segment. No mutations were observed in the NA segment. Three lowlevel non-synonymous mutations were found in HA: A 456 G (Asn 146 Asp, 11%), A 736 G (Asp 239 Gly, 16%) and G 739 A (Arg 240 Gln, 16%). The Asn 146 Asp mutation is located in the Sa antigenic site.

Deep sequencing analysis of 121XP virus passaged in eggs
The 121XP virus was generated by the reverse genetics method followed by serial passages in chicken eggs; this virus contains the HA and NA genes from A/California/07/2009 (H1N1) and all other genes from A/PR/8/34 (H1N1) [26].
To evaluate its genetic stability, the 121XP virus was passaged ten times in eggs under manufacture conditions (Table 1). Mutations revealed by deep sequencing in HA and NA genes of the virus are summarized in Table 5. Several non-synonymous mutations emerged in NA and HA, including the mutations Asn 16 Ser and Lys 226 Glu that were present at low levels (36% and 18% of mutants, respectively). The latter mutation is located in the Ca antigenic site in HA. Mutation Lys 136 Asn, located close to the antigenic site Sa, emerged and accumulated to about 78% of the population.
Ten mutations were found in the NA gene, and eight of them led to amino acid changes. Interestingly, three of them were found in codon 88. Sequencing of other genomic segments revealed several heterogeneities in most of them, with some mutations leading to amino acid changes. There was a good agreement between results obtained by sequencing of the whole-RNA library by MiSeq and HiSeq, as well as the DNA library by HiSeq. For more details, see S2 to S5 Tables.

Deep sequencing analysis of X-179A viruses passaged in eggs
The X-179A virus is a reassortant prepared by New York Medical College (NYMC) using classical reassortant methodology from A/California/7/2009 (H1N1) virus and NYMC X-157 virus, with the HA, NA and PB1 genes originating from A/California/7/2009 (H1N1), and the other internal genes from A/PR/8/34 (H1N1) [26]. This virus was passaged several times and adapted to grow in eggs. Viruses derived by passaging of X-179A virus in chicken eggs under manufacture conditions (Table 1) were similarly analyzed by deep sequencing. The results of deep sequencing of HA and NA genes of these viruses are summarized in Table 6. In sample X-179A-M5, two non-synonymous mutations emerged in HA: Pro 314 Gln (17%) and Asn 146 Asp (78%). The latter mutation located in the Sa antigenic site was present as low level mutation in A/California/07/2009 (H1N1). The confirmation of the presence of this mutation by Sanger In general, samples derived from X-179A virus were heterogeneous and contained several complete substitutions compared to its published sequence. HiSeq sequencing of DNA libraries prepared from X-179A viruses revealed more heterogeneity than HiSeq/MiSeq sequencing of RNA libraries prepared from the same samples. Mutations found in other genomic segments are presented in S5 to S7 Tables.

Deep sequencing analysis of X-181 viruses passaged in eggs
Similar to X-179A virus, X-181 virus was developed by New York Medical College using conventional reassortment technology. It has a 5:3 gene constellation with three genes (PB1, HA  Table 7. Two mutations were found in HA genes: A 1398 G (silent, 31%) in the X-181-M3 virus and G 756 T (Glu 252 Asp, 47%) in the X-181-M4 virus. The latter mutation is located in the conserved region of the antigenic site Ca. One nonsynonymous mutation, Thr 466 Ala, was present at a low level (8%) in the NA gene of X-181-M2. Interestingly, the mutation Asn 146 Asp is located in the Sa antigenic site and exists as a complete substitution in X-181 virus which is derived from X-179A virus. As mentioned above, this mutation was observed in X-179A-M5 at a high level (78%) and at a low level (11%) in the wild-type A/California/07/2009 (H1N1) virus.
Viruses derived by passaging of X-181 in chicken eggs were heterogeneous and contained several complete mutations in PB2, NP and NS segments (S8 to S10 Tables).
As for X-179A viruses, HiSeq sequencing of DNA libraries prepared from X-181 viruses revealed more heterogeneity than HiSeq/MiSeq sequencing of whole-RNA libraries prepared from the same samples. Mutations found in other genomic segments are presented in S8 to S10 Tables.

Discussion
Rapid emergence of mutations is an inherent property of RNA viruses, leading to genetic heterogeneity of viral populations [8]. This allows viruses to rapidly adapt to changing growth conditions by selecting from the preexisting variants with higher fitness [28]. For some vaccines, virus growth in cell cultures during vaccine manufacture leads to reversion of attenuated Table 6. Mutation percentages (5%) emerged in HA and NA of X-179A viruses passaged in eggs. Notes: The percentages obtained from RNA library followed by HiSeq sequencing are presented without parentheses or brackets. The percentages obtained from RNA library followed by MiSeq sequencing are presented in parentheses. The percentages obtained from DNA library followed by HiSeq sequencing are presented in brackets. These mutation percentages were calculated per comparison with the corresponding expanded progenitor. (See accession numbers in Materials and Methods.) aa: Amino-acid, nt: Nucleotide. phenotype to virulence; therefore, their content must be monitored as a part of vaccine quality control [13]. Mutations emerging during virus growth may also alter the antigenic properties, and as a consequence affect protective potency of live and inactivated vaccines. Antigenic properties of influenza A are notoriously prone to rapid changes. Hemagglutinin (HA) and neuraminidase (NA) are the two most important influenza virus envelope proteins that elicit immune response, and therefore are under immunological pressure that leads to their continuous change. The antigenic drift reflects the accumulation of amino acid substitutions in the globular domain of HA, the principal target of neutralizing antibodies, as well as in NA [29,30]. The primary function of HA is the attachment to host cell receptors and mediation of viral entry [31,32]. The NA is involved in progeny virion release and spread of the virus, and also may have a role in HA-mediated membrane fusion [33,34]. Adaptation of viruses to replication in new cell types (e.g. growth in cell substrates, such as embryonated eggs or mammalian cells used for vaccine manufacture) involves changes in HA. Since the receptor-binding region on the HA molecule overlaps with the major antigenic site, the adaptation may also be accompanied by changes in antigenic properties [33,35,36]. The larger number of eggs used during manufacture process would decrease the level (percentage) of each individual mutant, but could nevertheless increase the overall heterogeneity of virus population. Cumulatively, they could lead to a similar loss of antigenic properties as one mutation present at a higher level. Therefore, it may be important to monitor the changes that take place in major protective epitopes of the seed virus stocks to avoid losses in protective efficacy of the vaccine. Not all mutations that emerge in viruses are harmful to vaccine properties. Mutations that consistently accumulate have higher fitness and could increase virus yields and vaccine potency. Therefore, studying mutations emerging in vaccine strains could be used for both quality control, and for optimization of the genetic composition of vaccine strains.
Seed viruses used in influenza vaccine manufacture are produced either by reassortment technology that uses co-cultivation of a high-growth reference strain with the current virus isolate leading to exchange of genomic segments [37], or by reverse genetics based on transfection of susceptible cells with a combination of cDNA plasmids coding for the appropriate segments of the virus [38]. Both approaches have their strengths and weaknesses, and can potentially lead to the emergence of mutants. One of the purposes of this study was to determine which of them produces more homogeneous virus stocks with a lower number of mutants. We have used three pandemic influenza A/California/07/2009 (H1N1) virus vaccine strains and virus stocks produced from these viruses by passaging them on eggs. X-179A was prepared at the New York Medical College on eggs as a reassortant between A/California/07/2009 and A/PR/ 8/34. After reports of a low yield of production virus from X-179A a second reassortant, X-181, Table 7. Mutation percentages (5%) emerged in HA and NA of X-181 viruses passaged in eggs. Notes: The percentages obtained from RNA library followed by HiSeq sequencing are presented without parentheses or brackets. The percentages obtained from RNA library followed by MiSeq sequencing are presented in parentheses. The percentages obtained from DNA library followed by HiSeq sequencing are presented in brackets. These mutation percentages were calculated per comparison with the corresponding expanded progenitor. (See accession numbers in Materials and Methods.) aa: Amino-acid, nt: Nucleotide.  [26].
To reveal sequence heterogeneity in viral stocks we have used a deep sequencing approach that was recently shown to be an effective tool for quantifying all mutants in viral populations [24]. Mutations that were identified by deep sequencing and were present at a significant level were also confirmed by the analysis of Sanger sequencing electrophoregrams that revealed sequence heterogeneities. Amplification of the entire genome of influenza viruses presents a challenge because it consists of eight genomic segments of different lengths and sequences. Previously, we described PCR conditions that allowed us to amplify all genomic segments of influenza A virus in one reaction [27]. In this work, the PCR amplification was optimized to amplify all influenza segments simultaneously, efficiently and equally, which resulted in an adequate distribution of sequencing reads between all segments (Fig 1A and 1C). We have also used whole-RNA libraries prepared without specific amplification of viral cDNA. Illumina deep-sequencing of viral cDNA libraries and whole-RNA libraries were used to determine quantitative profiles of mutations along the entire genome of viruses of influenza A/California/ 07/2009 (H1N1) vaccines.
The deep sequencing results revealed several heterogeneities in most genomic segments, and several mutations led to amino acid changes. No big differences were observed between the results obtained by sequencing of whole-RNA libraries using MiSeq and HiSeq, and HiSeq sequencing of DNA libraries (S2 to S10 Tables). Deep sequencing of whole-RNA libraries was found to be more reproducible than sequencing of DNA libraries (Table 2). But the HiSeq sequencing of DNA libraries were less consistent and produced more heterogeneous sequences (S1 Table). This may be due to errors introduced by PCR amplification and non-specific alignment of primers.
Deep sequencing of several stocks of high-growth reference strain A/PR/8/34 revealed also some mutations and sequence heterogeneities. The CBER master stock of this strain contained several complete mutations in comparison to the published sequence, and most of these mutations were silent. There were also low levels of additional mutations. This indicates that this A/ PR/8/34 virus is genetically stable, and the use of this virus as backbone for development of influenza vaccine may improve the vaccine production.
Another A/PR/8/34 virus stock was passaged 13 times on eggs and was grown (passage 14) separately in two different embryonated chicken eggs, the harvested 2 viruses (designated as A/ PR/8/34-1 and A/PR/8/34-3) were deep-sequenced using HiSeq protocol. These two viruses shared most sequence heterogeneities, but the percentages of mutants were slightly different (Table 3). Therefore, virus growth in eggs introduced some additional sequence heterogeneity, but caused no complete nucleotide substitutions.
The parental wild type A/California/07/2009 (H1N1) virus was sequenced by HiSeq using the whole RNA library (Table 4). Its consensus matched the published nucleotide sequence. There was some sequence heterogeneity observed at select positions, reaching up to 66% (Glu 18 Gly) of the viral population, this mutation is very rare and nothing known about its biological function.
The Deep sequencing of the X-179A passaged viruses revealed several mutations in HA and NA. Two non-synonymous mutations in HA (Pro 314 Gln present at 17% and Asn 146 Asp present at 78% of the population) were identified in X-179A-M5 virus. Interestingly, the latter is located in the antigenic site Sa; it was detected at a low level (11%) in A/California/07/2009 (H1N1) strain and was present as a complete substitution in the X-181 virus stock derived from the X-179A virus (Fig 2), suggesting that it may be involved in the adaptation of X-181 virus. Virus X-179A-M2 contained Lys 328 Thr mutation at a low level (9%). Viruses derived from X-179A were heterogeneous and contained some complete nucleotide substitutions in comparison to their published sequences in PB2, PB1, NP, and in NS segments (see S5 to S7 Tables).
The X-181 virus was developed from the X-179A seed lot by another round of reassortment, and also is subjected to several passages in eggs (Table 1). Deep sequencing results showed that the G 756 T (Glu 252 Asp, present at 47%) mutation emerged in HA of the X181-M4 virus, and it is located in the conserved region of the antigenic site Ca.
Unlike the two previous viruses, 121XP was developed by reverse genetics [26]. Deep sequencing of virus derived from it showed several non-synonymous mutations in NA and HA. Mutations leading to Asn 16 Ser and Lys 226 Glu amino acid substitutions emerged at relatively low levels (36% and 18% of mutants, respectively) in HA. The latter mutation was located in the Ca antigenic site in HA, next to codon 225 that was shown to participate in the modulation of HA receptor avidity/specificity [39]. In addition, mutation of residue 225 was found to enable H3 viruses to switch from avian to human specificity [40,41]. Mutation Lys 136 Asn was present at a high level (78%). It is located close to the antigenic site Sa in close proximity within the sialic acidbinding pocket, suggesting its functional importance [42]. Interestingly, in the NA segment, three mutations occurred in codon 88, suggesting that this codon represents a hot spot. Also, the results of passages 10 showed that the overall number of mutations and sequence heterogeneities present in virus stocks derived by reverse genetics was higher than the number of mutations in virus stocks created by conventional reassortment procedure. This observation is counterintuitive, since reverse genetics is based on homogeneous cDNA plasmids coding individual viral segments. One possible explanation for increased sequence heterogeneity of cDNA-derived RNA viruses is that transcription of cDNA plasmids to rescue the virus followed by its passages to adapt to grow in chicken eggs led to higher sequence heterogeneity.
In conclusion, our deep sequencing approach based on RNA library preparation was shown to be effective and reproducible for detection of low quantities of mutants in the entire genome of influenza viruses. It revealed that the viruses derived from the three of pandemic A/Ca/07/ 2009 (H1N1) vaccine viruses have varying levels of sequence heterogeneities. It can be used for assessment of the quality of seed viruses, and for ensuring that they do not contain mutations that could affect immunogenic properties of the vaccine.
Supporting Information S1 Table. Percentage of mutations (5%) emerged in the genome of A/PR/8/34 strain (CBER stock) obtained by sequencing five times its DNA library by HiSeq. (XLS) S2 Table. Percentage of mutations (5%) emerged in 121XP-M4 virus obtained by using RNA library preparation followed by MiSeq sequencing. (XLS) S3 Table. Percentage of mutations (5%) emerged in 121XP-M4 virus obtained by using RNA library preparation followed by HiSeq sequencing. (XLS) S4 Table. Percentage of mutations (5%) emerged in 121XP-M4 virus obtained by using DNA library preparation followed by HiSeq sequencing. (XLS) S5 Table. Percentage of mutations (5%) emerged in X-179A viruses obtained by using RNA library preparation followed by MiSeq sequencing. (XLS) S6 Table. Percentage of mutations (5%) emerged in X-179A viruses generated by using RNA library preparation followed by HiSeq sequencing. (XLS) S7 Table. Percentage of mutations (5%) emerged in X-179A viruses generated by using DNA library preparation followed by HiSeq sequencing. (XLS) S8 Table. Percentage of mutations (5%) emerged in X-181 viruses generated by using RNA library preparation followed by MiSeq sequencing. (XLS) S9 Table. Percentage of mutations (5%) emerged in X-181 viruses generated by using RNA library preparation followed by HiSeq sequencing. (XLS) S10 Table. Percentage of mutations (5%) emerged in X-181 viruses generated by using DNA library preparation followed by HiSeq sequencing. (XLS)