More than ten subgenotypes of genotype C Hepatitis B virus (HBV) have been reported, including C1 to C16 and two C/D recombinant subgenotypes (CD1 and CD2), however, inconsistent designations of these subgenotypes still exist.
We performed a phylogenetic analysis of all full-length genotype C HBV genome sequences to correct the misclassifications of HBV subgenotypes and to study the influence of recombination on HBV subgenotyping. Our results showed that although inclusion of the recombinant sequences changed the topology of the phylogenetic tree, it did not affect the subgenotyping of the non-recombinant sequences, except subgenotype C2. In addition, most of the subgenotypes have been properly designated. However, several misclassifications of HBV subgenotypes have been identified and corrected. For example, C11 proposed by Utsumi and colleagues in 2011 was found to be grouped with C12 proposed by Mulyanto and colleagues. Two sequences, GQ358157 and GU721029, previously designated as C6 have been re-designated as C12 and C7, respectively. Moreover, a quasi-subgenotype C2 was proposed, which included the old C2, several previously unclassified sequences and previously designated C14. In particular, we identified a novel subgenotype, tentative C14, which was well supported by phylogenetic analysis and sequence divergence of >4%.
A number of misclassifications in the subgenotyping of genotype C HBV have been identified in this study. After correcting the misclassifications, we proposed a better classification for the subgenotyping of genotype C HBV, in which a novel quasi-subgenotype C2 and a novel subgenotype, tentative C14, were described. Based on this large-scale analysis, we propose that a novel subgenotype should only be reported after a complete comparison of all relevant sequences rather than a few representative sequences only.
Citation: Shi W, Zhu C, Zheng W, Zheng W, Ling C, Carr MJ, et al. (2012) Subgenotyping of Genotype C Hepatitis B Virus: Correcting Misclassifications and Identifying a Novel Subgenotype. PLoS ONE 7(10): e47271. https://doi.org/10.1371/journal.pone.0047271
Editor: Jianming Qiu, University of Kansas Medical Center, United States of America
Received: June 22, 2012; Accepted: September 10, 2012; Published: October 15, 2012
Copyright: © Shi et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This study was partly supported by Science Foundation Ireland (PI grant 07/IN.1/B1783). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. No additional external funding received for this study.
Competing interests: The authors have declared that no competing interests exist.
Ten genotypes (from genotype A to J) – and more than 30 subgenotypes  of HBV have been identified based on the general rule that different genotypes should diverge by at least 8%  and different subgenotypes should diverge by at least 4% over the entire genome . Other rules for HBV genotyping and subgenotyping include the monophyletic nature of the genotypes and subgenotypes on a phylogenetic tree and high bootstrap support , .
To date, genotype C has the largest number of reported subgenotypes, with at least 16 subgenotypes identified. In early 2004, Huy et al. found that genotype C could be classified at least two subgenotypes C1 and C2 . Also in 2004, Norder et al. divided genotype C into four subgenotypes: C1 from East Asia, C2 mostly from China and Southeast Asia, C3 from Oceania and C4 from aborigines from Australia , . Subgenotype C5 was isolated from patients from the Philippines in 2006 . Subgenotype C6 was first proposed by analyzing the S gene sequences and preC-C gene sequences from Papua, Indonesia , which was later confirmed by complete genome sequences in 2009 . Almost at the same time, a virus strain isolated from the Philippines was also defined as subgenotype C6 . After a comparison between these two C6 subgenotypes, the one from the Philippines was renamed as C7 , . However, some viruses from Nusa Tenggara, Indonesia were also named as subgenotype C7 by Mulyanto and colleagues . To avoid potential confusion in the delimitation of subgenotypes, Mulyanto and colleagues renamed their C7 as C8 in 2010 . In addition, they also proposed a novel subgenotype C9, which they originally reported as an unclassifiable subgenotype . Subgenotype C10 was also isolated from Indonesia where a few novel subgenotypes, such as B7, B8 and C7 to C9, were identified . In 2011, two independent research groups named some viruses isolated from Indonesia as C11, respectively , . Moreover, Mulyanto et al. reported another novel subgenotype C12, which has the same geographical origin as C11s and many other HBV subgenotypes . Recently, Mulyanto et al. further described four novel subgenotypes C13 to C16 . These four subgenotypes were also isolated from Papua, Indonesia. Finally, two more subgenotypes associated with C/D recombination, CD1 and CD2, were isolated from Tibet, China –.
Different genotypes usually have distinct geographical distributions . However, both genotypes B and C are prevalent in Asia and Oceania . This has led to potential recombination between B and C due to co-infection or super-infection , . Several genotype C viruses have been reported to be recombinants. For example, a C13 strain from Indonesia was identified to be a C13/B3 recombinant . Also in Indonesia, a strain of subgenotype C12 was proved to be a C/G recombinant . In addition, some C/D recombinants have also been isolated from China , .
A number of problems in the subgenotyping of genotype C HBV have been reported . First, there was reported incongruence in C1 and C2 proposed respectively by Huy et al. and Norder et al. . Although Schaefer and colleagues suggested that the designation proposed by Huy et al. should be used , subgenotype C2 proposed by Huy et al. was not a monophyly . Second, there were two C6 subgenotypes proposed by different research groups , , though the one from the Philippines was subsequently renamed as C7 . Third, as mentioned above, two new subgenotypes were named as C11 respectively in 2011 , . Fourth, including recombinant sequences into phylogenetic analysis sometimes might change the topology of the tree and increase the sequence divergences estimated. In addition, including recombinant sequences may also change (mostly increase) the sequence divergence. Therefore, recombination played a potential role in HBV subgenotyping. However, unfortunately, most previous studies failed to take recombination into consideration when they designated novel subgenotypes.
In order to determine how the recombination influences HBV subgenotyping, to correct the known and potential unidentified misclassifications in the subgenotyping of genotype C HBV, and to establish a better classification, we analyzed a large number of full-length genotype C HBV sequences using a phylogenetic approach.
Materials and Methods
In our previous report, 1214 sequences have been identified to be of genotype C, including 96 potential recombinants . All these sequences were selected to compose a new dataset for further analysis. A second dataset excluding the recombinant sequences was also composed. In addition, a sequence of genotype B (GenBank accession number: D00329) was included in the two datasets and used as an outgroup. Information of these sequences, such as subgenotype and recombination, was extracted from GenBank annotations. An extensive literature review for sequences with references available in Pubmed was carried out to obtain their subgenotype and recombination information, which was then used in defining the subgenotypes.
Phylogenetic analysis of the two datasets was carried out using RAxML  under the GTRCAT approximation  and random starting trees. One thousand rapid bootstrap replicates were performed with all other parameters set to default. Trees were visualized and analyzed using Dendroscope . The trees are available as Figures S1 and S2.
The mean nucleotide divergence (mean ± SD) between different subgenotypes was calculated using Mega 5  with the Kimura 2-parameter model . In order to obtain consistent and reliable sequence divergence values, 500 bootstrap replicates were applied.
Phylogenetic analysis of all genotype C sequences showed that four subgenotypes, CD1, CD2, C4, and C5, were inter-genotype recombinants (Figure 1) –, . CD1 and CD2 have been proposed as recombinant subgenotypes of genotype C . They were composed of C/D recombinants. Sequence divergence between CD1 and CD2 and that between CD2 and C2 were 4.1% and 5.7%, respectively (Table 1). However, sequence divergence between CD1 and C2 was 3.8% (Table 1), less than 4% (the general rule to define a new subgenotype). C4 was associated with inter-genotype recombination between genotype C and an unknown genotype. Sequences of C5 were mostly B/C recombinants, with one A/C recombinant.
It should be noted that, although inclusion of the recombinants did not influence HBV subgenotyping greatly (in other words, the clustering of non-recombinant subgenotypes was not changed greatly), it did change the topology of the phylogenetic tree (Figures 1 and 2). For example, C2 was closer to the root of the tree than C9 in the phylogenetic tree built using all the genotype C sequences (Figure 1). However, in the tree estimated using non-recombinant sequences only, C9 was the closest subgenotype to the root of tree (Figure 2).
Our results revealed that most of the subgenotypes were properly designated. Subgenotypes C1, C3, C6, C7, C8, C9, C10, C11 (proposed by Mulyanto et al. ), C12, C13, C15 and C16 were monophyletic (Figures 1 and 2). Sequence divergence between any two of the above subgenotypes was greater than 4% (Tables 1 and 2). Therefore, these subgenotypes were properly designated and should be maintained.
However, there were a few misclassifications. First, at the top part of the tree constructed using all genotype C sequences, three sequences (EU939628, EU939629 and EU939631) were previously defined as genotype B (Figure 1). We have demonstrated that they were B/C recombinants but closer to genotype C, and have corrected this information in a previous report . Also at the top of the tree, there were three sequences from China, GQ377630, GQ377635 and FJ386646. Information extracted from GenBank showed the first two sequences belonged to subgenotype C4, and the third belonged to subgenotype C2 (Figure 1). Obviously, this information was not correct, because these sequences did not really cluster with C4 and C2 respectively; in fact, they have been already identified as B/C recombinants in our previous analysis .
Second, C11 has been named twice by two research groups respectively , . Both of the trees revealed that C11 proposed by Utsumi and colleagues were actually clustered with C12 proposed by Mulyanto et al. , , supported by high bootstrap value (100%, Figures 1 and 2). Therefore, C11 proposed by Utsumi and colleagues should be renamed as C12.
Third, sequences of C6 fell into three parts in the two trees respectively (Figures 1 and 2). The first part was composed of 16 sequences isolated from Indonesia and has been labeled as subgenotype C6. However, in the second part, one sequence, GQ358157, previously defined as subgenotype C6 , fell into a cluster of subgenotype C12. In the third part, one sequence from South Korea, GU721029, was clustered with a C7 sequence from the Philippines . Sequence divergences between the first C6 and other two parts, C12 and C7, were 5.1% and 5.3% respectively (Table 1). Therefore, the subgenotypes of sequences in the second and third parts were not properly defined. Instead, the subgenotype of GQ358157 should be C12, while that of GU721029 should be C7.
Fourth, both of the trees revealed that subgenotype C2 was not a monophyly and sequences previously designated as subgenotype C2 scattered into several parts in the trees (Figures 1 and 2). In addition, there was no subgenotype information for some sequences. To determine whether subgenotype C2 was properly defined and to classify the sequences without subgenotype information, we named a few suspect sequences or branches as C×1 to C×9 tentatively (Figures 1 and 2). However, sequence divergences between C2 and the tentative designations, C×1, C×2, C×3, C×5, C×6, C×8 and C×9 were less than 4% (Tables 1 and 2). By comparing the topologies of the two trees and mostly based on the phylogeny constructed using non-recombinant sequences (Figures 1 and 2), we proposed that subgenotypes C2, C×3, C×1, C×2, C×9, C×6, C×8, C×7 and C14  composed a quasi-subgenotype C2 of Asian origin. Although sequence divergences between C×5 and several subgenotypes were less than 4% and that between C×5 and C2 was the lowest (2.8%), C×5 formed a monophyly with subgenotype C1 (Figures 1 and 2). In particular, it was supported with high bootstrap value of 91% (Figure 2). Therefore, C×5 should be classified as C1. Apart from C×5, sequence divergences between C×4 and other subgenotypes were always greater than 4% (Tables 1 and 2). Because lineage C×4 was a monophyly with high bootstrap value of 100%, it should be classified as a novel subgenotype. As previously defined C14  has been classified into the quasi-subgenotype C2, we proposed that it should be named as the new C14 for continuous numbering. Then we calculated sequence divergences between non-recombinant subgenotypes in the novel classification (Table 3). Sequence divergences between the quasi-subgenotype C2, C1, new C14 and any of the remaining non-recombinant subgenotypes were always greater than 4% (Table 3).
The accurate classification of genotype and subgenotype of HBV is important in that different viral genotypes and subgenotypes have shown differences in the course of disease, responses to anti-viral treatment regimens, and in clinical outcomes , , –. For example, subgenotype B1 was related to fulminant HBV infections in Japan. However, subgenotype B2 has been reported to be associated with HCC or HCC recurrence in young patients in East Asia , . In particular, both subgenotypes C1 and C2 have been reported to be associated with the risk of hepatocellular carcinoma (HCC). However, only C2 has been associated with an increased risk of HCC .
It is still controversial whether recombinants should be reported separately or designated as novel subgenotypes . Although inclusion of the recombinant sequences into phylogenetic analysis did change the topology of the tree, it played a limited role in subgenotyping the non-recombinant sequences. In addition, there haven’t been generally accepted rules for reporting HBV recombinants by far. Therefore, the designation of subgenotypes C4, C5, CD1 and CD2 remained unchanged, although all of them have been proven as inter-genotype recombinants. The C/D recombinants have been reported to be specifically restricted to the Qinghai-Tibet Plateau in western China . However, one CD2 virus has also been isolated from Belgium , and a few CD1 strains have been isolated from Mongolia (Figure 1) .
Our results showed that most of the subgenotypes were properly designated, such as C1, C3, C6 to C13, and C15 to C16. They were monophylies and sequence divergences between them were always greater than 4%. Therefore, no change has been made to these subgenotypes in the new classification.
However, a few misclassifications have been identified and corrected. For example, subgenotype information extracted from GenBank for a few sequences isolated from China was wrong and has been identified to be B/C recombinants in our previous report . C11 proposed by Utsumi and colleagues has been classified into C12 . Two previously designated C6 sequences have been renamed as C12 and C7 respectively.
In particular, subgenotype C2 has been associated with an increased risk of HCC . However, subgenotype C2 was not a monophyly . Furthermore, the classification of the sequences falling between C2 and C1 was problematic and some of them haven’t been designated a subgenotype (Figure 2). To correct the misclassifications in subgenotype C2, we named several subgenotypes, from C×1 to C×9, temporarily. Although some of them (e.g. C×3) were monophylies with high bootstrap support, sequence divergences between C2 and C×1 to C×9 were mostly smaller than 4%. Therefore, designating them as separate subgenotypes was not suitable.
Alternatively, we proposed that quasi-subgenotype C2 should be used. The term “quasi-subgenotype” has been used to correct the misclassifications in the subgenotyping of HBV of genotypes A and B , , . The novel quasi-subgenotype C2 was composed of sequences of Asian origin and included the old C2, C×1 to C×3, and C×6 to C×9. It also included previously classified C14. However, both advantages and disadvantages of the designation of quasi-subgenotype C2 were distinct. On one hand, introducing the quasi-subgenotype C2 was the simplest, but a feasible way to provide a robust and consistent classification for genotype C HBV, instead of introducing more subgenotypes which would make the HBV subgenotyping classification more complex and inconsistent. On the other hand, the quasi-subgenotype C2 was still not a monophyly, which is contradictory to the current criteria used for HBV subgenotyping.
In addition, C×4 showed more than 4% divergence with the remaining subgenotypes. It was a monophyly with a bootstrap value of 100%. Therefore, we proposed that C×4 should be classified as a novel subgenotype and has been named as the new C14 for continuous numbering.
Based on the above corrections, we propose a novel classification for subgenotyping the genotype C HBV. In the new classification, original C1, C3 to C10, C11 proposed by Mulyanto and colleagues , C12 to C13, C15 to C16, CD1 and CD2 remained unchanged. C11 proposed by Utsumi and colleagues  are classified as C12. The original C2 has been named as quasi-subgenotype C2 and it included several undefined sequences, as well as previously defined C14. In addition, C×4 has been identified to be a novel subgenotype and has been named as the new C14 for continuous numbering. This new classification system is well supported by the sequence divergence data (Table 3).
Based on the present large-scale analysis, we propose that it should be extremely cautious to propose novel HBV subgenotypes. Apart from phylogenetic analysis and sequence divergence analysis, geographical information and even ethnic information might be used to guide HBV subgenotyping, since distributions of different HBV genotypes and subgenotypes show distinct geographical and certain ethnic characteristics . In addition, most previous analyses with a few selected representative strains often showed high bootstrap support for subgenotype C2 and its monophyletic nature. However, when all genotype C sequences were analyzed together, neither the high bootstrap support for subgenotype C2 nor its monophyletic nature was really guaranteed. Therefore, we suggest that if possible, the designation of a novel subgenotype should be based on a comparison of all available relevant sequences in public databases rather than only a few representative strains.
To sum up, we studied the influence of inclusion of recombinant sequences in the HBV subgenotyping and highlighted the importance and urgency to introduce a novel nomenclature system to report HBV recombinants. In addition, we identified and corrected several misclassifications in the subgenotyping of genotype C HBV. Based on these corrections, a novel, but more robust and consistent classification for the subgenotyping of genotype C HBV has been proposed, in which a novel quasi-subgenotype C2 and a novel subgenotype (new C14) were introduced.
Phylogenetic tree constructed using all genotype C HBV sequences.
Phylogenetic tree constructed using all non-recombinant genotype C HBV sequences.
We thank Dr. XY LANG, Dr. XN WANG and their colleagues in the Supercomputing Center, Computer Network Information Center of The Chinese Academy of Sciences for their help in installing and optimizing RAxML on the SCIGRID.
Conceived and designed the experiments: WS DGH ZZ. Performed the experiments: WS CZ WZ WMZ CL. Analyzed the data: WS WMZ CL. Wrote the paper: WS MJC DGH ZZ.
- 1. Huy T, Ngoc T, Abe K (2008) New complex recombinant genotype of hepatitis B virus identified in Vietnam. J Virol 82: 5657–5663.
- 2. Tatematsu K, Tanaka Y, Kurbanov F, Sugauchi F, Mano S, et al. (2009) A genetic variant of hepatitis B virus divergent from known human and ape genotypes isolated from a Japanese patient and provisionally assigned to new genotype J. J Virol. 83: 10538–10547.
- 3. Kramvis A, Kew M, Francois G (2005) Hepatitis B virus genotypes. Vaccine 23: 2409–2423.
- 4. Cao GW (2009) Clinical relevance and public health significance of hepatitis B virus genomic variations. World J Gastroenterol 15: 5761–5769.
- 5. Okamoto H, Tsuda F, Sakugawa H, Sastrosoewignjo RI, Imai M, et al. (1988) Typing hepatitis B virus by homology in nucleotide sequence: comparison of surface antigen subtypes. J Gen Virol 69 (Pt 10): 2575–2583.
- 6. Kramvis A, Kew MC (2005) Relationship of genotypes of hepatitis B virus to mutations, disease progression and response to antiviral therapy. J Viral Hepat 12: 456–464.
- 7. Schaefer S, Magnius L, Norder H (2009) Under construction: classification of hepatitis B virus genotypes and subgenotypes. Intervirology 52: 323–325.
- 8. Pourkarim MR, Amini-Bavil-Olyaee S, Lemey P, Maes P, Van Ranst M (2010) Are hepatitis B virus “subgenotypes” defined accurately? J Clin Virol 47: 356–360.
- 9. Huy TT-T, Ushijima H, Quang VX, Win KM, Luengrojanakul P, et al. (2004) Genotype C of hepatitis B virus can be classified into at least two subgroups. J Gen Virol 85: 283–292.
- 10. Norder H, Courouce AM, Coursaget P, Echevarria JM, Lee SD, et al. (2004) Genetic diversity of hepatitis B virus strains derived worldwide: genotypes, subgenotypes, and HBsAg subtypes. Intervirology 47: 289–309.
- 11. Sugauchi F, Mizokami M, Orito E, Ohno T, Kato H, et al. (2001) A novel variant genotype C of hepatitis B virus identified in isolates from Australian Aborigines: complete genome sequence and phylogenetic relatedness. J Gen Virol 82: 883–892.
- 12. Sakamoto T, Tanaka Y, Orito E, Co J, Clavio J, et al. (2006) Novel subtypes (subgenotypes) of hepatitis B virus genotypes B and C among chronic liver disease patients in the Philippines. J Gen Virol 87: 1873–1882.
- 13. Lusida MI, Nugrahaputra VE, Soetjipto, Handajani R, Nagano-Fujii M, et al. (2008) Novel subgenotypes of hepatitis B virus genotypes C and D in Papua, Indonesia. J Clin Microbiol 46: 2160–2166.
- 14. Utsumi T, Lusida MI, Yano Y, Nugrahaputra VE, Amin M, et al. (2009) Complete genome sequence and phylogenetic relatedness of hepatitis B virus isolates in Papua, Indonesia. J Clin Microbiol 47: 1842–1847.
- 15. Cavinta L, Sun J, May A, Yin J, von Meltzer M, et al. (2009) A new isolate of hepatitis B virus from the Philippines possibly representing a new subgenotype C6. J Med Virol 81: 983–987.
- 16. Cavinta L, Cao GW, Schaefer S (2009) Description of a New Hepatitis B Virus C6 Subgenotype Found in the Papua Province of Indonesia and Suggested Renaming of a Tentative C6 Subgenotype Found in the Philippines as Subgenotype C7. Journal of Clinical Microbiology 47: 3068–3069.
- 17. Mulyanto, Depamede SN, Surayah K, Tsuda F, Ichiyama K, et al. (2009) A nationwide molecular epidemiological study on hepatitis B virus in Indonesia: identification of two novel subgenotypes, B8 and C7. Arch Virol 154: 1047–1059.
- 18. Mulyanto, Depamede SN, Surayah K, Tjahyono AAH, Jirintai, et al (2010) Identification and characterization of novel hepatitis B virus subgenotype C10 in Nusa Tenggara, Indonesia. Archives of Virology 155: 705–715.
- 19. Mulyanto, Depamede SN, Wahyono A, Jirintai, Nagashima S, et al. (2011) Analysis of the full-length genomes of novel hepatitis B virus subgenotypes C11 and C12 in Papua, Indonesia. J Med Virol 83: 54–64.
- 20. Utsumi T, Nugrahaputra VE, Amin M, Hayashi Y, Hotta H, et al. (2011) Another novel subgenotype of hepatitis B virus genotype C from papuans of Highland origin. J Med Virol 83: 225–234.
- 21. Mulyanto, Pancawardani P, Depamede SN, Wahyono A, Jirintai S, et al. (2012) Identification of four novel subgenotypes (C13–C16) and two inter-genotypic recombinants (C12/G and C13/B3) of hepatitis B virus in Papua province, Indonesia. Virus Res 163: 129–140.
- 22. Cui C, Shi J, Hui L, Xi H, Zhuoma, et al (2002) The dominant hepatitis B virus genotype identified in Tibet is a C/D hybrid. J Gen Virol 83: 2773–2777.
- 23. Wang Z, Liu Z, Zeng G, Wen S, Qi Y, et al. (2005) A new intertype recombinant between genotypes C and D of hepatitis B virus identified in China. J Gen Virol 86: 985–990.
- 24. Wang Z, Hou J, Zeng G, Wen S, Tanaka Y, et al. (2007) Distribution and characteristics of hepatitis B virus genotype C subgenotypes in China. J Viral Hepat 14: 426–434.
- 25. Sugauchi F, Orito E, Ichida T, Kato H, Sakugawa H, et al. (2002) Hepatitis B virus of genotype B with or without recombination with genotype C over the precore region plus the core gene. J Virol 76: 5985–5992.
- 26. Sakamoto T, Tanaka Y, Simonetti J, Osiowy C, Borresen ML, et al. (2007) Classification of hepatitis B virus genotype B into 2 major types based on characterization of a novel subgenotype in Arctic indigenous populations. J Infect Dis 196: 1487–1492.
- 27. Ahn SH, Yuen L, Revill P (2009) Clarification required for the definition of hepatitis B virus subgenotypes C1 and C2. Intervirology 52: 321–322.
- 28. Shi W, Carr MJ, Dunford LM, Zhu CD, Hall WW, et al. (2012) Identification of Novel Inter-genotypic Recombinants of Human Hepatitis B Viruses by Large-scale Phylogenetic analysis. Virology 427: 51–59.
- 29. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21: 456–463.
Stamatakis A (2006) Phylogenetic models of rate heterogeneity: a high performance computing perspective. Proceedings of 20th IEEE/ACM International Parallel and Distributed Processing Symposium (IPDPS2006), High Performance Computational Biology Workshop. Rhodos, Greece.
- 31. Huson DH, Richter DC, Rausch C, Dezulian T, Franz M, et al. (2007) Dendroscope: An interactive viewer for large phylogenetic trees. BMC Bioinformatics 8: 460.
- 32. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. (2011) MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol 28: 2731–2739.
- 33. Kimura M (1980) A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 16: 111–120.
- 34. Thedja MD, Muljono DH, Nurainy N, Sukowati CH, Verhoef J, et al. (2011) Ethnogeographical structure of hepatitis B virus genotype distribution in Indonesia and discovery of a new subgenotype, B9. Arch Virol 156: 855–868.
- 35. Chu CJ, Lok AS (2002) Clinical significance of hepatitis B virus genotypes. Hepatology 35: 1274–1276.
- 36. Schaefer S (2005) Hepatitis B virus: significance of genotypes. J Viral Hepat 12: 111–124.
- 37. Miyakawa Y, Mizokami M (2003) Classifying hepatitis B virus genotypes. Intervirology 46: 329–338.
- 38. Guettouche T, Hnatyszyn HJ (2005) Chronic hepatitis B and viral genotype: the clinical significance of determining HBV genotypes. Antivir Ther 10: 593–604.
- 39. Ganem D, Prince AM (2004) Mechanisms of disease: Hepatitis B virus infection - Natural history and clinical consequences. New England Journal of Medicine 350: 1118–1129.
- 40. Ni YH, Chang MH, Wang KJ, Hsu HY, Chen HL, et al. (2004) Clinical relevance of hepatitis B virus genotype in children with chronic infection and hepatocellular carcinoma. Gastroenterology 127: 1733–1738.
- 41. Yin J, Zhang H, Li C, Gao C, He Y, et al. (2008) Role of hepatitis B virus genotype mixture, subgenotypes C2 and B2 on hepatocellular carcinoma: compared with chronic hepatitis B and asymptomatic carrier state in the same area. Carcinogenesis 29: 1685–1691.
- 42. Chan HL, Tse CH, Mo F, Koh J, Wong VW, et al. (2008) High viral load and hepatitis B virus subgenotype ce are associated with increased risk of hepatocellular carcinoma. J Clin Oncol 26: 177–182.
- 43. Zhou B, Xiao L, Wang Z, Chang ET, Chen J, et al. (2011) Geographical and ethnic distribution of the HBV C/D recombinant on the Qinghai-Tibet Plateau. PLoS One 6: e18708.
- 44. Pourkarim MR, Amini-Bavil-Olyaee S, Verbeeck J, Lemey P, Zeller M, et al. (2010) Molecular evolutionary analysis and mutational pattern of full-length genomes of hepatitis B virus isolated from Belgian patients with different clinical manifestations. J Med Virol 82: 379–389.
- 45. Elkady A, Tanaka Y, Kurbanov F, Oynsuren T, Mizokami M (2008) Virological and clinical implication of core promoter C1752/V1753 and T1764/G1766 mutations in hepatitis B virus genotype D infection in Mongolia. J Gastroenterol Hepatol 23: 474–481.
- 46. Pourkarim MR, Amini-Bavil-Olyaee S, Lemey P, Maes P, Van Ranst M (2011) HBV subgenotype misclassification expands quasi-subgenotype A3. Clin Microbiol Infect 17: 947–949.
- 47. Shi WF, Zhu CD, Zheng W, Carr MJ, Higgins DG, et al. (2012) Subgenotype reclassification of genotype B hepatitis B virus. BMC Gastroenterol 12: 116.
- 48. Thedja MD, Muljono DH, Nurainy N, Sukowati CH, Verhoef J, et al. (2011) Ethnogeographical structure of hepatitis B virus genotype distribution in Indonesia and discovery of a new subgenotype, B9. Arch Virol 156: 855–868.