Sapovirus is a genus of caliciviruses that are known to cause enteric disease in humans and animals. There is considerable genetic diversity among the sapoviruses, which are classified into different genogroups based on phylogenetic analysis of the full-length capsid protein sequence. While several mammalian species, including humans, pigs, minks, and dogs, have been identified as animal hosts for sapoviruses, there were no reports of sapoviruses in bats in spite of their biological diversity. In this report, we present the results of a targeted surveillance study in different bat species in Hong Kong. Five of the 321 specimens from the bat species, Hipposideros pomona, were found to be positive for sapoviruses by RT-PCR. Complete or nearly full-length genome sequences of approximately 7.7 kb in length were obtained for three strains, which showed similar organization of the genome compared to other sapoviruses. Interestingly, they possess many genomic features atypical of most sapoviruses, like high G+C content and minimal CpG suppression. Phylogenetic analysis of the viral proteins suggested that the bat sapovirus descended from an ancestral sapovirus lineage and is most closely related to the porcine sapoviruses. Codon usage analysis showed that the bat sapovirus genome has greater codon usage bias relative to other sapovirus genomes. In summary, we report the discovery and genomic characterization of the first bat calicivirus, which appears to have evolved under different conditions after early divergence from other sapovirus lineages.
Citation: Tse H, Chan W-M, Li KSM, Lau SKP, Woo PCY, Yuen K-Y (2012) Discovery and Genomic Characterization of a Novel Bat Sapovirus with Unusual Genomic Features and Phylogenetic Position. PLoS ONE7(4): e34987. https://doi.org/10.1371/journal.pone.0034987
Editor: Jean-Pierre Vartanian, Institut Pasteur, France
Received: January 2, 2012; Accepted: March 8, 2012; Published: April 13, 2012
Copyright: © 2012 Tse et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work is partly supported by the Research Grant Council; University Development Fund, The University of Hong Kong; The Tung Wah Group of Hospitals Fund for Research in Infectious Diseases; the HKSAR Research Fund for the Control of Infectious Diseases of the Health, Welfare and Food Bureau; and the Shaw Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The caliciviruses are a family of small non-enveloped viruses, and can be classified into five genera: Vesivirus, Lagovirus, Norovirus, Sapovirus and Nebovirus. They possess a non-segmented, polyadenylated, positive-sense ssRNA genome of about 7.5 to 8.5 kb in length, enclosed in an icosahedral capsid of 27 to 40 nm in diameter. Among them, noroviruses and sapoviruses (SaVs) are well known to cause enteric disease in a range of mammals, including humans, while vesiviruses and lagoviruses cause systemic diseases in specific animal hosts. Nebovirus is the most recently established genus in the family Caliciviridae , and its members are associated with enteric diseases in cattle , . A putative sixth genus, Recovirus, has been proposed for a novel calicivirus detected in stool specimens from rhesus monkeys , . Another new genus, Valovirus, has been proposed for a novel group of swine caliciviruses known as the St-Valérien-like viruses . In addition, there exist other unclassified caliciviruses, such as the recently described chicken calicivirus .
The genus Sapovirus currently contains only one recognized species, the Sapporo virus, which was discovered in 1977 in Sapporo, Japan . The SaV genome is approximately 7.1 to 7.5 kb in length, and may have two or three ORFs. ORF1 encodes a polyprotein that undergoes proteolytic cleavage to form the non-structural proteins and the major capsid protein VP1. ORF2 encodes the minor structural protein VP2. ORF3 encodes a small basic protein of unknown function , . Interestingly, it is located in an overlapping reading frame within ORF1, and is present only in SaVs from selected genogroups. At present, SaVs are classified formally into 5 genogroups based on phylogenetic analysis of the full-length VP1 sequence, though additional genogroups have been proposed to accommodate some novel SaVs discovered in recent years. Further classification of SaVs into genotypes has also been undertaken, though taxonomic assignment at the genotype level appears to be less well-defined than at the genogroup level .
As mentioned above, both noroviruses and SaVs generally cause mild to asymptomatic enteric infections in human and animal hosts . Human SaV infections are reported to be similar to or milder than human norovirus infections, but SaV infections have a shorter duration of viral shedding and are less associated with projectile vomiting –. Incidence of SaV-associated gastroenteritis infections remains less than norovirus-associated infections for both sporadic and outbreak settings, though various studies have reported increasing rates of SaV infections around the world –. The genetic diversity of SaVs is comparable to that of noroviruses, and the diversity of reported animal hosts is also similar. Noroviruses have been discovered in specimens from humans, pigs, cattle, dogs, sea lions, African lion, and mice –. In comparison, SaVs have been found in specimens from humans, pigs, dogs, minks and California sea lions , –.
Bats (order Chiroptera of class Mammalia) constitute a significant portion of biological diversity in many ecosystems and have a wide geographical distribution . We have previously discovered novel viruses in several local bat species –, and there were many similar discoveries of novel bat viruses by researchers in other parts of the world. In particular, important human viral pathogens like the SARS virus, Nipah virus and Ebola virus were found to have originated from bats and contributed to substantial human morbidity and mortality in recent outbreaks. Taken together, these discoveries hint that these small mammals are important reservoirs of diverse and undiscovered animal viruses, with significant risk of zoonotic transmission to humans .
In the present study, we investigated the presence of unknown calicivirus diversity in bats by targeted RT-PCR screening. Novel SaV sequences were amplified from several faecal samples of the bat species Hipposideros pomona, and genome sequences were obtained for three strains of the bat SaV. Sequence analysis indicated that the novel virus possesses several genomic features atypical of SaVs, and phylogenetic analysis revealed that it descended from a lineage that had diverged early from other SaV.
Surveillance and detection of novel SaVs in bats
A total of 728 anal swabs from different bat species in Hong Kong were obtained. No obvious signs of enteric disease, like anorexia and diarrhoea, were observed in the bats during the brief period of captivity needed for sampling.
RT-PCR using broadly reactive degenerate primers for a 185 nt fragment in the 3D-like RNA-dependent RNA polymerase (RdRp) region of the calicivirus ORF1 gene was positive in two specimens. Repeated screening using more sensitive specific primers revealed three additional positive specimens. Further information on the species and RT-PCR screening results are presented in Table 1, Table S1 and Figure S1. Sequence similarity search using BLASTN against the NCBI non-redundant nucleotide database did not reveal significant similarity to known SaV sequences. Another search using BLASTX against the NCBI non-redundant protein database produced hits to SaV sequences, with the most closely related sequence being the RdRp sequence of porcine SaV (GenBank accession number ACT98315) at 43% aa identity. A phylogenetic tree was constructed from the nucleotide alignment based on the length of the partial RdRp sequence obtained from bat SaV/TLC72 (GenBank accession number JQ267527) (Figure S2).
Genome sequencing and analysis of novel bat SaVs
Complete or nearly full-length genome sequences (with incomplete 5′ ends) were obtained for three positive samples using the sequencing strategy as described in the Methods section. For two of the samples that were positive only for RT-PCR screening with specific primers, only sequences for short segments of the viral genome were obtained. Additional viral genome sequencing on these samples was unsuccessful due to limited clinical materials available and possibly low viral titres. The complete genome of bat SaV strain TLC58 (Genbank accession number JN899075) is 7696 nt in length and has a genomic G+C content of 60.2 mol%. Both the length and G+C content of the bat SaV genome are significantly higher than that of other known SaVs (Table 2). Each genome is predicted to contain 3 overlapping ORFs, comparable to the genome organization of SaVs in GI, GIV and GV (Figure 1). The 5′-UTR and the 3′-UTR are 9 nt and 225 nt in length, respectively. The length of the 3′-UTR is considerably longer than other SaVs (Table 2). The two other nearly full-length bat SaV genomes were found to be highly similar to that of the complete bat SaV/TLC58 genome in nucleotide sequence and genome organization, and were not analysed separately.
The genome organization of the bat SaV TLC58/HK in comparison with the genome organization of human SaV GI strain Mancheseter, human SaV GII strain Mc10, porcine enteric calicivirus, and norovirus GII strain MD145.
The complete ORF1 is 6855 nt long, and encodes a large precursor polyprotein with an estimated molecular mass of 246.8 kDa. The polyprotein contains characteristic amino acid motifs conserved in caliciviruses: 2C-like NTPase at residue 482 (GPPGIGKT), VPg at residues 958 (KGKTK) and 972 (SEYEE), 3C-like protease at residue 1183 (GDCG), 3D-like RNA-dependent RNA polymerase at residues 1520 (GLPSG) and 1568 (YGDD), and VP1 at residue 1867 (PPG). It undergoes proteolytic processing to produce the nonstructural viral proteins and the major capsid protein VP1. Based on comparison with the ORF1 cleavage map of SaV/Mc10 , , a human SaV GII strain, we predicted the cleavage site that generates the major capsid protein to be located between residues 1740 (E) and 1741 (G). An in-frame AUG start codon is located in a favourable context for translation initiation (GUGUUUGUGAUGGA) just upstream to the cleavage site, which has also been reported in other caliciviruses , . This sequence is noted to be similar to the 5′ UTR of the genome, and it was postulated that the site might permit internal translation initiation from subgenomic RNA . The sequence identities of the bat SaV/TLC58 with other SaVs in the complete ORF1 protein sequence vary between 36.0% and 37.4% (Table 3). While comparison with caliciviruses of other genera, the ORF1 sequence identities are overall lower (15.6%–22.8%) than those between different SaVs (Table S2). For individual alignment of protease – polymerase, sequence identities with other SaVs (45.3%–48.4%) are overall higher than those with other genera (22.7%–32.1%) (Table 3 and Table S2). The VP1 is predicted to be 546 aa long, and has a molecular mass of 56.6 kDa. It shares 36.1% to 39.2% amino acid identities with VP1 of other SaVs (Table 3). Likewise analysis with caliciviruses of other genera reveals lower similarities with 14.9% to 23.4% sequence identities (Table S2).
The complete ORF2 is 615 nt long, with an overlapping region of 8 nt with the 3′ terminus of ORF1. Its reading frame is +1 relative to that of ORF1, unlike most other SaVs (Table 2). ORF2 encodes the minor structural protein VP2, which has an estimated molecular mass of 21 kDa. The mechanism of translation initiation in ORF2 of SaVs has not been fully elucidated. In the present case, a translational upstream ribosome binding site (TURBS) motif (CAUGGGACC; underline indicates region complementary to 18 S ribosomal rRNA sequence) could be identified at 24 nt upstream of the ORF2 start codon. Sequence identities for VP2 with other SaVs vary from 15.5% to 19.9% (Table 3). By comparison with caliciviruses of other genera, sequence identities are generally lower than those between SaVs. (4.8%–12.3%) (Table S2).
Phylogenetic and recombination analysis
Phylogenetic trees were constructed using the predicted amino acid sequences of the ORF1 precursor polyprotein (Figure 2), VP1 and VP2 (Figure 3). The LG+G+F model was found to be the best-fit substitution model in all cases. Phylogenetic analysis was not performed for the putative ORF3 product as no homologous sequences were available. Sequence analysis with the Recombination Analysis Tool did not reveal any potential recombination breakpoints in the bat SaV sequences.
SH-like aLRT branch support values of greater than 0.70 are shown besides major branches. Scale bar indicates the number of inferred substitutions per site.
The trees were constructed based on the full-length amino acid sequences of (a) VP1 major capsid protein, and (b) VP2 minor structural protein. SH-like aLRT branch support values of greater than 0.70 are shown besides major branches. Scale bar indicates the number of inferred substitutions per site.
There are subtle but important differences in the phylogenetic position of the bat SaV in the three phylogenetic trees. In the tree based on the full-length amino acid sequences of the ORF1 polyprotein, the bat SaVs are clustered tightly with the porcine SaVs in a monophyletic clade constituting the SaVs. However, in the VP1 tree, the bat SaVs are positioned just outside the clade of other SaVs. In the VP2 tree, the bat SaVs are located approximately equidistant from the GII noroviruses and porcine SaVs. The phylogenetic positions of the bat SaVs are supported by high Shimodaira-Hasegawa-like approximate likelihood ratio test (SH-like aLRT) branch support values as calculated by PhyML.
Although the phylogenetic positions of the novel bat virus are slightly divergent in the three trees, they generally show the bat SaV as being most closely related to the SaVs. In our opinion, there is insufficient ground for proposing a new genus for the novel virus under the current framework of taxonomic classification. The ORF1 polyprotein and VP1 capsid protein sequences of the novel bat virus showed obvious phylogenetic clustering with other SaV sequences. It should also be noted that the VP2 protein sequences are shorter and more divergent, and therefore are considered to be less useful in the phylogenetic classification of caliciviruses . Lastly, the genome organization of the bat SaV is highly similar to that of the SaVs as shown above. Hence, together with relatively high sequence identities with other SaVs rather than with calicivirues in other genera (Table S2), we propose that the novel bat virus be classified as a new member of the genus Sapovirus in the family Caliciviridae.
Codon usage and compositional bias analysis
As genomic nucleotide composition is strongly associated with codon usage bias in viruses, we examined the codon usage in the genomes of the novel bat SaV and other SaVs given their different nucleotide composition. The bat SaV genome was found to have significantly greater codon usage bias than the other SaV genomes, as measured by their effective number of codons (Nc) (Figure 4). Adjusting Nc for background nucleotide composition (Nc′) did not significantly affect the observed difference in codon usage bias.
The scratterplot of codon usage summary statistics Nc and Nc′ against the proportion of G or C nucleotides at the 3rd position of synonymous codons (GC3s), showing greater codon usage bias in the bat SaV genome relative to other SaV genomes. Unlike the porcine enteric caliciviruses, the observed difference in codon usage bias persists with adjustment of background nucleotide composition (Nc′).
Next, we examined CpG dinucleotide bias in the SaV genomes, as studies on other animal RNA viruses suggest that CpG suppression is a major factor in their genome evolution , . Odds ratio of CpG and GpC dinucleotides (ρCG and ρGC) and the CpG/GpC ratio were calculated to assess the degree of CpG suppression. Results confirm the presence of significant CpG suppression (ρCG≤0.78) in examined SaV genomes, with the only exception being the bat SaV genome (Table 4). ρGC values are similar across examined SaV genomes, suggesting that the difference in CpG suppression is specific. All SaV genomes are found to have a slightly negative GC skew, and there is no major difference between the degree of GC skew in bat SaV and the other SaVs (Table 4). This suggests that the degree of cytosine deamination is not a major factor in the altered GC composition and CpG suppression in the bat SaV genome.
Although the taxonomic classification of caliciviruses has improved with the availability of full-length gene sequences and robust phylogenetic methods , the increase in genetic diversity introduced by novel caliciviruses would necessitate further taxonomic revisions within the family. The International Committee on Taxonomy of Viruses has adopted a systematic polythetic approach towards virus taxonomy, but classification at or below the genus level may be complicated by the specific biology of diverse viruses. As a case in point, the proposed assignment of the novel bat virus to the genus Sapovirus might be opposed on the basis of an increased genomic G+C content, the different reading frame of ORF2, and the increased length of the 3′ UTR. On the other hand, the polythetic criteria for inclusion in the genus are not fully clear, and phylogenetic distances between viral gene sequences have assumed overriding importance in previous and current classifications. It should be noted that even phylogenetic analysis may be confounded by other factors such as the cleavage pattern of ORF1 polyprotein, which has not been determined experimentally for many caliciviruses.
Among the various notable genomic features and properties in the novel bat SaV, we were most intrigued by its remarkably high G+C genomic content. Most caliciviruses have a genomic G+C content of 44.2–57.4 mol%. Among them, the genomic G+C content of the SaVs lie within the relatively narrow range of 49.0–53.6 mol% in spite of their genetic diversity. Hence, the presently observed G+C genomic content of 60.2% is significantly higher than that for other SaVs or caliciviruses, and indeed would rank amongst mammalian RNA viruses with the highest G+C genomic content . Relatively little is known about the evolution of genome composition in caliciviruses. A number of factors have been postulated to exert selectional pressure on the G+C content of viral genomes, including host body temperature, immune pressure, codon and nucleotide usage patterns –. Our results suggested that the increased G+C content is associated with a decrease in CpG suppression, but does not have a direct correlation with codon usage bias. We are unaware of any previous findings indicating that genomes of bat viruses are under less CpG suppression, thus the observed reduction in CpG suppresion is unlikely to result from host-related factors. The greater codon usage bias in the bat SaV genome is another interesting genome feature, which could be associated with altered dinucleotide frequencies. The association could be tested by Markov modelling of the dinucleotide and codon frequencies in the SaV genomes, although the small genome sizes and the presently small number of complete genomes would limit the usefulness of this approach .
The novel SaV described presently is the first known member of the Caliciviridae in bats. The approach to its discovery is based on the established strategy of targeted genetic screening informed by conserved sequences of related viruses. Although this “homology-based” strategy has been successful in the discovery of numerous viruses, the advent of affordable high-density microarrays and high-thoughput sequencing has given rise to virus discovery through metagenomics. Indeed, the first canine SaVs were discovered recently by metagenome sequencing of canine diarrhoea samples on a high-throughput pyrosequencer . Important advantages of the new method include detection of novel viruses not closely related to known viruses, and the capacity to detect multiple divergent viruses in cases of co-infection. However, metagenomics sequencing can suffer from possible bias during sample preparation , and it is unlikely to detect very low titres of viruses in a specimen, such as the three bat faecal samples that were positive upon repeat screening with specific PCR primers in the present study. While we anticipate the increasing utilization of the metagenomics approach, existing methods such as viral culture, electron microscopy and targeted nucleic acid amplification would continue to serve important roles in virus discovery.
As Hong Kong is a highly urbanized city, the local roosting sites of bats are mainly man-made structures, such as water tunnels and abandoned mines. Hipposideros pomona is very common and widespread throughout Hong Kong countryside areas. It is a small-sized leaf-nosed bat with body weight ranged from 6–8 g. It possesses a small nose leaf which is simple, small, and lacking of lateral leaflets (Figures S3 and S4). This species may aggregate in small chambers or enclosures where the air flow is relatively limited. The 5 SaV-infected specimens were all captured in a place called Tai Lam – Shek Kong located next to a major country park of Hong Kong, and this roosting site shares similar ecological characteristic with other sampled roosting sites. Due to the extremely high human population density in Hong Kong, direct contact between humans and bats is relatively frequent. Fortunately, no local case of bat zoonosis has ever been reported . The relatively large genetic distance between the present bat SaV and other mammalian SaVs suggests that the zoonotic risk posed by this virus is likely to be low, though this should be confirmed with further in vitro and in vivo studies.
There are two main limitations in the current study. First and foremost, clinical information on the sampled bats is limited to the brief period of captivity needed for sample collection, which is unlikely to reflect the disease association of the virus accurately. In other words, the scope of the study is limited to surveillance of viral diversity and possible discovery of new viruses. Secondly, the number of samples for the novel virus is quite small, despite the use of specific PCR primers for screening and the relatively large number of samples collected. Thus, we were unable to draw conclusions on the seasonality of its detection or its host specificity. To address these limitations, long-term follow-up studies would be required to identify sufficient positive samples with associated clinical data. Increasing the scale of surveillance would also help, though there are practical geographical and logistic constraints in our locality.
In conclusion, we identified a novel bat SaV with several genomic features and properties that set it apart from other members of the genus Sapovirus. Phylogenetic analysis suggests that its ancestral lineage had diverged early from the other SaVs and evolved under different conditions. Further discovery and characterization of additional strains would enhance our understanding of the evolutionary history of the SaVs and other caliciviruses.
Materials and Methods
Surveillance and sample collection
The study was approved by the Department of Agriculture, Fisheries and Conservation, HKSAR; and Committee on the Use of Live Animals in Teaching and Research, The University of Hong Kong. Bats from 14 different locations in rural areas of Hong Kong, including water tunnels, closed mines, sea caves and forested areas, were captured over a 36-month period. Anal swabs were collected by an experienced veterinary surgeon, and kept in viral transport medium at 4°C before processing.
Viral RNA was extracted from the anal swabs using a QIAamp Viral RNA mini kit (Qiagen). The RNA was eluted into 50 µl RNase-free water and was used as the template for RT-PCR.
RT-PCR of the RdRp region using conserved primers, and sequencing
Screening was performed by amplifying a 185 nt fragment in the RdRp region of the ORF1 gene of caliciviruses. Conserved degenerate primers (5′-GAYTAYTCNMRRTGGGAYTC-3′ and 5′- GGCATNCCNGAKGGNAYNCC -3′) were designed from the multiple sequence alignment of the available calicivirus gene sequences in NCBI GenBank. First-strand cDNA synthesis was performed using SuperScript III kit (Invitrogen) according to manufacturer's instructions. The PCR mixture (25 µl) contained cDNA, PCR buffer (10 mM Tris/HCl pH 8.3, 50 mM KCl, 2 mM MgCl2 and 0.01% gelatin), 200 µM of each dNTP and 1.0 U AmpliTaq Gold polymerase (Applied Biosystems). PCR cycling conditions were as follows: hot start at 94°C for 7 min, followed by 50 cycles of 94°C for 1 min, 50°C for 1 min and 72°C for 1 min with a final extension at 72°C for 10 min in an automated thermal cycler (Applied Biosystems). Standard precautions were taken to avoid PCR contamination and no false-positive signal was observed in the negative controls. The PCR products were gel-purified using a QIAquick gel extraction kit (Qiagen). Both strands of the PCR products were sequenced twice with an ABI Prism 3730xl DNA Analyser (Applied Biosystems), using the two PCR primers.
RT-PCR screening of bat sapovirus using specific primers
Additional RT-PCR screening was performed on the same samples using specific primers designed from the RdRp nucleotide sequences of bat SaVs obtained from previous rounds of RT-PCR and sequencing, as RT-PCR screening with specific primers usually offers higher sensitivity than a comparable screening with consensus degenerate primers. Sequences of the specific primers are as follows: forward primer 5′- CACAATGCAGCCAGCCA-3′ and reverse primer 5′- GGTGCGCGTGGTGAACAC-3′. PCR cycling conditions were as follows: hot start at 94°C for 7 min, followed by 50 cycles of 94°C for 1 min, 52°C for 1 min and 72°C for 1 min with a final extension at 72°C for 10 min in an automated thermal cycler (Applied Biosystems). Standard precautions were taken to avoid PCR contamination and no false-positive signal was observed in the negative controls. PCR product purification and sequencing were performed as above.
Cloning of PCR product and sequencing
Purified PCR products were cloned into a pCR2.1-TOPO vector (Invitrogen) according to manufacturer's instructions. The vector was then used to transform the competent Escherichia coli strain DH5α by electroporation. Positive transformants were identified by blue–white screening, and eight colonies were selected for DNA sequencing of the construct using the M13 forward and reverse primers according to the manufacturer's instructions. Sequencing reactions were performed as described above.
Viral genome sequences were obtained using strategies we had previously used for other RNA viruses –. RNA extraction and cDNA generation were performed as described above. PCR primers were designed by targeting conserved regions, which were identified from the multiple alignment of genomes of related SaVs, as primer-binding sites. Additional primers for subsequent rounds of PCR were designed based on the results of earlier rounds of genome sequencing. The complete set of primer sequences is available from the authors upon request. The 5′ and 3′ ends of the viral genomes were sequenced following amplification of the segments by rapid amplification of cDNA ends, which was performed using the SMARTer RACE cDNA Amplification kit (Clontech) according to the manufacturer's instructions.
Phylogenetic and genome analysis
ORFs were located using the ORF Finder tool at NCBI (http://www.ncbi.nlm.nih.gov/projects/gorf/). Annotation of the predicted proteins was performed by BLAST sequence similarity search against annotations in the NCBI RefSeq database. Multiple sequence alignments were constructed using MUSCLE version 3.8.31 , and phylogenetic informative regions were extracted using BMGE . Maximum-likelihood phylogenetic trees were constructed using PhyML version 3 , under the best-fit protein evolution model as selected by ProtTest 3 . Branch support values were estimated by calculation of SH-like aLRT values . Recombination detection was performed by analysing the translated sequences of ORF1 and ORF2 separately using the Recombination Analysis Tool .
Codon usage and compositional bias analysis
The full-length ORF1 and ORF2 coding sequences were extracted from selected SaV genomes and concatenated for codon usage analysis (see Table 4 for the list of included genome sequences). Codon usage and summary statistic of codon usage bias (Nc and Nc′) were calculated using the INCA package version 2.1 , where Nc is the effective number of codons in the coding regions of the genome , and Nc′ is the effective number of codons adjusted for background nucleotide composition . For CpG dinucleotide bias analysis, odds ratio of CpG and GpC dinucleotides and the CpG/GpC ratio were calculated as described in previous studies , . Odds ratio of ≤0.78 indicates significant suppression of the dinucleotide, same as the interpretation criteria of previous studies. Symmetrized nucleotide frequencies and dinucleotide odds ratio were not considered in the present study, as SaV genomes consist of positive-sense ssRNA only. To investigate the possible effects of cytosine deamination, genomic GC skew, which is the ratio (G-C)/(G+C), was calculated for the SaV genomes. The strength of the GC skew had been suggested to correlate with the degree of cytosine deamination , , .
Geographical distribution of the bat specimens in the present study.
Neighbor-joining tree of partial RdRp nucleotide sequences. The tree was constructed based on the length of the nucleotide sequence in the RdRp region obtained from bat SaV/TLC72.
Photo showing Hipposideros pomona is in the drainage at Tai Lam – Shek Kong.
Photo showing Hipposideros pomona possesses a small nose leaf.
Epidemiology of the tested bat specimens.
We thank Director Alan Chi-Kong Wong, Siu-Fai Leung, Chik-Chuen Lay, Ping-Man So and K. F. Chan [HKSAR Department of Agriculture, Fisheries, and Conservation (AFCD)] and Hong Kong Police Force for facilitation and support; Chung-Tong Shek, Cynthia S. M. Chan and Joseph W. K. So from AFCD for their excellent technical assistance and collection of animal specimens. Photos and ecological information of the bats roosting sites are reproduced with kind permission from AFCD. Views expressed in this paper are those of the authors only, and may not represent the opinion of the AFCD or the Government of the HKSAR. We are grateful for the generous support of Mr Hui Hoy and Mr Hui Ming in the genomic sequencing platform.
Conceived and designed the experiments: HT SKPL PCYW. Performed the experiments: WMC KSML. Analyzed the data: HT WMC. Wrote the paper: HT WMC. Critical revision for important intellectual content: KYY SKPL PCYW.
- 1. Smiley JR, Chang KO, Hayes J, Vinje J, Saif LJ (2002) Characterization of an enteropathogenic bovine calicivirus representing a potentially new calicivirus genus. Journal of virology 76: 10089–10098.JR SmileyKO ChangJ. HayesJ. VinjeLJ Saif2002Characterization of an enteropathogenic bovine calicivirus representing a potentially new calicivirus genus.Journal of virology761008910098
- 2. Di Martino B, Di Profio F, Martella V, Ceci C, Marsilio F (2011) Evidence for recombination in neboviruses. Veterinary microbiology 153: 367–372.B. Di MartinoF. Di ProfioV. MartellaC. CeciF. Marsilio2011Evidence for recombination in neboviruses.Veterinary microbiology153367372
- 3. Kaplon J, Guenau E, Asdrubal P, Pothier P, Ambert-Balay K (2011) Possible novel nebovirus genotype in cattle, France. Emerging infectious diseases 17: 1120–1123.J. KaplonE. GuenauP. AsdrubalP. PothierK. Ambert-Balay2011Possible novel nebovirus genotype in cattle, France.Emerging infectious diseases1711201123
- 4. Farkas T, Sestak K, Wei C, Jiang X (2008) Characterization of a rhesus monkey calicivirus representing a new genus of Caliciviridae. Journal of virology 82: 5408–5416.T. FarkasK. SestakC. WeiX. Jiang2008Characterization of a rhesus monkey calicivirus representing a new genus of Caliciviridae.Journal of virology8254085416
- 5. Farkas T, Dufour J, Jiang X, Sestak K (2010) Detection of norovirus-, sapovirus- and rhesus enteric calicivirus-specific antibodies in captive juvenile macaques. The Journal of general virology 91: 734–738.T. FarkasJ. DufourX. JiangK. Sestak2010Detection of norovirus-, sapovirus- and rhesus enteric calicivirus-specific antibodies in captive juvenile macaques.The Journal of general virology91734738
- 6. L'Homme Y, Sansregret R, Plante-Fortier E, Lamontagne AM, Ouardani M, et al. (2009) Genomic characterization of swine caliciviruses representing a new genus of Caliciviridae. Virus genes 39: 66–75.Y. L'HommeR. SansregretE. Plante-FortierAM LamontagneM. Ouardani2009Genomic characterization of swine caliciviruses representing a new genus of Caliciviridae.Virus genes396675
- 7. Wolf S, Reetz J, Otto P (2011) Genetic characterization of a novel calicivirus from a chicken. Archives of virology 156: 1143–1150.S. WolfJ. ReetzP. Otto2011Genetic characterization of a novel calicivirus from a chicken.Archives of virology15611431150
- 8. Chiba S, Sakuma Y, Kogasaka R, Akihara M, Horino K, et al. (1979) An outbreak of gastroenteritis associated with calicivirus in an infant home. Journal of medical virology 4: 249–254.S. ChibaY. SakumaR. KogasakaM. AkiharaK. Horino1979An outbreak of gastroenteritis associated with calicivirus in an infant home.Journal of medical virology4249254
- 9. Clarke IN, Lambden PR (2000) Organization and expression of calicivirus genes. The Journal of infectious diseases 181: Suppl 2S309–316.IN ClarkePR Lambden2000Organization and expression of calicivirus genes.The Journal of infectious diseases181Suppl 2S309316
- 10. Atmar RL, Estes MK (2001) Diagnosis of noncultivatable gastroenteritis viruses, the human caliciviruses. Clinical microbiology reviews 14: 15–37.RL AtmarMK Estes2001Diagnosis of noncultivatable gastroenteritis viruses, the human caliciviruses.Clinical microbiology reviews141537
- 11. L'Homme Y, Brassard J, Ouardani M, Gagne MJ (2010) Characterization of novel porcine sapoviruses. Archives of virology 155: 839–846.Y. L'HommeJ. BrassardM. OuardaniMJ Gagne2010Characterization of novel porcine sapoviruses.Archives of virology155839846
- 12. Bank-Wolf BR, Konig M, Thiel HJ (2010) Zoonotic aspects of infections with noroviruses and sapoviruses. Veterinary microbiology 140: 204–212.BR Bank-WolfM. KonigHJ Thiel2010Zoonotic aspects of infections with noroviruses and sapoviruses.Veterinary microbiology140204212
- 13. Rockx B, De Wit M, Vennema H, Vinje J, De Bruin E, et al. (2002) Natural history of human calicivirus infection: a prospective cohort study. Clinical infectious diseases : an official publication of the Infectious Diseases Society of America 35: 246–253.B. RockxM. De WitH. VennemaJ. VinjeE. De Bruin2002Natural history of human calicivirus infection: a prospective cohort study.Clinical infectious diseases : an official publication of the Infectious Diseases Society of America35246253
- 14. Chiba S, Nakata S, Numata-Kinoshita K, Honma S (2000) Sapporo virus: history and recent findings. The Journal of infectious diseases 181: Suppl 2S303–308.S. ChibaS. NakataK. Numata-KinoshitaS. Honma2000Sapporo virus: history and recent findings.The Journal of infectious diseases181Suppl 2S303308
- 15. Percival S, Chalmers R, Embrey M, Hunter P, Sellwood J, et al. (2004) Norovirus and sapovirus. Microbiology of Waterborne Diseases. London: Academic Press. pp. 433–444.S. PercivalR. ChalmersM. EmbreyP. HunterJ. Sellwood2004Norovirus and sapovirus. Microbiology of Waterborne DiseasesLondonAcademic Press433444
- 16. Logan C, O'Sullivan N (2007) Detection of viral agents of gastroenteritis: Norovirus, Sapovirus and Astrovirus. Future Virology 3: 61–70.C. LoganN. O'Sullivan2007Detection of viral agents of gastroenteritis: Norovirus, Sapovirus and Astrovirus.Future Virology36170
- 17. Svraka S, Vennema H, van der Veer B, Hedlund KO, Thorhagen M, et al. (2010) Epidemiology and genotype analysis of emerging sapovirus-associated infections across Europe. Journal of clinical microbiology 48: 2191–2198.S. SvrakaH. VennemaB. van der VeerKO HedlundM. Thorhagen2010Epidemiology and genotype analysis of emerging sapovirus-associated infections across Europe.Journal of clinical microbiology4821912198
- 18. Pang XL, Lee BE, Tyrrell GJ, Preiksaitis JK (2009) Epidemiology and genotype analysis of sapovirus associated with gastroenteritis outbreaks in Alberta, Canada: 2004–2007. The Journal of infectious diseases 199: 547–551.XL PangBE LeeGJ TyrrellJK Preiksaitis2009Epidemiology and genotype analysis of sapovirus associated with gastroenteritis outbreaks in Alberta, Canada: 2004–2007.The Journal of infectious diseases199547551
- 19. Tam CC, Rodrigues LC, Viviani L, Dodds JP, Evans MR, et al. (2011) Longitudinal study of infectious intestinal disease in the UK (IID2 study): incidence in the community and presenting to general practice. Gut. CC TamLC RodriguesL. VivianiJP DoddsMR Evans2011Longitudinal study of infectious intestinal disease in the UK (IID2 study): incidence in the community and presenting to general practice.Gut
- 20. Khamrin P, Maneekarn N, Peerakome S, Tonusin S, Malasao R, et al. (2007) Genetic diversity of noroviruses and sapoviruses in children hospitalized with acute gastroenteritis in Chiang Mai, Thailand. Journal of medical virology 79: 1921–1926.P. KhamrinN. ManeekarnS. PeerakomeS. TonusinR. Malasao2007Genetic diversity of noroviruses and sapoviruses in children hospitalized with acute gastroenteritis in Chiang Mai, Thailand.Journal of medical virology7919211926
- 21. Monica B, Ramani S, Banerjee I, Primrose B, Iturriza-Gomara M, et al. (2007) Human caliciviruses in symptomatic and asymptomatic infections in children in Vellore, South India. Journal of medical virology 79: 544–551.B. MonicaS. RamaniI. BanerjeeB. PrimroseM. Iturriza-Gomara2007Human caliciviruses in symptomatic and asymptomatic infections in children in Vellore, South India.Journal of medical virology79544551
- 22. Martella V, Lorusso E, Decaro N, Elia G, Radogna A, et al. (2008) Detection and molecular characterization of a canine norovirus. Emerging infectious diseases 14: 1306–1308.V. MartellaE. LorussoN. DecaroG. EliaA. Radogna2008Detection and molecular characterization of a canine norovirus.Emerging infectious diseases1413061308
- 23. Li L, Pesavento PA, Shan T, Leutenegger CM, Wang C, et al. (2011) Viruses in diarrhoeic dogs include novel kobuviruses and sapoviruses. The Journal of general virology 92: 2534–2541.L. LiPA PesaventoT. ShanCM LeuteneggerC. Wang2011Viruses in diarrhoeic dogs include novel kobuviruses and sapoviruses.The Journal of general virology9225342541
- 24. Martella V, Campolo M, Lorusso E, Cavicchio P, Camero M, et al. (2007) Norovirus in captive lion cub (Panthera leo). Emerging infectious diseases 13: 1071–1073.V. MartellaM. CampoloE. LorussoP. CavicchioM. Camero2007Norovirus in captive lion cub (Panthera leo).Emerging infectious diseases1310711073
- 25. Scipioni A, Mauroy A, Vinje J, Thiry E (2008) Animal noroviruses. Veterinary journal 178: 32–45.A. ScipioniA. MauroyJ. VinjeE. Thiry2008Animal noroviruses.Veterinary journal1783245
- 26. Guo M, Evermann JF, Saif LJ (2001) Detection and molecular characterization of cultivable caliciviruses from clinically normal mink and enteric caliciviruses associated with diarrhea in mink. Archives of virology 146: 479–493.M. GuoJF EvermannLJ Saif2001Detection and molecular characterization of cultivable caliciviruses from clinically normal mink and enteric caliciviruses associated with diarrhea in mink.Archives of virology146479493
- 27. Li L, Shan T, Wang C, Cote C, Kolman J, et al. (2011) The fecal viral flora of California sea lions. Journal of virology 85: 9909–9917.L. LiT. ShanC. WangC. CoteJ. Kolman2011The fecal viral flora of California sea lions.Journal of virology8599099917
- 28. Wilson DE, Reeder DM (2005) Mammal species of the world : a taxonomic and geographic reference. Baltimore: Johns Hopkins University Press. DE WilsonDM Reeder2005Mammal species of the world : a taxonomic and geographic referenceBaltimoreJohns Hopkins University Press
- 29. Lau SK, Woo PC, Lai KK, Huang Y, Yip CC, et al. (2011) Complete genome analysis of three novel picornaviruses from diverse bat species. Journal of virology 85: 8819–8828.SK LauPC WooKK LaiY. HuangCC Yip2011Complete genome analysis of three novel picornaviruses from diverse bat species.Journal of virology8588198828
- 30. Lau SK, Poon RW, Wong BH, Wang M, Huang Y, et al. (2010) Coexistence of different genotypes in the same bat and serological characterization of Rousettus bat coronavirus HKU9 belonging to a novel Betacoronavirus subgroup. Journal of virology 84: 11385–11394.SK LauRW PoonBH WongM. WangY. Huang2010Coexistence of different genotypes in the same bat and serological characterization of Rousettus bat coronavirus HKU9 belonging to a novel Betacoronavirus subgroup.Journal of virology841138511394
- 31. Lau SK, Woo PC, Wong BH, Wong AY, Tsoi HW, et al. (2010) Identification and complete genome analysis of three novel paramyxoviruses, Tuhoko virus 1, 2 and 3, in fruit bats from China. Virology 404: 106–116.SK LauPC WooBH WongAY WongHW Tsoi2010Identification and complete genome analysis of three novel paramyxoviruses, Tuhoko virus 1, 2 and 3, in fruit bats from China.Virology404106116
- 32. Lau SK, Woo PC, Li KS, Huang Y, Wang M, et al. (2007) Complete genome sequence of bat coronavirus HKU2 from Chinese horseshoe bats revealed a much smaller spike gene with a different evolutionary lineage from the rest of the genome. Virology 367: 428–439.SK LauPC WooKS LiY. HuangM. Wang2007Complete genome sequence of bat coronavirus HKU2 from Chinese horseshoe bats revealed a much smaller spike gene with a different evolutionary lineage from the rest of the genome.Virology367428439
- 33. Woo PC, Lau SK, Li KS, Poon RW, Wong BH, et al. (2006) Molecular diversity of coronaviruses in bats. Virology 351: 180–187.PC WooSK LauKS LiRW PoonBH Wong2006Molecular diversity of coronaviruses in bats.Virology351180187
- 34. Lau SK, Li KS, Huang Y, Shek CT, Tse H, et al. (2010) Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events. Journal of virology 84: 2808–2819.SK LauKS LiY. HuangCT ShekH. Tse2010Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events.Journal of virology8428082819
- 35. Lau SK, Woo PC, Li KS, Huang Y, Tsoi HW, et al. (2005) Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats. Proc Natl Acad Sci U S A 102: 14040–14045.SK LauPC WooKS LiY. HuangHW Tsoi2005Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats.Proc Natl Acad Sci U S A1021404014045
- 36. Wong S, Lau S, Woo P, Yuen KY (2007) Bats as a continuing source of emerging infections in humans. Reviews in medical virology 17: 67–91.S. WongS. LauP. WooKY Yuen2007Bats as a continuing source of emerging infections in humans.Reviews in medical virology176791
- 37. Oka T, Katayama K, Ogawa S, Hansman GS, Kageyama T, et al. (2005) Proteolytic processing of sapovirus ORF1 polyprotein. Journal of virology 79: 7283–7290.T. OkaK. KatayamaS. OgawaGS HansmanT. Kageyama2005Proteolytic processing of sapovirus ORF1 polyprotein.Journal of virology7972837290
- 38. Oka T, Yamamoto M, Katayama K, Hansman GS, Ogawa S, et al. (2006) Identification of the cleavage sites of sapovirus open reading frame 1 polyprotein. The Journal of general virology 87: 3329–3338.T. OkaM. YamamotoK. KatayamaGS HansmanS. Ogawa2006Identification of the cleavage sites of sapovirus open reading frame 1 polyprotein.The Journal of general virology8733293338
- 39. Hansman GS, Oka T, Takeda N (2008) Sapovirus-like particles derived from polyprotein. Virus research 137: 261–265.GS HansmanT. OkaN. Takeda2008Sapovirus-like particles derived from polyprotein.Virus research137261265
- 40. Rima BK, McFerran NV (1997) Dinucleotide and stop codon frequencies in single-stranded RNA viruses. The Journal of general virology 78(Pt 11): 2859–2870.BK RimaNV McFerran1997Dinucleotide and stop codon frequencies in single-stranded RNA viruses.The Journal of general virology78Pt 1128592870
- 41. Karlin S, Doerfler W, Cardon LR (1994) Why is CpG suppressed in the genomes of virtually all small eukaryotic viruses but not in those of large eukaryotic viruses? Journal of virology 68: 2889–2897.S. KarlinW. DoerflerLR Cardon1994Why is CpG suppressed in the genomes of virtually all small eukaryotic viruses but not in those of large eukaryotic viruses?Journal of virology6828892897
- 42. Berke T, Matson DO (2000) Reclassification of the Caliciviridae into distinct genera and exclusion of hepatitis E virus from the family on the basis of comparative phylogenetic analysis. Archives of virology 145: 1421–1436.T. BerkeDO Matson2000Reclassification of the Caliciviridae into distinct genera and exclusion of hepatitis E virus from the family on the basis of comparative phylogenetic analysis.Archives of virology14514211436
- 43. Kapoor A, Simmonds P, Lipkin WI, Zaidi S, Delwart E (2010) Use of nucleotide composition analysis to infer hosts for three novel picorna-like viruses. Journal of virology 84: 10322–10328.A. KapoorP. SimmondsWI LipkinS. ZaidiE. Delwart2010Use of nucleotide composition analysis to infer hosts for three novel picorna-like viruses.Journal of virology841032210328
- 44. Greenbaum BD, Rabadan R, Levine AJ (2009) Patterns of oligonucleotide sequences in viral and host cell RNA identify mediators of the host innate immune system. PloS one 4: e5969.BD GreenbaumR. RabadanAJ Levine2009Patterns of oligonucleotide sequences in viral and host cell RNA identify mediators of the host innate immune system.PloS one4e5969
- 45. ElHefnawi M, Alaidi O, Mohamed N, Kamar M, El-Azab I, et al. (2011) Identification of novel conserved functional motifs across most Influenza A viral strains. Virology journal 8: 44.M. ElHefnawiO. AlaidiN. MohamedM. KamarI. El-Azab2011Identification of novel conserved functional motifs across most Influenza A viral strains.Virology journal844
- 46. Lobo FP, Mota BE, Pena SD, Azevedo V, Macedo AM, et al. (2009) Virus-host coevolution: common patterns of nucleotide motif usage in Flaviviridae and their hosts. PloS one 4: e6282.FP LoboBE MotaSD PenaV. AzevedoAM Macedo2009Virus-host coevolution: common patterns of nucleotide motif usage in Flaviviridae and their hosts.PloS one4e6282
- 47. Shackelton LA, Parrish CR, Holmes EC (2006) Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. Journal of molecular evolution 62: 551–563.LA ShackeltonCR ParrishEC Holmes2006Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses.Journal of molecular evolution62551563
- 48. Tse H, Cai JJ, Tsoi HW, Lam EP, Yuen KY (2010) Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes. BMC genomics 11: 491.H. TseJJ CaiHW TsoiEP LamKY Yuen2010Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes.BMC genomics11491
- 49. Kim KH, Bae JW (2011) Amplification Methods Bias Metagenomic Libraries of Uncultured Single-Stranded and Double-Stranded DNA Viruses. Applied and environmental microbiology 77: 7763–7768.KH KimJW Bae2011Amplification Methods Bias Metagenomic Libraries of Uncultured Single-Stranded and Double-Stranded DNA Viruses.Applied and environmental microbiology7777637768
- 50. Woo PC, Lau SK, Lam CS, Lai KK, Huang Y, et al. (2009) Comparative analysis of complete genome sequences of three avian coronaviruses reveals a novel group 3c coronavirus. Journal of virology 83: 908–917.PC WooSK LauCS LamKK LaiY. Huang2009Comparative analysis of complete genome sequences of three avian coronaviruses reveals a novel group 3c coronavirus.Journal of virology83908917
- 51. Woo PC, Lau SK, Huang Y, Lam CS, Poon RW, et al. (2010) Comparative analysis of six genome sequences of three novel picornaviruses, turdiviruses 1, 2 and 3, in dead wild birds, and proposal of two novel genera, Orthoturdivirus and Paraturdivirus, in the family Picornaviridae. The Journal of general virology 91: 2433–2448.PC WooSK LauY. HuangCS LamRW Poon2010Comparative analysis of six genome sequences of three novel picornaviruses, turdiviruses 1, 2 and 3, in dead wild birds, and proposal of two novel genera, Orthoturdivirus and Paraturdivirus, in the family Picornaviridae.The Journal of general virology9124332448
- 52. Tse H, Chan WM, Tsoi HW, Fan RY, Lau CC, et al. (2011) Rediscovery and genomic characterization of bovine astroviruses. The Journal of general virology 92: 1888–1898.H. TseWM ChanHW TsoiRY FanCC Lau2011Rediscovery and genomic characterization of bovine astroviruses.The Journal of general virology9218881898
- 53. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic acids research 32: 1792–1797.RC Edgar2004MUSCLE: multiple sequence alignment with high accuracy and high throughput.Nucleic acids research3217921797
- 54. Criscuolo A, Gribaldo S (2010) BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC evolutionary biology 10: 210.A. CriscuoloS. Gribaldo2010BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments.BMC evolutionary biology10210
- 55. Guindon S, Gascuel O (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Systematic biology 52: 696–704.S. GuindonO. Gascuel2003A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.Systematic biology52696704
- 56. Darriba D, Taboada GL, Doallo R, Posada D (2011) ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27: 1164–1165.D. DarribaGL TaboadaR. DoalloD. Posada2011ProtTest 3: fast selection of best-fit models of protein evolution.Bioinformatics2711641165
- 57. Anisimova M, Gascuel O (2006) Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. Systematic biology 55: 539–552.M. AnisimovaO. Gascuel2006Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative.Systematic biology55539552
- 58. Etherington GJ, Dicks J, Roberts IN (2005) Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination. Bioinformatics 21: 278–281.GJ EtheringtonJ. DicksIN Roberts2005Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination.Bioinformatics21278281
- 59. Supek F, Vlahovicek K (2004) INCA: synonymous codon usage analysis and clustering by means of self-organizing map. Bioinformatics 20: 2329–2330.F. SupekK. Vlahovicek2004INCA: synonymous codon usage analysis and clustering by means of self-organizing map.Bioinformatics2023292330
- 60. Wright F (1990) The ‘effective number of codons’ used in a gene. Gene 87: 23–29.F. Wright1990The ‘effective number of codons’ used in a gene.Gene872329
- 61. Novembre JA (2002) Accounting for background nucleotide composition when measuring codon usage bias. Molecular biology and evolution 19: 1390–1394.JA Novembre2002Accounting for background nucleotide composition when measuring codon usage bias.Molecular biology and evolution1913901394
- 62. Cardon LR, Burge C, Clayton DA, Karlin S (1994) Pervasive CpG suppression in animal mitochondrial genomes. Proceedings of the National Academy of Sciences of the United States of America 91: 3799–3803.LR CardonC. BurgeDA ClaytonS. Karlin1994Pervasive CpG suppression in animal mitochondrial genomes.Proceedings of the National Academy of Sciences of the United States of America9137993803