Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Synonymous Codon Usage in TTSuV2: Analysis and Comparison with TTSuV1

  • Zhicheng Zhang ,

    Contributed equally to this work with: Zhicheng Zhang, Wei Dai

    Affiliation Department of Animal Science and Technology, Jinling Institute of Technology, Nanjing, China

  • Wei Dai ,

    Contributed equally to this work with: Zhicheng Zhang, Wei Dai

    Affiliation Key Laboratory of Zoonoses of Anhui Province, Anhui Agricultural University, Hefei, China

  • Dingzhen Dai

    Affiliation Department of Animal Science and Technology, Jinling Institute of Technology, Nanjing, China

Synonymous Codon Usage in TTSuV2: Analysis and Comparison with TTSuV1

  • Zhicheng Zhang, 
  • Wei Dai, 
  • Dingzhen Dai


Two species of the DNA virus Torque teno sus virus (TTSuV), TTSuV1 and TTSuV2, have become widely distributed in pig-farming countries in recent years. In this study, we performed a comprehensive analysis of synonymous codon usage bias in 41 available TTSuV2 coding sequences (CDS), and compared the codon usage patterns of TTSuV2 and TTSuV1. TTSuV codon usage patterns were found to be phylogenetically conserved. Values for the effective number of codons (ENC) indicated that the overall extent of codon usage bias in both TTSuV2 and TTSuV1 was not significant, the most frequently occurring codons had an A or C at the third codon position. Correspondence analysis (COA) was performed and TTSuV2 and TTSuV1 sequences were located in different quadrants of the first two major axes. A plot of the ENC revealed that compositional constraint was the major factor determining the codon usage bias for TTSuV2. In addition, hierarchical cluster analysis of 41 TTSuV2 isolates based on relative synonymous codon usage (RSCU) values suggested that there was no association between geographic distribution and codon bias of TTSuV2 sequences. Finally, the comparison of RSCU for TTSuV2, TTSuV1 and the corresponding host sequence indicated that the codon usage pattern of TTSuV2 was similar to that of TTSuV1. However the similarity was low for each virus and its host. These conclusions provide important insight into the synonymous codon usage pattern of TTSuV2, as well as better understangding of the molecular evolution of TTSuV2 genomes.


It is well known that the 64 codons of the genetic code encode the 20 standard amino acids as well as three translation termination signals (UAA, UAG, UGA). Each amino acid is encoded with at least one codon (e.g., Met and Try); however, due to the degeneracy of the genetic code, some amino acids are encoded with up to six codons (e.g, Leu, Ser and Arg). Codons encoding the same amino acid are referred to as synonymous codons. Studies have indicated that synonymous codon usage is non-random and species-specific [1]. Some synonymous codons are more frequent than others both within and between genes, and this phenomenon is termed synonymous codon usage bias [2]. In general, genome dynamics, primarily mutation pressure, facilitate the evolution of novel viruses and strains and contribute to adaption to environment and host [3]. Hence, codon usage variation is considered to be an indicator of the type of force that influences genome evolution. Investigation of codon bias and the forces that influence it provides insights into the fundamental mechanisms of viral evolution. Thus, understanding codon bias is essential to understand the interplay between a virus and its host.

It was well established that mutational pressure and natural selection [4,5] were presented as the two major factors accounting for codon usage variation in mammalian, protozoan and endosymbiotic bacterial genes [6]. In their investigate of codon usage variation, Shackelton et al (2006) found that codon usage bias was strongly correlated with overall genomic GC content, indicating that compositional constraint under mutation pressure rather than natural selection was the main factor for specific codons [7]. Naya et al (2001) examined the Chlamydomonas reinhardtii genome, which has a high GC content, and found no evidence that base constraint under mutation pressure was responsible for determining the codon usage pattern [8]. Recently, it was also reported that codon usage variation is related to gene function and length [9,10], DNA replication and selective transcription [11], protein secondary structure [12,13] and environmental factors [14].

Torque teno virus (TTV) is a small, single-stranded, negative-sense non-enveloped, circular DNA virus [15], which has been classified as a member of the recently discovered Anelloviridae family [16]. It was first identified in a Japanese patient with post-transfusion hepatitis of unknown aetiology in 1997 [17]. Subsequently, TTV has been detected in humans, chimpanzees, poultry, swine, cattle, sheep, cats and dogs [18,19]. TTV was first detected in swine in 1999 and two genetically distinct species, Torque teno sus virus 1 (TTSuV1) and 2 (TTSuV2), have been identified based on the low sequence identity between the two variants [20].

Recently, Torque teno sus virus (TTSuV) infection of pigs has become widespread in many countries, including the USA, Canada, Spain, Germany, China, Japan, Korea and Brazil [21]. Despite the fact that TTV infection in humans is not yet directly associated with any disease [22], TTSuVs have been shown to be involved in co-infection with other diseases, including the experimental induction of porcine dermatitis and nephropathy syndrome in combination with porcine reproductive and respiratory syndrome virus infection [23] and post-weaning multisystemic wasting syndrome (PMWS) in combination with porcine circovirus type 2 (PCV2) infection in a gnotobiotic pig model [24]. Moreover, Kekarainen et al. (2006) found that TTSuV2 was detected at a significantly higher rate in PMWS pigs than in healthy pigs [25]. Other research comfirmed that the replication of TTSuV2, but not of TTSuV1, was up-regulated in the pigs with PMWS [26,27]. This result was supported by Taira et al (2009), who examined animals suspected of infection with PMWS and porcine respiratory disease complex [28]. However, due to the limited number of animal species examined and the lack of information about viral cell and tissue tropism, the characteristics and evolution of TTSuV are not fully understood.

We previously investigated synonymous codon usage in TTSuV1 [29] and began to suspect that this method might be important for elucidating the molecular mechanism and evolutionary process of TTSuV. In this study, synonymous codon usage bias was analyzed in the coding sequences (CDS) from the 41 available TTSuV2 genomes, and the codon usage patterns of TTSuV2 and TTSuV1 were compared.

Materials and Methods

Sequences data

Complete genome sequences from 41 TTSuV2 isolates were downloaded from the National Center for Biotechnology Information ( Each TTSuV2 CDS was analyzed using DNAStar version 7.1 (DNAStar, Madison, WI). Table 1 summarizes relevant details about these viral sequences.

No.Accession no.NameIsolationYearLength(bp)

Table 1. 41 complete TTSuV2 genes used in this study.

Download CSV

Recombination analysis

The Recombination Analysis Tool (RAT, was used to detect recombination events in TTSuV2 and TTSuV1 sequences. Recombination is a prevailing drive that shapes genome evolution, and it is believed to influence the efficacy of natural selection on codon usage [30]. RAT uses a distance-method-based algorithm to perform pair-wise comparisons with multiple sequence alignments (DNA or protein). The RAT graph represents the genetic distance of each sequence in the alignment to a reference sequence (Y-axis) for each position in the sequence (X-axis). A putative recombination event is detected when the lines representing two sequences intersect in the graph [31].

Compositional properties measures

General nucleotide composition (A%, C%, T% and G%) and nucleotide composition at the third position of each codon (A3S%, C3S%, T3S% and G3S%) were analyzed for TTSuV2 CDSs using Molecular Evolutionary Genetics Analysis (MEGA) software version 5.0 [32]. The GC and GC3S index was used to calculate the overall G + C content in the gene sequence and at the third position of synonymous codon (excluding Met, Trp and termination codons).

Measure of synonymous codon usage

Relative synonymous codon usage (RSCU) values and effective number of codons (ENC) values were calculated using CodonW software version 1.4 ( The RSCU is defined as the ratio between the usage frequency of one codon in the gene and its expected frequency in the synonymous codon family (i.e., the observed frequency of a codon adjusted for amino acid composition). RSCU value is calculated according to the following published equation [33]:


Xij denotes the position of the codon (i) in the CDS for the corresponding amino acid (j). ni denotes the total number of synonymous codons encoding the amino acid at this position. Codons with RSCU values greater than 1.0 exhibit positive codon usage bias, while those with RSCU values less than 1.0 have negative codon usage bias. RSCU values of 1.0 indicate that the codon frequencies are equal or random.

The ENC is the most useful estimator of absolute synonymous codon usage bias [34] and can indicate the degree of synonymous codon bias in a codon family. ENC values range from 20 (only one synonymous codon occurs in the CDS) to 61 (all synonymous codons occur with equal frequency). A gene with an ENC value lower than 35 is generally considered to have significant codon usage bias.

Correspondence analysis

Correspondence analysis (COA), also known as principal component analysis, was performed with CodonW software version 1.4. COA is the most commonly used multivariate statistical analysis method [35]. In this analysis, COA was used to study the major trends in sequence variation and distribute genes along continuous axes according to these trends. Each gene was represented as a 59-dimensional vector, each dimension corresponding to the RSCU value for each sense codon (excluding Met, Trp and termination codons). Major variation trends within this dataset can be determined with the relative inertia: genes were positioned according to the major inertia to determine the major factors affecting codon usage bias in the gene.

Statistical analysis

Correlation analysis was performed to compare the relationship between nucleotide composition and synonymous codon usage pattern using Spearman’s rank correlation analysis method. A phylogenetic tree was constructed by the neighbor-joining method with a bootstrap of 1000 replicates, based on the Clustal W alignment produced with MEGA software version 5. Cluster analysis was performed using the hierarchical cluster method, and the distances between selected sequences were calculated by the Euclidean distance method. All statistical results were analyzed using Student’s t-test, SPSS software version 11.6 for Windows (p > 0.05, no difference; 0.01 < p < 0.05, non-significant difference; p < 0.01, significant difference).


Recombination analysis

Recombination is believed to influence the efficacy of natural selection on codon usage [30]. A single recombinant sequence present in an alignment can seriously influence the branch order and branch length of the trees generated using standard phylogenetic methods [36]. Therefore, it was necessary to exclude any TTSuV2 and TTSuV1 sequences found to be recombinant from further analysis. Recombination analysis of a nucleotide sequence alignment including all 41 TTSuV2 sequences and 29 TTSuV1 sequences was performed using RAT software (Figure 1). The resulting graph provided no evidence for recombination within or between TTSuV2 and TTSuV1 sequences. However, the graph indicated that the sequences diverged at nucleotide position 2282 into branches corresponding to TTSuV2 and TTSuV1.

Figure 1. Recombination analysis of TTSuV2 and TTSuV1 sequences using the RAT.

The colour of the line on the graph is the same as the colour of its sequence name on the left.

The 41 TTSuV2 sequences were further analyzed for codon usage bias and the synonymous codon usage pattern between TTSuV2 and TTSuV1 (previously analyzed) [29] was compared, as described in the following sections.

Compositional properties

The nucleotide content of the TTSuV2 genomes is provided in Table 2. In the CDSs from the 41 genomes, A and G occurred more frequently than C and T. A occurred most frequently at the third codon position (average A3S% = 41.77%) and T occurred the least frequently (average T3S% = 27.67%). The overall nucleotide composition and the composition at the third codon position in TTSuV2 genomes suggest that compositional constraint might be influencing the codon usage pattern of this genome. The GC% of TTSuV2 genomes (42.9% to 46.7%, average 45.1%) is lower than for other vertebrate DNA viruses. The GC3S% ranged from 43.2% to 48.2% with a mean value of 46.2%. Due to this compositional constraint, it was expected that A would occur most frequently at the third codon position in TTSuV2 genomes.


Table 2. Nucleotide content of 41 TTSuV2 genomes (%).

Download CSV

The ENC values of these TTSuV2 genomes were much higher than genomes of other DNA viruses, varying from 55.20 to 58.18 with a mean value of 56.21. This result indicates that codon usage bias is not remarkable in TTSuV2 genomes and is apparently maintained at a stable level.

Codon usage in TTSuV2

The overall RSCU values for the 59 codons in all 41 TTSuV2 genomes indicated that A and C occurred most frequently at the third codon position (i.e., GUA for Val, GCA for Ala, CAA for Gln and AAC for Asn) as shown in Table 3. In addition, the CCU, ACU and UAU codons, encoding Pro, Thr and Tyr, respectively, occurred more frequently than the other synonymous codons for these amino acids. Two codons encoding Arg, CGA and CGC, also occurred more frequently than their synonymous codons. These results support the hypothesis that compositional constraint is a major contributing factor in codon usage pattern in TTSuV2 genomes.


Table 3. RSCU values of codons in TTSuV2, TTSuV1 and swine.a

aThe preferred codons for each amino acid is displayed in bold.
bAA is the abbreviation of Amino Acid.
cSUS is swine.
Download CSV

For TTSuV2 sequences, ENC was plotted against both the GC content at the third synonymous codon position (GC3S%) and the expected ENC values, as determined by CodonW analysis (Figure 2). All actual codon usage indices were lower than expected, although differences were small. In addition, a positive correlation (r = 0.316, 0.01 < p < 0.05) between GC3S and ENC values was found. These results taken together support the conclusion that factors other than compositional constraint under mutation pressure (the major factor accounting for codon usage bias) have influenced TTSuV2 evolution.

Figure 2. Distribution of the ENC values and GC content at synonymous codon third position (GC3S).

The curve indicates the expected codon usage if compositional constraint alone account for codon usage bias.

COA of codon usage

To investigate RSCU variation, COA was performed using the 41 TTSuV2 genomes as a single dataset. As described in the "Materials and methods" section, the distribution of genes on the COA axis was used to identify the source of the variation among a set of multivariate data points. A major trend in the first axis (f1’) accounted for 16.91% of total synonymous codon usage variation, and the second major trend in the second axis (f2’) accounted for 13.72% of the total variation (data not shown).

COA was performed for TTSuV1 and TTSuV2 genomes separately and the first two axes of the plots are shown in Figure 3. Although TTSuV1 and TTSuV2 genes occupied all four quadrants of the rectangular coordinate system, the points were generally separated from each other. This result reveals that variation in codon usage might be one of the factors driving the observed aspect of TTSuV evolution.

Figure 3. Correspondence analysis of codon usage patterns of TTSuV2 and TTSuV1.

Effect of mutational bias on codon usage variation

To explore whether the evolution of codon usage bias in TTSuV2 CDS had been driven by mutation pressure alone or whether translation selection from its host has also contributed, we first compared the correlation between general nucleotide composition (A%, T%, G%, C%, GC%) and nucleotide composition at the third codon position (A3S%, T3S%, G3S%, C3S%, GC3S%) using the Spearman’s rank correlation analysis method (Table 4). A significant positive correlation was observed between A% and A3S% (r = 0.761, p < 0.01), C% and C3S% (r = 0.392, 0.01 < p < 0.05), GC% and GC3S% (r = 0.645, p < 0.01) and significant negative correlation was observed for most of heterogeneous nucleotide comparisons. Taken alone, these results suggest that compositional constraints under mutation pressure determine the codon usage pattern for TTSuV2. However, a significant positive correlation between G% and C3S% (r = 0.434, p < 0.01), GC% and T3S% (r = 0.434, p < 0.01) and no correlation between T% and T3S% (r = 0.175, p > 0.05), G% and G3S% (r = 0.171, p > 0.05) suggest that natural selection from its host might have played an appreciable role in determining the codon usage pattern of this virus.

A3S%T(U)3S %G3S %C3S %GC3S %

Table 4. The correlation analysis between A, T, G, C, GC contents and A3S, T3S, G3S, C3S, GC3S contents in TTSuV2 CDS.a

aValue in this table is the P-value of correlation analysis.
NS, non-significant (p>0.05).
Download CSV

Furthermore, G + C content at the first and second codon positions (GC1% and GC2%) was compared with the G + C content at the third codon position (GC3%). A highly significant correlation was observed between GC1% with GC2% (r = 0.551, p < 0.01), GC3% (r = 0.699, p < 0.01), and GC2% with GC3% (r = 0.490, p < 0.01). Since the effects were present at all codon positions, the results further support the hypothesis that nucleotide constraint under mutation pressure was a main determinant for synonymous codon usage pattern in TTSuV2.

COA was also performed for the first two principle axes (f1’ and f2’) and A%, T%, G%, C%, GC%, A3S%, T3S%, G3S%, C3S%, GC3S% (Table 5). The first principle axis (f1’) exhibited a significant positive correlation with G%, C%, GC%, C3S%, GC3S% and a negative correlation with A%, A3S%. It was interesting to note that, except G3S% (r = –0.357, 0.01 < p <0.05), the second principle axis (f2’) had no correlation with any nucleotide content. These results further support the conclusion that composition constraints under mutational bias is an important factor determining synonymous codon usage pattern in TTSuV2, and but that other factors, such as natural selection, contributed.


Table 5. The correlation analysis between the first two axes and nucleotide contents in TTSuV2 CDS.a

aValue in this table is the P-value of correlation analysis.
NS, non-significant (p>0.05).
Download CSV

Relationship between TTSuV and host codon usage patterns

In the ENC plot (Figure 2), most points were near to and under the expected curve, which suggested that other factors contributed to codon usage bias in addition to mutation pressure. To examine this further, a comparative analysis of RSCU values was performed for TTSuV2, TTSuV1 and swine, the natural host for this virus. We found that the codon usage pattern of TTSuV2 was mostly coincident with that of TTSuV1 and that the similarity between the viruses and the host was low. In particular, except for CCU encoding Pro and UAU encoding Tyr, all the preferentially used codons in TTSuV2 and TTSuV1 had an A or C in the third codon position: UUA for Leu, AUA for Ile, UCA for Ser, CAC for His, GAC for Asp and UGC for Gly (Table 3). In contrast, most frequent codons in swine had a T or A at the third codon position. Although some codons frequent in swine, such as CAC for His, AAA for Lys, GAC for Asp and AAA for Glu, were also frequent in TTSuV2 and TTSuV1, the high frequency codons in swine (CUG for Leu, UCU for Ser, UGU for Cys) were generally low frequency codons in TTSuV2 and TTSuV1. It was worth noting that the similarity to swine was higher for TTSuV1 than it was for TTSuV2. The RSCU values of synonymous codons in TTSuV1 and swine, including GUG for Val, GCU for Ala, CAG for Gln, AAU for Asn, were clearly different than TTSuV2 values. This suggests that TTSuV1 might have adapted to its host under natural selection to some degree for improved translation efficiency and that selection pressure from the host had less effect on codon usage pattern of TTSuV2.

Phylogenetic and cluster analysis

A cluster tree was generated with the RSCU values from all 41 TTSuV2 genomes using a hierarchical cluster method. As shown in Figure 4, the TTSuV2 CDS were divided into three main lineages (I–III). Lineage I comprised two strains isolated from the USA, one from Germany and five from China. Twenty-two strains isolated from Brazil, Spain and China were grouped into Lineage II. Lineage III was comprised of strains isolated from China only. Some genes from different isolates were classified into the same lineage, while others genes from the same isolate were classified into different lineages; thus lineage did not correspond well with geographical distribution.

Figure 4. Cluster tree result of 41 TTSuV2 genes based on hierarchical cluster method.

The phylogenetic analysis of all 41 TTSuV2 (black dots) and 29 TTSuV1 sequences (white dots) was performed to determine the conservation and variation of codon usage pattern within TTSuV lineages (Figure 5). The two major branches of the resulting phylogenetic tree corresponded to TTSuV2 and TTSuV1, and each branch had several minor branches. Thus, phylogenetic analysis of the two viruses did not reveal correlations between sequence differences and geographical distribution.

Figure 5. Phylogenetic tree of 41 TTSuV2 sequences and 29 TTSuV1 sequences.

● represents TTSuV2 and ○ represents TTSuV1.


TTSuV is an emerging small DNA virus, widely distributed in pig-farming countries. Although reports implicate TTSuV in co-infection with other diseases, in depth studies on molecular characteristics and pathogenic mechanism are lacking [37,38]. Synonymous codon usage is a well established technique for analyzing genetic information from viral genomes. Most codon usage studies have focused on higher organisms or microorganisms with large genomes and viruses that pose a great threat to human health, such as human immunodeficiency virus, human bocavirus [39], hepatitis virus [40] and Influenza A virus [41]. Results from analyzing codon usage bias in TTSuV genomes are expected to contribute to the knowledge of the characteristics and molecular evolution of this virus. This report furthers our investigation of synonymous codon usage variation in TTSuV1 and provides the first analysis of TTSuV2.

Recombination is an important event in viral evolution and epidemiology [42]. It is interesting to note that recombinant viruses appear to be highly pathogenic, suggesting that recombination events either preserve or increase the pathogenicity of the original strains. Various studies have demonstrated that natural inter- and intra-genotypic recombination occurs frequently in viruses, as shown for highly pathogenic porcine reproductive and respiratory syndrome viruses [43], PCV2 [44], humane enterovirus 71 [45], and rabbit haemorrhagic disease virus [46]. Thus, before analyzing codon usage bias for TTSuV2, we first conducted recombination analysis of 41 TTSuV2 sequences and 29 TTSuV1 sequences, and found no evidence for recombination between the two viruses (Figure 1).

In this study, we analyzed synonymous codon usage bias in TTSuV2 CDS, as well as the relationship between codon usage patterns of TTSuV2 and TTSuV1. Most frequent codons in both TTSuV2 and TTSuV1 had A or C at the third codon position. Mean ENC values for H5N1 influenza A virus [47], severe acute respiratory syndrome [48] and human bocavirus [39], reported as 50.91, 48.99 and 44.45, respectively, are lower than the ENC values for TTSuV2 and TTSuV1 (56.21 and 56.46, respectively), indicating a relatively low codon usage bias for these two viruses. Codon usage patterns for TTSuV2 and TTSuV1 were remarkably similar. In addition, no significant relationship was found between the codon usage pattern of TTSuV2 and its host; although TTSuV1 codon usage was comparatively more similar to swine than that of TTSuV2 (Table 3). This observation might be the result of genome composition evolution and dynamic processes of mutation and selection that enabled the TTSuV1 virus to escape the antiviral cell responses and adapt its codon usage to its host environment [49].

In this study, nucleotide frequency at the third codon position of synonymous codons correlated to general composition for some codons but not for others (Table 4). The GC content was similar at all codon positions in TTSuV2 genomes, presumably as a result of mutational pressure. In addition, the general correlation between codon usage bias and composition constraint suggest that mutational pressure was an important factor determining codon usage in TTSuV2, as seen in the highly significant correlation between GC1%, GC2% and GC3% (p < 0.01), and remarkable correlation between f1’ values with respect to A%, G%, C%, GC%, A3S%, G3S%, GC3S% (p<0.01) (Table 5). Furthermore, in all ENC plots, values for TTSuV2 genomes were below the expected curve (Figure 1). Taken together, the above evidence indicates that compositional constraint under mutational pressure significantly contributed to the variation of synonymous codon usage in TTSuV2 genomes.

Natural selection has been shown to influence the synonymous codon usage pattern in viruses [50] and this conclusions is supported by this study. First, although the GC3S% for the TTSuV2 genome is lower than average (46.20%), the most frequent codons had A or C at the third codon position (Table 3). Second, a significant positive correlation existed between G% and C3S%, and GC% and T3S% (p < 0.01), whereas no correlation was detected between T% and T3S% or G% and G3S% (p > 0.05) (Table 4). Except G3S%, no correlation was found between f2’ values and A%, T%, G%, C%, GC%, A3S%, T3S%, C3S% or GC3S% (p > 0.05) in this study (Table 5). Third, most points in the ENC plot were close to the expected curve, although all were below it (Figure 2). The above evidences suggests that, in addition to mutation pressure, natural selection played an important role in determining codon usage bias for TTSuV2 genomes as well. Thus, codon bias in the TTSuV2 genome is multi-factorial. We believe that these characteristics of TTSuV2 genomes might have conferred adaptive advantage resulting in a highly efficient dissemination of this virus through different modes of transmission.

The analysis of TTSuV genome sequences identified two genetically distinct species, TTSuV1 and TTSuV2. COA was performed to detect possible codon usage variation between these two viruses. Unexpectedly, the distribution of the two viruses showed that genetically distinct species were distantly located in the plane defined by the first two axes of the analysis (Figure 3). A cluster tree analysis based on the RSCU values of TTSuV2 genomes revealed that geographic factors failed to correspond to the codon usage pattern of this virus (Figure 4). Further, the phylogenetic tree had two major branches corresponding to the two different species, and no specific geographical correlation was detected in this analysis (Figure 5). It seems likely that, given extensive international communication and various modes of transmission for this virus, geographical distance is a weak factor in the distribution of TTSuV2 in different countries.

In summary, our investigation of synonymous codon usage pattern in TTSuV2 CDS revealed that codon usage bias is not remarkable, possibly representing the interactions between compositional constraint under mutation pressure and natural selection. However, both TTSuV1 and TTSuV2 genomes exhibited significant synonymous codon usage bias favoring A or C at the third codon position, presumably determined by compositional constraint under mutation pressure. Although the analysis of synonymous codon usage does not perfectly reflect the genetic variation of TTSuV2 nor does it distinguish between TTSuV1 and TTSuV2, our results provide an insight into the codon usage variation in TTSuV2 genes that may also facilitate understanding of TTSuV evolution.

Author Contributions

Conceived and designed the experiments: ZZ WD. Performed the experiments: ZZ WD. Analyzed the data: ZZ WD. Contributed reagents/materials/analysis tools: ZZ WD DD. Wrote the manuscript: ZZ WD.


  1. 1. Gupta SK, Bhattacharyya TK, Ghosh TC (2004) Synonymous codon usage in Lactococcus lactis: mutational bias versus translational selection. J Biomol Struct Dyn 21: 527-536. doi:10.1080/07391102.2004.10506946. PubMed: 14692797.
  2. 2. Lloyd AT, Sharp PM (1992) Evolution of codon usage patterns: the extent and nature of divergence between Candida albicans and Saccharomyces cerevisiae. Nucleic Acids Res 20: 5289-5295. doi:10.1093/nar/20.20.5289. PubMed: 1437548.
  3. 3. Chen R, Holmes EC (2006) Avian influenza virus exhibits rapid evolutionary dynamics. Mol Biol Evol 23: 2336-2341. doi:10.1093/molbev/msl102. PubMed: 16945980.
  4. 4. Zhong J, Li Y, Zhao S, Liu S, Zhang Z (2007) Mutation pressure shapes codon usage in the GC-Rich genome of foot-and-mouth disease virus. Virus Genes 35: 767-776. doi:10.1007/s11262-007-0159-z. PubMed: 17768673.
  5. 5. Sau K, Sau S, Mandal SC, Ghosh TC (2005) Factors influencing the synonymous codon and amino acid usage bias in AT-rich Pseudomonas aeruginosa phage PhiKZ. Acta Biochim Biophys Sin (Shanghai) 37: 625-633. doi:10.1111/j.1745-7270.2005.00089.x. PubMed: 16143818.
  6. 6. Sharp PM, Li WH (1986) Codon usage in regulatory genes in Escherichia coli does not reflect selection for 'rare' codons. Nucleic Acids Res 14: 7737-7749. doi:10.1093/nar/14.19.7737. PubMed: 3534792.
  7. 7. Shackelton LA, Parrish CR, Holmes EC (2006) Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J Mol Evol 62: 551-563. doi:10.1007/s00239-005-0221-1. PubMed: 16557338.
  8. 8. Naya H, Romero H, Carels N, Zavala A, Musto H (2001) Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii. FEBS Lett 501: 127-130. doi:10.1016/S0014-5793(01)02644-8. PubMed: 11470270.
  9. 9. Chiapello H, Lisacek F, Caboche M, Henaut A (1998) Codon usage and gene function are related in sequences of Arabidopsis thaliana. Gene 209: GC1-GC38.
  10. 10. Ma ES, Chow EY, Chan AY, Chu CM, Lin SY et al. (2002) Low affinity and unstable hemoglobin variant caused by AAC--ATC (Asn--Ile) mutation at codon 108 of the beta-globin gene. Haematologica 87: 553-554. PubMed: 12010673.
  11. 11. McInerney JO (1998) Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci U S A 95: 10698-10703. doi:10.1073/pnas.95.18.10698. PubMed: 9724767.
  12. 12. Chiusano ML, Alvarez-Valin F, Di Giulio M, D'Onofrio G, Ammirato G et al. (2000) Second codon positions of genes and the secondary structures of proteins. Relationships and implications for the origin of the genetic code. Gene 261: 63-69.
  13. 13. Gupta SK, Majumdar S, Bhattacharya TK, Ghosh TC (2000) Studies on the relationships between the synonymous codon usage and protein secondary structural units. Biochem Biophys Res Commun 269: 692-696. doi:10.1006/bbrc.2000.2351. PubMed: 10720478.
  14. 14. Levin DB, Whittome B (2000) Codon usage in nucleopolyhedroviruses. J Gen Virol 81: 2313-2325. PubMed: 10950991.
  15. 15. Mushahwar IK, Erker JC, Muerhoff AS, Leary TP, Simons JN et al. (1999) Molecular and biophysical characterization of TT virus: evidence for a new virus family infecting humans. Proc Natl Acad Sci U S A 96: 3177-3182. doi:10.1073/pnas.96.6.3177. PubMed: 10077657.
  16. 16. Biagini P (2009) Classification of TTV and related viruses (anelloviruses). Curr Top Microbiol Immunol 331: 21-33. doi:10.1007/978-3-540-70972-5_2. PubMed: 19230555.
  17. 17. Nishizawa T, Okamoto H, Konishi K, Yoshizawa H, Miyakawa Y et al. (1997) A novel DNA virus (TTV) associated with elevated transaminase levels in posttransfusion hepatitis of unknown etiology. Biochem Biophys Res Commun 241: 92-97. doi:10.1006/bbrc.1997.7765. PubMed: 9405239.
  18. 18. Okamoto H, Nishizawa T, Takahashi M, Tawara A, Peng Y et al. (2001) Genomic and evolutionary characterization of TT virus (TTV) in tupaias and comparison with species-specific TTVs in humans and non-human primates. J Gen Virol 82: 2041-2050. PubMed: 11514713.
  19. 19. Okamoto H, Takahashi M, Nishizawa T, Tawara A, Fukai K et al. (2002) Genomic characterization of TT viruses (TTVs) in pigs, cats and dogs and their relatedness with species-specific TTVs in primates and tupaias. J Gen Virol 83: 1291-1297. PubMed: 12029143.
  20. 20. Niel C, Diniz-Mendes L, Devalle S (2005) Rolling-circle amplification of Torque teno virus (TTV) complete genomes from human and swine sera and identification of a novel swine TTV genogroup. J Gen Virol 86: 1343-1347. doi:10.1099/vir.0.80794-0. PubMed: 15831945.
  21. 21. McKeown NE, Fenaux M, Halbur PG, Meng XJ (2004) Molecular characterization of porcine TT virus, an orphan virus, in pigs from six different countries. Vet Microbiol 104: 113-117. doi:10.1016/j.vetmic.2004.08.013. PubMed: 15530745.
  22. 22. Jelcic I, Hotz-Wagenblatt A, Hunziker A, Zur Hausen H, de Villiers EM (2004) Isolation of multiple TT virus genotypes from spleen biopsy tissue from a Hodgkin's disease patient: genome reorganization and diversity in the hypervariable region. J Virol 78: 7498-7507. doi:10.1128/JVI.78.14.7498-7507.2004. PubMed: 15220423.
  23. 23. Kakkola L, Bondén H, Hedman L, Kivi N, Moisala S et al. (2008) Expression of all six human Torque teno virus (TTV) proteins in bacteria and in insect cells, and analysis of their IgG responses. Virology 382: 182-189. doi:10.1016/j.virol.2008.09.012. PubMed: 18947848.
  24. 24. Ellis JA, Allan G, Krakowka S (2008) Effect of coinfection with genogroup 1 porcine torque teno virus on porcine circovirus type 2-associated postweaning multisystemic wasting syndrome in gnotobiotic pigs. Am J Vet Res 69: 1608-1614. doi:10.2460/ajvr.69.12.1608. PubMed: 19046008.
  25. 25. Kekarainen T, Sibila M, Segalés J (2006) Prevalence of swine Torque teno virus in post-weaning multisystemic wasting syndrome (PMWS)-affected and non-PMWS-affected pigs in Spain. J Gen Virol 87: 833-837. doi:10.1099/vir.0.81586-0. PubMed: 16528032.
  26. 26. Aramouni M, Segalés J, Sibila M, Martin-Valls GE, Nieto D et al. (2011) Torque teno sus virus 1 and 2 viral loads in postweaning multisystemic wasting syndrome (PMWS) and porcine dermatitis and nephropathy syndrome (PDNS) affected pigs. Vet Microbiol 153: 377-381. doi:10.1016/j.vetmic.2011.05.046. PubMed: 21719215.
  27. 27. Nieto D, Aramouni M, Grau-Roma L, Segalés J, Kekarainen T (2011) Dynamics of Torque teno sus virus 1 (TTSuV1) and 2 (TTSuV2) DNA loads in serum of healthy and postweaning multisystemic wasting syndrome (PMWS) affected pigs. Vet Microbiol 152: 284-290. doi:10.1016/j.vetmic.2011.05.020. PubMed: 21680113.
  28. 28. Taira O, Ogawa H, Nagao A, Tuchiya K, Nunoya T et al. (2009) Prevalence of swine Torque teno virus genogroups 1 and 2 in Japanese swine with suspected post-weaning multisystemic wasting syndrome and porcine respiratory disease complex. Vet Microbiol 139: 347-350. doi:10.1016/j.vetmic.2009.06.010. PubMed: 19570625.
  29. 29. Zhang Z, Dai W, Wang Y, Lu C, Fan H (2013) Analysis of synonymous codon usage patterns in torque teno sus virus 1 (TTSuV1). Arch Virol, 158: 145–54. PubMed: 23011310.
  30. 30. Marais G, Mouchiroud D, Duret L (2001) Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc Natl Acad Sci U S A 98: 5688-5692. doi:10.1073/pnas.091427698. PubMed: 11320215.
  31. 31. Etherington GJ, Dicks J, Roberts IN (2005) Recombination Analysis Tool (RAT): a program for the high-throughput detection of recombination. Bioinformatics 21: 278-281. doi:10.1093/bioinformatics/bth500. PubMed: 15333462.
  32. 32. Tamura K, Peterson D, Peterson N, Stecher G, Nei M et al. (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol 28: 2731-2739. doi:10.1093/molbev/msr121. PubMed: 21546353.
  33. 33. Sharp PM, Li WH (1986) An evolutionary perspective on synonymous codon usage in unicellular organisms. J Mol Evol 24: 28-38. doi:10.1007/BF02099948. PubMed: 3104616.
  34. 34. Comeron JM, Aguadé M (1998) An evaluation of measures of synonymous codon usage bias. J Mol Evol 47: 268-274. doi:10.1007/PL00006384. PubMed: 9732453.
  35. 35. Zhou JH, Zhang J, Chen HT, Ma LN, Liu YS (2010) Analysis of synonymous codon usage in foot-and-mouth disease virus. Vet Res Commun 34: 393-404. doi:10.1007/s11259-010-9359-4. PubMed: 20425142.
  36. 36. Posada D, Crandall KA (2002) The effect of recombination on the accuracy of phylogeny estimation. J Mol Evol 54: 396-402. PubMed: 11847565.
  37. 37. Krakowka JE, MacIntosh K, Ringler SS, Rings DM,Hartunian C, Zhang Yan, Allan G (2008) Porcine genogroup 1 Torque Teno Virus (G1-TTV) Potentiates PCV2 & PRRSV infections in gnotobiotic swine. Proceedings of the International Pig Veterinary Society (Durban) 1: 99.
  38. 38. Krakowka S, Ellis JA (2008) Evaluation of the effects of porcine genogroup 1 torque teno virus in gnotobiotic swine. Am J Vet Res 69: 1623-1629. doi:10.2460/ajvr.69.12.1623. PubMed: 19046010.
  39. 39. Zhao S, Zhang Q, Liu X, Wang X, Zhang H et al. (2008) Analysis of synonymous codon usage in 11 human bocavirus isolates. Biosystems 92: 207-214. doi:10.1016/j.biosystems.2008.01.006. PubMed: 18378386.
  40. 40. Wang M, Zhang J, Zhou JH, Chen HT, Ma LN et al. (2011) Analysis of codon usage in type 1 and the new genotypes of duck hepatitis virus. Biosystems 106: 45-50. doi:10.1016/j.biosystems.2011.06.005. PubMed: 21708221.
  41. 41. Wong EH, Smith DK, Rabadan R, Peiris M, Poon LL (2010) Codon usage bias and the evolution of influenza A viruses. Codon Usage Biases of Influenza Virus. BMC Evol Biol 10: 253. doi:10.1186/1471-2148-10-253. PubMed: 20723216.
  42. 42. Posada D, Crandall KA, Holmes EC (2002) Recombination in evolutionary genomics. Annu Rev Genet 36: 75-97. doi:10.1146/annurev.genet.36.040202.111115. PubMed: 12429687.
  43. 43. Shi M, Holmes EC, Brar MS, Leung FC (2013) Recombination is Associated with an Outbreak of Novel Highly Pathogenic Porcine Reproductive and Respiratory Syndrome Viruses in China. J Virol, 87: 10904–7. PubMed: 23885071.
  44. 44. Ramos N, Mirazo S, Castro G, Arbiza J (2013) Molecular analysis of Porcine Circovirus Type 2 strains from Uruguay: Evidence for natural occurring recombination. Infect Genet Evol 19C: 23-31. PubMed: 23806516.
  45. 45. Li J, Huo X, Dai Y, Yang Z, Lei Y et al. (2012) Evidences for intertypic and intratypic recombinant events in EV71 of hand, foot and mouth disease during an epidemic in Hubei Province, China, 2011. Virus Res 169: 195-202. doi:10.1016/j.virusres.2012.07.028. PubMed: 22922556.
  46. 46. Abrantes J, Esteves PJ, van der Loo W (2008) Evidence for recombination in the major capsid gene VP60 of the rabbit haemorrhagic disease virus (RHDV). Arch Virol 153: 329-335. doi:10.1007/s00705-007-1084-0. PubMed: 18193156.
  47. 47. Zhou T, Gu W, Ma J, Sun X, Lu Z (2005) Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses. Biosystems 81: 77-86. doi:10.1016/j.biosystems.2005.03.002. PubMed: 15917130.
  48. 48. Gu W, Zhou T, Ma J, Sun X, Lu Z (2004) Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales. Virus Res 101: 155-161. doi:10.1016/j.virusres.2004.01.006. PubMed: 15041183.
  49. 49. Zhang Z, Wang Y, Fan H, Lu C (2012) Natural infection with torque teno sus virus 1 (TTSuV1) suppresses the immune response to porcine reproductive and respiratory syndrome virus (PRRSV) vaccination. Arch Virol 157: 927-933. doi:10.1007/s00705-012-1249-3. PubMed: 22327391.
  50. 50. Namouchi A, Didelot X, Schöck U, Gicquel B, Rocha EP (2012) After the bottleneck: Genome-wide diversification of the Mycobacterium tuberculosis complex by mutation, recombination, and natural selection. Genome Res 22: 721-734. doi:10.1101/gr.129544.111. PubMed: 22377718.