Based on GC content and the observed/expected CpG ratio (oCpGr), we found three major groups among the members of subfamily Parvovirinae: Group I parvoviruses with low GC content and low oCpGr values, Group II with low GC content and high oCpGr values and Group III with high GC content and high oCpGr values. Porcine parvovirus belongs to Group I and it features an ascendant CpG distribution by position in its coding regions similarly to the majority of the parvoviruses. The entire PPV genome remains hypomethylated during the viral lifecycle independently from the tissue of origin. In vitro CpG methylation of the genome has a modest inhibitory effect on PPV replication. The in vitro hypermethylation disappears from the replicating PPV genome suggesting that beside the maintenance DNMT1 the de novo DNMT3a and DNMT3b DNA methyltransferases can’t methylate replicating PPV DNA effectively either, despite that the PPV infection does not seem to influence the expression, translation or localization of the DNA methylases. SNP analysis revealed high mutability of the CpG sites in the PPV genome, while introduction of 29 extra CpG sites into the genome has no significant biological effects on PPV replication in vitro. These experiments raise the possibility that beyond natural selection mutational pressure may also significantly contribute to the low level of the CpG sites in the PPV genome.
Citation: Tóth R, Mészáros I, Stefancsik R, Bartha D, Bálint Á, Zádori Z (2013) CpG Distribution and Methylation Pattern in Porcine Parvovirus. PLoS ONE 8(12): e85986. doi:10.1371/journal.pone.0085986
Editor: Jianming Qiu, University of Kansas Medical Center, United States of America
Received: July 30, 2013; Accepted: December 3, 2013; Published: December 31, 2013
Copyright: © 2013 Tóth et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by OTKA and NKTH (Mobilitás 08-C OTKA 81187), the János Bolyai Fellowship from the Hungarian Academy of Sciences (BO/00414/10). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
DNA methylation is the prime form of epigenetic modifications of the eukaryotic genome. In vertebrate cells almost exclusively the 5th carbon atom of cytosine is methylated within CpG dinucleotides. Methylation has a significant impact on chromatin structure modulation, genomic imprinting and X chromosome inactivation. It can inhibit transcription by preventing the binding of transcription factors or by recruiting methyl-binding proteins and histone deacetylases leading to the formation of condensed chromatin structure .
In mammals approximately 60-90% of CpGs comprise methylated cytosine bases . The 5’ end of the housekeeping genes are often associated with a GC-rich stretch of DNA containing high amounts of CpG dinucleotides, so called CpG islands, which are free of methylation . DNA methyltransferases (DNMTs) are responsible for the conversion of cytosines to 5-methylcytosines. The DNMTs are divided into two groups: maintenance (DNMT1) and de novo (DNMT3a and DNMT3b) methyltransferases. The mechanism of site specific CpG methylation and regulation of DNMTs to develop specific patterns are presently not well understood .
CpGs are observed only at one-fourth to one-third of their expected frequency [5,6] in most vertebrate genomes. Several explanations have been proposed to account for this discrepancy, deamination and timine conversion of methylated cytosines, avoidance of higher stacking energy of CpG dinucleotides during replication and prevention of autoimmune reactions among others.
Unmethylated CpGs (UCpGs) as signature of invading bacterial and viral organisms are immunostimulants even on short oligonucleotides in mammals. Immune response is triggered by the UCpGs binding to TLR9, a member of the Toll-like receptor family on the surface of dendritic cells . Therefore CpG methylation is not only important in the regulation of the hosts’ life processes, but it also plays a key role in the detection of microbial and viral pathogens and inactivation of integrated foreign DNA [8,9], consequently it has major influence on the lifecycles of DNA- and retroviruses as well. The role of methylation in viral regulation is less understood than in mammals. Integrated adenoviruses and papovaviruses are generally hypermethylated, while the actively replicating viral DNA is hypomethylated with methylated sites in specific regions of the viral genome . EBV is highly methylated during latency, and becomes demethylated during active replication. It uses methylation-induced gene silencing to evade host immunity . In contrast, ranid herpesviruses are heavily methylated during replication and probably code their own DNA cytosine-5 methyltransferases . CpG dinucleotides are underrepresented in most of the small DNA viruses. This pattern is thought to be established by evolutionary pressure to avoid CpG-mediated immune responses and to decrease the direct interference of methylation on the transcription of viral RNAs and viral replication .
Parvoviruses are small single stranded DNA viruses with an approximately 4-6 Kb linear genome. Despite their small genome and their limited coding capacity parvoviruses are surprisingly successful to invade a wild variety of host organisms from insects to mammals and constitute a large, diverse virus family . Their diversity manifests not only in the large number of parvoviral species, but also in the complexity of their lifecycle. Beside lytic infection some parvoviruses are able to infect their respective host persistently [13,14]. Adeno-associated viruses are able to insert their genome into the host genome to establish latent infection and subsequently are capable to parasitize the transcription machinery of other viruses and reactivate their own replication mechanism during helper virus infection .
Members of the Parvovirinae subfamily are among the most dangerous and economically most harmful viruses of domesticated and farm animals (e.g., canine parvovirus (CPV) , goose parvovirus (GPV)  porcine parvovirus (PPV)  and are also able to cause diseases in humans (B19)  and human bocavirus [20,21]. PPV is responsible for syndrome of reproductive failure in swine, included infertility, early embryonic death, mummified fetuses and stillbirths . It frequently causes persistent infection combined with chronic shedding .
There are some well-established connections between persistent viral infection and CpG methylation in other virus families [24,25] but not much is known about the role of epigenetic modifications in parvoviruses. In this paper our aim is to expand our knowledge about the effect of CpG methylation in the life cycle of parvoviruses focusing on PPV and to reveal whether methylation has any direct influence on the evolution of the CpG poor PPV genome.
Our in silico analysis of the CpG pattern of parvoviruses revealed that parvoviral genomes are more heterogenic in their CpG contents than it was previously recognized and a group of parvoviruses exists in which CpGs are not depleted despite that their genomes are AT rich. PPV DNA was found hypomethylated independently from its tissue origin. In vitro methylation of PPV DNA or the introduction of additional CpG sites into the PPV genome had no significant effect on PPV replication in vitro. These data indicate that CpG methylation has no regulating role in PPV life cycle and together with the recently published findings that parvoviruses do not induce TLR9 activated immune response  suggest that CpG depletion in the genome of PPV and other parvoviruses is most probably the consequence of other evolutionary forces than CpG methylation.
Materials and Methods
Computing CpG distribution in coding positions and SNP analysis
Parvovirus sequences have been collected from the NCBI nucleotide databank (Table S1a). The contiguous protein coding sequences for Drosophila melanogaster were downloaded in FASTA format from the FTP site of FlyBase (release r5.24) . The core set of human coding sequences are from The Consensus CDS (CCDS) project . The protein coding regions of the human mRNAs were downloaded in FASTA format from the CCDS database . Dinucleotide frequencies as a function of coding sequence position were calculated using custom Bash scripts (available upon request). Single nucleotide polymorphism (SNP) was calculated from 68 PPV sequences containing the full coding regions or the complete NS or VP genes (Table S1b) with a custom made program. The algorithm counts the polymorphism sites (distinguishing the transition events at C, G, CpG and GC nucleotides) in a multiple alignment where a consensus base is only considered with 75% confidence or above (C++ source code available upon request).
PT (porcine testis)  and Cos 7 (African green monkey kidney)  cell lines were used for the propagation of the NADL-2 strain of PPV and its derivatives. Cells were grown in Dulbecco’s modified Eagle’s medium with high glucose (4,5 g/l) and L-glutamine (PAA, Cölbe, Germany) supplemented with 1% PenStrep (100X; penicillin: 10 000 U/ml, streptomycin: 10 mg/ml; PAA), 1% sodium-pyruvate solution (100 mM; Lonza, Basel, Switzerland) and 10% Fetal Bovine Serum Gold (PAA).
Viral DNA extraction and determination of the methylation pattern
The packaged form of the viral DNA was extracted from 1 ml tissue supernatant by the High Pure Viral Nucleic Acid Kit (Roche, Basel, Switzerland) according to the manufacturer’s recommendations. The replicative PPV DNA was purified by using modified Hirt extraction  as it is described by Molitor et al. . The methylation status of the viral DNAs was determined by bisulfite PCR, cloning and sequencing. For the bisulfite conversion the EpiTect Bisulfite Kit (Qiagen, Venlo, Netherlands) was used according to the manufacturer’s instructions. The modified CpG containing DNA fragments of the positive and negative strands were amplified by PCR primers (Table 1) which were designed using MethPrimer program . PCR reactions were executed by DreamTaq DNA Polymerase (Thermo Fisher Scientific, Waltham, MA, USA). The DNA amplifications were carried out by initial denaturation for 5 min at 95 °C, followed by 30 cycles at 95 °C for 20 s, 52 °C for 20 s, and 72 °C for 20 s. The amplified DNA fragments were cloned into pJET1.2 blunt cloning vector (Thermo Fisher Scientific) and sequenced by using BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA) following the instructions of the manufacturers.
|Primer name||Sequence||Product size (bp)||CpGs in product||Primer name||Sequence||Product size (bp)||CpGs in product|
Deep sequencing was executed on an Ion Torrent sequenator using the IonXpress barcode set and the 316D chip kit, after a DNA library preparation from the equimolarly pooled bisulfite PCR fragments by the NEBNext® Fast DNA Fragmentation & Library Prep Set for Ion Torrent (New England BioLabs, Ipswich, MA, USA). For data procession the 2.2 Torrentsuite software was used. To gain quality data, reads under 15 average Phred score were omitted. CLC Genomics Workbench 5.5 was used for data analysis. High confidence of the evaluation was ensured by excluding short reads (<20 nucleotide) and setting Length fraction and Similarity fraction parameters to 0.9.
Creation of the mutant viruses
Seven mutant viruses (3 single M1, M2, M3, and three double M12, M23, M13 and one triple M123) were rescued containing extra CpGs. To introduce CpG mutations into the NcoI-SacI (3473-3935) fragments of NADL-2 strain by joining PCR, three pairs of overlapping mutational primers (M1-3F and M1-3R), two external (mut_exF, mut_exR) and two internal (mut_intF, mut_intR) primers were designed (Table 2). For the upstream fragments of the joining PCRs mut_exF and M1-3R primers were utilized, for the downstream fragments PCR mut_exR and M1-3F primers. For single mutants the pN2 infectious clone of the NADL-2 strain , for double mutants the clone of the M1 and M2 mutant viruses, while for the triple mutant the clone of M12 served as templates. The PCR reactions included 5 µl of 5X HF Buffer, 0,5 µl of 10 mM dNTP, 1 µl of each overlapping and external primers (15 pmol/µl) of each different mutants, 0,2 µl of 2U/µl Phusion Hot Start II DNA Polymerase (Thermo Fisher Scientific), 0,05 ng template DNA and distilled water to a final volume of 25 µl. Amplifications started by initial denaturation for 5 min at 95 °C and followed by 30 cycles at 98 °C for 10 s, 65 °C for 15 s, and 72 °C for 20 s. Joining PCRs were carried out by mut_intF and mut_intR primers using Hot Start II DNA Polymerase and 2 ng of the appropriate mutant downstream and upstream fragments. DNA amplifications started by initial denaturation for 5 min at 95 °C, followed by 30 cycles at 98 °C for 10 s, 60 °C for 15 s, and 72 °C for 20 s. The amplified 1005 nucleotide-long products were digested with NcoI and SacI (New England BioLabs) restriction enzymes and cloned into the same sites of the pN2 infectious clone. Mutant sequences were deposited into GenBank under the accession numbers: KF913345-KF913351.
|Primer name||Sequence||Number of extra CpGs|
Transfection, viral stocks titration and quantification
To rescue the mutant viruses the infectious clones were transfected into PT cells by Turbofect (Thermo Fisher Scientific) reagent according to the supplier’s recommendations. After 48 hours 50 µl culture media was transferred from the transfected cells to a 24-well plate seeded with PT cells and the viruses were let to multiply for 96 hours. This was followed by the inoculation of semi confluent PT cells with 0.5 ml of viral supernatants on 75 cm2 plates. After 96 hours the supernatants were collected and titrated by three parallel, independent dilutions with immunofluorescence detection technique on PT and Cos7 cells as described previously . Shortly: virus samples were serially diluted (10x) and cells on a 96-well plate were infected with 10 µl of the viral dilutions. PT and Cos7 cells were fixed after 20 and 48 hours respectively (to exclude the detection of progeny viruses emerging from the cells used for titering) with 3% formaldehyde and permeabilized with 1% Triton X-100. The 3C9 (CRL-17; ATCC) anti-PPV antibody and Alexa Flour® 488 donkey anti-mouse IgG (Life Technologies Carlsbad, CA, USA) as secondary antibody were used for visualization of the infected nuclei (IN). Titer was calculated by multiplying the number of IN by the dilution factors and values are given in fluorescent nuclei count (FNC)/ml.
For qPCR quantification of the viral production initiated by differently methylated DNAs the transfected cells were washed three times to ensure minimal contamination of the viral stocks by plasmid originated viral DNA.
To monitor sequence stability viral stocks were passaged 10 times on a 24-well plate containing PT cells transferring 5 µl of the virus containing supernatant after 48 hours to freshly seeded cells covered by 2 ml medium. After the last passages new stocks were prepared on 75 cm2 plates as it is described above. Following DNA extraction, the mutated regions were amplified by mut_intF, mut_intR primers (Table 2) and sequenced.
For DNA quantification qPCR were performed using the earlier described VPS2 (5’-CAATACTGCACCTGTATTTCCAAATGG-3’) and VPAS2 (5’-AAAATTTTATTGTTTTTTGGGGATAATTGG-3’) primers .
The reactions included 2,5 µl of 10X DreamTaq Buffer, 0,5 µl of dNTP mix, 1 µl of each 15 pmol/µl primers, 0,2 µl of DreamTaq DNA Polymerase, 1,25 µl of 20X EvaGreenTM (Biotium, Hayward, CA, USA), 1 µl DNA template and sterile distilled water to a final volume of 25 µl. The DNA amplifications were carried out by initial denaturation for 5 min at 95 °C, followed by 40 cycles at 95 °C for 20 s, 55 °C for 20 s, and 72 °C for 50 s.
Preparation of bacterially methylated and non-methylated DNA for transfection
To get unmethylated viral genome, a PCR was performed using pN2 as template and N2F (5’-GGGTTATTGTCTCATGAGCGGATACATA-3’) and N2R (5’-CAATTTCACACAGG AAACAGCTATGACC-3’) primers. The PCR reaction included 20 µl of 5X GC Buffer, 2 µl of 10 mM dNTP, 6 µl DMSO 0.6 µl of each 100 pmol/µl primers, 1 µl of 2U/µl Phusion Hot Start II DNA Polymerase (Thermo Fisher Scientific) and distilled water to a final volume of 100 µl. The DNA amplification started by initial denaturation for 3 min at 98 °C, followed by 25 cycles at 98 °C for 15 s, 66 °C for 20 s, and 72 °C for 4 min 30 s. To obtain bacterially DAM and DCM methylated viral genome the pN2 was digested by KpnI and BamHI restriction enzymes (Thermo Fisher Scientific) which cut the NADL-2 virus from the vector. In each case the viral genomes were isolated from 0.7% agarose gel by Nucleospin Extract II kit (Macherey-Nagel, Dueren, Germany).
In vitro CpG methylation
In a reaction 3 µg viral DNA was methylated by CpG methylase (Zymo Research, Irvine, CA, USA) according to the manufacturer’s instructions. The reaction was stopped by ethanol precipitation, washed by 70% ethanol, dried and resolved in 20 µl distilled water. The success of hipermethylation was inspected by digestion of an aliquot with methylation sensitive SsiI (Thermo Fisher Scientific) restriction enzyme in 20 µl final volume.
Determination of expression levels of DNA methyltransferases in infected and non-infected tissues
The expression levels of porcine DNMT1, DNMT3a, DNMT3b were defined by real-time PCR, and normalized with those of the GADPH gene. RNAs were purified from PPV infected (MOI 3) and mock infected PT cells grown on 75 cm2 flask 24 hours postinfection (HPI) by RiboZolTM RNA Extraction Reagent (Amresco, Solon, OH, USA) according to the manufacturers’ instructions. The RNA was dissolved in 15 µl DEPC-treated water. The first strand of cDNA and the amplification were created using One-Step RT-PCR Kit (Qiagen). The reactions included 5 µl of 5X QIAGEN OneStep RT-PCR Buffer, 1 µl of dNTP mix, 1 µl of each 15 pmol/µl primers (Table 3), 1 µl of QIAGEN OneStep RT-PCR Enzyme mix, 1,25 µl of 20X EvaGreenTM, 1 µl RNA template and RNase-free water to a final volume of 25 µl. The reaction started with reverse transcription for 30 min at 50 °C and initial PCR activation for 15 min at 95 °C, followed by 35 cycles at 95 °C for 30 sec, 55 °C for 30 sec, and 72 °C for 40 sec, and 72 °C for 5 min.
|Primer name||Sequence||Product size (bp)||Accession number|
Immunofluorescence and Western blot detection of DNMTs
Western blot detection of the DNMT proteins were carried out by using DNMT1 (H-300) and DNMT3a/b (C-15) primary antibodies (Santa Cruz Biotechnology Inc., Heidelberg, Germany), horseradish peroxidase conjugated swine anti-rabbit (DAKO, Glostrup, Denmark) and rabbit anti-goat (Southern Biotech, Birmingham, AL, USA) secondary antibodies in a 50, 100, 500 and 8000-fold dilutions respectively. For loading control anti-alpha tubulin antibody (Developmental Studies Hybridoma Bank, Iowa City, IA, USA) were used together with peroxidase conjugated rabbit anti-mouse IgG (Sigma-Aldrich, St. Louis, MO, USA).
The peroxidase was revealed using the TMB cholorimetric substrate (MIKROGEN, Neuried, Germany) according to the supplier’s recommendations. Bands were quantified with the ImageJ programs .
For immunofluorescence detection DNMT1 (H-300) and CF488A labeled goat anti-rabbit were used in 50 and 300-fold dilutions respectively and the samples were examined with a Zeiss Axio Observer D1 inverse fluorescence research microscope.
GC and CpG content of Parvoviruses
To better understand common and distinguishing features of PPV genome organization among parvoviruses the GC content and CpG density of 32 parvoviruses from the Parvovirinae subfamily were calculated and compared. The GC content of parvoviral genomes scales between 35% and 63%. In general, it can be stated that self-replicating parvoviruses have AT-rich genomes (GC content < 50%), while most of the adeno-associated viruses have GC-rich genomes (GC content > 50%) (Figure 1).
Values are shown in percentage; viruses are listed by increasing GC content. Group I, Group II and Group III parvoviruses are indicated by red, violet and green colors respectively.
In self-replicating viruses the observed/expected CpG ratio (oCpGr) is variable: it can take very low values (like in the case of PPV), or high values (as it can be seen in the CaMiV genome). Dependoviruses are more uniform regarding CpG content: in each case the oCpGr values stay above 60% with the exception of MDPV (which is an autonomous member of the Dependovirus genus) and can reach more than 100%.
Considering the GC content and the oCpGr values, three major categories can be distinguished among the members of Parvovirinae subfamily: parvoviruses with low GC content (< 50%) and low oCpGr values (< 50%) (Group I), viruses with low GC content (< 50%) and high oCpGr values (> 50%) (Group II) and viruses with high GC (> 50%) content and high oCpGr (> 50%) values (Group III). A characteristic taxonomical distribution can be recognized among these three groups. All known members of Amdovirus, Erythrovirus and Parvovirus (PPV among them) genera together with several members of the Bocavirus genus (like the highly CpG depleted human bocavirus and porcine bocavirus) belong to Group I. Other members of the Bocavirus genus (e.g. the CpG-rich BPV and CaMiV) together with the unclassified chicken and turkey parvoviruses are members of Group II. Most viruses of the Dependovirus genus belong to Group III.
CpG distribution in PPV and in the coding regions of parvoviruses
Not only the number of CpGs but their distribution in the genome is different among parvoviruses. Very few CpG islands can be found in members of the first group and usually they are restricted to the terminal regions, while several potential CpG islands can be plotted in every member of the latter two groups scattered throughout the genome including both the coding and non-coding regions (Figure 2).
Red vertical bars represent CpG sites in the genomes, light blue shading indicates CpG islands (parameters: window 100, observed/expected CpG minimum 0.6, GC% minimum 50). A, PPV Kresse strain; B, Canine minute virus; C, AAV2 as representatives of Group I, Group II and Group III parvoviruses respectively. Figure was generated by MethPrimer program .
The NADL-2 strain of PPV contains 60 CpGs 13 in the left 7 in the right non coding regions and 40 in the protein coding sequences. The distribution of CpGs in the coding frames is not random. Only three CpGs can be found in the first coding position (CGX) in the Arginine codons, 13 in the second position (XCG) (in serine, proline, threonine and alanine codons) and 24 in the third position (XXC GXX) affecting two amino acid codons (Figure 3a). Similar ascendant tendency of CpG distribution can be observed by position in the majority of the parvoviruses but also in the viral hosts, for example in Homo sapiens (Figure 3b). Since the distribution is independent of the absolute number of the CpGs in the viral genomes and similar distribution can be detected in eukaryotic organisms, which do not have CpG methylation (e.g. Drosophila melanogaster) , it is more probable that the particular pattern of CpG distribution is rather the effect of coding bias and coding preference than some unknown evolutionary effect of the methylation machinery of the different organisms.
A, Total number of CpGs in the main coding ORFs (NS and VP) of parvoviral genomes. B, Number of CpGs in the different positions of the mRNAs of the human and drosophila proteomes. Numbers on the X axis indicate distance from the start codon of the proteins; Number of CGX (Arg codons) are indicated in 1,4,7,10… positions, numbers of XCG codons (Ser, Pro, Thr, Ala) are indicated in 2,5,8,11… positions and numbers of CGs in consecutive codons XXC GXX are indicated in 3,6,9,12… positions.
Biological effects of additional CpGs in the PPV genome
The small number of CpGs in the PPV genome implies an evolutionary pressure against this dinucleotide. To study the biological effects of the elevated CpG ratio, seven mutants were created in which new CpG sites were inserted into three CpG free regions of the VP2 gene by site-specific silent mutations of the pN2 infectious clone. The number of the new CpG sites in the mutants ranged between 9 and 29 (Figure 4a). The mutant clones were transfected into PT cells and the supernatants of the transfected cells were collected 48 hour post transfection and used to infect PT cells. The presence of PPV was monitored by IF using 3C9 monoclonal antibodies. The transfection of all mutant infectious clones resulted viable virus and a stock solution was prepared from each and titrated on permissive PT cells and semi-permissive Cos7 cells. There was no significant difference in the infectivity of the mutant and the wild type viral stocks indicating that the extra CpG sites have no significant biological effects (p= 0.557 at PT cells and p= 0.0727 at Cos7 cells by ANOVA) either on viral production or viral growth in vitro (Figure 4b). To monitor the stability of the mutant viruses 10 additional passages were carried out on PT cells and new stock solutions were prepared. The titers of the stocks were quantified by qPCR (Figure 4c) and again there were no significant differences (p= 0.0974 by ANOVA) in the titer among the different mutants and the wild type virus. Sequencing of the mutant viruses from the different mutant stocks confirmed that no new or back mutations are present on the viruses and the introduced mutations are stable, which indicates lack of selective disadvantage of the additional CpGs in tissue culture.
A, Schematic representation of the novel CpG sites in the mutant viruses. Red vertical bars symbolize the CpGs in the genomes. B, Titer of the mutant stocks on PT and Cos7 cells. Infected cells were detected by immunofluorescence labeling at 20 and 48 hours PI fixation, respectively. C, Titer of the mutant stocks after 10 additional passages on PT cells. Quantification was carried out by qPCR. Vertical bars indicate the standard deviation in each case.
Methylation status of PPV
To establish the methylation status of PPV genome, the bisulfite conversion based PCR protocol was used, which allows the independent analysis of the methylation of the negative and the positive strands. To cover all CpG sites on the full genome two sets of PCR primers (13 pairs for the negative strand and 11 pairs for the positive strand) were planned. First, the encapsidated negative strand was analyzed from virions originated from permissive (PT) 20 HPI semi-permissive (Cos7) cells 96 HPI and aborted pig embryos. In each case the investigated DNAs were highly hypomethylated (Table 4, Table S2). PPV encapsidates only negative strand, and positive strand can be found almost exclusively only in the replicative form PPV DNA . To exclude that the hypomethylation pattern of the viral DNA would be the result of specific encapsidation of the unmethylated DNA, the methylation pattern of the cellular PPV DNA purified from PPV infected PT cells (20 HPI) was also determined (Table 4). Similarly to the encapsidated negative strand, the positive strand proved to be hypomethylated, suggesting that PPV DNA remains hypomethylated during the entire life cycle of the virus including replication and packaging.
|Positive strand||Negative strand|
|CpG site||Position||Clones sequenced||Methylation||Position||Clones sequenced||Methylation|
1-2 percentage points of the CpG sites on the cloned bisulfite treated PPV fragments remained resistant against C to T conversion indicating a rare occurrence of methylation on the PPV DNA. To gain more accurate data about the methylation level, the bisulfite treated PCR fragments were deep sequenced. Around 168000 PCR fragments were analyzed with 0-22619 coverage of the CpG sites. Most of the CpG sites had more than 92% conversion frequency while around 10% of the CpG sites had less than 92% conversion rate indicating that low level CpG methylation occurs on replicating PPV DNA (Figure 5). Most probably there is no immediate effect of such a low level methylation on PPV replication, however, even low level of methylation can drive the purge of CpGs from the PPV genome during a long period of time, since methylated cytosines are mutational hotspots [41,42].
Vertical bars label the position of the CpGs in the PPV genome. Height of the bars represents the frequency of unmethylated CpGs. Coverage of the CpG sites having more than 8% methylation rate was between 47 and 13694.
In fact, analysis of single nucleotide polymorphism (SNPs) of the PPV genome based on the available sequences of the DNA databank support the high mutability of CpG sites in the PPV genome. On the investigated coding region, the ratio of SNPs in the CpG sites (17 mutations, 38 CpG sites) significantly exceeds the ratio of SNPs in the total number of G and C nucleotides (113 mutations, 1619 G+C nucleotides) (p< 2.2x10-16 by Pearson's Chi-square test) or in the GC sites (29 mutations, 148 GC sites) (p= 0.0014 by Pearson's Chi-square test). Even a higher difference (3,82) can be observed calculating the transition rates (p= 5.413x10-13) by Pearson's Chi-square test) (14 C→T and G→A in CpG sites versus 78 in G and C nucleotides of the genome) (Figure 6).
The frequency was calculated by dividing the rate of SNPs in the CpG and GC sites with the rate of SNPs in the total number of Gs and Cs in the genome.
These data, together with our observation that extra CpG sites in PPV do not interfere with PPV replication, raise the possibility that not only natural selection but mutational pressure might contribute significantly to the low level balance of the CpG sites in the PPV genome.
The effect of methylation on PPV replication
To investigate the effect of CpG methylation on the PPV replication, PPV genomes were in vitro methylated, transfected and their replication initiation capability was compared to that of the bacterially cloned (DAM DCM methylated) and PCR amplified (non-methylated) PPV genomes. In vitro methylation was executed by M.SssI CpG methylase and in each case almost complete CpG methylation (>95%) of the PPV genomes could be achieved, shown by the digestion of the treated DNAs with the SsiI methylation sensitive restriction endonuclease (data not shown). The CpG methylated DNAs and their non CpG methylated counterparts were transfected into PT cells and their virus replication initiation capability was monitored by IF assay at 24 hours post-transfection using 3C9 PPV specific monoclonal antibody. In each case the CpG methylated DNA induced around 62% less viral infection in PT cells than the non CpG methylated control DNAs, indicating that CpG methylation has a relatively modest inhibitory effect on PPV replication (Figure 7a-b). This result was confirmed by qPCR analysis of the supernatant of the transfected cells (Figure 7c). Interestingly, non-methylated PPV dsDNA (PCR amplified) was less effective to initiate viral replication than the bacterially DAM/DCM methylated dsDNA.
A, and B, PT cells on 96 well plate were transfected by 0.2 µg differentially methylated DNA and PPV infected cells were detected by immunofluorescence 24 hour post-transfection.
The fluorescent nuclei count (FNC) in one well is indicated by columns. C, Number of viral genomes in the supernatant of differentially methylated DNA transfected cells 24 and 48 hours post-transfection. Vertical bars indicate standard deviation in each case.
The progeny viruses from CpG methylated PPV DNA transfected cells were collected, and the methylation pattern of their DNA was examined. The in vitro hypermethylated status of the transfected PPV genome could not be detected on the investigated fragments of the genome of the progeny viruses, they even proved to be hypomethylated, similarly to the genome of the native virus (Table S2d).
To gather additional evidence on the sustainability of the hypomethylated status of the PPV genome, the clone of the M123 mutant was also transfected into PT cells and the methylation pattern of the progeny virus was also examined. The 29 newly introduced CpG sites also remained hypomethylated similarly to the “wild type” CpG sites of the native virus (Table S2e).
These experiments strongly suggest that beside the maintenance DNMT1 the de novo DNMT3a and DNMT3b methylases cannot methylate replicating PPV DNA effectively either, and hypomethylation is not restricted to the existing CpG sites: it is a generalized process which extends to the full PPV genome.
Influence of the PPV infection on the transcription, translation, and cellular distribution of DNMT proteins
Hypomethylation of PPV DNA must involve a decreased activity of the cellular DNMTs on PPV DNA and/or a weak susceptibility of the PPV DNA to their action. Absence, inhibition or elimination of DNMTs in infected cells, different compartmentalization of methylases and viral DNA, or the unability of methylases to recognize PPV DNA as substrate singularly or synergistically can lead to hypomethylation of the viral genome. To study the direct reasons behind the hypomethylated status of PPV DNA we have investigated the transcription, translation and subcellular distribution of DNMTs in infected and non-infected host cells. Quantitative measurement of the mRNAs of DNMT1 DNMT3a and DNMT3b by real-time qRT-PCR reveals that the mRNAs of all the three major form of DNMTs are present in PT cells and PPV infection does not change their levels significantly (Figure 8a). As demonstrated by Western-blot and immunofluorescence experiments, the translation level of the DNMT proteins is not affected significantly and localization of the DNMT1 is not changed either by PPV infection in PT cells (Figure 8b-c).
A, Ratio of the mRNA levels of DNMTs in PPV infected/uninfected PT cells. Level of expressions were normalized with GADPH. B, Ratio of the alpha-tubulin normalized DNMT1 and DNMT3a/b protein levels in PPV infected/uninfected PT cells. C, Localization of DNMT1 in infected and uninfected PT cells. Nuclear staining by Hoechst 33342 is shown in blue, DNMT1 and PPV capsid proteins were visualized by indirect immunofluorescence methods and they are presented in green and red, respectively.
The effect of overexpression of the DNMT3a on PPV methylation was also investigated. To ensure the co-expression of the PPV genome and the DNMT3a protein, the pN2 and the human DNMT3a expressing pcDNA3/Myc-DNMT3A [43,44] plasmid were co-transfected into PT cells. Co-transfection of the two plasmids resulted around three times less infectious foci (data not shown) than the cotransfection of the pN2 and the pDSREDmonomer-N1 plasmid which expressed the dsRED protein as a control, indicating an inhibitory effect of DNMT3a on PPV replication. Overexpression of DNMTs has been shown to change the pattern of gene expression in cells and influence cell cycle [45,46]. So it cannot be excluded that the viral inhibition of the overexpressed DNMT3a is rather due to an indirect effect on the host cell regulation than to direct methylase activity on the viral DNA. Especially because bisulfite sequencing of the rescued viruses, emerged from pcDNA3/Myc-DNMT3A and pN2 co-transfection, revealed a hypomethylated status of the viral DNA (Table S2f), demonstrating that DNMT3a, even present in excess, cannot methylate effectively replicating PPV DNA.
Investigation of the GC and CpG contents of 32 parvoviral genomes revealed three distinct groups. Dependoviruses represent a group with high GC and high oCpGr values. As earlier recognized, the relative CpG-rich sequence of dependoviruses makes them an out-group among similarly sized DNA viruses [9,47]. Just like large DNA viruses, they are much less biased toward CpG dinucleotids than the small DNA viruses.
The majority of the known autonomously replicating parvoviruses belong to a different group with opposing characteristics manifesting low GC and low oCpGr values. However, based on our findings, several recently described autonomous parvoviruses seem to differ significantly from this group and feature relatively high oCpGr values combined with low GC content.
This finding somewhat extenuates the conclusion which emerged from the earlier analysis of viral sequences that small DNA viruses – parvoviruses among them – are extremely biased against CpG dinucleotide, and only adeno-associated viruses are exceptions from this rule [9,47].
The methylation patterns of a few DNA viruses have been studied in details. The hypomethylated state of the replicating PPV genome fits very well with what was described about replicating adeno [48,49] papilloma and polyoma viruses [50,51,52,53] and reinforces the emerging view that the genome of small DNA viruses remains hypomethylated during replication. One possible explanation for this phenomenon is the association of the viral DNA with host and viral proteins during rapid viral replication and encapsidation which may simply prevent the interactions with DNMTs. However, the lack of (so far unknown) cis signals for de novo methylation or the presence of chromatin insulator sequences [54,55,56] might also contribute to the sustainment of hypomethylation in the PPV genome. This notion is supported by the fact that while some episomal adenoviral constructs become rapidly de novo methylated , others, like episomal recombinant AAV and even some bacterial plasmid constructs remain hypomethylated for a long time after introduction into the mammalian cells, despite being replication incompetent [58,59]. The inverted terminal repeat (ITR) of AAV is suspected to be an insulator  and most probably has a role in keeping AAV recombinant constructs hypomethylated and transcriptionally active . However, whether the ITRs of PPV or any other parvovirus functions similarly remains to be investigated.
Interestingly, complete in vitro CpG methylation of PPV genome has a moderate inhibitory effect (around 62% decrease) on PPV replication initiation. The decrease was found very reproducible in several experiments and independent of the original methylation status of the treated DNA. These experiments indicate a moderate sensitivity of the PPV genome to methylation. This result is somewhat contradictory to human parvovirus B19 studies, since B19 promoter and replication seem to be very sensitive to CpG methylation . However, B19 has a CpG island and 42 CpGs in its terminal 520 bp promoter/enhancer region overlapping with Sp1, Sp3 and viral NS protein binding elements, while PPV has no CpG island and has only 12 CpGs in its 190 bp long P4 promoter/enhancer region and no CpGs at all in its 140-bp-long P40 promoter region. Recent investigations revealed that the relationship between the methylation status and transcriptional activity of a gene is more complicated than it was originally thought [4,63,64] and different promoters react differently to methylation [63,65]. The different sensitivity of the two parvoviruses to methylation can be explained by the different CpG content of their promoters since the methylation of CpG-poor promoters frequently does not preclude transcription  while the methylation of CpG island promoters usually suppresses their activity [4,65].
In contrast to viruses (e.g hepatitis B virus, Marek’s disease virus, Kaposi's sarcoma-associated herpesvirus and EBV) from other viral families [66,67,68,69], PPV infection does not influence the mRNA or the protein level of the DNA methylases and does not seem to change the localization of the DNMT3a. So it looks like that replicating PPV DNA is a weak substrate of the host’s DNA methylases and even overexpressed DNMT3a cannot raise PPV methylation level significantly.
The genome of the majority of the autonomous parvoviruses – PPV among them – similarly to their hosts are highly GC and CpG depleted. It is widely assumed that the main reason behind CpG depletion in small DNA viruses and RNA viruses is natural selection coming from replicative advantage and/or immune escape [9,47]. However, our findings that the introduction of additional CpGs into the PPV genome has no measurable biological effect (no disadvantage),or in vitro hypermethylation does not significantly inhibit replication initiation of PPV argue against the replicative advantage of CpG depletion. A recent publication about the failure of members of the Parvoviridae family to elicit TLR 9 activated interferon response in plasmacytoid dendritic cells  questions the existence of immunological pressure against CpGs in parvoviral genomes. The ascendant distribution of CpGs by position does not support the presence of immunological pressure against CpGs in parvoviruses either because it would be expected that under such pressure the number of first position CpGs would exceed the number of second or third position CpGs (the degenerate code makes it easy to change the C and G of the CpGs in third and second positions without amino acid changes while point mutations of first position CpGs inevitably come with amino acid changes which could be harmful to the virus).
These data together with our observation that CpG sites are more mutable than GC or C and G sites in the PPV genome suggest that mutational pressure could have a more significant role in the formation of PPV genome than selective forces.
In vertebrate genomes CpG is the most mutable dinucleotide and its occurrence is about 20 % of the expected frequency [5,6]. The high mutability of CpGs and its frequent conversion to TG and CA largely attributed to mutational pressure caused by the spontaneous deamination of 5-methylcytosine. The low level methylation of the PPV genome combined with the high mutation rate of parvoviruses [70,71] and the single stranded DNA genome, which is very susceptible for deamination [72,73] may explain the higher mutability of CpGs and their loss from the PPV genome through countless generations. Reinforcing this argument we have to mention that not only the transition but the transversion rates of the CpG sites are a few times higher in mammals [74,75,76] and non-methylated CpGs still have an approximately three times higher overall mutation rate in the human genome than non CpG nucleotides . These observations suggest a deamination-independent intrinsic mutability of CpGs in the mammalian genome, and raise the possibility that mutational pressure originating from the host replicative machinery may have a significant influence to CpG suppression, at least in small DNA viruses which lack their own DNA polymerases.
Similarly to other organisms, mutational pressure, genetic drift and natural selection together shape the genome of parvoviruses. Our in vitro investigations of different CpG mutants of PPV were not able to reveal any significant evolutionary advantage of CpG depletion in viral replication at cellular level. Future in vivo experiments and monitoring our extra CpG mutants at organism level may help us clarify how much of the loss of CpGs from the PPV genome is the consequence of selective or other evolutionary forces.
Accession numbers of the viral sequences. 1a, Accession numbers and abbreviations of the investigated parvoviral sequences.
1b, Accession numbers of the PPV sequences.
Methylation level of the CpG sites in the PPV genome. 2a, Methylation of the CpG sites in PT cells 8 hours post infection. 2b, Methylation of the CpG sites in Cos 7 cells 96 hours post infection. 2c, Methylation of the CpG sites in PPV originated from aborted pig embryos. 2d, Methylation of the CpG sites in the progeny viruses of the in vitro methylated PPV genome. 2e, Methylation of the new CpG sites in the M123 mutant PPV. 2f, Methylation of the progeny viruses after overexpressing of DNMT3a in PT cells.
Conceived and designed the experiments: RT ZZ AB. Performed the experiments: IM RS DB. Analyzed the data: IM RS DB RT ZZ AB. Contributed reagents/materials/analysis tools: IM RS DB RT ZZ AB. Wrote the manuscript: RT ZZ AB RS DB.
- 1. Ballestar E (2011) An introduction to epigenetics. Adv Exp Med Biol 711: 1–11. doi:10.1007/978-1-4419-8216-2_1. PubMed: 21627038.
- 2. Curradi M, Izzo A, Badaracco G, Landsberger N (2002) Molecular mechanisms of gene silencing mediated by DNA methylation. Mol Cell Biol 22: 3157–3173. doi:10.1128/MCB.22.9.3157-3173.2002. PubMed: 11940673.
- 3. Caiafa P, Zampieri M (2005) DNA methylation and chromatin structure: the puzzling CpG islands. J Cell Biochem 94: 257–265. doi:10.1002/jcb.20325. PubMed: 15546139.
- 4. Jones PA (2012) Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet 13: 484–492. doi:10.1038/nrg3230. PubMed: 22641018.
- 5. Bird AP (1980) DNA methylation and the frequency of CpG in animal. DNA - Nucleic Acids Res 8: 1499–1504. doi:10.1093/nar/8.7.1499.
- 6. Schorderet DF, Gartler SM (1992) Analysis of CpG suppression in methylated and nonmethylated species. Proc Natl Acad Sci U S A 89: 957–961. doi:10.1073/pnas.89.3.957. PubMed: 1736311.
- 7. Krug A, Towarowski A, Britsch S, Rothenfusser S, Hornung V et al. (2001) Toll-like receptor expression reveals CpG DNA as a unique microbial stimulus for plasmacytoid dendritic cells which synergizes with CD40 ligand to induce high amounts of IL-12. Eur J Immunol 31: 3026–3037. doi:10.1002/1521-4141(2001010)31:10. PubMed: 11592079.
- 8. Rothenfusser S, Hornung V, Krug A, Towarowski A, Krieg AM et al. (2001) Distinct CpG oligonucleotide sequences activate human gamma delta T cells via interferon-alpha/-beta. Eur J Immunol 31: 3525–3534. doi:10.1002/1521-4141(200112)31:12. PubMed: 11745372.
- 9. Hoelzer K, Shackelton LA, Parrish CR (2008) Presence and role of cytosine methylation in DNA viruses of animals. Nucleic Acids Res 36: 2825–2837. doi:10.1093/nar/gkn121. PubMed: 18367473.
- 10. Ambinder RF, Robertson KD, Tao Q (1999) DNA methylation and the Epstein-Barr virus. Semin Cancer Biol 9: 369–375. doi:10.1006/scbi.1999.0137. PubMed: 10547345.
- 11. Davison AJ, Cunningham C, Sauerbier W, McKinnell RG (2006) Genome sequences of two frog herpesviruses. J Gen Virol 87: 3509–3514. doi:10.1099/vir.0.82291-0. PubMed: 17098965.
- 12. Tattersall P (2006) The evolution of parvovirus taxonomy. In: JR KerrSF CotmoreME BloomRM LindenCR Parrish. Parvoviruses. London: Hodder Arnold. pp. 5–14.
- 13. Simmons R, Sharp C, Sims S, Kloverpris H, Goulder P et al. (2011) High frequency, sustained T cell responses to PARV4 suggest viral persistence in vivo. J Infect Dis 203: 1378–1387. doi:10.1093/infdis/jir036. PubMed: 21502079.
- 14. Filipovska-Naumovska E, Thompson MJ, Hopwood D, Pass DA, Wilcox GE (2010) Strain-and age-associated variation in viral persistence and antibody response to mouse parvovirus 1 in experimentally infected mice. J Am Assoc Lab Anim Sci 49: 443–447. PubMed: 20819390.
- 15. Linden RM, Ward P, Giraud C, Winocour E, Berns KI (1996) Site-specific integration by adeno-associated virus. Proc Natl Acad Sci U S A 93: 11288–11294. doi:10.1073/pnas.93.21.11288. PubMed: 8876128.
- 16. Hoelzer K, Parrish CR (2010) The emergence of parvoviruses of carnivores. Vet Res 41: 39. doi:10.1051/vetres/2010011. PubMed: 20152105.
- 17. Kisary J, Derzsy D, Meszaros J (1978) Attenuation of the goose parvovirus strain B. Laboratory and field trials of the attenuated mutant for vaccination against Derzsy's disease. Avian Pathol 7: 397–406. doi:10.1080/03079457808418293. PubMed: 18770393.
- 18. van Leengoed LA, Vos J, Gruys E, Rondhuis P, Brand A (1983) Porcine Parvovirus infection: review and diagnosis in a sow herd with reproductive failure. Vet Q 5: 131–141. Review. doi:10.1080/01652176.1983.9693887. PubMed: 6314634.
- 19. Chorba T, Coccia P, Holman RC, Tattersall P, Anderson LJ et al. (1986) The role of parvovirus B19 in aplastic crisis and erythema infectiosum (fifth disease). J Infect Dis 154: 383–393. doi:10.1093/infdis/154.3.383. PubMed: 3016109.
- 20. Foulongne V, Rodière M, Segondy M (2006) Human Bocavirus in children. Emerg Infect Dis 12: 862–863. doi:10.3201/eid1205.051523. PubMed: 16710957.
- 21. Schildgen V, Malecki M, Tillmann RL, Brockmann M, Schildgen O (2013) The Human Bocavirus is associated with some lung and colorectal cancers and persists in solid tumors. PLOS ONE 8: e68020. doi:10.1371/journal.pone.0068020. PubMed: 23826357.
- 22. Dunne HW, Gobble JL, Hokanson JF, Kradel DC, Bubash GR (1965) Porcine reproductive failure associated with a newly identified "SMEDI" group of picorna viruses. Am J Vet Res 26: 1284–1297. PubMed: 5894163.
- 23. Gradil CM, Joo HS, Molitor TW (1990) Persistence of porcine parvovirus in swine infected in utero and followed through maturity. Zentralbl Veterinarmed B 37: 309–316. PubMed: 2166412.
- 24. Tao Q, Robertson KD (2003) Stealth technology: how Epstein-Barr virus utilizes DNA methylation to cloak itself from immune detection. Clin Immunol 109: 53–63. doi:10.1016/S1521-6616(03)00198-0. PubMed: 14585276.
- 25. Krieg AM, Wu T, Weeratna R, Efler SM, Love-Homan L, Yang L, Yi AK, Short D, Davis HL (1998) Sequence motifs in adenoviral DNA block immune activation by stimulatory CpG motifs. Proc Natl Acad Sci U S A 95: 12631–12636. doi:10.1073/pnas.95.21.12631. PubMed: 9770537.
- 26. Mattei LM, Cotmore SF, Li L, Tattersall P, Iwasaki A (2013) Toll-like receptor 9 in plasmacytoid dendritic cells fails to detect parvovirus. J Virol 87: 3605–3608. doi:10.1128/JVI.03155-12. PubMed: 23302877.
- 27. Marygold SJ, Leyland PC, Seal RL, Goodman JL, Thurmond J, et al. (2013) FlyBase consortium. FlyBase: improvements to the bibliography. Nucleic Acids Res 41 (Database issue): D751–757. doi: 10.1093/nar/gks1024
- 28. Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M et al. (2009) The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. Genome Res 19: 1316–1323. doi:10.1101/gr.080531.108. PubMed: 19498102.
- 29. Harte RA, Farrell CM, Loveland JE, Suner MM, Wilming L et al. (2012) Tracking and coordinating an international curation effort for the CCDS Project. Database (Oxford). p. bas008. PubMed : 22434842.
- 30. Bergeron J, Menezes J, Tijssen P (1993) Genomic organization and mapping of transcription and translation products of the NADL-2 strain of porcine parvovirus. Virology 197: 86–98. doi:10.1006/viro.1993.1569. PubMed: 8212598.
- 31. Gluzman Y (1981) SV40-transformed simian cells support the replication of early SV40 mutants. Cell 23: 175–182. doi:10.1016/0092-8674(81)90282-8. PubMed: 6260373.
- 32. Hirt B (1967) Selective extraction of polyoma DNA from infected mouse cell cultures. J Mol Biol 26: 365–369. doi:10.1016/0022-2836(67)90307-5. PubMed: 4291934.
- 33. Molitor TW, Joo HS, Collett MS (1984) Porcine parvovirus DNA: characterization of the genomic and replicative form DNA of two virus isolates. Virology 137: 241–254. doi:10.1016/0042-6822(84)90216-2. PubMed: 6091327.
- 34. Li LC, Dahiya R (2002) MethPrimer: designing primers for methylation PCRs. Bioinformatics 11: 1427–1431. PubMed: 12424112.
- 35. Zádori Z, Szelei J, Lacoste MC, Li Y, Gariépy S et al. (2001) A viral phospholipase A2 is required for parvovirus infectivity. Dev Cell 1: 291–302. doi:10.1016/S1534-5807(01)00031-4. PubMed: 11702787.
- 36. Zádori Z, Szelei J, Tijssen P (2005) SAT: a late NS protein of porcine parvovirus. J Virol 79: 13129–13138. doi:10.1128/JVI.79.20.13129-13138.2005. PubMed: 16189014.
- 37. Gagnon CA, Tremblay D, Tijssen P, Venne MH, Houde A et al. (2007) The emergence of porcine circovirus 2b genotype (PCV-2b) in swine in Canada. Can Vet J 48: 811–819. PubMed: 17824323.
- 38. Schneider CA, Rasband WS, Eliceiri KW (2012) NIH Image to ImageJ: 25 years of image analysis. Nat Methods 9: 671–675. doi:10.1038/nmeth.2089. PubMed: 22930834.
- 39. Lyko F (2001) DNA methylation learns to fly. Trends Genet 17: 169–172. doi:10.1016/S0168-9525(01)02234-X. PubMed: 11275305.
- 40. Bergeron J, Menezes J, Tijssen P (1993) Genomic organization and mapping of transcription and translation products of the NADL-2 strain of porcine parvovirus. Virology 197: 86–98. doi:10.1006/viro.1993.1569. PubMed: 8212598.
- 41. Holliday R, Grigg GW (1993) DNA methylation and mutation. Mutat Res 285: 61–67. doi:10.1016/0027-5107(93)90052-H. PubMed: 7678134.
- 42. Cooper DN, Gerber-Huber S (1985) DNA methylation and CpG suppression. Cell Differ 17: 199–205. doi:10.1016/0045-6039(85)90488-9. PubMed: 3902251.
- 43. Li H, Rauch T, Chen ZX, Szabó PE, Riggs AD, Pfeifer GP (2006) The histone methyltransferase SETDB1 and the DNA methyltransferase DNMT3A interact directly and localize to promoters silenced is cancer cells. J Biol Chem 281: 19489–19500. doi:10.1074/jbc.M513249200. PubMed: 16682412.
- 44. Chen ZX, Mann JR, Hsieh CL, Riggs AD, Chédin F (2005) Physical and functional interactions between the human DNMT3L protein and members of the de novo methyltransferase family. J Cell Biochem 95: 902–917. doi:10.1002/jcb.20447. PubMed: 15861382.
- 45. Shao G, Zhang R, Zhang S, Jiang S, Liu Y et al. (2013) Splice variants DNMT3B4 and DNMT3B7 overexpression inhibit cell proliferation in 293A cell line. In Vitro Cell Dev Biol Anim 49: 386–394. doi:10.1007/s11626-013-9619-z. PubMed: 23636939.
- 46. Takahashi M, Kamei Y, Ehara T, Yuan X, Suganami T et al. (2013) Analysis of DNA methylation change induced by Dnmt3b in mouse hepatocytes. Biochem Biophys Res Commun 434: 873–878. doi:10.1016/j.bbrc.2013.04.041. PubMed: 23611774.
- 47. Shackelton LA, Parrish CR, Holmes EC (2006) Evolutionary basis of codon usage and nucleotide composition bias in vertebrate DNA viruses. J Mol Evol 62: 551–563. doi:10.1007/s00239-005-0221-1. PubMed: 16557338.
- 48. Gunthert U, Schweiger M, Stupp M, Doerfler W (1976) DNA methylation in adenovirus, adenovirus-transformed cells, and host cells. Proc Natl Acad Sci U S A 73: 3923–3927. doi:10.1073/pnas.73.11.3923. PubMed: 1069277.
- 49. Wienhues U, Doerfler W (1985) Lack of evidence for methylation of parental and newly synthesized adenovirus type 2 DNA in productive infections. J Virol 56: 320–324. PubMed: 2993662.
- 50. Winocour E, Kaye AM, Stollar V (1965) Synthesis and transmethylation of DNA in polyoma-infected cultures. Virology 27: 156–169. doi:10.1016/0042-6822(65)90155-8. PubMed: 4284654.
- 51. Danos O, Katinka M, Yaniv M (1980) Molecular cloning, refined physical map and heterogeneity of methylation sites of papilloma virus type 1a DNA. Eur J Biochem 109: 457–461. doi:10.1111/j.1432-1033.1980.tb04815.x. PubMed: 6250842.
- 52. Burnett TS, Sleeman JP (1984) Uneven distribution of methylation sites within the human papillomavirus la genome: possible relevance to viral gene expression. Nucleic Acids Res 12: 8847–8860. doi:10.1093/nar/12.23.8847. PubMed: 6096809.
- 53. Ford JP, Coca-Prados M, Hsu MT (1980) Enzymatic analysis of 5-methylcytosine content in eukaryotic DNA. Study of intracellular Simian Virus 40. DNA - J Biol Chem 255: 7544–7547.
- 54. Mukhopadhyay R, Yu W, Whitehead J, Xu J, Lezcano M et al. (2004) The binding sites for the chromatin insulator protein CTCF map to DNA methylation-free domains genome-wide. Genome Res 14: 1594–1602. doi:10.1101/gr.2408304. PubMed: 15256511.
- 55. Butcher DT, Mancini-DiNardo DN, Archer TK, Rodenhiser DI (2004) DNA binding sites for putative methylation boundaries in the unmethylated region of the BRCA1 promoter. Int J Cancer 111: 669–678. doi:10.1002/ijc.20324. PubMed: 15252835.
- 56. Filippova GN, Cheng MK, Moore JM, Truong JP, Hu YJ et al. (2005) Boundaries between chromosomal domains of X inactivation and escape bind CTCF and lack CpG methylation during early development. Dev Cell 8: 31–42. doi:10.1016/j.devcel.2004.10.018. PubMed: 15669143.
- 57. Brooks AR, Harkins RN, Wang P, Qian HS, Liu P et al. (2004) Transcriptional silencing is associated with extensive methylation of the CMV promoter following adenoviral gene delivery to muscle. J Gene Med 6: 395–404. doi:10.1002/jgm.516. PubMed: 15079814.
- 58. Pringle IA, Raman S, Sharp WW, Cheng SH, Hyde SC et al. (2005) Detection of plasmid DNA vectors following gene transfer to the murine airways. Gene Ther 12: 1206–1214. doi:10.1038/sj.gt.3302518. PubMed: 15800657.
- 59. Léger A, Le Guiner C, Nickerson ML, McGee Im K, Ferry N et al. (2011) Adeno-associated viral vector-mediated transgene expression is independent of DNA methylation in primate liver and skeletal muscle. PLOS ONE 6: e20881. doi:10.1371/journal.pone.0020881. PubMed: 21687632.
- 60. Daya S, Berns KI (2008) Gene therapy using adeno-associated virus vectors. Clin Microbiol Rev 21: 583–593. doi:10.1128/CMR.00008-08. PubMed: 18854481.
- 61. Penaud-Budloo M, Le Guiner C, Nowrouzi A, Toromanoff A, Chérel Y et al. (2008) Adeno-associated virus vector genomes persist as episomal chromatin in primate muscle. J Virol 16: 7875–7885. PubMed: 18524821.
- 62. Bonvicini F, Manaresi E, Di Furio F, De Falco L, Gallinella G (2012) Parvovirus b19 DNA CpG dinucleotide methylation and epigenetic regulation of viral expression. PLOS ONE 7: e33316. doi:10.1371/journal.pone.0033316. PubMed: 22413013.
- 63. Weber M, Hellmann I, Stadler MB, Ramos L, Pääbo S et al. (2007) Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat Genet 39: 457–466. doi:10.1038/ng1990. PubMed: 17334365.
- 64. Meissner A, Mikkelsen TS, Gu H, Wernig M, Hanna J et al. (2008) Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature 454: 766–770. PubMed: 18600261.
- 65. Zeisberg EM, Zeisberg M (2013) The role of promoter hypermethylation in fibroblast activation and fibrogenesis. J Pathol 229: 264–273. doi:10.1002/path.4120. PubMed: 23097091.
- 66. Lu F, Stedman W, Yousef M, Renne R, Lieberman PM (2010) Epigenetic regulation of Kaposi's sarcoma-associated herpesvirus latency by virus-encoded microRNAs that target Rta and the cellular Rbl2-DNMT pathway. J Virol 84: 2697–2706. doi:10.1128/JVI.01997-09. PubMed: 20071580.
- 67. Leonard S, Wei W, Anderton J, Vockerodt M, Rowe M et al. (2011) Epigenetic and transcriptional changes which follow Epstein-Barr virus infection of germinal center B cells and their relevance to the pathogenesis of Hodgkin's lymphoma. J Virol 85: 9568–9577. doi:10.1128/JVI.00468-11. PubMed: 21752916.
- 68. Luo J, Yu Y, Chang S, Tian F, Zhang H et al. (2012) DNA methylation fluctuation induced by virus infection differs between MD-resistant and –susceptible chickens. Front Genet 3: 20. PubMed: 22363343.
- 69. Jung JK, Arora P, Pagano JS, Jang KL (2007) Expression of DNA methyltransferase 1 is activated by hepatitis B virus X protein via a regulatory circuit involving the p16INK4a-cyclin D1-CDK 4/6-pRb-E2F1 pathway. Cancer Res 67: 5771–5778. doi:10.1158/0008-5472.CAN-07-0529. PubMed: 17575144.
- 70. Duffy S, Shackelton LA, Holmes EC (2008) Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet 9: 267–276. doi:10.1038/nrm2380. PubMed: 18319742.
- 71. Stucker KM, Pagan I, Cifuente JO, Kaelber JT, Lillie TD et al. (2012) The role of evolutionary intermediates in the host adaptation of canine parvovirus. J Virol 86: 1514–1521. doi:10.1128/JVI.06222-11. PubMed: 22114336.
- 72. Ehrlich M, Norris KF, Wang RY, Kuo KC, Gehrke CW (1986) DNA cytosine methylation and heat-induced deamination. Biosci Rep 6: 387–393. doi:10.1007/BF01116426. PubMed: 3527293.
- 73. Lindahl T, Nyberg B (1974) Heat-induced deamination of cytosine residues in deoxyribonucleic acid. Biochemistry 13: 3405–3410. doi:10.1021/bi00713a035. PubMed: 4601435.
- 74. Hodgkinson A, Eyre-Walker A (2011) Variation in the mutation rate across mammalian genomes. Nat Rev Genet 12: 756–766. doi:10.1038/nrg3098. PubMed: 21969038.
- 75. Hwang DG, Green P (2004) Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution. Proc Natl Acad Sci U S A 101: 13994–14001. doi:10.1073/pnas.0404142101. PubMed: 15292512.
- 76. Nachman MW, Crowell SL (2000) Estimate of the mutation rate per nucleotide in humans. Genetics 156: 297–304. PubMed: 10978293.
- 77. Xia J, Han L, Zhao Z (2012) Investigating the relationship of DNA methylation with mutation rate and allele frequency in the human genome. BMC Genomics 13 Suppl 8: S7. doi:10.1186/1471-2164-13-S3-S7. PubMed: 23281708.