The novel human papillomavirus type 154 (HPV154) was characterized from a wart on the crena ani of a three-year-old boy. It was previously designated as the putative HPV type FADI3 by sequencing of a subgenomic FAP amplicon. We obtained the complete genome by combined methods including rolling circle amplification (RCA), genome walking through an adapted method for detection of integrated papillomavirus sequences by ligation-mediated PCR (DIPS-PCR), long-range PCR, and finally by cloning of four overlapping amplicons. Phylogenetically, the HPV154 genome clustered together with members of the proposed species Gammapapillomavirus 11, and demonstrated the highest identity in L1 to HPV136 (68.6%). The HPV154 was detected in 3% (2/62) of forehead skin swabs from healthy children. In addition, the different detection sites of 62 gammapapillomaviruses were summarized in order to analyze their tissue tropism. Several of these HPV types have been detected from multiple sources such as skin, oral, nasal, and genital sites, suggesting that the gammapapillomaviruses are generalists with a broader tissue tropism than previously appreciated. The study expands current knowledge concerning genetic diversity and tropism among HPV types in the rapidly growing gammapapillomavirus genus.
Citation: Ure AE, Forslund O (2014) Characterization of Human Papillomavirus Type 154 and Tissue Tropism of Gammapapillomaviruses. PLoS ONE 9(2): e89342. doi:10.1371/journal.pone.0089342
Editor: Robert D. Burk, Albert Einstein College of Medicine, United States of America
Received: November 22, 2013; Accepted: January 19, 2014; Published: February 13, 2014
Copyright: © 2014 Ure, Forslund. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Swedish Cancer Society (http://www.cancerfonden.se/), grant CAN2009/867, and by the Skåne Regional Research Funds (http://www.skane.se), grant ALF12711. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The papillomaviruses (PVs) are small viruses with icosahedral symmetry and a circular, double stranded genome . These viruses are widely distributed across vertebrates, and among humans alone, more than 170 unique complete genomes of types have so far been sequenced . The papillomaviruses are epitheliotropic and produce hyperproliferation of squamous cells known as papillomas. Human papillomaviruses (HPV) are divided into high-risk types, which are etiologically associated with cancer of the cervix uteri; and low-risk types, which produce benign warts , . Regarding the taxonomy of papillomaviruses, the International Committee for the Taxonomy of Viruses (ICTV) recommends inclusion of both genotypic and phenotypic information in the definition of viral genera and species . Nevertheless, papillomaviruses have been an exception to the classical rules, and a system based mainly on sequence identity was adopted (reviewed in de Villiers, 2013). Accordingly, a 70% sequence identity amongst L1 ORFs is used as cut-off guide to classify viruses as belonging to the same species . Even so, there is a gray zone (67.5%–70.5% identity) where the distribution curves for inter- and intraspecies identities overlap, and consequently the classification must be curated . The papillomaviruses have in the past been classified as mucosal or cutaneous according to their tropism. The human cutaneotropic papillomaviruses are represented mainly by the genera Beta- and Gammapapillomavirus (β-PV and γ-PV). Their prevalence and natural history, for example acquisition soon after birth, reflects a commensalic interaction with the immunocompetent host –. Historically, several cutaneous papillomaviruses have been isolated from patients with the genetic disease epidermodysplasia verruciformis , , and from immunosuppressed patients , probably due to higher viral loads among these patient groups . As a consequence of improved methods, several PVs have been characterized, and the number of HPV types of the γ-PV genus has been expanded from 16 HPV types in 2010  to 62 representative HPV types in 2013 (retrieved from GenBank, September 2013). A review of the isolation sources of these HPV types could lead to increased knowledge of their tropism. Here, we categorized the different isolation sites of the γ-PV as genital, oral, nasal or cutaneous (including skin lesions and healthy skin). The aim of the study was to obtain the complete sequence of a novel papillomavirus, and to present a summary of the isolation sources of the rapidly growing genus Gammapapillomavirus.
HPV154 was isolated from a wart on the crena ani of a three-year-old boy. The index sample was negative for HPV by PCR using MGP primers  and a Luminex system , but positive for HPV by FAP-PCR . The FAP amplicon was sequenced and showed the highest identity to the FADI3 fragment (99.3%, acc. no. FJ480954). The FADI3 was originally amplified from a forehead swab of a six-year-old girl .
The FAP amplicon represented a putative novel type, as the partial L1 ORF was below 90% of sequence identity to any other known PV type . With or without pre-amplification by rolling circle amplification (RCA), we failed to amplify the complete genome by long-range PCR or with PCRs designed to cover half the genome by a combination of L1-specific primers with degenerated E1 primers (data not shown). As a consequence, we adapted the method for detection of integrated papillomavirus sequences by ligation-mediated PCR (DIPS-PCR)  to obtain sequence information outside the FAP amplicon region of the L1 ORF. In order to test the performance of the DIPS-PCR method, we verified the sequence around the integration site of HPV16 into the genome of SiHa cells (data not shown). Several DIPS-PCRs and PCR with degenerated HPV primers were used to obtain sequence data from L2 to almost the end of L1 (Figure 1). Proximal to the ends of that region, new primers were designed that were combined with degenerated primers based on related γ-PVs, and two amplicons of ∼2000 bp were cloned and sequenced. In order to obtain the complete genome, four overlapping amplicons were obtained by PCR with specific primers (Figure 1). The viral load of HPV154 was 68 genomes per human cell (Table 1).
The ORFs are indicated with light blue arrows. The primers used in the DIPS-PCR are represented with half arrows. The inner circle shows the strategies employed, and the outer circle shows the final clones. Putative binding sites for viral proteins and cellular factors are shown as follows: E2 binding site, E2BS (▾); E1 binding site, E1BS (Δ); TATA-box (▪); Polyadenylation signal (•).
The four clones were submitted to the International Reference Center for Human Papillomaviruses at the German Cancer Research Center, Heidelberg, Germany, and the compiled sequence was verified and officially designated as HPV154 (GenBank JN211193).
The genome of HPV154 comprised 7286 bp with a GC content of 37.9%. The most closely related type was HPV136, isolated from the oral cavity , with 68.6% identity at the L1 ORF, whereas the uncloned HPV isolate KN3, obtained by high throughput sequencing from healthy skin , showed 71.8% identity at L1.
HPV154 demonstrated the classical genomic organization of PVs, with seven ORFs identified (Figure 1). The putative E6 protein had two zinc-finger domains (CX2CX29/30CX2C) that were separated by 36 amino acids, which are conserved among PVs . Similarly, there was one zinc-finger domain in the inferred E7 protein, as well as the tumor suppressor (pRB) binding domain (LXCXE) , . In the putative E1 protein, the superfamily 3 ATP-dependent helicase domain was identified , . The theoretical E2 protein had the typical C-terminal DNA-binding domain and the N-terminal trans-activation domain , . The E4 ORF showed a start codon; nevertheless it was ignored as we identified the characteristic donor (AAG/GUASNR) and acceptor (GUYACYAG/YU) RNA splicing sites , and the resulting putative E1∧E4 fusion protein with six amino acids from the E1 N-terminal end.
The early polyadenylation site (AATAAA) for processing of early mRNAs, was located at the 5′ end of the L2 ORF, while the late polyadenylation site was downstream of L1 within the upstream regulatory region (URR). The URR was relatively short, being 517 bp, and contained six putative E2 binding sites (E2BS, ACCN6GGT). Near the E2BS proximal to E6, several E1 binding sites (AACAAT) or related AT-rich sequences were identified and probably represent the origin of replication , . A TATA box was found (pos. 6973–6977) and surrounded by two E2BS in close proximity (2 and 14 bp).
As the FADI3 amplicon and HPV154 were obtained from samples from children, we tested whether this PV type might have an increased presence in this group. Among the cutaneous swab samples, HPV154 was detected in 3% (2/62) by real-time PCR. One of the samples showed a viral load of 0.093 genomes and the other 1.3 genomes per human cell (Table 1).
The phylogenetic relationships of HPV154 were inferred based on the complete genomes of 91 HPV-sequences (Figure 2). HPV154 was positioned on a divergent group, along with the members of the currently proposed γ-PV11 species (pending ICTV approval) (Figure 2). HPV154 is below the 70% identity limit suggested for species definition compared to its closest relative type, HPV136 (68.6%). Moreover, the lowest values of identity for HPV154 within γ-PV11 (Figure 2) are with HPV140 (65.37%) and HPV169 (65.86%).
Ninety one complete genomes were analyzed, including all gammapapillomaviruses as well as related genera but closer than betapapillomaviruses. HPV154 is indicated with an arrow. The tree was obtained by the maximum likelihood approach using RAxML software and rooted with the betapapillomavirus HPV5 and HPV9. Bootstrap support values are indicated in each branch as percentages. Genera are indicated at the rightmost side, while species of γ-PVs are to the right of brackets. The dotted key to the left of γ-PV11* shows species suggested in this work. The bar indicates the number of substitutions per site. * Proposed category and pending approval by ICTV. †Putative novel species. ‡: Uncloned HPV genome (not official type). The following species are represented by the types included in the tree: AsPV1, Apodemus sylvaticus papillomavirus 1; BPV, Bos taurus papillomavirus spp.; ChPV1, Capra hircus papillomavirus 1; CPV, Canis familiaris papillomavirus spp.; DdPV1, Delphinus delphis papillomavirus 1; HPV, Gammapapillomavirus spp.; McPV2, Mastomys coucha Papillomavirus 2; PphPV, Phocoena phocoena papillomavirus spp.; PsPV1, Phocoena spinipinnis Papillomavirus 1; TtPV, Tursiops truncatus Papillomavirus spp.
In order to increase knowledge of the biological niches of gammapapillomaviruses, we enumerated the isolation sources (complete HPV-genomes) and detection sites of all HPV types from this genus (Table 2). The γ-PV1 species contains members isolated from a wide variety of sites, including healthy tissue such as the skin, nasal cavity and male genitalia. However, they are also found in non-healthy tissue such as common warts, pigmented verrucas, squamous cell carcinoma (SCC) and actinic keratosis (AK). In a similar fashion, HPV types of the γ-PV3, γ-PV7, γ-PV8, γ-PV9, γ-PV10, γ-PV11 and γ-PV15 species have been found in several different sites, including both mucosal and cutaneous sources (Table 2). The γ-PV6 members (HPV101, 103 and 108) are the only γ-PV restricted to genital and oral mucosal sites (six isolates). On the other hand, the γ-PV12 members have been isolated only from warts and healthy skin (eight isolates). The remaining species, which include γ-PV2, γ-PV4, γ-PV5, γ-PV13, γ-PV14, γ-PV16, γ-PV17 and the six putative novel species (Figure 2), have members with less than three findings, but include both mucosal and cutaneous sites (Table 2).
From a wart on the crena ani of a three-year-old boy we characterized the novel HPV154, which was added to the rapidly growing genus Gammapapillomavirus. The genome of the HPV154 demonstrated the typical genomic organization of papillomaviruses and was lacking the E5 ORF, as with γ- β- and µ-HPVs . The prevalence of HPV154 was 3% in skin swabs from healthy children, which is in a similar range as other HPV types of the beta and gammapapillomavirus genera detected among adults –. A limitation of the study was that only samples from children were used in the screening of HPV154. Another limitation was that HPV154 was characterized from a swab sample, so we cannot assert that the virus caused the lesion. Our initial approach of obtaining the complete HPV genome by long-range PCR failed, which may be due to the low concentration of HPV154 (Table 1) and/or fragmentation of the genome. Instead we used PCRs with degenerated HPV primers and an adapted version of the DIPS-PCR method . It was notable that several sequence reads of the restriction enzyme site for the adapter ligation of DIPS-PCRs could not be verified in the final sequence, which might indicate star activity of the restriction enzyme prior to ligation of adapters. Thus, compiled sequences from DIPS-PCR should be verified by independent PCR. Nevertheless, we showed that the DIPS-PCR can expand the length of sequence information, which in turns allows to design HPV primers at positions that reduce the size of the targeted long-range amplicons, which would probably increase amplification efficiency. However, our approach is laborious and time-consuming compared to modern methods such as high-throughput sequencing, and would be a feasible alternative only in cases where the latter methods are not available or other classical methods such as RCA and long-range PCR have failed.
HPV154 demonstrated the typical genomic organization of papillomaviruses, but lacked the E5 ORF as in all known members of the β- γ- and µ-PV genera . The prevalence of HPV154 (5%) in skin swabs from healthy children was in a similar low range as for other HPV types of the β- and γ-PV genera detected among adults –.
Concerning the presented phylogenetic tree, HPV154 was placed with high support along with members of the currently proposed γ-PV11 species . Nevertheless, the percentages of L1 ORF sequence identity among different members of this group were closer to the interspecies center of the distribution (64.5%) than to the intraspecies center (72%) as shown in the histograms of L1 sequence identity from 189 PVs . Even using 68.5% of identity as a more stringent/divergent limit for inter-species distance, three different species could be delimited, as shown with a gray dotted key in Figure 2.
Accordingly, a more stringent/divergent limit of 68.5% identity allowed us to split the proposed γ-PV11 into three putative novel species. However, the proposed γ-PV11  were based on HPV types with tropism for the oral cavity, but additional HPV types within this species had been isolated from healthy skin and warts as in the case of HPV154 (Table 2). According to ICTV “A species is a monophyletic group of viruses whose properties can be distinguished from those of other species by multiple criteria” (http://www.ictvonline.org/codeOfVirusClassification.asp), but among a majority of the γ-PV species it is difficult to find distinctive characteristics besides sequence identity. Although species definition for γ-PV is currently based mainly on sequence identity , , , the criteria have varied over time, and hence have not been applied homogeneously to all members of the genus. For example, HPV48 (γ-PV2) shares 68.01% identity with HPV50 (γ-PV3) and 69.92% with HPV131 (proposed γ-PV14). A system with well supported phylogeny would help to define species, and it may be safer to leave new members unclassified at the species level until more isolates are sequenced.
With regard to the isolation sites enumerated in Table 2, it seems that γ-PVs can be found in a broader variety of sites than previously appreciated. Diverse sites are found for members of the same species or even type. Even though very few studies have searched for papillomavirus in the oral or nasal cavities , –, members belonging to nearly every species have been identified in those studies. Altogether, this supports the idea that γ-PVs are generalist with various forms of tropism. The only exceptions seem to be the members of the γ-PV6 species, HPV101, HPV103 and HPV108, as they appear to have only mucosal tropism. This group also distinctively lack the E6 ORF and have been associated with disease , .
However, our approach to summarize isolation sites of gammapapillomaviruses described in reports provides only a suggestive mode to study the tropism of these viruses. In order to perform a rigorous evaluation of the tropism for each HPV type, a random population study with samples collected from several sites is needed.
In this study we successfully characterized the novel HPV154, thereby expanding knowledge about the diversity and tropism of HPV types in the rapidly growing γ-PV genus. In addition, we suggest that gammapapillomaviruses are generalist with broad tissue tropism. We expect that improved amplification methods will promote the discovery of additional HPV types of the γ-PV genus, and will shed light onto its apparent wider tropism compared to other papillomavirus genera.
Materials and Methods
Ethical approval for this study was granted by the Ethical Committee of Lund University (LU 106-01). The samples were obtained with written informed consent from the parents of the minors.
Sample Processing: DNA Extraction
A swab suspension of 200 µl was processed using the automated MagNA Pure LC with the Total Nucleic acid kit (Roche), and eluted in 100 µl.
Standard HPV Analysis
Sample adequacy was assessed by testing 5 µL of the sample for the human β-globin gene with a real-time PCR . For identification of 22 genital HPV types (6, 11, 16, 18, 31, 33, 35, 39, 42, 45, 51, 52, 56, 58, 59, 66, 68, 70, 73, 82) simultaneously 5 µL of extracted material was added to a total volume of 25 µL for MGP-PCR and subsequent Luminex analysis , . Five microliters was used for FAP-PCR targeting the L1 ORF .
The eluted total DNA was subjected to rolling-circle amplification (RCA) using illustra TempliPhi 100 Amplification Kit (GE Healthcare), basically following the manufacturer’s instructions, but in a slightly modified form. As indicated elsewhere , the final concentration of each dNTP was increased to 450 µM and the reaction was incubated overnight.
Genome Walking with DIPS-PCR
The RCA product (∼0.5 µg) was digested in order to be used as input for DIPS-PCR  with either TaqI, Sau3AI (Bsp143I), FatI, XbaI or HindIII enzymes (10 units, 20 µl final volume) following the manufacturer’s instructions (Fermentas). This method employs short adapters specific to each enzyme. These short oligos were synthesized to be compatible with enzymes TaqI (cgcaacgtgtaagtctg), Sau3AI (gatccaacgtgtaagtctg), FatI (catgcaacgtgtaagtctg), XbaI (ctagcaacgtgtaagtctg), and HindIII (agctcaacgtgtaagtctg); and all of them were modified to contain a phosphate group at the 5′ end and an amino group (Amino C3) at the 3′ end (MWG, Germany). The last three enzymes were added in this study in order to increase the likelihood of obtaining longer fragments and to speed up the overall process. Each of the short adapters was combined with a unique long adapter (gggccatcagtcagcagtcgtagccggatccagacttacacgttg) in equimolar quantities, to a final concentration of 25 µM. The combined oligos were incubated in a thermocycler for 4 min at 94°C, and then incubated for 5 s at 95°C. This process was repeated 18 times, reducing the temperature by 5°C in each step. The formed adapters were then aliquoted and stored at −20°C. The RCA product previously digested was combined with the appropriate (enzyme specific) double stranded adapter (50 pmol) and ligated by adding ATP (0.5 mM), DTT (10 mM) and 1.75U of ligase (Fermentas) to a final volume of 27 µl. The reaction was incubated at 16° overnight, and later diluted to 40 µl with water and was used as template for all subsequent PCR steps. Two microliters of the ligation products were subjected to an initial linear PCR amplification using a viral specific primer (0.2 µM) in 1x PCR buffer (Roche), 1.8 mM MgCl2, 0.2 mM each dNTP, 1.25 U of Taq polymerase (AmpliTaq Gold, Roche). The cycling conditions were 94°C 5 min; 45 cycles of 94°C 15 s, 50°C 20 s, 72° 3 min; 72° 5 min. A second PCR (exponential) using 2 µl of the linear PCR as a template was made using a nested viral-specific primer (primer sequence available upon request) combined with the adapter-specific “AP1” primer (ggccatcagtcagcagtcgtag) in the same conditions as the linear PCR but with 0.5 µM of both primers. The cycling conditions were 94°C 5 min; 35 cycles of 94°C 15 s, 55°C 20 s, 72°C 3 min; 72°C 5 min. The PCR products were evaluated by electrophoresis, and the longest product (from the different restriction enzymes) was selected, cloned using a TOPO-TA cloning kit (Invitrogen) according to the manufacturer’s instructions, and sequenced at MWG (Germany). The novel sequence was used to design primers with Oligo 7 software (Molecular Biology Insights, USA) in order to carry out successive steps of linear/exponential PCR.
PCR with Degenerated Primers and Overlapping Amplicons
Primers were designed in the L2 ORF from a multiple alignment of HPV sequences (HPV48, 50, 60, 65, 88, 95, 112 and 116) related to the known FAP region of HPV154. The degenerated primer “L2 gamma F” (tttgratwtgaaaatcccgccttt) was used in conjunction with the HPV154 specific primer “PV77 FADI3 R” (gacctgtgctaccgactccaag). The cycling conditions were 94°C for 5 min; 40 cycles of 94°C 15 s, 52°C 20 s, 72°C 1 min and a final extension of 72° 7 min. The PCR product was purified with an Illustra Microspin S-300 HR column (GE Healthcare) and sequenced with a nested specific primer.
A second set of degenerated primers was designed for the E1 ORF. The primers ‘E1 gamma Fnew’ (gacagtggdatwkddgaagatgaa) and ‘E1 gamma Rnew’ (ttcatcttcddmwatdccactgtc), were combined with ‘FADI3 −2 nes R' and ‘FADI3 +2 nes F’, respectively (0.3 µM final concentration). One microliter of RCA-pre-amplified template was subjected to Long Range PCR (Expand Long Template PCR System, Roche) amplification in 1x PCR buffer 2 (Roche), 0.5 mM each dNTP, 0.7 U of Enzyme mix (Expand Long Template PCR System, Roche). The cycling conditions were 94°C 2 min; 10 cycles of 94°C 15 s, 50°C 1 min, 68°C 4 min and 30 cycles of 94°C 15 s, 59°C 30 s, 68°C 4 min. The product was electrophoresed and the bands excised from the agarose gel, purified with a QIAquick Gel Extraction kit (Qiagen) and cloned using a TOPO-TA cloning kit (Invitrogen), according to the manufacturer’s instructions.
The remainder of the genome was amplified using an Expand High Fidelity PCR system (Roche, Mannheim, Germany) with 2.5 µl of the RCA amplified DNA (diluted 1∶100) as a template; 0.3 µM of each primer, 3.5 mM MgCl2 and 1.24 U of enzyme in a final volume of 50 µl. The cycling was 94°C 2 min; 40 cycles 94°C 15 s, 55°C 20 s, 72°C 2 min increasing 5 s/cycle; 72°C 10 min. The product was purified and cloned as indicated before.
Screening of Samples from Children
Forehead samples from children from three age groups, one month (23), one year (19), and four years old (20), were obtained from a previous study  and used as templates for the screening. The samples were swab suspensions in 0.9% NaCl, and were used directly as PCR templates. Primers intended for real-time PCR were designed using Oligo 7 (Molecular Biology Insights, USA) and tested for possible cross-reactivity with other types using Primer-BLAST . The resulting primers were ‘HPV154 5511 F’ (accgtggtggtcctcttgg) and ‘HPV154 5647 R’ (tgggtcaaaggaaatgttttgg). The real-time PCR was carried out in an ABI 7500 Real-Time PCR Instrument (Applied Biosystems). Each test tube contained Power SYBR Green PCR Master Mix (Applied Biosystems), 0.3 µM of each primer, and 2.5 µl of template in a final volume of 25 µl. Pipetting was automated using a Qiagility device (Qiagen). The cycling conditions were 95°C 10 min and 45 cycles 95°C 15 s, 60°C 60 s, followed by melting curve analysis. For data analysis the ABI 7500 software v2.0.6 was used and the threshold for positivity was automatically calculated. The specificity of the HPV154 PCR was also analysed by gel-electrophoresis for identification of the expected 137 bp amplicon. Plasmid DNA concentration of HPV154 clone L (Figure 1) was quantified using a spectrophotometer (NanoDrop ND-1000, Nanodrop Technologies, Oxfordshire, UK). The sensitivity of the method was evaluated using serial dilutions of the clone L (Figure 1), and 10 copies were detected in a background of 1 ng of human DNA (Sigma-Aldrich, art. D 7011). The positive HPV154 control had a Tm of 75.4C (CV: 0.2%, based on four measurements).
Viral Load of HPV154 Positive Samples
The number of viral genomes per cell was quantified by carrying out two separate quantitative real-time PCR assays to amplify a part of the HPV154 L2 gene and the human β-globin gene. For the quantification of HPV154 the primers and PCR conditions were identical to that used for screening of samples from children as described above. Quantification was extrapolated from a linear regression standard curve obtained from serial dilutions of 100,000 to 100 copies per PCR of HPV154 plasmid DNA (clone L, Figure 1) in a background of 10 ng human placenta DNA (Sigma-Aldrich, art. D 7011). The standard curve had a slope of −3.4, y intercept of 37.3 and r2 of 0.99. The PCR efficiency calculated from slope was 97.5%. Similarly, in order to calculate the number of cells analyzed per sample, the β-globin gene was amplified with PC03 and PC04 primers in a 25 µl PCR reaction containing 2.5 µL template . The standard curve was obtained from serial dilutions of 50,000 to 50 copies per PCR of the β-globin gene using human placenta DNA (Sigma, art. no D7011). The standard curve had a slope of −3.4, y intercept of 39.9 and r2 of 0.99. The PCR efficiency calculated from slope was 97.5%. For the calculations of number of human cell per sample, the copy number of β-globin was divided by two. We assumed that each human cell carries two β-globin gene copies and that the diploid genome equivalent contains ∼6.6 pg DNA. No-template controls of water samples were tested of both the HPV154 PCR and the human β-globin gene PCR. All samples were analyzed triplicate. Coefficient of variation was calculated for each triplicate measurement of viral copy number per human cell. In the quantitative PCR, negative controls showed no Ct values.
General sequence handling and feature identification were carried out using the UGENE software v1.11.5 (http://genome.unipro.ru). Putative proteins from ORFs were generated by UGENE and they were then searched for similarities with other proteins using BLASTp (http://blast.ncbi.nlm.nih.gov). Proteins were analyzed for unique domains with ScanProsite (http://www.expasy.ch/prosite) and SMART , including searches in the Pfam database (http://pfam.sanger.ac.se/search) . A Python v3 script was made to compare pairwise identities from multiple sequence alignments (available upon request), in which all differences, including terminal gaps, were counted.
In order to avoid missing any gamma-related virus, all complete genomes were retrieved from GenBank (914 accessions), from which 723 unique genomes remained after dereplication using Usearch v6.0.307 . A workflow made using UGENE software  was used to extract the FAP region from L1 ORFs, align it using uMUSCLE , , and the resulting alignment was then converted to phylip format using ALTER website . It was used later to make an ML-tree with RaxML (400 rapid bootstrap inferences and thereafter a thorough ML search). The branch containing all gamma-related viruses was selected using Dendroscope v3.1.0  (excluding beta types). From those sequences, L1 was extracted, aligned, and dereplicated again to remove identical accessions or genomic variant sequences. The complete genomes of the remaining 91 unique sequences were manually edited to have L1 at the 3' end and then they were aligned with uMUSCLE. It was later used to make a Maximum Likelihood phylogenetic inference using RAxML v 7.4.2 , compiled under Linux as AVX and PTHREADS versions, and run using the raxmlGUI v1.3. Gaps were treated as missing data and the General Time Reversible (GTR) under the gamma model of rate heterogeneity was selected as a nucleotide substitution model to make 20 inferences with 150 thorough bootstrap replicates (automatically determined by the program using the majority rule tree based criteria, command “-N autoMR”). The sequence of the betapapillomaviruses HPV5 (acc. no. M17463) and HPV9 (acc. no. X74464) was used to root the tree. A graphical representation of the tree was made with FigTree software v 1.3.1  and edited with Inkscape v0.48. The GenBank accession numbers of the sequences in the tree are as follows: AsPV1, HQ625440; BPV11, AB543507; BPV12, JF834523; BPV3, AF486184; BPV4, X05817; BPV5, EU360723; BPV6, AJ620208; BPV7, DQ217793; BPV9, AB331650; CG2, JF966378; CG3, JF966379; ChPV1, DQ091200; CPV13, JX141478; CPV2, AY722648; CPV7, FJ492742; DdPV1, GU117620; FA69, KC108722; FD1, JF966375; FD2, JF966376; Fi864, KC311731; FS1, JF966373; HPV4, X70827; HPV48, U31789; HPV50, U31790; HPV60, U31792; HPV65, X70829; HPV88, EF467176; HPV95, AJ620210; HPV101, DQ080081; HPV103, DQ080078; HPV108, FM212639; HPV109, EU541441; HPV112, EU541442; HPV116, FJ804072; HPV119, GQ845441; HPV121, GQ845443; HPV123, GQ845445; HPV126, AB646346; HPV127, HM011570; HPV128, GU225708; HPV129, GU233853; HPV130, GU117630; HPV131, GU117631; HPV132, GU117632; HPV133, GU117633; HPV134, GU117634; HPV135, HM999987; HPV136, HM999988; HPV137, HM999989; HPV138, HM999990; HPV139, HM999991; HPV140, HM999992; HPV141, HM999993; HPV142, HM999994; HPV144, HM999996; HPV146, HM999998; HPV147, HM999999; HPV148, GU129016; HPV149, GU117629; HPV153, JN171845; HPV154, JN211193; HPV155, JF906559; HPV156, JX429973; HPV161, JX413109; HPV162, JX413108; HPV163, JX413107; HPV164, JX413106; HPV165, JX444072; HPV166, JX413104; HPV169, JX413105; HPV170, JX413110; KC5, JX444073; KN1, JF966371; KN2, JF966372; KN3, JF966374; McPV2, DQ664501; MmiPV1, DQ269468; MusPV1, GU808564; PphPV1, GU117621; PphPV2, GU117622; PsPV1, AJ238373; RnPV1, GQ180114; SD2, KC113191; SE87, KC108721; TtPV1, EU240894; TtPV2, AY956402; TtPV3, EU240895; TtPV4, JN709469; TtPV5, JN709470; TtPV6, JN709471; TtPV7, JN709472.
We thank Aline Marshall, Lena Nilsson and the laboratory staff at Medical Microbiology, Malmö, for technical assistance.
Conceived and designed the experiments: OF AEU. Performed the experiments: AEU. Analyzed the data: AEU. Contributed reagents/materials/analysis tools: AEU OF. Wrote the paper: AEU OF.
- 1. Bernard HU, Burk RD, DeVilliers EM, zur Hausen H (2012) Papillomaviridae. In: Viruses IC on T of, editor. Virus Taxonomy: Ninth Report of the International Committee on Taxonomy of Viruses. Elsevier Inc., Vol. 1 pp. 235–248.
- 2. De Villiers EM (2013) Cross-roads in the classification of papillomaviruses. Virology 445: 2–10. doi: 10.1016/j.virol.2013.04.023
- 3. Zur Hausen H (2002) Papillomaviruses and cancer: from basic studies to clinical application. Nat Rev Cancer 2: 342–350. doi: 10.1038/nrc798
- 4. Lorincz AT, Reid R, Jenson AB, Greenberg MD, Lancaster W, et al. (1992) Human papillomavirus infection of the cervix: relative risk associations of 15 common anogenital types. Obstet Gynecol 79: 328–337. doi: 10.1097/00006250-199203000-00002
- 5. Van Regenmortel MHV (2000) No Title. In: van Regenmortel MH V., Fauquet CM, Bishop DHL, Carstens E, Estes MK, et al.., editors. In Seventh Report of the International Committee on Taxonomy of Viruses. New York, NY: Academic Press. pp. 3–16.
- 6. De Villiers EM, Fauquet C, Broker TR, Bernard HU, zur Hausen H (2004) Classification of papillomaviruses. Virology 324: 17–27. doi: 10.1016/j.virol.2004.03.033
- 7. Bernard HU, Burk RD, Chen Z, van Doorslaer K, zur Hausen H, et al. (2010) Classification of papillomaviruses (PVs) based on 189 PV types and proposal of taxonomic amendments. Virology 401: 70–79. doi: 10.1016/j.virol.2010.02.002
- 8. Antonsson A, Forslund O, Ekberg H, Sterner G, Hansson BG (2000) The ubiquity and impressive genomic diversity of human skin papillomaviruses suggest a commensalic nature of these viruses. J Virol 74: 11636–11641. doi: 10.1128/jvi.74.24.11636-11641.2000
- 9. Antonsson A, Karanfilovska S, Lindqvist PG, Hansson BG (2003) General acquisition of human papillomavirus infections of skin occurs in early infancy. J Clin Microbiol 41: 2509–2514. doi: 10.1128/jcm.41.6.2509-2514.2003
- 10. Forslund O, Antonsson A, Nordin P, Stenquist B, Hansson BG (1999) A broad range of human papillomavirus types detected with a general PCR method suitable for analysis of cutaneous tumours and normal skin. J Gen Virol 80: 2437–2443.
- 11. Jablonska S, Dabrowski J, Jakubowicz K (1972) Epidermodysplasia verruciformis as a model in studies on the role of papovaviruses in oncogenesis. Cancer Res 32: 583–589.
- 12. Jablonska S, Majewski S (1994) Epidermodysplasia verruciformis: immunological and clinical aspects. Curr Top Microbiol Immunol 186: 157–175. doi: 10.1007/978-3-642-78487-3_9
- 13. Köhler A, Gottschling M, Manning K, Lehmann MD, Schulz E, et al. (2011) Genomic characterization of ten novel cutaneous human papillomaviruses from keratotic lesions of immunosuppressed patients. J Gen Virol 92: 1585–1594. doi: 10.1099/vir.0.030593-0
- 14. Weissenborn S, Neale RE, Waterboer T, Abeni D, Bavinck JNB, et al. (2012) Beta-papillomavirus DNA loads in hair follicles of immunocompetent people and organ transplant recipients. Med Microbiol Immunol 201: 117–125. doi: 10.1007/s00430-011-0212-3
- 15. Söderlund-Strand A, Carlson J, Dillner J (2009) Modified general primer PCR system for sensitive detection of multiple types of oncogenic human papillomavirus. J Clin Microbiol 47: 541–546. doi: 10.1128/jcm.02007-08
- 16. Schmitt M, Bravo IG, Snijders PJF, Gissmann L, Pawlita M, et al. (2006) Bead-based multiplex genotyping of human papillomaviruses. J Clin Microbiol 44: 504–512. doi: 10.1128/jcm.44.2.504-512.2006
- 17. Hsu JYC, Chen ACH, Keleher A, McMillan NAJ, Antonsson A (2009) Shared and persistent asymptomatic cutaneous human papillomavirus infections in healthy skin. J Med Virol 81: 1444–1449. doi: 10.1002/jmv.21529
- 18. Luft F, Klaes R, Nees M, Durst M, Heilmann V, et al. (2001) Detection of integrated papillomavirus sequences by ligation-mediated PCR (DIPS-PCR) and molecular characterization in cervical cancer cells. Int J Cancer 92: 9–17. doi: 10.1002/1097-0215(200102)9999:9999<::aid-ijc1144>3.0.co;2-l
- 19. Bottalico D, Chen Z, Dunne A, Ostoloza J, McKinney S, et al. (2011) The oral cavity contains abundant known and novel human papillomaviruses from the Betapapillomavirus and Gammapapillomavirus genera. J Infect Dis 204: 787–792. doi: 10.1093/infdis/jir383
- 20. Foulongne V, Sauvage V, Hebert C, Dereure O, Cheval J, et al.. (2012) Human skin microbiota: high diversity of DNA viruses identified on the human skin by high throughput sequencing. PLoS One 7: e38499. Available: http://dx.plos.org/10.1371/journal.pone.0038499. Accessed 2012 Nov 13.
- 21. Ullman CG, Haris PI, Galloway DA, Emery VC, Perkins SJ (1996) Predicted alpha-helix/beta-sheet secondary structures for the zinc-binding motifs of human papillomavirus E7 and E6 proteins by consensus prediction averaging and spectroscopic studies of E7. Biochem J 319: 229–239.
- 22. Dahiya A, Gavin MR, Luo RX, Dean DC (2000) Role of the LXCXE binding site in Rb function. Mol Cell Biol 20: 6799–6805. doi: 10.1128/mcb.20.18.6799-6805.2000
- 23. Radulescu RT, Bellitti MR, Ruvo M, Cassani G, Fassina G (1995) Binding of the LXCXE insulin motif to a hexapeptide derived from retinoblastoma protein. Biochem Biophys Res Commun 206: 97–102. doi: 10.1006/bbrc.1995.1014
- 24. Iyer LM, Leipe DD, Koonin EV, Aravind L (2004) Evolutionary history and higher order classification of AAA+ ATPases. J Struct Biol 146: 11–31. doi: 10.1016/j.jsb.2003.10.010
- 25. Liu X, Schuck S, Stenlund A (2010) Structure-based mutational analysis of the bovine papillomavirus E1 helicase domain identifies residues involved in the nonspecific DNA binding activity required for double trimer formation. J Virol 84: 4264–4276. doi: 10.1128/jvi.02214-09
- 26. Hegde RS (2002) The papillomavirus E2 proteins: structure, function, and biology. Annu Rev Biophys Biomol Struct 31: 343–360. doi: 10.1146/annurev.biophys.31.100901.142129
- 27. Hegde RS, Grossman SR, Laimins LA, Sigler PB (1992) Crystal structure at 1.7 A of the bovine papillomavirus-1 E2 DNA-binding domain bound to its DNA target. Nature 359: 505–512. doi: 10.1038/359505a0
- 28. Doorbar J, Myers G (1996) The E4 Protein. Human Papillomaviruses 1996 Compendium. pp. 58–80.
- 29. Chen G, Stenlund A (2001) The E1 initiator recognizes multiple overlapping sites in the papillomavirus origin of DNA replication. J Virol 75: 292–302. doi: 10.1128/jvi.75.1.292-302.2001
- 30. Ustav E, Ustav M, Szymanski P, Stenlund A (1993) The bovine papillomavirus origin of replication requires a binding site for the E2 transcriptional activator. Proc Natl Acad Sci 90: 898–902. doi: 10.1073/pnas.90.3.898
- 31. Venuti A, Paolini F, Nasir L, Corteggio A, Roperto S, et al. (2011) Papillomavirus E5: the smallest oncoprotein with many functions. Mol Cancer 10: 140. doi: 10.1186/1476-4598-10-140
- 32. Kovanda A, Kocjan BJ, Luzar B, Bravo IG, Poljak M (2011) Characterization of novel cutaneous human papillomavirus genotypes HPV-150 and HPV-151. PLoS One 6: e22529. Available: http://dx.plos.org/10.1371/journal.pone.0022529. Accessed 2013 July 27.
- 33. Vasiljevic N, Hazard K, Dillner J, Forslund O (2008) Four novel human betapapillomaviruses of species 2 preferentially found in actinic keratosis. J Gen Virol 89: 2467–2474. doi: 10.1099/vir.0.2008/001925-0
- 34. Vasiljević N, Hazard K, Eliasson L, Ly H, Hunziker A, et al. (2007) Characterization of two novel cutaneous human papillomaviruses, HPV93 and HPV96. J Gen Virol 88: 1479–1483. doi: 10.1099/vir.0.82679-0
- 35. Zakrzewska K, Regalbuto E, Pierucci F, Arvia R, Mazzoli S, et al. (2012) Pattern of HPV infection in basal cell carcinoma and in perilesional skin biopsies from immunocompetent patients. Virol J 9: 309. doi: 10.1186/1743-422x-9-309
- 36. Kullander J, Handisurya A, Forslund O, Geusau A, Kirnbauer R, et al. (2008) Cutaneous human papillomavirus 88: remarkable differences in viral load. Int J Cancer 122: 477–480. doi: 10.1002/ijc.23115
- 37. Ekström J, Forslund O, Dillner J (2010) Three novel papillomaviruses (HPV109, HPV112 and HPV114) and their presence in cutaneous and mucosal samples. Virology 397: 331–336. doi: 10.1016/j.virol.2009.11.027
- 38. Bzhalava D, Johansson H, Ekström J, Faust H, Möller B, et al.. (2013) Unbiased approach for virus detection in skin lesions. PLoS One 8: e65953. Available: http://dx.plos.org/10.1371/journal.pone.0065953. Accessed 2013 July 15.
- 39. Forslund O, Johansson H, Madsen KG, Kofoed K (2013) The nasal mucosa contains a large diversity of human papillomavirus from the Beta- and Gammapapillomavirus genera. J Infect Dis: In press.
- 40. Phan TG, Vo NP, Aronen M, Jartti L, Jartti T, et al. (2013) Novel human gammapapillomavirus species in a nasal swab. Genome Announc 1: e0002213. doi: 10.1128/genomea.00022-13
- 41. Bottalico D, Chen Z, Kocjan BJ, Seme K, Poljak M, et al. (2012) Characterization of human papillomavirus type 120: a novel betapapillomavirus with tropism for multiple anatomical niches. J Gen Virol 93: 1774–1779. doi: 10.1099/vir.0.041897-0
- 42. Chen Z, Schiffman M, Herrero R, Desalle R, Burk RD (2007) Human papillomavirus (HPV) types 101 and 103 isolated from cervicovaginal cells lack an E6 open reading frame (ORF) and are related to gamma-papillomaviruses. Virology 360: 447–453. doi: 10.1016/j.virol.2006.10.022
- 43. Nobre RJ, Herráez-Hernández E, Fei JW, Langbein L, Kaden S, et al. (2009) E7 oncoprotein of novel human papillomavirus type 108 lacking the E6 gene induces dysplasia in organotypic keratinocyte cultures. J Virol 83: 2907–2916. doi: 10.1128/jvi.02490-08
- 44. Sturegård E, Johansson H, Ekström J, Hansson BG, Johnsson A, et al. (2013) Human papillomavirus typing in reporting of condyloma. Sex Transm Dis 40: 123–129. doi: 10.1097/olq.0b013e31827aa9b3
- 45. Rector A, Tachezy R, Van Ranst M (2004) A sequence-independent strategy for detection and cloning of circular DNA virus genomes by using multiply primed rolling-circle amplification. J Virol 78: 4993–4998. doi: 10.1128/jvi.78.10.4993-4998.2004
- 46. Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, et al.. (2012) Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics 13: 134. Available: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=3412702&tool=pmcentrez&rendertype=abstract. Accessed 2013 Mar 3.
- 47. Letunic I, Doerks T, Bork P (2009) SMART 6: recent updates and new developments. Nucleic Acids Res 37: D229–32. doi: 10.1093/nar/gkn808
- 48. Finn RD, Mistry J, Tate J, Coggill P, Heger A, et al. (2010) The Pfam protein families database. Nucleic Acids Res 38: D211–22. doi: 10.1093/nar/gkp985
- 49. Edgar RC (2010) Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26: 2460–2461. doi: 10.1093/bioinformatics/btq461
- 50. Okonechnikov K, Golosova O, Fursov M (2012) Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics 28: 1166–1167. doi: 10.1093/bioinformatics/bts091
- 51. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797. doi: 10.1093/nar/gkh340
- 52. Glez-Peña D, Gómez-Blanco D, Reboiro-Jato M, Fdez-Riverola F, Posada D (2010) ALTER: program-oriented conversion of DNA and protein alignments. Nucleic Acids Res 38: W14–8. Available: http://nar.oxfordjournals.org/content/38/suppl_2/W14. Accessed 2013 June 8.
- 53. Huson DH, Scornavacca C (2012) Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst Biol 61: 1061–1067.
- 54. Stamatakis A (2006) RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22: 2688–2690. doi: 10.1093/bioinformatics/btl446
- 55. Rambaut A (2009) FigTree v1.3. Available: http://beast.bio.ed.ac.uk/figtree.
- 56. Favre M, Obalek S, Jablonska S, Orth G (1989) Human papillomavirus (HPV) type 50, a type associated with epidermodysplasia verruciformis (EV) and only weakly related to other EV-specific HPVs. J Virol 63: 4910.
- 57. Sichero L, Pierce Campbell CM, Ferreira S, Sobrinho JS, Luiza Baggio M, et al.. (2013) Broad HPV distribution in the genital region of men from the HPV infection in men (HIM) study. Virology: In Press.
- 58. Chouhy D, Bolatti EM, Piccirilli G, Sánchez A, Fernandez Bussy R, et al. (2013) Identification of human papillomavirus type 156, the prototype of a new human gammapapillomavirus species, by a generic and highly sensitive PCR strategy for long DNA fragments. J Gen Virol 94: 524–533. doi: 10.1099/vir.0.048157-0
- 59. Li J, Cai H, Xu Z, Wang Q, Hang D, et al. (2012) Nine complete genome sequences of cutaneous human papillomavirus genotypes isolated from healthy skin of individuals living in rural He Nan province, China. J Virol 86: 11936. doi: 10.1128/jvi.01988-12
- 60. Johansson H, Bzhalava D, Ekström J, Hultin E, Dillner J, et al. (2013) Metagenomic sequencing of “HPV-negative” condylomas detects novel putative HPV types. Virology 440: 1–7. doi: 10.1016/j.virol.2013.01.023
- 61. Antonsson A, Erfurt C, Hazard K, Holmgren V, Simon M, et al. (2003) Prevalence and type spectrum of human papillomaviruses in healthy skin samples collected in three continents. J Gen Virol 84: 1881–1886. doi: 10.1099/vir.0.18836-0
- 62. Forslund O, Ly H, Reid C, Higgins G (2003) A broad spectrum of human papillomavirus types is present in the skin of Australian patients with non-melanoma skin cancers and solar keratosis. Br J Dermatol 149: 64–73. doi: 10.1046/j.1365-2133.2003.05376.x
- 63. Matsukura T, Iwasaki T, Kawashima M (1992) Molecular cloning of a novel human papillomavirus (type 60) from a plantar cyst with characteristic pathological changes. Virology 190: 561–564. doi: 10.1016/0042-6822(92)91254-r
- 64. Gissmann L, Pfister H, Zur Hausen H (1977) Human papilloma viruses (HPV): characterization of four different isolates. Virology 76: 569–580. doi: 10.1016/0042-6822(77)90239-2
- 65. Forslund O, Lindelöf B, Hradil E, Nordin P, Stenquist B, et al. (2004) High prevalence of cutaneous human papillomavirus DNA on the top of skin tumors but not in “Stripped” biopsies from the same tumors. J Invest Dermatol 123: 388–394. doi: 10.1111/j.0022-202x.2004.23205.x
- 66. Nordin P, Hansson BG, Hansson C, Blohmè I, Larkö O, et al. (2007) Human papilloma virus in skin, mouth and uterine cervix in female renal transplant recipients with or without a history of cutaneous squamous cell carcinoma. Acta Derm Venereol 87: 219–222.
- 67. Hazard K, Karlsson A, Andersson K, Ekberg H, Dillner J, et al. (2007) Cutaneous human papillomaviruses persist on healthy skin. J Invest Dermatol 127: 116–119. doi: 10.1038/sj.jid.5700570
- 68. Li L, Barry P, Yeh E, Glaser C, Schnurr D, et al. (2009) Identification of a novel human gammapapillomavirus species. J Gen Virol 90: 2413–2417. doi: 10.1099/vir.0.012344-0
- 69. Egawa K, Delius H, Matsukura T, Kawashima M, de Villiers EM (1993) Two novel types of human papillomavirus, HPV 63 and HPV 65: comparisons of their clinical and histological features and DNA sequences to other HPV types. Virology 194: 789–799. doi: 10.1006/viro.1993.1320
- 70. Egawa N, Kawai K, Egawa K, Honda Y, Kanekura T, et al. (2012) Molecular cloning and characterization of a novel human papillomavirus, HPV 126, isolated from a flat wart-like lesion with intracytoplasmic inclusion bodies and a peculiar distribution of Ki-67 and p53. Virology 422: 99–104. doi: 10.1016/j.virol.2011.10.011
- 71. Egawa K, Kimmel R, De Villiers EM (2005) A novel type of human papillomavirus (HPV 95): comparison with infections of closely related human papillomavirus types. Br J Dermatol 153: 688–689. doi: 10.1111/j.1365-2133.2005.06825.x
- 72. Müller M, Kelly G, Fiedler M, Gissmann L (1989) Human papillomavirus type 48. J Virol 63: 4907–4908.
- 73. Mokili JL, Dutilh BE, Lim YW, Schneider BS, Taylor T, et al. (2013) Identification of a novel human papillomavirus by metagenomic analysis of samples from patients with febrile respiratory illness. PLoS One 8: e58404. doi: 10.1371/journal.pone.0058404