Oxyplax ochracea (Moore) is a pest that causes severe damage to a wide range of crops, forests and fruit trees. The complete genome sequence of Oxyplax ochracea nucleopolyhedrovirus (OxocNPV) was determined using a Roche 454 pyrosequencing system. OxocNPV has a double-stranded DNA (dsDNA) genome of 113,971 bp with a G+C content of 31.1%. One hundred and twenty-four putative open reading frames (ORFs) encoding proteins of >50 amino acids in length and with minimal overlapping were predicted, which covered 92% of the whole genome. Six baculoviral typical homologous regions (hrs) were identified. Phylogenetic analysis and gene parity plot analysis showed that OxocNPV belongs to clade “a” of Group I alphabaculoviruses, and it seems to be close to the most recent common ancestor of Group I alphabaculoviruses. Three unique ORFs (with no homologs in the National Center for Biotechnology Information database) were identified. Interestingly, OxocNPV lacks three auxiliary genes (lef7, ie-2 and pcna) related to viral DNA replication and RNA transcription. In addition, OxocNPV has significantly different sequences for several genes (including ie1 and odv-e66) in comparison with those of other baculoviruses. However, three dimensional structure prediction showed that OxocNPV ODV-E66 contain the conserved catalytic residues, implying that it might possess polysaccharide lyase activity as AcMNPV ODV-E66. All these unique features suggest that OxocNPV represents a novel species of the Group I alphabaculovirus lineage.
Citation: Wang J, Hou D, Wang Q, Kuang W, Zhang L, Li J, et al. (2018) Genome analysis of a novel Group I alphabaculovirus obtained from Oxyplax ochracea. PLoS ONE 13(2): e0192279. https://doi.org/10.1371/journal.pone.0192279
Editor: Olle Terenius, Swedish University of Agricultural Sciences, SWEDEN
Received: September 28, 2017; Accepted: January 18, 2018; Published: February 1, 2018
Copyright: © 2018 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All sequences are available from the NCBI database (accession number MF143631).
Funding: This work was supported by the grants from the National Key R&D Program of China (Grants No. 2017YFD0200400 to FD and 2016YFE0113500 to ZH); the National Natural Science Foundation of China (Grant No. 31621061) to ZH; the Key Research Program of Frontier Sciences of Chinese Academy of Sciences (Grant No. QYZDJ-SSW-SMC021) to ZH, the Virology Key Frontier Science Program of State Key Laboratory of Virology (Grant No. klv-2016-03) to ZH and HW, and the National Natural Science Foundation of China (Grant No. 31400140) to SS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Baculoviruses are known to infect a wide variety of insect hosts and play important roles in regulating many insect populations in nature. They have been widely used as environmentally safe agents for pest control. In addition, baculoviruses provide efficient expression systems for the production of recombinant proteins in insect cells, as well as promising vectors for gene therapy [1, 2]. These applications greatly facilitate fundamental studies of baculoviruses.
To date, more than 600 baculoviruses have been reported to be isolated from different insect species, and 87 of them have had their whole genomes sequenced . Baculovirus genomes consist of circular, double-stranded DNA ranging from 80 to 180 kb [4, 5]. Based on phylogenetic analysis, the family Baculoviridae is classified into four genera: Alphabaculovirus (lepidopteran-specific nucleopolyhedroviruses [NPVs]), Betabaculovirus (lepidopteran-specific granuloviruses [GVs]), Gammabaculovirus (hymenopteran-specific NPVs) and Deltabaculovirus (dipteran-specific NPVs) [6, 7]. The Alphabaculovirus genus can be divided into two groups, Groups I and II [8–10]. Group I alphabaculoviruses are characterized by their use of GP64 as their envelope fusion protein (EFP), while Group II alphabaculoviruses, most betabaculoviruses (except a newly discovered Diatraea saccharalis granulovirus, which contains both gp64 and f) and deltabaculoviruses exploit F protein as their EFP [11, 12]. Group I alphabaculoviruses are further divided into two clades based on phylogeny, “a” and “b” . Despite the diversity in gene content and organization of baculovirus genomes, a set of 38 core genes are conserved across their genomes, which play important roles in the viral life cycle [14, 15]. In addition, 22 genes are conserved in all sequenced lepidopteran baculoviruses (alpha- and betabaculoviruses) and 11 genes (including gp64) are only present in Group I alphabaculoviruses .
Oxyplax ochracea (Moore) (Lepidoptera: Limacodidae) causes economic losses related to orange, tea, and tea-oil trees. It is widely distributed in East, South and Central China, as well as in Indonesia, India, Thailand and Sri Lanka . In China, the insect has two generations each year . Oxyplax ochracea nucleopolyhedrovirus (OxocNPV) is a natural pathogen of O. ochracea. It was first isolated in 1989 from O. ochracea larvae that exhibited the typical symptoms of baculovirus infection in an orange field in Zhuxi, Hubei Province . Electron microscopy showed that OxocNPV is a single-nucleocapsid nucleopolyhedrovirus (SNPV), with occlusion bodies (OBs) containing singly packaged virions . In this study, the complete genome sequence of OxocNPV was determined and analyzed. Phylogenetic analysis suggested that this virus is a novel Group I alphabaculovirus and seems to be closely linked to the most recent common ancestor of these viruses.
Results and discussion
Sequencing and characterization of OxocNPV genome
The genome of OxocNPV was assembled from 71,240 high-quality Roche 454 sequencing reads with an average coverage of 249X. Uncertain regions were confirmed by PCR amplification and Sanger sequencing. The complete genome sequence and annotation information were submitted to GenBank (accession number: MF143631). In summary, the complete circular OxocNPV genome is 113,971 bp in length, with a G+C content of 31.1%. In total, 124 putative open reading frames (ORFs) that potentially encode proteins of >50 amino acids (aa) in length were predicted, covering 92% of the whole genome. Among them, 61 ORFs were in the forward orientation and 63 were in the reverse orientation. The polyhedrin gene was assigned as the first ORF according to tradition. The 38 baculovirus core genes (Fig 1, red), 22 lepidopteran baculovirus conserved genes (Fig 1, blue), 10 Group I alphabaculovirus unique genes (Fig 1, green) and 51 baculovirus common genes (Fig 1, gray) were annotated using Basic Local Alignment Search Tool (BLAST) comparisons. In addition, three genes were classified as OxocNPV unique genes as no homologs were found in the National Center for Biotechnology Information (NCBI) database (Fig 1, open arrows).
ORFs are indicated by arrows. The direction of the arrows indicates the orientation of gene transcription. The colors represent gene types: red for core genes, blue for lepidopteran baculovirus conserved genes, green for Group I unique genes, and gray for other baculoviral genes. Open arrows represent hypothetical unique genes of OxocNPV. Hrs are represented by pink square boxes. The collinear region conserved in lepidopteran baculoviruses is also indicated.
Phylogenetic analysis of OxocNPV
A phylogenetic tree based on 38 concatenated core genes from 88 whole-genome sequenced baculoviruses (including OxocNPV) was generated (Fig 2). According to the tree, OxocNPV can be classified as a new member of clade “a” within Group I alphabaculoviruses (indicated by a red triangle). It is located on a distinct branch and appears to be a close lineage to the most recent common ancestor of the Group I alphabaculoviruses.
Phylogenetic analysis was performed using the combined aa sequences of the 38 core genes from 88 sequenced baculoviruses using the Maximum Likelihood method. Bootstrap value: 1000 replicates. The numbers on the nodes indicate the bootstrap scores. OxocNPV is indicated by a red triangle.
Relationship with other baculoviruses
Nine representative baculoviruses (either type species/well-studied representatives or in a close phylogenetic relation to OxocNPV) were chosen for the comparative study of OxocNPV: Autographa californica MNPV (AcMNPV, clade “a” Group I alphabaculovirus), Bombyx mori NPV (BmNPV, clade “a” Group I alphabaculovirus), Thysanoplusia orichalcea NPV (ThorNPV, clade “a” Group I alphabaculovirus), Catopsilia pomona NPV (CapoNPV, clade “a” Group I alphabaculovirus), Orgyia pseudotsugata MNPV (OpMNPV, clade “b” Group I alphabaculovirus), Helicoverpa armigera NPV (HearNPV, Group II alphabaculovirus), Cydia pomonella GV (CpGV, betabaculovirus), Neodiprion sertifer NPV (NeseNPV, gammabaculovirus) and Culex nigripalpus NPV (CuniNPV, deltabaculovirus). OxocNPV shares 119 homologous ORFs with AcMNPV, 115 with BmNPV, 114 with ThorNPV, 112 with CapoNPV, 112 with OpMNPV, 97 with HearNPV, 67 with CpGV, 49 with NeseNPV and 39 with CuniNPV. Regarding the 38 core genes, OxocNPV shares an average aa identity of 63.8%, 64.4%, 64.4%, 63.1%, 57.3%, 43.1%, 31.0%, 26.0%, 18.3% with the above nine viruses, respectively (S1 Table). When compared with the five selected Group I alphabaculoviruses (AcMNPV, BmNPV, ThorNPV, CapoNPV and OpMNPV), OxocNPV shares an average aa identity of 58.1%, 51.8%, and 38.3% for the core genes, lepidopteran baculovirus conserved genes and other baculoviral genes, respectively (S2 Table). Eight OxocNPV ORFs share high homology (>75% aa identity on average) with their counterparts in the other five selected Group I alphabaculoviruses, and the majority of them are core genes/lepidopteran baculovirus conserved genes, except ubiquitin (S2 Table). In contrast, among the 14 less conserved ORFs (<30% aa identity in average), only one core gene (desm oplakin) and one lepidopteran baculovirus conserved gene (lef6) were found (S2 Table).
Gene parity plots of OxocNPV against the above nine selected baculoviruses are shown in Fig 3. The OxocNPV gene order is substantially collinear with representatives of both Group I and Group II alphabaculoviruses, with a small region that is collinear with betabaculoviruses; however, the gene order is significantly different from that of gamma- and deltabaculoviruses (Fig 3). The previously identified collinear region, which is conserved among lepidopteran baculoviruses and characterized by containing a highly similar gene contents (harboring ~20 core genes) and gene orders  is also conserved in OxocNPV. In OxocNPV, this region contains 20 core genes, 5 lepidopteran baculovirus conserved genes and 8 other baculoviral genes (Fig 1).
Gene parity plots were constructed of OxocNPV against representative baculoviruses: AcMNPV, BmNPV, ThorNPV and CapoNPV (Group I clade “a”); OpMNPV (Group I clade “b”); HearNPV (Group II); CpGV (a betabaculovirus), NeseNPV (a gammabaculovirus) and CuniNPV (a deltabaculovirus). OxocNPV ORFs are on the x-axes. The accession numbers of these genomes are listed in S1 Table.
A typical characteristic of baculovirus genomes is the presence of interspersed homologous regions (hrs) with high A+T content, tandem repeats and imperfect palindromes, although they do not necessarily exist in all baculoviruses [20, 21]. Hrs have been implicated both as origins of DNA replication and as transcriptional enhancers in a number of baculoviruses [22, 23]. Six hrs were found in the OxocNPV genome, and they had an A+T content of 54.3% (Fig 1, pink, and Fig 4). Hr2 and hr4 are positioned in a counterclockwise direction and the rest are positioned in a clockwise direction in the genome. The length of the OxocNPV hrs ranges from 230–780 bp, and each hr consists of tandem repeats of about 80 bp in length (Fig 4). The secondary structure prediction of the tandem repeats revealed that it contains two imperfect palindromes (Fig 4, red and blue, respectively).
(A) Sequence alignment of OxocNPV hrs. Black background indicates 100% identity among compared regions, and dark and light gray indicates >80% and >60% identity, respectively. The numbers of hrs are listed on the left, and the conserved sequence and two palindromes are listed at the bottom. (B) Secondary structure prediction of conserved hr sequences. The red rectangle indicates the first palindrome and the blue rectangle indicates the second palindrome.
Gene content of OxocNPV
Annotation of the OxocNPV genome revealed that it contains 11 replication-associated genes, 12 transcription-associated genes, 34 structure-related genes, 10 genes essential for oral infection, and 20 auxiliary genes (Table 1). In addition, 37 genes of unknown function including three hypothetical unique OxocNPV genes were predicted (Table 1).
DNA replication and RNA transcription genes
So far, six genes have been found to be essential for baculovirus DNA replication and they are all present in the OxocNPV genome: immediate early gene-1 (ie-1, oxoc6), DNA polymerase (DNA-pol, oxoc72), helicase (oxoc48), late expression factor 1 (lef1, oxoc95), lef2 (oxoc124) and lef3 (lef3, oxoc70) (Table 1) [24, 25]. Among them, IE-1, a major transcriptional activator of early genes, was found to be significant longer for OxocNPV (~714 aa) than for most of the other Group I alphabaculoviruses (~550 aa). Functional analysis showed that the N-terminal half of AcMNPV IE-1 contains two independent transcription stimulatory (transactivation) domains (M1-N125 and A168-G222) interrupted by a basic region, while the C-terminal half contains putative DNA-binding and oligomerization domains (Fig 5) [26, 27]. Sequence alignment showed that the transactivation domain I of OxocNPV IE-1 is quite divergent from the homologs in other clade “a” Group I alphabaculoviruses in that the OxocNPV domain contains several discontinuous insertions (Fig 5). Interestingly, IE-1 of CapoNPV and Lonomia obliqua multiple nucleopolyhedrovirus [LoobNPV], which are also closely related to the ancestral Group I alphabaculovirus (Fig 2), also exhibits obvious differences in this domain compared to other Group I alphabaculoviruses (Fig 5) . Whether the transactivation domain I of OxocNPV, CapoNPV or LoobNPV IE-1 is sufficient to activate transcription (like its counterpart in AcMNPV IE-1) remains to be investigated, and this may provide useful information regarding the function–evolution relationship of the baculovirus IE-1 protein.
Functional domains were annotated according studies of AcMNPV IE-1. The red, green and orange lines indicate the transcriptional transactivation domains, basic region and oligomerization domain, respectively. The putative DNA-binding domain is indicated by the blue line. The GenBank accession numbers for these IE-1 proteins and the virus full names are as follows: YP_758611 (Plutella xylostella multiple nucleopolyhedrovirus, PlxyMNPV), NP_054178 (AcMNPV), NP_703134 (RoMNPV), NP_047544 (BmNPV), YP_002884369 (Bombyx mandarina nucleopolyhedrovirus, BomaNPV), YP_950845 (Maruca vitrata nucleopolyhedrovirus, MaviNPV), YP_007250550 (ThorNPV), YP_009255268 (CapoNPV) and AKN80956.1(LoobNPV).
Additional genes that influence DNA replication were found in OxocNPV: DNA binding protein (dbp, oxoc107), lef11 (oxoc120), me53 (oxoc16), alkaline exonuclease (alk-exo, oxoc20) and ac79 (oxoc59)  (Table 1). However, homologs of lef7 (ac125) and proliferating cell nuclear antigen (pcna, ac49) were absent. Eukaryotic PCNA plays a role in DNA synthesis, DNA repair and cell cycle progression. In AcMNPV, PCNA was not found to play an obvious role in transient DNA replication . However, it was found to accelerate expression of late genes . LEF7 is a stimulating factor for viral DNA replication and it has been proposed to be a single-stranded DNA-binding protein [30, 31]. Deletion of AcMNPV lef7 resulted in a >90% reduction in viral DNA replication in Sf21 and SE1c cells, but not in Tn368 cells . Deletion of lef7 from the BmNPV genome also led to impairment of viral DNA synthesis . Recent study suggested that AcMNPV LEF7 promoted efficient virus replication most likely by hijacking host factors regulating the DNA damage response . So far, homologs of lef7 have been found to be present in all the sequenced Group I alphabaculoviruses except OxocNPV, CapoNPV and LoobNPV .
Early baculovirus genes are transcribed by the host cell RNA polymerase II, but after onset of DNA replication, the transcription of late and very late genes is dependent on viral-encoded RNA polymerase, a 560-kDa protein complex composed of LEF-4, 8, 9 and P47 . In the OxocNPV genome, six core genes (the four components of RNA polymerase plus lef5 [oxoc44], and very late factor 1 [vlf-1, oxoc61]), three lepidopteran baculovirus conserved genes and two other baculoviral genes related to viral late gene transcription were identified (Table 1) [36, 37]. OxocNPV lef6, CapoNPV lef6 and LoobNPV lef6 share low similarity (~20–40% aa identity) with other members of Group I clade “a”. LEF6 is required for late gene transcription and may function as an mRNA exporter. Deletion of AcMNPV lef6 leads to a ~90% reduction in infectious budded virus (BV) production . The homolog of ie-2, a specific gene of Group I alphabaculoviruses, is absent from the genome of OxocNPV. IE-2 contains a predicted really interesting new gene (RING) finger domain and has been found to enhance transactivation when acting synergistically with IE-1 [39–41]. Deletion of ie-2 reduced the plasmid replication level by 3-fold in Sf21 cells . OxocNPV is the first reported Group I alphabaculovirus to lack ie-2.
Eighteen core genes and six lepidopteran conserved genes that encode structural proteins were identified in the OxocNPV genome (Table 1) [43–45]. In addition, nine other baculoviral genes were also identified in the OxocNPV genome (Table 1). Desmoplakin (ac66) is one of the 38 core genes. Knockout of ac66 led to a >99% reduction in BV yield compared to the wild-type virus, as well as the elimination of occlusion-derived virus (ODV) and OB formation . Certain baculoviruses harbor two or three copies of desmoplakin. In the OxocNPV genome, only one desmoplakin gene (oxoc71) is present and its protein length (683 aa) is much shorter than those of other Group I homologs (766–953 aa), due to many deletions in the middle region (S1 Fig, only clade “a” members are shown). Cg30 is present in the genomes of most sequenced alphabaculoviruses and certain betabaculoviruses (such as SpliGV). Regarding Group I alphabaculoviruses, cg30 is missing only in two cases, OxocNPV and Maruca vitrata multiple nucleopolyhedrovirus (MaviNPV). CG30 contains putative RING finger and leucine zipper domains. It is not an essential gene for AcMNPV replication as deletion of cg30 resulted in only a subtle reduction in the BV titer . In a study of BmNPV, CG30 was found to be required for maximum BV production and OB formation . Therefore, the acquisition of cg30 may represent a selective advantage during evolution.
Proteins involved in primary infection
Per os infectivity factors (PIFs) are a group of ODV-specific envelope proteins that are required for the establishment of primary infection [49, 50]. So far, all ten recognized PIF genes have been found in the OxocNPV genome, comprising p74 (oxoc17), pif1 (oxoc29), pif2 (oxoc102), pif3 (oxoc31), pif4 (oxoc47), pif5 (oxoc5), pif6 (oxoc69), pif7 (ac110, oxoc35), pif8 (vp91/p95, oxoc55) and sf58 (oxoc37) (Table 1; S1 Table) [51–54]. Except for the last PIF gene, which is a lepidopteran baculovirus conserved gene, the PIF genes are core genes .
Besides PIFs, other ODV envelope proteins also play important roles in per os infection. ODV-E66 (ac46) is also a major component of ODV envelope proteins. Homologs of odv-e66 are found in the genomes of most alpha- and betabaculoviruses, but not in gamma- or deltabaculoviruses. Deletion of AcMNPV odv-e66 resulted in a 1000-fold increase in the lethal dose that kills 50% of a test sample (LD50) compared to wild-type virus when larvae were infected per os, but there was no difference when the virus was injected into the hemolymph . Recently, ODV-E66 was shown to have chondroitinase activity and it has been suggested that it facilitates the primary infection of ODV by digestion of chondroitin sulfate in the insect midgut peritrophic membrane .
Among the 23 sequenced Group I alphabaculoviruses, all encode odv-e66 except MaviNPV, CapoNPV and Condylorrhiza vestigialis MNPV [CoveNPV]). Interestingly, sequence alignment showed that OxocNPV ODV-E66 (oxoc113) shares low amino acid serquence identity (~25%) with the ODV-E66 sequences of other Group I members. The Group I homologs ODV-E66 normally exhibit a high degree of sequence conservation (>70% identity) (S2 Fig). It appears that the N-terminus of OxocNPV ODV-E66, which contains a polysaccharide lyase family sequence with homology to bacterial chondroitinases, is more conserved than its C-terminus, which is a baculovirus ODV-E66 superfamily domain with unknown function (S2 Fig) [58,59]. Five residues were identified as being essential for the catalytic actively of AcMNPV ODV-E66 , and these residues were also conserved in the protein of OxocNPV (Fig 6A).
(A) Sequence alignment of OxocNPV ODV-E66 with the sequences of eight representative baculoviruses. The five key active residues are marked by rectangles. The GenBank accession numbers for these ODV-E66 proteins and the virus full names are as follows: NP_054075 (AcMNPV), YP_758513 (PlxyNPV), NP_703036 (RoMNPV), NP_047452 (BmNPV), YP_002884277 (BomaNPV), YP_007250454 (ThorNPV), NP_203651 (HearNPV) and NP_148821 (CpGV). (B) Prediction of the 3D structure of OxocNPV ODV-E66. The computational 3D structure (red) of OxocNPV ODV-E66 was modeled using the crystal structure (yellow) of AcMNPV ODV-E66 (PDB code, 3vsm) as the template . The five key active residues of OxocNPV ODV-E66 (N209, H264, Y272, R316 and E370) were superimposed over the corresponding residues of AcMNPV ODV-E66 (N236, H291, Y299, R345 and E395). The image on the right represents the enlarged active site of the left image.
The three-dimensional (3D) structure of OxocNPV ODV-E66 (Fig 6B, magenta) was modeled based on the reported AcMNPV ODV-E66 crystal structure (Fig 6B, yellow, Protein Data Bank (PDB) code: 3vsm) , and the superimposition of these two structures revealed a higher degree of overlap in the N-terminal α-helix-rich domain compared to the C-terminal β-strand-rich region (Fig 6B, left figure). The five key active residues of OxocNPV ODV-E66 (N209, H264, Y272, R316 and E370) were suitably superimposed on the corresponding residues of AcMNPV ODV-E66 (N236, H291, Y299, R345 and E395) (Fig 6B, right image).
These findings suggest that OxocNPV may encode an active ODV-E66, although further investigation is required. The significant difference in ODV-E66 between OxocNPV and other Group I members also suggests a more ancient origin of OxocNPV during the evolution of the Group I lineage.
Auxiliary genes are not essential for viral replication, but they may provide a selective advantage to facilitate virus production/survival . Homologues of auxiliary genes in OxocNPV include but are not limited to 38.7k (oxoc94), ADP-ribose pyrophosphatase (ADPRase, oxoc121), ubiquitin (oxoc118), arif-1 (oxoc101), cathepsin (oxoc106), chitinase (oxoc105), baculovirus J domain protein (bjdp, oxoc84), ecdysteroid UDP glucosyltransferase (egt, oxoc96), fibroblast growth factor (fgf, oxoc116), gp37 (oxoc73), global transactivator (gta, oxoc90), p26 (oxoc19), phosphotyrosine phosphatase (ptp, oxoc122), inhibitor of apoptosis-1 (iap-1, oxoc109), iap-2 (oxoc67), superoxide dismutase (sod, oxoc115), trax-like (oxoc86), MTase (oxoc68), and conotoxin-like (ctl, oxoc114) (Table 1). However, baculovirus repeated ORFs (bro), which are repeated genes that are found in most baculoviruses and contain DNA-binding activity that could influence host DNA replication and transcription, were absent from the OxocNPV genome, which is similar to the situation in MaviNPV and Rachiplusia ou MNPV (RoMNPV) in the Group I lineage.
Three ORFs of OxocNPV, comprising oxoc9 (203 aa), oxoc10 (79 aa) and oxoc27 (154 aa), have no discernible homologues in GenBank. Interestingly, oxoc9 and oxoc10 are linked and their sequences have been validated by PCR. Whether these are functional ORFs of OxocNPV requires further experimentation.
Materials and methods
Viral DNA extraction
OxocNPV-infected O. ochracea larvae have been preserved at the Chinese General Virus Collection Center (CGVCC) under collection number IVCAS 1.0235. The virus OBs were purified from larvae body homogenate by differential centrifugation . Viral genomic DNA was isolated according to the method reported previously [62, 63].
Genomic DNA sequencing and bioinformatics analysis
Genomic DNA sequencing of OxocNPV was performed using the Roche 454 GS FLX pyrosequencing system. The sequenced reads were assembled with 454 Newbler software version 2.7. Low-quality regions or ambiguous bases were further verified by PCR and Sanger sequencing.
The establishment of the full genome sequence of OxocNPV was followed by ORF and repeated regions prediction. The hrs were determined using Tandem Repeats Finder (http://tandem.bu.edu/trf/trf.html)  and the NCBI BLAST server (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Putative ORFs were predicted using FGENESV0 (http://linux1.softberry.com/berry.phtml)  and the NCBI ORF finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html), using the criteria of protein length >50 aa and minimal overlaps. The predicted ORFs were annotated according to homology using NCBI BLAST. The complete genome sequence and annotation information were submitted to GenBank (accession number: MF143631). Gene parity plots were constructed to compare ORF organization, as previously described .
The concatenated protein sequences encoded by the 38 core genes of OxocNPV and the other 87 sequenced baculovirus genomes were aligned using ClustalW . A phylogenetic tree was reconstructed using the Maximum Likelihood method based on the Jones–Thornton–Taylor (JTT) model with 1000 bootstrap values for core proteins using MEGA6 software . The reliability of the tree was explored via a bootstrap analysis with 1000 replicates .
3D structural modeling
The 3D structural model of OxocNPV ODV-E66 was predicted by the SWISS-MODEL server (http://www.expasy.org/structural_bioinformatics), using the AcMNPV ODV-E66 structure (PDB code, 3vsm) as the template. A superimposition of OxocNPV ODV-E66 and AcMNPV ODV-E66 was generated by PyMOL (www.pymol.org).
S1 Fig. Sequence alignment of desmoplakin of clade “a” of Group I alphabaculoviruses.
Black background indicates 100% identity among compared regions, and dark and light gray indicates >80% and >60% identity, respectively. The GenBank accession numbers for these Desmoplakin proteins and the virus full names are as follows: YP_758533 (PlxyMNPV), NP_054096 (AcMNPV), NP_703056 (RoMNPV), NP_047470 (BmNPV), YP_002884295 (BomaNPV), YP_950780 (MaviNPV), YP_007250473 (ThorNPV), ANF29722 (CapoNPV), AKN81050 (LoobNPV).
S2 Fig. Sequence alignment of ODV-E66 of Group I alphabaculoviruses.
The Black background indicates 100% identity among compared regions, and dark and light gray indicates >80% and >60% identity, respectively. GenBank accession numbers for these ODV-E66 proteins and the virus full names are as follows: YP_758513 (PlxyMNPV), NP_054075 (AcMNPV), NP_703036 (RoMNPV), NP_047452 (BmNPV), YP_002884277 (BomaNPV), YP_007250454 (ThorNPV), YP_803443 (Anticarsia gemmatalis nucleopolyhedronvirus, AngeNPV), NP_932654 (Choristoneura fumiferana DEF multiple nucleopolyhedronvirus, CfDEFMNPV), NP_203211 (Epiphyas postvittana nucleopolyhedronvirus, EppoNPV), YP_611070 (Antheraea pernyi nucleopolyhedronvirus, AnpeNPV), AFY63904 (Philosamia cynthia nucleopolyhedronvirus, PhcyNPV), YP_473294 (Hyphantria cunea nucleopolyhedronvirus, HycuNPV), NP_046206 (OpMNPV), YP_008378455 (C. rosaceana nucleopolyhedroviruses, ChroNPV), YP_008992195 (Choristoneura murinana nucleopolyhedroviruses, ChmuNPV), NP_848356 (Choristoneura fumiferana MNPV, CfMNPV), YP_008378605 (Choristoneura occidentalis nucleopolyhedroviruses, ChocNPV), AKN81064 (Lonomia obliqua multiple nucleopolyhedrovirus), AKR14189 (Dasychira pudibunda nucleopolyhedrovirus).
The authors are grateful for the Roche 454 sequencing services of the Core Facility Center of Wuhan Institute of Virology.
- 1. Possee RD. Baculoviruses as expression vectors. Curr Opin Biotechnol. 1997;8(5):569–72. pmid:9353228
- 2. Summers MD. Milestones leading to the genetic engineering of baculoviruses as expression vector systems and viral pesticides. Adv Virus Res. 2006;68:3–73. pmid:16997008
- 3. Martignoni ME, Iwai PJ. A Catalog of Viral Diseases of Insects, Mites, and Ticks In Microbial Control of Pest and Plant Diseases 1970–80. Academic Press Inc, London;1986. pp. 897–911
- 4. Lauzon HA, Lucarotti CJ, Krell PJ, Feng Q, Retnakaran A, Arif BM. Sequence and organization of the Neodiprion lecontei nucleopolyhedrovirus genome. J Virol. 2004;78(13):7023–35. pmid:15194779
- 5. Hayakawa T, Ko R, Okano K, Seong SI, Goto C, Maeda S. Sequence analysis of the Xestia c-nigrum granulovirus genome. Virology. 1999;262(2):277–97. pmid:10502508
- 6. Jehle JA, Blissard G, Bonning B, Cory J, Herniou E, Rohrmann G, et al. On the classification and nomenclature of baculoviruses: a proposal for revision. Arch Virol. 2006;151(7):1257–66. pmid:16648963
- 7. Carstens EB, Ball LA. Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2008). Arch Virol. 2009;154(7):1181–8. pmid:19495937
- 8. Hefferon KL, Oomens AG, Monsma SA, Finnerty CM, Blissard GW. Host cell receptor binding by baculovirus GP64 and kinetics of virion entry. Virology. 1999;258(2):455–68. pmid:10366584
- 9. IJkel WF, Westenberg M, Goldbach RW, Blissard GW, Vlak JM, Zuidema D. A novel baculovirus envelope fusion protein with a proprotein convertase cleavage site. Virology. 2000;275(1):30–41. pmid:11017785
- 10. Monsma SA, Oomens A, Blissard GW. The GP64 envelope fusion protein is an essential baculovirus protein required for cell-to-cell transmission of infection. J Virol. 1996;70(7):4607–16. pmid:8676487
- 11. Ardisson-Araújo DM, Melo FL, Clem RJ, Wolff JL, Ribeiro BM. A betabaculovirus-encoded gp64 homolog codes for a functional envelope fusion protein. J Virol. 2016;90(3):1668–72.
- 12. Ardisson-Araújo DMP, Pereira BT, Melo FL, Ribeiro BM, Báo SN, Paolo MdA, et al. A betabaculovirus encoding a gp64 homolog. BMC Genomics. 2016;17:94. pmid:26847652
- 13. Jehle JA, Lange M, Wang HL, Hu ZH, Wang YJ, Hauschild R. Molecular identification and phylogenetic analysis of baculoviruses from Lepidoptera. Virology. 2006;346(1):180–93. pmid:16313938
- 14. Garavaglia MJ, Miele SAB, Iserte JA, Belaich MN, Ghiringhelli PD. The ac53, ac78, ac101, and ac103 genes are newly discovered core genes in the family Baculoviridae. J Virol. 2012;86(22):12069–79. pmid:22933288
- 15. Herniou EA, Olszewski JA, Cory JS, O’Reilly DR. The genome sequence and evolution of baculoviruses. Annu Rev Entomol. 2003;48(1):211–34.
- 16. Rohrmann GF. Baculovirus molecular biology. 3rd ed. Bethesda (MD): National Center for Biotechnology Information (US); 2013.
- 17. Solovyev A.V. & Witt T.J. The Limacodidae of Vietnam. Entomofauna.2009; 16 (Suppl.):193–311.
- 18. Tang XC. A study on the NPV of Oxyplax ochracea and analysis of its DNA map with restriction endonucleases. Virol Sin. 1993;8(2):158–63.
- 19. Zhu Z, Yin FF, Liu XP, Hou DH, Wang J, Zhang L, et al. Genome sequence and analysis of Buzura suppressaria nucleopolyhedrovirus: a group II alphabaculovirus. PLoS ONE. 2014;9(1):e86450. pmid:24475121
- 20. Kool M, Voeten JT, Goldbach RW, Tramper J, Vlak JM. Identification of seven putative origins of Autographa californica multiple nucleocapsid nuclear polyhedrosis virus DNA replication. J Gen Virol. 1993;74(12):2661–8.
- 21. Hilton S, Winstanley D. The origins of replication of granuloviruses. Arch Virol. 2008;153(8):1527–35. pmid:18612587
- 22. Guarino LA, Gonzalez MA, Summers MD. Complete sequence and enhancer function of the homologous DNA regions of Autographa californica nuclear polyhedrosis virus. J Virol. 1986;60(1):224–9. pmid:16789259
- 23. Guarino LA, Summers MD. Interspersed homologous DNA of Autographa californica nuclear polyhedrosis virus enhances delayed-early gene expression. J Virol. 1986;60(1):215–23. pmid:16789258
- 24. Adam LV, Victor SM, George FR. Baculovirus DNA replication and processing. Curr Drug Targets. 2007;8(10):1096–102. pmid:17979669
- 25. Kool M, Ahrens CH, Goldbach RW, Rohrmann GF, Vlak JM. Identification of genes involved in DNA replication of the Autographa californica baculovirus. Proc Natl Acad Sci U S A. 1994;91(23):11212–6. pmid:7972036
- 26. Olson VA, Wetter JA, Friesen PD. Baculovirus transregulator IE1 requires a dimeric nuclear localization element for nuclear import and promoter activation. J Virol. 2002;76(18):9505–15. pmid:12186932
- 27. Slack JM, Blissard GW. Identification of two independent transcriptional activation domains in the Autographa californica multicapsid nuclear polyhedrosis virus IE1 protein. J Virol. 1997;71(12):9579–87. pmid:9371622
- 28. Kool M, Ahrens CH, Goldbach RW, Rohrmann GF, Vlak JM. Identification of genes involved in DNA replication of the Autographa californica baculovirus. Proc Natl Acad Sci U S A. 1994;91(23):11212–6. pmid:7972036
- 29. Crawford AM, Miller LK. Characterization of an early gene accelerating expression of late genes of the baculovirus Autographa californica nuclear polyhedrosis virus. J Virol. 1988;62(8):2773–81. pmid:3292791
- 30. Lu A, Miller LK. The roles of eighteen baculovirus late expression factor genes in transcription and DNA replication. J Virol. 1995;69(2):975–82. pmid:7815565
- 31. Morris TD, Todd JW, Fisher B, Miller LK. Identification of lef-7: a baculovirus gene affecting late gene expression. Virology. 1994;200(2):360–9. pmid:8178427
- 32. Chen CJ, Thiem SM. Differential infectivity of two Autographa californica nucleopolyhedrovirus mutants on three permissive cell lines is the result of lef-7 deletion. Virology. 1997;227(1):88–95. pmid:9007061
- 33. Gomi S, Zhou CE, Yih W, Majima K, Maeda S. Deletion analysis of four of eighteen late gene expression factor gene homologues of the baculovirus, BmNPV. Virology. 1997;230(1):35–47. pmid:9126260
- 34. Mitchell Jonathan K., Byers Nathaniel M., and Friesen Paul D.. Baculovirus F-Box protein LEF-7 modifies the host DNA Damage Response to enhance virus multiplication. J Virol. 2013: 87(23):12592–9. pmid:24027328
- 35. Wang J, Zhu Z, Zhang L, Hou DH, Wang ML, Arif B, et al. Genome sequencing and analysis of Catopsilia pomona nucleopolyhedrovirus: a distinct species in group I alphabaculovirus. PLOS ONE. 2016;11(5):e0155134. pmid:27166956
- 36. Guarino LA, Xu B, Jin JP, Dong W. A virus-encoded RNA polymerase purified from baculovirus-infected cells. J Virol. 1998;72(10):7985–91. pmid:9733837
- 37. Guarino LA, Dong W, Jin JP. In vitro activity of the baculovirus late expression factor LEF-5. J Virol. 2002;76(24):12663–75. pmid:12438592
- 38. Lin GY, Blissard GW. Analysis of an Autographa californica multicapsid nucleopolyhedrovirus lef-6-null virus: LEF-6 is not essential for viral replication but appears to accelerate late gene transcription. J Virol. 2002;76(11):5503–14. pmid:11991978
- 39. Yoo S, Guarino LA. The Autographa californica nuclear polyhedrosis virus ie2 gene encodes a transcriptional regulator. Virology. 1994;202(2):746–53. pmid:8030237
- 40. Carson DD, Guarino LA, Summers MD. Functional mapping of an AcNPV immediately early gene which augments expression of the IE-1 trans-activated 39K gene. Virology. 1988;162(2):444–51. pmid:3277328
- 41. Carson DD, Summers MD, Guarino LA. Molecular analysis of a baculovirus regulatory gene. Virology. 1991;182(1):279–86. pmid:2024466
- 42. Lu A, Miller LK. Differential requirements for baculovirus late expression factor genes in two cell lines. J Virol. 1995;69(10):6265–72. pmid:7666527
- 43. Thiem SM, Miller LK. Identification, sequence, and transcriptional mapping of the major capsid protein gene of the baculovirus Autographa californica nuclear polyhedrosis virus. J Virol. 1989;63(5):2008–18. pmid:2649691
- 44. Wang ML, Tuladhar E, Shen S, Wang HL, van Oers MM, Vlak JM, et al. Specificity of baculovirus P6. 9 basic DNA-binding proteins and critical role of the C terminus in virion formation. J Virol. 2010;84(17):8821–8. pmid:20519380
- 45. Ferrelli ML, Sciocco-Cap A, Berretta MF, Belaich MN, Ghiringhelli PD, Romanowski V. The baculoviral genome. INTECH Open Access Publisher; 2012.
- 46. Ke JH, Wang JW, Deng RQ, Wang XZ. Autographa californica multiple nucleopolyhedrovirus ac66 is required for the efficient egress of nucleocapsids from the nucleus, general synthesis of preoccluded virions and occlusion body formation. Virology. 2008;374(2):421–31. pmid:18241908
- 47. Passarelli AL, Miller LK. In vivo and in vitro analyses of recombinant baculoviruses lacking a functional cg30 gene. J Virol. 1994;68(2):1186–90. pmid:8289348
- 48. Ishihara G, Shimada T, Katsuma S. Functional characterization of Bombyx mori nucleopolyhedrovirus CG30 protein. Virus Res. 2013;174(1–2):52–9. pmid:23507307
- 49. Peng K, van Oers MM, Hu ZH, van Lent JW, Vlak JM. Baculovirus per os infectivity factors form a complex on the surface of occlusion-derived virus. J Virol. 2010;84(18):9497–504. pmid:20610731
- 50. Kikhno I, Gutiérrez Sn, Croizier L, Croizier G, Ferber ML. Characterization of pif, a gene required for the per os infectivity of Spodoptera littoralis nucleopolyhedrovirus. J Gen Virol. 2002;83(12):3013–22.
- 51. Sparks WO, Harrison RL, Bonning BC. Autographa californica multiple nucleopolyhedrovirus ODV-E56 is a per os infectivity factor, but is not essential for binding and fusion of occlusion-derived virus to the host midgut. Virology. 2011;409(1):69–76. pmid:20970820
- 52. Fang MG, Nie YC, Harris S, Erlandson MA, Theilmann DA. Autographa californica multiple nucleopolyhedrovirus core gene ac96 encodes a per os infectivity factor (PIF-4). J Virol. 2009;83(23):12569–78. pmid:19759145
- 53. Nie YC, Fang MG, Erlandson MA, Theilmann DA. Analysis of the Autographa californica multiple nucleopolyhedrovirus overlapping gene pair lef3 and ac68 reveals that AC68 is a per os infectivity factor and that LEF3 is critical, but not essential, for virus replication. J Virol. 2012;86(7):3985–94. pmid:22278232
- 54. Simón O, Palma L, Williams T, López-Ferber M, Caballero P. Analysis of a naturally-occurring deletion mutant of Spodoptera frugiperda multiple nucleopolyhedrovirus reveals sf58 as a new per os infectivity factor of lepidopteran-infecting baculoviruses. J Invertebr Pathol. 2012;109(1):117–26. pmid:22041202
- 55. Javed MA, Biswas S, Willis LG, Harris S, Pritchard C, van Oers MM, et al. Autographa californica multiple nucleopolyhedrovirus AC83 is a per os infectivity factor (PIF) protein required for occlusion-derived virus (ODV) and budded virus nucleocapsid assembly as well as assembly of the PIF complex in ODV envelopes. J Virol. 2017;91(5): e02115–16. pmid:28031365
- 56. Xiang XW, Chen L, Hu XL, Yu SF, Yang R, Wu XF. Autographa californica multiple nucleopolyhedrovirus odv-e66 is an essential gene required for oral infectivity. Virus Res. 2011;158(1–2):72–8. pmid:21440017
- 57. Sugiura N, Ikeda M, Shioiri T, Yoshimura M, Kobayashi M, Watanabe H. Chondroitinase from baculovirus Bombyx mori nucleopolyhedrovirus and chondroitin sulfate from silkworm Bombyx mori. Glycobiology. 2013;23(12):1520–30. pmid:24052236
- 58. Kawaguchi Y, Sugiura N, Kimata K, Kimura M, Kakuta Y. The crystal structure of novel chondroitin lyase ODV-E66, a baculovirus envelope protein. FEBS Lett. 2013;587(24):3943–8. pmid:24446551
- 59. Sugiura N, Setoyama Y, Chiba M, Kimata K, Watanabe H. Baculovirus envelope protein ODV-E66 is a novel chondroitinase with distinct substrate specificity. J Biol Chem. 2011;286(33):29026–34. pmid:21715327
- 60. O’Reilly DR. The baculoviruses. 1st ed. Springer; 1997.
- 61. O’Reilly DR, Miller LK, Luckow VA. Baculovirus expression vectors: a laboratory manual. 1st ed. Oxford University Press on Demand; 1994.
- 62. Hu ZH, Arif BM, Sun JS, Chen XW, Zuidema D, Goldbach RW, et al. Genetic organization of the HindIII-I region of the single-nucleocapsid nucleopolyhedrovirus of Buzura suppressaria1. Virus Res. 1998;55(1):71–82. pmid:9712513
- 63. Huang GH, Hou DH, Wang ML, Chen XW, Hu ZH. Genome analysis of Heliothis virescens ascovirus 3h isolated from China. Virol Sin. 2017;32:147–154. pmid:28382574
- 64. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573–80. pmid:9862982
- 65. Solovyev VV, Salamov AA. INFOGENE: a database of known gene structures and predicted genes and proteins in sequences of genome sequencing projects. Nucleic Acids Res. 1999;27(1):248–50. pmid:9847192
- 66. Hu ZH, Arif BM, Jin F, Martens JW, Chen XW, Sun JS, et al. Distinct gene arrangement in the Buzura suppressaria single-nucleocapsid nucleopolyhedrovirus genome. J Gen Virol. 1998;79(11):2841–51.
- 67. Thompson JD, Gibson T, Higgins DG. Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinformatics; 2002 Aug; Chapter 2: Unit 2.3.
- 68. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9. pmid:24132122
- 69. Sanderson MJ, Wojciechowski MF. Improved bootstrap confidence limits in large-scale phylogenies, with an example from Neo-Astragalus (Leguminosae). Syst Biol. 2000;49(4):671–85. pmid:12116433