Skip to main content
Advertisement
  • Loading metrics

Structures of nucleotide-bound Redondovirus Rep protein link conformation and function

?

This is an uncorrected proof.

Abstract

Circular Rep-encoding single-stranded DNA (CRESS-DNA) virus Rep proteins are multidomain enzymes that mediate viral DNA rolling-circle replication. Reps nick viral DNA to expose a 3’ end for polymerase extension, provide an NTP-dependent helicase activity for DNA unwinding, and join nicked ends to form circular viral genomes. Here, we present the first structures of a Rep protein from the Redondoviridae family, a newly discovered family of human-associated CRESS-DNA viruses that replicates within the oral protozoan Entamoeba gingivalis. Using cryo-EM, we characterized the hexameric structures of a Redondovirus Rep helicase bound with ATPγS, representing the initial ATP-bound state, and with ADP, reflecting the protein state after hydrolysis. The ADP state, but not the ATP state of Rep shows a staircase arrangement of DNA-binding loops that plays a central role in current models for SF3 helicase function. Additionally, we determined a head-to-tail dodecameric structure of ATPγS-bound Rep, in which both the helicase and endonuclease domains are ordered. Conservation of residues involved in stabilizing the dodecamer suggest that this assembly may be functionally relevant for many CRESS-DNA viruses. The positioning of endonuclease domains in the Rep hexamer, combined with our biophysical analyses of Rep oligomerization, provide new insights into Rep function during viral replication.

Author summary

We have used cryo-electron microscopy and solution biophysical techniques to understand the structure of the Rep protein from a Redondovirus, a CRESS DNA virus originally found in human respiratory samples. The small circular Rep-encoding single-stranded (CRESS) DNA viruses produce only two known proteins; one forms the viral capsid, and the other, called Rep, assists in replicating the genome. Rep has at least three distinct biochemical activities. It cleaves the genome to generate a single-strand break, unwinds the double-stranded DNA generated during replication by the host cell’s machinery, and rejoins the break to form a single-stranded DNA circle that is packaged into a new viral particle. Previous studies of Rep have revealed the structures of the isolated DNA-unwinding (helicase) and cleaving/rejoining (endonuclease) domains, but the structure of an entire Rep protein has been elusive. Here, we describe the structure of full-length Redondovirus Rep in an unexpected dodecameric assembly. Together with structures of the helicase and endonuclease domains, the models provide new insights into how Rep might coordinate its cleavage, unwinding, and rejoining activities during the viral replication cycle.

Introduction

The circular Rep-encoding single-stranded DNA (CRESS-DNA) viruses are ubiquitous small viruses of Eukarya and Archaea [1]. CRESS-DNA viruses have been isolated from a wide array of biomes, such as marine, plant, and human-associated communities [2]. Many CRESS-DNA viruses appear to be benign commensals, while others are important pathogens, such as porcine circoviruses and geminiviruses [1]. Redondoviruses are recently discovered CRESS-DNA viruses that were first identified in human respiratory samples [3,4] and subsequently shown to be widespread in human oral and lung samples [57]. Redondovirus recovery was increased in samples from human airway during acute illness, intubation and COVID19 infection [5,812]. Further studies showed that Redondovirus abundance was tightly correlated with that of the oral amoeba Entamoeba gingivalis [13,14]. A xenic culture of E. gingivalis was shown to harbor Redondoviruses, and Hi-C analysis showed that DNA of the two could be crosslinked, supporting E. gingivalis as the host of Redondoviruses [13].

According to present models, CRESS-DNA virus replication initiates following introduction of the viral genome into cells, where the single stranded viral DNA genome is copied into double stranded DNA by cellular enzymes [15,16]. The double stranded DNA is then the substrate for the viral-encoded Rep protein, which nicks the genome at extruded DNA cruciform structures formed at the origin of replication. Rep remains covalently joined to the DNA 5’-end through a phosphotyrosine linkage and the free 3’-end provides a substrate for a host cell DNA polymerase. The Rep protein is also an ATP-dependent DNA helicase, which is believed to assist in viral DNA replication via the rolling circle mechanism [17] and may also play a role in viral DNA packaging. Rep proteins are encoded by all CRESS viruses, and related Reps are encoded in some bacterial plasmids [1].

Despite the ubiquity and importance of CRESS-DNA viruses, understanding of CRESS Rep functional mechanisms is nascent. Rep has two functional domains (Fig 1A). An amino-terminal HUH endonuclease domain is responsible for origin recognition, cleavage, and re-ligation to form ssDNA circles [18]. The HUH domain is tightly coupled to an SF3 helicase domain that contains oligomerization and AAA + ATPase subdomains [19]. Structural models are available for related HUH endonuclease domains [2022] and one structure has recently been described for a CRESS-DNA Rep helicase [23]. The helicase model is a 3.8 Å cryo-EM structure of porcine circovirus type 2 Rep (pcRep) in an ADP-bound state. pcRep forms a hexamer with a narrow channel that can accommodate ssDNA, but not dsDNA. Two well-studied viral SF3 helicases are the large T antigen from Simian Virus 40 (SVTag) and the human papillomavirus E1 protein [24,25]. Compared to pcRep, the SVTag oligomerization subdomains form a larger channel that accommodates dsDNA, so that strand separation occurs between the domains or within the ATPase domains [26].

thumbnail
Fig 1. Properties of fbRep in solution.

A. Domain architecture of fbRep. Rep consists of an N-terminal HUH-endonuclease domain (ED) and a C-terminal SF3 helicase domain. The helicase domain is comprised of oligomerization and AAA + ATPase sub-domains. B. SV-AUC analysis of fbRep at 8.7 µM concentration. C. Size-exclusion chromatography in line with multi-angle light scattering (SEC-MALS) analysis of fbRep at 50 µM concentration. D. Mass photometry analyses of 100 nM apo-fbRep, and fbRep in buffers containing Mg2+, ATPγS + Mg2+, or ADP + Mg2+. Calculated masses of fbRep oligomers are monomer: 40.5 kDa; pentamer: 203 kDa; hexamer: 243 kDa; dodecamer: 486 kDa. All buffers contained 300 mM NaCl.

https://doi.org/10.1371/journal.ppat.1013997.g001

A recent cryo-EM analysis of SVTag reaction intermediates, combined with previous work in the same and related helicase systems, have established a mechanistic framework that is likely to be applicable to all hexameric helicases [2527]. In this “symmetric rotary mechanism”, DNA-binding β-turns of the ATPase subunits form a “spiral staircase” that promotes unidirectional movement of ssDNA through the central channel, and the complementary strand is extruded from the hexameric assembly. The staircase is dynamically manipulated by hydrolysis of ATP at the subunit interfaces. Indeed, the pcRep cryo-EM structure has a staircase arrangement of DNA-binding loops in support of this model [23]. Although the mechanism of ssDNA translocation by the conserved ATPase domains is expected to be similar among SF3 helicases, the functions of CRESS-DNA Rep proteins differ in many ways from the origin opening roles of SVTag and E1 [28,29], and the CRESS Rep functions remain incompletely understood.

Here we present a structural study of the Rep protein of a Vientovirus, a viral species in the Redondoviridae family [30]. Previously Vientovirus Rep protein was purified and shown to be capable of site-specific cleavage at a DNA stem-loop designed to mimic the inferred origin of viral replication [7]. We report that Rep protein from Vientovirus FB (fbRep, Fig 1A) is a tractable model for understanding CRESS-DNA virus Rep mechanisms at the structural level. We first established the oligomeric state of fbRep using biophysical methods, identifying pentamers, hexamers and dodecamers across a broad range of concentrations and ionic strengths. We then generated cryo-EM structures for hexameric fbRep bound with ATPγS at 2.36 Å and bound with ADP at 3.20 Å, revealing the allosteric changes in the ATPase sub-domains that link the two nucleotide bound states. We also report the structure of a novel dodecameric form of fbRep•ATPγS composed of head-to-tail hexamers where the HUH endonuclease domains of one hexamer are fully resolved. These models provide important new insights into Rep function and will facilitate the design of experiments to test Rep functions in CRESS-DNA virus biology.

Results

Oligomeric properties of Redondovirus Rep

In prior work, we established that recombinant full-length fbRep was capable of carrying out DNA nicking and joining activities in vitro [7]. However, those protein preparations were purified and stored in 1 M NaCl, which is poorly suited for structural studies. In addition, we found that fbRep exists primarily as pentamers in high salt by analytical ultracentrifugation (AUC) (S1 Fig). We therefore re-optimized our purification protocol and assessed the monodispersity of fbRep at salt concentrations closer to physiological levels using sedimentation velocity-AUC (SV-AUC) (Fig 1B). In 300 mM NaCl and at 8.7 µM concentration, we observed a single 9.0 S species with a mass of ~228 kDa (calculated monomer 40.5 kDa, pentamer 203 kDa, hexamer 243 kDa). Using size-exclusion chromatography in-line with multi-angle light scattering (SEC-MALS), the absolute mass determined was 224 kDa ± 5%, consistent with a mixture of pentamer and hexamer forms (Fig 1C).

We extended our analyses to include fbRep in the presence of ADP and ATP (or ATPγS) across a broad range of Rep concentrations. We first applied mass photometry to assess the impact of nucleotides on the oligomeric state of fbRep at physiological concentrations (Fig 1D). Hexamers are the predominant species at 100 nM concentration in the apo, Mg, ADP-Mg, and ATPγS-Mg forms, with evidence for dissociation into monomers and/or dimers that are more prevalent in the presence of nucleotides. To further probe the solution properties of nucleotide-bound fbRep, we employed size-exclusion chromatography in-line with synchrotron small-angle X-ray scattering (SEC-SAXS) (S2 and S3 Figs). The scattering profiles were analyzed using singular value decomposition with evolving factor analysis (SVD-EFA), which identifies the scattering contributions from the major species present in solution [31]. For apo-fbRep and fbRep•ADP at ~100 µM concentration, we found evidence for both Rep hexamers and a larger species whose mass corresponds to a dimer of Rep hexamers (Table 1). We also found that apo-Rep vs. Rep•ADP hexamers show distinct P(r) profiles, indicating changes in quaternary structure when nucleotide binds (S2 Fig). For fbRep•ATP at the same concentration, we observed only the larger, dodecameric species (Table 1).

thumbnail
Table 1. Properties of Rep derived using SEC-SAXS with SVD-EFA analyses.

https://doi.org/10.1371/journal.ppat.1013997.t001

Cryo-EM structure of Redondovirus Rep bound by ATPγS

To obtain the structure of fbRep in the ATP-bound state, we prepared Rep complexes using the non-hydrolyzable analog ATPγS. Preliminary experiments indicated that fbRep was prone to aggregation at salt concentrations less than 300 mM. In 300 mM NaCl, cryo-EM micrographs displayed fbRep oligomers as well-defined and easily recognizable particles. Single particle analysis of fbRep•ATPγS revealed a diverse composition of pentamers, hexamers, and double hexamers (S4 Fig). Hexamers were the most prevalent form observed (examples in Fig 2A), followed by pentamers.

thumbnail
Fig 2. Cryo-EM structure of ATPγS-bound fbRep hexamer.

A. Representative 2D class averages of fbRep hexamers. Diffuse density for the endonuclease domains (ED) is indicated in side and tilted views. B. Orthogonal views of reconstructions for the ATPγS-bound fbRep hexamer are shown on top with corresponding cartoon representations below. Bound ATPγS is drawn in yellow. Oligomerization (OD) and ATPase (AD) subdomains of Rep are indicated. The endonuclease domains could not be fit. C. Views of the fbRep•ATPγS helicase pore channel. Blue density in a cutaway of the reconstruction may represent nucleic acid captured during expression and purification. A solvent-accessible profile of the channel reveals a uniform pore with a relatively narrow proximal pore through the oligomerization domains that gradually widens to the distal pore at the bottoms of the ATPase domains. D. The ATP binding site is located between ATPase domain subunits. E. Rotated view of the ATP binding site showing coordination of magnesium ion and surrounding water molecules.

https://doi.org/10.1371/journal.ppat.1013997.g002

Three-dimensional classification of the hexameric particles led to the reconstruction of two distinct fbRep hexamers (S5 Fig and Table 2). The first was reconstructed from two similar hexamers composed of 661,708 particles. High-resolution 3D refinement, without internal symmetry imposed, resulted in a 2.36 Å reconstruction of a hexamer that features highly resolved helicase domains with no visible endonuclease domain density (Fig 2A). The second hexamer reconstruction featured visible, but diffuse endonuclease domain density. This map, however, refined to lower resolution (3.19 Å) with an endonuclease domain density that could not be interpreted.

The 2.36 Å cryo-EM map allowed for the facile building and refinement of an atomic model revealing a six-fold symmetric ring assembly of oligomerization and ATPase sub-domains with overall dimensions of ~72 x 112 Å (Fig 2B). The six nucleotide binding sites located between adjacent ATPase subunits are fully occupied and could be fit with high fidelity. The map also revealed weak density in the central pore formed by the oligomerization subdomains (Fig 2C). A similar density was observed in the pcRep•ADP cryo-EM map [23], but in that case the additional density extended well into the ATPase subdomains and could be modeled as heterogeneous ssDNA that was carried over from E. coli expression and purification. The density in fbRep•ATPγS is also likely to be E. coli-derived nucleic acid but is present at a lower occupancy and we did not attempt to model it. The pore formed by the oligomerization subdomains is ~ 12 Å in diameter and remains constricted into the proximal pore formed by the ATPase subdomains. The channel opens at the distal end of the ATPase hexamer, with a diameter of ~23 Å (Fig 2C).

The high resolution fbRep•ATPγS structure provides a detailed model for the ATP-specific interface formed between adjacent helicase subunits. These interactions, along with alternative interactions in the corresponding ADP-specific interface, control the ATPase domain movements and consequent positioning of the DNA-binding loops responsible for staircase formation and helicase activity [25,26]. A close-up view of the ATP interface between subunits A and B is shown in Fig 2D; the other five interfaces are nearly identical. The triphosphate of ATPγS is buried in the inter-subunit interface, along with a magnesium ion and several tightly bound water molecules (Fig 2E). The adenosine fragment is nestled between two loops and an α-helix of adjacent subunits and is partly solvent exposed. ATP binding to fbRep resembles a key inserted into a keyhole, with the triphosphate ‘key’ matching the internal lock mechanism of the ATPase domains. In addition to interactions involving the canonical Walker A motif (Thr179, Gly180, Gly182, Lys183, and Thr184) and the arginine finger (Arg291, Arg292), the Lys185 amide, the Tyr241 hydroxyl, and the Tyr167 hydroxyl also make hydrogen bonded interactions to ATPγS. The buried Mg2+ ion is coordinated by the β- and γ-phosphates of ATPγS, Thr184 hydroxyl, as well as three water molecules. Details of fbRep interactions with ATPγS and Mg2+ are provided in S6 Fig.

The DNA-binding β-turns of the fbRep•ATPγS structure do not form a spiral staircase arrangement; they are approximately co-planar and share the C6 symmetry of the oligomerization subdomain. A similar arrangement was observed in the SVTag•ATP hexamer structure [27] and is likely due to the rigidity of the ATP interfaces combined with the lack of a ssDNA substrate to provide a staircase interaction template. To quantitate this structural feature and provide a means of comparison to other structures, we transformed the fbRep hexamer coordinates to an inertial frame with the Cartesian Z axis coincident with the symmetry axis defined by the oligomerization subdomains. The z-coordinates of Gly249 in the β-turn ‘step’ then provide the helical pitch, if any, of the staircase (Gly249 in fbRep is the structural homolog of Gly240 in pcRep and His513 in SVTag). This analysis for fbRep•ATPγS confirms that the DNA-binding loops are co-planar (Table 3).

thumbnail
Table 3. Helical parameters for fbRep and other SF3 family helicases.

https://doi.org/10.1371/journal.ppat.1013997.t003

Cryo-EM structure of Redondovirus Rep bound by ADP

We prepared ADP-bound fbRep in a similar manner to ATPγS-bound fbRep and collected images for single particle analysis. In the ADP-bound form, hexamers were also the most abundant Rep oligomers observed in 2D class averages (Fig 3A). The fbRep•ADP structure was determined using 312,272 particles with an overall resolution of 3.20 Å (Table 2 and S7 Fig). Overall, the ADP-bound hexamer resembles the ATPγS form (Figs 3B and S8), but closer inspection reveals that the similarity is primarily confined to the oligomerization domains, which are nearly superimposable with an r.m.s. difference of 0.83 Å. The quaternary structure of the ATPase domains is different in the ADP-bound structure, with an increased distal pore diameter of 45 Å the most conspicuous feature (Figs 3C and S9).

thumbnail
Fig 3. Cryo-EM structure of ADP-bound fbRep hexamer.

A. Representative 2D class averages with diffuse density for the endonuclease domains indicated. B. Orthogonal views of the ADP-bound fbRep hexamer reconstruction with ADP density shown in lime green. Corresponding ribbon representations are shown below. The endonuclease domains could not be fit. C. Views of the pore channel. Pink density in the oligomerization domain channel is likely nucleic acid captured during expression and purification. The solvent-accessible profile of the channel generated using MOLEonline reveals a varying pore channel different than the pore channel profile of fbRep•ATPγS. The variations observed can be attributed to varying degree of ATPase domain movements of the helicase subunits. D. Examples of two of the six unique ADP binding sites. Although the diphosphate-Rep interactions are maintained in each interface, the remaining interactions differ with fewer total interactions compared to ATPγS. E. Spiral staircase formed by the DNA-binding loops of the ATPase domains. The oligomerization domains have been cut away for clarity. View is down the symmetry (Z) axis. The loops form a right-handed downward spiral going from subunit A to F (see also Table 3). F. Superposition of the six oligomerization domains. A successive rotation of ATPase domains A-F results in movement of the DNA-binding loops downwards and movement of the distal β-hairpins outwards. The sixfold symmetry axis is aligned with Cartesian Z as indicated.

https://doi.org/10.1371/journal.ppat.1013997.g003

ADP ligands could be readily fit into the six nucleotide binding site densities. ADP binds primarily to one subunit, where the same interactions are formed in each case. There are fewer interactions formed with neighboring subunits compared to ATPγS and those interactions vary among the six interfaces (Figs 3D and S10). The ADP interface contributes weaker constraints on the quaternary structure of the hexamer, allowing the ATPase domains to undergo small, successive rotations from subunit A to subunit F. These rotations generate a helical arrangement of DNA-binding loops around the pseudo-symmetry axis of the hexamer (Fig 3E and 3F). To simplify analysis of the helical nature of the DNA-binding loops, we transformed the coordinates to align the symmetry axis of the hexamer with the Cartesian Z axis, as described above for the ATPγS hexamer. The Z-coordinates of the DNA-binding loops in the ADP complex show a clear spiral staircase arrangement, with the loops spanning 8 Å in the Z direction and an average helical step size of 2.3 Å (Table 3). The ATPase subunits rotate about an axis in the X-Y plane that passes close to the Cα atom of Tyr199, moving the DNA-binding loops away from the oligomerization domain pore (i.e., lowering the z coordinate) and expanding the distal pore formed by the ATPase domains (Figs 3F and S9). The staircase arrangement of DNA-binding loops decreases the proximal pore size by about 2 Å, going from the symmetric ATPγS structure to the asymmetric ADP form (Table 3).

A similar staircase configuration was observed in the pcRep•ADP structure [23], where we have calculated a 12 Å z-range, a 2.6 Å average helical step, and a similar pore size (Table 3). Unlike the pcRep structure, however, we do not observe strong density within the ATPase subunit channel that could be interpreted as ssDNA. Thus, the helical staircase arrangement of subunits can be formed by the ADP-bound helicase alone and does not require a ssDNA substrate. For comparison, we analyzed the helical parameters and pore size of the SVTag•DNA•AMPpNp structure [26]. In that complex, the synthetic DNA substrate was present at full occupancy, and the helical z-range was larger (14 Å) with a helical step of 2.9 Å (Table 3). The helical pore size and the average domain rotations were similar to those calculated for fbRep•ADP and pcRep•ADP. The HPV-E1•ADP•DNA complex [25] also shows similar parameters, although for both the SVTag and E1 complexes the domain rotations are more complex (Table 3).

Cryo-EM structure of a Redondovirus Rep double hexamer

Although hexamers were the most abundant oligomers observed on cryo-EM grids, we also observed larger particles that could be interpreted as two Rep hexamers (Fig 4A), consistent with the higher mass species observed in SEC-SAXS experiments (Table 1). Because the number of dodecamer particles was smaller than that available for the fbRep hexamer structures, we carried out reconstructions applying C1 and C6 symmetry to ATPγS-bound fbRep dodecamers (Table 2 and S11 and S12 Figs). A symmetric reconstruction with 28,346 particles led to a readily interpretable structure with an overall resolution of 2.95 Å. The same reconstruction carried out without symmetry constraints showed an indistinguishable, but lower quality structure at 3.32 Å.

thumbnail
Fig 4. Cryo-EM structure of fbRep•ATPγS double hexamer.

A. Representative 2D class averages of fbRep dodecamers. Diffuse density for the endonuclease domains is found only at one end of the particles. B. Reconstruction and cartoon representation of the fbRep dodecamer (C1 symmetry). All three domains (ED, OD, AD) are well-defined in fbRep1. The endonuclease domain is poorly ordered in fbRep2, similar to what is seen in the fbRep•ATPγS hexamer. C. Closeup of the interface formed by adjacent endonuclease domains in fbRep1. Few polar or hydrophobic interactions are formed between subunits. D. The distal β-hairpins from fbRep2 (shown in green) interact with the endonuclease domains of fbRep1. E. Closeup of a distal hairpin-ED interaction. Glu279 in the hairpin acts as a phosphate mimic, coordinating magnesium in the endonuclease active site. Gly278 is essential for this interaction because a side chain would be sterically disallowed.

https://doi.org/10.1371/journal.ppat.1013997.g004

The two hexamers in the fbRep dodecamer associate in a head-to-tail fashion, with a clearly resolved endonuclease domain (ED) ring of one Rep (fbRep1) interacting with the helicase domains of a second Rep (fbRep2, Fig 4B). The EDs of fbRep2 have diffuse density similar to that observed for isolated fbRep hexamers. Since the ED density for fbRep1 was easily interpretable, we built the domains manually, including well-defined active site magnesium ions. To provide an independent experimental model of the fbRep endonuclease domain, we crystallized it, measured diffraction data to 1.8 Å resolution, determined the structure using molecular replacement, and refined the model with Rwork/Rfree of 0.18/0.22 (S1 Table and S13 Fig). The fbRep1 EDs superposed onto the high resolution ED with r.m.s.d. values of 1.13-1.34 Å across residues 2–109. The ED loops connecting strands β3 and β4 and the loops connecting α3 and β7 adopt different conformations in the fbRep hexamer compared to the isolated domain, explaining the higher than expected r.m.s.d. values.

The interfaces between EDs in the fbRep1 ring lack the extensive contacts normally associated with an oligomeric protein. A salt bridge between Arg4 and Glu19 and a hydrogen bond between Gln6 and Lys47 provide the only polar interactions, and the remainder is comprised of van der Waals contacts with little hydrophobic surface buried (Fig 4C). The lack of a substantial interface between subunits may explain why the EDs in fbRep hexamers do not form well-defined oligomeric rings, despite being pre-organized to do so by the hexameric helicase domains. The ED hexamer in fbRep1 is instead stabilized through interactions with the distal β-hairpins of the ATPase subdomains from fbRep2 (Fig 4D). Each β-hairpin binds to the ED β-sheet, with the nuclease active site at the center of the interface (Fig 4E). Two β-hairpin residues appear to play key roles. Glu279 functions as a phosphate mimic, coordinating the active site magnesium ion with its carboxylate side chain, and Gly278 allows a close approach of the β-hairpin to the ED β-sheet. Direct interactions with the peptide backbone of the β-hairpin, together with hydrophobic and polar interactions that could be performed by alternative side chains provide additional, but less specific contacts that flank the active site.

Gly278 is invariant among the redondoviruses and Glu279 is strongly conserved, but replaced by Gln or Asn in some cases, either of which could coordinate a carbonyl oxygen to the active site magnesium ion in the endonuclease domain interaction. An alignment of several Redondovirus Rep sequences is provided in S14 Fig. The conservation of Gly278 is particularly compelling because its backbone dihedral angles are within the allowed region for β-strands, implying that it is the lack of a side chain that is most important in this position and that the head-to-tail rep dodecamer is therefore likely to be functionally relevant. Not all CRESS-DNA Rep proteins are expected to have this interaction; the circoviruses and geminiviruses, for example, lack the distal β-hairpin motif. Other viral Reps could, however, form alternative interactions to form head-to-tail hexamer interactions.

We considered two approaches to relate the full-length fbRep structure represented by fbRep1 (Fig 5A) to isolated Rep hexamers, where the EDs adopt an ensemble of configurations. The first was to calculate predicted scattering profiles for fbRep1 and compare to the hexameric component from SEC-SAXS analysis of the fbRep•ATP complex. However, fbRep•ATP exists primarily in the dodecamer form at the concentrations required for SAXS analysis and the hexameric scattering component is too weak to reliably model. We therefore attempted to model the SAXS profiles from fbRep•ADP and apo-fbRep hexamers (S15 Fig). The fit to fbRep•ADP was reasonable (χ2 = 2.1) but the fit to apo-fbRep was poor (χ2 = 4.8), as expected based on the conformational changes that occur upon nucleotide binding (Table 1). The second approach was to dock fbRep1 into the density of the fbRep•ATPγS cryo-EM structure contoured at σ = 1. As shown in Fig 5B, the ED ring occupies the diffuse toroidal density observed in the isolated hexamer. Thus, the arrangement of EDs observed in fbRep1 is likely to be present in the isolated hexamer, but as an ensemble of closely related configurations due to the weak intersubunit contacts that are formed.

thumbnail
Fig 5. Structure of the Rep hexamer.

A. Ribbon representation of Rep as seen in fbRep1 within the fbRep•ATPγS dodecamer. The orthogonal view (right) shows the external face of the endonuclease domains, where a large bowl-shaped cavity is formed by the ED ring. The proximal pore formed by the oligomerization domains is visible at the bottom of the cavity. Magnesium ions are drawn as red spheres. B. Superposition of fbRep1 on the 2.36 Å fbRep•ATPγS hexamer map, contoured at σ = 1 to emphasize the diffuse endonuclease domain density. C. Model of the fbRep hexamer with origin ssDNA nonanucleotide bound to each ED. The ED cavity is large enough to accommodate more than one origin stem-loops, but not six as shown. The model is based on the PCV2 endonuclease domain-ssDNA complex structure [20].

https://doi.org/10.1371/journal.ppat.1013997.g005

The cavity formed by the hexameric ED ring is shaped like a large bowl that could accommodate the DNA stem-loop substrate that is recognized and cleaved by Rep (Fig 5A). To verify this, we generated a model of ssDNA-bound fbRep1 using the ED•ssDNA structure reported for pcRep to place the DNA [20]. Although the fbRep and pcRep EDs share minimal sequence identity overall, the residues involved in origin recognition and the origin sequences themselves are similar. As shown in Fig 5C, more than one DNA stem-loop could bind to and be cleaved within the bowl-shaped ED cavity as observed in fbRep1. The inward-facing ED active sites should therefore have access to either an external origin sequence, an origin sequence translocating into the Rep helicase channel, or an origin sequence on the non-translocating strand that is separated by the helicase. Since Rep is expected to translocate ssDNA through the oligomerization and ATPase domain pores in the N-terminal to C-terminal direction [26,32], separation of dsDNA should occur within the ED bowl where both strands can be accessed by ED active sites. These arguments assume that the same Rep hexamer is responsible for cleaving the origin sequence and translocating along ssDNA. However, it is also possible that different Rep molecules, possibly with distinct oligomeric forms, are responsible for the two activities.

Although the fbRep dodecamer shown in Fig 4 might be expected to represent two SF3 helicases that work in tandem to translocate along ssDNA, the structure suggests that this is unlikely to be the case. The distal β-hairpins of fbRep2 lock the helicase domains into a symmetric ATP configuration and the domain rotations associated with helicase function upon ATP hydrolysis appear unlikely. Similarly, the endonuclease domains of fbRep1 are inactivated by binding of the distal β-hairpins from fbRep2. Thus, the dodecamer may instead represent a form of Rep with one functional endonuclease hexamer (on fbRep2) and one functional helicase hexamer (on fbRep1). Assuming that ssDNA can freely translocate through the wider (ATP state) proximal pore of fbRep2, the dodecamer could function as a highly processive form of the Rep enzyme.

Discussion

CRESS-DNA viruses are ubiquitous on Earth, but the function of their encoded replication initiation proteins—Reps—has been elusive. Here we take advantage of the Redondovirus system to develop multiple views of Rep function, including the first experimental structure of a full-length Rep hexamer. Both biophysical and cryo-EM studies support a hexameric assembly as a major Rep form, but Rep can readily interconvert between pentamers and hexamers, as well as form dodecameric assemblies. Each of these forms may have functional roles.

The cryo-EM structures of ATPγS and ADP-bound fbRep allowed us to compare these states and relate the results to current models for SF3 helicase function. We find that fbRep•ATPγS adopts a symmetric configuration without a helical staircase arrangement of DNA-binding loops. This preference can be explained by optimization of quaternary interactions involving arginine finger motifs interacting with the γ-phosphate of ATPγS. A similar structure was reported for SVTag bound to ATP [27]. In the ADP-bound state, fbRep adopts a conformation where the ATPase domains rotate downwards by ~4° successively around the hexamer, resulting in a helical arrangement of DNA-binding loops and an opening of the helicase distal pore. Similar structures have been observed in other SF3 helicases, but those have been with DNA bound [23,25,26]. The fbRep•ADP structure indicates that ssDNA interactions are not required for the staircase structure to be formed and that this configuration may be preferred in the ADP-bound state. The SVTag•ADP hexamer crystallized in a nearly symmetric configuration, arguing that both forms are energetically accessible to the helicases.

Current models of SF3 helicase function do not involve all-ATP or all-ADP bound states [25,26]. Reaction intermediates of active SVTag•DNA complexes trapped by freezing on cryo-EM grids indicate that the AB and BC subunit interfaces are ATP-bound (least rotated) and the remaining interfaces have ADP or no nucleotide bound (more rotated) [26]. ATP hydrolysis and phosphate release at AB and BC generate ADP sites and ATP binding to DE and FA generate ATP sites, with the associated intersubunit remodeling and domain rotations to generate cyclic permutations of the staircase configuration. These cyclic motions are coupled to unidirectional ssDNA diffusion along the helical ratchet formed by the DNA-binding loops. Since it can be difficult to compare the structural results from different studies, we used simple geometric parameters to describe the helical nature and pore dimension of fbRep and compared the results to other SF3 helicases (Table 3). This quantitative description should be generally useful to other hexameric AAA + ATPases, provided a reference symmetry axis can be suitably defined. Thus, the fbRep complexes described here provide support for emerging models of SF3 helicase function and establish Redondovirus Rep as a useful model system for studying these enzymes. We note, however, that the fbRep structures are without a high-occupancy DNA substrate bound, making comparisons to translocating experimental systems difficult.

The structure of the head-to-tail dodecameric fbRep complex provides new insights into Rep structure and function. Because the endonuclease domains are sequestered into a well-defined hexameric ring, the fbRep dodecamer provides a model for the full-length viral Rep hexamer. Since the fbRep1 hexamer observed in the dodecamer fits well within the isolated fbRep•ATPγS hexamer density, we suggest that fbRep1 represents a functional member of an ensemble of similar structures present in isolated hexamers.

The inward facing active sites of the endonuclease domains may be functionally important in the initiation of replication and in formation of ssDNA genome circles. When Rep initially cleaves the nonanucleotide sequence at the stem-loop viral replication origin, a 5’-phosphotyrosine linkage is formed to the DNA. Once a new copy of the origin is available following polymerase extension of the 3’-end, a second ED can cleave this sequence in the same manner, releasing a 3’-end within the ED ring cavity. The 3’- and 5’- ends of the ssDNA genome would therefore be in proximity and an intact DNA circle can be formed and released by attack of the original 5’-linkage by the new 3’-end. The endonuclease arrangement we observe could also explain how stem-loop destabilization might occur as described for the ‘RCR melting pot’ model of CRESS-DNA viral replication [33]. In the melting pot model, both stem-loops of the supercoiled genome origin are engaged in a destabilized initiation complex where template switching can occur once Rep nicks the DNA. The six DNA-binding surfaces lining the bowl-shaped ED cavity in fbRep1 might be well suited for this mechanism.

The primary interactions we observe between Rep hexamers in the dodecamer involve solvent-exposed β-hairpins emerging from the distal pore of the helicase and the active sites of the endonuclease domains. Conservation of the key residues involved suggests that dodecamers may play a functional role in redondovirus biology and for many other CRESS-DNA viruses. It is not yet clear what that role may be, but one possibility is that a head-to-tail dodecamer could simply be a more processive version of hexameric Rep. This could be important since we know that Rep subunits readily dissociate in solution. The pentamer-hexamer transition could be used for loading ssDNA into the helicase pore, but the same transition could result in premature dissociation. Formation of the presumably more stable dodecamer on a ssDNA substrate could inhibit dissociation and result in higher processivity. There is precedent for two SF3 hexamers working together on a single DNA substrate. SVTag and HPV-E1 both form head-to-head dimers of hexamers at the replication origin and melt the DNA to initiate bidirectional DNA replication in their eukaryotic hosts [34,35].

Like SVTag and HPV-E1, the AAV2 Rep68 isoform also has SF3 helicase activity but with more complex oligomeric properties. Rep68 forms double octamers on ssDNA and dsDNA with a 3’-extension [36], but cryo-EM analysis has revealed that the helicase domains form a heptameric ring on the ssDNA despite the octameric OBD [37]. When ATP binds to this complex, both hexameric and heptameric helicase rings are observed with increased domain mobility and heterogeneity. Rep68 also forms heptameric HD rings on dsDNA when binding to an origin of replication [38]. In this case, the OBD arrangement is quite different, where some subunits bind specifically to the origin sequence and the remainder form associated dimer interactions. Oligomeric heterogeneity has been suggested to play a role in Rep68 function, for example in facilitating helicase loading onto DNA [37,38]. These findings support the idea that the oligomeric heterogeneity we observe for fbRep could also be functionally relevant.

A primary goal of this work has been to use Rep structural models to gain insight into viral replication and DNA packaging. Published models of viral Rep function are mostly schematic in nature and are generalized versions of rolling circle replication from plasmid and related viral systems. The lack of mechanistic models has been partly due to the lack of structural data for CRESS-DNA Reps. Although a number of endonuclease structures have been reported, the only structural data available for hexameric CRESS-DNA Rep helicases has been the recent cryo-EM structure of pcRep bound to ADP. The more distantly related AAV2 Rep has been well-studied, with structural models available, but the oligomeric states, replication mechanism of the virus and nature of the replication origin are all different than that of Redondovirus Rep. The same limitations exist at the biochemical level. Origin recognition and cleavage by Rep have been demonstrated for several CRESS-DNA systems, including fbRep [7]. However, demonstration of helicase activity has been more difficult [39] and how that helicase activity is used in replication is unclear.

A simple but important mechanistic question is which DNA strand is translocated by Rep. If we assume that fbRep translocates in the 3’-5’ direction as found experimentally for other CRESS DNA viral Reps [3941] and AAV2 Rep68 [42] and as expected for an SF3 family helicase, two models can be considered (Fig 6). If Rep engages the (+) strand that it has just cleaved, a dead-end complex is formed on the 3’-side of the nick, since Rep would be bound at the attached 5’-end, with nothing to translocate (Fig 6A, top). If Rep engages the (+) strand on the 5’-side of the break, translocation could occur in the opposite direction from replication, and it is not clear how this would be productive (Fig 6A, bottom). A more plausible mechanism (Fig 6B) is similar to that proposed for PCV2 Rep [43], where Rep engages the (-) strand, perhaps the destabilized (-) strand stem-loop formed during origin recognition [33]. Rep could then move ahead of the replication fork and strand separation would occur within the endonuclease cavity. When a newly synthesized (+) strand origin becomes available, it could be recognized and cleaved by one of the five untethered EDs and a new genome circle could be formed by trans-esterification at the original 5’-linkage. Experiments to test this mechanism and alternative mechanisms can be more readily considered, designed and modeled with the structures reported here.

thumbnail
Fig 6. Possible mechanisms of Rep-assisted viral replication.

(A) Rep recognizes the (+)-strand origin in the dsDNA viral genome that has been generated and supercoiled by the host. The origin is shown as a cruciform structure here, but may adopt alternative forms. The (-)-strand may also be engaged by the Rep hexamer as suggested [33] but that interaction is not shown here. Rep cleaves the ssDNA to form a covalent 5-phosphotyrosine linkage and release a free 3’-end that is extended by host polymerase. Rep could load on the (+)-strand after the nick (top view), resulting in a helicase dead-end. Loading onto the (+)-strand before the nick results in opposing helicase and replication directions. (B) If Rep loads onto the (-)-strand, the DNA can be unwound to provide (+)-strand ssDNA. When a new origin sequence is encountered, Rep could cleave it and catalyze ligation of the extruded DNA to produce ssDNA viral genomes. Rep is represented by a hexamer here for simplicity, but the oligomeric forms responsible for cleavage and strand separation have not yet been demonstrated. It is also not known if cleavage and helicase functions are carried out by the same Rep oligomer. 5’-covalent linkages to Rep are indicated by red spheres. The direction of polymerase extension and Rep translocation are indicated by arrows. The pathway shown in (B) is a modification of that suggested by Cheung [43].

https://doi.org/10.1371/journal.ppat.1013997.g006

There are several limitations of the work described here. It would be valuable to complement the structural studies with further functional studies and efforts are underway on several fronts. It has not yet been possible to grow Redondoviruses experimentally in their natural amoeba hosts, but in the future, it would be valuable to generate predictions from in vitro data and confirm them with in vivo tests. Developing a robust in vitro helicase assay, perhaps coupled to origin cleavage and replication, would also be important to test mechanistic models. We have so far been unable to demonstrate helicase activity with full-length Redondovirus Rep protein using standard flap and fork substrates. Since helicase loading may be coupled to origin recognition and cleavage, more specific substrates may be required that have the sequences and properties of native viral DNAs.

In summary, we present the first high resolution structures for Redondovirus Rep proteins, which provide a tractable model for studies of Rep proteins broadly. We report that conformational differences between ATPγS and ATP-bound fbRep strongly support models of SF3 helicase function and the structure of full-length fbRep and dodecameric fbRep provide key insights into Rep function. Going forward, it will be valuable to generate structures for additional functional forms, such as Rep covalently bound to DNA, and develop diverse in vitro assays of function.

Methods

Expression and purification of Rep

The DNA sequence of Vientovirus FB Rep (fbRep) (GenBank MK059763.1) was codon-optimized for bacterial expression and cloned into a pCDFDuet expression vector (Novagen) containing an N-terminal His7-Flag-SUMO tag. The fbRep endonuclease domain 2–114 (fbRep-ED) was amplified by PCR and cloned into the same vector. Full-length fbRep and fbRep-ED were expressed in HI-Control BL21(DE3) (Lucigen) using 2xYT medium supplemented with 2 mM MgSO4 and 50 μg/mL streptomycin at 37°C until an optical density at 600 nm of 1.8-2.2 was reached. The temperature was reduced to 20°C and expression was induced with 0.1 mM isopropyl-β-d-1-thiogalactopyranoside (IPTG) for 5 hours. Cells were harvested by centrifugation and frozen at -80°C.

Cell pellets of fbRep were resuspended in lysis buffer (20 mM NaKPO4 pH 7, 500 mM NaCl, 10 mM imidazole, 10mM β-mercaptoethanol (β-ME), 1X Protease Inhibitor) and lysed mechanically using an Emulsiflex-C3 high pressure homogenizer (Avestin, Inc). The clarified supernatant was loaded onto 5 mL of Ni-NTA Superflow Resin (Qiagen) equilibrated with 10 column volumes of lysis buffer. Ni-bound Rep was washed with 10 column volumes of 20 mM NaKPO4 pH 7, 500 mM NaCl, 20 mM imidazole, 10 mM β-ME and then eluted in 30 mL of 20 mM NaKPO4 pH 7, 500 mM NaCl, 250 mM imidazole, 10 mM β-ME. The His7-Flag-Sumo affinity tag was cleaved by ULP1 protease (Life Sensors) while dialyzing against 20 mM Tris-HCL pH 7.4, 500 mM NaCl, 10 mM β-ME. The cleaved affinity tag was removed from solution by a second Ni-NTA column. Rep was further purified by anion exchange using a Source 15Q column (GE Healthcare) with a sodium chloride gradient and size exclusion chromatography using Superdex 200 (GE Healthcare) and Superose 6 Increase 10/300 GL (GE Healthcare) columns at room temperature in 20 mM HEPES–NaOH pH 7.5, 300 mM NaCl, 0.1mM TCEP.

fbRep-ED was purified using Ni-NTA chromatography and Ulp1 cleavage as described, except that chromatography buffers contained 300 mM NaCl, and the dialysis buffer contained 100 mM NaCl. fbRep-ED was further purified by cation exchange chromatography using heparin Sepharose (Cytiva) with a NaCl gradient at pH 7.4 and a Ni-NTA flow-through step to remove uncleaved fusion protein. fbRep-ED was further purified by size-exclusion chromatography using a Superose 6 Increase 10/300 GL column (GE Healthcare) at room temperature in 20 mM HEPES pH 7.5, 150 mM NaCl, 0.1mM TCEP.

Proteins were concentrated using Millipore concentrators, concentrations determined using UV absorption at 280 nm, and flash-frozen in liquid nitrogen after addition of glycerol to 10%. Aliquots were stored at - 80°C.

Size-exclusion Chromatography in line with Multi-angle Light Scattering (SEC-MALS)

SEC-MALS experiments were performed on 3–6 mg/mL full-length Rep protein with a Superose 6 10/300 GL column (GE Healthcare) at 0.5 ml/min at room temperature in buffer containing 20 mm HEPES-NaOH pH 7.5, 300 mm NaCl or 1 M NaCl, and 0.1mM TCEP. Absolute molar mass of fbRep was determined in-line using multi-angle light scattering from the column eluent at 18 angles using a DAWN-HELEOS II MALS detector (Waters/Wyatt Technology Corp., Santa Barbara, CA, USA) operating at 658 nm. Protein concentration of the eluent was determined using an in-line Optilab T-rEX Interferometric Refractometer (Waters/Wyatt Technology Corp.). The weight-averaged molar masses of species within defined chromatographic peaks were calculated using the ASTRA software version 8.2.1 (Waters/Wyatt Technology Corp.), by construction of Debye plots (KC/Rθ versus sin2[θ/2]) at one second data intervals. The weight-averaged molar mass was then calculated at each point of the chromatographic trace from the Debye plot intercept, and an overall average molar mass was calculated by weighted averaging across the peak.

Sedimentation Velocity Analytical Ultracentrifugation (SV-AUC)

SV-AUC experiments were performed at 20°C with an Optima AUC analytical ultracentrifuge (Beckman Coulter) and a TiAn50 rotor with two-channel epon charcoal-filled centrepieces and sapphire windows. In SV-AUC analysis of the full-length fbRep in different nucleotide states, 400 μL of 8.7 μM fbRep protein samples (A280: 0.6, 0.35 mg/ml) with and without 5 mM MgCl2 and 5 mM ATPγS (or ATP, or ADP) were prepared in a buffer containing 20 mM HEPES-NaOH pH 7.5, 300 mM NaCl and 0.1 mM TCEP. SV-AUC analyses of fbRep at different salt concentration were performed in a buffer containing 20 mM HEPES-NaOH pH 7.5, 1 mM DTT and salt concentration varying from 250 mM to 1 M. Sedimentation velocity profiles were collected every 30 s for 200 boundaries at 40,000 rpm. Data were fit using the Lamm equation model, as implemented in the program SEDFIT [44]. After optimizing meniscus position and fitting limits, the sedimentation coefficients (s) and best-fit frictional ratio (f/f0) were determined by iterative least squares analysis. Sedimentation coefficients were corrected to S20,w based on the calculated solvent density (ρ) and viscosity (η) derived from chemical composition by the program SEDNTERP [45]. Figures were prepared using the program GUSSI [46].

Size-exclusion chromatography in line with synchrotron small-angle X-ray scattering (SEC-SAXS)

SEC-SAXS data were collected at beamline 16-ID (LiX) of the National Synchrotron Light Source (NSLS) II at Brookhaven National Laboratory (Upton, NY) [47] and the BioSAXS beamline ID7A1 at Cornell MacCHESS [48] using a Superdex 200 5/100 column (GE Healthcare). At both beamlines, the BioXTAS RAW software (Version 2.1.1) [49] was used for data processing and preliminary analysis. All measurements were performed in 20 mM HEPES-NaOH pH 7.5, 300 mM NaCl, 0.1 mM TCEP with or without 1 mM MgCl2 and 1mM nucleotide (or ATP or ADP). Samples were injected at 3–5 mg/mL of fbRep protein in 100 µL volumes at room temperature. Radiation damage was not observed in the samples. A summary of the SAXS parameters is given in Table 1.

At beamline 16-ID, data were collected at a wavelength of 1.0 Å in a three-camera conformation, yielding accessible scattering angle with 0.005 < q < 3.0 Å-1, where q is the momentum transfer, defined as q = 4π sin(θ)/ λ, where λ is the X-ray wavelength and 2θ is the scattering angle. The data to q < 0.5 Å-1 were used in subsequent analyses. Samples were loaded into a 1-mm capillary for ten 1-s X-ray exposures. At ID7A1, data were collected at a wavelength of 1.1 Å in with an Eiger 4.0 detector, yielding accessible scattering angle with 0.008 < q < 0.67 Å-1. A series of 707 images for fbRep only, 500 images each for ADP-bound and ATP-bound fbRep were collected, with 2-s exposures per image.

SAXS analysis

SVD-EFA analyses of the SEC-SAXS datasets were performed as implemented in the RAW program [31]. Buffer subtracted profiles were analysed by singular value decomposition (SVD) and the ranges of overlapping peak data were determined using evolving factor analysis (EFA). The determined peak windows were used to identify the basis vectors for each component and the corresponding SAXS profiles were calculated. When fitting manually, the maximum diameter of the particle (Dmax) was incrementally adjusted in GNOM [50] to maximize the goodness-of-fit parameter, to minimize the discrepancy between the fit and the experimental data, and to optimize the visual qualities of the distribution profile. DENSS [51] was used to calculate the ab initio electron density directly from GNOM outputs, as implemented in the program RAW. For each nucleotide state, twenty reconstructions of electron density were performed in the slow mode with default parameters and subsequently averaged and refined with or without C6 symmetry restraints. Reconstructions were visualized using either UCSF ChimeraX version 1.9 [52] or PyMOL version 3.1 (Schrodinger, LLC, New York, NY, USA).

Hybrid bead-atomistic modeling of hexameric fbRep was performed using the program CORAL [53], where the known structure was fixed in composition and inventory not resolved by X-ray crystallography was modeled as coarse-grain beads. Ten independent calculations for each protein were performed and yielded comparable results. The final models were assessed using the program FoxS [54]. The models were rendered using the program PYMOL.

Mass photometry

Mass photometry experiments were conducted using a Refeyn TwoMP instrument (Refeyn Ltd., Littlemore, Oxford, UK). Prior to the experiment, a standard curve relating particle contrasts to molecular weight was constructed with molecular weight standards made with 3 nM thyroglobulin and 10 nM β-amylase. Samples at 20 μL volumes were evaluated in reaction chambers formed by placing silicone Sample Well Cassettes with 6-3.0 mm diameter, 1.0 mm depth, and 11mm × 15 mm OD onto MassGlass UC coverslips (Refeyn Ltd.). Mass photometry signals were recorded for 60 seconds at room temperature. Data acquisition and analysis were performed using AcquireMP and DiscoverMP (Refeyn Ltd., version 2022 R1). Experiments using 100 nM purified fbRep were conducted in a buffer consisting of 20 mM HEPES-NaOH pH 7.5, 300 mM NaCl and 0.1 mM TCEP. Additionally, experiments were performed in the presence or absence of 5 mM MgCl2 and 5 mM ATPγS or ADP.

Cryo-EM sample preparation and data collection

ATPγS-bound and ADP-bound fbRep samples were prepared by incubating 5 μM fbRep protein with 5 mM MgCl2 and 5 mM ATPγS (or ADP) in 20 mM HEPES-NaOH pH 7.5, 300mM NaCl, 0.1 mM TCEP and 2.5% glycerol for 30 minutes on ice. Quantifoil 1.2/1.3 300 mesh Cu (Electron Microscopy Sciences) grids were glow-discharged for 30 seconds at 10 mA using a PELCO easiGlow glow-discharge system. For grid samples preparation, a 3 μl protein sample was spotted, blotted for 6s at 4°C with 0 blot force in 100% humidity and plunged frozen into liquid ethane using the Vitrobot Mark IV (Thermo Fisher Scientific). Grids were then stored in liquid nitrogen until screening and data collection. Initial screening and data collection were conducted at the Beckman Center for Cryo-Electron Microscopy at the University of Pennsylvania Perelman School of Medicine (RRID: SCR_022375) on Titan Krios G3i 300 kV electron microscope equipped with a Gatan K3 direct electron detector. For the ATPγS-bound fbRep, 5605 movies were collected with EPU (Thermo Fisher Scientific) using aberration-free image shift (AFIS) strategy at 105,000X magnification with a pixel size of 0.42 Å/pixel. For the ADP-bound fbRep, 3790 movies were collected with EPU using AFIS strategy at 105,000X magnification with a pixel size of 0.43 Å/pixel. Movies were collected as 35 frames in super resolution mode with a defocus range from -1.0 to -3.0 μm and a total exposure of 41.3 e-2 for ATPγS-bound fbRep dataset and 42.2 e-2 for ADP-bound fbRep dataset.

Cryo-EM image processing

ATPγS-bound fbRep.

5605 movies were motion corrected with binning by a factor of 2 to 0.84 Å/pix using Relion v5.0-beta [55,56] implementation of MotionCor2-like algorithm [57] followed by estimation of contrast transfer function (CTF) parameters using CTFFIND-4.1.14 [58]. The motion corrected micrographs were then imported into CryoSPARC (v4.5.3) [59] and the CTF parameters were estimated using Patch CTF estimation. Micrographs were manually curated to remove low-quality micrographs, and the remaining 5544 micrographs were denoised using Topaz denoising [60] tool prior to particle picking. Initial sets of particles were blob picked from 200 denoised micrographs to obtain templates for full dataset particle picking and initial models for heterogeneous refinement. Initial 98,258 particles were binned by 4 to a box size of 72 pixels during extraction and subjected to a round of 2D classification. Selected 2D classes were re-extracted with binning by 2 to a box size of 144 pixels and subjected to another round of 2D classification to remove obvious junk particles. The remaining 54,440 particles were used in reconstruction of ab initio models (N = 4), generating two junk models and an intact hexameric and pentameric models. Four decoy models reconstructed from 90 particles were also obtained from the remaining particle stack using ab initio reconstruction. Both hexameric and pentameric models were then aligned to C6 symmetry axis and subjected to non-uniform refinement [61]. Well-resolved 2D classes from the hexameric and pentameric particle datasets were then used as templates for full dataset template-based particle picking.

5,286,720 template-based picked particles were extracted with binning by 4 at a box size of 72 pixels. The extracted particles were subjected to a round of heterogeneous refinement using a hexamer, a pentamer, and four decoy models resulting into classification of 1,454,002 and 817,824 particles into the hexamer and pentamer models, respectively. The hexameric and pentameric particle subsets were then separately subjected to three rounds of heterogeneous refinement using the same initial models. In each round of heterogeneous refinement for the hexameric and pentameric particle subsets, the particles that were classified into the hexameric or pentameric model were combined and used as the input particles for the next round of heterogeneous refinement while all particles classified into the decoy models were discarded. After the third round of heterogeneous refinement, the combined 1,446,088 hexameric and pentameric particles were extracted with binning by a factor of 2 at a box size of 144 pixels and subjected to several rounds of heterogeneous refinement using the same strategy as previously done until less than 2% of particles are classified into decoy models. The hexameric and pentameric particles from the last rounds of heterogeneous refinement were further sorted with 2D classifications, keeping only particles from well-resolved 2D classes. The hexameric and the pentameric particles from selected 2D class averages were separately subjected to extra rounds of heterogeneous refinement using hexameric or pentameric models with four decoys to ensure clean particle stacks. The final hexameric and pentameric particles were classified into 2D classes to visually assess particle orientations and remove any remaining particles sorted into poorly resolved classes. The 963,513 hexameric and 202,611 pentameric particle stacks were extracted without binning, refined using non-uniform refinement, and exported to Relion for further analysis.

In Relion, the final particle stacks were reextracted without binning to a box size of 288 pixels and then subjected to 3D auto-refinement using the volume and mask from non-uniform refinements performed in CryoSPARC. After post-processing, particles were subjected to CTF refinements followed by Bayesian polishing. After 3D auto-refinement and post processing, the particles were subjected to another round of CTF refinements and Bayesian polishing. 3D refinement of polished hexameric and pentameric particles generated 2.35 Å and 3.22 Å resolution maps, respectively.

To analyze for structural heterogeneity in the data, the polished hexameric particles were subjected to unmasked 3D classification with exhaustive angular searches at a sampling rate of 7.5 degrees (T = 4, K = 6). 3D classification generated a junk class of 418 particles, a stacked double hexamer of 5,955 particles, a poorly aligned hexamer of 220,094 particles, a well-aligned hexamer with low-resolution features consisting of 75,338 particles, and two well-aligned hexamers with high-resolution features consisting of 314,908 and 346,800 particles. The stacked double hexamer of 5,955 particles was used for further analysis of double hexameric particles in the dataset. 3D refinement of the well-aligned hexamer of 75,338 particles generated a 3.18 Å resolution map. The other two highly similar well-aligned hexamers were combined and subjected to a 3D refinement. Refinement of the combined hexamers with 661,708 particles generated a final map of 2.36 Å resolution. Local resolution of all final reconstructed maps was estimated in CryoSPARC at 0.143 threshold.

ATPγS-bound Rep double hexamer.

The stacked double hexamer from unmasked 3D classification with exhaustive angular searches refined to a resolution of 3.78 Å with C1 symmetry and 3.27 Å with C6 symmetry. The dataset used in the reconstruction of the double hexamer map consists only of 5,955 particles. To determine if there are more double hexameric particles present in the micrographs, the double hexameric particles were picked from 5544 micrographs using Topaz picking algorithm in CryoSPARC [62]. First, the 5955 particles were exported to CryoSPARC, manually re-picked, extracted with binning by 4 at a box size of 96 pixels, and then used to generate a Topaz picking model. Topaz picking on 5544 micrographs obtained 138,554 particles which were then extracted with binning by 4 a box size of 100 pixels and then subjected to a 2D classification. Well-resolved 2D class averages consisting of 29,474 double hexameric particles were then used to generate an ab initio model. The model was aligned to C6 symmetry axis and subjected to non-uniform refinement. A round of heterogeneous refinement using the refined model and three decoy models was done to remove junk particles. The remaining 28,462 particles were extracted without binning to a box size of 440 pixels and were used to generate an ab initio model. The ab initio model was then aligned to C6 symmetry axis and subjected to a non-uniform refinement, generating a 3.71 Å resolution map. To obtain a higher resolution reconstruction, the particles were then exported to Relion for further processing.

In Relion, the double hexameric particles were re-extracted and subjected to 3D auto-refinement using the volume and mask generated from non-uniform refinement in CryoSPARC. After post-processing, the particles were subjected to CTF refinements followed by Bayesian polishing. 3D refinement of the polished particles improved the resolution to 3.52 Å. To identify inconsistent features in the refined particle dataset, the particles were subjected to masked 3D classification without image alignment (T = 4, K = 6), resulting to a removal of 116 particles. Refinement of the remaining 28,346 particles improved the resolution to 3.42 Å. The final particles were then subjected to another round of CTF refinements followed by Bayesian polishing. 3D auto-refinement of the polished particles with C1 symmetry generated a final model with an overall resolution of 3.32 Å. Imposing C6 symmetry on 3D auto-refinement improved the resolution of the map to 2.95 Å. Local resolution of all final reconstructed maps was estimated in CryoSPARC at 0.143 threshold.

ADP-bound Rep.

Data analysis for 3790 movies was also done in Relion v5.0-beta and CryoSPARC v4.5.3. Micrographs were motion corrected and CTF estimated in the same way as in ATPγS-bound fbRep dataset. Manual curation of 3790 patch CTF-estimated micrographs eliminated 150 suboptimal micrographs, leaving 3640 micrographs for data analysis. Initial 221,667 blob-picked particles from 500 denoised micrographs were extracted with binning by 4 and subjected to a round of 2D classification. Well-resolved 2D classes consisting of 31,160 particles were re-extracted with binning by 2 and subjected to a round of 2D classification. Well-resolved 2D classes consisting of 26,788 particles were then used as templates in template-based picking of particles from whole dataset. 4,802,310 template-picked particles were extracted with binning by 4 at a box size of 72 pixels and subjected to a round of 2D classification. Selected 2D classes consisting of 341,693 particles were then re-extracted with binning by 2 at a box size of 144 pixels and subjected to another round of 2D classification. Selected side and tilted views were subjected to a round of 2D classification, generating well-resolved side and tilted views of 3,093 particles that were used to train a Topaz picking model. 250,032 Topaz picked particles were extracted by binning by factor of 2 at a box size of 144 pixels and subjected to two rounds of 2D classification. 2D class averages of side and tilted views consisting of 8,144 particles were selected for another round of Topaz training. 195,026 Topaz-picked particles were extracted with binning by 2 at a box size of 144 pixels and subjected to a round of 2D classification. Well-resolved 2D classes of tilted and side views of 101,013 particles were then selected and combined with the 341,693 particles. An initial model consisting of 30,000 particles from the combined particle sets were obtained and refined using nonuniform refinement with symmetry relaxation using maximization method. To remove junk particles, several rounds of hetero refinement were performed using the nonuniform refined model and five decoy models. 417,686 particles from the good class were re-extracted without binning at a box size of 288 pixels and subjected to a round of hetero refinement with decoys. 417,679 particles from the good class were then subjected to a round of 2D classification. Well-resolved 2D classes consisting of 342,764 were then selected and used to obtain an ab initio model. Nonuniform refinement using the ab initio model aligned to C6 symmetry axis generated a 3.59 Å resolution map. To obtain an improved reconstruction, the clean particle stack was exported to Relion for particle polishing and further analysis.

In Relion, the clean particle stack was re-extracted and subjected to a 3D auto-refinement and post processing. After post processing, the particles were then subjected to CTF refinements followed by Bayesian polishing. After 3D auto-refinement, the polished particles were subjected to a masked 3D classification without image alignment (T = 4, K = 6), resulting to one good class consisting of 312,272 particles and two bad classes of 12,665 and 17,827 particles. 3D auto-refinement of the 312,272 particles resulted to a 3.39 Å resolution map which were then subjected to CTF refinements followed by Bayesian polishing. 3D auto-refinement of the polished particles at C1 symmetry resulted to a final map with an overall resolution of 3.20 Å. Local resolution of the final reconstructed map was estimated in CryoSPARC at 0.143 threshold.

Cryo-EM model building and refinement

Prior to model building, the final refined maps were first sharpened with phenix.auto_sharpen [63] implemented in Phenix software package version 1.21-5207-000 [64] using b_iso_to_d_cut sharpening method. Molecular models were built using UCSF ChimeraX version 1.9 and Coot version 0.9.8.96 EL [65] and refined with Phenix version 1.21-5207-000. For modeling of 2.36 Å fbRep•ATPγS hexamer, the hexameric model of fbRep generated using AlphaFold Multimer [66] hosted on COSMIC2 cloud platform [67] was used to build the oligomerization and ATPase domains. Initially, the truncated AlphaFold multimer model (120–349) was fitted into the map in ChimeraX. The fitted model was manually adjusted to the map in COOT, truncated to 121–327 amino acids, and then refined with phenix.real_space_refine [68]. Iterative model building and refinements were done with ligands and water molecules manually added and refined during the final cycles of refinement. The models for 3.18 Å fbRep•ATPγS hexamer and 3.20 Å fbRep•ADP hexamer maps were built and refined as previously described but using the refined 2.36 Å fbRep•ATPγS hexamer without ligands as the initial model. The stacked double hexamer models were initially built using two refined fbRep•ATPγS hexamer models without ligands into the map using ChimeraX. The endonuclease domains were then built into the model residue by residue in Coot. Several cycles of model building and refinement were performed with ligands added during the final cycles of refinement. All refined models were validated against cryo-EM maps using phenix.validation_cryoem [69] and EMRinger [70] implemented in Phenix. A summary of cryo-EM data collection, refinement, and validation statistics can be found in Table 2.

Anisotropy analysis

We analyzed the cryo-EM maps of fbRep•ATPγS and fbRep•ADP hexamers for directional anisotropy using FSO [71] and 3DFSC [72] algorithms (S5E, S5F, S8E, and S8F Figs). FSO analyses indicated that each dataset contains intrinsically high-quality signal but is influenced by preferred particle orientation. The global resolution of the fbRep•ATPγS map is estimated at 2.22 Å and the fbRep•ADP map at 3.16 Å. 3DFSC analyses also estimated high global resolutions for fbRep•ATPγS and fbRep•ADP hexamers with similar directional anisotropy observed.

X-ray crystallography

Crystallization trials were performed using the hanging drop method by equilibrating a 1:1 mixture of 200 μM fbRep-ED prepared in 20 mM Tris-HCl pH 7.4, 100mM NaCl, 300 μM MgCl2, 0.1 mM TCEP and crystallization buffer against the crystallization buffer containing 100 mM ammonium malonate pH 5.8, 200 mM potassium fluoride, 15% w/v polyethylene glycol 4000, 15% v/v ethylene glycol. Crystals appeared within a week following incubation at 21°C, and the selected crystals were harvested and stored in liquid nitrogen for data collection.

Diffraction data were collected at 100oK and wavelength of 0.92 Å on beamline 17-ID-1 (AMX) at NSLSII (Brookhaven National Laboratory, Upton, NY). Images were processed using the DIALS software package (Diffraction Integration for Advanced Light Sources) [73]. Molecular replacement was carried out using Phaser-MR [74] as implemented in Phenix. The structure was solved using an AlphaFold [75] model of the Rep endonuclease domain 2–114 as a search model. The model was iteratively refined in reciprocal space with Phenix and in real space with COOT. Crystallographic data collection and refinement statistics can be found in S1 Table.

Structural analysis

Pore channel profiles of ATPγS-bound and ADP-bound fbRep were generated using MOLEonline [76] with cavity parameters of 45 Å probe radius and 3 Å interior threshold, channel parameters of 10 Å origin radius and 20 Å surface cover radius, 1.2 Å bottleneck radius, 3 Å bottleneck tolerance, 1.0 max tunnel similarity, Voronoi scale weight function, starting point at residue 146, and end point at residue 276. Analysis of hydrogen bonding and hydrophobic contacts were performed in Ligplot+ software v2.2 [77] with default parameters. Magnesium ion contacts were calculated in UCSF ChimeraX with center-center distance parameter value of less than or equal 3.10 Å. Structural alignments were also done in UCSF ChimeraX using the matchmaker tool with default parameters. The helical parameters of fbRep and other SF3 family helicases were calculated using locally written programs.

Sequence alignments

Vientovirus Rep sequence alignments were performed using Clustal Omega [78] and rendered using ESPript 3.2 [79].

Map and model visualization

Maps and models were visualized in UCSF ChimeraX version 1.9. Model illustrations were prepared using UCSF ChimeraX version 1.9 or PyMOL version 3.1.

Supporting information

S1 Table. Crystallographic data collection and refinement statistics for fbRep-ED 2–114.

https://doi.org/10.1371/journal.ppat.1013997.s001

(PDF)

S1 Fig. Effects of ionic strength on the oligomeric state of recombinant fbRep.

A&B. Sedimentation Velocity Analytical Ultracentrifugation (SV-AUC) analysis of purified fbRep in 1 M NaCl (A) or 200–400 mM NaCl (B). Shown are c(S) distributions (black lines). In 1 M NaCl (A), a 6.3S species is most prominent, whereas particles with S values ranging from 8.7-9.5 S are observed, changing in response to increasing ionic strength (B). Additional higher order species between 11-16S are also detected in smaller quantities. C&D. Size-exclusion chromatography inline with multi-angle light scattering (SEC-MALS) analysis of purified fbRep in 1 M NaCl (C) or 300 mM NaCl (D). Shown in solid black lines are the SEC elution profiles for fbRep, with absolute weight-averaged molecular mass (Mw, circles), determined by light scattering plotted across the profiles. Dashed horizontal lines denote the masses for pentamers and hexamers, respectively. In 1M NaCl (C), masses most consistent with pentamers are observed (210 kDa ± 1.1), whereas in 300 mM NaCl (D), masses larger than hexamer are observed (260 kDa ± 6.7).

https://doi.org/10.1371/journal.ppat.1013997.s002

(TIF)

S2 Fig. Size-exclusion chromatography inline with synchrotron small-angle X-ray scattering (SEC-SAXS).

A. Experimental scheme for synchrotron SEC-SAXS analysis is shown. See Methods for more details. B-D. Lines show integrated mean intensities of X-ray small-angle scattering vs frames recorded during elution of fbRep from SEC in its apo form (B), and with 1 mM ATP (C), or with 1 mM ADP (D) present in the running buffer. Circles show the derived radii of gyration (Rg) from the background subtracted X-ray scattered profiles. E. Singular value decomposition (SVD) analysis of the exposures denoted by pink shading in panels B-D. In each of the three samples tested, the peak region data was best described by three significant species, comprised of dodecamers, hexamers, and aggregates or buffer artifacts. F. SAXS profiles for hexamers and dodecamers derived from SVD-EFA analysis of SEC-SAXS data for fbRep protein, scaled arbitrarily along the y-axis and shown as a log-log plot. G&H. Shape distribution (P(r)) function analysis for hexamers (G) and dodecamers (H), performed using the program GNOM. Parameters derived from this analysis are provided in Table 1. In the respective lower panels differences in P(r) are shown, illustrating the reorganization of interatomic vectors in response to nucleotide binding of the respective oligomer. I&J. Shown is DENSS analysis of the synchrotron SAXS data. In orthogonal views for each of the hexamers and dodecamers isolated by SVD-EFA SAXS are ab initio electron density reconstructions, docked with the corresponding experimental hexamer models determined (I) or with head-to-tail dodecamer models (J). Asterisks in Panel I denote spatial discrepancies between the experimental SAXS volumes and where N- terminal endonuclease domains would be expected to reside. Electron density is colored with five contour levels of density rendered with these respective colors: 15σ (red), 10σ (green), 5σ (cyan), 2.5σ (blue), and -0.7σ (blue). The sigma (σ) level denotes the standard deviation above the average electron density value of the generated volume. DENSS figures were generated using the program PyMOL.

https://doi.org/10.1371/journal.ppat.1013997.s003

(TIF)

S3 Fig. Guinier plot analyses.

lnI(q) vs. q2 of SAXS data (dots) for fbRep oligomers, with residuals from the fitted lines shown below. Monodispersity is evidenced by linearity in the Guinier region of the scattering data and agreement of the I0 and Rg values determined with inverse Fourier transform analysis by the program GNOM (Table 1). Guinier analyses were performed where qRg ≤ 1.4.

https://doi.org/10.1371/journal.ppat.1013997.s004

(TIF)

S4 Fig. Cryo-EM data processing workflow of the fbRep•ATPγS dataset.

Schematic diagram of data collection, initial data processing, classifications, and refinements used to obtain the final maps (see Methods for details). The orthogonal views of the final maps are shown in black boxes. Resolutions associated with the final maps were calculated in CryoSPARC at FSC threshold of 0.143.

https://doi.org/10.1371/journal.ppat.1013997.s005

(TIF)

S5 Fig. Analysis of cryo-EM data quality of fbRep•ATPγS hexamers.

A. Orthogonal views of 2.36 Å fbRep•ATPγS hexamer and 3.18 Å fbRep•ATPγS hexamer with ED, colored based on estimated local resolution as calculated in CryoSPARC. B. Angular distribution of particles in the final refinement of fbRep•ATPγS hexamer maps. C. Map FSC curves at 0.143 threshold. D. Representative sidechain densities of 2.36 Å fbRep•ATPγS chain A and ATPγS binding site contoured at 4.6σ. Densities for each chain and ATPγS binding site are nearly identical. E, F) Analysis of directional anisotropy in fbRep•ATPγS. E) Calculated Fourier shell occupancy (FSO) values (blue solid line) and Bingham curves (red dashed line) using FSO algorithm. The anisotropy transition zone is indicated by the shaded green region which ranges from FSO 0.9 to 0.1. F) Histogram and directional Fourier Shell Correlation (FSC) curves obtained using 3DFSC algorithm implemented in the remote 3DFSC processing server [72].

https://doi.org/10.1371/journal.ppat.1013997.s006

(TIF)

S6 Fig. Ligplot+ diagrams of hydrogen bonding and hydrophobic interactions of ATPγS.

Hydrogen bonding and hydrophobic interactions of ATPγS in 2.36 Å fbRep•ATPγS hexamer model were calculated in Ligplot+ v2.2 with default parameters. ATPγS and protein side chains are shown in ball-and-stick representation, with ATPγS colored in yellow. Hydrogen bonds are shown as red dashed line while hydrophobic contacts are represented by red spoked arcs. Magnesium ion coordination bonds and its length, calculated in UCSF ChimeraX based on center-center distance parameter value of ≤ 3.10, are also shown and represented as green dashed line.

https://doi.org/10.1371/journal.ppat.1013997.s007

(TIF)

S7 Fig. Cryo-EM data processing workflow of the fbRep•ADP dataset.

Schematic diagram of data collection, initial data processing, classifications, and refinements used to obtain the final map (see Methods for details). Resolution associated with the final map was calculated in CryoSPARC at FSC threshold of 0.143.

https://doi.org/10.1371/journal.ppat.1013997.s008

(TIF)

S8 Fig. Analysis of cryo-EM data quality of fbRep•ADP hexamer.

A. Orthogonal views of 3.20 Å fbRep•ADP hexamer map, colored based on estimated local resolution as calculated in CryoSPARC. B. Angular distribution of particles in the final refinement of fbRep•ADP hexamer map. C. Map FSC curve at 0.143 threshold. D. Representative sidechain densities of 3.20 Å fbRep•ADP chain C and ADP binding site contoured at 5.8σ. Densities for each chain and ADP have subtle differences. E, F) Analysis of directional anisotropy in fbRep•ADP hexamer. E) Calculated Fourier shell occupancy (FSO) values (blue solid line) and Bingham curves (red dashed line) using FSO algorithm. The anisotropy transition zone is indicated by the shaded green region which ranges from FSO 0.9 to 0.1. F) Histogram and directional Fourier Shell Correlation (FSC) curves obtained using 3DFSC algorithm implemented in the remote 3DFSC processing server.

https://doi.org/10.1371/journal.ppat.1013997.s009

(TIF)

S9 Fig. Superposition of ATPγS and ADP-bound fbRep.

A) Superposition of the hexamers. The oligomerization domains are nearly identical but the helicase domains differ in conformation. B) Superposition of the oligomerization domains of individual subunits, showing the changes that occur in the helicase domains.

https://doi.org/10.1371/journal.ppat.1013997.s010

(TIF)

S10 Fig. Ligplot+ diagrams of hydrogen bonding and hydrophobic interactions of ADP.

Hydrogen bonding and hydrophobic interactions of ADP in 3.20 Å fbRep•ADP hexamer model were calculated in Ligplot+ v2.2 with default parameters. ADP and protein side chains are shown in ball-and-stick representation, with ADP colored in lime green. Hydrogen bonds are shown as red dashed line while hydrophobic contacts are represented by red spoked arcs. Magnesium ion coordination bonds and its length, calculated in UCSF ChimeraX based on center-center distance parameter value of ≤ 3.10, are also shown and represented as green dashed line.

https://doi.org/10.1371/journal.ppat.1013997.s011

(TIF)

S11 Fig. Cryo-EM data processing workflow of the fbRep•ATPγS double hexamer.

Schematic diagram of data reprocessing, classifications, and refinements used to obtain the C1 and C6-symmetrized final maps (see Methods for details). The orthogonal views of the C1 and C6-symmetrized final maps are shown. Resolutions associated with the final maps were calculated in CryoSPARC at FSC threshold of 0.143.

https://doi.org/10.1371/journal.ppat.1013997.s012

(TIF)

S12 Fig. Analysis of cryo-EM data quality of fbRep•ATPγS double hexamer.

A. Orthogonal views of 3.32 Å fbRep•ATPγS double hexamer (C1 symmetry) and 2.95 Å fbRep•ATPγS double hexamer (C6 symmetry), colored based on estimated local resolution as calculated in CryoSPARC. B. Angular distribution of particles in the final refinement of fbRep•ATPγS double hexamer maps. C. Map FSC curves at 0.143 threshold. D. Representative sidechain densities of 3.32 Å fbRep•ATPγS double hexamer chain E and ATPγS binding site contoured at 5.4σ. Densities for each chain and ATPγS binding site are nearly identical.

https://doi.org/10.1371/journal.ppat.1013997.s013

(TIF)

S13 Fig. Structure of fbRep endonuclease domain.

A. X-ray crystal structure of fbRep endonuclease domain 2–114 (fbRep-ED) with Mg2 + ion resolved at 1.80 Å resolution. The endonuclease domain contains a central five-stranded anti-parallel β-sheet surrounded by α-helices and disordered loops. The α3 helix contains the catalytic tyrosine at position 99 with its side chain pointing towards the metal coordination site. Magnesium is coordinated by Glu48 (β3), His57 (β4), His59 (β4) and a malonate ion (MLI). The coordination of Mg2+ by malonate ion likely represents a key interaction of ssDNA phosphate backbone during catalysis. B. Superposition of X-ray crystal structure of fbRep-ED and cryo-EM structure of fbRep1 EDs. Structural alignments of the EDs (2–109) were done in UCSF ChimeraX with matchmaker tool using default parameters. The overall Cα r.m.s.d of 108 paired atoms are indicated.

https://doi.org/10.1371/journal.ppat.1013997.s014

(TIF)

S14 Fig. Alignment of a subset of known vientovirus Rep protein sequences.

Sequences from 13 members of Vientovirus species were obtained from GenBank with following accession numbers: QCD25311.1 (Vientovirus FB), QCD25302.1 (Vientovirus AL), QCD25326.1 (Vientovirus EC), QCD25317.1 (Vientovirus JY), QCD25308.1 (Vientovirus ES), QCD25314.1 (Vientovirus JB), QCD25329.1 (Vientovirus LZ), QCD25338.1 (Vientovirus MW), QCD25305.1 (Vientovirus DC), QCD25335.1 (Vientovirus XM), QCD25323.1 (Vientovirus AV), QCD25320.1 (Vientovirus LT), and QCD25332.1 (Vientovirus MC). The secondary structure of the full-length Vientovirus FB Rep, obtained from the cryo-EM model of Rep double hexamer, is indicated above the alignment. The residues in each column are colored according to the percentage of equivalent residues calculated based on their physico-chemical properties. Consensus sequence with a global score greater than 70 is displayed at the bottom of the alignment.

https://doi.org/10.1371/journal.ppat.1013997.s015

(TIF)

S15 Fig. CORAL modeling.

CORAL analysis [53] of Rep hexamers in the presence and absence of ADP. CORAL employs a rigid body approach to optimize atomistic models against experimental SAXS data. In this approach, missing and flexible atomic inventory are represented as beads in coarse grain fashion and flexibly fit. Shown in A are the experimental SAXS data for RepADP, and in B the experimental SAXS data for apo Rep (see Table 1). In both plots, gray circles represent the experimental data in a log–log plot, where intensity I is plotted as a function of q. In this analysis, two hexamer models containing the N-terminal endonuclease domains (ED) were initially considered: the experimental hexamer determined in these studies (ED domains colored purple, C), and an Alphafold3 (AF3) [80] calculated model (ED domains colored black, D). In the ADP state (A), the experimental hexamer (purple dotted lines) showed discrepancies with the experimental data between 0.05 < q < 0.1 (χFoxS2 = 2.1 vs ADP, 4.8 vs APO), and similar discrepancy was apparent with the AF3 derived model (black dotted line, χFoxS2 = 2.9). When missing atomic inventory (residue 323–354) and the N-terminal domains were modelled using the CORAL approach, this discrepancy is relieved (representative fits shown in red, χFoxS2 = 1.31-1.34 for ADP-bound (n = 10) and χFoxS2 = 3.21-3.32 for apo (n = 10)) [54]. In all cases, random positioning of the EDs in proximal and distal configurations is observed. A gallery of representative calculations (n = 10) is shown in E for the ADP-bound form of hexameric Rep and F for apo Rep. Atomic inventory not represented in available crystallographic models is shown as beads and fit flexibly. Residues 120–323 (white) were fixed in these calculations. Application of minimal ensemble search (MES/MultiFoXS) [81] did not provide statistically improved fits relative to single-state models, indicating SAXS degeneracy and consistency with a continuum of ED conformations in solution. Figures were rendered using PYMOL.

https://doi.org/10.1371/journal.ppat.1013997.s016

(TIF)

Acknowledgments

We are grateful to members of the Moiseenkova-Bell, Gupta, Bushman and Van Duyne laboratories for help and suggestions. We acknowledge the use of instruments at the Electron Microscopy Resource Laboratory (EMRL) and at the Beckman Center for Cryo-Electron Microscopy at the University of Pennsylvania Perelman School of Medicine. We are also grateful to former and current staff members of EMRL and Beckman Center for Cryo-EM, especially to Sudheer Molugu for help and training in negative stain and cryo-EM grid sample freezing and Stefan Steimle for assistance in sample grid preparation and data collection on the Titan Krios. We also wish to extend our gratitude to Ronen Marmorstein laboratory especially to Mary Szurgot and Lauren Gardner for assistance with mass photometry data collection. K.G. acknowledges support from the Johnson Research Foundation and NIH Shared Instrumentation Grant S10-OD018483. The SV-AUC, SEC-MALS, and Mass Photometry experiments were performed at the Johnson Foundation Biophysical and Structural Biology Core Facility (University of Pennsylvania, Philadelphia PA). This work is based in part on research conducted at the Center for High-Energy X-ray Sciences (CHEXS), which is supported by the National Science Foundation (BIO, ENG and MPS Directorates) under award DMR-2342336, and the Macromolecular Diffraction at CHESS (MacCHESS) facility, which is supported by award 1-P30-GM124166 from the National Institute of General Medical Sciences, and the National Institutes of Health. The crystallographic data were obtained at the 17-ID-1 beamline, while additional SEC-SAXS data were obtained at 16-ID (LIX) at the National Synchrotron Light Source II, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Brookhaven National Laboratory under Contract No. DE-SC0012704.

References

  1. 1. Krupovic M, Varsani A, Kazlauskas D, Breitbart M, Delwart E, Rosario K, et al. Cressdnaviricota: a Virus Phylum Unifying Seven Families of Rep-Encoding Viruses with Single-Stranded, Circular DNA Genomes. J Virol. 2020;94(12):e00582–20. pmid:32269128
  2. 2. Zhao L, Rosario K, Breitbart M, Duffy S. Chapter Three - Eukaryotic Circular Rep-Encoding Single-Stranded DNA (CRESS DNA) Viruses: Ubiquitous Viruses With Small Genomes and a Diverse Host Range. In: Kielian M, Mettenleiter TC, Roossinck MJ, editors. Advances in Virus Research. Academic Press; 2019. pp. 71–133.
  3. 3. Abbas AA, Taylor LJ, Dothard MI, Leiby JS, Fitzgerald AS, Khatib LA, et al. Redondoviridae, a family of small, circular DNA viruses of the human oro-respiratory tract associated with periodontitis and critical illness. Cell Host Microbe. 2019;25(5):719–729.e4. pmid:31071295
  4. 4. Cui L, Wu B, Zhu X, Guo X, Ge Y, Zhao K, et al. Identification and genetic characterization of a novel circular single-stranded DNA virus in a human upper respiratory tract sample. Arch Virol. 2017;162(11):3305–12. pmid:28707271
  5. 5. Spezia PG, Macera L, Mazzetti P, Curcio M, Biagini C, Sciandra I, et al. Redondovirus DNA in human respiratory samples. J Clin Virol. 2020;131:104586. pmid:32841923
  6. 6. Tu NTK, Deng X, Hong NTT, Ny NTH, Phuc TM, Tam PTT, et al. Redondoviridae: high prevalence and possibly chronic shedding in human respiratory tract, But no zoonotic transmission. Viruses. 2021;13(4):533. pmid:33804837
  7. 7. Taylor LJ, et al. Redondovirus diversity and evolution on global, individual, and molecular scales. J Virol. 2021;95:e00817-21.
  8. 8. Merenstein C, et al. Signatures of COVID-19 Severity and Immune Response in the Respiratory Tract Microbiome. mBio. 2021;12:e01777–21.
  9. 9. Zhang Y, Wang C, Feng X, Chen X, Zhang W. Redondoviridae and periodontitis: a case-control study and identification of five novel redondoviruses from periodontal tissues. Virus Evol. 2021;7(1):veab033. pmid:35186324
  10. 10. Charlys da Costa A, Mendes-Correa MC, Tozetto-Mendoza TR, Villas-Boas LS, de Paula AV, Paiao HGO, et al. Detailed characterization of Redondovirus in saliva of SARS-CoV-2-infected individuals in Sao Paulo, Brazil. PLoS One. 2023;18(8):e0291027. pmid:37651462
  11. 11. Mohebbi A, Kiani SJ, Khanaliha K, Donyavi T, Emtiazi N, Sharifian K, et al. Dental complications as a potential indicator of Redondovirus infection: a cross-sectional study. BMC Infect Dis. 2024;24(1):673. pmid:38969993
  12. 12. Fang S, Zhu L, Bai S, Tian W, Pan Y, Zhang S, et al. Year-round infectome profiling of acute febrile respiratory illness unveiled complex epidemiological dynamics postlifting of COVID-19 restrictions. Int J Infect Dis. 2025;155:107896. pmid:40164380
  13. 13. Keeler EL, Merenstein C, Reddy S, Taylor LJ, Cobián-Güemes AG, Zankharia U, et al. Widespread, human-associated redondoviruses infect the commensal protozoan Entamoeba gingivalis. Cell Host Microbe. 2023;31(1):58-68.e5. pmid:36459997
  14. 14. Makoa-Meng M, Semmar R, Antezack A, Penant G, La Scola B, Monnet-Corti V, et al. Correlation of redondovirus and entamoeba gingivalis detections in the human oral cavity suggests that this amoeba is possibly the redondovirus host. Int J Mol Sci. 2023;24(7):6303. pmid:37047275
  15. 15. Gutierrez C. Geminivirus DNA replication. Cell Mol Life Sci. 1999;56(3–4):313–29. pmid:11212359
  16. 16. Faurez F, Dory D, Grasland B, Jestin A. Replication of porcine circoviruses. Virol J. 2009;6:60. pmid:19450240
  17. 17. Ruiz-Masó JA, et al. Plasmid rolling-circle replication. Microbiol Spectrum. 2015;3.
  18. 18. Chandler M, de la Cruz F, Dyda F, Hickman AB, Moncalian G, Ton-Hoang B. Breaking and joining single-stranded DNA: the HUH endonuclease superfamily. Nat Rev Microbiol. 2013;11(8):525–38. pmid:23832240
  19. 19. Hickman AB, Dyda F. Binding and unwinding: SF3 viral helicases. Curr Opin Struct Biol. 2005;15(1):77–85. pmid:15718137
  20. 20. Tompkins KJ, Houtti M, Litzau LA, Aird EJ, Everett BA, Nelson AT, et al. Molecular underpinnings of ssDNA specificity by Rep HUH-endonucleases and implications for HUH-tag multiplexing and engineering. Nucleic Acids Res. 2021;49(2):1046–64. pmid:33410911
  21. 21. Smiley AT, Tompkins KJ, Pawlak MR, Krueger AJ, Evans RL 3rd, Shi K, et al. Watson-crick base-pairing requirements for ssDNA recognition and processing in replication-initiating HUH endonucleases. mBio. 2023;14(1):e0258722. pmid:36541758
  22. 22. Hickman AB, Ronning DR, Perez ZN, Kotin RM, Dyda F. The nuclease domain of adeno-associated virus rep coordinates replication initiation using two distinct DNA recognition interfaces. Mol Cell. 2004;13(3):403–14. pmid:14967147
  23. 23. Tarasova E, Dhindwal S, Popp M, Hussain S, Khayat R. Mechanism of DNA interaction and translocation by the replicase of a circular rep-encoding single-stranded DNA virus. mBio. 2021;12:e00763–21.
  24. 24. Li D, Zhao R, Lilyestrom W, Gai D, Zhang R, DeCaprio JA, et al. Structure of the replicative helicase of the oncoprotein SV40 large tumour antigen. Nature. 2003;423(6939):512–8. pmid:12774115
  25. 25. Enemark EJ, Joshua-Tor L. Mechanism of DNA translocation in a replicative hexameric helicase. Nature. 2006;442(7100):270–5. pmid:16855583
  26. 26. Shahid T, Danazumi AU, Tehseen M, Alhudhali L, Clark AR, Savva CG, et al. Structural dynamics of DNA unwinding by a replicative helicase. Nature. 2025;641(8061):240–9. pmid:40108462
  27. 27. Gai D, Zhao R, Li D, Finkielstein CV, Chen XS. Mechanisms of conformational change for a replicative hexameric helicase of SV40 large tumor antigen. Cell. 2004;119(1):47–60. pmid:15454080
  28. 28. Lee SJ, et al. Dynamic look at DNA unwinding by a replicative helicase. Proc Natl Acad Sci. 2014;111:E827–35.
  29. 29. Langston LD, Yuan Z, Georgescu R, Li H, O’Donnell ME. SV40 T-antigen uses a DNA shearing mechanism to initiate origin unwinding. Proc Natl Acad Sci U S A. 2022;119(49):e2216240119. pmid:36442086
  30. 30. Abbas A, Taylor LJ, Collman RG, Bushman FD, Ictv Report Consortium. ICTV virus taxonomy profile: redondoviridae. J Gen Virol. 2021;102(1):jgv001526. pmid:33258767
  31. 31. Meisburger SP, Xu D, Ando N. REGALS: a general method to deconvolve X-ray scattering data from evolving mixtures. IUCrJ. 2021;8(Pt 2):225–37. pmid:33708400
  32. 32. Singleton MR, Dillingham MS, Wigley DB. Structure and mechanism of helicases and nucleic acid translocases. Annu Rev Biochem. 2007;76:23–50. pmid:17506634
  33. 33. Cheung AK. Detection of template strand switching during initiation and termination of DNA replication of porcine circovirus. J Virol. 2004;78(8):4268–77. pmid:15047840
  34. 34. Mastrangelo IA, Hough PV, Wall JS, Dodson M, Dean FB, Hurwitz J. ATP-dependent assembly of double hexamers of SV40 T antigen at the viral origin of DNA replication. Nature. 1989;338(6217):658–62. pmid:2539565
  35. 35. Fouts ET, Yu X, Egelman EH, Botchan MR. Biochemical and electron microscopic image analysis of the hexameric E1 helicase. J Biol Chem. 1999;274(7):4447–58. pmid:9933649
  36. 36. Mansilla-Soto J, Yoon-Robarts M, Rice WJ, Arya S, Escalante CR, Linden RM. DNA structure modulates the oligomerization properties of the AAV initiator protein Rep68. PLoS Pathog. 2009;5(7):e1000513. pmid:19593381
  37. 37. Santosh V, Musayev FN, Jaiswal R, Zárate-Pérez F, Vandewinkel B, Dierckx C, et al. The Cryo-EM structure of AAV2 Rep68 in complex with ssDNA reveals a malleable AAA+ machine that can switch between oligomeric states. Nucleic Acids Res. 2020;48(22):12983–99. pmid:33270897
  38. 38. Jaiswal R, Braud B, Hernandez-Ramirez KC, Santosh V, Washington A, Escalante CR. Cryo-EM structure of AAV2 Rep68 bound to integration site AAVS1: insights into the mechanism of DNA melting. Nucleic Acids Res. 2025;53(3):gkaf033. pmid:39883011
  39. 39. Clérot D, Bernardi F. DNA helicase activity is associated with the replication initiator protein rep of tomato yellow leaf curl geminivirus. J Virol. 2006;80(22):11322–30. pmid:16943286
  40. 40. Choudhury NR, Malik PS, Singh DK, Islam MN, Kaliappan K, Mukherjee SK. The oligomeric Rep protein of Mungbean yellow mosaic India virus (MYMIV) is a likely replicative helicase. Nucleic Acids Res. 2006;34(21):6362–77. pmid:17142233
  41. 41. Tao L, et al. The replication protein of duck circovirus unwinds dsDNA in an ATP-driven and metal ion-dependent manner from 3′ to 5′. Front Vet Sci. 2025;12.
  42. 42. Wu J, Davis MD, Owens RA. Factors affecting the terminal resolution site endonuclease, helicase, and ATPase activities of adeno-associated virus type 2 Rep proteins. J Virol. 1999;73(10):8235–44. pmid:10482574
  43. 43. Cheung AK. Porcine circovirus: transcription and DNA replication. Virus Res. 2012;164(1–2):46–53. pmid:22036834
  44. 44. Schuck P. Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and lamm equation modeling. Biophys J. 2000;78(3):1606–19. pmid:10692345
  45. 45. Philo JS. SEDNTERP: a calculation and database utility to aid interpretation of analytical ultracentrifugation and light scattering data. Eur Biophys J. 2023;52(4–5):233–66. pmid:36792822
  46. 46. Brautigam CA. Chapter Five - Calculations and Publication-Quality Illustrations for Analytical Ultracentrifugation Data. In: Cole JL, editor. Methods in Enzymology. Academic Press; 2015. pp. 109–33.
  47. 47. Yang L, Antonelli S, Chodankar S, Byrnes J, Lazo E, Qian K. Solution scattering at the life science X-ray scattering (LiX) beamline. J Synchrotron Radiat. 2020;27(Pt 3):804–12. pmid:32381785
  48. 48. Miller RC, Cummings C, Huang Q, Ando N, Gillilan RE. Inline small-angle X-ray scattering-coupled chromatography under extreme hydrostatic pressure. Protein Sci. 2022;31(12):e4489. pmid:36320105
  49. 49. Hopkins JB. BioXTAS RAW 2: new developments for a free open-source program for small-angle scattering data reduction and analysis. J Appl Crystallogr. 2024;57(Pt 1):194–208. pmid:38322719
  50. 50. Svergun DI, Semenyuk AV, Feigin LA. Small-angle-scattering-data treatment by the regularization method. Acta Crystallogr A Found Crystallogr. 1988;44(3):244–50.
  51. 51. Sumner J, Qian S. DENSS-multiple: A structure reconstruction method using contrast variation of small-angle neutron scattering based on the DENSS algorithm. BBA Adv. 2022;2:100063. pmid:37082592
  52. 52. Meng EC, Goddard TD, Pettersen EF, Couch GS, Pearson ZJ, Morris JH, et al. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 2023;32(11):e4792. pmid:37774136
  53. 53. Petoukhov MV, Franke D, Shkumatov AV, Tria G, Kikhney AG, Gajda M, et al. New developments in the ATSAS program package for small-angle scattering data analysis. J Appl Crystallogr. 2012;45(Pt 2):342–50. pmid:25484842
  54. 54. Schneidman-Duhovny D, Hammel M, Sali A. FoXS: a web server for rapid computation and fitting of SAXS profiles. Nucleic Acids Res. 2010;38(Web Server issue):W540–4. pmid:20507903
  55. 55. Scheres SHW. A Bayesian view on cryo-EM structure determination. J Mol Biol. 2012;415(2):406–18. pmid:22100448
  56. 56. Scheres SHW. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J Struct Biol. 2012;180(3):519–30. pmid:23000701
  57. 57. Zivanov J, Nakane T, Forsberg BO, Kimanius D, Hagen WJ, Lindahl E, et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife. 2018;7:e42166. pmid:30412051
  58. 58. Rohou A, Grigorieff N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J Struct Biol. 2015;192(2):216–21. pmid:26278980
  59. 59. Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat Methods. 2017;14(3):290–6. pmid:28165473
  60. 60. Bepler T, Kelley K, Noble AJ, Berger B. Topaz-Denoise: general deep denoising models for cryoEM and cryoET. Nat Commun. 2020;11(1):5208. pmid:33060581
  61. 61. Punjani A, Zhang H, Fleet DJ. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat Methods. 2020;17(12):1214–21. pmid:33257830
  62. 62. Bepler T, Morin A, Rapp M, Brasch J, Shapiro L, Noble AJ, et al. Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs. Nat Methods. 2019;16(11):1153–60. pmid:31591578
  63. 63. Terwilliger TC, Sobolev OV, Afonine PV, Adams PD. Automated map sharpening by maximization of detail and connectivity. Acta Crystallogr D Struct Biol. 2018;74(Pt 6):545–59. pmid:29872005
  64. 64. Liebschner D, Afonine PV, Baker ML, Bunkóczi G, Chen VB, Croll TI, et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol. 2019;75(Pt 10):861–77. pmid:31588918
  65. 65. Casañal A, Lohkamp B, Emsley P. Current developments in Coot for macromolecular model building of Electron Cryo-microscopy and Crystallographic Data. Protein Sci. 2020;29(4):1069–78. pmid:31730249
  66. 66. Evans R, et al. Protein complex prediction with AlphaFold-Multimer. [Preprint]. 2022. [cited 2025 Aug 6]. Available from: https://www.biorxiv.org/content/10.1101/2021.10.04.463034v2
  67. 67. Cianfrocco MA, Wong-Barnum M, Youn C, Wagner R, Leschziner A. COSMIC2: A science gateway for cryo-electron microscopy structure determination in practice and experience in advanced research computing. In: Sustainability, success and impact, PEARC’17. Association for Computing Machinery; 2017. pp. 1–5.
  68. 68. Afonine PV, Poon BK, Read RJ, Sobolev OV, Terwilliger TC, Urzhumtsev A, et al. Real-space refinement in PHENIX for cryo-EM and crystallography. Acta Crystallogr D Struct Biol. 2018;74(Pt 6):531–44. pmid:29872004
  69. 69. Afonine PV, Klaholz BP, Moriarty NW, Poon BK, Sobolev OV, Terwilliger TC, et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr D Struct Biol. 2018;74(Pt 9):814–40. pmid:30198894
  70. 70. Barad BA, Echols N, Wang RY-R, Cheng Y, DiMaio F, Adams PD, et al. EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat Methods. 2015;12(10):943–6. pmid:26280328
  71. 71. Vilas J-L, Tagare HD. New measures of anisotropy of cryo-EM maps. Nat Methods. 2023;20(7):1021–4. pmid:37248387
  72. 72. Tan YZ, Baldwin PR, Davis JH, Williamson JR, Potter CS, Carragher B, et al. Addressing preferred specimen orientation in single-particle cryo-EM through tilting. Nat Methods. 2017;14(8):793–6. pmid:28671674
  73. 73. Winter G, Waterman DG, Parkhurst JM, Brewster AS, Gildea RJ, Gerstel M, et al. DIALS: implementation and evaluation of a new integration package. Acta Crystallogr D Struct Biol. 2018;74(Pt 2):85–97. pmid:29533234
  74. 74. McCoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. J Appl Crystallogr. 2007;40(Pt 4):658–74. pmid:19461840
  75. 75. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–9. pmid:34265844
  76. 76. Berka K, Hanák O, Sehnal D, Banás P, Navrátilová V, Jaiswal D, et al. MOLEonline 2.0: interactive web-based analysis of biomacromolecular channels. Nucleic Acids Res. 2012;40(Web Server issue):W222–7. pmid:22553366
  77. 77. Laskowski RA, Swindells MB. LigPlot+: multiple ligand-protein interaction diagrams for drug discovery. J Chem Inf Model. 2011;51(10):2778–86. pmid:21919503
  78. 78. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7:539. pmid:21988835
  79. 79. Gouet P, Robert X, Courcelle E. ESPript/ENDscript: extracting and rendering sequence and 3D information from atomic structures of proteins. Nucleic Acids Res. 2003;31(13):3320–3. pmid:12824317
  80. 80. Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630(8016):493–500. pmid:38718835
  81. 81. Schneidman-Duhovny D, Hammel M, Tainer JA, Sali A. FoXS, FoXSDock and MultiFoXS: Single-state and multi-state structural modeling of proteins and their complexes based on SAXS profiles. Nucleic Acids Res. 2016;44(W1):W424–9. pmid:27151198