DNA binding proteins efficiently search for their cognitive sites on long genomic DNA by combining 3D diffusion and 1D diffusion (sliding) along the DNA. Recent experimental results and theoretical analyses revealed that the proteins show a rotation-coupled sliding along DNA helical pitch. Here, we performed Brownian dynamics simulations using newly developed coarse-grained protein and DNA models for evaluating how hydrodynamic interactions between the protein and DNA molecules, binding affinity of the protein to DNA, and DNA fluctuations affect the one dimensional diffusion of the protein on the DNA. Our results indicate that intermolecular hydrodynamic interactions reduce 1D diffusivity by 30%. On the other hand, structural fluctuations of DNA give rise to steric collisions between the CG-proteins and DNA, resulting in faster 1D sliding of the protein. Proteins with low binding affinities consistent with experimental estimates of non-specific DNA binding show hopping along the CG-DNA. This hopping significantly increases sliding speed. These simulation studies provide additional insights into the mechanism of how DNA binding proteins find their target sites on the genome.
DNA binding proteins efficiently search for their cognitive sites on long genomic DNA in cells to control biological activities. Recent experimental studies have revealed that the proteins use not only three-dimensional diffusion, but also one-dimensional diffusion (sliding) on DNA for this search process. For a better understanding of this biological process, we need to elucidate the mechanism of sliding. We report here molecular simulations using newly developed coarse-grained protein and DNA models for elucidating the nature of the sliding motions. Our simulation results show that: 1) hydrodynamic interactions between protein and DNA reduce sliding rate by 30%, 2) structural fluctuations of DNA give rise to steric collisions between proteins and DNA, which facilitate sliding motions, and 3) proteins with low binding affinities to DNA can hop along the DNA, resulting in a significant increase in sliding speed. These simulation studies provide additional insights into the mechanism of how DNA binding proteins find their target sites on the genome.
Citation: Ando T, Skolnick J (2014) Sliding of Proteins Non-specifically Bound to DNA: Brownian Dynamics Studies with Coarse-Grained Protein and DNA Models. PLoS Comput Biol 10(12): e1003990. doi:10.1371/journal.pcbi.1003990
Editor: G. Marius Clore, NIDDK, National Institutes of Health, United States of America
Received: May 12, 2014; Accepted: October 16, 2014; Published: December 11, 2014
Copyright: © 2014 Ando, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper.
Funding: This work was supported in part by a grant from the National Institute of Health GM-37408. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
In living cells, DNA-binding proteins search for their specific target sites on DNA to initiate many biological processes, such as transcription, repression, activation, etc. How can proteins find their target sites on a long genome DNA? Many experimental and theoretical studies have been done over the past decade to address this issue (see review articles ,  and references therein). The question as to how proteins find their DNA binding sites arose from the experimental observation that the association rate of the lactose repressor and its target site on DNA was significantly (about 100 times) higher than the expected Smoluchowski reaction rate in three-dimensional (3D) space . To explain this discrepancy, Riggs proposed a mechanism that the protein does not rely on 3D diffusion alone for target search, but also non-specifically binds to the DNA at a random location, then undergoes one-dimensional (1D) Brownian diffusion (or sliding) along DNA to their target sites . This is based on the idea of reducing the dimensionality of diffusion based reactions in biological systems as originally suggested by Adam and Delbeück in 1968 . This mechanism, so-called “facilitated diffusion” or a “1D/3D mechanism”, was later expanded on by Berg, Winter, and von Hippel . In their model, DNA binding proteins have three modes for target search: 1) 1D sliding on DNA without dissociation, 2) 1D hopping along the DNA via a series of microscopic dissociation and association events to a nearby location, and 3) jumping or diffusion in 3D for inter-segmental transfer , . The significant facilitation of the rate that lactose repressor finds its DNA target site can be explained by an acceleration due to the electrostatic interactions between a positively charged site on the protein and the negatively charged phosphate groups in DNA at the low salt concentrations used in the experiment . This classical experiment and the idea of facilitated diffusion have driven many experimental and theoretical studies for over a decade. Indeed, single molecule experiments have confirmed 1D sliding motions of several DNA binding proteins along DNA in vitro – as well as in vivo , .
Assuming that only 1D sliding and 3D diffusion are at play, a simple analytical analysis gives the association rate of a DNA-binding protein to its target site , :(1)where kSmol is the Smoluchowski rate for random association to target site with 3D diffusion, is the average distance in units of base-pairs (bp) moved during a single sliding step. τ1D and τ3D are the average durations of 1D sliding and 3D diffusion in one round of a 1D and 3D search, respectively. The Smoluchowski rate has the form kSmol = 4πD3DbfNA with the diffusion coefficient of the protein in 3D, D3D, the spacing between the base-pairs of DNA is b ( = 3.4 Å), and the fraction of the protein's surface that contains the reactive binding interface, f, and NA is Avogadro number. With the measured diffusion coefficients of proteins ranging from D3D≈(1–5)×10−6 cm2 s−1 and assuming f≈0.2–0.5, we obtain kSmol≈108 M−1 s−1 , . The sliding length can be written by with a 1D diffusion coefficient D1D. This equation demonstrates that binding to a non-specific site on DNA slows down the search process by a factor of τ3D/(τ1D+τ3D) and this slow-down is compensated for by the sliding term . By setting dks/dτ1D = 0 and assuming D1D is independent of τ1D, we found that the maximum rate constant is achieved if τ1D = τ3D , . However, in a bacterial cell, the theoretical estimate for τ3D/(τ1D+τ3D) has range of 10−1–10−4 , which is verified by in vivo measurements . This ratio is far from the optimal ratio in the theory. Additionally, is estimated to be in the range of 102–103 bps in vitro  and is likely less in vivo . Therefore, the 1D/3D mechanism does not allow for significant facilitation and may reduce the efficiency of the protein–DNA search in bacterial cells , , . Possible reasons why 1D sliding does not increase the rate of diffusive motion may be related to spatial effects of the genome . Thus, despite significant progress in theoretical and experimental approaches, the detailed mechanisms of protein motion along DNA and its biological role are still not well understood. What are the protein and DNA conformations, energetics, and search dynamics that enable efficient target search along genomic DNA in crowded intracellular environments? For better understanding of the search processes at molecular to cellular levels, simulation studies can play an important role for connecting theory and experiment.
Since DNA has a helical structure, one might expect that proteins rotate along the helical groves of the DNA during 1D sliding. Schurr first derived a theoretical expression for an apparent 1D diffusion coefficient of this rotation-coupled sliding by a non-specifically DNA bound protein along DNA based on hydrodynamic theory . In his model, if the protein is approximated as sphere, the total friction is the sum of translational and rotational friction, in which it is assumed that the center of mass of the protein always remains on the DNA axis. Recently, Bagchi, Blainey, and Xie extended his model to take into account off-axis rotational diffusion of the proteins . Their model, called the “BBX model”, is expressed by(2)where kB is Boltzmann's constant, T is the temperature, η is the viscosity of water, apro is the radius of the protein, BP is the distance between two base pairs of DNA, equal to 3.4 Å, and ROC is the separation between the protein center of mass and the longest axis of DNA. The first and second terms in the denominator describe translational friction along the longest axis and rotational friction on the axis, and are the same as in the Schurr's model. The third term in the denominator, an additional term in the BBX model, accounts for the friction associated with off-axis circular translational motion around the axis. This third term is essentially translational friction. The most important consequence of the BBX model is that a 1/(apro)3 size dependence of diffusion is expected, in contrast to the usual Stokes-Einstein 1/apro dependence for pure translational motion. The same group experimentally measured the apparent 1D diffusion coefficients of various size transcription factors by a single molecule analysis in vitro and showed the 1/(apro)3 dependence of the diffusion coefficients. This strongly suggests that proteins non-specifically bound to DNA undergo rotation-coupled sliding . BBX theory predicts a 100–200-fold reduction in a protein's 1D diffusivities relative to their diffusivities in 3D for typical transcription factors modeled as spherical objects . Interestingly, the diffusion coefficients of most transcription factors predicted by the BBX model are still 2–5 times larger than experimental values, which may be attributed to free energy roughness to sliding that arises from details of the protein-DNA interactions , . Thus, on average diffusion constants on DNA for most transcription factors seems to be less than that expected for 3D diffusion in the absence of the DNA.
Apart from the theoretical analyses, molecular simulation studies also observe rotation-coupled sliding of DNA binding proteins –. In those studies, the dynamics of coarse-grained (CG) DNA and protein models were analyzed by molecular dynamics simulations in a simplified implicit solvent model. Proteins are modeled by strings of beads representing α carbon atoms of amino acids, with each nucleotide represented by a few beads. These studies provide molecular and atomistic views of the sliding process.
Here, we have developed a CG model for protein sliding along DNA. In our simulations, hydrodynamics interactions (HI) are considered. As is well known from polymer dynamics , HI can significantly alter the dynamics of macromolecules. Including HI in the simulations makes it possible to compare to the BBX theory of rotation-coupled sliding. We remind the reader that the BBX model assumes that 1) the protein is a sphere, 2) the protein follows a helical track along, and never detaches from, the DNA, 3) HI between the protein and DNA are ignored, and 4) there is no energy roughness along with sliding. Here, we would like to address the following questions: What are the effects of HI between the protein and DNA on 1D diffusion of proteins along DNA? How does DNA flexibility affect 1D sliding? These effects are difficult to handle in the theoretical analysis. We first build CG models of DNA binding proteins and DNA, and then estimate protein-DNA binding affinities in the CG model by an umbrella sampling method. Using the model, we then perform BD simulations under various conditions to answer these questions.
Brownian dynamics algorithm with hydrodynamic interactions
The BD simulations were performed using a second-order Runge-Kutta algorithm . For constructing the 3N×3N diffusion matrix D of a given simulation system with N particles, we employ the Rotne-Prager-Yamakawa (RPY) tensor , , described by(3)Here, Dij is the 3×3 diffusion tensor for particles i and j, rij is ri−rj, with the particle position vector r, rij is the length of rij, and , I is the 3×3 unit tensor, kB is Boltzmann's constant, T is the temperature, η is the viscosity of water, and ai is the Stokes radius of particle i. is an effective Stokes radius of the i and j particle pair. In this study, was used .
CG-DNA and protein models
A schematic view of our CG protein and DNA model is shown in Fig. 1. Many DNA binding proteins form homo dimers, where the dimeric proteins have two DNA binding domains, e.g. lactose repressor, tryptophan repressor, λ repressor, etc. . In this work, a protein molecule is represented by three beads: one representing a protein body portion, named the “PBP” bead, and the rest of the beads represent the DNA binding portion, named “DBP”, which have positive charges to bind to DNA. For the DNA molecule, the two adjacent nucleotides in the double strand are represented by a pseudo phosphate “PP” bead at the position of the phosphate atom in one strand of the canonical B-form of DNA. Since the bead represents two adjacent nucleotides, we set the effective charge of the PP beads to be −2. PP beads are connected to pseudo backbone “PB” beads, located on the long axis of the DNA. Radii, σ, for excluded volume effects explained below, Stokes radii, a, and effective charges, q, of the beads are listed in Table 1. σ values were determined to represent geometrical features of DNA. The distance between adjacent phosphate atoms is about 12.6 Å in B-DNA, with two adjacent nucleotides represented by one PP bead. To reproduce the excluded volume of the two adjacent nucleotides, the radii of the PP beads were set to 10.4 Å. The center of PBP was placed at 38 Å which gives an off-axis distance of 47 Å ( = 9+38) between Roc values of LacI (55 Å) and hOgg1 (25 Å) as reported in Ref. . The radii of the PBP were set to 27.6 Å ( = 38–10.4 Å). DBP's radii of 6 Å were used to geometrically fit between PP beads. This is slightly smaller than the surface distance between PP beads, 7 Å. The assigned a for beads of CG-DNA give translational diffusion coefficients of small DNA fragments (8 bp to 24 bp) close to the experimental values .
PBP and DBP represent a Protein Body Portion and a DNA Binding Portion of the CG-protein model, respectively. PP and PB represent a Pseudo Phosphate of two adjacent nucleotides and a Pseudo Backbone of double strand DNA, respectively. PBP and DBP beads are connected to each other by a harmonic potential, represented as a black line. The excluded volume radii for each bead used in the simulations are shown.
The pitch in B-form DNA is 33.8 Å, i.e. that of 10 base pairs, so that distance between adjacent PB beads is 3.38 Å and the torsion angle defined by PP(α) – PB(α) – PB(α+1) – PP(α+1) for the α-th pseudo residue is 36 degrees. The distance between PP(α) – PB(α) is 8.973 Å, which is the position of the phosphate atom from the longest axis of B-DNA. Adjacent beads, PP(α) – PB(α) and PB(α) – PB(α+1), are connected by a harmonic potential,(4)where kbond is the force constant, and r0 is the equilibrium distance between beads making the bond. Beads in CG-protein model are also connected by Eq. (4). All bond distances and force constants are listed in Table 2.
The bond angle potential for the CG-DNA molecules is given by(5)where kangle is the force constant, θ is the bond angle, and θ0 is the equilibrium bond angle. The stiffness of bond angles formed by adjacent three PB beads in the backbone is directly related to the persistence length of DNA . All bond angles and force constants are listed in Table 3. The listed parameters for PB(α) – PB(α+1) – PB(α+2) correspond to the typical persistence length of DNA, 50 nm .
Torsion angles defined by PP(α) – PB(α) – PB(α+1) – PP(α+1) are restrained by(6)where ktorsion is the force constant, φ is the torsion angle, and φ0 is the equilibrium torsion angle. All torsion angles and force constants are listed in Table 4.
Excluded volume effects are described by a half-harmonic potential,(7)where kex is the force constant. For electrostatic interactions between beads, the effective Yukawa pair-potential of DLVO (Derjaguin, Landau, Verwey, and Overbeek) theory was used and is given by (8)where e is the elementary charge (4.803×10−10 esu), ε0 is the permittivity of the vacuum, ε is the dielectric constant of the medium, and κ is the inverse of the Debye screening length. It is worth to noting that q in Eq. (8) is the effective charge, which is parameterized to change binding affinities between the modeled CG-protein and CG-DNA based on an umbrella sampling technique as described in the Umbrella sampling section of METHODS. Bead pairs that satisfy |i−j|≤4 are excluded in calculating the non-bonded interactions Vex and Velec.
BD simulations were performed under non-periodic boundary conditions. The simulation temperature was set to 298 K, and the time step was set to 0.25 ps. The diffusion tensor estimated by the RPY tensor and its Cholesky decomposition for computing Brownian displacement satisfying the fluctuation-dissipation theorem were updated every 200 steps. BD simulations were performed for 25 µs. Coordinates and energies were sampled every 10,000 steps (2.5 ns). The dielectric constant of the medium ε was set to 78.5. The Debye length 1/κ was set to 7.8 Å, which corresponds to a NaCl concentration of 0.15 M at 298 K. kex was set to 1 kcal/mol/Å2. A cutoff distance of 40 Å was used for the non-bonded interactions Vex and Velec. The DNA length was 200 bp, in which our CG model has 200 pseudo residues consisting of 200 PP and 200 PB beads.
The geometrical center of the CG-DNA model was placed at the origin of the Cartesian coordinates, and the longest axis of the DNA was placed along Z-axis. The CG-protein was placed just above the DNA molecule. To study the effects of HI on the 1D sliding of the CG-protein model, we performed two different types of BD simulations: one with full HI; that is, HI within each CG-protein and DNA molecules as well as between CG molecules are considered. The other is a simulation with only intramolecular HI, where intermolecular HI are neglected. Hereafter, we call the former “with inter-HI” and the latter “without inter-HI”. We also considered two different treatments for CG-DNA for analyzing effects of DNA flexibility on 1D sliding of the CG-protein: one is “restrained CG-DNA”, where all beads of the CG-DNA molecule are restrained at their initial positions by a harmonic potential with a force constant of 1 kcal/mol/Å2. The other is “flexible CG-DNA”, where two PP and PB beads at both termini are restrained at their initial positions by a harmonic potential with a force constant of 0.01 kcal/mol/Å2. For each condition, ten independent BD simulations were performed with different random seeds.
An umbrella sampling method was employed for estimating the binding free energy of the CG-DNA binding protein, with various charges on the DBP beads, to DNA . The geometrical center of the CG-DNA model was placed at the origin of the Cartesian coordinates, and the longest axis of the DNA was placed along Z-axis. All beads of the DNA were restrained in Cartesian space by a harmonic potential throughout the umbrella sampling simulations. The PBP bead of the CG-protein was placed on the X-axis at a distance of 90 Å as its initial position. PBP beads are allowed to move only in the X-Y plane by applying a harmonic potential along the Z-axis with a force constant of 1 kcal/mol/Å2. An umbrella potential V = 1/2kumb(r−r0)2 was applied between the PBP and a PB bead nearest the origin with r0 = 90, 89, ···, 31 Å (total 60 windows) and kumb = 5 kcal/mol/Å2. For each r0, a 25 ns BD simulation was performed, where the first 5 ns are for equilibration and the rest for sampling. The potential mean force was constructed by WHAM . The umbrella sampling BD simulations and their analysis were performed 5 times with different random seeds.
Estimation of 3D diffusion coefficients and Stokes radii of the model proteins by rigid-particle theory
Rigid-particle theory is a well-known method to compute diffusion properties, such as the translational and rotational diffusion coefficients at infinite dilution, D3DT and D3DR, respectively, of rigid objects constructed from many particles –. The method gives diffusion coefficients of the object very close to the values estimated from BD simulations , . In this study, the Stokes radii of the model CG-protein apro were estimated via the Stokes-Einstein equation connecting the Stokes radius and translational and rotational diffusion coefficients, given by(9)and(10)respectively. Thus, two estimates of the Stokes radii are provided.
Binding affinity estimated by umbrella sampling
Binding affinities for non-specific DNA binding for several proteins have been estimated to be in the range of 10 kBT (5.9 kcal/mol) to 15 kBT (8.9 kcal/mol) at physiological salt concentrations . We employed the umbrella sampling technique to estimate the binding affinities of the CG model with various charges of the DBP beads, q(DBP). In Fig. 2, binding free energies as a function of q(DBP) are shown. CG-proteins with q(DBP) = 8–10 with restrained CG-DNA have a binding free energy of 5.71±0.02 to 9.00±0.02 kcal/mol, which is close to the experimental estimate. For the flexible CG-DNA model, fluctuation of the CG-DNA reduces binding affinity; here CG-proteins with q(DBP) = 8 to 15 have binding free energies within the range of experimental estimates.
Stokes radius of the PBP bead, a(PBP), of 40 Å was used for this estimation.
Comparison with theory
In order to compare our simulation results to the theory, we first performed BD simulations using q(DBP) = 20 with the restrained CG-DNA model in the absence of inter-HI. With this q(DBP), the CG-protein virtually never detaches from DNA during the simulation length due to the protein's very high binding affinity of −30 kcal/mol. In our model, all PP beads have the same charge, and the DNA sequence effect is not considered. Therefore, it is possible that the energy landscape along DNA might be very smooth. These conditions make the simulation closely correspond to the model in the BBX theory. The binding affinity −30 kcal/mol is not biologically relevant. However, our purpose in this section is to compare BD results with the BBX theory. In subsequent analysis, Roc is defined as the distance between the DNA axis and the center of diffusion of the CG-protein calculated by the rigid-particle theory.
In Fig. 3, a representative trajectory of the model CG-protein with q(DBP) = 20 and a(PBP) = 40 Å is shown (see also Movie S1). For this condition, the CG-protein showed rotation-coupled sliding along Z-axis as expected without detaching from the model CG-DNA for all a(PBP) ( = 30, 40 and 50 Å) values examined. The diffusion coefficients and related properties or proteins of various sizes estimated from the BD simulations in the absence of inter-HI and the rigid particle theory are listed in Table 5. In this table, the protein's Stokes radii were estimated by Eq. (9), and used for estimating D1Dtheory via Eq. (2). D1D are much smaller than D3D by a factor of ∼100. This large reduction in diffusion rate is qualitatively consistent with the theoretical expectation from the BBX model. However, D1Dtheory values estimated with aproT are smaller than D1D calculated by the BD simulations, which is especially evident for larger a(PBP).
The BD simulation was done in the absence of intermolecular HI and all beads of the CG-DNA molecule were restrained by a harmonic potential.
To investigate the origin of this deviation, Stokes radii of proteins were also estimated from rotational diffusion coefficients computed by the rigid-particle theory using Eq. (10), which are listed in Table 6. Although aproT values are close to a(PBP), aproR are almost a constant 20 Å for all protein sizes, which is much smaller than a(PBP) as well as aproT values. If a given object is completely spherical, like the protein model in BBX theory, aproR should be equal to aproT. This discrepancy of aproR in between our model and the BBX theory may give rise to the deviation in 1D diffusion coefficient. To check this possibility, we check if D1Dtheory gives a value close to D1D when both aproT and aproR values are used in estimating D1Dtheory. As explained in Introduction, the denominator in Eq. (2) is the sum of translational and rotational friction contributions. So, we re-estimate the 1D diffusion coefficients using aproT and aproR values as follows:(11)The difference of Eq. (11) from Eq. (2) used for calculating D1Dtheory listed in Table 5 is the use of aproR value in the second term in the denominator. The corrected 1D diffusion coefficients, D1Dtheory* listed in Table 6, for a(PBP) = 40 and 50 Å are much closer to the D1D values directly computed from the BD simulations. This result indicates that the small aproR values compared to aproT in our model result in larger D1D than the theoretical estimates from the original D1Dtheory of Eq. (2).
D1D/D1Dtheory and D1D/D1Dtheory* increase with a(PBP). For a(PBP) = 30 Å, the correction does not improve the result and even makes it slightly worse. The distance between PBP and DBP beads is 41.6 Å (Table 2) and a(DBP) = 8 Å (Table 1). Therefore, the Stokes radii of PBP and DBP overlap if a(PBP) is larger than 33.6 Å. The RPY tensor described in Eq. (3) is defined for overlapping particles, which was derived to remain the tensor positive definite. However, the physical meaning of this form is problematic . The increase on D1D/D1Dtheory and D1D/D1Dtheory* values with Stokes radius may be due to particle overlap and use of the RPY tensor for overlapping particles. For a(PBP) = 30 Å, PBP and DBP do not overlap. For a(PBP) = 30 Å, although PBP and DBP do not overlap, D1Dtheory and D1Dtheory* values slightly deviate from D1D obtained from BD simulations. The deviation may be rooted in the shape difference of modeled proteins; BBX theory assumes proteins are completely spherical objects, but our CG-protein models are not completely spherical.
Effects of HI between protein and DNA, and DNA flexibility on 1D sliding speed
In this section, we try to evaluate the effects of HI between the CG-protein and CG-DNA, and DNA flexibility on 1D diffusion. In Table 7, the apparent 1D diffusion coefficients of proteins with a q(DBP) of 20 estimated by the BD simulations in the presence and absence of inter-HI, and with the restrained and flexible CG-DNA are listed. A representative trajectory of CG-protein sliding dynamics on flexible DNA in the absence of inter-HI is shown in Movie S2. Inter-HI reduce the 1D diffusivity by 30% and 40% on average over three different CG-protein sizes for both the restrained and flexible CG-DNA, respectively. Thus, the implication is that inter-HI effects are quite robust and insensitive to details. This reduction could be explained by correlated motions between the CG-protein and DNA models caused by inter-HI. The flexibility of CG-DNA increases 1D diffusivity by factors of 2.6 and 2.1 on average over three different CG-protein sizes in the absence and presence of inter-HI, respectively. This mechanism will be discussed below.
Effect of binding affinity of the protein on its 1D sliding
Finally, the effects of charge q(DBP) on 1D sliding rate are examined. In Fig. 4, representative trajectories in Z position of the model CG-proteins with various q(DBP) and a(PBP) = 40 Å are shown, where the restrained CG-DNA molecules were used. During 25 µs BD simulations, the CG-protein with q(DBP) of 7 to 10 hopped along the CG-DNA molecule (see Movie S3 that shows hopping of the CG-protein). This hopping is prominent for smaller q(DBP) values. This trend was not changed for the simulations with the flexible CG-DNA model. Apparent 1D diffusion coefficients of the CG-protein with a(PBP) = 40 Å and various q(DBP) with the restrained and flexible CG-DNA in the presence and absence of inter-HI are shown in Fig. 5. The CG-proteins with smaller q(DBP) values (<10) tend to slide quickly due to hopping for both CG-DNA models. For q(DBP)>10, D1D reached the same lower bound for both CG-DNA models. CG-proteins with q(DBP) = 8 to 10 have binding affinities within experimental estimates of non-specific DNA binding. Those proteins in our model showed hopping along DNA. Direct observations of the hopping by single molecule experiments are currently very difficult due to the limited experimental resolution. Our simulation supports the possibility of hopping for non-specifically DNA bounded proteins initially envisioned in the theory.
BD simulations shown in this figure were done in the absence of intermolecular HI using the same random seed. Arrows indicate times that hopping was observed.
These values are average over ten BD simulations.
Reduction due to inter-HI was observed for the whole range of q(DBP), which was evident for the larger q(DBP). Acceleration by DNA flexibility was also seen for whole range of q(DBP) values, by about a factor of 2. The CG-proteins with a(PBP) = 30 Å and 50 Å show almost similar results for the reduction by inter-HI and acceleration by fluctuations. Regardless of inter-HI, 1D diffusion for the CG-protein with q(DBP) = 15 and 20 in the system with flexible CG-DNA are 2 to 3 times faster than those with same charge values as well as for q(DBP) = 10 in the restrained CG-DNA model. However, CG-protein binding affinity with q(DBP) = 10 in the restrained CG-DNA is lower than that with q(DBP) = 15 and 20 in the flexible CG-DNA system as shown in Fig. 2. Local and temporary steric repulsion between the CG-protein and CG-DNA model caused by fluctuations of the DNA molecule may enhance the rate of 1D sliding.
Implications of simulation results to experiments and limitations of the CG-model
Free energy roughness along the sliding path caused by the sequence-dependent atomistic interactions, e.g. hydrogen bonding, is also considered to be an important factor for reducing 1D diffusivity of the DNA bound protein. This effect is often evaluated by the Zwanzig formula for diffusion in a rough potential in 1D space, in which the reduction factor is written as exp[(−ε/kBT)2], with energy roughness ε . This reduction is very sensitive to ε. From the Zwanzig formula and experimental results on apparent diffusion coefficients of several DNA binding proteins, ε was estimated 1.1±0.2 kBT , which means a reduction factor of 0.3. In this analysis, all sources of reduction except for helical diffusion are assumed to be due to energy roughness. However, as seen in this work, inter-HI decrease and DNA fluctuation increase the diffusivity of proteins along DNA
Here, the RPY tensor was employed to represent HI between the CG-protein and CG-DNA models. However, the RPY tensor only includes the far-field part of hydrodynamic effects , . A recent simulation study of the association of two non-polar model objects clearly showed that at short distances (<1–2 nm) molecular scale effects dominate, giving rise to deviations from continuum hydrodynamic theory –. Even though a more sophisticated hydrodynamic model was used, the deviation from the atomistic simulation results was not eliminated. Therefore, we may need to find a better description of HI at short distances. However, we believe that our simulation results using the RPY tensor still provide the important qualitative features of HI. Finally, our BD simulations show the possibility that DNA structural fluctuations enhance 1D diffusion of DNA binding proteins by steric collisions between the protein and DNA. However, since proteins are also flexible, this effect may be damped.
In this work, we have developed CG models of DNA binding protein and DNA for dissecting the sliding mechanisms of a protein along the DNA. By considering HI we could compare our simulation results with the theoretical model of rotation coupled sliding along helical path of DNA proposed by Bagchi, Blainey, and Xie . This makes it possible to elucidate the relative importance of hydrodynamic forces between the protein and DNA, DNA flexibility, and binding affinity of the protein to the DNA on 1D diffusivity of the protein along the DNA. Our simulations under conditions similar to the BBX model showed that 1D diffusivity obtained from the BD simulations using our model are 1.2–2.2 times faster than theoretical estimates. This discrepancy is mainly due to a low rotational friction of the CG-protein in our model compared to the theoretical treatment in BBX model. Second, BD simulations with intermolecular HI represented by the RPY tensor show that HI reduce 1D diffusivity by 30%. Third, a CG-protein whose binding affinity to CG-DNA is in the range of experiment, −5 to −10 kcal/mol, shows hopping along DNA. This results in an increase of its apparent 1D diffusion coefficients. Direct observations of the hopping by single molecule experiments are currently very difficult due to the limited time and space resolutions in experiments.
The model developed in this work is quite simple, but still we can do “experiments” to elucidate the sliding mechanisms of DNA binding proteins by changing several parameters. Early BD simulation work on protein diffusion in concentrated DNA solutions using a CG model reasonably well reproduce experimental results , where the protein and DNA are represented one sphere and strings of beads, respectively, and electrostatic interactions are calculated from a Debye-Hückel potential. This work also shows usefulness of a very simple CG model to analyze the dynamics of macromolecules. In this work, we have only changed the effective charge parameter of the CG-DNA binding protein to reproduce the biologically relevant binding affinity. However, since HI are long-range effects and the dominant effect is insensitive to shape of the molecule, we believe that effects of HI on protein sliding observed in our BD simulation would be seen in the simulations with different particle radii and higher-resolution models.
Inclusion of HI in BD simulations is often a computational bottleneck to large-scale simulations. To overcome this difficulty, we developed a Krylov subspace method for computing correlated Brownian noise vectors which scales as O(N2) with an N particle system, whereas an O(N3) computation is required in a conventional BD algorithm . The Krylov method with a particle-mesh Ewald method, which is based on fast-Fourier transform for computing hydrodynamic effects, enables BD simulations with O(NlogN) scaling in computation and O(N) memory storage . Combining these advance BD algorithms and the CG model developed here would give us a possibility to perform large-scale BD simulations of DNA binding proteins in a crowded intracellular environment, which should enable a deeper understanding of the experimental results. These simulations are currently underway.
BD simulation of sliding of the CG-protein along the restrained CG-DNA in the absence of intermolecular hydrodynamic interactions. This trajectory corresponds to Fig. 3 in the main text. The Stokes radius of the PBP bead of the CG-protein, a(PBP), is 40 Å and the charge at q(DBP) of the CG-protein is 20. In this movie, the beads' radii correspond to their excluded volume radii.
BD simulation of the sliding of the CG-protein along flexible CG-DNA in the absence of intermolecular hydrodynamic interactions. The Stokes radius of the PBP bead of the CG-protein, a(PBP), is 40 Å and the charge at q(DBP) of the CG-protein is 20. In this movie, the beads' radii correspond to their excluded volume radii.
BD simulation of the sliding of the CG-protein along the restrained CG-DNA in the absence of intermolecular hydrodynamic interactions. This movie corresponds to the trajectory shown at top left panel in Fig. 4 of the main text. The CG-protein, which has a(PBP) of 40 Å and q(DBP) of 7, shows hopping along the CG-DNA. In this movie, beads' radii correspond to their exclude volume radii.
Conceived and designed the experiments: TA JS. Performed the experiments: TA. Analyzed the data: TA JS. Wrote the paper: TA JS.
- 1. Mirny L, Slutsky M, Wunderlich Z, Tafvizi A, Leith J, et al. (2009) How a protein searches for its site on DNA: the mechanism of facilitated diffusion. J Phys A: Math Theor 42. doi: 10.1088/1751-8113/42/43/434013
- 2. Tafvizi A, Mirny LA, van Oijen AM (2011) Dancing on DNA: kinetic aspects of search processes on DNA. ChempPysChem 12: 1481–1489. doi: 10.1002/cphc.201100112
- 3. Riggs AD, Bourgeoi.S, Cohn M (1970) Lac Repressor-Operator Interaction.3. Kinetic Studies. J Mol Biol 53: 401–417.
- 4. Adam G, Delbruck M (1968) Reduction of dimensionality in biological diffusion processes. In: Rich A, Davidson N, editors. Structural chemistry and molecular biology: W. H. Freeman and Co. San Francisco. pp. 198–215.
- 5. Berg OG, Winter RB, von Hippel PH (1981) Diffusion-Driven Mechanisms of Protein Translocation on Nucleic-Acids.1. Models and Theory. Biochemistry 20: 6929–6948. doi: 10.1021/bi00527a028
- 6. Halford SE (2009) An end to 40 years of mistakes in DNA-protein association kinetics? Biochem Soc Trans 37: 343–348. doi: 10.1042/bst0370343
- 7. Blainey PC, Luo G, Kou SC, Mangel WF, Verdine GL, et al. (2009) Nonspecifically bound proteins spin while diffusing along DNA. Nat Struct Mol Biol 16: 1224–1229. doi: 10.1038/nsmb.1716
- 8. Blainey PC, van Oijen AM, Banerjee A, Verdine GL, Xie XS (2006) A base-excision DNA-repair protein finds intrahelical lesion bases by fast sliding in contact with DNA. Proc Natl Acad Sci U S A 103: 5752–5757. doi: 10.1073/pnas.0509723103
- 9. Gorman J, Chowdhury A, Surtees JA, Shimada J, Reichman DR, et al. (2007) Dynamic basis for one-dimensional DNA scanning by the mismatch repair complex Msh2-Msh6. Mol Cell 28: 359–370. doi: 10.1016/j.molcel.2007.09.008
- 10. Graneli A, Yeykal CC, Robertson RB, Greene EC (2006) Long-distance lateral diffusion of human Rad51 on double-stranded DNA. Proc Natl Acad Sci U S A 103: 1221–1226. doi: 10.1073/pnas.0508366103
- 11. Tafvizi A, Huang F, Leith JS, Fersht AR, Mirny LA, et al. (2008) Tumor suppressor p53 slides on DNA with low friction and high stability. Biophys J 95: L01–03. doi: 10.1529/biophysj.108.134122
- 12. Wang YM, Austin RH, Cox EC (2006) Single molecule measurements of repressor protein 1D diffusion on DNA. Phys Rev Lett 97: 048302. doi: 10.1103/physrevlett.97.048302
- 13. Elf J, Li GW, Xie XS (2007) Probing transcription factor dynamics at the single-molecule level in a living cell. Science 316: 1191–1194. doi: 10.1126/science.1141967
- 14. Hammar P, Leroy P, Mahmutovic A, Marklund EG, Berg OG, et al. (2012) The lac repressor displays facilitated diffusion in living cells. Science 336: 1595–1598. doi: 10.1126/science.1221648
- 15. Slutsky M, Mirny LA (2004) Kinetics of protein-DNA interaction: Facilitated target location in sequence-dependent potential. Biophys J 87: 4021–4035. doi: 10.1529/biophysj.104.050765
- 16. Schurr JM (1979) The one-dimensional diffusion coefficient of proteins absorbed on DNA. Hydrodynamic considerations. Biophys Chem 9: 413–414. doi: 10.1016/0301-4622(75)80057-3
- 17. Bagchi B, Blainey PC, Xie XS (2008) Diffusion constant of a nonspecifically bound protein undergoing curvilinear motion along DNA. J Phys Chem B 112: 6282–6284. doi: 10.1021/jp077568f
- 18. Givaty O, Levy Y (2009) Protein Sliding along DNA: Dynamics and Structural Characterization. J Mol Biol 385: 1087–1097. doi: 10.1016/j.jmb.2008.11.016
- 19. Marcovitz A, Levy Y (2011) Frustration in protein-DNA binding influences conformational switching and target search kinetics. Proc Natl Acad Sci U S A 108: 17957–17962. doi: 10.1073/pnas.1109594108
- 20. Terakawa T, Kenzaki H, Takada S (2012) p53 searches on DNA by rotation-uncoupled sliding at C-terminal tails and restricted hopping of core domains. J Am Chem Soc 134: 14555–14562. doi: 10.1021/ja305369u
- 21. Doi M, Edwards SF (1988) The theory of polymer dynamics. Oxford: Oxford University Press.
- 22. Iniesta A, Delatorre JG (1990) A 2nd-Order Algorithm for the Simulation of the Brownian Dynamics of Macromolecular Models. J Chem Phys 92: 2015–2018. doi: 10.1063/1.458034
- 23. Rotne J, Prager S (1969) Variational treatment of hydrodynamic interaction in polymers. J Chem Phys 50: 4831–4837. doi: 10.1063/1.1670977
- 24. Yamakawa H (1970) Transport properties of polymer chains in dilute solution - hydrodynamic interaction. J Chem Phys 53: 436–443. doi: 10.1063/1.1673799
- 25. Carrasco B, Garcia de la Torre J, Zipper P (1999) Calculation of hydrodynamic properties of macromolecular bead models with overlapping spheres. Eur Biophys J 28: 510–515. doi: 10.1007/s002490050233
- 26. Lamond AI (2002) Molecular biology of the cell, 4th edition. Nature 417: 383–383. doi: 10.1038/417383a
- 27. Fernandes MX, Ortega A, Lopez Martinez MC, Garcia de la Torre J (2002) Calculation of hydrodynamic properties of small nucleic acids from their atomic structure. Nucleic Acids Res 30: 1782–1788. doi: 10.1093/nar/30.8.1782
- 28. Schlick T (2002) Molecular modeling and simulation: an interdisciplinary guide. New York: Springer.
- 29. Bishop KJM, Wilmer CE, Soh S, Grzybowski BA (2009) Nanoscale Forces and Their Uses in Self-Assembly. Small 5: 1600–1630. doi: 10.1002/smll.200900358
- 30. Torrie GM, Valleau JP (1977) Monte-Carlo Study of a Phase-Separating Liquid-Mixture by Umbrella Sampling. J Chem Phys 66: 1402–1408. doi: 10.1063/1.434125
- 31. Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kollman PA (1995) Multidimensional Free-Energy Calculations Using the Weighted Histogram Analysis Method. J Comput Chem 16: 1339–1350. doi: 10.1002/jcc.540161104
- 32. Carrasco B, Garcia de la Torre J (1999) Hydrodynamic properties of rigid particles: comparison of different modeling and computational procedures. Biophys J 76: 3044–3057. doi: 10.1016/s0006-3495(99)77457-6
- 33. Garcia De La Torre J, Huertas ML, Carrasco B (2000) Calculation of hydrodynamic properties of globular proteins from their atomic-level structure. Biophys J 78: 719–730. doi: 10.1016/s0006-3495(00)76630-6
- 34. Garcia De La Torre J, Jimenez A, Freire JJ (1982) Monte-Carlo Calculation of Hydrodynamic Properties of Freely Jointed, Freely Rotating, and Real Polymethylene Chains. Macromolecules 15: 148–154. doi: 10.1021/ma00229a030
- 35. Frembgen-Kesner T, Elcock AH (2009) Striking Effects of Hydrodynamic Interactions on the Simulated Diffusion and Folding of Proteins. J Chem Theory Comput 5: 242–256. doi: 10.1021/ct800499p
- 36. Frembgen-Kesner T, Elcock AH (2010) Absolute Protein-Protein Association Rate Constants from Flexible, Coarse-Grained Brownian Dynamics Simulations: The Role of Intermolecular Hydrodynamic Interactions in Barnase-Barstar Association. Biophys J 99: L75–L77. doi: 10.1016/j.bpj.2010.09.006
- 37. Revzin A (1990) The Biology of Nonspecific DNA Protein Interactions: Taylor & Francis.
- 38. Szymczak P, Cieplak M (2011) Hydrodynamic effects in proteins. J Phys-Condens Mat 23. doi: 10.1088/0953-8984/23/3/033102
- 39. Zwanzig R (1988) Diffusion in a Rough Potential. Proc Natl Acad Sci U S A 85: 2029–2030. doi: 10.1073/pnas.85.7.2029
- 40. Li JY, Morrone JA, Berne BJ (2012) Are Hydrodynamic Interactions Important in the Kinetics of Hydrophobic Collapse? Journal of Physical Chemistry B 116: 11537–11544. doi: 10.1021/jp307466r
- 41. Mondal J, Morrone JA, Berne BJ (2013) How hydrophobic drying forces impact the kinetics of molecular recognition. Proc Natl Acad Sci U S A 110: 13277–13282. doi: 10.1073/pnas.1312529110
- 42. Morrone JA, Li J, Berne BJ (2012) Interplay between hydrodynamics and the free energy surface in the assembly of nanoscale hydrophobes. J Phys Chem B 116: 378–389. doi: 10.1021/jp209568n
- 43. Dwyer JD, Bloomfield VA (1993) Brownian Dynamics Simulations of Probe and Self-Diffusion in Concentrated Protein and DNA Solutions. Biophys J 65: 1810–1816. doi: 10.1016/s0006-3495(93)81235-9
- 44. Ando T, Chow E, Saad Y, Skolnick J (2012) Krylov subspace methods for computing hydrodynamic interactions in Brownian dynamics simulations. J Chem Phys 137. doi: 10.1063/1.4742347
- 45. X. L, Chow E. Large-scale Hydrodynamic Brownian Simulations on Multicore and Manycore Architectures; 2014; Phenix, AZ.