Figures
Abstract
The compaction of DNA by phase-separating, DNA-binding proteins has emerged as a key mechanism for organizing chromatin and shaping genome architecture. Although experimental studies have provided insights into the governing principles of such protein-DNA co-condensation, how DNA sequence affects this process remains unclear. Guided by experimental observations, we develop a simple polymer-based model of protein-DNA co-condensation that explicitly accounts for sequence-dependent protein binding. Using coarse-grained Brownian dynamics simulations, we demonstrate that, in the case of a homogeneous DNA, only one condensate forms in equilibrium. In sharp contrast, DNA sequence heterogeneity can result in the coexistence of multiple condensates. Interestingly, we find that interfacial DNA binding affinity controls capillary forces generated by protein-DNA condensates, offering a potential mechanism to regulate chromatin structure and 3D genome organization. To demonstrate the usefulness of our modeling framework, we compare the simulation results against published data for the condensation of DNA via Dps, Sox2, and HP1. We find that DNA sequence dictates the condensation of Sox2 and HP1 with DNA. Overall, our framework provides mechanistic insights into how DNA sequence affects protein-DNA co-condensation and paves the way for developing a deeper understanding of genome organization.
Author summary
Cells rely on a variety of proteins to compact and organize their DNA, enabling them to shape genome architecture and regulate gene expression. Many of these DNA-binding proteins can phase-separate with DNA, but how the underlying DNA sequence influences this behaviour remains elusive. Here, we use a minimal model to study protein-mediated DNA condensation along an end-tethered DNA molecule. Our Brownian dynamics simulations show that non-specific protein binding produces a single condensate at equilibrium, whereas sequence-dependent binding can generate multiple condensates of varying sizes, governed by the interplay between DNA–protein and protein–protein interaction strengths. We further find that the DNA–protein affinity at the condensate interface controls the capillary forces exerted on bare DNA. By comparing our results with experimental data for Dps, Sox2, and HP1, we show that DNA sequence strongly affects the condensation of Sox2 and HP1 but has minimal influence on Dps. Together, these findings provide mechanistic insight into how DNA sequence shapes protein–DNA condensation and advance our understanding of genome reorganization.
Citation: Singh RK, Swain P, Ganji M, Choubey S (2025) Decoding the role of DNA sequence on protein-DNA co-condensation. PLoS Comput Biol 21(12): e1013829. https://doi.org/10.1371/journal.pcbi.1013829
Editor: Changbong Hyeon, Korea Institute for Advanced Study, KOREA, REPUBLIC OF
Received: July 1, 2025; Accepted: December 9, 2025; Published: December 18, 2025
Copyright: © 2025 Singh et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All simulation code is available at the GitHub repository: Decoding-the-role-of-DNA-sequence-on-protein-DNA-co-condensation.
Funding: M.G. gratefully acknowledges funding from Department of Biotechnology, Ministry of Science and Technology, Government of India/Wellcome India Alliance intermediate fellowship (IA/I/21/2/505928). S.C. is supported by the Ramalingaswami Re-entry Fellowship (BT/HRD/35/02/2006), a re-entry scheme of the Department of Biotechnology, Ministry of Science and Technology, Government of India. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
The genome is spatially organized within the cell nucleus through three-dimensional (3D) compaction, a higher-order architecture essential for regulating key cellular processes such as gene expression, DNA replication, and repair [1–4]. This organization arises from hierarchical chromatin folding, which forms distinct yet pervasive features like loops, domains, and compartments across various length scales [4,5]. In prokaryotes, Nucleoid-Associated Proteins (NAPs) compact DNA to maintain nucleoid structure, thereby modulating DNA accessibility to transcription machinery and directly impacting global gene expression [6]. Despite considerable progress, the physical mechanisms by which DNA-binding proteins compact the genome and regulate gene expression in vivo remain less-understood. Unfurling the principles that govern genome architecture and its dynamic remodeling remains a central goal in regulatory biology.
Recent in vitro experiments suggest that transcription factors and DNA can undergo co-condensation through the collective behavior of thousands of proteins and DNA, driven by protein-protein and protein-DNA interactions [7–10]. The resulting protein-DNA co-condensate ensnares a specific amount of DNA, exerting force on the surrounding free DNA, known as capillary force [10–13]. Capillary forces, arising from the interfacial physics of protein-DNA condensates, have the potential to serve as a mechanism for regulating and reshaping chromatin. This capability holds promise in influencing the 3D architecture of the genome.
Experimental studies have investigated protein-DNA co-condensates and their impact on capillary forces by employing optical tweezers and coverslip-based assays, as shown in Fig 1. For instance, Quail et al. utilized total internal reflection fluorescence (TIRF) microscopy to explore the co-condensation of -phage DNA and the pioneer transcription factor protein FoxA1 [11]. Their findings revealed that the FoxA1-DNA co-condensate could generate forces on the order of 0.2 piconewton (pN). Another study highlighted that PARP1-DNA co-condensates generate capillary forces in the sub-piconewton range [12]. Intriguingly, a more recent investigation demonstrated that the pluripotent factor Sox2 can generate up to 7 pN of capillary forces [10].
A Schematic of experimental setup wherein the ends of a single DNA molecule (blue) are tethered to the glass slide using biotin-streptavidin interactions. Proteins (green) phase-separate with DNA to form protein-DNA co-condensates. B Schematic of the optical tweezers assay: a DNA molecule (blue) is held between two optically trapped beads (black) via biotin–streptavidin interactions.
These experiments provide insights into force–extension behavior as well. While such measurements for individual DNA molecules have been well-characterized [14,15], the presence of DNA-binding proteins has inspired extensive study over the past two decades. In untethered DNA, proteins can link distant segments, leading to collapse at high concentrations through a mechanism known as bridging-induced phase separation (BIPS) [16–20]. Notably, BIPS models typically neglect direct interactions between the proteins themselves. Introducing protein–protein interactions adds an additional layer of complexity by incorporating effects such as surface tension. However, most existing work has focused on protein–protein interactions either in untethered DNA systems [21–24] or in homogeneous tethered DNA setups [25], leaving aspects of these interactions underexplored.
An intriguing observation from several protein–DNA condensation studies is the formation of multiple discrete condensates along a single DNA molecule. For example, Sox2 was reported to form around 4–5 condensates [10], while HP1 typically formed 2 condensates along -phage DNA [26]. From a thermodynamic standpoint, phase separation of self-attracting molecules is expected to produce a single condensate at equilibrium [27,28], yet a clear theoretical framework explaining the persistence of multiple condensates remains incomplete. Similar behavior has also been documented in live cells, where multiple condensates coexist within the same system [29–35]. To address this, theoretical studies have approached the problem from both equilibrium perspectives [36–43] and non-equilibrium models incorporating chemical reactions coupled to phase separation [44–48]. Notably, Brackley et al. demonstrated that bridging-induced phase separation (BIPS) alone can give rise to multiple protein clusters along DNA, even with non-specific protein binding [17]. In this work, we investigate how the interplay between sequence-specific protein–DNA interactions and protein–protein interactions shapes the formation and spatial organization of multiple condensates along DNA, as observed in recent co-condensation experiments [10,26].
While a dialogue between theoretical and experimental studies have advanced our understanding of protein–mediated DNA compaction, the role of DNA sequence heterogeneity in shaping capillary forces and controlling the number and size of co-condensates remains poorly understood. The goal of this study is to develop a modeling framework to systematically investigate how DNA sequence variability influences protein–DNA co-condensation. To this end, we introduce a simple coarse-grained model that explicitly incorporates sequence-dependent protein binding, tailored to capture two common in vitro experimental setups: optical tweezer and coverslip-based assays. Using Brownian dynamics simulations, we show that a homogeneous DNA polymer yields a single condensate at equilibrium. In contrast, sequence heterogeneity promotes the coexistence of multiple condensates with variable sizes along the same DNA molecule. We demonstrate the utility of our framework by comparing simulation results with experimental observations of Dps, Sox2, and HP1 co-condensation with DNA. Our analysis reveals that while Dps binds DNA independently of sequence, Sox2 and HP1 co-condensates are highly sequence-sensitive. Furthermore, we find that the binding affinity of DNA at protein–DNA interfaces governs the magnitude of capillary forces, offering a potential mechanism to tune genome-scale mechanical forces through local DNA sequence changes. In summary, our framework enables the interpretation of in vitro co-condensation experiments and reveals how DNA sequence heterogeneity shapes protein–DNA condensation.
2 Model
To explore the impact of DNA sequence on the co-condensation of protein-DNA complexes and the subsequent generation of capillary forces, we propose a simple polymer-based model for protein-DNA binding. This model explicitly considers both protein-protein interactions and sequence-dependent protein-DNA interactions. The parameters of the model are chosen to mimic different experimental systems (see the Materials and Methods for details). In addition, the parameters used in our study are summarized in Table 1, with corresponding references (See Subsection 5.2 of the Materials and Methods section for more details).
Informed by in vitro studies, as shown in Fig 1, we consider a 5 kb-long DNA as a semiflexible polymer consisting of Nm = 500 monomers, tethered at both ends within a cuboidal box. In accordance with previous studies [49–52], the persistence length of DNA is considered to be 150 bps, which translates to 15 monomers in our model. The DNA is in contact with proteins. We assume that each monomer corresponds to 10 base pairs which is the approximate footprint of a family of transcription factors in bacteria [53–55]. Our study focuses on transcription factors that bind to shorter sequences. For example, Sox2 binds to 7 bp DNA sequences [56], while Klf4 binds to 7 to 10 bp [57]. We assume that the proteins are spherical beads with dimensions identical to the monomers. The sequence heterogeneity of DNA is represented by distinct arrangements of monomers along the DNA contour, where each monomer exhibits varying binding affinity for the proteins. To dissect the impact of DNA sequence on protein-DNA co-condensation, first, we introduce a null model of homogeneous DNA. In this model, each monomer has an identical binding affinity of 2 kBT for proteins (see Fig 2A) [8]. To investigate how DNA sequence heterogeneity influences protein–DNA co-condensation, we analyze three DNA models of increasing complexity. In the first two heterogeneous models, DNA is represented as a polymer composed of two distinct types of monomers, each corresponding to a 10 bp double-stranded DNA segment that characterizes a specific protein-binding motif. These monomer types reflect strong and weak binding affinities for proteins, capturing a binary classification of nucleotide segments that represents biologically relevant binding landscapes. For example, nucleoid-associated proteins like Lsr2 [58] and H-NS [59] exhibit strong binding to AT-rich regions and weaker affinity for GC-rich sequences. 1) Heterogeneous DNA I, in this model the DNA is represented as an ABA block copolymer based on differential attractive interaction with proteins. Here DNA consists of a high-affinity region spanning 250 monomers at the center, with a protein-DNA binding affinity of 2.25 kBT. This high-affinity region is surrounded by low-affinity regions on either side, each containing 125 monomers, with a protein-DNA binding affinity of 1.75 kBT. This ensures that the average protein-DNA binding affinity remains identical to the homogeneous case (see Fig 2B). 2) Heterogeneous DNA II model consisting of two blocks of high-affinity monomers separated by low-affinity monomers, leads to multiple condensates at equilibrium when protein-DNA interactions dominate over protein-protein interactions (see Fig 4 and SI). 3) Partial -DNA, to model a biologically realistic DNA, we consider the last 5 kb of the lambda DNA (see Fig 5A and Methods section). For simplicity, we assume that each protein has a preference for binding to AT-rich regions. Indeed, there exist numerous proteins, in bacteria and higher organisms, that broadly bind AT or GC-rich sequences [58,60,61]. We assign a corresponding interaction energy to each monomer based on the AT content of each 10-base sequence (see the Materials and Methods and Supplementary Information (SI), including Table A-D in S1 Text, for details).
A Schematic represents a homogeneous DNA model; each monomer (blue) binds to proteins (green) with an identical binding affinity (2 kBT). B Schematic represents heterogeneous DNA I model; monomers (red) at the center bind proteins with an affinity of 2.25 kBT, whereas monomers (yellow) at the periphery have an interaction strength of 1.75 kBT. Proteins bind to each other with an interaction strength of 2 kBT. C, D Snapshots of DNA configurations for homogeneous and heterogeneous DNA I, in the absence of proteins, are shown. E, F Snapshots of protein-DNA co-condensates in equilibrium for the two models are shown. To generate the plots, we fixed the normalized end-to-end distance () and varied
. G, H Probability distribution of DNA and proteins along the longitudinal axis at
= 0.6, for different bulk protein concentrations (
), shown for homogeneous and heterogeneous DNA I, respectively. For each
value, data from five independent replicates are included in the plots. I, J Representative kymographs for position of condensates along the contour as a function of time for homogeneous DNA (
=0.6,
= 109.9
) and heterogeneous DNA I (
=0.6,
= 93.01
) are shown.
Following the experiments, we tune the normalized end-to-end distance of DNA and the bulk protein concentration. Normalized end-to-end distance () is defined as the ratio of the distance between the tethered ends (Re) and the contour length of the DNA (s). Bulk protein concentration (
) is the initial concentration of proteins introduced in the simulation box in units of
. We employ coarse-grained Brownian dynamics simulation to study sequence-dependent protein-DNA condensation (see the Methods and SI for details).
3 Results
3.1 DNA sequence governs the capillary forces emanating from protein-DNA co-condensates
We seek to uncover the effect of DNA-sequence heterogeneity on the formation of protein-DNA co-condensates and the resultant capillary forces. In particular, we consider two models: homogeneous DNA (Fig 2A) and heterogeneous DNA I (Fig 2B). For a detailed discussion on the choice of parameters, see the methods section and Sec. S1. Our DNA model is homogeneous or heterogeneous only with respect to protein-binding. In the absence of proteins, both the sequences remain extended as shown in Fig 2C and 2D. In contrast, proteins form droplets in the absence of DNA around =190
(Fig F in S1 Text). When proteins are introduced along with the DNA, we observe phase separation at
= 42.25
for both free-end DNA and double-tethered DNA (Fig F, panels B and C in S1 Text). This observation is consistent with previous theoretical [22] and experimental studies [26]. For subsequent investigation, we keep the normalized end-to-end distance (
) fixed and systematically increase the bulk protein concentration (
). For both homogeneous DNA and heterogeneous DNA I, we observe an increase in condensate size with increasing bulk protein concentration (
), but the condensate remains spherical (Fig 2E and 2F). However, we see a change in the shape of the condensate with variation in protein-protein interaction at constant
. When protein-protein interaction is lower than the protein-DNA binding affinity, elongated condensates emerge (Fig G, panels A and B in S1 Text). As we increase protein-protein interactions, the shape of condensates becomes more and more spherical (Fig G, panels C and D in S1 Text).
For a homogeneous DNA, a single condensate tends to form near the tethered ends as shown in Fig 2G (Fig H, panels A and B in S1 Text). To verify this positional bias in homogeneous DNA, we perform a longer simulation of a flexible homopolymer of shorter length. We find that a single condensate is more likely to remain close to the center in equilibrium due to larger conformational fluctuations of the polymer in the center (Fig H, panels C and D in S1 Text). Thus, the end-position preference observed in Fig 2G can be attributed to computationally expensive simulation time to sample different configurations for a long chain. In the case of the heterogeneous DNA I, a single condensate forms in equilibrium at the high-affinity region of the DNA, therefore maximizing their interactions with proteins, as illustrated in Fig 2H. While multiple co-condensates nucleate along the DNA for homogeneous DNA (Fig 2I) and heterogeneous DNA I (Fig 2J) at higher protein concentrations, these condensates fuse and coarsen over time to become a single condensate in equilibrium. The Laplace pressure facilitates this coarsening process, as a single condensate minimizes the surface area compared to multiple smaller condensates. Note that Laplace pressure is the pressure difference across the interface between the dense and dilute phases of condensates. It arises due to surface tension and is inversely proportional to the radius of a condensate [62].
Next, we investigate the effect of protein-DNA co-condensation on capillary force as a function of bulk protein concentration () and normalized end-to-end distance of DNA (
). See the Materials and Methods, subsection 5.5, for details on the computation of capillary force. For homogeneous DNA, we observe a marginal increase in capillary forces on bare DNA as we increase
and
(Fig 3A and 3B). To validate that the force on bare DNA indeed arises at the interface of the condensate, we calculate the potential energy for each monomer along the contour for homogeneous DNA at
= 0.2 and
= 84.50
(Fig D in S1 Text). Indeed, the potential energy sharply changes for the monomers present at the interface, confirming that the force on the bare DNA is generated at the interface of the protein-DNA co-condensate. For heterogeneous DNA I, the capillary force exhibits a non-linear behavior as we tune
. Initially, an increase in
leads to a small increase in capillary force, followed by a sharp jump at
(Fig 3A). On the other hand, as we increase
, an opposite trend is observed (Fig 3B). The capillary force initially remains constant as we increase
, followed by a gradual decrease after
= 59.15
. The behavior of capillary force for heterogeneous DNA I is dictated by the affinity of the interfacial DNA. When
exceeds a certain threshold (here 0.5), the higher affinity DNA (shown in red) gets exposed at the interface of the condensate (between compacted DNA and bare DNA), resulting in an increased force exerted on bare DNA (Fig I, panel A in S1 Text). In contrast, with an increase in
, condensate volume increases linearly (Fig I, panel B in S1 Text), which leads the condensate interface to be exposed to low-affinity DNA (shown in yellow). As a result, there is a decrease in the capillary force exerted on the bare DNA (Fig 3B and Fig I, panel C in S1 Text). Overall, our results demonstrate that the protein binding affinity of the interfacial DNA dictates the capillary forces. In the ensuing sections, we probe how different DNA sequences affect the capillary forces.
A and B Capillary force is plotted as a function of at
=84.50
and as a function of
at
=0.6 for homogeneous (salmon) and heterogeneous DNA I (purple), respectively. Each point in the plots represents the average of five independent replicates, with error bars denoting the standard deviation. The snapshots show condensates for heterogeneous DNA I where red represents high-affinity monomers while yellow represents low-affinity monomers. Proteins are shown in green. Snapshots (left to right) show condensates for the following parameter sets: (i)
= 0.2 and
= 84.50
, (ii)
= 0.8 and
= 84.50
, (iii)
= 0.6 and
= 42.25
and (iv)
= 0.6 and
= 126.75
.
3.2 DNA sequence heterogeneity leads to the formation of multiple condensates
To further investigate the impact of DNA sequence heterogeneity on protein–DNA co-condensation, particularly how DNA-protein and protein-protein interaction strengths affect the number and size of co-condensates, we examine the heterogeneous DNA II model (Fig 4A). This model features two blocks of high-affinity monomers separated by a block of low-affinity monomers (see the Model section and SI for details). Interestingly, the relative strengths of DNA-protein and protein-protein interactions lead to distinct regimes with specific droplet size and number. We consider three parameter regimes: (1) when protein–protein interactions are much weaker than protein–DNA interactions, (2) when they are comparable but slightly weaker, and (3) when protein–protein interactions are much stronger than protein–DNA interactions. In all cases, the protein concentration () is fixed at 84.50
and the normalized end-to-end distance (
) is maintained at 0.6. When protein-protein interactions are much weaker than DNA-protein interactions (Regime 1), two equal-sized DNA–protein co-condensates form in equilibrium, with their size determined by the DNA bound in each (Fig J, panels A, C, and E in S1 Text). When the two interaction strengths are comparable (Regime 2), two condensates form initially, but Ostwald ripening drives one to grow at the expense of the other, yielding unequal sizes at equilibrium (Fig 4B and 4C). However, the stronger protein-DNA interactions keep the two condensates stable. When protein-protein interactions exceed DNA-protein interactions (Regime 3), proteins prefer to cluster with each other more than binding DNA monomers, leading the system to coarsen into a single large co-condensate with one of the high-affinity regions along the DNA, thereby minimizing the total surface energy (Fig J, panels B, D, and F in S1 Text). Below, we explore the second parameter regime in greater detail.
A Schematic represents heterogeneous DNA II model: two blocks of high-affinity monomers (2.25 kBT, 100 monomers in red) are separated by a block of low-affinity monomers (0.1 kBT, grey) of identical length. Proteins (green) interact with each other with an interaction strength of 2 kBT. B Snapshots shown for cases with two condensates co-existing at equilibrium for all five independent realizations at and
=84.50
. High-affinity monomers, low-affinity monomers, and proteins are shown in red, grey, and green respectively. C Scaled radius of the larger droplet (black) and smaller droplet (red) is shown for all the five trajectories quantifying the arrested coarsening. D Representative kymograph showing visualization of coarsening kinetics for one initial condition.
To ascertain that the two-condensate state is an equilibrium state, we look at the coarsening kinetics in Fig 4C. We follow the time evolution of the normalized radii of the two condensates. Fig 4C shows that two condensates are roughly of the same size till , after which one of the condensates (black) grows at the expense of the other (red). Notably, both the droplets show saturation in size at a late time (
), indicating that coarsening is arrested. Note that we see arrested coarsening irrespective of the initial condition, which suggests that the two-condensate state is an equilibrium state. The representative kymograph for one of the realizations confirms that coarsening is arrested after
(Fig 4D).
Next, we study a biologically realistic DNA sequence, the Partial -DNA (Fig 5A), to gain a deeper understanding of how DNA sequence impacts the number of protein-DNA co-condensates. For a detailed discussion of this model, see the model section and Table D in Sec. S1 of the S1 Text. Fig 5B shows that the number of condensates varies for a fixed
and
in different realizations. While replicates 2 and 3 in Fig 5B show a single condensate as in the homogeneous DNA model, replicates 1, 4, and 5 show two condensates as in the heterogeneous DNA II model. This suggests that a more heterogeneous DNA sequence like the Partial
-DNA sequence may show initial condition dependency, and get trapped in a long-lived metastable state of multiple droplets even though the equilibrium state is that of a single droplet.
A Schematic represents the Partial -DNA: Eleven kinds of monomers (yellow to maroon) are introduced, based on AT content of 10 base pairs, modeled as one monomer. The monomer-protein binding affinities vary in the range of 0.1 to 4 kBT depending on the AT content of the corresponding 10 bp motif that represents the monomer (see the model section). Proteins (green) bind to each other with an affinity of 2 kBT. B Snapshots shown for five independent realizations at
and
=84.50
(top to down). C, D Representative kymograph showing visualization of coarsening kinetics for replicate 2 and 5. E Scaled radius of the larger condensate (black) and smaller condensate (red) is shown for replicate 2 which leads to a single condensate at equilibrium. F Scaled radius of the larger condensate (black) and smaller condensate (red) is shown for replicate 5 which shows coexistence of two condensates as a kinetically trapped state. G Bar plots show the average interfacial affinity (maroon) and volume (orange) for two condensates observed in the case of replicates 1, 4, and 5.
We confirm the initial condition dependency by studying the kinetics leading to a single and double condensate state for two representative replicates. The kymograph in Fig 5C gives a visualization of the early-time coarsening followed by the fusion event for replicate 2, which leads to a single condensate. In contrast, we do not see such a fusion event for the double-condensate case observed in replicate 5 (Fig 5D). To quantify the coarsening kinetics, we look at the time evolution of the size of the condensates. For the single condensate state observed in replicate 2, the normalized radii of both the condensates grow till , after which condensate 1 (shown in black) grows while condensate 2 (shown in red) shrinks. However, at
, condensate 2 fuses with condensate 1, leading to one condensate at equilibrium (Fig 5E). The normalized radii of the two condensates, formed after early-time growth and coarsening, saturate at late times, as shown in Fig 5F. It is possible that for an infinitely long simulation, replicate 5 shows a single condensate. Such a long-lived state of local equilibrium is known as a metastable state and can manifest in experiments due to time scale limitations.
To further investigate the origin of metastable states, we analyze the volumes and average interfacial affinities of individual condensates in replicates that exhibit two condensates (Fig 5G). In all such cases, we find that the smaller condensate consistently displays a higher average interfacial affinity than the larger one. In the next section, we show that interfacial affinity plays a key role in modulating the capillary forces in the Partial -DNA sequence across all Re and
values examined in our study.
3.3 Alterations in the local affinity of interfacial DNA regulate the global capillary forces
To further elucidate the role of DNA sequence in shaping the capillary forces exerted by protein–DNA co-condensates, we measure the capillary force for the partial -DNA model as a function of protein concentration (
) at a fixed extension of
, and as a function of extension (
) at a fixed protein concentration
. For both cases, the capillary force exhibits an intriguing non-monotonic behavior (see Fig 6A and 6B). While the null homogeneous DNA model shows a small and monotonic increase in force with
, the partial
-DNA model shows an initial increase followed by a decrease in force.
A Capillary force as a function of at fixed
=0.6. B Capillary force as a function of
at fixed
. C (top panel) Snapshots of condensates at concentrations marked as (i), (ii), and (iii) in A. Monomers are shown in yellow to red in ascending order protein binding affinity and the proteins are shown in green. C(bottom panel) Bar plots represent monomer-protein binding affinities. The grey-shaded region shows the interface. The red curve shows the probability of the monomer being inside the condensate. D capillary force correlates with average interfacial affinities. Grey points show the capillary forces and interfacial affinities for 30 individual replicates at the six
values in A. The purple points show the mean and standard deviation at each
. The red dashed line is a fit to the mean data.
To understand the non-monotonic behavior of capillary force, we analyze the average interfacial affinities of the DNA sequences at the protein concentrations showing non-monotonic variation. We observe a pN change in capillary force with an increase in
from 59.15
to 76.05
(marked as (i) and (ii) in Fig 6A). Although the DNA segments inside the condensate remain nearly identical (snapshots in Fig 6C), the sequences at the interface get altered, which leads to significant differences in the capillary forces (bottom panels in Fig 6C). A comparison of the interfaces shows that the inclusion of a high-affinity motif at the interface for
= 76.05
, leads to pinning of the condensate along the DNA contour, resulting in a smaller interface width (Fig 6C bottom panel). This results in an increase in the average interfacial affinity from 1.87 kBT to 2.29 kBT. Further increase in
to 92.95
(marked as (iii) in Fig 6A), results in a decrease in force and a corresponding decrease in average interfacial affinity to 2.23 kBT.
To establish the role of the interface in dictating the capillary forces, we plot the average interfacial affinity for all the 30 simulations conducted for the partial -DNA sequence at six
values (Fig 6D). Next, we calculate the mean and standard deviation of the average interfacial affinity over the five replicates at each
. Indeed, we observe a high correlation of 0.82 between the capillary forces and average interfacial affinity over the five simulations. We observe a similar correspondence between the interfacial affinity and capillary force with variation in
at fixed
(Fig K in S1 Text).
Our findings indicate that small variations in motif-level binding affinity, arising from mutations or methylation patterns, can substantially alter the global capillary forces. In fact, if the affinity of protein binding to a given motif at the interface of a co-condensate changes by even one kBT, two-fold changes in global forces are observed in simulations (Fig 6).
3.4 HP1 and Sox2 exhibit DNA sequence-dependent co-condensation
The results obtained from our simulations can be harnessed to interpret recent experimental results [10,11,26] on protein-DNA co-condensation. In particular, our results show that a homogeneous DNA sequence, where a protein binds with uniform affinity along the entire length of DNA, leads to only a single condensate at equilibrium across all tested parameter regimes. In contrast, sequence heterogeneity in DNA can give rise to multiple condensates along the DNA, with their size and number determined by the interplay between protein–protein and protein–DNA interactions. A key implication of our work is that the emergence of multiple, spatially separated condensates requires underlying sequence heterogeneity. Thus, if experiments reveal multiple condensates at long times, the homogeneous DNA model can be ruled out, indicating that sequence features dictate protein–DNA co-condensation. In this light, we analyze the number of co-condensates along the DNA for a set of proteins: the nucleoid-associated protein Dps, which condenses DNA through bridging interactions [63]; the nucleosome-binding pioneer transcription factor Sox2 [10]; and HP1, a key driver of heterochromatin formation [26].
We first examine the co-condensation of Dps, as observed in a recent study [63], which reported the formation of a single condensate across varying normalized end-to-end DNA distances (Fig 7). This behavior is consistent with predictions from both the homogeneous and Partial -DNA models, suggesting that Dps binds DNA in a sequence-independent or only weakly sequence-dependent manner, with co-condensation primarily driven by protein-protein interactions. This interpretation aligns with earlier findings that Dps exhibits sequence-independent DNA binding [63,64]. In contrast, Sox2–DNA co-condensation shows markedly different behavior. Recent experiments reveal the formation of multiple distinct co-condensates along the DNA (Fig 7), which contradicts the predictions of the homogeneous model (Fig 2), thereby falsifying it. Instead, these experimental observations are well captured by our heterogeneous DNA models, highlighting the potential role of DNA sequence in guiding Sox2-mediated co-condensation.
Scatter plot shows the number of condensates as a function of end-to-end distance for Dps (taken from Figure 4 of [63]), Sox2 (taken from Figure 1b and 3i of [10]), and HP1 (taken from Figure 4B of [26]). The dashed line, depicting the condensate number of unity, reflects sequence-independent co-condensation, whereas the region shaded in pink shows sequence-dependent co-condensation of protein and DNA.
Next, we turn our attention to the co-condensation of HP1 with DNA. Hitherto the binding of HP1 to DNA has been considered as relatively sequence-independent [65]. Based on these previous studies, we hypothesize that the homogeneous DNA model would effectively capture the condensation of HP1 with DNA. Contrary to our expectations, HP1 forms two condensates (see Fig 7), with one located near a tethered end and another in the middle of the DNA, thereby falsifying our hypothesis [26]. A heterogeneous DNA model with sequence heterogeneity can explain the coexistence of multiple co-condensates at different locations along the DNA. Our analysis suggests that HP1 exhibits differential binding affinity to DNA.
Note that our models don’t distinguish between Sox2-Sox2, HP1-HP1 or Dps-Dps interactions. However, our work helps us distinguish between the three proteins in terms of their DNA-binding landscape. In other words, based on the different models of DNA we employ, the protein–protein interaction strength is the same for all three proteins, fixed at kBT.; However, for Sox2 and HP1, the protein–DNA interaction strength varies along the DNA (e.g.,
,
kBT), in line with our heterogeneous DNA model II. For Dps, the interaction is uniform,
kBT, consistent with our homogeneous DNA model I. Thus, we do not explicitly consider the molecular identities of these proteins, but rather contrast homogeneous and heterogeneous protein–DNA binding landscapes, using Dps, HP1, and Sox2 as illustrative examples.
4 Discussion
Recent evidence suggests that phase separation of chromatin and DNA might play a crucial role in organizing the genome [7,66–77]. However, probing the biophysical principles of such phase-separated bodies in vivo has proved to be challenging. Alternatively, in vitro biochemical studies with purified transcription factors (TF) and DNA have shed light on protein-mediated condensation of DNA [9–11]. A series of such in vitro studies have shown that protein-protein and protein-DNA interactions lead to condensation of DNA [63,78,79]. However, the role of DNA sequence has thus far remained unclear. We present a simulation framework to investigate how DNA sequence influences protein–DNA co-condensation in double-tethered DNA systems.
To investigate how sequence heterogeneity influences capillary forces and the number of condensates, we use a model in which DNA is represented as a semiflexible polymer and proteins are modeled as interacting particles. We consider four different DNA models: (i) a homogeneous DNA model, where proteins bind uniformly without sequence preference; (ii) heterogeneous model I, having a central high-affinity binding region; (iii) heterogeneous model II, with two high-affinity regions separated by a low-affinity segment; and (iv) a partial -DNA model, representing a biologically realistic, heterogeneous binding landscape from a segment of
-phage DNA. Note that the choice of parameters in these models is informed by earlier work on DNA-protein condensation (See Subsection 5.2 of the Materials and Methods section for more details). Using these models, we address two key questions: (1) How do capillary forces vary with end-to-end distance (
) and protein concentration (
) across different DNA sequences? and (2) What are the minimal sequence and energetic conditions necessary to support the formation of multiple distinct condensates?
We find that for a homogeneous DNA, even if multiple condensates form initially, they tend to coarsen over time, eventually resulting in a single condensate—a phenomenon known as Ostwald ripening. For the homogeneous DNA, the condensate position can be anywhere but with a maximum probability of being at the center (Fig 2). However, for the heterogeneous DNA I model, the presence of strong binding sites at the center pins the condensate position at that location. Although both models lead to a single condensate at equilibrium, the capillary forces resulting from co-condensation differ qualitatively between them. With variations in , the homogeneous DNA model shows no change in the capillary force. In contrast, the heterogeneous DNA I model exhibits a switch-like behavior around
(Fig 3). This behavior can be explained by the composition of monomers at the condensate interface in the heterogeneous DNA I model. In particular, at
, a transition occurs: below this point, low-affinity monomers occupy the interface, while above this threshold, high-affinity monomers take their place (Fig 3). Interestingly, interfacial affinity also accounts for variations in capillary force in biologically relevant sequences, such as Partial
-DNA. For instance, capillary force exhibits a non-monotonic response as a function of protein concentration (
) (Fig 6A). Indeed, further analysis reveals a direct correlation between average interfacial affinity and capillary forces (Fig 6D). While the total interfacial energy reflects contributions from various DNA–protein, and protein-protein interactions, our findings suggest that even small changes in the identity of DNA monomers at the condensate interface—and the corresponding change in interfacial affinity (Ia)—can effectively capture the observed variations in capillary forces. For simplicity, we restrict our measurement of capillary forces to simulations that result in a single condensate at equilibrium. However, the Partial
-DNA sequence also shows initial condition dependency, i.e., at the same parameter values, we observe one or two condensates for different initial conditions. The two condensates observed in some replicates do not coarsen over long simulation time leading to long-lived metastable states (Fig 5). Further analysis shows that in the case of replicates showing two condensates, the smaller condensates contain high-affinity monomers at the interface (Fig 5G). In contrast, the heterogeneous DNA II model shows two condensates for all replicates, when DNA-protein interaction at the high-affinity sites dominates over the protein-protein interaction (Fig 4). We confirm these findings with coarsening kinetics analysis and see that the droplet size has saturated for all replicates. Our results indicate that interfacial affinity explains the minimal condition to sustain multiple condensates, as well as, the non-monotonicity observed in capillary forces for heterogeneous sequences. We summarize our results in Fig 8.
First column Cartoon representations of the four DNA models considered in our study. Second column Probability of each monomer being inside the condensate is plotted as a function of monomer position along the contour for a shorter homogeneous DNA (Nm = 50, = 0.8, and
= 84.5
), heterogeneous DNA I (
= 0.6 and
= 84.5
), heterogeneous DNA II (
= 0.6 and
= 84.5
), and partial
-DNA (
= 0.2 and
= 84.5
). Third column Representative kymographs are shown for all the models for the same conditions as in the second column. Fourth column Coarsening kinetics of a single final condensate is shown for five independent realizations of homogeneous DNA (Nm = 500,
= 0.6, and
= 126.75
) and heterogeneous DNA I (
= 0.6 and
= 126.75
). Coarsening kinetics of two droplets (larger condensate radius in grey and smaller condensate radius in red) are shown for heterogeneous DNA II (
= 0.6 and
= 84.5
), and partial
-DNA (
= 0.6 and
= 126.75
). Panels are arranged as homogeneous DNA, heterogeneous DNA I, heterogeneous DNA II, and Partial
-DNA from top to bottom.
The distinction between homogeneous and heterogeneous DNA sequences is gradual rather than discrete. Heterogeneity arises from variations in sequence patterning and differences in protein-binding affinity among monomers, though protein-protein interactions also influence the biophysical properties of protein-DNA co-condensates, including their number and spatial positioning. Introducing correlated heterogeneity, such as alternating high- and low-affinity blocks, can localize condensate positions along the DNA even with small affinity differences, while uncorrelated variations make the average local affinity the main determinant of condensate position, as we observed for a nucleoid-associated protein Lsr2 [80].
Upon revisiting protein-DNA co-condensation data from previous experimental studies [10,11,26,63], we find that bacterial nucleoid-associated protein Dps compacts DNA in a sequence-independent or weakly sequence-dependent manner. In contrast, pioneer factor Sox2 and heterochromatin protein HP1 exhibit sequence-dependent binding to DNA [10,26,81]. Our findings for Dps are consistent with previous experimental studies [63,64]. Similarly, Sox2 is known to exhibit high sequence specificity; Sox2 recognizes 7 base pair (bp) of (e.g. CTTTGTT) DNA sequences [56,81,82]. Therefore, our discovery of the co-condensation of Sox2 being influenced by DNA sequence is not surprising. Remarkably, our re-analysis of a recent paper on HP1-mediated DNA condensation revealed the sequence-dependence of HP1 binding to DNA [26]. In particular, the existence of multiple condensates implies that DNA sequence governs HP1 condensation with DNA. This observation challenges the prevailing understanding of HP1 binding to DNA, which is generally considered to be relatively sequence-independent. Moreover, in the study involving Sox2-DNA co-condensation [10], the authors confirmed that the system reached equilibrium by monitoring the condensate intensities over an extended period until they plateaued. In contrast, the HP1-DNA co-condensation study did not include such equilibrium measurements; nevertheless, experiments using DNA with varying end-to-end distances consistently showed two condensates along the DNA [26]. For the HP1-DNA system, however, we cannot exclude the possibility that the system becomes trapped in a metastable state. Undoubtedly, further experiments are necessary to test the sequence dependence of HP1 binding to DNA and its contribution to DNA condensation, preferably involving controlled design of DNA sequences.
We have modelled proteins as spherical beads of the same size as DNA monomers, as detailed in the model section and supplementary information. While acknowledging that actual proteins have diverse structural features influencing their phase separation behavior, our coarse-grained approach captures essential features of sequence-dependent protein-DNA co-condensation. We find that the relative strengths of protein-protein and protein-DNA interactions govern key sequence-dependent effects. Our simplified model can capture experimental observations of proteins like Dps, HP1, and Sox2, reproducing known behaviors such as sequence-independent binding of Dps and sequence-dependent binding of Sox2. A recent study on the bacterial nucleoid-associated protein Lsr2 demonstrated that both the single-bead protein model, like the one used in our work, and a more detailed model incorporating Lsr2’s dimeric and multivalent features yield comparable results for the number and positions of Lsr2-DNA condensates, closely recapitulating experimental observations [80]. These results highlight that our simplified model can provide key mechanistic insights into experimental observation of protein-DNA co-condensation and shed light on genome compaction and gene regulation.
Biomolecular condensates in cells have traditionally been thought to form via bulk phase separation [83–87]. However, recent experiments suggest that often the formation of these condensates involves various surfaces inside cells such as DNA, microtubule, cytoskeleton, membranes, etc [88–90]. The interfaces associated with these surface-mediated condensates give rise to interfacial capillary forces [9,11,88]. However, capillary forces and their physiological implications remain relatively understudied. Our findings suggest that capillary forces arising from the interfacial DNA heterogeneity in protein-DNA co-condensates can generate forces up to piconewtons. This range of capillary forces is consistent with earlier findings on DNA condensation mediated by pioneer transcription factors such as Sox2 [10], and viral nucleocapsid proteins [91]. Meanwhile, molecular motors involved in chromatin organization, such as condensin and cohesin, can exert forces up to piconewton on the DNA [92,93]. Interestingly, alterations in protein binding affinity at the single motif level, even by one kBT due to mutations or methylation patterns, can lead to two-fold changes in global forces. These forces have the potential to shape the three-dimensional structure of the genome. Moreover, sequence-dependent co-condensation of DNA and proteins can potentially form spatially heterogeneous chromosomal territories. Future studies can shed light on the feasibility of such a mechanism.
Overall, the protein-mediated condensation of DNA is increasingly recognized as a crucial mechanism for organizing the genome and responsible for other cellular processes [9,11,94]. Nevertheless, a comprehensive theoretical understanding remains in its early stages. In this light, our modeling framework can be harnessed to gain mechanistic insights into biophysical principles governing the sequence-dependent co-condensation of protein and DNA, and pave the way for developing a deeper understanding of genome organization.
5 Materials and methods
5.1 Limitations of the model
While our model offers insights into how sequence heterogeneity regulates the interfacial physics of DNA-protein co-condensation, it has certain limitations. For instance, our model does not take into account the changes in the mechanical properties of the DNA due to protein binding. Previous studies have shown that proteins from the high mobility group (HMG) family bend DNA upon binding, leading to the assembly of nucleoprotein complexes [95,96]. However, our model does not differentiate between the bending rigidity of DNA in its protein-bound and unbound states; we consider the persistence length of DNA to be 150 bp in both cases. It is important to note that although the magnitudes of forces observed in our simulations fall within the range reported in experiments, they may not precisely reflect those measured in specific in vitro studies. Achieving a more precise and quantitative agreement between model-predicted and experimentally observed forces may require detailed consideration of protein binding properties specific to the protein of interest. Nonetheless, our simplified model captures the complex influence of DNA sequence heterogeneity on the physics of protein-DNA co-condensation. Moreover, we carry out Brownian dynamics simulation to study the kinetic and equilibrium properties of the system. Although Brownian dynamics simulations are slower than Monte Carlo methods for sampling equilibrium properties, we choose the former because our focus is on both the equilibrium and the kinetics of DNA-protein co-condensation. Alternatively, we could have employed the kinetic version of the Monte Carlo algorithm, as employed by Tortora et al. [23]. For example, the authors showed that the idealized mouse chromosome 19 model in Ref. [23], exhibits a coarsening kinetic exponent of (Figure 4E in [23]). This model of mouse chromosome 19 has a chromatin (polymer) sequence with a continuous high binding affinity region of HP1 (protein). Our heterogeneous DNA I model is similar to Tortora et al.’s model in terms of sequence heterogeneity and shows the same scaling exponent of
(Fig 8). Although our model operates at a different scale—where one monomer of the DNA represents 10 base pairs compared to 1 kilobase pair per monomer in Ref. [23]—we find good agreement in the coarsening exponents, capturing the qualitative features of the kinetics. Overall, our study demonstrates that sequence heterogeneity dictates capillary forces as well as impacts the coarsening kinetics of DNA-protein condensates.
Our model of protein-DNA co-condensation is designed to capture and explain experimental observations typically made using optical tweezers and coverslip-based assays, which commonly employ simple duplex DNA substrates. However, DNA in physiological contexts is organized into nucleosomes and higher-order chromatin structures. In vitro studies involving nucleosomes remain limited, with recent work by Nguyen et al. [10] providing important insights. They found that in nucleosomal configurations, where DNA is wrapped around histone octamers, capillary forces are significantly reduced. Nguyen et al. [10] experimentally demonstrated that the presence of histones reduces the capillary forces exerted by Sox2-DNA condensates by approximately half compared to naked DNA. These experimental findings highlight the need for theoretical models that explicitly incorporate nucleosomes. While our model offers important biophysical insights into how naked DNA sequence influences protein-DNA co-condensation, extending it to include nucleosomal and higher-order chromatin structures will be important for accurately capturing in vivo physiological conditions. Future studies should systematically explore these effects to deepen our understanding of chromatin organization.
5.2 Parameter selection
Recent experimental studies on protein–DNA co-condensation report that various DNA-binding proteins undergo phase separation over a concentration range of approximately 2.5 to 200
[7,80,97]. For example, FUS proteins alone phase-separate at around 2.5
concentration [97], whereas Lsr2 and nPhos–HP1
(the phosphorylated, functional form of HP1
) exhibit phase-separation thresholds of 5
and 192
, respectively [7,80]. Importantly, the addition of DNA substantially lowers these threshold concentrations, shifting them into the range of 0.1
to 50
; for instance, FUS–DNA co-condensation occurs at approximately 100 nM, while Lsr2–DNA and HP1–DNA co-condensates form at roughly 500 nM and 50
, respectively. To capture these experimental observations, we set the protein–protein interaction strength to 2 kBT, which yields protein-only phase separation near 190
. Moreover, protein–DNA interaction strengths in our study range from 0.1 kBT to 4 kBT across different models. In particular, a protein–DNA interaction strength of 2 kBT leads to co-condensation at approximately 40
(see Fig F in S1 Text), ensuring that our simulations operate within a physiologically relevant regime. Moreover, the parameter values we employ are consistent with previous studies [7–9,17,20,22,24,26,80,98]; for example, Takaki et al. [98] calibrated protein interaction strengths using single-molecule measurements from FoxA1–DNA condensation by Quail et al. [11] and found that an interaction strength of 2 kBT sufficiently captures the experimental behavior, aligning with the values adopted in our work. Table 1 summarizes the model parameters and their agreement with previous studies.
5.3 Simulation methodology
We employ Brownian dynamics to simulate the protein-DNA system using ESPResSo package [99]. The temperature of the system is set to 1.0 kBT using Langevin thermostat whereas the damping coefficient is set to 0.1 . We use a time step of
=0.01
to integrate the equations of motion. Please refer to Sec. S1 in S1 Text of the Supplementary Information (SI), including Table A-D, for details of the model.
We start with a semiflexible polymer in a box of dimensions(x, y, z) =(80 , 80
, 600
), where
is the size of one monomer. Guided by optical tweezer experiments, we fix the ends of the polymer to maintain a constant end-to-end distance (Fig A in S1 Text, top panel). We equilibrate the polymer in the absence of proteins. Next, We save five uncorrelated DNA configurations which are used as initial configurations for simulations in the presence of proteins. For each of these independent realizations, we introduce proteins in the simulation box (Fig A in S1 Text, middle panel), and integrate the system till equilibrium is ensured (Fig A in S1 Text, bottom panel). The equilibration time varies for the DNA models considered in our study, end-to-end distance, and protein concentration. After equilibration, we run the system for
steps and save 3000 independent configurations for all subsequent analyses. All images are created using Visual Molecular Dynamics (VMD 1.9.3) [100]. Note that the field of view of the snapshots is adjusted to highlight specific features, which may result in variations in image dimensions.
5.4 Condensate detection and volume analysis
We use the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) to identify condensates in our simulations [101]. DBSCAN depends on two critical parameters: epsilon () and Minimum Points. The parameter epsilon (
) defines the radius around each particle, within which neighboring points are considered part of the cluster. The other parameter, Minimum Points, specifies the minimum number of points within this radius required to form a cluster. In our study, we set Minimum Points to 6, following the common practice of using twice the dimensionality of the dataset (here, 3D coordinates of particles) [101]. To make use of the DBSCAN algorithm for identifying the condensates, it is essential to determine the optimal value for epsilon (
). To do this, we follow the standard approach which is to calculate the distance between each point and its k-th nearest neighbor (in our study, k is set to 5; k = Minimum Points –1 [102]). Sorting and plotting these distances result in a k-distance plot (see Fig B, panels A and B in S1 Text) revealing any density-based grouping present in the data. The point of maximum curvature, known as the “knee point”, serves as a good estimate for epsilon (
) [102].
We start by plotting the k-distance plot using the distance of each data point from its 5th nearest neighbor (see Fig B, panel A in S1 Text, red curve). However, due to noise in the data, the epsilon value obtained from this raw plot is inaccurate (see Fig B, panel A in S1 Text, red point). We plot a zoomed-in view of the grey-shaded region shown in plot A to better illustrate the fluctuations present in the raw data (see Fig B, panel B in S1 Text, red curve). When this value is used as the input parameter in DBSCAN, the monomers outside the condensates are erroneously classified as part of the condensate (see Fig B (i) in S1 Text).
To address this issue, we apply the Savitzky-Golay filter [103], a digital filter commonly used for signal smoothing. This filter fits polynomials of varying degrees to consecutive data points within a moving window of a specified length. In our study, we use third-degree polynomials with window lengths of 99 (see Fig B, panels A and B in S1 Text (green curves)) and 199 (see Fig B, panels A and B in S1 Text (blue curves)). When the value is estimated by using the Savitzky-Golay filter with window lengths of 99 and 199, and then applied in the DBSCAN algorithm, we observe different outcomes. With a window length of 99 (see Fig B (ii) in S1 Text), the algorithm still fails to accurately detect the condensate. In contrast,
value estimated with window length 199 correctly detects the particles present in the condensates (see Fig B (iii) in S1 Text). We repeat this procedure for all the configurations in each simulation to identify the condensates, which are then used for force, volume, and interfacial affinity analysis.
To compute the volume of the condensates, we use the maximum pairwise distance between the particles identified as part of the condensate along all three axes. We define the condensate radius along each axis as half of the maximum distance. Volume is then calculated as
Here, is the volume of the condensate, rx, ry, and rz are condensate radii along the three axes.
5.5 Force and potential energy analysis
To calculate the force due to condensate, we first find the force due to tethering the DNA at the specified , and subtract this from the force calculated in the presence of interacting proteins. This step distinguishes the intramolecular forces acting on DNA beads from other DNA beads, from the intermolecular forces exerted by proteins present in the system.
(i) For force calculation in the absence of proteins, we compute bond stretching due to fixing the ends of the polymer. We calculate the change in bond lengths of all 499 bonds present in the polymer. As expected, the distribution of bond length for the tethered ends is identical to the rest of the bonds (Fig C, panel A in S1 Text). We calculate the force on polymer in the absence of proteins as
Here, kb is the harmonic bond strength, is the mean bond length of the DNA with tethered ends, and
is the mean bond length of the DNA with free ends. The angular brackets in the above equation denotes the average over 3000 independent configurations. We calculate the force for different normalized end-to-end distances (
). Fig C, panel B in S1 Text shows a marginal change in FDNA up to
= 0.8, followed by a sharp rise at
= 1, and is qualitatively consistent with the theoretical prediction of the worm-like chain (WLC) model [15].
(ii) For force calculation due to protein-DNA co-condensates, we first identify the condensates using DBSCAN. Next, we calculate the mean bond length of monomers outside the condensate () and calculate the force as,
We average the force over 3000 equilibrated configurations. The error bars in the force-extension curves are standard deviations over 5 different initial conditions. Finally, we find the force due to condensation as,
Fig C, panel C in S1 Text shows FDNA,protein and F for a homogeneous DNA as a function of to illustrate the procedure.
(iii) To find the potential energy of each monomer along the DNA contour, we start with computing potential energy contributions from monomer-protein interactions and monomer-monomer interactions. We then add these contributions from different interactions to calculate the potential energy for each monomer of a configuration (Fig D in S1 Text). We observe a sharp decline in the potential energies of the monomers at the left interface and a sharp rise in the potential energy at the right interface of the condensate.
5.6 Interfacial affinity analysis
After identifying the condensate for each configuration using DBSCAN, we assign a score (sm,c) of 1 or 0 to each monomer m if it is inside or outside the condensate in cth configuration. We find the probability of monomer m to be inside the condensate (Pm) by,
where, =3000, is the number of equilibrated configurations chosen for the analysis. We consider the monomers to be part of the interface if
(as shown in Fig E in S1 Text). Finally, we calculate the interfacial affinity (Ia) by,
where is the interaction strength between monomer and proteins, and NmI number of monomers within the interface. In every independent realization, we compute the two interfacial affinities corresponding to each interface of the condensate and report their average.
Supporting information
S1 Text. Model details, parameters and supplementary figures.
https://doi.org/10.1371/journal.pcbi.1013829.s001
(PDF)
Acknowledgments
SC wishes to thank Frank Julicher, Stephan Grill, and Jose Morin for years of stimulating discussions and shared thoughts about the surface condensation of proteins on DNA. The simulations were carried out on the supercomputing machines Nandadevi and Kamet at the Institute of Mathematical Sciences.
References
- 1. Cremer T, Cremer C. Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat Rev Genet. 2001;2(4):292–301. pmid:11283701
- 2. Oudelaar AM, Higgs DR. The relationship between genome structure and function. Nat Rev Genet. 2021;22(3):154–68. pmid:33235358
- 3. Fan Y, Nikitina T, Zhao J, Fleury TJ, Bhattacharyya R, Bouhassira EE, et al. Histone H1 depletion in mammals alters global chromatin structure but causes specific changes in gene regulation. Cell. 2005;123(7):1199–212. pmid:16377562
- 4. Dekker J, Heard E. Structural and functional diversity of topologically associating domains. FEBS Lett. 2015;589(20 Pt A):2877–84. pmid:26348399
- 5. Woodcock CL, Ghosh RP. Chromatin higher-order structure and dynamics. Cold Spring Harb Perspect Biol. 2010;2(5):a000596. pmid:20452954
- 6. Dorman CJ. Genome architecture and global gene regulation in bacteria: making progress towards a unified model?. Nat Rev Microbiol. 2013;11(5):349–55. pmid:23549066
- 7.
Larson AG, Elnatan D, Keenen MM, Trnka MJ, Johnston JB, Burlingame AL, et al. Liquid droplet formation by HP1
suggests a role for phase separation in heterochromatin. Nature. 2017;547(7662):236–40. pmid:28636604
- 8. Morin JA, Wittmann S, Choubey S, Klosin A, Golfier S, Hyman AA, et al. Sequence-dependent surface condensation of a pioneer transcription factor on DNA. Nat Phys. 2022;18(3):271–6.
- 9. Renger R, Morin JA, Lemaitre R, Ruer-Gruss M, Jülicher F, Hermann A, et al. Co-condensation of proteins with single- and double-stranded DNA. Proc Natl Acad Sci U S A. 2022;119(10):e2107871119. pmid:35238639
- 10. Nguyen T, Li S, Chang JT-H, Watters JW, Ng H, Osunsade A, et al. Chromatin sequesters pioneer transcription factor Sox2 from exerting force on DNA. Nat Commun. 2022;13(1):3988. pmid:35810158
- 11. Quail T, Golfier S, Elsner M, Ishihara K, Murugesan V, Renger R, et al. Force generation by protein–DNA co-condensation. Nat Phys. 2021;17(9):1007–12.
- 12. Bell NAW, Haynes PJ, Brunner K, de Oliveira TM, Flocco MM, Hoogenboom BW, et al. Single-molecule measurements reveal that PARP1 condenses DNA by loop stabilization. Sci Adv. 2021;7(33):eabf3641. pmid:34380612
- 13. Wang T, Hu J, Li Y, Bi L, Guo L, Jia X, et al. Bloom syndrome helicase compresses single-stranded DNA into phase-separated condensates. Angew Chem Int Ed Engl. 2022;61(39):e202209463. pmid:35922882
- 14. Bustamante C, Marko JF, Siggia ED, Smith S. Entropic elasticity of lambda-phage DNA. Science. 1994;265(5178):1599–600. pmid:8079175
- 15. Marko JF, Siggia ED. Stretching DNA. Macromolecules. 1995;28(26):8759–70.
- 16. Barbieri M, Chotalia M, Fraser J, Lavitas L-M, Dostie J, Pombo A, et al. Complexity of chromatin folding is captured by the strings and binders switch model. Proc Natl Acad Sci U S A. 2012;109(40):16173–8. pmid:22988072
- 17. Brackley CA, Taylor S, Papantonis A, Cook PR, Marenduzzo D. Nonspecific bridging-induced attraction drives clustering of DNA-binding proteins and genome organization. Proc Natl Acad Sci U S A. 2013;110(38):E3605-11. pmid:24003126
- 18. Johnson J, Brackley CA, Cook PR, Marenduzzo D. A simple model for DNA bridging proteins and bacterial or human genomes: bridging-induced attraction and genome compaction. J Phys Condens Matter. 2015;27(6):064119. pmid:25563801
- 19. Kumar A, Chaudhuri D. Cross-linker mediated compaction and local morphologies in a model chromosome. J Phys Condens Matter. 2019;31(35):354001. pmid:31112939
- 20. Ryu J-K, Bouchoux C, Liu HW, Kim E, Minamino M, de Groot R, et al. Bridging-induced phase separation induced by cohesin SMC protein complexes. Sci Adv. 2021;7(7):eabe5905. pmid:33568486
- 21. Ancona M, Brackley CA. Simulating the chromatin-mediated phase separation of model proteins with multiple domains. Biophys J. 2022;121(13):2600–12. pmid:35643628
- 22. Sommer J-U, Merlitz H, Schiessel H. Polymer-assisted condensation: a mechanism for hetero-chromatin formation and epigenetic memory. Macromolecules. 2022;55(11):4841–51.
- 23. Tortora MMC, Brennan LD, Karpen G, Jost D. HP1-driven phase separation recapitulates the thermodynamics and kinetics of heterochromatin condensate formation. Proc Natl Acad Sci U S A. 2023;120(33):e2211855120. pmid:37549295
- 24. Swain P, Choubey S, Vemparala S. Role of protein-protein interactions on organization and dynamics of a model chromatin. J Chem Phys. 2024;161(13):134901. pmid:39351947
- 25. Haugk L, Merlitz H, Sommer J-U. Single chain polymer condensates under external force. Macromolecules. 2024;57(19):9476–88.
- 26. Keenen MM, Brown D, Brennan LD, Renger R, Khoo H, Carlson CR, et al. HP1 proteins compact DNA into mechanically and positionally stable phase separated domains. Elife. 2021;10:e64563. pmid:33661100
- 27. Jacobs WM, Frenkel D. Phase transitions in biological systems with many components. Biophys J. 2017;112(4):683–91. pmid:28256228
- 28. Julicher F, Weber CA. Droplet physics and intracellular phase separation. Annual Review of Condensed Matter Physics. 2024;15(1):237–61.
- 29. Brangwynne CP, Eckmann CR, Courson DS, Rybarska A, Hoege C, Gharakhani J, et al. Germline P granules are liquid droplets that localize by controlled dissolution/condensation. Science. 2009;324(5935):1729–32. pmid:19460965
- 30. Berry J, Weber SC, Vaidya N, Haataja M, Brangwynne CP. RNA transcription modulates phase transition-driven nuclear body assembly. Proc Natl Acad Sci U S A. 2015;112(38):E5237-45. pmid:26351690
- 31. Molliex A, Temirov J, Lee J, Coughlin M, Kanagaraj AP, Kim HJ, et al. Phase separation by low complexity domains promotes stress granule assembly and drives pathological fibrillization. Cell. 2015;163(1):123–33. pmid:26406374
- 32. Langdon EM, Qiu Y, Ghanbari Niaki A, McLaughlin GA, Weidmann CA, Gerbich TM, et al. mRNA structure determines specificity of a polyQ-driven phase separation. Science. 2018;360(6391):922–7. pmid:29650703
- 33. Ladouceur A-M, Parmar BS, Biedzinski S, Wall J, Tope SG, Cohn D, et al. Clusters of bacterial RNA polymerase are biomolecular condensates that assemble through liquid-liquid phase separation. Proc Natl Acad Sci U S A. 2020;117(31):18540–9. pmid:32675239
- 34. Fritsch AW, Diaz-Delgadillo AF, Adame-Arana O, Hoege C, Mittasch M, Kreysing M, et al. Local thermodynamics govern formation and dissolution of Caenorhabditis elegans P granule condensates. Proc Natl Acad Sci U S A. 2021;118(37):e2102772118. pmid:34507991
- 35.
Hardy JC, Pool EH, Bruystens JGH, Zhou X, Li Q, Zhou DR, et al. Molecular determinants and signaling effects of PKA RI
phase separation. Mol Cell. 2024;84(8):1570-1584.e7. pmid:38537638
- 36. Fernández-Rico C, Sai T, Sicher A, Style RW, Dufresne ER. Putting the squeeze on phase separation. JACS Au. 2021;2(1):66–73. pmid:35098222
- 37. Garcia-Jove Navarro M, Kashida S, Chouaib R, Souquere S, Pierron G, Weil D, et al. RNA is a critical element for the sizing and the composition ofphase-separated RNA–protein condensates. Nature Communications. 2019;10(1):3230.
- 38. Cochard A, Garcia-Jove Navarro M, Piroska L, Kashida S, Kress M, Weil D, et al. RNA at the surface of phase-separated condensates impacts their size and number. Biophys J. 2022;121(9):1675–90. pmid:35364105
- 39. Welsh TJ, Krainer G, Espinosa JR, Joseph JA, Sridhar A, Jahnel M, et al. Surface electrostatics govern the emulsion stability of biomolecular condensates. Nano Lett. 2022;22(2):612–21. pmid:35001622
- 40. Style RW, Sai T, Fanelli N, Ijavi M, Smith-Mannschott K, Xu Q, et al. Liquid-liquid phase separation in an elastic network. Phys Rev X. 2018;8(1):011028.
- 41. Rosowski KA, Sai T, Vidal-Henriquez E, Zwicker D, Style RW, Dufresne ER. Elastic ripening and inhibition of liquid-liquid phase separation. Nat Phys. 2020;16(4):422–5. pmid:32273899
- 42. Lee DSW, Wingreen NS, Brangwynne CP. Chromatin mechanics dictates subdiffusion and coarsening dynamics of embedded condensates. Biophysical Journal. 2021;120(3):318a.
- 43. Qi Y, Zhang B. Chromatin network retards nucleoli coalescence. Nat Commun. 2021;12(1):6824. pmid:34819511
- 44. Zwicker D, Hyman AA, Jülicher F. Suppression of Ostwald ripening in active emulsions. Phys Rev E Stat Nonlin Soft Matter Phys. 2015;92(1):012317. pmid:26274171
- 45. Zwicker D, Seyboldt R, Weber CA, Hyman AA, Jülicher F. Growth and division of active droplets provides a model for protocells. Nature Phys. 2016;13(4):408–13.
- 46. Donau C, Späth F, Sosson M, Kriebisch BAK, Schnitter F, Tena-Solsona M, et al. Active coacervate droplets as a model for membraneless organelles and protocells. Nat Commun. 2020;11(1):5167. pmid:33056997
- 47. Kirschbaum J, Zwicker D. Controlling biomolecular condensates via chemical reactions. J R Soc Interface. 2021;18(179):20210255. pmid:34186016
- 48. Nakashima KK, van Haren MHI, André AAM, Robu I, Spruijt E. Active coacervate droplets are protocells that grow and resist Ostwald ripening. Nat Commun. 2021;12(1):3819. pmid:34155210
- 49. Hagerman PJ. Investigation of the flexibility of DNA using transient electric birefringence. Biopolymers. 1981;20(7):1503–35. pmid:7023566
- 50. Rizzo V, Schellman J. Flow dichroism of T7 DNA as a function of salt concentration. Biopolymers. 1981;20(10):2143–63. pmid:7284565
- 51. Hagerman PJ. Flexibility of DNA. Annual Review of Biophysics and Biophysical Chemistry. 1988;17:265–86.
- 52. Garcia HG, Grayson P, Han L, Inamdar M, Kondev J, Nelson PC, et al. Biological consequences of tightly bent DNA: the other life of a macromolecular celebrity. Biopolymers. 2007;85(2):115–30. pmid:17103419
- 53. Stewart AJ, Hannenhalli S, Plotkin JB. Why transcription factor binding sites are ten nucleotides long. Genetics. 2012;192(3):973–85. pmid:22887818
- 54. Aptekmann AA, Bulavka D, Nadra AD, Sánchez IE. Transcription factor specificity limits the number of DNA-binding motifs. PLoS One. 2022;17(1):e0263307. pmid:35089985
- 55.
Fairall L, Schwabe JW. DNA binding by transcription factors. Transcription factors. Garland Science. 2003. p. 62–81.
- 56. Reményi A, Lins K, Nissen LJ, Reinbold R, Schöler HR, Wilmanns M. Crystal structure of a POU/HMG/DNA ternary complex suggests differential assembly of Oct4 and Sox2 on two enhancers. Genes Dev. 2003;17(16):2048–59. pmid:12923055
- 57. Shields JM, Yang VW. Identification of the DNA sequence that interacts with the gut-enriched Krüppel-like factor. Nucleic Acids Res. 1998;26(3):796–802. pmid:9443972
- 58. Gordon BRG, Li Y, Wang L, Sintsova A, van Bakel H, Tian S, et al. Lsr2 is a nucleoid-associated protein that targets AT-rich sequences and virulence genes in Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 2010;107(11):5154–9. pmid:20133735
- 59. Japaridze A, Yang W, Dekker C, Nasser W, Muskhelishvili G. DNA sequence-directed cooperation between nucleoid-associated proteins. iScience. 2021;24(5):102408. pmid:33997690
- 60. Prieto AI, Kahramanoglou C, Ali RM, Fraser GM, Seshasayee ASN, Luscombe NM. Genomic analysis of DNA binding and gene regulation by homologous nucleoid-associated proteins IHF and HU in Escherichia coli K12. Nucleic Acids Res. 2012;40(8):3524–37. pmid:22180530
- 61. Solomon MJ, Strauss F, Varshavsky A. A mammalian high mobility group protein recognizes any stretch of six A.T base pairs in duplex DNA. Proc Natl Acad Sci U S A. 1986;83(5):1276–80. pmid:3456586
- 62.
De Gennes PG, Brochard-Wyart F, Quéré D. Capillarity and wetting phenomena: drops, bubbles, pearls, waves. Springer Science & Business Media; 2003.
- 63.
Shahu S, Vtyurina N, Das M, Meyer AS, Ganji M, Abbondanzieri E. Bridging DNA contacts allow Dps from E. coli to condense DNA. bioRxiv. 2024:2024–01.
- 64. Orban K, Finkel SE. DPS is a universally conserved dual-action DNA-binding and Ferritin protein. J Bacteriol. 2022;204(5):e0003622. pmid:35380871
- 65. Zhao T, Heyduk T, Allis CD, Eissenberg JC. Heterochromatin protein 1 binds to nucleosomes and DNA in vitro. J Biol Chem. 2000;275(36):28332–8. pmid:10882726
- 66. Ghatak P, Karmakar K, Kasetty S, Chatterji D. Unveiling the role of DPS in the organization of mycobacterial nucleoid. PLoS One. 2011;6(1):e16019. pmid:21283627
- 67. Gibson BA, Doolittle LK, Schneider MWG, Jensen LE, Gamarra N, Henry L, et al. Organization of chromatin by intrinsic and regulated phase separation. Cell. 2019;179(2):470-484.e21. pmid:31543265
- 68. Ma L, Gao Z, Wu J, Zhong B, Xie Y, Huang W, et al. Co-condensation between transcription factor and coactivator p300 modulates transcriptional bursting kinetics. Mol Cell. 2021;81(8):1682-1697.e7. pmid:33651988
- 69. Sharma R, Choi K-J, Quan MD, Sharma S, Sankaran B, Park H, et al. Liquid condensation of reprogramming factor KLF4 with DNA provides a mechanism for chromatin organization. Nat Commun. 2021;12(1):5579. pmid:34552088
- 70. Kim S, Shendure J. Mechanisms of interplay between transcription factors and the 3D genome. Mol Cell. 2019;76(2):306–19. pmid:31521504
- 71. Tang SC, Vijayakumar U, Zhang Y, Fullwood MJ. Tang SC, Vijayakumar U, Zhang Y, Fullwood MJ. Super-enhancers, phase-separated condensates, and 3D genome organization in cancer. Cancers (Basel). 2022;14(12):2866. pmid:35740532
- 72. Kantidze OL, Razin SV. Weak interactions in higher-order chromatin organization. Nucleic Acids Res. 2020;48(9):4614–26. pmid:32313950
- 73. Larson AG, Narlikar GJ. The role of phase separation in heterochromatin formation, function, and regulation. Biochemistry. 2018;57(17):2540–8. pmid:29644850
- 74. Shin Y, Chang Y-C, Lee DSW, Berry J, Sanders DW, Ronceray P, et al. Liquid nuclear condensates mechanically sense and restructure the genome. Cell. 2018;175(6):1481-1491.e13. pmid:30500535
- 75. Sabari BR, Dall’Agnese A, Boija A, Klein IA, Coffey EL, Shrinivas K, et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science. 2018;361(6400):eaar3958. pmid:29930091
- 76. Cho W-K, Spille J-H, Hecht M, Lee C, Li C, Grube V, et al. Mediator and RNA polymerase II clusters associate in transcription-dependent condensates. Science. 2018;361(6400):412–5. pmid:29930094
- 77. Nair SJ, Yang L, Meluzzi D, Oh S, Yang F, Friedman MJ, et al. Phase separation of ligand-activated enhancers licenses cooperative chromosomal enhancer assembly. Nat Struct Mol Biol. 2019;26(3):193–203. pmid:30833784
- 78. Chappidi N, Quail T, Doll S, Vogel LT, Aleksandrov R, Felekyan S, et al. PARP1-DNA co-condensation drives DNA repair site assembly to prevent disjunction of broken DNA ends. Cell. 2024;187(4):945-961.e18. pmid:38320550
- 79. Ramisetty SK, Langlete P, Lale R, Dias RS. In vitro studies of DNA condensation by bridging protein in a crowding environment. Int J Biol Macromol. 2017;103:845–53. pmid:28536019
- 80.
Satheesh T, Singh RK, Gaur P, Shahu S, Ramakrishnan S, Srivastava M, et al. Sequence-dependent co-condensation of Lsr2 with DNA elucidates the mechanism of genome compaction in Mycobacterium tuberculosis. bioRxiv. 2025; p. 2025-01.
- 81. Kamachi Y, Uchikawa M, Kondoh H. Pairing SOX off: with partners in the regulation of embryonic development. Trends Genet. 2000;16(4):182–7. pmid:10729834
- 82. Harley VR, Jackson DI, Hextall PJ, Hawkins JR, Berkovitz GD, Sockanathan S, et al. DNA binding activity of recombinant SRY from normal males and XY females. Science. 1992;255(5043):453–6. pmid:1734522
- 83. Banani SF, Lee HO, Hyman AA, Rosen MK. Biomolecular condensates: organizers of cellular biochemistry. Nat Rev Mol Cell Biol. 2017;18(5):285–98. pmid:28225081
- 84. Söding J, Zwicker D, Sohrabi-Jahromi S, Boehning M, Kirschbaum J. Mechanisms for active regulation of biomolecular condensates. Trends Cell Biol. 2020;30(1):4–14. pmid:31753533
- 85. Keating CD, Pappu RV. Liquid-liquid phase separation: a widespread and versatile way to organize aqueous solutions. J Phys Chem B. 2021;125(45):12399–400. pmid:34788996
- 86. Alberti S, Gladfelter A, Mittag T. Considerations and challenges in studying liquid-liquid phase separation and biomolecular condensates. Cell. 2019;176(3):419–34. pmid:30682370
- 87. Brangwynne CP, Tompa P, Pappu RV. Polymer physics of intracellular phase transitions. Nature Phys. 2015;11(11):899–904.
- 88. Gouveia B, Kim Y, Shaevitz JW, Petry S, Stone HA, Brangwynne CP. Capillary forces generated by biomolecular condensates. Nature. 2022;609(7926):255–64. pmid:36071192
- 89. Ukmar-Godec T, Wegmann S, Zweckstetter M. Biomolecular condensation of the microtubule-associated protein tau. Semin Cell Dev Biol. 2020;99:202–14. pmid:31260737
- 90. Koppers M, Özkan N, Farías GG. Complex interactions between membrane-bound organelles, biomolecular condensates and the cytoskeleton. Front Cell Dev Biol. 2020;8:618733. pmid:33409284
- 91. Gien H, Morse M, McCauley MJ, Kitzrow JP, Musier-Forsyth K, Gorelick RJ, et al. HIV-1 nucleocapsid protein binds double-stranded DNA in multiple modes to regulate compaction and capsid uncoating. Viruses. 2022;14(2):235. pmid:35215829
- 92. Ganji M, Shaltiel IA, Bisht S, Kim E, Kalichava A, Haering CH, et al. Real-time imaging of DNA loop extrusion by condensin. Science. 2018;360(6384):102–5. pmid:29472443
- 93. Kim Y, Shi Z, Zhang H, Finkelstein IJ, Yu H. Human cohesin compacts DNA by loop extrusion. Science. 2019;366(6471):1345–9. pmid:31780627
- 94. Tschurikow X, Gadzekpo A, Tran MP, Chatterjee R, Sobucki M, Zaburdaev V, et al. Amphiphiles formed from synthetic DNA-nanomotifs mimic the stepwise dispersal of transcriptional clusters in the cell nucleus. Nano Lett. 2023;23(17):7815–24. pmid:37586706
- 95. Grosschedl R, Giese K, Pagel J. HMG domain proteins: architectural elements in the assembly of nucleoprotein structures. Trends Genet. 1994;10(3):94–100. pmid:8178371
- 96. Bustin M. Regulation of DNA-dependent activities by the functional motifs of the high-mobility-group chromosomal proteins. Mol Cell Biol. 1999;19(8):5237–46. pmid:10409715
- 97. Ganser LR, Niaki AG, Yuan X, Huang E, Deng D, Djaja NA, et al. The roles of FUS-RNA binding domain and low complexity domain in RNA-dependent phase separation. Structure. 2024;32(2):177-187.e5. pmid:38070499
- 98. Takaki R, Savich Y, Brugués J, Jülicher F. Active loop extrusion guides DNA-protein condensation. Physical Review Letters. 2025;134(12):128401.
- 99. Limbach HJ, Arnold A, Mann BA, Holm C. ESPResSo—an extensible simulation package for research on soft matter systems. Computer Physics Communications. 2006;174(9):704–27.
- 100. Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996;14(1):33–8, 27–8. pmid:8744570
- 101.
M E, Kriegel HP, Sander J, Xu X. A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD. 1996. p. 226–31.
- 102. Sander J, Ester M, Kriegel H-P, Xu X. Density-based clustering in spatial databases: the algorithm GDBSCAN and its applications. Data Mining and Knowledge Discovery. 1998;2(2):169–94.
- 103. Savitzky A, Golay MJE. Smoothing and differentiation of data by simplified least squares procedures.. Anal Chem. 1964;36(8):1627–39.