Skip to main content
Advertisement
  • Loading metrics

4D nucleome equation predicts gene expression controlled by long-range enhancer-promoter interaction

  • Zihao Wang,

    Roles Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Guangdong Province Key Laboratory of Computational, Sun Yat-sen University, Guangzhou, People’s Republic of China, School of Mathematics, Sun Yat-Sen University, Guangzhou, People’s Republic of China

  • Songhao Luo,

    Roles Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Guangdong Province Key Laboratory of Computational, Sun Yat-sen University, Guangzhou, People’s Republic of China, School of Mathematics, Sun Yat-Sen University, Guangzhou, People’s Republic of China

  • Zhenquan Zhang,

    Roles Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Guangdong Province Key Laboratory of Computational, Sun Yat-sen University, Guangzhou, People’s Republic of China, School of Mathematics, Sun Yat-Sen University, Guangzhou, People’s Republic of China

  • Tianshou Zhou ,

    Roles Writing – original draft, Writing – review & editing

    mcszhtsh@mail.sysu.edu.cn (TZ); zhjiajun@mail.sysu.edu.cn (JZ)

    Affiliations Guangdong Province Key Laboratory of Computational, Sun Yat-sen University, Guangzhou, People’s Republic of China, School of Mathematics, Sun Yat-Sen University, Guangzhou, People’s Republic of China

  • Jiajun Zhang

    Roles Conceptualization, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    mcszhtsh@mail.sysu.edu.cn (TZ); zhjiajun@mail.sysu.edu.cn (JZ)

    Affiliations Guangdong Province Key Laboratory of Computational, Sun Yat-sen University, Guangzhou, People’s Republic of China, School of Mathematics, Sun Yat-Sen University, Guangzhou, People’s Republic of China

Abstract

Recent experimental evidence strongly supports that three-dimensional (3D) long-range enhancer-promoter (E-P) interactions have important influences on gene-expression dynamics, but it is unclear how the interaction information is translated into gene expression over time (4D). To address this question, we developed a general theoretical framework (named as a 4D nucleome equation), which integrates E-P interactions on chromatin and biochemical reactions of gene transcription. With this equation, we first present the distribution of mRNA counts as a function of the E-P genomic distance and then reveal a power-law scaling of the expression level in this distance. Interestingly, we find that long-range E-P interactions can induce bimodal and trimodal mRNA distributions. The 4D nucleome equation also allows for model selection and parameter inference. When this equation is applied to the mouse embryonic stem cell smRNA-FISH data and the E-P genomic-distance data, the predicted E-P contact probability and mRNA distribution are in good agreement with experimental results. Further statistical inference indicates that the E-P interactions prefer to modulate the mRNA level by controlling promoter activation and transcription initiation rates. Our model and results provide quantitative insights into both spatiotemporal gene-expression determinants (i.e., long-range E-P interactions) and cellular fates during development.

Author summary

Gene expression is an essential biological process in all organisms. Numerous experimental studies have reported that the long-range enhancer-promoter (E-P) interaction on three-dimensional (3D) chromatin architecture plays important roles in regulating gene expression and cell functions, but the quantitative and qualitative impact of E-P interaction on gene expression over time is unclear. We develop a theoretically and numerically efficient model (called the 4D nucleome equation) to couple E-P interaction with gene expression and use this equation to characterize dynamic behavior. Then, we obtain the theoretical distribution of mRNAs and predict the gene expression profiles under E-P regulations. Interestingly, we find that E-P interactions can induce bimodal and trimodal shapes of mRNA distribution. When applying this framework to mouse embryonic stem cell data to investigate the dynamical behaviors of E-P interaction and gene expression, we reproduce the experimentally measured E-P contact frequencies and mRNA distributions under different E-P interactions. Our results support the picture of an essential mechanism for explaining phenotypic diversity and cellular decision-making.

Introduction

Gene expression is tightly related to three-dimensional (3D) genome conformation that may change over 1D time [14]. Specific DNA sequences–promoters and enhancers–orchestrate transcription in a highly complex and multilayered manner to ensure accurate spatiotemporal gene expression [14]. Many experimental studies have shown the importance of the roles of distal enhancers in ensuring reliable cell functioning and cellular decision-making [510]. However, the mechanism of how 3D chromatin organization (in particular 3D enhancer-promoter (E-P) interactions) in time (4D) shapes gene-expression dynamics still remains elusive.

Hierarchic genomic structures support various possible E-P topologies and the connection of upstream stochastic E-P interaction to downstream gene transcription [1116]. Many efforts have been made to explore the essential factors of E-P interaction affecting transcription. Recent live-imaging measurements have provided clear evidence that E-P genomic distance effectively controls gene activities [6,10,17]. And a collection of experimental evidence has established that the E-P interaction strength significantly impacts gene expression levels [6,7,18,19]. For example, the sna shadow enhancer, whose strength is determined by chromatin inheritance, generates more bursts than the sna primary enhancer in Drosophila embryos [6], and the hormone or heavy metal exposure, which externally alters the E-P interaction strength [20,21], boosts the mRNA level of c-Fos gene with the increase of dose concentrations [18]. These experimental observations indicate that the E-P genomic distance and the E-P interaction strength are important factors impacting gene expression profiles. However, biological experiments alone are not sufficient to unravel the complete picture of the dynamics of E-P interaction-regulated gene expression, and it is necessary to develop biologically reasonable mathematical models to investigate the underlying mechanism.

The conventional modeling of gene expression kinetics is based on simple models such as the two-state model and multistate model [2230], in which an implicit hypothesis is that chromatin behavior is frozen. Recent work adds the effects of chromatin structure to the model [10,31], but still ignores the spatiotemporal dynamic regulation of chromatin topologies [3135]. So far, we still lack a mechanistic mathematical model that couples stochastic chromatin organizations and stochastic gene expression processes. Addressing this issue faces two challenges. First, genomic structures are stochastic at almost every level of organization, and this stochasticity is suggestively linked to gene transcription and finally affects transcriptional outcomes [14,32]. Overall, the temporal disconnection, as well as the stochasticity of E-P topology and gene expression, lies at the heart of a broad challenge in the physical biology of both establishing a comprehensive theoretical framework of the information transmission from the upstream chromatin organization to the downstream gene expression and forecasting mRNA profiles from the dynamics of underlying molecular processes. Second, the regulation of gene expression by the chromatin spatial structure is a multiscale system. Many experimental studies have indicated distinct timescale differences between upstream chromatin dynamics and downstream gene expression [36,37]. For instance, E-P interaction occurs on a timescale of seconds to minutes [8,38,39], whereas the gene produces over a longer timescale compatible with E-P interaction, precisely the timescale of minutes to hours as suggested in multiple studies [9,36,40].

Here, we developed a general theoretical framework (formulated as the so-called 4D nucleome equation) to investigate how E-P interaction (characterized by E-P genomic distance and E-P interaction strength) affects gene expression dynamics. Specifically, this framework considered upstream chromatin motion on a fast timescale and downstream mRNA production on a slow timescale as well as the temporal connection between the upstream and the downstream. This framework involving space and time is referred as 4D gene expression kinetics [41,42]. With the timescale separation method, we derived analytical mRNA distribution and studied how E-P interaction qualitatively impacts the characteristics of mRNA distribution. Importantly, we found that the E-P interaction can flexibly regulate mRNA patterns by inducing multiple shapes of mRNA distributions including bimodal distribution with two non-origin peaks and even trimodal distribution. And our theoretical analysis and simulations revealed a power-law scaling of gene expression levels in the E-P genomic distance. Finally, by performing statistic inference on the mouse embryonic stem cells (mESCs) data, our framework exhibited good predictions of mRNA distribution and E-P contact probability under different E-P interactions. We emphasize that our 4D nucleome equation provides a general modeling framework for studying how chromatin dynamics affect gene expression kinetics and our results suggest a possible mechanism for explaining phenotypic diversity and cellular decision-making in realistic cases.

Materials and methods

In this section, we introduce a theoretical framework for predicting gene expression regulated by long-range E-P interaction in 4D (Fig 1A). First, we model chromatin as a polymer [4346] discretized into a collection of successive monomers, and assume that there are N monomers on chromatin, spatial positions of which are denoted by r = [r1,⋯,rN]T. Each monomer represents a segment of DNA, whose length depends on the levels of coarse-grained chromatin. Second, we model the gene expression process as a discrete multistate model, and assume that the gene has K different gene states (each state includes promoter’s state (ON or OFF) and the number of mRNA), which altogether constitute the vector s = [s1,⋯,sK]T.

thumbnail
Fig 1. Schematic representation of gene expression regulated by E-P interaction.

(A) E-P interaction in the cell nucleus plays a key role in regulating gene expression. (B) A polymer model involving E-P interaction (the red spring with coefficient kEP) is proposed to simulate chromatin dynamics, where r = [r1,⋯,rN]T represents monomers positions in 3D space, dG is E-P genomic distance, and dS = ‖rErP‖ is E-P spatial distance. (C) A link function vector θ = H(ds), where H is a function vector, bridges the temporal disconnection between upstream E-P topology and downstream gene expression. (D) The two-state telegraph model is used to imitate gene expression, where s = [s1,⋯,sK]T is the vector of the gene’s states and θ = [α,β,μ,δ] is the vector of state-switching rates.

https://doi.org/10.1371/journal.pcbi.1011722.g001

4D nucleome equation governing probabilistic evolutionary behavior in phase space

Let p(r,s;t) be a vector of the joint probability density functions that monomers are in position r and the gene is in state s at time t. Specifically, . Noting , we have (1)

On one hand, the motion of monomers has a continuous trajectory in the region Ω (a connected and bounded domain). The derivative in Eq [1] can be formally written as , where ∇r is the gradient operator and F(r,s;t) is a velocity field. Next, if we consider isotropic diffusion and friction across the region of monomers’ positions, then F(r,s;t) takes a generalized Fokker–Planck approximation, i.e., F(r,s;t) = V(r,s;t)−∇r(Dlogp(r;t)), where the first term represents the deterministic part of the velocity field and the second term is a stochastic ingredient of the velocity under isotropic diffusion with diffusion coefficient D. We further assume that changes in gene state do not contribute to chromatin motion, and V(r,s;t) can be approximated by V(r;t). Thus, we can obtain (2) where is the Laplace operator.

On the other hand, the stochastic gene expression process regulated by chromatin dynamics can be modeled by the master equation of the form (3) where W(r;t) is a monomer position-dependent state transition matrix, and the elements of W are related to gene state switching rates and chromatin position.

We assume that tiny changes in monomers coordinates do not alter gene state, implying that the derivative of conditional probability, ∇rp(s|r;t), approximately equals zero partly because the time interval of gene state transition is generally longer than that of chromatin motion [36]. Thus, substituting Eq [2] and Eq [3] into Eq [1] yields the following equation (4)

The first two terms on the right-hand side of Eq [4] represent chromatin’s spatiotemporal diffusion process. The first term is the deterministic component, and the second is the stochastic component accounting for random fluctuations. The last term captures the gene states’ random switching process. It should be noted that Eq [4] is a comprehensive description of the gene expression process toward the 4D reality, so we called it the 4D nucleome equation. In the Results section, we will derive the mRNA distribution based on Eq [4].

Stochastic model simulating trajectory evolutionary behavior in configuration space

Modeling chromatin dynamics.

Successive monomers in the chromatin are connected with harmonic springs of stiffness kNN (Fig 1B). Each monomer represents a nucleosome or a DNA segment (this length does not affect our qualitative results to be obtained) with the 3D position denoted by ri = (ri1,ri2,ri3), where i = 1,…,N. We employ one nucleosome to represent an enhancer or promoter and posit that there are only one enhancer and one promoter on the chromatin. To mimic E-P interaction and to simplify without loss of generality, we add a harmonic spring with stiffness kEP between the enhancer and the promoter to materialize this abstract concept.

Based on the above assumptions, we model the chromatin dynamics according to the Langevin equation , which is equivalent to Eq [2], where B(t) is a vector of independent Brownian motions. And , where γ is the friction coefficient and U(r;t) is the total potential of chromatin conformation. Note that , where without loss of generality, we set which represents the potential for the chain connection, and which represents the potential for the E-P interaction with E,P∈{1,⋯,N} representing the index of the monomer occupied by enhancer and promoter respectively. We let dG = |E−P| represent the E-P genomic distance, which can be directly measured by an experimental method. However, the kEP representing the E-P interaction strength cannot be directly measured by experiments but can be estimated from experimental data, e.g., Hi-C data [47] or Capture-C data [10].

Modeling gene expression dynamics.

Gene expression process can be characterized by a two-state model with an active ON and a silent OFF state (Fig 1D). The switching rates from OFF to ON and vice versa are α and β, respectively. The transcription rate is μ, and the mRNA degradation rate is δ. We define θ = [α,β,μ,δ] as the gene-state switching rates set.

Bridging chromatin dynamics and gene expression.

After having identified chromatin conformations and transcriptional reactions independently, we next build a biologically reasonable link between them (Fig 1C). E-P interaction carries regulatory information to orchestrate gene expression, and we hypothesize that the time-varying E-P spatial distance dS = ‖rErP‖ encodes the information to regulate the transcription-associated reaction rates θ. That is θ = H(ds), where H is a link function vector. Therefore, the two-state model mentioned above becomes a variable two-state model in which the rates θ depend on the E-P spatial distance dS.

Recent experimental results show that E-P proximity increases the likelihood of gene expression [14]. It is reasonably assumed that if dS is smaller, then α and μ are larger, but β is smaller (in the case that δ is constant). For simplification, we assume β is independent of dS and α(μ) changes between αmax(μmax) and αmin(μmin) with the fluctuating dS. We choose a Hill function [48] to reflect the effect of changes in the E-P spatial distance dS on the rates θ, although the specific shape of the corresponding response curve can also be captured by other functions. Specifically, we assume that the rates α and μ depend nonlinearly on dS, i.e., each is a Hill-like function vector H (see S1 Text).

Under the above settings, we propose a stochastic simulation algorithm to simulate the time evolution of the entire system (see S1 Text). In Result section, we will study how E-P interaction strength kEP and E-P genome distance dG regulate gene expression and how they affect the shape and characterization of mRNA distributions. In short, our modeling strategy provides a possible framework for characterizing 3D chromatin motion and tracking gene-expression processes over 1D time [41,42].

Model-data approach to infer E-P interactions and gene-expression kinetics

We use the above model to fit the mRNA distribution of single-molecule RNA fluorescence in situ hybridization (smRNA-FISH) in mESCs [10]. The data includes 6 cell lines Ck (k = 1,…,6). In each cell line, the enhancer is placed at different positions from the promoter to drive the expression of enhanced green fluorescent protein (eGFP), so we can get the E-P genomic distance and corresponding mRNA distribution with the bin number nk as well as the corresponding steady-state probability distribution Qk(X = xi) (1≤ink).

For each cell line Ck, we theoretically calculate the steady-state probability distribution with parameters Γ (including E-P interaction parameters and gene expression parameters). Then, we discretize the steady-state distribution Pk(X = xi;Γ) (1≤ink) that is comparable with the experiment data Qk(X = xi). Note that the cross entropy of the cell line Ck is given by (5) and the best-fit parameters by minimizing the total cross entropy function (6) In fact, the minimum cross-entropy method is the same as maximum likelihood estimation. To solve this optimization problem, we use the fmincon function (a nonlinear programming solver) in the LBFGS method of MATLAB to find the minimum value of the optimization problem given a set of initial values and parameter intervals (S1 Text).

Results

Analytical results of steady mRNA distribution

To study the qualitative and quantitative effects of E-P interaction (including E-P genomic distance dG and E-P interaction strength kEP) on gene expression (including the mRNA distribution, the mean mRNA expression level and the coefficient of variation (CV) defined as the ratio of standard deviation over mean), we use the 4D nucleome equation (Eq [4]) to solve the steady mRNA distribution P(x) depending on the E-P spatial distance dS.

Note that the entire gene expression system contains two modules–upstream chromatin conformation on a fast timescale and downstream gene expression on a slow timescale. If the typical timescales for the former and latter are denoted by τup and τdown respectively, Eq [4] can be rewritten as the rescaled equation (7) where . Deriving the mRNA distribution directly from the above multiscale system is a particular challenging task. Here we resort to a timescale separation method. Before this, however, we consider the E-P interaction and gene expression dynamics independently.

When focusing only on the motion of upstream chromatin conformation, we find that the stationary probability density function of chromatin conformation r can be expressed as since every monomer moves independently in each dimension. Notably, we find that E-P spatial distance dS obeys the following exact Maxwell-Boltzmann distribution (Fig 2A and S1 Text) (8) where the lumping parameter determines how changes in E-P interaction strength kEP or E-P genomic distance dG alter the shape of the pDS(dS). Experimental measurement of 3D spatial distance between Sox2 and its enhancer in living ESCs [49] or between the even-skipped (eve) locus and its enhancer in the fly embryo [50] can be well fitted by Eq [8], confirming the validity of theoretical analysis (Fig 2B).

thumbnail
Fig 2. Distributions of gene expression model.

(A) The distribution of E-P spatial distance (Maxwell-Boltzmann distribution given by Eq [8]) where red lines represent theoretical results, and histograms represent numerical results. (B) Experimental measurement of 3D spatial distance between Sox2 and its enhancer in living ESCs (histogram, [49]). The blue line is obtained by fitting Eq [8]. (C) The mRNA distribution conditional on frozen chromatin structure (Poisson-Beta distribution) where α = 0.30, β = 0.50, μ = 1 and δ = 0.1. (D) Upstream chromatin fluctuations are much faster compared with the downstream state switching rates (ω = 0.04), where the values in parentheses indicate the weight of the interpolation distribution. Parameter values are set as γ = 0.1, kEP = 0.3. (E) Upstream chromatin fluctuations are slow (ω = 59.95). Parameter values are set as γ = 100, kEP = 0.1. (F) Upstream chromatin fluctuations and downstream state switching rates are comparable (ω = 0.91). Parameter values are set as γ = 5, kEP = 0.3. Other parameters in (D-F) are set as αmax = 0.60, αmin = 0.30, β = 0.50, μmax = 3, μmin = 0.1, δ = 0.1.

https://doi.org/10.1371/journal.pcbi.1011722.g002

When focusing only on the downstream gene expression, that is, given a dS (or a frozen E-P topology), we find that the stationary probability distribution of mRNA abundance PmRNA(x) is a Poisson-Beta distribution (Fig 2C, [51]). That is, (9) where ∧ represents the mixture of two distributions, and the gene-state switching rates are in the unit of the mRNA decay rate, i.e., , , (Fig 2C).

Now, we turn to derive mRNA distribution by using the timescale separation method. First, we assume that the upstream process is much faster than the downstream process, i.e., we only consider the limit case of τup/τdown = ς<<1. By rescaling t to t/τdown, we can then assume the formal solution . Considering the leading order of ς and the stationary condition, we have . Since we have neglected the regulation of gene state by chromatin structure, the zero-th order solution p(0)(r,s;t) can be decoupled as p(0)(r,s;t) = pstat(r)p(s;t), implying that the quasi-stationary of chromatin motion is well separated from the downstream transcription. In general, the stationary mRNA probability distribution is given through (Fig 2D) (10) where 〈DS〉 is the expectation of E-P spatial distance and PmRNA|〈DS(x) (the distribution conditional on the averaged dS) is given through Eq [9].

Second, we consider the opposite limit, i.e., τup/τdown = ς>>1. In this limit and similar to the analysis in the case of τup/τdown = ς<<1, we find that p(0)(r,s;t) can also be decoupled as p(0)(r,s;t) = p(r;t)pstat(s|r), where the equation group determines pstat(s|r). Therefore, the chromatin spatial positions are not independent of the downstream gene expression. But the stationary mRNA probability distribution can be calculated according to (Fig 2E) (11) where (i.e., the distributions conditional on the dS) is given by Eq [9].

Finally, in the intermediate regime, we find that P(x) can be well fit with the following formula (Fig 2F) (12) where ω is a scaling factor defined as the ratio of minimum variable transition rate and maximum chromatin motion velocity. , where , b is E-P encounter distance and dS can be selected as the maximum E-P spatial distance that can be theoretically reached (in fact, the cumulative density function of pDS(dS) reaches 0.99).

It is worth mentioning that Eq [12] provides high-accuracy approximations of mRNA steady distribution and is a useful and simple formula for predicting the dynamics of mRNA over a broad range of timescale separation. We can further trace the respective contributions of the system’s key parameters (e.g., dG and kEP) to the patterns of mRNA distribution, to better reveal the essential mechanism of gene expression.

E-P interaction controls the mean level and variability of gene expression

To understand how E-P interaction modulates gene expression, we use the above theoretical analysis to explore the qualitative impact of E-P genomic distance dG and E-P interaction strength kEP on gene expression. We calculate the mean (〈mRNA〉) and the CV of the mRNA probability distribution in steady state, based on Eq [12].

Fig 3A and 3B depict how changes in dG alter 〈mRNA〉 and CV. We find that when kEP is fixed, 〈mRNA〉 monotonously decreases and CV monotonically increases with increasing dG. Theoretically, a smaller dG corresponds to a shorter E-P spatial distance and a more frequent E-P interaction, thus boosting the expression level. Fig 3C and 3D reveal the distinct effects of different kEP on gene expression. It shows that the tendencies are completely different from those in the case of changing dG. 〈mRNA〉 is monotonously upregulated and CV is downregulated by kEP. Indeed, Eq [8] has shown the opposite effect of dG and kEP on E-P spatial distance and then on gene expression. Log-log plot represents power-law behaviors of 〈mRNA〉 regulated by dG or kEP (Fig 3A and 3C inset). Moreover, it can be seen that the enhancement or the reduction of 〈mRNA〉 (at larger dG or kEP) tends to be saturated, indicating that the effect of E-P interaction on gene expression is not infinite but limited.

thumbnail
Fig 3. Influence of E-P interaction on mRNA mean and CV.

(A-B) The effect of E-P genomic distance dG on mRNA mean and CV, where solid lines represent theoretical results, and color points represent numerical results. (C-D) The effect of E-P interaction strength kEP on mRNA mean and CV. The inset in (A) and (C) shows the log-log plot. Other parameter values are set as αmin = 0.06, β = 0.20, μmax = 3, δ = 0.1, γ = 50.

https://doi.org/10.1371/journal.pcbi.1011722.g003

Notably, the qualitative behaviors of 〈mRNA〉 are consistent with many experimental observations [6,10,17,18,52]. For example, a larger dG between sna shadow enhancer and promoter generates fewer mRNAs than a smaller dG [6,17], and a stronger enhancer (corresponding to a larger kEP) produces more mRNAs than a weaker enhancer [6] in living Drosophila embryos. And with the increase of dose concentrations (corresponding to the increment of kEP), the mRNA of the MS2 reporter gene grows and reaches saturation [52]. Another example is that in mESCs, the mRNA level decreases and tends to remain unchanged in response to the enlargement of E-P genomic distance [10].

In addition, we study the combined effect of state-switching rates and E-P interaction on gene expression. The downstream gene-expression model is a variable ON-OFF model with variable α and μ. We find that increasing α (or μ, no matter increase αmax (or μmax) and αmin (or μmin)) enlarges 〈mRNA〉 and reduces CV (Fig 3). This is because larger α can elongate the ON state residence time, and larger μ makes the time interval of mRNA generation become shorter, and each case leads to enlargement in 〈mRNA〉 and diminishment in CV.

E-P interaction can induce bimodal and multimodal mRNA distributions

Having clarified the qualitative effect of E-P interaction on the 〈mRNA〉 and CV of mRNA, we next analyze how E-P interaction impacts the shape of mRNA distribution, including the peak numbers and peak probabilities. We adjust the E-P interaction strength kEP and E-P genomic distance dG and calculate the theoretical distributions according to Eq [12].

Each peak is defined as the observed local maximum value to identify the peak number of mRNA distributions. Based on this, we draw the boundary lines. As a result, we find that the unimodal (U), bimodal (B), and trimodal (T) distributions can appear (Fig 4A). Besides, the heatmap in Fig 4A is the bimodal coefficient BC = 1/(KS2), where is the skewness of mRNA distribution, is the kurtosis, and vi is the i-th central moment, i = 2,3,4.

thumbnail
Fig 4. E-P interaction can induce multiple shapes of mRNA distribution.

(A) Effects of E-P interaction on mRNA distribution. The lines stand for the boundaries of different peak numbers. The plain is divided into distinctive regions representing unimodal (U), bimodal (B) and trimodal (T), respectively. The heatmap is for the bimodal coefficient. (B) Dependence of the most probable mRNA numbers on kEP, where dG = 70. The color regions represent different peak numbers. The color bar represents peak probability. (C) Dependence of the most probable mRNA numbers on dG, where kEP = 10−1.76. (D) The example of unimodal/bimodal/trimodal mRNA distribution in (B). OP (origin peak), NOP (non-origin peak). The solid lines represent theoretical results, and histograms represent numerical results. Parameter values are set as dG = 70, αmax = 2, αmin = 0.06, β = 0.10, μmax = 9, μmin = 3, δ = 0.1, γ = 50.

https://doi.org/10.1371/journal.pcbi.1011722.g004

Fig 4B shows that when dG is fixed, with the increase of kEP, the distribution can appear in five modes: unimodal with the origin peak (OP), bimodal with one OP and one non-origin peak (NOP), trimodal with one OP and two NOPs, bimodal with two NOPs and unimodal with the NOP. In fact, we can see the evolutionary process of the peak numbers and peak probabilities. The OP goes under with amplifying the kEP, and a NOP begins to grow and a bimodal then occurs. When the distance between the two peaks of the bimodal becomes larger, a peak emerges at the larger mRNA and the bimodal distribution turns to trimodal distribution. Subsequently, the OP vanishes and gets back to the bimodal. The distance between the two NOPs of the bimodal distribution can become smaller, and the peak corresponding to the fewer mRNA disappears when the bimodal becomes unimodal. Fig 4D perfectly demonstrates the above process, and the simulations (histograms) are in good agreement with theoretical predictions (solid lines). In addition, when we fix kEP, the influence of dG on the distribution modes is opposite to that of kEP (Comparing Fig 4B and 4C). It should be pointed out that although the trimodal distribution may be absent under some parameter values, the pattern of distribution along kEP or dG still changes from unimodal to bimodal and then back to unimodal. During the bimidal phase, the peak at the smaller mRNA gradually disappears, resulting in the transition from bimodal to unimodal (see S2 and S3 Figs).

These results indicate that E-P interaction can produce multiple modes of mRNA distribution. More importantly, it can lead to mRNA distributions with two NOPs and even with three peaks. S1 Fig shows more bimodal and trimodal cases under different parameter values. Notably, only the downstream ON-OFF model (Eq [9]) can produce neither bimodal with two NOPs nor trimodal. In fact, the phenomenon of bimodal with two NOPs has been confirmed in biological experiments, such as the random activity of latent HIV-1 promoters [53]. In Ref [54],the researchers studied the transcriptional behavior of GREB1 changes with estrogen dose. Generally, addition of estrogen (17b-estradiol, E2) may increase contacts between the GREB1 gene and the estrogen-receptor-α-bound enhancer [55], thus altering the E-P interaction strength. When increasing the E2 concentration (increasing the E-P interaction strength), the mRNA level grows and the mRNA distribution changes from the single peak to a bimodal distribution. A new peak emerges at a higher mRNA level and gradually increases and the peak near the origin gradually decreases, which is consistent with our model (see S2 Fig). In a word, these phenomena highlight the importance of E-P interaction in regulating gene expression, which may explain the source of multimodality. And the distinctive modes of mRNA distribution for achieving fast responses to stimulations and phenotypic switching would be essential for environmental adaptation and cellular decision-making.

Analysis of mouse embryonic stem cell data indicates that E-P interaction regulates promoter activation and transcription initiation

To check the effectiveness of our model, we used different E-P distances that forming distinctive cell lines Ck (k = 1,…,6) and corresponding mRNA distribution measured by smRNA-FISH in mESCs [10] to infer gene expression dynamics (Materials And Methods). Assume that the mRNA distributions in different cell lines Ck are different only due to the different E-P genomic distances, and that an enhancer is inserted into the upstream or downstream of an promoter with the same genomic distance has no significant difference. The E-P genomic distances for different cell lines are NaN, 112.1710 kb, 39.4530 kb, 23.1110 kb, 17.0190 kb and 6.0600 kb respectively, where the NaN means the eGFP transcription is driven by the Sox2 promoter alone. Then, we suppose that one monomer in the coarse-grained polymer model represents 5kb, the E-P genomic distances for our model in different cell lines Ck are NaN, 22, 8, 5, 3, and 1, respectively. It should be pointed out that the dG is the value after rounding.

Frist, we preliminarily attempt to use the two-state model without considering E-P regulatory to infer gene expression dynamics. We find that the changing α and μ in different cell lines get the minimize cross entropy (Table A in S1 Text). Then, we involve the long-range E-P regulatory into the model and define three variable two-state models (the variable α two-state model, the variable μ two-state model, and the variable α and μ two-state model) to simulate the transcription process (S1 Text). For each cell line Ck, we calculate the steady-state probability distributions by using Eq [12] and use Eq [6] to estimate the optimized parameter values (Materials And Methods). We find that the two-state model with variable α and μ obtains the minmum cross entropy, which indicates E-P communication prefers to regulate the promoter activation (α) and transcription initiation (μ) rates (Table B in S1 Text). The multiscale model fit converged to a good set of parameters describing chromatin and gene expression dynamics (see Table C in S1 Text). In addition, different monomer lengths do not alter the fitting results (see Table D in S1 Text).

As a result, Fig 5A shows that the 4D nucleome equation can well fit the mRNA distributions under different E-P interaction. It should be pointed out that the weight of slow chromatin dynamics (1/(1+ω)) plays an important role in modifying the fitted distribution (see S5 Fig and S1 Text). Based on the best fitting parameter values and Eq [8], we can further study the relationship between E-P genomic distance and E-P contact probability (see S1 Text). Fig 5B shows that with increasing E-P genomic distance, the contact probability steeply decays and tends to be flat, which is in good agreement with the experimental results confirmed using Capture-C analysis [10]. Log-log plot represents a power-law behavior between E-P genomic distance and contact probability (Fig 5B inset), also in accordance with many experimental data in consecutive and nonconsecutive TAD borders [56], even on the genome-wide scale [57,58]. Second, Fig 5C shows the relationship between contact probability and mean gene expression level. We see that the values and the change tendency of the contact probability and average mRNA level obtained by theoretical calculations (light red circle) are basically consistent with the experimental data (light blue circle). If we focus on the six special cell lines shown in Fig 5A, we find that the inferred results (red polygon) and experimental data (blue polygon) are relatively consistent in values and change tendency. Third, for the relationship between CV of eGFP levels and contact probability (Fig 5D), we find that the theoretical result and experimental data are also relatively consistent in values and change tendency. In addition, we consider changing the upstream-downstream nonlinear Hill function in the model to a linear form (i.e., using a linear function within a reasonable range of E-P spatial distances). By using the same fitting method, we find that the linear model fits the experiemental data well (see S6 Fig). Therefore, in our model, non-linear dependence of the activation rate on E-P contacts does not seem to be a dominant cause of the nonlinear relationship between connection probability and mean expression observed in the experiment. Rather, the coupling of the fast-slow chromatin dynamics may be an important factor contributing to the nonlinearity.

thumbnail
Fig 5. Results obtained by analyzing experiment data on mESCs in response to the varying E-P genomic distance.

(A) Distributions of the mRNA numbers of smRNA-FISH in different cell lines. The solid line shows the best fit of our stochastic multiscale model to the experimental data shown in the histograms. dG = NaN represents promoter-only control cell line. Parameter values are listed in Table E in S1 Text. (B) The relationship between dG and contact probability. The blue circles represent the experimental data from the ectopic Sox 2 transgene. The solid line is obtained based on the best-fit values of parameters kNN and kEP. The inset shows the log-log plot. (C) Mean eGFP mRNA level plotted against contact probability between the ectopic Sox2 promoter and SCR insertions. The shadow blue dots show the experimental data presented as mean values +/- standard deviation, whereas the shadow red dots show the theoretical results obtained by best-fit parameter values. The blue polygon shows the experimental data in (A), and the red polygon shows the corresponding theoretical results. (D) CV of eGFP level against contact probability. The meanings of symbols are the same as those in (C).

https://doi.org/10.1371/journal.pcbi.1011722.g005

Finally, we verify that the inferred results are consistent with previous observations. The inferred kNN = 0.7758 is one order of magnitude larger than kEP = 0.0969, indicating that the ectopic enhancer and promoter do not have a prominent specific interaction compared to the connection between successive monomers. This result is consistent with experimental studies [10]. Meanwhile, the fold change between αmax and αmin (18 folds) is larger than that between μmax and μmin (8 folds), indicating that E-P interaction is inclined to adjust OFF state dwell time (related to α) and further verifying that E-P interaction mainly regulates burst frequency [6,7,10,59,60]. According to the inferred results, the ON-state dwell time is in the timescale of minutes, and the OFF-state dwell time is from minutes to hours [6163] (considering that the time unit is mRNA lifetime, expected to be around 1.5 hours [64]). The number of mRNA per burst is about tens of mRNAs, consistent with a previous experimental observation [63].

Taken together, our model results are consistent with previous experimental observations and suggest a possible essential mechanism of gene expression regulated by E-P interaction.

Discussion

Imaging studies, high-resolution chromatin conformation maps, and genome-wide occupancy profiles of architectural proteins have revealed that genome topology encoding E-P interaction information is tightly correlated with gene expression. In this paper, we have proposed a general yet tractable multiscale model, which integrates the E-P interaction information into gene expression, to shed light on the pivotal role of E-P spatial interaction in the control of gene expression profiles.

This theoretical framework characterizes the dynamic process of stochastic gene expression regulated by chromatin movement in the nucleus by using a 4D nucleome equation. First, the fast-scale chromatin motion processes (including E-P interaction) can be described by a generalized Rouse model, which is suitable for modeling the situation that the environmental effects of entanglement and crowding are negligible [65]. Second, the slow-scale gene expression process is described by a two-state model that captures essential events occurring in transcriptional processes. Third, an input-output relation was proposed to link upstream chromatin configurations to downstream gene expression. This equation unifies two stochastic processes with two different time scales, which can be taken as a good starting point for analyzing how chromatin motion affects gene expression in complex cases.

Overcoming time-scale differences and biological-process complexity to solve mRNA distribution analytically in this 4D nucleome equation is challenging. However, the analytical methodology developed here can be used in quantitatively studying distribution characteristics of gene expression involving E-P interaction. In fact, we have used the timescale separation method to obtain analytical mRNA distributions. In addition, our approach also allows us to make meaningful predictions about how upstream chromatin dynamics affect downstream gene-expression phenotypes. First, we have shown that E-P genomic distance dG and E-P interaction strength kEP, two key parameters in our model, have the opposite change trends on mRNA mean levels and CVs, which were qualitatively verified by experimental data. Second, our model demonstrated that the combination of kEP and dG is a flexible regulation strategy to explain possible implications of bimodal distributions with two NOPs and trimodal distributions. The peaks are more sensitive to the change of dG when kEP is small, but to the change of kEP when dG is large (Figs 4, S2, and S3). S7 Fig also shows different parameters in modulating mRNA distribution. System’s parameters would collectively regulate the resulting mRNA distribution, and the parameters related to E-P communication shows a consistent pattern of modulating mRNA distribution across parameter changes. These sensitivity phenomenon provide insights into complex mechanisms of biological processes and are essential for cellular decision-making and environmental adaptation. Third, our mathematical model shows well predictions on the experimental mESCs data, including the mRNA distributions and the relationship between contact probability and mean mRNA level (and CV) under defferent E-P interactions. The inference results indicate that the slow chromatin dynamics plays an important role in regulating the distribution of mRNAs (S5 Fig). And in particularly, for the slow dynamics, the regulation of upstream to downstream is inseparable, which may cause the gene expression levels of the slow limit are not linear dependence on E-P contacts and eventually lead to the nonlinearity observed in the experiment between transcriptional levels and E-P contact probabilities. These results provide a possible mechanism for E-P interaction translated into gene-expression dynamics.

Our model has several advantages compared to previous work that studied the effect of chromatin structure on gene expression. Xiao et al. [31] proposed a mechanism of how E-P signals were involved in the accumulation and removal of the transcription factors favoring transcription to explain the hypersensitivity of transcription to changes in contact frequency. Zuin et al. [10] used the E-P encounter probability obtained from the data analysis as the regulator of the upstream to the downstream and assumed that the OFF to ON process was a multi-step process with cumulative effects. However, we note that the corresponding models did not use dynamically fluctuating E-P spatial distances to regulate downstream transcription processes. In our model, random movement of the enhancer and promoter causes fluctuations in E-P spatial distance and regulates gene transcription in real time. In fact, this direct and dynamic regulation leads to the variable rate from OFF to ON, which is practically an alternative to the multi-step process described above. We point out that in the case of fast chromatin dynamics, upstream-to-downstream regulation is performed through the expectation of the E-P spatial distance, which can be equated, in a general way, to the contact probability in Ref. [10]. However, the slow chromatin dynamics also plays an important role in regulating the distribution of mRNAs (see S1 Text). Moreover, our model has the ability to infer the gene-expression dynamics using less information that can be directly measured and leading to better fitting on mESC data to some extent (S4 Fig), especially for the E-P genomic distance in different cell lines, rather than the E-P encounter probability.

Significantly, our modeling framework can also be extended to more complex situations. For example, some experimental studies reported that gene products could affect chromatin structure [66,67]. We can incorporate this feedback into our model by modifying the gene-state-dependent drift function V(r,s;t) in Eq [4]. In addition, our modeling of chromatin motion is not limited to one-pair E-P interaction. The potential extensions include the cases of multiple enhancers to one promoter [68], one enhancer to multiple promoters [6], or super-enhancers [69]. Finally, using multistate models of gene expression to extract insights from enormous experimental data and complex biological phenomena is impressive. Our two-state model is not the default option, and we may adjust the form of the downstream gene expression model to include more complex biological processes such as mRNA splices [70] and cell cycle [71]. However, we still need to balance the complexity and solvability of the model.

Finally, we point out that our theoretical model, which aims to develop a general modeling framework to study 4D gene-expression kinetics, may provide an opportunity for a dialogue between theoretical studies and biological experiments. We envision that our modeling framework will be helpful for the biophysical analysis of broader in vivo cellular processes.

Supporting information

S1 Text. It consists of two parts: (1) model description and simulation; (2) fitting experimental data using model.

In the first section, we supplement the details of the model and give the simulation algorithm as well as the statistics analyses of the simulation data. In the second section, we present the procedure for fitting experimental data using our model and comparing the experimental data with theoretical results.

https://doi.org/10.1371/journal.pcbi.1011722.s001

(PDF)

S1 Fig. Bimodal and trimodal cases under different parameters.

(A) The bimodal distribution (with a origin peak and a non-origin peak) of gene expression where kEP = 0.5, γ = 1, αmax = 0.10, αmin = 0.01, β = 0.03, μmax = 5, μmin = 2, δ = 0.1. (B) The bimodal distribution (with two origin peaks) of gene expression where kEP = 0.20, γ = 100, αmax = 5, αmin = 0.30, β = 0.50, μmax = 8, μmin = 2, δ = 0.1. (C) The trimodal distribution of gene expression where kEP = 0.1, γ = 300, αmax = 0.5, αmin = 0.05, β = 0.02, μmax = 10, μmin = 2, δ = 0.1.

https://doi.org/10.1371/journal.pcbi.1011722.s002

(TIF)

S2 Fig. E-P interaction can induce bimodal mRNA distribution with two non-origin peaks.

(A) Effects of E-P interaction on mRNA distribution. The black lines stand for the boundaries of different peak numbers. The plain is divided into distinctive regions representing unimodal (U) and bimodal (B). The heatmap is the bimodal coefficient. (B) Dependence of the most probable mRNA numbers on kEP where dG = 60. The color regions represent different peak numbers. The color bar represents the peak probabilities. (C) The example of unimodal/bimodal mRNA distribution. NOP (non-origin peak). The solid lines represent theoretical results and histograms represent numerical results. Parameter values are set as dG = 60, αmax = 5, αmin = 0.3, β = 0.5, μmax = 8, μmin = 2, δ = 0.1, γ = 50.

https://doi.org/10.1371/journal.pcbi.1011722.s003

(TIF)

S3 Fig. E-P interaction can induce bimodal mRNA distribution with one origin peak and one non-origin peak.

(A) Effects of E-P interaction on mRNA distribution. The black lines stand for the boundaries of different peak numbers. The plain is divided into distinctive regions representing unimodal (U) and bimodal (B). The heatmap is the bimodal coefficient. (B) Dependence of the most probable mRNA numbers on kEP where dG = 60. The color regions represent different peak numbers. The color bar represents the peak probabilities. (C) The example of unimodal/bimodal mRNA distribution. OP (origin peak), NOP (non-origin peak). The solid lines represent theoretical results and histograms represent numerical results. Parameter values are set as dG = 60, αmax = 0.10, αmin = 0.01, β = 0.03, μmax = 5, μmin = 2, δ = 0.1, γ = 1.

https://doi.org/10.1371/journal.pcbi.1011722.s004

(TIF)

S4 Fig. Comparisons the fitting results of our model and Zuin’s model to experimental data.

(A) KS distances of different cell lines (the KS distance is defined in Eq [16] in the S1 Text). The blue dots and shadow line stand for the KS distance between our model and experimental data, and the dashed blue line represents the mean KS distance for all cell lines. The red shows the KS distance between Zuin’s model and experimental data. (B) Distribution of mRNA numbers of smRNA-FISH in cell lines C6. The blue line shows the best fit of our model whereas the red line shows the best fit of Zuin’s model to the experimental data shown in the histograms. (C) CDF of the distributions in (B). The and . The small arrows represent the number of mRNAs corresponding to the KS distance.

https://doi.org/10.1371/journal.pcbi.1011722.s005

(TIF)

S5 Fig. The contribution of slow chromatin dynamics to data fitting.

(A) The weight of slow chromatin dynamics in fitting distribution (1/(1+ω)). (B) KS distances of different cell lines. The blue dots and shadow line stand for the KS distance between the fast distribution and experimental data, and the red shows the KS distance between interpolation distribution and experimental data. The dashed lines represent the mean KS distance for all cell lines. (C) The distribution of Cell line 2. (D) The distribution of Cell line 6.

https://doi.org/10.1371/journal.pcbi.1011722.s006

(TIF)

S6 Fig. Comparing the fitting results of model’s Hill and linear dependence to experimental data.

(A) KS distances of different cell lines. The red (/green) dots and shadow line stand for the KS distance between Hill (/linear) (Eq [3]/Eq [4] in S1 Text) dependence and experimental data. The blue shows the KS distance between Zuin’s model and experimental data. the dashed lines represent the mean KS distance for all cell lines. (B) The relationship between dG and contact probability. The blue circles represent the experimental data from the ectopic Sox 2 transgene. The solid red (/green) line is obtained based on the Hill (/linear) dependence. The inset shows the log-log plot. (C) Mean eGFP mRNA level plotted against contact probability between the ectopic Sox2 promoter and SCR insertions. The blue polygon shows the experimental data, and the red (/green) polygon shows the corresponding theoretical results based on the Hill (/linear) dependence. (D) CV of eGFP level against contact probability. The meanings of symbols are the same as those in (C).

https://doi.org/10.1371/journal.pcbi.1011722.s007

(TIF)

S7 Fig. Comparisons of parameter’s changes on the multimodality of mRNA distribution.

(A) Effects of E-P interaction strengths kEP and friction coefficient γ on the pattern of mRNA distribution. Parameters are dG = 60, αmax = 5, αmin = 0.3, β = 0.5, μmax = 8, μmin = 2, δ = 0.1. (B) Effects of kEP and the ratio of μmax/μmin on the pattern of mRNA distribution. The μmin = 0.1, 0.3, 0.5. Parameters are αmax = 0.10, αmin = 0.01. (C) Effects of kEP and the ratio of αmax/αmin on the pattern of mRNA distribution. The αmin = 0.01, 0.04, 0.08. Parameters are μmax = 5, μmin = 2. Each line divides the entire region into two parts, with U indicating a single peak area and M indicating a multimodal area. Other parameter values are set as dG = 60, β = 0.03, δ = 0.1, γ = 1. The solid line divides the entire region into two parts, with the part marked U indicating unimodal and the part marked M indicating multimodal (including bimodal and trimodal).

https://doi.org/10.1371/journal.pcbi.1011722.s008

(TIF)

References

  1. 1. Robson MI, Ringel AR, Mundlos S. Regulatory landscaping: How enhancer-promoter communication is sculpted in 3D. Mol Cell. 2019;74(6):1110–22. pmid:31226276
  2. 2. Andersson R, Sandelin A. Determinants of enhancer and promoter activities of regulatory elements. Nat Rev Genet. 2020;21(2):71–87. pmid:31605096
  3. 3. Schoenfelder S, Fraser P. Long-range enhancer–promoter contacts in gene expression control. Nat Rev Genet. 2019;20(8):437–55. pmid:31086298
  4. 4. Haberle V, Stark A. Eukaryotic core promoters and the functional basis of transcription initiation. Nat Rev Mol Cell Biol. 2018;19(10):621–37. pmid:29946135
  5. 5. Lettice LA, Heaney SJ, Purdie LA, Li L, de Beer P, Oostra BA, et al. A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum Mol Genet. 2003;12(14):1725–35. pmid:12837695
  6. 6. Fukaya T, Lim B, Levine M. Enhancer control of transcriptional bursting. Cell. 2016;166(2):358–68. pmid:27293191
  7. 7. Bartman CR, Hsu SC, Hsiung CC-S, Raj A, Blobel GA. Enhancer regulation of transcriptional bursting parameters revealed by forced chromatin looping. Mol Cell. 2016;62(2):237–47. pmid:27067601
  8. 8. Chen H, Levo M, Barinov L, Fujioka M, Jaynes JB, Gregor T. Dynamic interplay between enhancer–promoter topology and gene activity. Nat Genet. 2018;50(9):1296–303. pmid:30038397
  9. 9. Rodriguez J, Ren G, Day CR, Zhao K, Chow CC, Larson DR. Intrinsic dynamics of a human gene reveal the basis of expression heterogeneity. Cell. 2019;176(1–2):213–26. pmid:30554876
  10. 10. Zuin J, Roth G, Zhan Y, Cramard J, Redolfi J, Piskadlo E, et al. Nonlinear control of transcription through enhancer-promoter interactions. Nature. 2022;604:571–7. pmid:35418676
  11. 11. Li J, Hsu A, Hua Y, Wang G, Cheng L, Ochiai H, et al. Single-gene imaging links genome topology, promoter–enhancer communication and transcription control. Nat Struct Mol Biol. 2020;27(11):1032–40. pmid:32958948
  12. 12. Su J-H, Zheng P, Kinrot SS, Bintu B, Zhuang X. Genome-scale imaging of the 3D organization and transcriptional activity of chromatin. Cell. 2020;182(6):1641–59. pmid:32822575
  13. 13. Gizzi AMC, Cattoni DI, Fiche J-B, Espinola SM, Gurgo J, Messina O, et al. Microscopy-based chromosome conformation capture enables simultaneous visualization of genome organization and transcription in intact organisms. Mol Cell. 2019;74(1):212–22. pmid:30795893
  14. 14. Bohrer CH, Larson DR. The stochastic genome and its role in gene expression. Cold Spring Harb Perspect Biol. 2021;13:a040386. pmid:33903158
  15. 15. Hübner MR, Eckersley-Maslin MA, Spector DL. Chromatin organization and transcriptional regulation. Curr Opin Genet Dev. 2013;23(2):89–95. pmid:23270812
  16. 16. Oudelaar AM, Higgs DR. The relationship between genome structure and function. Nat Rev Genet. 2021;22(3):154–68. pmid:33235358
  17. 17. Yokoshi M, Segawa K, Fukaya T. Visualizing the role of boundary elements in enhancer-promoter communication. Mol Cell. 2020;78(2):224–35. pmid:32109364
  18. 18. Senecal A, Munsky B, Proux F, Ly N, Braye FE, Zimmer C, et al. Transcription factors modulate c-Fos transcriptional bursts. Cell Rep. 2014;8(1):75–83. pmid:24981864
  19. 19. Bartman CR, Hamagami N, Keller CA, Giardine B, Hardison RC, Blobel GA, et al. Transcriptional burst initiation and polymerase pause release are key control points of transcriptional regulation. Mol Cell. 2019;73(3):519–32. pmid:30554946
  20. 20. Stavreva DA, Coulon A, Baek S, Sung MH, John S, Stixova L, et al. Dynamics of chromatin accessibility and long-range interactions in response to glucocorticoid pulsing. Genome Res. 2015;25(6):845–57. pmid:25677181
  21. 21. Hakim O, Sung MH, Voss TC, Splinter E, John S, Sabo PJ, et al. Diverse gene reprogramming events occur in the same spatial clusters of distal regulatory elements. Genome Res. 2011;21(5):697–706. pmid:21471403
  22. 22. Shahrezaei V, Swain PS. Analytical distributions for stochastic gene expression. Proc Natl Acad Sci USA. 2008;105:17256–61. pmid:18988743
  23. 23. Kumar N, Platini T, Kulkarni RV. Exact distributions for stochastic gene expression models with bursting and feedback. Phys Rev Lett. 2014;113:268105. pmid:25615392
  24. 24. Wang Z, Zhang Z, Zhou T. Exact distributions for stochastic models of gene expression with arbitrary regulation. Sci China Math. 2020;63(3):485–500. https://doi.org/10.1007/s11425-019-1622-8
  25. 25. Zhang J, Zhou T. Markovian approaches to modeling intracellular reaction processes with molecular memory. Proc Natl Acad Sci USA. 2019;116(47):23542–50. pmid:31685609
  26. 26. Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S. Stochastic mRNA synthesis in mammalian cells. PLoS Biol. 2006;4(10):e309. pmid:17048983
  27. 27. Peccoud J, Ycart B. Markovian modeling of gene-product synthesis. Theor Popul Biol. 1995;48:222–34. https://doi.org/10.1006/tpbi.1995.1027
  28. 28. Zhang J, Zhou T. Promoter-mediated transcriptional dynamics. Biophys J. 2014;106(2):479–88. pmid:24461023
  29. 29. Harper CV, Finkenstädt B, Woodcock DJ, Friedrichsen S, Semprini S, Ashall L, et al. Dynamic analysis of stochastic transcription cycles. PLoS Biol. 2011;9(4):e1000607. pmid:21532732
  30. 30. Zhou T, Zhang J. Analytical results for a multi-state gene model. SIAM J Appl Math. 2012;72:789–818. http://dx.doi.org/10.1137/110852887.
  31. 31. Xiao JY, Hafner A, Boettiger AN. How subtle changes in 3D structure can create large changes in transcription. eLife. 2021;10:e64320. pmid:34240703
  32. 32. Sood V, Misteli T. The stochastic nature of genome organization and function. Curr Opin Genet Dev. 2022;72:45–52. pmid:34808408
  33. 33. Dekker J, Mirny L. The 3D genome as moderator of chromosomal communication. Cell. 2016;164(6):1110–21. pmid:26967279
  34. 34. Elimelech A, Birnbaum RY. From 3D organization of the genome to gene expression. Curr Opin Syst Biol. 2020;22:22–31. https://doi.org/10.1016/j.coisb.2020.07.006
  35. 35. Brouwer I, Lenstra TL. Visualizing transcription: Key to understanding gene expression dynamics. Curr Opin Chem Biol. 2019;51:122–9. pmid:31284216
  36. 36. Lammers NC, Kim YJ, Zhao J, Garcia HG. A matter of time: Using dynamics and theory to uncover mechanisms of transcriptional bursting. Curr Opin Cell Biol. 2020;67:147–57. pmid:33242838
  37. 37. Johnstone CP, Wang NB, Sevier SA, Galloway KE. Understanding and engineering chromatin as a dynamical system across length and timescales. Cell Syst. 2020;11(5):424–48. pmid:33212016
  38. 38. Heist T, Fukaya T, Levine M. Large distances separate coregulated genes in living Drosophila embryos. Proc Natl Acad Sci USA. 2019;116(30):15062–7. pmid:31285341
  39. 39. Lim B, Heist T, Levine M, Fukaya T. Visualization of transvection in living Drosophila embryos. Mol Cell. 2018;70(2):287–96. pmid:29606591
  40. 40. Lammers NC, Galstyan V, Reimer A, Medin SA, Wiggins CH, Garcia HG. Multimodal transcriptional control of pattern formation in embryonic development. Proc Natl Acad Sci USA. 2020;117(2):836–47. pmid:31882445
  41. 41. Dekker J, Belmont A, Guttman M, Leshyk V, Lis J, Lomvardas S, et al. The 4D nucleome project. Nature. 2017;549(7671):219–26. pmid:28905911
  42. 42. Marti-Renom MA, Almouzni G, Bickmore WA, Bystricky K, Cavalli G, Fraser P, et al. Challenges and guidelines toward 4D nucleome data and model standards. Nat Genet. 2018;50(10):1352–8. pmid:30262815
  43. 43. Doi M, Edwards SF, Edwards S. The theory of polymer dynamics: Oxford University Press; 1988.
  44. 44. Barbieri M, Chotalia M, Fraser J, Lavitas L, Dostie J, Pombo A, et al. Complexity of chromatin folding is captured by the strings and binders switch model. Proc Natl Acad Sci USA. 2012;109(40):16173–8. pmid:22988072
  45. 45. Shukron O, Holcman D. Transient chromatin properties revealed by polymer models and stochastic simulations constructed from Chromosomal Capture data. PLoS Comput Biol. 2017;13(4):e1005469. pmid:28369076
  46. 46. Giorgetti L, Galupa R, Nora EP, Piolot T, Lam F, Dekker J, et al. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell. 2014;157(4):950–63. pmid:24813616
  47. 47. Lu L, Liu X, Huang W-K, Giusti-Rodríguez P, Cui J, Zhang S, et al. Robust Hi-C maps of enhancer-promoter interactions reveal the function of non-coding genome in neural development and diseases. Mol Cell. 2020;79(3):521–34. e15. pmid:32592681
  48. 48. Goutelle S, Maurin M, Rougier F, Barbaut X, Bourguignon L, Ducher M, et al. The Hill equation: A review of its capabilities in pharmacological modelling. Fundam Clin Pharmacol. 2008;22(6):633–48. pmid:19049668
  49. 49. Alexander JM, Guan J, Li B, Maliskova L, Song M, Shen Y, et al. Live-cell imaging reveals enhancer-dependent Sox2 transcription in the absence of enhancer proximity. eLife. 2019;8:e41769. pmid:31124784
  50. 50. Brueckner DB, Chen H, Barinov L, Zoller B, Gregor T. Stochastic motion and transcriptional dynamics of pairs of distal DNA loci on a compacted chromosome. 2023. https://doi.org/10.1101/2023.01.18.524527.
  51. 51. Vu TN, Wills QF, Kalari KR, Niu N, Wang L, Rantalainen M, et al. Beta-Poisson model for single-cell RNA-seq data analyses. Bioinformatics. 2016;32(14):2128–35. pmid:27153638
  52. 52. Larson DR, Fritzsch C, Sun L, Meng X, Lawrence DS, Singer RH. Direct observation of frequency modulated transcription in single cells using light activation. eLife. 2013;2:e00750. pmid:24069527
  53. 53. Tantale K, Garcia-Oliver E, Robert M-C, L’hostis A, Yang Y, Tsanov N, et al. Stochastic pausing at latent HIV-1 promoters generates transcriptional bursting. Nat Commun. 2021;12(1):1–20. https://doi.org/10.1038/s41467-021-24462-5.
  54. 54. Fritzsch C, Baumgärtner S, Kuban M, Steinshorn D, Reid G, Legewie S. Estrogen-dependent control and cell-to-cell variability of transcriptional bursting. Mol Syst Biol. 2018;14(2):e7678. pmid:29476006
  55. 55. Hevener AL, Ribas V, Moore TM, Zhou Z. ERα in the control of mitochondrial function and metabolic health. Trends Mol Med. 2021;27(1):31–46. https://doi.org/10.1016/j.molmed.2020.09.006.
  56. 56. Cattoni DI, Gizzi AMC, Georgieva M, Di Stefano M, Valeri A, Chamousset D, et al. Single-cell absolute contact probability detection reveals chromosomes are organized by multiple low-frequency yet specific interactions. Nat Commun. 2017;8(1):1753. pmid:29170434
  57. 57. Ing-Simmons E, Vaid R, Bing XY, Levine M, Mannervik M, Vaquerizas JM. Independence of chromatin conformation and gene regulation during Drosophila dorsoventral patterning. Nat Genet. 2021;53(4):487. pmid:33795866
  58. 58. Bonev B, Cohen NM, Szabo Q, Fritsch L, Papadopoulos GL, Lubling Y, et al. Multiscale 3D genome rewiring during mouse neural development. Cell. 2017;171(3):557–72. pmid:29053968
  59. 59. Larsson AJM, Johnsson P, Hagemann-Jensen M, Hartmanis L, Faridani OR, Reinius B, et al. Genomic encoding of transcriptional burst kinetics. Nature. 2019;565(7738):251–4. pmid:30602787
  60. 60. Ochiai H, Hayashi T, Umeda M, Yoshimura M, Harada A, Shimizu Y, et al. Genome-wide kinetic properties of transcriptional bursting in mouse embryonic stem cells. Sci Adv. 2020;6(25):eaaz6699. pmid:32596448
  61. 61. Molina N, Suter DM, Cannavo R, Zoller B, Gotic I, Naef F. Stimulus-induced modulation of transcriptional bursting in a single mammalian gene. Proc Natl Acad Sci USA. 2013;110(51):20563–8. pmid:24297917
  62. 62. Halpern KB, Tanami S, Landen S, Chapal M, Szlak L, Hutzler A, et al. Bursty gene expression in the intact mammalian liver. Mol Cell. 2015;58(1):147–56. pmid:25728770
  63. 63. Suter DM, Molina N, Gatfield D, Schneider K, Schibler U, Naef F. Mammalian genes are transcribed with widely different bursting kinetics. Science. 2011;332(6028):472–4. pmid:21415320
  64. 64. Horvathova I, Voigt F, Kotrys AV, Zhan Y, Artus-Revel CG, Eglinger J, et al. The dynamics of mRNA turnover revealed by single-molecule imaging in single cells. Mol Cell. 2017;68(3):615–25. pmid:29056324
  65. 65. Zhang Y, Dudko OK. First-passage processes in the genome. Annu Rev Biophys. 2016;45(1):117–34. pmid:27391924
  66. 66. Heinz S, Texari L, Hayes MG, Urbanowski M, Chang MW, Givarkes N, et al. Transcription elongation can affect genome 3D structure. Cell. 2018;174(6):1522–36. pmid:30146161
  67. 67. Busslinger GA, Stocsits RR, Van Der Lelij P, Axelsson E, Tedeschi A, Galjart N, et al. Cohesin is positioned in mammalian genomes by transcription, CTCF and Wapl. Nature. 2017;544(7651):503–7. https://doi.org/10.1038/nature22063
  68. 68. He B, Chen C, Teng L, Tan K. Global view of enhancer–promoter interactome in human cells. Proc Natl Acad Sci USA. 2014;111(21):E2191–E9. pmid:24821768
  69. 69. Pott S, Lieb JD. What are super-enhancers? Nat Genet. 2015;47(1):8–12. pmid:25547603
  70. 70. Kan Z, States D, Gish W. Selecting for functional alternative splices in ESTs. Genome Res. 2002;12(12):1837–45. pmid:12466287
  71. 71. Johnson DG, Walker CL. Cyclins and cell cycle checkpoints. Annu Rev Pharmacol Toxicol. 1999;39:295–312. pmid:10331086