Figures
Abstract
RNA-binding proteins (RBP) are important regulators of RNA metabolism. In neurodegenerative disorders such as Huntington’s Disease (HD), disrupted RBP-RNA interactions contribute to neuronal dysfunction. One such RBP, Midline 1 (MID1), has been shown to aberrantly associate with mutant huntingtin (Htt) RNA, enhancing its translation, yet the mechanism driving this effect remains unknown. Here, we develop a computational model to understand the role of MID1. Based on previously published data, our model predicts that MID1 increases the stability of the Htt RNA. We experimentally validate this prediction, showing that overexpression of MID1 significantly prolongs the half-life of mutant Htt RNA. Furthermore, we evaluate model refinements, including clustering of MID1-bound RNA, which allow capturing all key observations in the data. Together, we provide a data-driven framework that underlines the importance of RBP-RNA interaction in post-transcriptional regulation. This framework also shows how individual molecular reactions jointly determine RNA stability and protein levels in HD.
Author summary
Huntington’s Disease is a severe brain disorder caused by a mutation that makes a part of the huntingtin gene much longer than normal. This change affects how the gene’s RNA interacts with proteins that normally control how long RNAs live and how much protein they produce. One of these proteins, called MID1, attaches more strongly to the mutant huntingtin RNA and increases the amount of mutant huntingtin protein, which may worsen the disease.
In this study, we use mathematical modeling together with experimental data to understand how MID1 affects huntingtin RNA. Our model suggests that MID1 makes the RNA more stable, allowing it to live in the cell longer and produce more protein. We confirm this prediction in biological experiments. We also find that RNAs bound to MID1 may group together inside cells, further boosting protein production. Our findings show how combining modeling and experiments can uncover hidden mechanisms in diseases like Huntington’s Disease and may help identify new ways to control harmful protein buildup inside cells.
Citation: Liu Y, Reisbitzer A, Dorešić D, Hasenauer J, Krauß S, Tchumatchenko T (2026) Data-driven model reveals increased stability of CAG-expanded huntingtin RNA due to MID1 binding. PLoS Comput Biol 22(6): e1014342. https://doi.org/10.1371/journal.pcbi.1014342
Editor: Marc R. Birtwistle, Clemson University, UNITED STATES OF AMERICA
Received: November 5, 2025; Accepted: May 18, 2026; Published: June 2, 2026
Copyright: © 2026 Liu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data, SBML and PEtab files, code, and instructions to perform all analyses are available at https://doi.org/10.5281/zenodo.17478761.
Funding: This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy (EXC 2047—390685813, EXC 2151—390873048), by the European Union via ERC grant MolDynForSyn to TT (GA number 945700), and by the University of Bonn via the Bonn Center for Mathematical Life Sciences, the Schlegel Professorship of J.H., and the Institute for Experimental Epileptology and Cognition Research. We acknowledge the Marvin and Unicorn clusters hosted by the University of Bonn. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Translation of RNA into protein is a fundamental cellular process that influences the amount of proteins available for cellular activity [1,2]. Such a process is subject to multilayered regulation, both temporally [3] and spatially [4–6], and even minor perturbations can result in severe cellular dysfunction [7–9]. A central mechanism for this regulation involves interactions between RNA and RNA-binding proteins (RBP), which coordinate key steps of RNA metabolism such as splicing, localization, stability, and translation efficiency [10–12]. Disruptions in these interactions are particularly detrimental in neurons, where RNA processing plays a pivotal role in maintaining neuronal integrity and function [13,14]. Indeed, aberrant RBP activity is increasingly recognized as a pathogenic driver in various neurological disorders [15,16].
A prominent example is Huntington’s Disease (HD), a neurodegenerative disorder caused by an expansion of the CAG trinucleotide repeats in the huntingtin gene (Htt) [17,18]. The CAG trinucleotide encodes glutamine (Q) in the huntingtin protein (HTT), so, in the following, we use Q number and CAG repeat length interchangeably. The number of CAG repeats in Htt RNA is typically fewer than 36 in the wild-type (wt) [19]. Mutant Htt RNA, however, contains longer CAG repeats, which fold into stable hairpin structures [20, 21]. These aberrant secondary structures found in Htt RNA with expanded CAG repeat lengths are preferentially recognized by the RBP Midline 1 (MID1) (Fig 1A, known interaction 1), an E3 ubiquitin ligase tagging proteins for degradation [22,23]. This specific interaction between MID1 and mutant Htt RNA can thus potentially modify RNA metabolism.
(A) Schematic representation of MID1-dependent RNP complex formation. The legend indicates the color scheme used to represent different CAG repeat or glutamine (Q) lengths. Wild-type is defined as when the CAG repeat or glutamine length is less than 36. (B) Experimental in vitro and in vivo data taken from [20]. In the in vitro panels, the x-axis labels, e.g., Q17, indicate constructs with the corresponding number of CAG repeats, while in the in vivo panels, the subtitles, e.g., wt/wt, denote mouse genotypes. MID1 si-8 and MID1 si-9 refer to two independent small interfering RNA sequences specifically designed to knock down MID1 expression. The error bars represent the standard deviation of the data. An asterisk above a bar indicates a significant difference (p < 0.05) relative to the first bar in the subplot. All in vivo data originate from the same western blot experiment and are directly comparable across genotypes. Bars are absent in the wt/wt and Q111/Q111 subplots where the respective allele is not expressed.
Upon binding to Htt RNA, MID1 orchestrates the assembly of a translationally active ribonucleoprotein (RNP) complex that includes protein phosphatase 2A (PP2A) and S6 kinase (S6K) [21,24]. This interaction also occurs in a CAG repeat length dependent manner [20]: RNA with longer CAG repeat length (e.g., 111 CAG repeats) displays markedly higher affinity for MID1 and its associated complex compared to those with physiological repeat lengths (e.g., 20 CAG repeats) (Fig 1A, known interaction 2). The binding affinity is length-dependent because the CAG-containing RNA hairpin progressively elongates and stabilizes as the CAG repeat length increases, thereby providing a larger binding platform for MID1 [20]. Notably, knockdown of MID1 abolishes PP2A and S6K association with Htt RNA, establishing MID1 as a necessary scaffold for complex formation (Fig 1A, known interaction 3) [20].
These recruited complex compounds, PP2A and S6K, modulate phosphorylation states of translation regulators, thereby enhancing translation initiation of the mutant RNA. The downstream consequence of these interactions is an increase in translation efficiency from RNA with expanded repeats. This is evident in published data for in vitro and in vivo experiments (Fig 1B) [20]. In in vitro experiments, translation of Htt RNA constructs bearing high CAG repeat length yields substantially more protein (e.g., Q49, which indicates HTT with a polyglutamine stretch of 49 residues) than constructs with lower CAG repeat length (e.g., Q17), despite similar RNA levels. This effect is abolished upon MID1 knockdown using MID1-specific short interfering RNA (MID1 si-8 or si-9) for constructs with expanded repeats, while having minimal or no effect on low-repeat constructs—likely due to the weak binding affinity between MID1 and wt Htt RNA [20].
Furthermore, additional in vivo evidence from knock-in mouse models carrying the mutant Htt allele (Q111/+) suggests that simple binding interactions might not be sufficient to explain the observed data (Fig 1B in vivo panel). In both heterozygous and homozygous mutant mice, the mutant allele consistently produces higher levels of HTT than the wt allele, mirroring the in vitro findings. However, the data also reveals more complex protein level patterns that cannot be explained by a simple linear increase in MID1 binding. For example, in heterozygous mice, while the mutant allele shows significantly higher protein output than the wt allele as expected, the total amount of protein is comparable with that in homozygous wt mice. This suggests that MID1 binding alone does not fully account for the differences. Moreover, although homozygous mice carry twice as many mutant alleles as heterozygous mice, their HTT levels are more than double, indicating a supralinear increase that cannot be explained by allele dosage alone.
These experimental findings together demonstrate that MID1 increases the translation of RNA with expanded CAG repeats. While the role of MID1 in recruiting translation-associated kinases is well established, the quantitative link between these molecular interactions and the resulting differences in protein expression remains unresolved. In particular, it is still unclear how the difference in CAG repeat length, MID1 binding affinity, and co-factor recruitment jointly determine translation efficiency and drive the observed expression patterns. Furthermore, these observations indicate that other mechanisms introducing nonlinear dependencies, such as competitive or cooperative effects, may be at play beyond the simple binding affinity or allele dosage.
Mathematical modeling is a powerful approach to unravel such mechanisms. Ordinary differential equation (ODE) models have already proven to be powerful tools for elucidating the quantitative dynamics of RNA–protein interactions [25,26], as well as autoregulatory feedback loops of RBP [27]. Furthermore, ODE models have been used to investigate the role of the yeast RBP Puf3 in regulating RNA stability [28]. However, despite these advances, such quantitative frameworks have yet to be extensively applied to understand RBP-driven RNA translation dynamics in neurodegenerative diseases like HD.
Our work addresses this gap by developing an ODE model describing the influence of CAG repeat length on MID1 binding and the resulting HTT levels based on the data [20] described above (Fig 1B). We derive testable predictions, such as the impact of MID1 binding on Htt RNA stability. To evaluate these predictions, we conduct biological experiments. Furthermore, we use parameter estimation and model selection to assess which model best describes the available data. By integrating mathematical modeling with experimental data, our findings provide a new mechanistic understanding of how MID1 influences RNA regulation and offer potential targets for therapeutic intervention in disorders characterized by abnormal RNA stability and translation dynamics.
Results
MID1-dependent core molecular interactions alone do not account for the observed protein level patterns
To systematically uncover how MID1 influences HTT level, we consider previously published experimental data regarding several characteristic observations of HTT level in the presence of expanded CAG repeats (Fig 1B) [20]. These observations are:
- (Observation 1) HTT levels increase with the CAG repeat length.
- (Observation 2) MID1 level has a minimal effect at low CAG repeat lengths.
- (Observation 3) MID1 level modulates HTT levels at high CAG repeat lengths.
- (Observation 4) The wt allele produces a lower HTT level in wt/Q111 compared to wt/wt mice.
- (Observation 5) In wt/Q111 mice, the Q111 allele is preferentially translated.
- (Observation 6) Q111/Q111 mice show a supralinear increase in the HTT level from the Q111 allele in wt/Q111 mice.
To evaluate whether these characteristics can be explained solely by the core molecular interactions between Htt RNA, MID1, and downstream effectors, we develop a Baseline Model. This model captures the previously identified core molecular interactions (Fig 1A), including allele-specific MID1 binding, MID1-dependent recruitment of the S6K, phosphorylation-dependent translation, and protein degradation.
The Baseline Model considers two Htt RNAs (one from each allele; denoted RNA1, RNA2), unbound MID1, MID1-bound RNA complexes (MID1:RNA1, MID1:RNA2), unphosphorylated and phosphorylated S6K (S6K, S6KP), and the resulting HTT (HTT1, HTT2). The concentrations of all biochemical species in the model are encoded in the state vector:
The characteristics of RNA1 and RNA2, i.e., CAG repeat lengths q of each allele, q1 and q2, depend on the specific experimental context. For instance, for an experiment with heterozygous wt/Q111 mice, the RNA1 is the wt RNA and RNA2 is the Q111 RNA. As the model structure is symmetric with respect to RNA1 and RNA2, the order of the assignment will not influence results (see Sec. 1.1 in S1 Appendix).
The model accounts for four classes of core reactions, which are illustrated in SBGN [29,30] (Fig 2A):
(A) Visualization of the Baseline Model using the Systems Biology Graphical Notation (SBGN) language, showing the binding of MID1 and RNA has a downstream effect on S6K phosphorylation and subsequently translation. (B) Simulation results of the Baseline Model for the maximum likelihood estimate (red dot): only four out of six data observations are reproduced. The error bars represent the standard deviation of the data. An asterisk above a bar indicates a significant difference (p < 0.05) relative to the first bar in the subplot.
(R1) CAG length-dependent binding of MID1 to Htt RNA.
Htt RNA binds MID1 in a manner dependent on the CAG repeat length q of each allele, i.e., q1, q2. We implement this dependence upon q via a function l(q), which represents the length of the RNA hairpin structure to which MID1 binds. Based on in silico RNA secondary structure predictions [20], l(q) is approximately constant at low CAG repeat numbers and increases approximately linearly beyond a threshold repeat length (see Sec. 2. in S1 Appendix). Since MID1 is assumed to bind along this hairpin structure, a longer hairpin provides a larger binding interface and thus increases the effective association rate. We therefore model the forward binding rate by mass action kinetics with k1 proportional to l:
where , and the unbinding rate
is assumed to be independent of CAG repeat length. In the following text, k represents the reaction rate and c represents the reaction rate constant.
(R2) S6K phosphorylation modulated by MID1-bound RNA.
Binding of MID1 to Htt RNA promotes S6K phosphorylation and inhibits its dephosphorylation:
with the reaction rates:
where x is the state vector and K−2 is the inhibition constant for MID1:RNA mediated suppression of S6K dephosphorylation. Since S6K is neither produced nor degraded, the total pool of S6K is conserved, i.e., remains constant. Note that the [S6KP] modeled here only represents the concentration level modulated by MID1. We discuss how phosphorylated S6K, modulated by MID1 or not, affects translation rate in the following reaction (R3).
(R3) Translation as a function of S6KP abundance.
Translation occurs for both free and MID1-bound RNA and is positively regulated by phosphorylated S6K modulated by MID1:
with the reaction rates:
where c3 is the basal translation rate that includes all other sources of translational activity in the cell. For instance, c3 includes all other effects on the translation rate from phosphorylated S6K not directly modulated by MID1. We assume the effect from the S6KP modulated by MID1, , cannot be too large compared to the basal translation and constrain
to values between 1 and 100.
(R4) Degradation of HTT.
The fourth reaction is first-order degradation of HTT:
with the degradation rate .
The list of reactions was translated into an ODE model. The system of ODEs governing the dynamics of the Baseline Model is provided in Sec. 1.1 in S1 Appendix. This model and all subsequent models admit a unique solution that remains non-negative for all t ≥ 0 (see Sec. 3. in S1 Appendix). The initial RNA levels are assumed to be the same and are estimated using the parameter rna0. The initial MID1 level depends on specific experiment conditions, i.e., on the presence of siRNAs (see Sec. 4. in S1 Appendix). The initial S6K level is estimated using the parameter s6k0. All other initial conditions are assumed to be zero.
The CAG repeat length q is generally known for most experimental conditions. For instance, the experimental condition Q111/Q111 means that both alleles carry 111 CAG repeats, i.e., . However, for wt/Q111 and wt/wt settings, the CAG repeat length for the wt allele is not exactly known. We thus define the CAG repeat length of the wt allele in wt/Q111 and wt/wt settings to be an estimated parameter, Qwt (see Sec. 5. in S1 Appendix). The reaction rate constants, e.g., c1, are also unknown parameters.
To test whether the model can explain the data (Fig 1B), the model parameters are estimated by maximum likelihood estimation (see Materials and methods for details on parameter estimation). As the data provides information about the HTT abundance in steady state, we have the observables
with western blot gel-specific scaling factors sg, , capturing difference in detection and normalization effects.
Parameter estimation using a state-of-the-art pipeline provides reproducible results (see Sec. 1.1 in S1 Appendix). The comparison of data and model simulations reveals that four observations are captured (Fig 2B), i.e., Observation 1, Observation 2, Observation 3, and Observation 6. However, the model fails to explain two other essential observations of the data, i.e., Observation 4 and Observation 5. These discrepancies suggest that additional molecular mechanisms—beyond the core MID1-dependent molecular interactions encoded in the Baseline Model—are required to fully capture the observed protein level patterns.
Model-based analysis indicates different degradation rates for free and MID1-bound Htt RNA
To obtain a more consistent description of the experimental data, we adjust the Baseline Model with additional reactions. The Baseline Model assumes a constant translatable RNA resource, i.e., rna0. For a dynamical system with many interacting species, this simplified assumption is, however, rarely holds. Thus, instead of assuming a fixed total Htt RNA level for both alleles, we now explicitly model Htt RNA synthesis and degradation. The Extended Model retains the molecular species and interactions of the Baseline Model but introduces three additional reactions (Fig 3A):
(A) Extended Model builds on Baseline Model by incorporating Htt RNA synthesis and degradation. (B) The model captures additional Observation 5 compared to the Baseline Model, including higher translation of the Q111 allele in heterozygous mice. The error bars represent the standard deviation of the data. An asterisk above a bar indicates a significant difference (p < 0.05) relative to the first bar in the subplot.
(R5) Transcription of Htt RNA.
New RNA for both alleles is assumed to be transcribed at a constant rate:
The transcription rate for both alleles is given by .
(R6) Degradation of free Htt RNA.
Free Htt RNA degrades following first-order kinetics with a constant rate:
The degradation rate for both alleles is given by .
(R7) Degradation of MID1-bound Htt RNA.
MID1-bound Htt RNA degrades following first-order kinetics, independently of free Htt RNA, releasing MID1:
The degradation rate for both alleles is given by .
The initial condition, states, parameter set, and system of ODE governing the dynamics of the extended model are provided in S1 Appendix. The parameters of the model incorporating Htt RNA synthesis and degradation are determined using maximum likelihood estimation. The same setup as for the Baseline Model is applied, and the optimizer outputs indicate the optimization has converged (see Sec. 6. in S1 Appendix).
The analysis of the estimation results reveals that the Extended Model reproduces another important experimental observation compared to the Baseline Model. In particular, it correctly recapitulates Observation 5 in addition to the Observations captured by the Baseline Model (Fig 3B). However, the model still fails to reproduce Observation 4, where the wt allele produces a lower HTT level in wt/Q111 mice compared to in wt/wt mice.
Despite its limitations, the Extended Model now successfully fits Observation 5 and provides a mechanistic explanation for the higher Q111 protein level compared to the wt protein level in wt/Q111 mice. We analyze the steady state total RNA concentrations (free and MID1-bound) for both alleles and find a consistently higher total RNA level for the Q111 allele (see Sec. 7. in S1 Appendix). This asymmetry in RNA abundance, despite equal transcription rates, indicates that differences in degradation rates play a central role.
Indeed, our analysis of the inferred parameters shows that the two degradation processes (Fig 4A) have different rate constants. The degradation rate constant of MID1-bound RNA (c7) is lower than that of free RNA (c6) (Fig 4B). Importantly, no inequality constraint is imposed on c6 and c7 in the Extended Model: both parameters are estimated freely over wide ranges, and the relationship c6 > c7 emerges from the data. However, the confidence intervals of c6 and c7 individually span most of the search space, indicating poor practical identifiability of the individual parameters. More broadly, the Extended Model exhibits wide confidence intervals for most kinetic parameters, reflecting the challenge of constraining 25 model parameters with only 48 measurements at steady state representing 14 distinct experimental conditions (see S1 Table and Sec. 8. in S1 Appendix). Despite this, the two-dimensional confidence region reveals that the ratio of c6 and c7 is consistently estimated at : free RNA degrades approximately 2.5 times faster than MID1-bound RNA. Thus, while the individual rate constants are not identifiable, the key conclusion that MID1 binding reduces the degradation rate is robust across the entire confidence region. This finding implies that MID1 binding not only enhances translation efficiency via S6K phosphorylation but also stabilizes Htt RNA by protecting it from degradation.
(A) Illustration of free and MID1-bound RNA degradation pathways. (B) Scatter plot of (k6, k7) parameter pairs (log10 scale). The maximum likelihood estimate (red) and points along the optimizer path (gray scale values according to the objective function) are shown. Only parameter vectors within the 95% confidence region, as defined via the threshold, are included. The median ratio across all points in the confidence region is
(linear scale).
Experimental data validates MID1 mediated protection of Htt RNA against degradation
To validate the model-based prediction of an altered degradation rate, we conduct a new experiment and assess how Htt RNA degrades under two conditions: a control condition (without MID1 transfection) and a MID1 overexpression condition (Fig 5A). After the establishment of the cell culture, including the induction of a Htt allele with 83 CAG repeats (Q83) in the cell and transfection with plasmids coding for MID1, cells are treated with Actinomycin D to inhibit transcription. Subsequently, the relative RNA levels are monitored using qPCR.
(A) Experimental setup. (B) MID1 level comparison under control (without MID1 transfection) and MID1 overexpression conditions on day 3 (n = 6 biological replicates per condition). (C) Relative Htt RNA (with 83 CAG repeats) level within 2 hours under control (without MID1 transfection) and MID1 overexpression conditions (n = 6 biological replicates per condition). The error bars represent the standard deviation of the data. Asterisks indicate significant differences relative to the 0h time point within each condition (Bonferroni-corrected paired comparisons; represents p < 0.05, *** represent p < 0.001). Between-condition differences were assessed using Mann-Whitney U tests (1h: p = 0.008; 2h: p = 0.041).
The assessment of the MID1 levels three days after plasmid transfection confirms a substantial increase compared to the control condition (Fig 5B). The difference in MID1 levels results in different Htt RNA decay dynamics (Fig 5C). Under the control condition (without MID1 transfection), the Htt RNA level is reduced by 50% after two hours, indicating a half-life time of approximately 2 hours. Under the MID1 overexpression condition, the Htt RNA levels are stable over the same time interval, implying a substantially longer half-life. The difference between conditions is significant at both times (Mann-Whitney U test, p = 0.008 at 1 h, p = 0.041 at 2 h). Individual replicate values are provided in S2 Table.
The experimental data provides a validation of our model-based predictions, illustrating the significant functional role of MID1 in regulating RNA stability.
Clustering and local signaling explain observed HTT levels
The validation experiments show that the Extended Model can provide valuable, testable hypotheses. Yet, the model does not recapitulate Observation 4. As the underlying cause of this limitation is unclear, we employ model-based hypothesis testing.
Since the increased stability of MID1-bound RNA has been experimentally confirmed, we incorporate this finding as a constraint (k7 ≤ k6) that enforces it in all subsequent models as a starting point for the model-based hypothesis testing. To account for the constraint, we reformulate the degradation reaction as follows:
(R6R) Degradation of free Htt RNA.
Free Htt RNA degrades at a constant rate:
The degradation rate for both alleles is given by , where
. The parameter
encodes the stability difference between free and MID1-bound RNA.
In the following part, we explore three models developed upon the Extended Model, each introducing an additional mechanism to explain Observation 4 (the wt allele produces lower HTT levels in wt/Q111 mice than in wt/wt mice), which neither the Baseline Model nor the Extended Model could reproduce.
In the wt/Q111 setting, the mutant allele binds MID1 more strongly than the wt allele, driving stronger S6KP activity. If translation responds linearly to S6KP, this difference in activity would not be sufficient to suppress wt allele translation below its level in the wt/wt setting. However, if translation depends nonlinearly on S6KP, for instance, through a threshold-like response, then the redistribution of S6KP activity toward the mutant allele could markedly reduce wt translation efficiency. Such a nonlinearity might arise as translation is a multi-step process and might depend nonlinearly on the abundance of S6KP. We therefore assess a Hill-type formulation, where translation efficiency increases slowly at low S6KP concentrations but rises sharply beyond a certain threshold. The Extended Model with Nonlinear Translation retains the molecular species and interactions from the Extended Model but utilizes a Hill-type reaction rate for reaction (R3):
(R3H) Translation as a function of S6KP abundance.
Translation occurs for both free and MID1-bound RNA and is positively regulated by phosphorylated S6K:
with the reaction rates:
A second possibility we explore is a l-dependent clustering mechanism of MID1-bound RNA. If MID1-bound RNA is spatially segregated into local clusters, the stronger MID1 binding of the mutant allele would concentrate S6KP activity locally, depleting the global pool available to the wt allele and thereby suppressing its translation. Such spatial segregation is consistent with evidence that cytoplasmic RNA is often non-uniformly distributed, becoming enriched in discrete sites such as membraneless condensates [31], TIS granules [32], or translation factories [33], where local translation rates could differ markedly from those in the surrounding cytosol. A spatial enrichment can arise through a variety of mechanisms, including physical clustering of RNA (often associated with RNA granules) [31,34], targeted localization to translation-permissive compartments [32], or co-translational recruitment of related RNA through nascent chain interactions [35,36].
To capture this possibility, we model “clusters” in a generalized sense as any localized subpopulation of MID1-bound RNA, regardless of the precise mechanism underlying their enrichment, whose translation can be selectively modulated by locally elevated S6KP activity. Such compartmentalized regulation enables precise, context-dependent protein synthesis and may account for the nonlinear dynamics and allele-specific effects observed in the experimental data. The Extended Model with Clustering incorporates spatial structure implicitly through localized translation regulation. It assumes that MID1-bound Htt RNA can aggregate into spatially restricted molecular clusters, where they exert local influence on S6K phosphorylation inside the cluster. This localized S6K activation selectively enhances translation of clustered RNA without affecting global translation rates. Outside the cluster, translation is assumed to proceed as in the Extended Model. The Extended Model with Clustering introduces several new reactions (Fig 6), and expands the state vector to include the cluster-localized species ,
,
, and
. The additional and modified reactions are:
MID1:RNA complexes form a local cluster that selectively modulates nearby S6K phosphorylation and translation activity.
(R8) Cluster association and dissociation.
MID1-bound RNA can reversibly join clusters:
We consider a CAG repeat length dependent association rate for cluster entry, i.e., , and a constant dissociation rate
. In Sec. 9.1 in S1 Appendix, we show that models lacking CAG repeat length dependence (i.e.,
) fail to recapitulate the observations, suggesting that l dependence is a possible mechanism consistent with all observations.
(R2C) Local S6K phosphorylation.
Within the cluster, S6K is phosphorylated in proportion to the amount of MID1-bound RNA present:
The rates follow the same functional form as outside the cluster,
but are not identical: the S6K phosphorylation rate outside the cluster, which remains the same as in the Baseline Model and the Extended Model, depends on the MID1-bound RNA outside the cluster, whereas the cluster rate depends on the cluster-localized MID1-bound RNA.
(R3C) Translation inside clusters as a function of S6KP abundance.
MID1:RNA complexes in the clusters are translated in an S6KP-dependent manner,
with rate:
The translation outside the cluster remains the same as in Baseline Model and in the Extended Model.
(R7C) Degradation of the MID1:RNA in the cluster.
Both global and cluster-associated MID1:RNA species degrade at the same MID1-dependent rate , distinct from the degradation of unbound RNA.
As the two mechanisms, nonlinear translation and clustering, are not exclusive, we also consider the Extended Model with Clustering and Nonlinear Translation. The model is based on Extended Model with Clustering, with the translation outside the clusters changed to (R3H) and the translation inside is defined as the following:
(R3C, H) Translation inside clusters as a function of S6KP abundance.
MID1:RNA complexes in the clusters are translated in an S6KP-dependent manner,
with rate:
The ODE systems governing the dynamics of all three models are provided in Sec. 1. in S1 Appendix. All models respect allele symmetry: swapping the labeling of allele 1 and allele 2 does not change the model output.
The analysis of the estimation results reveals that the three new models with additional mechanisms progressively deepen the suppression of the wt HTT level in the wt/Q111 setting (Fig 7B-7D) compared to the Extended Model (Fig 7A). Such suppression is achieved when the wt allele HTT level in the wt/Q111 setting is less than half of the level in the wt/wt setting (the region below the half line is thus called the sublinear region). The threshold is at half of the HTT level in wt/wt because wt/Q111 has one instead of two wt alleles. The Extended Model is unable to reproduce such suppression, and in fact, the wt allele in the wt/Q111 setting has a higher HTT level. In the model with only nonlinear translation, the simulated wt HTT level in the wt/Q111 setting is still a bit higher than in the wt/wt setting and lies slightly above the half line, outside of the sublinear region. The model with clustering alone produces a downward shift, but combining both non-linearity and clustering yields the strongest suppression effect among the three models, leading the wt HTT level well inside the sublinear region.
The HTT levels of the wt allele are indicated for (A) the Extended Model, (B) the Extended Model with Nonlinear Translation, (C) the Extended Model with Clustering, (D) the Extended Model with Clustering and Nonlinear Translation. Compared to Extended Model, the other three models all improve the fitting regarding the difference in wt allele HTT level between in wt/wt mice and wt/Q111 mice, but only the models with clustering reproduce the sublinear suppression. The red dashed line represents half of the wt allele HTT level of the simulation in the wt/wt setting, and the gray area represents the sublinear HTT level region that is below the half. (E) The Extended Model with Clustering and Nonlinear Translation successfully reproduces all observations. The error bars represent the standard deviation of the data. An asterisk above a bar indicates a significant difference (p < 0.05) relative to the first bar in the subplot.
Looking at full fitting results of all three models, only the models with clustering can capture all observations (Fig 7E), while the model with only nonlinear translation does not fully reproduce Observation 4 (see Sec. 9. in S1 Appendix). Although Extended Model with Clustering also captures all observations, only the simulation results of Extended Model with Clustering and Nonlinear Translation are within the standard deviation of the data (Fig 7E).
To quantitatively compare the performance of all models we consider in this work, we calculate the Negative Log-likelihood (NLLH), the Akaike Information Criterion (AIC), its small-sample corrected version (AICc), and the Bayesian Information Criterion (BIC) for each model (Table 1). The Extended Model with Clustering and Nonlinear Translation achieves the best fit, as indicated by the lowest NLLH. However, this improved fit requires more free parameters compared to the Extended Model with Clustering, which leads to higher complexity penalties and thus worse AIC values. Because our dataset is small (), these penalties are further amplified in AICc and BIC, resulting in the Baseline Model scoring best under these criteria.
Simulation study demonstrates how nonlinear translation and clustering help capture all experimentally reported observations
As demonstrated in the last section, the Extended Model with Clustering and Nonlinear Translation can reproduce all observations, and its simulation results are within the standard deviation of the data. Specifically, the model can reproduce Observation 4 (the wt allele produces lower HTT levels in wt/Q111 compared to wt/wt mice) and Observation 6 (Q111/Q111 mice show a supralinear increase in HTT levels from the Q111 allele).
To understand how the model captures the allele-specific HTT level patterns, we examine how CAG repeat length shapes the steady state of the model species across the three in vivo settings (Fig 8A). As CAG repeat length increases from wt/wt to Q111/Q111, stronger MID1 binding shifts more RNA into MID1-bound and subsequently clustered states. This redistribution has two opposing effects on S6KP: the global pool decreases, as more MID1:RNA is sequestered into clusters and unavailable to drive global S6K phosphorylation, while the cluster pool increases due to locally elevated MID1:RNA concentrations.
(A) Comparison of steady state amounts of model species (bars) and interaction strengths in the cluster compartment (purple-gradient arrows) across wt/wt, wt/Q111, and Q111/Q111 settings. CAG repeat length–dependent clustering strength is shown for wild-type (red bar) and mutant (green pass) alleles. Amounts and interactions are from steady state simulations at the maximum likelihood estimate of the model; the Hill function of R3C,H is shown at the cluster center. (B) Steady state simulations for : left, translatable RNA in global and cluster pools; middle, phosphorylated S6K in both pools; right, HTT levels in each pool and in total. The color gradient represents the CAG repeat length in Htt RNA and the Q length in HTT. Curves in each subplot are normalized by the highest value of any quantity in the respective subplot for visual clarity.
These opposing effects, combined with the nonlinear Hill-type translation response, explain both key observations. For Observation 4, in the wt/Q111 setting, the wt allele resides predominantly in the global environment, while the mutant allele moves into the cluster, causing a reduction of S6KP. The Hill-type response amplifies this effect into a reduction in wt translation, reproducing the sublinear wt HTT level. For Observation 6, in the Q111/Q111 setting, both alleles are strongly clustered, and the locally elevated drives a sharp increase in cluster translation via the Hill function. The combined loss of global translation and gain of cluster translation produces the observed supralinear increase in Q111 HTT levels. Thus, CAG repeat length-dependent clustering coupled to a nonlinear translation response converts differences in MID1 binding into the allele-specific HTT level patterns observed across the three experimental settings.
To go beyond comparing only the limited number of experimental settings, we systematically explored how different CAG repeat lengths influence the steady state abundance of model species in the Extended Model with Clustering and Nonlinear Translation. Specifically, we conduct a simulation study with CAG repeat length between 6 and 120, i.e., and
. These simulations reveal that the model predicts that the patterns observed across the three in vivo settings (Fig 8A) are part of a continuous transition, in which translation shifts from being dominated by the global pool at low q to being dominated by the cluster pool at high q (Fig 8B, right). Such a shift is caused by two components, translatable RNA and MID1-mediated phosphorylated S6K. Specifically, as CAG repeat length increases, we observe a decrease of global translatable resources and an increase of cluster translatable resources as the resources move into the cluster (Fig 8B, left). Interestingly, [S6KP] does not follow a monotone decrease and has a sharp increase just after q = 20 CAG repeats (Fig 8B, middle). The sharp increase is due to the shape of the l function (see Sec. 10. in S1 Appendix). Together, those two components drive the HTT curve, global-produced or cluster-produced.
Discussion
The interaction between RNA and RBP plays a pivotal role in post-transcriptional gene regulation, governing processes such as RNA stability, localization, and translation. Disruptions to these interactions have been increasingly recognized as pathogenic mechanisms, with HD being an example [37]. Motivated by experimental data showing a CAG repeat length dependent increase in protein level, we construct a sequence of ODE models to bridge the molecular biochemistry of MID1 binding and S6K phosphorylation with the emergent allele-specific HTT level patterns observed in vitro and in vivo. The progression—from a minimal Baseline Model, through an explicit RNA turnover model (Extended Model), to its extensions incorporating nonlinear translation and clustering—allows us to pinpoint which mechanistic ingredients are required to reproduce the full experimental phenotype.
The Baseline Model, which encodes only the core known reactions, fails to account for more nuanced observations such as allele-specific HTT level asymmetries. The Extended Model extended the Baseline Model by adding RNA synthesis and decay. While still incomplete, the model suggests that differences in Htt RNA stability, with and without MID1 binding, could explain the preferential translation of the Q111 allele. Indeed, prior work has shown that MID1, besides stimulating translation, may affect RNA stability [38]. This prediction is experimentally confirmed in our work: elevated MID1 levels substantially increase the half-life of mutant Htt RNA compared to wt. This agreement between model and experiment establishes the Extended Model as not only explanatory but also predictive, offering mechanistic insights with direct biological validation.
The prolonged availability of Htt RNA due to increased stability through MID1 binding can substantially impact protein synthesis dynamics, potentially exacerbating the accumulation of mutant HTT, a hallmark of HD pathology. Thus, our findings not only reveal a previously unrecognized aspect of RNA regulation by MID1 but also offer valuable insights into the underlying molecular mechanisms contributing to disease progression. Identifying and characterizing this novel MID1-dependent stabilization pathway represents an important step toward developing targeted therapeutic interventions aimed at modulating mutant HTT level through MID1 interaction.
Neither the Baseline Model nor the Extended Model could explain the suppressed wt HTT level in wt/Q111 (Observation 4), indicating that additional mechanisms may be at play. Firstly, we treat translation downstream of S6KP as cooperative (Hill-type) in the Extended Model with Nonlinear Translation. This modification improves the fit to Observation 4 and suggests that S6KP may drive cooperative, threshold-like translational control.
Another extension replaces the well-mixed assumption with a spatially structured description: MID1-bound RNAs can form a local cluster, where S6K phosphorylation and translation are regulated by the local concentrations of MID1:RNA and S6K. This Extended Model with Clustering reflects accumulating evidence that many cytoplasmic RNAs reside in sub-cytoplasmic microdomains (e.g., TIS granules/rough endoplasmic reticulum) where translation differs from the surrounding cytosol [32].
The concept of a localized cluster with RBP has been studied as a mechanism for protein synthesis [33]. Our study extends this concept specifically to MID1-mediated clustering of Htt RNA. Indeed, MID1 has a known ability to anchor RNA to the microtubule cytoskeleton, facilitating such spatial organization [24,38]. Furthermore, as MID1 preferentially binds to RNA with expanded CAG repeats, it is plausible that such clustering is CAG repeat length dependent. This dependence has been shown in other work [39]. These findings suggest that subcellular localization and compartmentalized signaling may affect translation efficiency in the context of CAG repeat expansion. As such, disrupting RNA-protein clustering may present a promising therapeutic avenue for modulating allele-specific translation in HD and related disorders.
While the model prediction of CAG repeat length dependent MID1-bound RNA clustering is plausible, direct experimental validation would be required to distinguish clustering from alternative mechanisms such as local S6K activation at individual MID1:RNA complexes. Single-molecule RNA FISH [40] could reveal whether mutant Htt RNAs co-localize in discrete cytoplasmic foci in a CAG repeat length-dependent manner, and SunTag-based translation imaging [41] could additionally determine whether translation events are spatially clustered or dispersed. Ideally, the quantitative dependence on CAG-repeat number and MID1 levels should be assessed to recapitulate differences between patients. Such studies would offer critical insights into the spatial architecture of translation regulation in HD.
Finally, to compare the performance of all models, we compute information criteria such as AIC, AICc, and BIC. The Baseline Model achieves the best AICc and BIC scores, as these criteria penalize the additional parameters of the more complex models. However, the Baseline Model does not reproduce all observations. The Extended Model with Clustering and the Extended Model with Clustering and Nonlinear Translation achieve substantially lower NLLHs and are the only models that reproduce all observations, though at the cost of increased model complexity. These models therefore represent plausible mechanistic frameworks that can account for all observations, though the limited dataset does not allow us to draw definitive conclusions.
In addition to nonlinear translation and clustering, another non-exclusive driver of the more-than-doubling of HTT level in Q111/Q111 could be somatic CAG-repeat expansion. In HD, CAG repeat numbers can increase over time within individual neurons, especially in the striatum, creating mosaic populations with effective repeat lengths well above 111 [42]. Although not explicitly modeled here, for steady state data, this mechanism can be incorporated by estimating a higher-than-111 CAG repeat length.
Although the Extended Model with Clustering and Nonlinear Translation captures the key experimental trends, it remains a simplification. We currently represent a single effective cluster, whereas cells may contain multiple, compositionally distinct clusters that could shape translation in parallel. Likewise, we assume one MID1 per RNA, even though expanded Htt RNA could recruit multiple MID1 molecules, potentially amplifying clustering and introducing additional cooperativity not captured here. Moreover, some model assumptions lack direct experimental validation. For instance, several reaction rates, such as free RNA degradation, are assumed to be independent of CAG repeat length, as we are not aware of experimental evidence for such dependencies. Future experiments could clarify whether such dependencies exist and guide a more specific model formulation.
Beyond these structural assumptions, inference is further limited by data availability: while the dataset [20] comprises 48 measurements, these correspond to only 14 distinct experimental conditions. Of the 23–30 estimated parameters, 10 are noise and scaling parameters, leaving 13–20 kinetic parameters to be constrained by the 14 experimental conditions. The 5 noise parameters are directly informed by the replicates within each condition. Furthermore, the wt CAG length in the wt/wt and wt/Q111 cohorts was not reported, requiring inference of Qwt and adding uncertainty to parameter estimates. Indeed, profile likelihood analysis (see Sec. 8. in S1 Appendix) confirms that the majority of kinetic parameters are non-identifiable across all models. This limited identifiability cautions against favoring one mechanism over another, and we therefore consider all three Extended Model with extra components as possible explanations. Future studies with richer time- and dose-response data, explicit reporting of wt repeat lengths, and model extensions (e.g., multiple clusters or multivalent MID1 binding) are needed to improve identifiability and test robustness.
Our integrative modeling and experimental study uncovers a novel mechanistic role for MID1 in promoting both the stability and localized translation of mutant Htt RNA through spatial clustering. These findings not only provide insight into how RBP regulates pathogenic RNA behavior in HD but also highlight the critical role of subcellular localization in shaping translation outcomes. By challenging the traditional assumption of a spatially uniform intracellular environment, our work opens new directions for investigating compartmentalized post-transcriptional regulation and lays a foundation for future therapeutic strategies targeting RNA-protein clustering dynamics in neurodegenerative diseases.
Materials and methods
Experimental data
We consider previously published experimental data (Fig 2B) [20]. A description of the experimental protocols for this published data is in the original publications [20].
For the new validation experiment we conducted in this study, HEK293 cells stably expressing huntingtin exon1 with 83 CAG repeats in a Tet-off system [43] were seeded in poly-L-lysine coated 12 well plates at a density of 2.5 × 105 cells per well. Washing off doxycycline induced the expression of huntingtin exon1. Cells were then cultured for 24 hours and transfected with a plasmid (pCMVTag2A-MID1 [24]) to overexpress MID1. After 48 hours of incubation, the actinomycin D time course was started. For this purpose, 1 μ l actinomycin D (5 mg/ml; Merck) was added per 1 ml medium, and the cells were harvested after 0 h, 1 h, or 2 h. Cells were washed with PBS and harvested using a cell scraper, followed by total RNA isolation using the Monarch Total RNA Miniprep Kit (NEB). cDNA was synthesized using the TaqMan reverse transcription reagents kit (Applied Biosystems), and real-time PCR was carried out using the qPCRBIO SyGreen Mix (PCRBiosystems). The sequences of primers are specified in Table 2.
Mathematical formulation
We model the interaction among all the molecules using systems of ODEs. These ODE models are of the form:
with state vector at time t, parameter vector
, and input vector
that encodes the experimental conditions. The vector field
encodes the dynamics of the process and is constructed to be Lipschitz continuous. The function
defines the initial condition. By nx,
, and nu we denote the numbers of state variables, unknown mechanistic parameters, and known input variables, respectively.
Model observables are the measurable model quantities. They are defined as functions of the state vector x and the parameter vector
with . For notational simplicity, we assume that there is only one measurable quantity, i.e.,
. When the measurements are relative, meaning that they provide data only in a relative form rather than as absolute quantities, it is necessary to introduce scaling factors that scale the observables to the data. For an assumed additive and normally distributed measurement noise, i.e.,
, the measurement-to-observable relationship is given by:
with experimental conditions of each experiment , repetition index
, time point
, and noise standard deviation
. By ne, ni, and nt we denote the number of experiments, the number of data replicates and the number of time points. A dataset is defined as the set of all measurements,
Unknown model parameters are inferred from the measured data using maximum likelihood estimation. The likelihood function of observing the experimental data
given parameters
, s, and
is given by
in which the likelihood can be separated into a product of conditional probabilities of individual measurements because of the assumed pairwise independence of all measurements. For a Gaussian noise model, the conditional probability of observing an individual measurement given the observable value of the model
and the noise level
is given by
Instead of directly maximizing , it is equivalent and numerically often preferable to minimize the negative log-likelihood
. Assuming Gaussian noise, this objective function is given by
For better convergence and efficiency properties [44–46], the unknown model parameters are estimated hierarchically: scaling s and noise parameters of the models are estimated in the inner optimization problem, and all other model parameters
are estimated in the outer optimization problem. The hierarchical optimization problem is given by
in which the inner optimization problem can be solved analytically [46].
The selection of the bounds for the unknown model parameters is outlined in Sec. 11. in S1 Appendix.
Steady state computation
A steady state of an ODE model is defined as a state vector x* satisfying
i.e., all time derivatives vanish. Numerically, steady states can be obtained in different ways. One common approach is to directly solve the nonlinear system using a Newton-type root-finding method, which requires evaluating and factorizing the Jacobian
. Alternatively, the system can be integrated forward in time until the norm of
falls below a predefined tolerance, indicating proximity to a steady state. In practice, simulation tools, including the one used in this study, alternate between these steady state calculation approaches for increased robustness [47]. In all cases, suitable convergence criteria (e.g., relative and absolute tolerances) must be specified to determine when the system is sufficiently close to steady state.
Model selection criteria
We employ the Akaike Information Criterion (AIC) [48], its small-sample corrected version (AICc) [49], and the Bayesian Information Criterion (BIC) [50] as model selection metrics to assess the trade-off between goodness-of-fit and model complexity. These criteria are defined as:
where is the maximum likelihood value, and
is the number of data points.
Uncertainty quantification
Here, we define what it means for a parameter vector to lie in a confidence region of a certain significance. This is done using the likelihood-ratio test, in which the corresponding test statistic is defined as
In the asymptotic case of a large number of data points, the test statistic converges to a chi-square distribution with
degrees of freedom (see [51] for more details). Then the confidence region of significance
is defined as
in which is the
-th percentile of the
distribution. The one-dimensional projection of a confidence region for a specific parameter
of the parameter vector
is defined as the Confidence Interval (CI) of that parameter:
in which is the
-th percentile of the
distribution with one degree of freedom. Prediction uncertainty quantification can then be carried out by propagating parameter uncertainty using the CR to model predictions.
Implementation
We use PEtab [52] to define the parameter estimation problems and SBML [53] for standardized representations of the mathematical models. We create the SBML models using Tellurium’s Antimony [54]. Parameter optimization and uncertainty analysis are implemented using pyPESTO 0.5.6 [55], wherein we use AMICI 0.33.0 [47] for simulation and steady state calculation, and Fides 0.7.8 [56] for optimization. Each model is estimated using gradient-based multi-start local optimization with 50000 randomly sampled starting points. As is commonly done, most parameters are estimated on the logarithmic scale to uniformly explore parameter orders of magnitude [57].
Supporting information
S1 Appendix. Supplementary sections, figures, and tables.
Contains detailed model description, optimization setup, and complementary results.
https://doi.org/10.1371/journal.pcbi.1014342.s001
(PDF)
S1 Table. Parameter estimates and confidence intervals.
Maximum likelihood estimates and 95% profile likelihood confidence interval bounds for all parameters across all models.
https://doi.org/10.1371/journal.pcbi.1014342.s002
(XLSX)
S2 Table. Experimental replicate values for Fig 5.
Individual replicate measurements for MID1 expression levels (Fig 5B) and Htt RNA levels under control and MID1 overexpression conditions (Fig 5C).
https://doi.org/10.1371/journal.pcbi.1014342.s003
(XLSX)
Acknowledgments
We thank the members of the research group of Prof. Dr. Jan Hasenauer for valuable feedback on drafts. We acknowledge the Marvin and Unicorn clusters hosted by the University of Bonn.
References
- 1. Biffo S, Ruggero D, Santoro MM. The crosstalk between metabolism and translation. Cell Metab. 2024;36(9):1945–62. pmid:39232280
- 2. Wu B, Eliscovich C, Yoon YJ, Singer RH. Translation dynamics of single mRNAs in live cells and neurons. Science. 2016;352(6292):1430–5. pmid:27313041
- 3. Borisova E, Newman AG, Couce Iglesias M, Dannenberg R, Schaub T, Qin B, et al. Protein translation rate determines neocortical neuron fate. Nat Commun. 2024;15(1):4879. pmid:38849354
- 4. Ramat A, Haidar A, Garret C, Simonelig M. Spatial organization of translation and translational repression in two phases of germ granules. Nat Commun. 2024;15(1):8020. pmid:39271704
- 5. Dalla Costa I, Buchanan CN, Zdradzinski MD, Sahoo PK, Smith TP, Thames E, et al. The functional organization of axonal mRNA transport and translation. Nat Rev Neurosci. 2021;22(2):77–91. pmid:33288912
- 6. Susor A, Jansova D, Cerna R, Danylevska A, Anger M, Toralova T, et al. Temporal and spatial regulation of translation in the mammalian oocyte via the mTOR-eIF4F pathway. Nat Commun. 2015;6:6078. pmid:25629602
- 7. Kapur M, Monaghan CE, Ackerman SL. Regulation of mRNA Translation in Neurons-A Matter of Life and Death. Neuron. 2017;96(3):616–37. pmid:29096076
- 8. Fernandopulle MS, Lippincott-Schwartz J, Ward ME. RNA transport and local translation in neurodevelopmental and neurodegenerative disease. Nat Neurosci. 2021;24(5):622–32. pmid:33510479
- 9. Jia X, He X, Huang C, Li J, Dong Z, Liu K. Protein translation: biological processes and therapeutic strategies for human diseases. Signal Transduct Target Ther. 2024;9(1):44. pmid:38388452
- 10. Hentze MW, Castello A, Schwarzl T, Preiss T. A brave new world of RNA-binding proteins. Nat Rev Mol Cell Biol. 2018;19(5):327–41. pmid:29339797
- 11. Schneider-Lunitz V, Ruiz-Orera J, Hubner N, van Heesch S. Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes. PLOS Computational Biology. 2021;17(12):e1009658.
- 12. Glisovic T, Bachorik JL, Yong J, Dreyfuss G. RNA-binding proteins and post-transcriptional gene regulation. FEBS Lett. 2008;582(14):1977–86. pmid:18342629
- 13. Thelen MP, Kye MJ. The Role of RNA Binding Proteins for Local mRNA Translation: Implications in Neurological Disorders. Front Mol Biosci. 2020;6:161. pmid:32010708
- 14. Nussbacher JK, Batra R, Lagier-Tourenne C, Yeo GW. RNA-binding proteins in neurodegeneration: Seq and you shall receive. Trends Neurosci. 2015;38(4):226–36. pmid:25765321
- 15. Nussbacher JK, Tabet R, Yeo GW, Lagier-Tourenne C. Disruption of RNA Metabolism in Neurological Diseases and Emerging Therapeutic Interventions. Neuron. 2019;102(2):294–320. pmid:30998900
- 16. Wolozin B, Ivanov P. Stress granules and neurodegeneration. Nat Rev Neurosci. 2019;20(11):649–66. pmid:31582840
- 17. Mullari M, Fossat N, Skotte NH, Asenjo-Martinez A, Humphreys DT, Bukh J, et al. Characterising the RNA-binding protein atlas of the mammalian brain uncovers RBM5 misregulation in mouse models of Huntington’s disease. Nat Commun. 2023;14(1):4348. pmid:37468457
- 18. Nguyen TB, Miramontes R, Chillon-Marinas C, Maimon R, Vazquez-Sanchez S, Lau AL, et al. Aberrant splicing in Huntington’s disease accompanies disrupted TDP-43 activity and altered m6A RNA modification. Nat Neurosci. 2025;28(2):280–92. pmid:39762660
- 19. Walker FO. Huntington’s disease. Lancet. 2007;369(9557):218–28. pmid:17240289
- 20. Krauss S, Griesche N, Jastrzebska E, Chen C, Rutschow D, Achmüller C, et al. Translation of HTT mRNA with expanded CAG repeats is regulated by the MID1-PP2A protein complex. Nat Commun. 2013;4:1511. pmid:23443539
- 21. Griesche N, Schilling J, Weber S, Rohm M, Pesch V, Matthes F, et al. Regulation of mRNA Translation by MID1: A Common Mechanism of Expanded CAG Repeat RNAs. Front Cell Neurosci. 2016;10:226. pmid:27774050
- 22. Thapa P, Shanmugam N, Pokrzywa W. Ubiquitin Signaling Regulates RNA Biogenesis, Processing, and Metabolism. Bioessays. 2020;42(1):e1900171. pmid:31778250
- 23. Trockenbacher A, Suckow V, Foerster J, Winter J, Krauß S, Ropers HH, et al. MID1, mutated in Opitz syndrome, encodes an ubiquitin ligase that targets phosphatase 2A for degradation. Nat Genet. 2001;29(3):287–94. pmid:11685209
- 24. Aranda-Orgillés B, Trockenbacher A, Winter J, Aigner J, Köhler A, Jastrzebska E, et al. The Opitz syndrome gene product MID1 assembles a microtubule-associated ribonucleoprotein complex. Hum Genet. 2008;123(2):163–76. pmid:18172692
- 25. Horn T, Gosliga A, Li C, Enculescu M, Legewie S. Position-dependent effects of RNA-binding proteins in the context of co-transcriptional splicing. NPJ Syst Biol Appl. 2023;9(1):1. pmid:36653378
- 26. Nyayanit D, Gadgil CJ. Mathematical modeling of combinatorial regulation suggests that apparent positive regulation of targets by miRNA could be an artifact resulting from competition for mRNA. RNA. 2015;21(3):307–19. pmid:25576498
- 27. Clarke-Whittet M, Rocco A, Gerber AP. Parameterising Translational Feedback Models of Autoregulatory RNA-Binding Proteins in Saccharomyces cerevisiae. Microorganisms. 2022;10(2):340. pmid:35208795
- 28. Jiang H, Xu L, Wang Z, Keene J, Gu Z. Coordinating expression of RNA binding proteins with their mRNA targets. Sci Rep. 2014;4:7175. pmid:25417751
- 29. Balci H, Siper MC, Saleh N, Safarli I, Roy L, Kilicarslan M, et al. Newt: a comprehensive web-based tool for viewing, constructing and analyzing biological maps. Bioinformatics. 2021;37(10):1475–7. pmid:33010165
- 30. Sari M, Bahceci I, Dogrusoz U, Sumer SO, Aksoy BA, Babur Ö, et al. SBGNViz: A Tool for Visualization and Complexity Management of SBGN Process Description Maps. PLoS One. 2015;10(6):e0128985. pmid:26030594
- 31. Ma W, Mayr C. A Membraneless Organelle Associated with the Endoplasmic Reticulum Enables 3’UTR-Mediated Protein-Protein Interactions. Cell. 2018;175(6):1492-1506.e19. pmid:30449617
- 32. Horste EL, Fansler MM, Cai T, Chen X, Mitschka S, Zhen G, et al. Subcytoplasmic location of translation controls protein output. Mol Cell. 2023;83(24):4509-4523.e11. pmid:38134885
- 33. Pichon X, Bastide A, Safieddine A, Chouaib R, Samacoits A, Basyuk E, et al. Visualization of single endogenous polysomes reveals the dynamics of translation in live human cells. J Cell Biol. 2016;214(6):769–81. pmid:27597760
- 34. Khong A, Matheny T, Jain S, Mitchell SF, Wheeler JR, Parker R. The Stress Granule Transcriptome Reveals Principles of mRNA Accumulation in Stress Granules. Mol Cell. 2017;68(4):808-820.e5. pmid:29129640
- 35. Tutucci E, Vera M, Biswas J, Garcia J, Parker R, Singer RH. An improved MS2 system for accurate reporting of the mRNA life cycle. Nat Methods. 2018;15(1):81–9. pmid:29131164
- 36. Shade Q, Estrada M, Kramer G, Bukau B, Shiber A. Distinct cis-acting elements mediate co-localization of mRNAs encoding for co-translational interactors in cytoplasmic clusters. bioRxiv. 2025.
- 37. Conlon EG, Manley JL. RNA-binding proteins in neurodegeneration: mechanisms in aggregate. Genes Dev. 2017;31(15):1509–28. pmid:28912172
- 38. Aranda-Orgillés B, Rutschow D, Zeller R, Karagiannidis AI, Köhler A, Chen C, et al. Protein phosphatase 2A (PP2A)-specific ubiquitin ligase MID1 is a sequence-dependent regulator of translation efficiency controlling 3-phosphoinositide-dependent protein kinase-1 (PDPK-1). J Biol Chem. 2011;286(46):39945–57. pmid:21930711
- 39. Mahendran TS, Wadsworth GM, Singh A, Gupta R, Banerjee PR. Homotypic RNA clustering accompanies a liquid-to-solid transition inside the core of multi-component biomolecular condensates. Nat Chem. 2025;17(8):1236–46. pmid:40603603
- 40. Femino AM, Fay FS, Fogarty K, Singer RH. Visualization of single RNA transcripts in situ. Science. 1998;280(5363):585–90. pmid:9554849
- 41. Wang C, Han B, Zhou R, Zhuang X. Real-Time Imaging of Translation on Single mRNA Transcripts in Live Cells. Cell. 2016;165(4):990–1001. pmid:27153499
- 42. Handsaker RE, Kashin S, Reed NM, Tan S, Lee W-S, McDonald TM, et al. Long somatic DNA-repeat expansion drives neurodegeneration in Huntington’s disease. Cell. 2025;188(3):623-639.e19. pmid:39824182
- 43. Waelter S, Boeddrich A, Lurz R, Scherzinger E, Lueder G, Lehrach H, et al. Accumulation of mutant huntingtin fragments in aggresome-like inclusion bodies as a result of insufficient protein degradation. Mol Biol Cell. 2001;12(5):1393–407. pmid:11359930
- 44.
Weber P, Hasenauer J, Allgöwer F, Radde N. Parameter estimation and identifiability of biological networks using relative data. In: IFAC Proceedings Volumes, 2011. 11648–53.
- 45. Loos C, Moeller K, Fröhlich F, Hucho T, Hasenauer J. A Hierarchical, Data-Driven Approach to Modeling Single-Cell Populations Predicts Latent Causes of Cell-To-Cell Variability. Cell Syst. 2018;6(5):593-603.e13. pmid:29730254
- 46. Schmiester L, Schälte Y, Fröhlich F, Hasenauer J, Weindl D. Efficient Parameterization of Large-Scale Dynamic Models Based on Relative Measurements. Bioinformatics. 2019;36(2):594–602.
- 47. Fröhlich F, Weindl D, Schälte Y, Pathirana D, Paszkowski Ł, Lines GT, et al. AMICI: high-performance sensitivity analysis for large ordinary differential equation models. Bioinformatics. 2021;37(20):3676–7. pmid:33821950
- 48.
Akaike H. Information Theory and an Extension of the Maximum Likelihood Principle. Akademiai Kiado. 1973. 267–81.
- 49. Hurvich CM, Tsai C-L. Regression and time series model selection in small samples. Biometrika. 1989;76(2):297–307.
- 50. Schwarz G. Estimating the Dimension of a Model. Ann Statist. 1978;6(2).
- 51. Tönsing C, Steiert B, Timmer J, Kreutz C. Likelihood-ratio test statistic for the finite-sample case in nonlinear ordinary differential equation models. PLoS Comput Biol. 2023;19(9):e1011417. pmid:37738254
- 52. Schmiester L, Schälte Y, Bergmann FT, Camba T, Dudkin E, Egert J, et al. PEtab-Interoperable specification of parameter estimation problems in systems biology. PLoS Comput Biol. 2021;17(1):e1008646. pmid:33497393
- 53. Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19(4):524–31. pmid:12611808
- 54. Choi K, Medley JK, König M, Stocking K, Smith L, Gu S, et al. Tellurium: An extensible python-based modeling environment for systems and synthetic biology. Biosystems. 2018;171:74–9. pmid:30053414
- 55. Schälte Y, Fröhlich F, Jost PJ, Vanhoefer J, Pathirana D, Stapor P, et al. pyPESTO: a modular and scalable tool for parameter estimation for dynamic models. Bioinformatics. 2023;39(11):btad711. pmid:37995297
- 56. Fröhlich F, Sorger PK. Fides: Reliable trust-region optimization for parameter estimation of ordinary differential equation models. PLoS Comput Biol. 2022;18(7):e1010322. pmid:35830470
- 57. Hass H, Loos C, Raimúndez-Álvarez E, Timmer J, Hasenauer J, Kreutz C. Benchmark problems for dynamic modeling of intracellular processes. Bioinformatics. 2019;35(17):3073–82. pmid:30624608