Figures
Abstract
Understanding how the assembly of nascent mRNA into a ribonucleoprotein (mRNP) influences R-loop homeostasis is crucial for gaining insight into the cellular mechanisms that prevent genome instability. Here, we identify three RNA-binding proteins, Rie1, Rim4 and She2, whose expression levels are important to limit R-loop accumulation and, thus, to prevent DNA damage. Interestingly, Rim4 and She2 are overrepresented in CBP80-containing mRNPs formed in the absence of THO. In addition, we found that an excess of the RNA exosome component Dis3 impairs its function, promoting R-loops, particularly from non-coding RNAs, which cause genomic instability. Our results indicate that changes in the availability of different RBPs or RNAs, causes R-loop-mediated DNA damage in the cell. These results may help to understand the mechanism that promotes cancer, as several RBPs are overexpressed in different types of tumors.
Author summary
R-loops are three-stranded structures formed by a DNA-RNA hybrid and the displaced ssDNA. The accumulation of R-loops damages the genome, highlighting the need for their tight regulation. One of the mechanisms that prevents R-loop formation is the co-transcriptional assembly of the nascent RNA into a messenger ribonucleoprotein (mRNP). Our study reveals that not only the absence of mRNP components, but also the overexpression of specific RNA-binding proteins (RBPs) or the persistence of unstable transcripts, results in R-loop accumulation and consequent DNA damage. R-loops are involved in a number of neurodegenerative diseases and cancer, highlighting the importance of understanding the cellular processes that mitigate their accumulation.
Citation: Mérida-Cerro JA, Chevreux G, Palancade B, Rondón AG, Aguilera A (2025) Cellular imbalance of specific RNA-binding proteins associates with harmful R-loops. PLoS Genet 21(7): e1011491. https://doi.org/10.1371/journal.pgen.1011491
Editor: Monica P. Colaiácovo, Harvard Medical School, UNITED STATES OF AMERICA
Received: November 8, 2024; Accepted: June 3, 2025; Published: July 2, 2025
Copyright: © 2025 Mérida-Cerro et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper and its Supporting Information files. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD057248.
Funding: Research was funded by the following grants: PID2019-104270GB-I00 funded by Ministerio de Ciencia e Innovación/Agencia Estatal de Investigación/10.13039/501100011033 and FIUS22/0178 from Fundación de Investigación de la Universidad de Sevilla to AA; P18-FR-655 funded by Consejería de Universidad, Investigación de la Junta de Andalucía and by the European Union to AA and AGR; PP2024/00000588 funded by the University of Seville to AGR; ANR-21-CE12-0040 funded by Agence Nationale pour la Recherche to BP and PJA2022070005310 funded by Fondation ARC pour la recherche contre le Cancer to BP. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
R-loops are non-B DNA structures formed by a DNA-RNA hybrid and the displaced single-stranded DNA (ssDNA). Although R-loops appear naturally in cells and play important roles in physiological processes like class-switch recombination, mitochondrial DNA replication and telomere maintenance, their accumulation can lead to genome instability by interfering with essential processes such as transcription, replication, chromosome segregation and DNA damage repair [1–5]. In order to preserve their genomes, cells have developed several mechanisms to avoid R-loop formation and to resolve them. R-loops are prevented by the combined action of topoisomerases, which remove the negative supercoiling accumulated behind RNA polymerases (RNAPs), and the cotranscriptional assembly of nascent RNA into a mature messenger ribonucleoprotein (mRNP). Both activities reduce the capacity of RNA to tether back to DNA outside the RNAP, either by restraining DNA melting or by reducing RNA availability. Once formed, R-loops are resolved either by RNase H-mediated degradation of the RNA moiety or by the unwinding of the DNA-RNA hybrid through the action of multiple DNA-RNA helicases [5,6]. Defects in these processes might contribute to the development of cancer and human neurodegenerative diseases associated with the accumulation of R-loops [7].
The impact of mRNP assembly on R-loop homeostasis was initially described in mutants of the THO complex, a major mRNP component, and later extended to splicing factors and other mRNP factors [8]. THO was initially identified in yeast as a complex formed by Tho2, Hpr1, Mft1 and Thp2 [9] later on expanded to include Tex1 [10]. Structural analyses revealed that in yeast THO is a dimer [10–12]. During mRNP assembly, THO binds the DEAD-box helicase Sub2 and the mRNA export adaptor Yra1 [11–13], contributing to mRNP compaction and facilitating mRNA export [11,14]. Sub2 ATPase activity is enhanced through its interaction with THO [15]. Based on the phenotypes observed in sub2 mutants and in cells depleted of its human counterpart, UAP56, removal of DNA-RNA hybrids formed co-transcriptionally is one of the many functions of Sub2 [16,17]. The role of THO in assembling the mRNP seems to be conserved in humans [14]. It has been hypothesized that mRNPs formed in the absence of THO would be less compacted, rendering the nascent RNA more susceptible to reannealing with DNA to form an R-loop [18,19].
The accumulation of DNA-RNA hybrids is not only caused by the absence but also by the excess of certain RNA-binding proteins (RBPs). Thus, elevating the cellular concentration of the RNA-binding factor Yra1 increases R-loops, probably due to the stabilization of the hybrids, given that Yra1 binds to DNA-RNA duplexes in vitro and it is recruited to genomic regions with R-loops in vivo in an RNase H-sensitive manner [20]. Additionally, Npl3, a factor involved in mRNP assembly and export that was previously reported to prevent R-loop formation [21], has recently been shown to bind and stabilize TERRA R-loops at short telomeres [22]. Therefore, RBPs could influence R-loops homeostasis either inhibiting the formation of DNA-RNA hybrids or stabilizing them once formed.
Despite the recent advances in elucidating mRNP composition and structure, the specific changes within mRNPs responsible for DNA-RNA hybrids formation remain unclear. To understand how mRNP assembly restricts R-loops, we analysed Cbp80-containing mRNPs formed in the absence of the THO complex and found an increased presence of various RBPs and nuclear pore components. Moreover, we noticed that overexpressing the RBPs Rie1, Rim4 and She2 or altering the activity of the Dis3 RNA exosome subunit promotes R-loop accumulation in the cell, consequently causing DNA damage. Therefore, proper cellular expression levels of RBPs are crucial for preventing R-loop-mediated genome instability.
Results
mRNP proteome is altered in hpr1 cells
To investigate the impact of mRNP biogenesis on R-loop homeostasis, we examined how mRNP composition changes in cells that accumulate R-loops. We chose hpr1∆ as a representative R-loop accumulating mutant and we undertook a quantitative proteomic approach to compare mRNP composition in the absence (hpr1∆) or presence (WT) of the THO complex. We isolated mRNPs from WT or hpr1∆ cells by pulling down a tagged version of Cbp80, a component of the nuclear cap-binding complex (CBC), under conditions that preserve mRNP integrity [23]. Immunoprecipitates were analyzed by SDS-PAGE stained with a fluorescent dye and by mass spectrometry (Figs 1A and S1A). We have previously reported that the proteome of CBP pull-downs is consistent with the capture of nuclear mRNPs, as evidenced by the RNA-dependent association of most regular mRNP components, as assessed both by mass spectrometry analysis and western blotting upon RNase A treatment [24]. Consistently, our proteomic analysis detected a wide range of nuclear RNA-binding proteins: spliceosome components, RNA 3’-end processing and export factors, and RNA helicases among others (Fig 1A and S1 Table). Although the profile and intensity of the bands in the denaturing gel did not change significantly between WT and hpr1∆ (S1A Fig), the quantitative proteomic analysis identified consistent changes. First, mRNPs isolated from hpr1∆ cells are depleted not only of this subunit but of the entire THO complex, confirming that deletion of one subunit disassembled the whole complex [25] (Fig 1A). Interestingly, Sub2 is not reduced, suggesting that although THO might contribute to Sub2 loading, it might be able to bind independently or with the help of other factors that operate in the absence of THO. This idea is supported by Sub2 overexpression suppressing the defects in tho mutants [16]. Among the differences found in the mRNP composition of hpr1∆ cells, we noticed an increase in nuclear pore components (Nsp1, Seh1 and Nup57) and RBPs (Rim4, She2, Yra1, Sbp1 and Npl3). The increased binding to nucleoporins is consistent with the previously reported accumulation of mRNPs at nuclear pores in the absence of THO [26,27] and agrees with R-loops being redirected to the nuclear periphery [26].
(A) Volcano plot of mRNP proteins in a hpr1Δ strain (H) (YAR015-4C) compared with a WT (W)(YDZ7005-16). Each dot represents a protein, with red indicating significant reduction and green indicating significant enrichment in hpr1Δ compared to WT. (B) Sensitivity of WT (YBP249) and hpr1Δ (HPBAR1-R) strains to the overexpression of NPL3 (pYES-NPL3); RIM4 (pYES-RIM4), SBP1 (pYES-SBP1), SHE2 (pYES-SHE2) or YRA1 (pYES-YRA1Δi). Cells were plated in SC media containing either 2% glucose or 2% galactose in order to repress or induce the overexpression. (C) Percentage of S/G2 cells with Rad52-YFP foci in a wild type (YBP249) overexpressing the indicated proteins. Representative images are shown. (D) Percentage of S/G2 cells with Rad52-YFP foci in a wild type (YBP249) overexpressing RIM4 (RIM4 o.e.), or SHE2 (SHE2 o.e.) combined (+) or not (-) with the overexpression of RNase H. Fold change and p-value mean of 4 samples are plotted for (A). Mean and SD of 3 samples are plotted for (C) and (D). *p ≤ 0.05; **p ≤ 0.01; ***p ≤ 0.001; ****p ≤ 0.0001 (two-tailed Student’s t-test).
Yra1 expression is tightly regulated in the cell, and its overexpression is toxic [28,29], especially in mutants that accumulate R-loops [20]. Indeed, YRA1 overexpression causes R-loop-mediated DNA damage, which is detected as an increase in RNase H-sensitive Rad52 foci [20,30]. As Yra1 is overrepresented in hpr1∆ mRNPs (Fig 1A), we examined whether the other enriched RPBs, Rim4, She2, Sbp1, and Npl3, could also affect R-loop homeostasis. To do so, we first assessed hpr1∆ cell fitness when overexpressing each of these RBPs individually. We observed that only SHE2 overexpression caused a mild growth defect in hpr1∆. RIM4 and SBP1 overexpression are already toxic in wild-type cells complicating the detection of a stronger growth defect in hpr1∆ (Fig 1B). Next, we measured the accumulation of Rad52 foci as a readout of DNA damage. Interestingly, overexpression of SHE2 and RIM4 significantly increased Rad52 foci, similarly to YRA1 (Fig 1C). We confirmed that although Npl3 and Sbp1 do not increase DNA damage, they are overexpressed to a similar level as She2 or Rim4, respectively (S1B and S1C Fig). When RNase H was simultaneously overexpressed, DNA damage was reduced to wild-type levels (Fig 1D) although the level of Rim4 or She2 remain unchanged (S1D Fig). Taken together, these results suggest that an excess of She2 and Rim4 in the mRNP or within the cell might disrupt R-loop homeostasis.
Upregulation of four different RBPs induces genotoxic R-loops
Given the increase of certain RBPs in mRNPs assembled without the THO complex, along with the fact that several of them cause RNase H-sensitive DNA damage when overexpressed (Fig 1D), we asked whether overexpressing other RBPs could cause R-loop accumulation. To address this, we performed a screen for genes that reduce growth when overexpressed in hpr1∆ cells, a phenotype previously described for one of these RBPs, Yra1 [20] and observed for She2 (Fig 1B). We used a diploid hpr1∆/HPR1-AID strain expressing a degron version of HPR1 that induces Hpr1 depletion through auxin addition. We transformed it with the MW90 multicopy library and examined its growth in the absence of Hpr1. Overexpression of YRA1 (Yra1∆i) was included as positive control [20]. We isolated two plasmids that caused a mild growth defect: clones 1 (C1) and 2 (C2) (Fig 2A). Next, we assessed whether expression of clone 1 or 2 increased R-loop-dependent DNA damage by quantifying Rad52 foci in cells overexpressing or not RNase H. Both clones increased Rad52 foci similarly to YRA1 overexpression, and this effect was suppressed by RNase H overexpression, consistent with an accumulation in DNA-RNA hybrids (Fig 2B). The first clone (C1) contained a fragment of chr XV including two tRNA genes (SUF17 and SUP3), TAT2, DIS3 and the putative gene YOL019W-A (S2A Fig). Interestingly, Dis3 is an endo-exonuclease that forms part of the RNA exosome, a complex that degrades RNA [31]. To assess whether overexpression of DIS3, but not the other genes cloned in C1, is detrimental to hpr1 cells, we truncated the C1 clone by removing SUF17 and part of (C1del.A) or the entire DIS3 gene (C1del.B) (S2A Fig). Neither C1del.A nor C1del.B expression caused DNA damage. However, overexpressing DIS3 from a multicopy plasmid increased Rad52 foci (S2C Fig and Fig 2C) and caused cell sickness in hpr1∆ cells (Fig 2E). We concluded that an excess of Dis3, rather than the tRNAs or Tat2 protein, is responsible for hpr1∆ sickness.
(A) Growth of diploid hpr1/HPR1AID (HPR1DGK) strain Hpr1 depleted (+Auxin) or non-depleted (-Auxin) transformed with YEp351 (WT), MW90-clone1 (C1) or MW90-clone2 (C2), YEpYRA1Δi (yra1Δi). (B) Percentage of S/G2 cells with Rad52-YFP foci in a wild-type (YBP249) strain overexpressing C1 or C2, combined (+) or not (-) with overexpression of RNase H. (C) Percentage of S/G2 cells with Rad52-YFP foci in a wild type strain (YBP249) overexpressing the indicated clones or DIS3 (DIS3 o.e.). (D) Percentage of S/G2 cells with Rad52-YFP foci in a wild type strain (YBP249) overexpressing C2, RIE1 (RIE1 o.e.) or YRA1 (yra1Δi). (E) Sensitivity of WT (YBP249) and hpr1Δ (HPBAR1-R) to elevated levels of DIS3 (DIS3 o.e.) or RIE1 (RIE1 o.e.). (F) Percentage of S/G2 cells with Rad52-YFP foci in a wild type strain (YBP249) overexpressing DIS3 or RIE1, combined (+) or not (-) with overexpression of RNase H. Mean and SD of ≥3 samples are plotted for (B), (C), (D) and (F). *p ≤ 0.05; **p ≤ 0.01; ***p ≤ 0.001; ****p ≤ 0.0001 (two-tailed Student’s t-test).
The other clone identified (C2) had a 7.2 kb insert of chr VII including MGA1 and RIE1 (YGR250c) genes (S2B Fig). Interestingly, Rie1 is an RNA-binding factor with three RNA recognition motifs (RRMs). Thus, we examined whether RIE1 overexpression was responsible for the increase in DNA damage. An excess of Rie1 increased Rad52 foci in the cells to a similar level than C2 plasmid (S2C and 2D Figs) and strongly decreased fitness in wild-type cells, making difficult to observe any additional effect in hpr1∆ cells (Fig 2E). When we overexpressed RNase H, DIS3 or RIE1 expression was not affected but Rad52 foci decreased (S2D and 2F Figs), suggesting that an excess of Dis3 or Rie1 alters DNA-RNA hybrid homeostasis in the cell. Consistent with these results, Rie1 is overrepresented in Hpr1-depleted mRNPs and, although Dis3 was not detected in the proteomic analysis, several RNA exosome subunits preferentially bind to mRNPs in the absence of THO (S1 Table). Taken together the results obtained in the proteomic and the genetic approaches, we identified four RBPs that cause R-loop dependent DNA damage when overexpressed: Dis3, Rie1, Rim4 and She2. Therefore, not only the removal of mRNP components but also their overexpression increases genomic damage, most probably favoring R-loop accumulation.
To directly examine whether overexpression of DIS3, RIE1, RIM4 or SHE2 increases R-loops, we performed an immunofluorescence assay in yeast chromosome spreads using the S9.6 antibody, which recognizes DNA-RNA hybrids and with less affinity dsRNA [32]. We observed DNA-RNA hybrids in 17–21% of the cells upon overexpression of the genes analyzed, compared to 9% without it. Simultaneous overexpression of RNase H in vivo reduced the S9.6 signal, confirming the specificity of the signal for DNA-RNA hybrids (Fig 3A). We then investigated the localization of these R-loops in the genome by DNA-RNA immunoprecipitation (DRIP) followed by qPCR at GCN4 and ribosomal DNA 18S, two regions previously validated for R-loop detection in yeast [20]. We only observed a significant enrichment of hybrids in these regions upon Rie1 or Rim4 upregulation (Fig 3B), but as Dis3 and She2 overexpression also increased the S9.6 signal in the immunofluorescence experiments (Fig 3A), which provides a global view of R-loop accumulation in the genome, we tested other genomic regions functionally connected to these factors. For Dis3, we looked at NLR021w, NEL025c, and NHR027c genes encoding cryptic unstable transcripts (CUTs) that are degraded by the RNA exosome [33], whereas for She2 we examined ASH1, EAR1 and IST2 genes whose transcripts contain RNA secondary structures specifically recognized by She2 [34,35]. We observed a significant accumulation of DNA-RNA hybrids in CUTs encoding genes when Dis3 is in excess (Fig 3C) and in genes whose transcripts are recognized by She2 when this RBP is overexpressed (Fig 3D), confirming that R-loops also accumulate upon overexpression of these factors. Taken together, our results imply that altering the cellular levels of Rim4, Rie1, She2 or the RNA exosome subunit Dis3, causes R-loop accumulation. Interestingly, whereas Rim4 and Rie1 seem to have a general effect, She2 and Dis3 action is preferentially directed to specific genes whose transcripts are targets of these factors.
(A) Representative pictures and percentage of nuclei with S9.6 signal from a wild type strain (YBP249) overexpressing DIS3, RIE1, RIM4 or SHE2, combined (+) or not (-) with overexpression of RNase H. Data from more than 100 total cells per experiment are shown. (B) DRIP using S9.6 antibody in GCN4 and rDNA 18S genes in YBP249 WT strain overexpressing DIS3, RIE1, RIM4 or SHE2 either non-treated (RNH-) or in vitro treated (RNH+) with RNase H. (C) DRIP in NLR021w, NEL025c and NHR027c CUTs in YBP249 overexpressing DIS3 (DIS3 o.e.) either non-treated (RNH-) or in vitro treated (RNH+) with RNase H. (D) DRIP in ASH1, EAR1 and IST2 genes in YBP249 overexpressing SHE2 (SHE2 o.e.) either non-treated (RNH-) or in vitro treated (RNH+) with RNase H. Mean and SD of 3 samples are plotted for (A), (B), (C) and (D). *p ≤ 0.05; **p ≤ 0.01; ***p ≤ 0.001; ****p ≤ 0.0001 (two-tailed Student’s t-test).
Cellular dysfunction of Dis3 and Rim4 induce harmful R-loops
The expression of each protein in the cell is optimized to favor specific interactions and to preserve the stoichiometry of the complexes. Thus, we reasoned that the drastic increase caused by expressing these RBPs from multiple copies could be disrupting their activity. To test this idea, we assessed whether mutating RIM4, SHE2, DIS3 or RIE1 would increase DNA-RNA hybrids, as previously observed with their overexpression (Fig 3A). By immunofluorescence on chromosome spreads, we observed a significant increase in the percentage of nuclei with S9.6 signal in dis3–1 (24%) and rim4∆ (27%) mutants, similar to the rnh1∆ mutant lacking RNase H1 (25%) (Fig 4A), and DIS3 and RIM4 overexpression (Fig 3A). In contrast, neither she2∆ nor rie1∆ deletion caused hybrid accumulation (Fig 4A). Consistent with the immunostaining data, we detected DNA damage, measured as Rad52 foci accumulation, in dis3–1 and rim4∆ but not in she2∆ or rie1∆ cells (Fig 4B). These experiments suggest that expressing from a multicopy vector DIS3 and RIM4 might disrupt the function of these factors mimicking the dis3 and rim4∆ mutations.
(A) Representative pictures and percentage of nuclei with S9.6 signal from BY4741 (WT), BDIS3-1 (dis3-1), BRIE1D (rie1Δ), BRIM4D (rim4Δ), BSHE2D (she2Δ) and BRNH1D (rnh1Δ) strains. (B) Percentage of S/G2 cells with Rad52-YFP foci in BY4741 (WT), BDIS3-1 (dis3-1), BRIE1D (rie1Δ), BRIM4D (rim4Δ) and BSHE2D (she2Δ) strains. Representative images are shown. (C) Representative pictures and percentage of nuclei with S9.6 signal from BY4741 (WT), BDIS3-1 (dis3-1) and BRIM4D (rim4Δ) strains overexpressing (RNH+) or not (RNH-) RNase H. (D) Percentage of S/G2 cells with Rad52-YFP foci in BY4741 (WT), BDIS3-1 (dis3-1) and BRIM4D (rim4Δ) overexpressing (RNH+) or not (RNH-) RNase H. (E) Northern blot of 5.8S rRNA from BY4741 transformed with YEp351 (WT) or YEpDIS3 (DIS3 o.e.) or BDIS3-1 (dis3-1) strain. Unprocessed 7S rRNA, processed 5.8S L and S forms and intermediaries rRNAs are marked. (F) Diagram of RIM4 and rim4∆488-718. Lysates of a WT strain (YBP249) overexpressing (+) or not (-) Rim4 and Rim4∆488-718 were analyzed by Western blot. (G) Sensitivity of a WT strain (YBP249) to overexpression of RIM4 (pYES-RIM4) or the truncated rim4∆488-718 version (pYES-RIM4∆488-718). (H) Percentage of S/G2 cells with Rad52-YFP foci in a wild type (YBP249) overexpressing Rim4 or Rim4∆488-718. (I) DRIP using S9.6 antibody in GCN4 (left) and rDNA 18S (right) genes in a wild type (YBP249) overexpressing full-length (RIM4) or truncated RIM4 (rim4∆488-718) either non-treated (RNH-) or in vitro treated (RNH+) with RNase H. Mean and SD of ≥3 samples are plotted for (A), (B), (C), (D), (H) and (I). *p ≤ 0.05; **p ≤ 0.01; ***p ≤ 0.001; ****p ≤ 0.0001 (two-tailed Student’s t-test).
Next, we addressed the mechanism by which Dis3 and Rim4 prevent R-loop accumulation. First, we confirmed that the increase in S9.6 signal observed in dis3–1 and rim4∆ is indeed an increase in DNA-RNA hybrids by analyzing RNase H sensitivity. We observed that the number of nuclei with S9.6 signal was significantly reduced by in vivo RNase H overexpression (Fig 4C). Moreover, Rad52 foci were also partially suppressed by RNase H (Fig 4D). Therefore, we confirmed that dis3–1 and rim4∆ mutations caused R-loops accumulation in the cell leading to DNA damage.
Given that DIS3 forms part of the RNA exosome [31], we wondered whether increasing DIS3 expression in the cell was affecting the function of this complex. RNA exosome depletion was previously described to affect 7S rRNA 3’-end processing into mature 5.8S rRNA [36,37]. Therefore, we analyzed 5.8 rRNA maturation by Northern blot in cells overexpressing DIS3 and compared it with the dis3–1 mutant. In wild-type cells, we only detected the 7S precursor and the completely processed 5.8L and 5.8S forms. However, in the RNA exosome mutant or upon DIS3 overexpression, we also observed intermediate forms of varying sizes (Fig 4E), confirming RNA exosome dysfunction and the stabilization of transient RNAs. We reasoned that expressing DIS3 from a multicopy vector might overload the cell with this component, potentially disrupting RNA exosome function and stabilizing short-lived RNAs favoring DNA-RNA hybrids.
Rim4 contains three RNA recognition motifs at the N-terminal region and a prion domain at the C-terminal part of the protein, which is involved in formation of amyloid-like aggregates (Fig 4F) [38]. Because an excess of this protein causes similar phenotypes to the loss, we wondered whether the overexpression might trigger protein aggregation, thereby depleting the cell of functional Rim4. To test this idea, we overexpressed a truncated version of RIM4, missing the 271 C-terminal amino acids (rim4∆448–718) (Fig 4F), previously reported not to form aggregates in vivo [38]. We confirmed that the truncated protein was overexpressed to the same level as the full-length protein (Fig 4F). However, removing the C-terminal IDR and prion domains restored cell viability (Fig 4G) and suppressed the DNA damage observed when the full-length protein is overexpressed (Fig 4H). DRIP analysis confirmed that whereas overexpression of the full protein increased DNA-RNA hybrids at GCN4 and 18S rDNA, overexpression of rim4∆448–718 presented similar levels to the wild-type strain in both genes (Fig 4I). The data suggest that an excess of Rim4 triggers its aggregation, causing a depletion of functional Rim4 in the cells.
Our data suggest that the increase of R-loops caused by an excess of Rim4 and Dis3 may result from the disruption of their physiological functions in the cell.
Nuclear accumulation of She2 or Rie1 alters R-loop homeostasis
Finally, we sought to understand how overexpression of Rie1 and She2 could interfere with R-loop homeostasis, leading to their accumulation. Initially, we wondered whether She2 or Rie1 accumulate in the nucleus when present in excess. This is especially relevant for Rie1, previously described as a cytoplasmic protein [39], but also important for She2, which shuttles between both compartments [40]. We used strains with the endogenous copy of SHE2 or RIE1 tagged with YFP to analyze the localization of these RBPs under normal conditions and we transformed these strains with multicopy plasmids with YFP-tagged versions of SHE2 or RIE1 to overexpress them. As expected, we observed a significant increase in either Rie1 or She2 in cells transformed with the overexpression plasmids (Fig 5A). We quantified the YFP signal in cells overexpressing or not SHE2 or RIE1 and plotted the nucleus/cytoplasm ratio (Fig 5A). We confirmed that Rie1 localized predominantly in the cytoplasm (ratio< 1), but upon overexpression, we observed that it entered the nucleus becoming equally distributed in both compartments (ratio ≅ 1). Conversely, She2 was initially distributed throughout the cell (ratio ≅ 1), and tended to accumulate in the nucleus when present in excess (ratio >1) (Fig 5A).
(A) In vivo localization of Rie1-YFP or She2-YFP in BY4741 overexpressing or not these factors. DNA was stained with DAPI. Ratio of the YFP average intensity detected in the nucleus and the cytoplasm. (B) Localization of poly(A)+ RNA by FISH with Cy3-labeled oligo(dT) in BY4741 overexpressing SHE2 (SHE2 o.e.) and in WMC1-1A strains (mex67-5). DNA was stained with DAPI. A detail of a single cell is highlighted. (C) ChIP using anti-HA antibody in WRIE1HA transformed with pYES2 (WT) or pYES-RIE1HA (RIE1 o.e.) overexpressing (RNH+) or not (RNH-) RNase H. GCN4, ASH1, 18S and HXT1 genes were analysed. (D) As in (C) but in WSHE2HA strain with SHE2-HA overexpression. Individual values and median of 3 samples are plotted for (A) (two-tailed Mann-Whitney t-test). Mean and SD of 3 samples are plotted for (C) and (D) (two-tailed Student’s t-test) *p ≤ 0.05; **p ≤ 0.01; ****p ≤ 0.0001.
Since we have previously reported that mutation in the major mRNA export factor Mex67 increases genomic instability [16], and considering that She2 is involved in transferring specific mRNAs to the yeast bud tip [41,42], we next investigated whether She2 accumulation in the nucleus could induce promiscuous interactions that affected general mRNA export. To explore this, we performed fluorescence in situ hybridization (FISH) using an oligo(dT)-Cy3 probe to detect polyadenylated transcripts. As a control, we included a mutant of MEX67. We observed that cellular mRNA distribution did not change upon SHE2 overexpression, in contrast to control mex67–5 cells (Fig 5B). Thus, despite accumulating in the nucleus, an excess of She2 does not alter bulk mRNA export to the cytoplasm.
A different possibility could be that the accumulation of Rie1 and She2 in the nucleus might cause aberrant binding to R-loops, as we previously observed with the overexpression of Yra1 [20]. To address this possibility, we overexpressed an HA-tagged version of Rie1 or She2 in strains where the endogenous copy is also tagged (S1C and S2C Fig). By Chromatin Immunoprecipitation analysis, we observed that Rie1 was recruited to chromatin when in excess. However, this interaction was not transcription-dependent as Rie1 was also bound to the repressed gene HXT1. In addition, the interaction of Rie1 with chromatin is R-loop-independent, as it persisted in the presence of RNase H (Fig 5C). In contrast, She2 localized to GCN4 and ASH1, but not to HXT1, in a hybrid-dependent manner (Fig 5D), indicating that She2 accumulation in the nucleus may be at R-loop-forming loci, potentially stabilizing transient DNA-RNA hybrids.
Our observations suggest that R-loop homeostasis can be challenged by the increased nuclear localization of She2 and Rie1, which may induce aberrant interactions with either transiently formed R-loops or chromatin, ultimately favoring R-loop accumulation.
Discussion
A major source of mRNP-dependent genome instability are unscheduled R-loops, which disrupt transcription and replication, causing DNA damage [2,6]. Our data provide evidence that an excess of the RNA exosome nuclease Dis3 or the RBPs Rim4, She2 and Rie1 can increase R-loops, leading to DNA damage. The mechanisms by which these proteins promote R-loop accumulation vary, highlighting the different cellular functions that control R-loop homeostasis. Thus, overproduction of Dis3 causes RNA exosome malfunction, resulting in the stabilization of short-life transcripts that may be more prone to hybridize with DNA. Alternatively, Rie1 or She2 overexpression increases their nuclear concentration, causing aberrant interactions with chromatin or nascent mRNPs that in the case of She2 are R-loop mediated. Finally, the amyloid-like protein Rim4, at physiological levels, seems to protect cells from R-loops through a yet uncharacterized mechanism, being more frequently recruited to R-loop prone mRNPs (Fig 1A). Our study, therefore, suggests that precise control of RBP availability is crucial for maintaining genome integrity. We propose that mRNP assembly could be challenged not only by specific RBP depletion, but also by its overexpression. Regardless of the mechanism, if mRNPs are not properly formed, R-loops tend to accumulate in the cell, leading to DNA damage.
The THO complex was the first mRNP component implicated in R-loop prevention, a role later extended to other RBPs [6]. As RNA-binding factors, we proposed that defective mRNP assembly might allow RNA to reanneal with template DNA [18,25]. To determine the nature of these putative aberrant mRNPs prone to forming R-loops, we analyzed their composition by quantitative proteomics, comparing mRNPs in the presence and absence of THO (Figs 1A and S1A). In addition to a marked reduction in the five THO subunits, which confirms they form a structural unit independent of Sub2 and Yra1, in agreement with recent findings [11], we observed a significant increase in several RBPs (Yra1, Npl3, Rim4, She2 and Sbp1) and in nucleoporins (Seh1, Nup57 and Nsp1) (Fig 1A). Several mechanisms could underlie the changes in the mRNP components observed in THO-complex mutants. The increase in RBPs observed could result from nonspecific binding to a less compacted mRNP. Alternatively, in hpr1∆ cells, mRNA release from the transcription site, co-transcriptional assembly with proteins, and export out of the nucleus may be kinetically affected so that different stages of nuclear mRNA metabolism are captured as compared to wild-type cells, in line with earlier studies [13,27]. Finally, higher levels of DNA-RNA hybrids in hpr1∆ cells may further impact the outcome of mRNP pull-outs, since chromatin-bound nascent mRNPs may associate with hybrid-binding proteins. This later possibility could apply to Yra1 and She2, both of which are overrepresented in mRNPs without THO (Fig 1A). We have previously shown that Yra1 interacts with DNA-RNA hybrids in vitro and when present in excess in cells, binds to R-loop-prone genomic regions in an RNase H sensitive manner, increasing genome instability [20,30]. Increased Yra1 association with RNAs engaged in R-loops may account for its capture in hpr1∆ pull-downs (Fig 1A), possibly explaining the difference with a published report detecting decreased Yra1 levels in single molecule analyses of free mRNPs isolated from THO-complex mutant cells [43]. Similarly, overexpressed She2 is recruited to chromatin in a RNase H-sensitive manner and causes R-loop accumulation and DNA damage (Figs 1D, 3A, 3D and 5D) opening the possibility that She2 might bind to and/or stabilize unwanted DNA-RNA duplexes. In the case of the observed increase in nucleoporins, it may be a consequence of the previously reported interaction of THO-deficient mRNPs with the nuclear pore [27] or the recently described relocation of R-loops to the nuclear pores [26]. Therefore, although other RBPs may partially compensate for the absence of the THO complex in mRNPs, they may inadvertently promote the accumulation of DNA-RNA hybrids and their retention at the nuclear envelope.
The identification of Dis3 as a defense against R-loops has revealed another aspect of mRNP metabolism that may contribute to R-loop homeostasis. Interestingly, overexpression of DIS3 alters the function of the RNA exosome, probably by unbalancing the stoichiometry of the complex, leading to accumulation of intermediates in rDNA maturation (Fig 4E). RNA exosome dysfunction, whether due to DIS3 overexpression or mutation, stabilizes RNAs especially non-coding species, increasing the likelihood of RNA hybridizing to DNA. Consistent with this, the TRAMP component Trf4, which helps the RNA exosome to degrade RNAs, protects cells from R-loop-triggered genome instability and the lack of the nuclear RNA exosome nuclease Rrp6 or other TRAMP components increases genome instability [44–46]. The role of the RNA exosome in preventing R-loops is conserved in humans, where DIS3 loss enhances R-loop mediated genome instability [47]. Moreover, increasing ncRNA synthesis, by human SPT6 depletion, or its stabilization, by RNA exosome depletion, enhances R-loops [48,49]. This supports the idea that ncRNAs are prone to hybridizing with DNA if they are not rapidly degraded. However, other indirect mechanisms leading to R-loops cannot be excluded. For instance, RNA accumulation due to RNA exosome malfunction might deplete cells of RBPs, as recently reported for the Nrd1-Nab3-Sen1 complex involved in non-coding transcription termination [50]. Additionally, the absence of the RNA exosome could affect Senataxin, an DNA-RNA helicase involved in hybrid resolution, with which the RNA exosome interacts [51].
Interestingly, two factors that lead to the accumulation of R-loops when overexpressed, Rie1 and Rim4, contain intrinsically disordered regions (IDRs). These unstructured domains enable beneficial interactions with proteins and nucleic acids and were recently proposed to contribute to mRNP compaction [11,52]. However, when present at abnormal levels, they can promote undesirable contacts with non-physiological partners. This might be the case for Rie1, which, when in excess, enters the nucleus and binds to chromatin in a transcription and hybrid independent manner (Fig 5C), altering the chromatin environment in a yet uncharacterized way that enhances R-loops. In the case of Rim4, an important distinction is that, unlike Rie1, both mutation and overexpression lead to R-loop accumulation. Given that the IDR of Rim4 functions as a prion-like domain, capable of folding into amyloid-like aggregates [38], and that this prion-like domain is responsible for Rim4 toxicity and the accumulation of R-loops when Rim4 is overexpressed (Fig 4F-4I), we hypothesize that Rim4 accumulation might trigger its aggregation, which disrupt its normal function in preventing R-loops. There are precedents for this in humans, where TDP-43 and FUS, two RBPs with prion-like domains [53], also promote R-loop accumulation when mutated [54–56]. Overexpression of TDP-43 and mutations in FUS that lead to the formation of cytoplasmic aggregates, depleting the nucleus of functional protein, contributes to the neurodegenerative diseases amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) [57,58]. Although it remains to be determined whether R-loops play a direct role in the development of these diseases, it would be interesting to know how IDR-containing RBPs influence R-loop homeostasis.
In this study, we found that an excess of She2 and Rie1, or lack of a functional RNA exosome or Rim4, increases DNA damage by enhancing R-loops. The mechanism underlying R-loop increment could vary, ranging from increased RNA stability to promiscuous interactions that promote or stabilize R-loops. In line with these diverse mechanisms, several distinct factors contribute to prevent this stress, a list that is still increasing since THO was initially identified, and includes not only structural mRNP components like Npl3, but also processing factors like Rna14 or RNA helicases like Sen1 [6,45,59], and has now been expanded to the RNA exosome, She2 and the IDR-containing proteins Rie1 and Rim4. Taken together, these observations suggest that mRNP biogenesis is a delicate and precise process, and stressing it by removing or increasing specific factors, causes R-loop mediated DNA damage. This might be particularly relevant in natural situations like aging or diseases such as cancer where the level of multiple RBPs is altered, leading to cellular stress [60,61]. RBPs may thus represent a promising class of potential therapeutic targets, not only in line with their known roles in genome expression and alternative splicing, but also in view of their impact on R-loops and genetic stability. Modulating R-loop levels could be important in diseases associated with excess of these structures such as cancer, autoimmune disorders, and neurodegenerative diseases, as increasing R-loops by stressing mRNP biogenesis could result in cell death in combination with inhibition of the DNA damage response, in line with anti-cancer therapeutic approaches [62]. Our work, thus, supports the existence of an additional level of R-loop control that relies on an optimal balance of RBPs that could open new perspectives for therapeutic strategies for R-loop-associated diseases.
Materials and methods
Yeast strains and media
Yeast strains used in this study are derivatives of W303 (MATa his3–11,15 leu2–3,112 Δtrp1 ura3–1, ade2–1 can1–100) or BY4741 (MATa his3∆ leu2∆0 met15∆ ura3∆0) and are listed in S2 Table.
YAR015-4C was generated by genetic cross of YDZ7005–16 and SCHY58A. HPR1DGK is a diploid strain generated by mating HPR1-D2 and SCHY58A. WRIEHA and WSHEHA were generated from YBP249 by fusion in the C-terminal region of the RIE1 or SHE2 ORFs (respectively) of a 3HA::ADH1::kanMX6 cassette amplified by PCR from pFA6a-3HA-kanMX6 plasmid, using the primers RIE1-HA fw, RIE1-HA rv, SHE2-HA fw, SHE2-HA rv and RECOMB rv.
Media used in this study: YPAD (1% yeast extract, 2% bacto-peptone, 2% glucose, and 20 mg/ml adenine), Synthetic defined [63] (0,17% yeast nitrogen base without amino acids, 0,5% ammonium sulfate, supplemented with amino acids), synthetic complete (SC) (SD with 2% glucose, 2% galactose or 2% raffinose). Solid media was prepared by adding 2% agar.
For serial dilution growth assays, mid-log cultures were grown in selective media. Ten-fold serial dilutions of the culture were done using sterile water and 5 μl of each dilution was plated and grown for 2–3 days at 30°C. For Hpr1 depletion (HPR1DGK), 1 mM of auxin (1-Naphthaleneacetic acid, NAA, from Sigma) was added to the media. Cultures were shifted to 37ºC for 2h to analyse dis3–1 (BDIS3–1) mutant and for 1h to analyze the mex67–5 (WMC1-1A) under non-permissive conditions. Yeast strains were defrosted from glycerol stocks and grown at 30°C using standard procedures.
Plasmids
Plasmids used in this study are listed in S3 Table.
pYES-NPL3, pYES-RIE1, pYES-RIM4, pYES-SBP1, pYES-SHE2 and pYES-YFP were generated by cloning of PCR cassettes amplified from their respective ORFs (or from pRS413-GAL::YRA1Δi or pWJ1344) into pYES2 multicloning site, using the listed primers: NPL3-XhoI fw, NPL3-XhoI rv, RIE1-KpnI fw, RIE1-EcoRI rv, RIM4-KpnI fw, RIM4-SacI rv, SBP1-HindIII fw, SBP1-XbaI rv, SHE2-XhoI fw, SHE2-XhoI rv, YFP fw and YFP-EcoRI rv. pYES-YRA1Δi was generated by cloning of the BamHI fragment of pRS413-GAL::YRA1Δi containing YRA1 gene without intron into pYES2 BamHI site. pYES-RIM4∆448–718 was generated by deletion of the last C-terminal 813 pb of RIM4 ORF in pYES-RIM4 using a PCR product amplified from pFA6a-3HA-kanMX6 plasmid using RIM4∆448–718-HA fw and RIM4-HA rv primers. pYES-RIE1YFP and pYES-SHE2YFP were obtained by fusion of their respective ORFs to YFP by a two-step PCR from pWJ1344, pYES-RIE1 and pYES-SHE2 using the primers RIE1-KpnI fw, RIE1-PH rv, SHE2-KpnI fw, SHE2-PH rv, YFP fw, YFP-EcoRI rv and cloned into KpnI/EcoRI sites of pYES2. pYES-RIE1HA and pYES-SHE2HA, were generated by fusion of 3xHA in the C-terminal end of their ORFs by introducing in pYES-RIE1 and pYES-SHE2 a PCR fragment obtained from WRIEHA and WSHEHA using RIE1comp fw, SHE2comp fw and HAtag rv primers. pYES-SBP1HA, pYES-RIM4HA and pYES-RIM4∆448–718HA were generated by fusion of 3xHA in the C-terminal end of their ORFs by introducing in their corresponding pYES vectors a PCR fragment obtained from pYES2-SHE2HA using SBP1-HA fw, SBP1-HA rv, RIM4-HA fw, RIM4-HA rv or RIM4∆-HA rv primers. YEpDIS3 was obtained by cloning into YEp351 SacI/SphI site a PCR product containing DIS3 ORF plus 457pb upstream and 126pb downstream, amplified from YPB249 using DIS3-SacI fw and DIS3-SphI rv primers. YEpDIS3-HA was created by cloning in YEpDIS3 3xHA tag at C-terminal end obtained by PCR from WDISHA genomic DNA using DIS3Sal fw and DIS3-HA rv primers. YEpYRA1Di was generated by cloning of the BamHI fragment of pRS413-GAL::YRA1Δi containing YRA1 gene without intron into YEp351 BamHI site. MW90-C1A and MW90-C1B were obtained by enzymatic digestion using XbaI or BamHI (respectively) and religation of the plasmid MW90-C1. Primer sequences are listed in S4 Table.
mRNP purification
mRNP purification from cells expressing Cbp80-proteinA was performed essentially as described [23]. Briefly, we grew cells in 3L of YPAD media at 30ºC to OD 1-1.2, collected them and washed them and resuspend them in a volume of 20mM Hepes pH7.4 1.2% PVP-40. Cells were extruded from a syringe into liquid nitrogen to quickly freeze them and grinded in a planetary mill (Pulverisette, Fritsch) in a 80 ml vase with 4 balls 3 times 500rpm 3 minutes with 1 min reverse. The jars were place in liquid nitrogen between cycles to keep the cells frozen. 2 gr of frozen cell powder were use per immunoprecipitation. Frozen cells were resuspended in nine volumes of extraction buffer (20 mM Hepes pH 7.4, 110 mM KOAc, 75 mM NaCl, 0.1% Tween-20, 0,5% Triton X-100, 1mM PMSF, 1X protease inhibitors cocktail complete EDTA-free and 1:5000 antifoam B 1:5000 RNasein). The resulting extract was clarified by filtering it through 1.6 μm GD/X Glass Microfiber syringe filters (25mm, Whatman) and incubated for 30 min at 4°C with IgG-conjugated magnetic beads. Beads were then washed four times with extraction buffer and once with 0.1 M NH4OAc, 0.1 mM MgCl2, 0.02% Tween-20. Bound-complexes were eluted with 0.5M NH4OH, 0.5 mM EDTA, lyophilized and resuspended either in SDS-sample buffer for SDS-PAGE or in 25 mM ammonium carbonate for mass spectrometry analysis.
Mass spectrometry
Digestion was performed overnight at 37°C in the presence of 12.5 μg/ml of sequencing grade trypsin (Promega). Digests were analyzed by a LTQ Velos Orbitrap (Thermo Fisher Scientific) coupled to an Easy nano-LC Proxeon system (Thermo Fisher Scientific). Chromatographic separation of peptides was performed with the following parameters: column Easy Column Proxeon C18 (10 cm, 75 μm i.d., 120 Å), 300nl/min flow, gradient rising from 95% solvent A (water - 0.1% formic acid) to 25% B (100% acetonitrile, 0.1% formic acid) in 20 min, then to 45% B in 40 min and finally to 80% B in 10 min. Peptides were analyzed in the Orbitrap in full ion scan mode at a resolution of 30,000 and a mass range of 400–1800 m/z. Fragments were obtained with a collision-induced dissociation (CID) activation with a collisional energy of 40%, an activation Q of 0.25 for 10 ms, and analyzed in the LTQ. MS/MS data were acquired in a data dependent mode in which 20 most intense precursor ions were isolated, with a dynamic exclusion of 20 seconds and an exclusion mass width of 10 ppm. Label Free quantitation was performed using Progenesis QI for proteomics software version 4.2 (Waters). The software was allowed to automatically align data to a common reference chromatogram to minimize missing values. Then, the default peak-picking settings were used to detect features in the raw MS files and a most suitable reference was chosen by the software for normalization of data following the normalization to all proteins method. A between-subject experiment design was chosen to create groups of four biological replicates, i.e., control (no tag), CBP80-pA wt and CPB80-pA hpr1∆. MS/MS spectra were exported and searched against the SwissProt database with the S.cerevisiae taxonomy using Proteome Discoverer 1.4 software (ThermoFisher Scientific) using a search engine node for an in-house Mascot server version 2.4.1 (Matrix Science). A maximum of 2 missed cleavage sites was authorized. Precursor and fragment mass tolerances were set to respectively 7 ppm and 0.5 Da. The following modifications were considered in mass calculation: oxidation (M), phosphorylations (STY), acetylation (Nterm, K), deamidation (NQ). The maximum number of missed cleavages was limited to 2 for trypsin digestion. Peptide spectrum-matches (PSMs) were filtered using a 1% False Discovery Rate (FDR). Identification results were then imported into Progenesis to convert peptide-based data to protein expression data using the Hi-3 based protein quantification method. Data were then processed using multivariate statistics to evidence differentially enriched proteins meeting the following criteria: at least two unique peptides, Fold Change higher than 2, ANOVA p-values lower than 0.05 and power higher than 0.8. Proteins significantly enriched in either CBP80-pA samples relative to the no tag control were further compared for their presence in wt vs. hpr1∆ mRNPs.
Rad52-YFP foci detection
Spontaneous Rad52-YFP foci formation from mid-log growing cultures were visualized by fluorescence microscopy in a Leica DM6000 B microscope at x63 magnification, as described [64]. Cultures were fixed with 2.5% formaldehyde prior to visualization. At least 200 S/G2 cells were manually counted for each experiment.
Nuclear protein localization
Mid-log cultures growing overnight in 2% galactose SC media were fixed and nuclei were stained with DAPI. Images of the YFP-fused proteins and DAPI were visualized and acquired using a Leica DM6 B microscope equipped with a DFC390 camera (Leica) at x100 magnification using the LAS AF software (Leica). YFP Average intensity was measured in the nucleus and the cytoplasm of each cell using Metamorph Offline version 7.8.13.0 (Molecular Devices, LLC) software. At least 150 cells where quantified per replicate.
Chromosome spreads immunostaining of DNA-RNA hybrids
Surface spreading of yeast nuclei were performed as previously described [65]. DNA-RNA hybrids were immunostained with S9.6 antibody and chicken anti-mouse Alexa Fluor 594 conjugated secondary antibody. Nuclei were stained with DAPI. At least 100 nuclei per replicate were acquired using a Leica DM6 B microscope equipped with a DFC390 camera (Leica) at x100 magnification using the LAS AF software (Leica) to obtain the percentage of nuclei with detectable S9.6 signal.
Fluorescence in situ hybridization (FISH)
mRNA export from mid-log cultures grown in 2% galactose SC media was measured as previously described [66] with minor modifications. Fixed cells were spheroplasted with 2mg 20T zymoliase (USBiological) at 30ºC during 15 min., washed and extended in poly-L-Lysine (Sigma-Aldrich) coated slides. Samples were hybridized for 3 h with 200 ng oligodT (40xT) Cy3-labeled (Sigma-Aldrich) in hybridization buffer (2x SSC, 20% formamide, 2.5 mg/ml BSA, 10 mM VRC containing a total of 200 ng of Cy3-oligodT, 100 μg of sonicated salmon sperm DNA and 100 μg g of E. coli tRNA). DAPI was used to stain the nuclei. Images were acquired in a Leica DM6000 B microscope equipped with a DFC365FX camera (Leica).
RNA isolation from yeast
RNA was extracted from mid-log cultures using acid phenol after 3 h of galactose induction, following standard procedures. Briefly, mid-log cultures were collected and washed in cold water before resuspending the cell pellet in TES buffer (10 mM Tris, 1 mM EDTA, 0.5% SDS in DEPC-treated water) and immediately mixing with acid phenol (Sigma). Samples were incubated at 65ºC during 45 minutes with occasional vortexing. Aqueous phase was recovered after two phenol and one chloroform extraction, and RNA precipitated with sodium acetate and cold ethanol.
Northern blot assay
RNA was separated by acrylamide gel electrophoresis and transferred to Hybond-N nitrocellulose membranes (GE Healthcare). 32P-labelled DNA probes were used. Signal was acquired using a FLA-5100 Imager Fluorescence Analyzer (Fujifilm) and quantified with MultiGauge 2.0 analysis software (Science Lab). Signal was plotted as arbitrarily units (a.u.).
RT-qPCR
RNA was converted into DNA using the QuantiTect Reverse Transcription kit following the manufacturer’s instructions. Quantitative PCR were performed at given regions using SYBR Green PCR Master Mix (Biorad) in a 7500 Fast Real Time PCR System (Applied Biosystems).
Western blot assay
Western blots were performed on proteins extracted using 10% TCA following standard procedures. Samples were loaded onto 4–20% acrylamide gels (Bio-Rad), migrated in an SDS-containing buffer, transferred to nitrocellulose membranes and hybridized with anti-HA primary antibody or anti-PGK1 antibody, followed by anti-rabbit or anti-mouse HRP-conjugated secondary antibodies (S5 Table). Images were acquired using Amersham ImageQuant 800 (Cytiva) and quantified with Image Lab (Bio-Rad).
Chromatin immunoprecipitation (ChIP)
Mid-log cultures were grown in 2% galactose SC prior to fixation with 1% formaldehyde. Chromatin was processed as previously described [67] with minor modifications. Cells were resuspended in lysis buffer (50 mM HEPES-KOH at pH 7.5, 150 mM NaCl, 1 mM EDTA, 1% Triton X-100, 0.1% sodium deoxycholate, 0.1% SDS, 1 mM phenylmethanesulfonyl fluoride, EDTA-free protease inhibitor) and broken with glass beads in a VXR basic (Vibrax) shaker at 4ºC. Chromatin was sonicated to an average fragment size of 400–500 bp using a Bioruptor UCD-200 (Diagenode) and immunoprecipitated with 10 µL of HA tag antibody bounded to Protein A Dynabeads (Invitrogen). Quantitative PCR were performed at given regions using SYBR Green PCR Master Mix (Bio-Rad) in a 7500 Fast Real Time PCR System (Applied Biosystems). Relative abundance of the protein for each region was normalized to their input signal.
DNA-RNA hybrid immunoprecipitation (DRIP)
Mid-log cultures were grown overnight in 2% galactose SC media. DNA-RNA hybrids were processed as previously described [20]. Half of the samples was treated with 8μl RNase H (New England BioLabs) overnight at 37°C as negative control. Quantitative PCR were performed at given regions using SYBR Green PCR Master Mix (Bio-Rad) in a 7500 Fast Real Time PCR System (Applied Biosystems). Relative abundance of DNA-RNA hybrids in each region was normalized to their input signal.
Supporting information
S1 Fig. Analysis of mRNP composition in the absence of THO.
(A) mRNPs purified using immunoglobulin gamma (IgG)-conjugated beads pulling down either protein A-tagged Cbp20 or Cbp80 in a wild-type or hpr1∆ strain. Proteins were separated in SDS-PAGE stained with Coomassie or a Deep-Purple fluorescent dye. Plot comparing the fluorescence intensity of the mRNP components isolated from a wild-type or hpr1∆ strain. (B) RNA of WT strain (YBP249) overexpressing (+) or not (-) Npl3, Rim4, Sbp1 and She2 were analyzed by RT-qPCR. (C) Lysates from WT strain (YBP249) overexpressing (+) or not (-) Rim4 or Sbp1 and from a She2-HA tagged strain (WSHEHA) overexpressing (+) or not (-) She2 were analyzed by Western blot. (D) Lysates from WT strain (YBP249) overexpressing Rim4 or She2, with (+) or without (-) RNase H overexpression were analyzed by Western blot. Mean and SD of 3 samples is plotted for (D). *p ≤ 0.05; (two-tailed Student’s t-test).
https://doi.org/10.1371/journal.pgen.1011491.s001
(EPS)
S2 Fig. Analysis of Rie1 and Dis3 overexpression
(A) Diagram of genomic sequence contained in MW90-Clone 1 plasmid (C1), the deletions analysed (C1 del.A and C1 del.B) and YEpDIS3 plasmid (DIS3 o.e.). (B) Diagram of genomic sequence contained in MW90-Clone 2 plasmid (C2) and pYES2-RIE1 plasmid (RIE1 o.e.). (C) Lysates from WT strain (YBP249) overexpressing (+) or not (-) Rie1 or Dis3 were analyzed by Western blot. (D) Lysates from WT strain (YBP249) overexpressing Rie1 or Dis3, with (+) or without (-) RNase H overexpression were analyzed by Western blot.
https://doi.org/10.1371/journal.pgen.1011491.s002
(EPS)
S1 Table. Proteins increased/decreased in mRNPs isolated in the absence of THO.
https://doi.org/10.1371/journal.pgen.1011491.s003
(XLSX)
References
- 1. Brickner JR, Garzon JL, Cimprich KA. Walking a tightrope: The complex balancing act of R-loops in genome stability. Mol Cell. 2022;82(12):2267–97. pmid:35508167
- 2. Garcia-Muse T, Aguilera A. R loops: from physiological to pathological roles. Cell. 2019;179(3):604–18. https://doi.org/10.1016/j.cell.2019.08.055 pmid:31607512
- 3. Giunta S, Herve S, White RR, Wilhelm T, Dumont M, Scelfo A. CENP-A chromatin prevents replication stress at centromeres to avoid structural aneuploidy. Proc Natl Acad Sci U S A. 2021;118(10). https://doi.org/10.1073/pnas.2015634118 pmid:33653953
- 4. Mishra PK, Chakraborty A, Yeh E, Feng W, Bloom KS, Basrai MA. R-loops at centromeric chromatin contribute to defects in kinetochore integrity and chromosomal instability in budding yeast. Mol Biol Cell. 2021;32(1):74–89. pmid:33147102
- 5. Petermann E, Lan L, Zou L. Sources, resolution and physiological relevance of R-loops and RNA-DNA hybrids. Nat Rev Mol Cell Biol. 2022;23(8):521–40. pmid:35459910
- 6. Luna R, Gómez-González B, Aguilera A. RNA biogenesis and RNA metabolism factors as R-loop suppressors: a hidden role in genome integrity. Genes Dev. 2024;38(11–12):504–27. pmid:38986581
- 7. Richard P, Manley JL. R loops and links to human disease. J Mol Biol. 2017;429(21):3168–80. pmid:27600412
- 8. Luna R, Rondón AG, Pérez-Calero C, Salas-Armenteros I, Aguilera A. The THO complex as a paradigm for the prevention of cotranscriptional R-loops. Cold Spring Harb Symp Quant Biol. 2019;84:105–14. pmid:32493765
- 9. Chavez S, Beilharz T, Rondon AG, Erdjument-Bromage H, Tempst P, Svejstrup JQ. A protein complex containing Tho2, Hpr1, Mft1 and a novel protein, Thp2, connects transcription elongation with mitotic recombination in Saccharomyces cerevisiae. EMBO J. 2000;19(21):5824–34. https://doi.org/10.1093/emboj/19.21.5824 pmid:11060033
- 10. Pena A, Gewartowski K, Mroczek S, Cuellar J, Szykowska A, Prokop A. Architecture and nucleic acids recognition mechanism of the THO complex, an mRNP assembly factor. EMBO J. 2012;31(6):1605–16. https://doi.org/10.1038/emboj.2012.10 pmid:22314234
- 11. Bonneau F, Basquin J, Steigenberger B, Schäfer T, Schäfer IB, Conti E. Nuclear mRNPs are compact particles packaged with a network of proteins promoting RNA-RNA interactions. Genes Dev. 2023;37(11–12):505–17. pmid:37399331
- 12. Schuller SK, Schuller JM, Prabu JR, Baumgärtner M, Bonneau F, Basquin J, et al. Structural insights into the nucleic acid remodeling mechanisms of the yeast THO-Sub2 complex. Elife. 2020;9:e61467. pmid:33191913
- 13. Strässer K, Masuda S, Mason P, Pfannstiel J, Oppizzi M, Rodriguez-Navarro S, et al. TREX is a conserved complex coupling transcription with messenger RNA export. Nature. 2002;417(6886):304–8. pmid:11979277
- 14. Pacheco-Fiallos B, Vorländer MK, Riabov-Bassat D, Fin L, O’Reilly FJ, Ayala FI, et al. mRNA recognition and packaging by the human transcription-export complex. Nature. 2023;616(7958):828–35. pmid:37020021
- 15. Ren Y, Schmiege P, Blobel G. Structural and biochemical analyses of the DEAD-box ATPase Sub2 in association with THO or Yra1. Elife. 2017;6:e20070. pmid:28059701
- 16. Jimeno S, Rondón AG, Luna R, Aguilera A. The yeast THO complex and mRNA export factors link RNA metabolism with transcription and genome instability. EMBO J. 2002;21(13):3526–35. pmid:12093753
- 17. Perez-Calero C, Bayona-Feliu A, Xue X, Barroso SI, Munoz S, Gonzalez-Basallote VM, et al. UAP56/DDX39B is a major cotranscriptional RNA-DNA helicase that unwinds harmful R loops genome-wide. Genes Dev. 2020;34(13–14):898–912. https://doi.org/10.1101/gad.336024.119 pmid:32439635
- 18. Aguilera A. mRNA processing and genomic instability. Nat Struct Mol Biol. 2005;12(9):737–8. pmid:16142225
- 19. Huertas P, Aguilera A. Cotranscriptionally formed DNA:RNA hybrids mediate transcription elongation impairment and transcription-associated recombination. Mol Cell. 2003;12(3):711–21. pmid:14527416
- 20. Garcia-Rubio M, Aguilera P, Lafuente-Barquero J, Ruiz JF, Simon MN, Geli V. Yra1-bound RNA-DNA hybrids cause orientation-independent transcription-replication collisions and telomere instability. Genes Dev. 2018;32(13–14):965–77. https://doi.org/10.1101/gad.311274.117 pmid:29954833
- 21. Santos-Pereira JM, Herrero AB, García-Rubio ML, Marín A, Moreno S, Aguilera A. The Npl3 hnRNP prevents R-loop-mediated transcription-replication conflicts and genome instability. Genes Dev. 2013;27(22):2445–58. pmid:24240235
- 22. Pérez-Martínez L, Öztürk M, Butter F, Luke B. Npl3 stabilizes R-loops at telomeres to prevent accelerated replicative senescence. EMBO Rep. 2020;21(3):e49087. pmid:32026548
- 23. Oeffinger M, Wei KE, Rogers R, DeGrasse JA, Chait BT, Aitchison JD, et al. Comprehensive analysis of diverse ribonucleoprotein complexes. Nat Methods. 2007;4(11):951–6. pmid:17922018
- 24. Bretes H, Rouviere JO, Leger T, Oeffinger M, Devaux F, Doye V, et al. Sumoylation of the THO complex regulates the biogenesis of a subset of mRNPs. Nucleic Acids Res. 2014;42(8):5043–58. pmid:24500206
- 25. Huertas P, García-Rubio ML, Wellinger RE, Luna R, Aguilera A. An hpr1 point mutation that impairs transcription and mRNP biogenesis without increasing recombination. Mol Cell Biol. 2006;26(20):7451–65. pmid:16908536
- 26. Penzo A, Dubarry M, Brocas C, Zheng M, Mangione RM, Rougemaille M, et al. A R-loop sensing pathway mediates the relocation of transcribed genes to nuclear pore complexes. Nat Commun. 2023;14(1):5606. pmid:37730746
- 27. Rougemaille M, Dieppois G, Kisseleva-Romanova E, Gudipati RK, Lemoine S, Blugeon C, et al. THO/Sub2p functions to coordinate 3’-end processing with gene-nuclear pore association. Cell. 2008;135(2):308–21. pmid:18957205
- 28. Preker PJ, Guthrie C. Autoregulation of the mRNA export factor Yra1p requires inefficient splicing of its pre-mRNA. RNA. 2006;12(6):994–1006. pmid:16618971
- 29. Rodríguez-Navarro S, Strässer K, Hurt E. An intron in the YRA1 gene is required to control Yra1 protein expression and mRNA export in yeast. EMBO Rep. 2002;3(5):438–42. pmid:11964382
- 30. Gavalda S, Santos-Pereira JM, Garcia-Rubio ML, Luna R, Aguilera A. Excess of Yra1 RNA-binding factor causes transcription-dependent genome instability, replication impairment and telomere shortening. PLoS Genet. 2016;12(4):e1005966. https://doi.org/10.1371/journal.pgen.1005966 pmid:27035147
- 31. Dziembowski A, Lorentzen E, Conti E, Séraphin B. A single subunit, Dis3, is essentially responsible for yeast exosome core activity. Nat Struct Mol Biol. 2007;14(1):15–22. pmid:17173052
- 32. Smolka JA, Sanz LA, Hartono SR, Chedin F. Recognition of RNA by the S9.6 antibody creates pervasive artifacts when imaging RNA:DNA hybrids. J Cell Biol. 2021;220(6).
- 33. Wyers F, Rougemaille M, Badis G, Rousselle J-C, Dufour M-E, Boulay J, et al. Cryptic pol II transcripts are degraded by a nuclear quality control pathway involving a new poly(A) polymerase. Cell. 2005;121(5):725–37. pmid:15935759
- 34. Chartrand P, Meng XH, Singer RH, Long RM. Structural elements required for the localization of ASH1 mRNA and of a green fluorescent protein reporter particle in vivo. Curr Biol. 1999;9(6):333–6. https://doi.org/10.1016/s0960-9822(99)80144-4 pmid:10209102
- 35. Shen Z, St-Denis A, Chartrand P. Cotranscriptional recruitment of She2p by RNA pol II elongation factor Spt4-Spt5/DSIF promotes mRNA localization to the yeast bud. Genes Dev. 2010;24(17):1914–26. https://doi.org/10.1101/gad.1937510 pmid:20713510
- 36. Allmang C, Mitchell P, Petfalski E, Tollervey D. Degradation of ribosomal RNA precursors by the exosome. Nucleic Acids Res. 2000;28(8):1684–91. pmid:10734186
- 37. Mitchell P, Petfalski E, Shevchenko A, Mann M, Tollervey D. The exosome: a conserved eukaryotic RNA processing complex containing multiple 3’-->5’ exoribonucleases. Cell. 1997;91(4):457–66. pmid:9390555
- 38. Berchowitz LE, Kabachinski G, Walker MR, Carlile TM, Gilbert WV, Schwartz TU, et al. Regulated formation of an amyloid-like translational repressor governs gametogenesis. Cell. 2015;163(2):406–18. pmid:26411291
- 39. Huh W-K, Falvo JV, Gerke LC, Carroll AS, Howson RW, Weissman JS, et al. Global analysis of protein localization in budding yeast. Nature. 2003;425(6959):686–91. pmid:14562095
- 40. Shen Z, Paquin N, Forget A, Chartrand P. Nuclear shuttling of She2p couples ASH1 mRNA localization to its translational repression by recruiting Loc1p and Puf6p. Mol Biol Cell. 2009;20(8):2265–75. pmid:19244342
- 41. Böhl F, Kruse C, Frank A, Ferring D, Jansen RP. She2p, a novel RNA-binding protein tethers ASH1 mRNA to the Myo4p myosin motor via She3p. EMBO J. 2000;19(20):5514–24. pmid:11032818
- 42. Long RM, Gu W, Lorimer E, Singer RH, Chartrand P. She2p is a novel RNA-binding protein that recruits the Myo4p-She3p complex to ASH1 mRNA. EMBO J. 2000;19(23):6592–601. pmid:11101531
- 43. Asada R, Dominguez A, Montpetit B. Single-molecule quantitation of RNA-binding protein occupancy and stoichiometry defines a role for Yra1 (Aly/REF) in nuclear mRNP organization. Cell Rep. 2023;42(11):113415. pmid:37963019
- 44. Gavaldá S, Gallardo M, Luna R, Aguilera A. R-loop mediated transcription-associated recombination in trf4Δ mutants reveals new links between RNA surveillance and genome integrity. PLoS One. 2013;8(6):e65541. pmid:23762389
- 45. Luna R, Jimeno S, Marín M, Huertas P, García-Rubio M, Aguilera A. Interdependence between transcription and mRNP processing and export, and its impact on genetic stability. Mol Cell. 2005;18(6):711–22. pmid:15949445
- 46. Wahba L, Amon JD, Koshland D, Vuica-Ross M. RNase H and multiple RNA biogenesis factors cooperate to prevent RNA:DNA hybrids from generating genome instability. Mol Cell. 2011;44(6):978–88. pmid:22195970
- 47. Gritti I, Basso V, Rinchai D, Corigliano F, Pivetti S, Gaviraghi M, et al. Loss of ribonuclease DIS3 hampers genome integrity in myeloma by disrupting DNA:RNA hybrid metabolism. EMBO J. 2022;41(22):e108040. pmid:36215697
- 48. Nojima T, Tellier M, Foxwell J, Ribeiro de Almeida C, Tan-Wong SM, Dhir S. Deregulated expression of mammalian lncRNA through loss of SPT6 induces R-loop formation, replication stress, and cellular senescence. Mol Cell. 2018;72(6):970–84 e7. https://doi.org/10.1016/j.molcel.2018.10.011 pmid:30449723
- 49. Pefanis E, Wang J, Rothschild G, Lim J, Chao J, Rabadan R, et al. Noncoding RNA transcription targets AID to divergently transcribed loci in B cells. Nature. 2014;514(7522):389–93. pmid:25119026
- 50. Villa T, Barucco M, Martin-Niclos M-J, Jacquier A, Libri D. Degradation of non-coding RNAs promotes recycling of termination factors at sites of transcription. Cell Rep. 2020;32(3):107942. pmid:32698007
- 51. Richard P, Feng S, Manley JL. A SUMO-dependent interaction between Senataxin and the exosome, disrupted in the neurodegenerative disease AOA2, targets the exosome to sites of transcription-induced DNA damage. Genes Dev. 2013;27(20):2227–32. pmid:24105744
- 52. Liu Z, Huang Y. Advantages of proteins being disordered. Protein Sci. 2014;23(5):539–50. pmid:24532081
- 53. Gitler AD, Shorter J. RNA-binding proteins with prion-like domains in ALS and FTLD-U. Prion. 2011;5(3):179–87. pmid:21847013
- 54. Giannini M, Bayona-Feliu A, Sproviero D, Barroso SI, Cereda C, Aguilera A. TDP-43 mutations link Amyotrophic Lateral Sclerosis with R-loop homeostasis and R loop-mediated DNA damage. PLoS Genet. 2020;16(12):e1009260. pmid:33301444
- 55. Thompson VF, Wieland DR, Mendoza-Leon V, Janis HI, Lay MA, Harrell LM, et al. Binding of the nuclear ribonucleoprotein family member FUS to RNA prevents R-loop RNA:DNA hybrid structures. J Biol Chem. 2023;299(10):105237. pmid:37690693
- 56. Wood M, Quinet A, Lin YL, Davis AA, Pasero P, Ayala YM, et al. TDP-43 dysfunction results in R-loop accumulation and DNA replication defects. J Cell Sci. 2020;133(20). https://doi.org/10.1242/jcs.244129 pmid:32989039
- 57. Kwiatkowski TJ, Bosco DA, Leclerc AL, Tamrazian E, Vanderburg CR, Russ C. Mutations in the FUS/TLS gene on chromosome 16 cause familial amyotrophic lateral sclerosis. Science. 2009;323(5918):1205–8. https://doi.org/10.1126/science.1166066 pmid:19251627
- 58. Neumann M, Sampathu DM, Kwong LK, Truax AC, Micsenyi MC, Chou TT, et al. Ubiquitinated TDP-43 in frontotemporal lobar degeneration and amyotrophic lateral sclerosis. Science. 2006;314(5796):130–3. pmid:17023659
- 59. Stirling PC, Bloom MS, Solanki-Patil T, Smith S, Sipahimalani P, Li Z, et al. The complete spectrum of yeast chromosome instability genes identifies candidate CIN cancer genes and functional roles for ASTRA complex components. PLoS Genet. 2011;7(4):e1002057. pmid:21552543
- 60. Pereira B, Billaud M, Almeida R. RNA-binding proteins in cancer: old players and new actors. Trends Cancer. 2017;3(7):506–28.
- 61. Varesi A, Campagnoli LIM, Barbieri A, Rossi L, Ricevuti G, Esposito C, et al. RNA binding proteins in senescence: A potential common linker for age-related diseases?. Ageing Res Rev. 2023;88:101958. pmid:37211318
- 62. Pommier Y, O’Connor MJ, de Bono J. Laying a trap to kill cancer cells: PARP inhibitors and their mechanisms of action. Sci Transl Med. 2016;8(362):362ps17.
- 63. Lee S-H, Singh I, Tisdale S, Abdel-Wahab O, Leslie CS, Mayr C. Widespread intronic polyadenylation inactivates tumour suppressor genes in leukaemia. Nature. 2018;561(7721):127–31. pmid:30150773
- 64. Lisby M, Rothstein R, Mortensen UH. Rad52 forms DNA repair and recombination centers during S phase. Proc Natl Acad Sci U S A. 2001;98(15):8276–82. pmid:11459964
- 65. Grubb J, Brown MS, Bishop DK. Surface spreading and immunostaining of yeast chromosomes. J Vis Exp. 2015;102:e53081. https://doi.org/10.3791/53081 pmid:26325523
- 66. Trcek T, Larson DR, Moldón A, Query CC, Singer RH. Single-molecule mRNA decay measurements reveal promoter- regulated mRNA stability in yeast. Cell. 2011;147(7):1484–97. pmid:22196726
- 67. Hecht A, Grunstein M. Mapping DNA interaction sites of chromosomal proteins using immunoprecipitation and polymerase chain reaction. Methods Enzymol. 1999;304:399–414. https://doi.org/10.1016/s0076-6879(99)04024-0 pmid:10372373