R-loops are cellular structures composed of an RNA/DNA hybrid, which is formed when the RNA hybridises to a complementary DNA strand and a displaced single-stranded DNA. R-loops have been detected in various organisms from bacteria to mammals and play crucial roles in regulating gene expression, DNA and histone modifications, immunoglobulin class switch recombination, DNA replication, and genome stability. Recent evidence suggests that R-loops are also involved in molecular mechanisms of neurological diseases and cancer. In addition, mutations in factors implicated in R-loop biology, such as RNase H and SETX (senataxin), lead to devastating human neurodegenerative disorders, highlighting the importance of correctly regulating the level of R-loops in human cells. In this review we summarise current advances in this field, with a particular focus on diseases associated with dysregulation of R-loop structures. We also discuss potential therapeutic approaches for such diseases and highlight future research directions.
Citation: Groh M, Gromak N (2014) Out of Balance: R-loops in Human Disease. PLoS Genet 10(9): e1004630. doi:10.1371/journal.pgen.1004630
Editor: Nancy Maizels, University of Washington School of Medicine, United States of America
Published: September 18, 2014
Copyright: © 2014 Groh, Gromak. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: NG is supported by a Royal Society University Research Fellowship and MRC NIRG (MR/J007870/1). MG is supported by a PhD studentship from Ataxia UK/Motor Neuron Disease Association (Gromak/Jun11/6278). The funders had no role in the preparation of the article.
Competing interests: The authors have declared that no competing interests exist.
R-loops are three-stranded structures, which form when RNA hybridises to a complementary DNA strand, forming an RNA/DNA hybrid, resulting in displacement of the other DNA strand in this process (Figure 1). The first R-loops were described in 1976, when their formation in vitro in the presence of 70% formamide was visualised by electron microscopy (Figure 1) . These structures were thermodynamically more stable than duplex DNA, and they remained intact following removal of formamide. This technique of RNA/DNA hybridisation has been used in over 140 studies to map gene organisation, transcription initiation sites, and the direction of transcription, as well as measure the quantities of cellular RNAs .
The diagram depicts major developments in the R-loop field and diseases associated with R-loop dysregulation.
The first evidence for R-loop formation in live bacteria was obtained in 1994 . This was followed by numerous studies showing that R-loops exist in different organisms (Figure 1) –. In living cells, R-loops are thought to form in cis during transcription, when nascent RNA hybridises to the DNA template behind the elongating RNA polymerase (Pol) . However, in contrast to this popular view of cotranscriptional R-loops, recent studies suggest that RNA transcribed at one locus can hybridise to homologous DNA at another locus, thus leading to R-loop formation in trans . In the last five years, the use of an antibody (S9.6) recognising RNA/DNA hybrids has revolutionised the R-loop field . Initially, the S9.6 antibody, which detects hybrids as small as six bp with an affinity of 0.6 nM, was developed as a tool to enhance the DNA/RNA hybridisation signal in DNA microarray studies , . More recently, it has been used to detect R-loops in vivo and uncover their contribution to fundamental biological processes in yeast , , plants , mice , , and humans –.
The picture emerging from these studies suggests that R-loops can be both beneficial and deleterious to cells. Paradoxically, while they are required for important biological processes, they can also promote DNA damage and genome instability. In particular, R-loops have been shown to play an essential positive function in Escherichia coli plasmid and human mitochondrial DNA replication ,  and during immunoglobulin class switch recombination, which contributes to the antibody isotype diversity in activated B cells . R-loops form on many genes in yeast and human cells ,  and have been implicated in regulation of gene expression. R-loops can repress transcription and promote transcriptional termination , , . Furthermore, R-loops are clearly associated with epigenetic mechanisms governing transcription, including DNA methylation and posttranslational histone modifications , . In spite of this growing list of beneficial R-loop functions, it is also evident that R-loops can be a dangerous source of DNA damage. They can sensitize DNA to damaging agents , induce transcription-associated recombination , double-strand breaks (DSBs) , , chromosome breaks, and fragile site instability –, and cause chromosome loss . Therefore, cells need to tightly regulate the levels of R-loops to exploit their unique features. Altering the physiological R-loop balance can impair R-loop-regulated processes, cause genome instability, and may lead to human diseases. Consequently, defining the roles of R-loops in the multitude of biological processes and human disease is likely to develop into one of the most important and influential areas of R-loop research in the future.
Proteins in R-loop Biology
The number of proteins associated with R-loop biology has increased in the last few years, reflecting the diversity of R-loop processes (Table S1) –. Many proteins can regulate cellular R-loop levels either directly or indirectly, mostly by preventing RNA from hybridising to DNA, thus reducing excessive R-loop accumulation. Among these are proteins required for efficient transcriptional elongation, termination, polyadenylation, RNA splicing, packaging, and export , , , , , , . DNA topology itself can influence hybridisation of RNA to DNA, and topoisomerases consequently play important roles in modulating R-loop levels , . Proteins involved in maintenance of genome integrity can also regulate R-loops, suggesting a dynamic interplay between DNA repair and R-loop formation . Importantly, cells possess dedicated enzymes, including the members of the RNase H family that specifically degrade the RNA in R-loops , and helicases that can unwind RNA/DNA hybrids , .
Recent evidence shows that R-loops can directly affect many gene expression–associated processes, including DNA methylation, posttranslational histone modifications, and transcription, by influencing the function of regulatory proteins ,,,. Despite the growing number of proteins involved in R-loop homeostasis and human disease, many questions still remain unanswered. For many proteins with documented in vitro RNA/DNA helicase activity (e.g., Pif1, the MCM complex), in vivo evidence is generally still lacking (Table S1) , . Moreover, the molecular mechanisms underlying interactions between proteins and R-loops are poorly understood, and in many cases the connections to disease remain obscure.
R-loops and Neurological Diseases
The biological importance of R-loops in humans is supported by the fact that mutations in proteins implicated in R-loop resolution cause devastating human diseases, often related to neurodegeneration. Mutations in the putative RNA/DNA helicase SETX cause neurodegenerative diseases, the dominant juvenile form of amyotrophic lateral sclerosis type 4 (ALS4), and a recessive form of ataxia oculomotor apraxia type 2 (AOA2) (Figure 2A). These diseases are characterised by progressive degeneration of motor neurons in the brain and spinal cord, muscle weakness and atrophy –.
The diagram depicts the role of R-loops in human diseases. Loss of wild type protein function is depicted by red crosses. A. Ataxia and motor neuron diseases. Mutations in human RNA/DNA helicase senataxin are associated with AOA2/ALS4 disorders and lead to R-loop accumulation and defects in transcriptional termination by Pol II , the maintenance of genome integrity , meiotic recombination during spermatogenesis, gene silencing during meiotic sex chromosome inactivation , and neuronal differentiation . B. Aicardi-Goutières syndrome (AGS). AGS is associated with mutations in all three subunits of RNase H2, ssDNA 3′–5′ exonuclease TREX1 (DNASEIII), dsRNA-editing enzyme ADAR1, and dNTP triphosphatase SAMHD1; these trigger accumulation of unprocessed nucleic acids, including genomic DNA with incorporated ribonucleotides, R-loops, and retroelement-derived nucleic acids, and result in the immune response characteristic of AGS . C. Trinucleotide expansion diseases. R-loops form over expanded repeats and result in decreased initiation and elongation of RNA Pol II and formation of repressive chromatin marks, which silence the host gene containing expanded repeats . D. Genome instability in cancer. Loss of proteins protecting against abnormal R-loop accumulation, such as FIP1L1, leads to genome instability, one hallmark of cancer . Yellow stars denote double-stranded DNA breaks. E. AID-mediated mutagenesis and translocations in cancer. Single-stranded DNA in R-loops is a substrate for cytidine deamination by activation-induced cytidine deaminase, leading to mutagenesis as indicated by orange stars , . These mutations can cause DSB formation, leading to chromosomal translocations. The IgH/c-MYC translocation brings the strong IgH enhancers, shown as yellow box, close to c-MYC, leading to its overexpression in Burkitt's lymphoma . Transcription of IgH/c-MYC starts from a previously inactive promoter downstream of the translocation break point. The IgH locus is depicted in blue, c-MYC gene is in grey. The translocation breakpoint is indicated by a dashed black line. F. Senescence. R-loops formed by the noncoding RNA TERRA accumulate at telomeres in cells deficient of Hpr1 and RNase H. In the absence of telomerase, these R-loops promote Rad52-dependent telomere elongation and delayed senescence. In the absence of telomerase and Rad52, R-loops promote telomere shortening and premature senescence .
In addition to its predicted function as an RNA/DNA helicase, SETX interacts with proteins involved in diverse aspects of RNA metabolism . Moreover, a single amino acid mutation, which compromises the function of the yeast homologue Sen1, dramatically changes the Pol II distribution genome-wide, further supporting the view that SETX/Sen1 functions in the regulation of transcription . Recently, we demonstrated that SETX is implicated in transcriptional termination by Pol II in humans . It is required to resolve R-loops at termination elements, releasing RNA for degradation by the 5′–3′ “torpedo” exonuclease Xrn2 prior to termination (Figure 2A) , . Mutations in the yeast homologue, Sen1, also lead to a transcriptional termination defect, associated with accumulation of R-loops and genome instability . In line with its function in R-loop resolution, SETX/Sen1 is also involved in maintaining genome integrity by coordinating transcription, DNA replication, and the DNA damage response –. SETX can target the 3′–5′ RNA degradation complex, the exosome, to sites of transcription-induced DNA damage . Furthermore, SETX protects genome integrity by coordinating meiotic recombination with transcription during spermatogenesis and gene silencing during meiotic sex chromosome inactivation . In particular, Setx knock-out mice accumulated DNA double strand breaks and R-loops and failed to disassemble Rad51 filaments. This resulted in a failure to cross over, likely due to collision between R-loops with Holliday junctions . These defects in Setx knock-out mice lead to male infertility, raising the question as to how this relates to fertility of male AOA2/ALS4 patients.
Studies in neuronal cells have demonstrated a role for SETX in neuronal differentiation through fibroblast growth factor 8 (FGF8) signalling, providing one explanation for the effects of loss-of-function AOA2 mutations . Surprisingly, overexpression of dominant mutant forms of SETX did not affect neuritogenesis, suggesting that a different function of SETX may be affected in ALS4 patients. However, the interplay between the function of SETX in R-loop resolution, genome maintenance, and neuronal differentiation is still unclear. In a recent study, Lavin and colleagues examined cells from mice with disrupted Atm, Tdp1, Setx, or Aptx genes, which cause ataxia telangiectasia (AT), spinocerebellar ataxia with axonal neuropathy 1 (SCAN1), AOA2, and ataxia oculomotor apraxia type 1 (AOA1) disorders, respectively . These diseases are characterised by a defective response to DNA damage, suggesting that R-loops may be implicated in triggering genome instability. Indeed, R-loops were found to be enriched in proliferating cells (testes), but not in the brain tissues from Setx, Atm, Tdp1 or Aptx knock-out mice . The enrichment of R-loops in testes correlated with high levels of DNA damage and apoptosis. The lack of R-loops in brain tissue questions the association between R-loops and neurodegeneration. This result is surprising, because inducible R-loops have been previously detected in neuronal cells at the Snord116 locus, which is associated with the neurodevelopmental disorder Angelman syndrome, as discussed below . Furthermore, R-loops were implicated in inducing DNA damage in nonproliferating cells and post-mitotic neurons and proposed to contribute to the neurodegeneration seen in AT patients . It is possible that R-loops are regulated by different mechanisms in proliferating cells and post-mitotic neurons, thereby leading to different R-loop kinetics and so preventing their detection in some model systems. In particular, R-loop accumulation may reflect collisions between transcription and replication machineries , , events which do not occur in postmitotic neurons. It should be noted that the mouse models currently used may not fully recapitulate all aspects of human neurodegeneration.
RNase H and Aicardi-Goutières Syndrome (AGS)
In addition to their generation during transcription, RNA/DNA hybrids can arise due to incorporation of ribonucleotides into DNA by DNA polymerases during replication. RNase H enzymes are endonucleases that cleave the RNA of RNA/DNA hybrids in a sequence-independent manner, thus maintaining genome stability by resolving R-loops that form during transcription and by removing misincorporated ribonucleotides from the DNA . Eukaryotic cells have two types of these enzymes, RNase H1 and RNase H2, which have different enzymatic and site-specific activities . In particular, RNase H1 requires a tract of at least four ribonucleotides to cleave the RNA/DNA hybrid, whereas RNase H2 can incise 5′ to a single ribonucleotide incorporated within a DNA molecule , . Therefore, only RNase H2 can process single ribonucleotides in the DNA, but both enzymes are capable of eliminating RNA/DNA hybrids. Unlike in bacteria and unicellular eukaryotic organisms, where RNase H enzymes are dispensable for viability, both RNase H enzymes are essential in higher eukaryotes. RNase H1 has been implicated in mitochondrial DNA (mtDNA) replication during mouse development, a process likely to be associated with processing of RNA primers during mtDNA replication .
RNase H2 is composed of three different subunits, the catalytic subunit 2A, and two other subunits, 2B and 2C, all of which are required for enzyme activity. RNase H2 has been implicated in recognition and removal of ribonucleotides incorporated into DNA and hydrolysis of Okazaki fragment RNA primers during DNA replication , –. In addition, recent studies point towards a role of RNase H2 in R-loop resolution during transcription in vivo , . In particular, deletion of Saccharomyces cerevisiae RNase H2 imposes transcriptional blocks and R-loop accumulation over rDNA regions in cells depleted of Topoisomerase I  and transcriptional down-regulation of genes with higher guanine-cytosine (GC) content at the promoter regions, which are likely to form stable R-loops .
In humans, mutations in any of the three subunits of RNase H2 cause Aicardi-Goutières syndrome (AGS), a neurological inflammatory disorder, which resembles a congenital viral infection and is associated with accumulation of ribonucleotides in the DNA (Figure 2B) , . Interestingly, AGS can also be triggered by mutations in single-stranded DNA (ssDNA) 3′–5′ exonuclease TREX1(DNASEIII) , double-stranded RNA (dsRNA)-editing enzyme ADAR1 , and dNTP triphosphatase SAMHD1. These proteins are involved in diverse pathways of nucleic acid metabolism, although their functions are not yet fully understood. They have been implicated in degrading ssDNA arising from endogenous retroelements or replication stress (TREX1), regulating the intracellular dNTPs pool available for replication and reverse transcription of these retroelements (SAMHD1), or altering the immune response to RNA species through RNA editing of retroelements and microRNAs (ADAR1) . Mutations in these proteins are associated with an accumulation of unprocessed nucleic acids, which triggers the immune response characteristic of AGS , .
So far, pathologies linked to AGS mutations in RNase H2 have been mainly attributed to genome instability caused by accumulation of ribonucleotides in DNA , . However, a specific contribution of R-loops and RNA/DNA hybrids to AGS pathology has not been yet investigated. This research has been hampered by the difficulty to uncouple the two activities of RNase H2; its ability to remove ribonucleotides from the DNA and to resolve R-loops, both of which are affected when RNase H2 is deleted , . Nevertheless, several lines of evidence suggest that R-loops may be involved in AGS pathology. Thus, an AGS-related mutation in the yeast RNase H2 enzyme resulted in its reduced RNA/DNA cleavage activity . Since RNase H2 constitutes ∼90% of the total cellular RNA/DNA hybrid cleavage activity, its loss due to AGS mutations may lead to significant accumulation of R-loops . The importance of RNase H2 is further highlighted by the fact that mutations in RNase H1 do not cause AGS, suggesting that RNase H2 may have unique properties to degrade RNA/DNA hybrids . Indeed, R-loops arising during DNA replication may be exclusively degraded by RNase H2, as they may be inaccessible to RNase H1 , . A recently generated S. cerevisiae RNase H2 mutant, which possesses R-loop degrading activity but fails to remove single ribonucleotides from the DNA , will be a useful tool in addressing the contribution of unresolved transcription-associated R-loops to AGS pathology.
TREX1, ADAR1 and SAMHD1 process retroelement-derived nucleic acids and help to suppress retroelements expansion in the host genome and their recognition by the immune system . Interestingly, recent genome-wide studies have demonstrated that RNA/DNA hybrids are particularly enriched at retrotransposon elements in yeast cells , suggesting that expansion of retroelements due to mutations in TREX1, ADAR1 or SAMHD1 may lead to increased RNA/DNA hybrid levels, contributing to autoimmunitity in AGS. Indeed, it has recently been demonstrated that RNA/DNA hybrids can be sensed by toll-like receptor 9 (TLR9) to induce pro-inflammatory cytokine and antiviral interferon production in dendritic cells .
R-loops in Nucleotide Expansion Diseases
Expansions of repetitive sequences have been linked to over forty human diseases , and R-loops have been proposed to play a role in their pathology –. Remarkably, R-loops are formed following transcription of trinucleotide repeats in vitro, in bacteria and human cells , , . Interestingly, the nontemplate DNA strand in many repetitive sequences can adopt unusual DNA structures, including G-quadruplexes and DNA triplexes, which may further stabilise R-loops . Moreover, R-loops formed at CTG repeats promote repeat instability characteristic of these diseases .
Recently, we demonstrated that R-loops form over expanded GAA and CGG repeats in cells from Friedreich's Ataxia (FRDA) and Fragile X syndrome (FXS) patients, respectively (Figure 2C) . The abundance of these stable R-loops correlates with expansion size, and they colocalise with the repressive chromatin marks characteristic of these diseases (Figure 2C). R-loops can also trigger the formation of repressive chromatin and cause transcriptional silencing of the FXN gene, providing a molecular link between R-loops and the pathology of expansion diseases . In line with R-loops formed on expanded “premutation” and “full mutation” CGG-repeat-containing alleles of the FMR1 gene , , promoter-bound FMR1 mRNA containing trinucleotide repeats was shown to promote epigenetic silencing in FXS . Importantly, the involvement of R-loops in expansion diseases is not limited to trinucleotide repeats, since R-loops associated with expanded hexanucleotide GGGGCC repeats in C9orf72 contribute to the molecular event leading to amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) .
R-loops could contribute to the pathology of expansion diseases in various ways. Similar to R-loops at the 3′ends of human genes, expansion-associated R-loops may form a structural block, directly interfering with Pol II transcriptional elongation , . Alternatively, R-loops may nucleate repressive chromatin over the expansion region, by analogy with heterochromatin formation at centromeres in Schizosaccharomyces pombe , or promote chromatin compaction associated with histone H3S10 phosphorylation, as observed in S. cerevisiae, Caenorhabditis elegans, and human cells . Furthermore, R-loops could cause the characteristic intergenerational and somatic instability of repeat sequences .
R-loops in Cancer
Genome instability is a hallmark of cancer, and it may actively drive hereditary tumour development , . Research in the last decade has clearly demonstrated that dysregulation of R-loops can corrupt genome integrity, resulting in increased DNA sensitivity to damaging agents, formation of DSBs, chromosome breaks, fragile site instability, chromosome loss, and recombination events . Several mechanisms have therefore evolved to maintain R-loop levels in balance, and alterations in genome caretaker processes can affect R-loop levels and genome stability . Moreover, mutations in proteins controlling R-loop levels have been identified in tumours (Figure 2D). For example, in eosinophilic leukemia, an oncogenic translocation renders cleavage and polyadenylation factor FIP1L1 inactive, which has been previously shown to cause increased R-loop levels, DNA damage and chromosome instability (Figure 2D) . A similar mechanism was suggested for RNA kinase CLP1, which is associated with a translocation in mixed lineage leukemia (MLL) . The histone ubiquitin ligase BRE1 also limits R-loop levels, and its decreased expression may contribute to the high levels of genomic instability observed in testicular seminoma .
The link between R-loops and cancer has been further substantiated by the finding that the tumour suppressor BRCA2, which is mutated in breast and ovarian cancer, is required to prevent R-loop accumulation and genome instability . These observations raise the interesting possibility that R-loops may provide proliferative advantages to tumour cells by promoting genome instability. This will in turn increase the probability of accumulating mutations favourable to tumour growth and metastasis. Intriguingly, recent evidence demonstrates that human oncogenic viruses may also promote genomic instability through accumulation of R-loops after infection. Kaposi's sarcoma-associated herpesvirus (KSHV), which causes multiple AIDS-related cancers, encodes the ORF57 protein, which can sequester the host hTREX complex, important for mRNA processing and export . Sequestration of hTREX leads to KSHV-induced accumulation of R-loops and causes damage to the host DNA, contributing to tumourigenesis .
Whilst some proteins suppress R-loop formation, others may promote R-loops and so increase genome instability leading to tumour development. This unexpected function has been shown in yeast for transcription elongation factor Spt2 and DNA repair protein Rad51 , . Overexpression of Spt2 leads to transcription-dependent chromosomal rearrangements, which are prevented by RNase H overexpression . Spt2 is structurally related to human HMG1, which is overexpressed in gastric cancers and malignant melanomas . However, it is not clear if increased HMG1 levels promote R-loops and DNA damage in cancer cells. In contrast to its well-established role in DNA strand exchange during homologous recombination and DNA repair , recent studies have shown that Rad51 can also mediate R-loop formation and genome instability in trans, extending the prevailing view that R-loops form cotranscriptionally . Similar to HMG1, RAD51 is overexpressed in human cancers . However, it remains to be elucidated if RAD51 overexpression in cancers is a consequence of activated DNA repair pathways, or a cause of genome instability .
R-loops have been detected in immunoglobulin (Ig) genes, where they initiate class switch recombination by exposing single-stranded DNA, thus providing the substrate for activation-induced cytidine deaminase (AID), which promotes DSBs and subsequent translocation between Ig heavy chains , . Although this process is essential for generation of antibody isotype diversity, AID-mediated mutagenesis has also been implicated in pathological translocations between the Ig loci and other active genes, leading to production of fusion proteins or oncogenic gene expression, observed in B cell malignancies (Figure 2E) . Interestingly, R-loops are also found in common translocation partners of Ig genes, including the oncogene c-MYC , . Therefore, the simultaneous formation of R-loops in Ig and transcribed non-Ig genes may induce AID-mediated DSB formation, leading to pathological translocations (Figure 2E) , , . Interestingly, overexpression of the APOBEC family of AID-related enzymes in breast cancer have been linked to genomic mutations, pointing to a potentially broader role of R-loops and AID/APOBEC-mediated genome instability in cancer .
Changes in gene expression are another central aspect of cancer . In healthy cells, the expression of tumour suppressor genes prevents abnormal proliferation and other aspects of tumourigenesis . Tumour suppressors are frequently silenced in cancer by excessive promoter DNA methylation . It has been proposed that R-loop formation at promoters protects against DNA methylation by de novo DNA methyltransferase DNMT3B, thereby keeping genes active . Since R-loops have been computationally predicted to form at promoters of tumour suppressor genes BRCA1, RASSF1A, and CDKN2A , it is important to investigate if R-loop levels at these genes are reduced in cancer and how this relates to the observed DNA hypermethylation.
In contrast to this, efficient transcription of the oncogene c-MYC requires that R-loop levels are kept low by the activity of DNA topoisomerase IIIB, which is recruited to arginine-methylated histones by the tudor domain containing 3 (TDRD3) protein . This R-loop-mediated mechanism of c-MYC gene regulation may be relevant to tumour progression in breast cancer, which frequently shows overexpression of both c-MYC and TDRD3 , . Therefore, it is tempting to speculate that increased TDRD3 levels suppress R-loops in c-MYC, thereby allowing its enhanced expression, which correlates with poor cancer prognosis . However, it still remains to be determined if R-loops play a specific role in transcription dysregulation in cancer and if this process differs from R-loop-mediated transcriptional programmes associated with housekeeping genes.
More recently, R-loops have been implicated in cell senescence, a mechanism protecting against tumour cell proliferation . In particular, the telomeric noncoding (nc) RNA TERRA forms R-loops which are induced when R-loop suppressors such as RNase H or Thp2 are lost , . In the absence of telomerase, telomeric R-loops promote recombination-mediated telomere elongation via Rad52, and this delays the onset of cellular senescence . In contrast, in Rad52-deficient cells, R-loop accumulation leads to telomere shortening and premature senescence . Interestingly, cells from AOA2 patients with senataxin mutations contain shorter telomeres, suggesting a possible involvement of SETX in telomere stability . Telomeric R-loops therefore play a complex and dynamic role in telomere length maintenance and cellular proliferative potential (Figure 2F).
In conclusion, multiple lines of evidence point to an involvement of R-loops in cancer biology. Yet it still remains to be investigated if R-loop levels are indeed regulated differentially in normal and tumour tissues and if they can directly influence tumourigenesis.
R-loops represent a potential therapeutic target. Despite their importance in gene regulation, they have yet to be fully exploited in drug design . Various ligands can target RNA/DNA hybrids, including ethidium bromide, the aminoglycosides neomycin and paramomycin, and the polyamides distamycin and netropsin . These compounds recognise RNA/DNA hybrids through intercalation and binding to the nucleic acid groove. Although exhibiting high binding affinities to RNA/DNA hybrids, many of these molecules also bind dsDNA and RNA and are mutagenic, limiting their potential biological applications . However, recent studies suggest that combining the properties of these ligands can achieve subnanomolar affinity for RNA/DNA hybrids. In particular, this has been demonstrated for ligands linking aminoglycosides to derivatives of ethidium bromide , providing a possible approach for the development of potent and specific RNA/DNA hybrid ligands in future drug design efforts.
Various compounds that modulate DNA supercoiling and inhibit DNA topoisomerases, including topotecan and camptothecin, can also affect R-loop formation in vivo , . In particular, topoisomerase inhibitors have recently been used to reactivate the silenced paternal Ube3a gene, which encodes a ubiquitin E3 ligase, to compensate for the deleted maternal Ube3a in Angelman syndrome (AS). AS and Prader-Willi syndrome (PWS) are imprinted neurodevelopmental disorders that are often caused by large deletions of human chromosome 15q11–q13 over the Snord116 gene locus, but the deletion differs in its parent-of-origin . In neurons, only the maternal Ube3a allele is expressed, because the paternal Ube3a allele is silenced by expression of the ncRNA Ube3a-ATS (Figure 3A) . AS therapies therefore seek to reactivate the silenced, but genetically intact, paternal Ube3a allele. Interestingly, R-loops were recently shown to regulate the neuronal expression of the paternal Ube3a-ATS transcript, which is essential for transcriptional silencing of the paternal Ube3a gene . In particular, treatment with the topoisomerase inhibitor topotecan increased R-loop levels over the Snord116 locus, resulting in chromatin decondensation, inhibition of Pol II transcription of Ube3a-ATS, and concomitant increase in Ube3a expression from the paternal allele (Figure 3B). This R-loop-mediated reactivation of paternal Ube3a could therefore compensate for the loss of maternal Ube3a in AS and so potentially holds promise for targeted therapies for both AS and PWS (Figure 3B).
A. Neuronal expression of the paternal ncRNA Ube3a-ATS represses paternal Ube3a gene in cis . DNA methylation of the Snord116 locus on the maternal allele prevents Ube3a-ATS transcription, resulting in Ube3a expression from the maternal allele. Transcriptional repression is indicated by red crosses. B. R-loop-mediated re-activation of silent paternal Ube3a gene provides a targeted therapy for AS. Deletion leading to the loss of maternal Ube3a expression detected in AS is indicated by the red dashed line. Topotecan treatment increases R-loop levels over the Snord116 locus, resulting in chromatin decondensation, inhibition of Pol II transcription through Ube3a-ATS, and increased expression of Ube3a from the paternal allele .
It has previously been proposed that R-loops in trinucleotide expansion diseases could be targeted to suppress repeat expansions or reactivate silenced genes . A recent study provided direct evidence that a small molecule is able to suppress R-loop formation at expanded CGG repeats in the FMR1 gene, thereby preventing FMR1 epigenetic silencing in FXS . As an alternative approach, R-loop levels may be indirectly modulated by treatments that target proteins involved in R-loop biology (Table S1). For instance, genomic instability caused by a widespread increase of R-loops due to loss of an R-loop suppressing protein could potentially be reverted by introduction of an alternative R-loop suppressor.
Recent identification of small-molecule inhibitors for RNase H2 may also provide a powerful new tool for the study of R-loop biology in health and disease . Furthermore, the S9.6 antibody offers new opportunities for research and development. In particular, it has already been used in the development of biosensor systems , detection of miRNA targets , and as a key component of human papillomavirus (HPV) diagnostic kit (Qiagen).
The explosion of studies uncovering the role of R-loops in health and disease in recent years provides the exciting prospect of developing new targeted therapeutics for many human disorders. However, due to the ubiquitous nature of R-loops it will be important to ensure that efficient treatments are specific.
Conclusions and Future Challenges
R-loops have been implicated in many biological processes in different organisms. R-loops can play positive and negative roles in gene expression; they can mediate Ig class switch recombination and transcriptional termination, affect genome stability, transcription, cell cycle progression, and cell viability. Despite the diversity of these biological processes, the molecular mechanisms associated with R-loop formation in mammalian cells remain largely unknown. It is unclear how R-loops can regulate gene expression, how they are maintained and eliminated in the cells, and which proteins are involved in the regulation of these processes.
The connections between R-loops and human diseases suggest that cells have evolved mechanisms to distinguish between deleterious and beneficial R-loops. However, the evidence discussed above raises an important question: how can R-loop dysregulation be mechanistically linked to a variety of human diseases with such diverse pathologies? One explanation may be that R-loops form in many genomic locations in healthy cells , , , . Therefore, unsurprisingly, their dysregulation can affect a large number of disease-associated genes. This is in contrast to gene-specific R-loop pathologies, associated with mutations, which result in altered R-loop levels locally, as observed in the repeat expansion diseases FRDA and FXS ,. Furthermore, R-loops can have different intrinsic properties. R-loops at expanded GAA repeats in the FXN gene are highly stable and trigger transcriptional repression, while R-loops in the highly-expressed γ-Actin gene are easily turned over . This could, in part, be due to differential activity of R-loop processing proteins on different classes of genes, as proposed in yeast . Adding another layer of complexity, the formation of R-loops can be influenced by cell type , cell cycle stage , gene length, and/or GC content and transcriptional level , . Epigenetic marks including DNA methylation and post-translational histone modifications can contribute to further modulation of R-loop levels , . Thus, R-loops represent cellular structures that share the same elementary composition, but may possess different dynamic properties, which can be affected by any of the aforementioned processes, thus explaining the wide range of diseases associated with R-loops.
Despite the lack of mechanistic insights into R-loop-associated diseases, some common themes, underlying their pathology, are already becoming obvious. First, there is a strong connection between R-loop dysregulation and induction of DNA damage and loss of genome integrity, which contributes to cancer development , , , repeat expansion diseases , and neurodegeneration , . Secondly, R-loops can mediate changes in transcription locally or globally, contributing to pathologies associated with repeat expansion diseases , , , Angelman syndrome , and cancer . However, it is a strong possibility that both of these pathological themes may overlap in many disorders, as observed in repeat expansion diseases , , , and novel disease themes may be revealed in the future.
One of the major challenges in R-loop field is to investigate the causes and consequences of R-loop formation in additional models of human disease. Uncovering further aspects of R-loop biology in human cells will certainly shed light on many basic biological questions and have major implications for our understanding of human disease. Future studies will undoubtedly reveal more diseases associated with R-loop dysregulation and will provide the basis for novel therapeutic approaches targeting these so far overlooked structures in gene expression.
Proteins implicated in R-loop biology. For multiprotein complexes, only subunits directly implicated in R-loop biology are mentioned in the table. *Asterisk indicates that protein association with R-loops is based on in vitro evidence.
We thank Prof. P. Cook, Prof. N. J. Proudfoot, Prof. C. Norbury and L. M. Silva for critically reading the manuscript.
- 1. Thomas M, White RL, Davis RW (1976) Hybridization of RNA to double-stranded DNA: formation of R-loops. Proc Natl Acad Sci U S A 73: 2294–2298. doi: 10.1073/pnas.73.7.2294
- 2. White RL, Hogness DS (1977) R loop mapping of the 18S and 28S sequences in the long and short repeating units of Drosophila melanogaster rDNA. Cell 10: 177–192. doi: 10.1016/0092-8674(77)90213-6
- 3. Drolet M, Bi X, Liu LF (1994) Hypernegative supercoiling of the DNA template during transcription elongation in vitro. J Biol Chem 269: 2068–2074.
- 4. Aguilera A, Garcia-Muse T (2012) R loops: from transcription byproducts to threats to genome stability. Mol Cell 46: 115–124. doi: 10.1016/j.molcel.2012.04.009
- 5. Hamperl S, Cimprich KA (2014) The contribution of co-transcriptional RNA∶DNA hybrid structures to DNA damage and genome instability. DNA Repair (Amst) 19: 84–94. doi: 10.1016/j.dnarep.2014.03.023
- 6. Skourti-Stathaki K, Proudfoot NJ (2014) A double-edged sword: R loops as threats to genome integrity and powerful regulators of gene expression. Genes Dev 28: 1384–1396. doi: 10.1101/gad.242990.114
- 7. Wahba L, Gore SK, Koshland D (2013) The homologous recombination machinery modulates the formation of RNA-DNA hybrids and associated chromosome instability. Elife 2: e00505.
- 8. Boguslawski SJ, Smith DE, Michalak MA, Mickelson KE, Yehle CO, et al. (1986) Characterization of monoclonal antibody to DNA.RNA and its application to immunodetection of hybrids. J Immunol Methods 89: 123–130. doi: 10.1016/0022-1759(86)90040-2
- 9. Phillips DD, Garboczi DN, Singh K, Hu Z, Leppla SH, et al. (2013) The sub-nanomolar binding of DNA-RNA hybrids by the single-chain Fv fragment of antibody S9.6. J Mol Recognit 26: 376–381. doi: 10.1002/jmr.2284
- 10. Hu Z, Zhang A, Storz G, Gottesman S, Leppla SH (2006) An antibody-based microarray assay for small RNA detection. Nucleic Acids Res 34: e52. doi: 10.1093/nar/gkl142
- 11. El Hage A, French SL, Beyer AL, Tollervey D (2010) Loss of Topoisomerase I leads to R-loop-mediated transcriptional blocks during ribosomal RNA synthesis. Genes Dev 24: 1546–1558. doi: 10.1101/gad.573310
- 12. Mischo HE, Gomez-Gonzalez B, Grzechnik P, Rondon AG, Wei W, et al. (2011) Yeast Sen1 helicase protects the genome from transcription-associated instability. Mol Cell 41: 21–32. doi: 10.1016/j.molcel.2010.12.007
- 13. Sun Q, Csorba T, Skourti-Stathaki K, Proudfoot NJ, Dean C (2013) R-loop stabilization represses antisense transcription at the Arabidopsis FLC locus. Science 340: 619–621. doi: 10.1126/science.1234848
- 14. Becherel OJ, Yeo AJ, Stellati A, Heng EY, Luff J, et al. (2013) Senataxin plays an essential role with DNA damage response proteins in meiotic recombination and gene silencing. PLoS Genet 9: e1003435. doi: 10.1371/journal.pgen.1003435
- 15. Yeo AJ, Becherel OJ, Luff JE, Cullen JK, Wongsurawat T, et al. (2014) R-Loops in Proliferating Cells but Not in the Brain: Implications for AOA2 and Other Autosomal Recessive Ataxias. PLoS ONE 9: e90219. doi: 10.1371/journal.pone.0090219
- 16. Skourti-Stathaki K, Proudfoot NJ, Gromak N (2011) Human senataxin resolves RNA/DNA hybrids formed at transcriptional pause sites to promote Xrn2-dependent termination. Mol Cell 42: 794–805. doi: 10.1016/j.molcel.2011.04.026
- 17. Ginno PA, Lim YW, Lott PL, Korf I, Chedin F (2013) GC skew at the 5′ and 3′ ends of human genes links R-loop formation to epigenetic regulation and transcription termination. Genome Res 23: 1590–1600. doi: 10.1101/gr.158436.113
- 18. Ginno PA, Lott PL, Christensen HC, Korf I, Chedin F (2012) R-loop formation is a distinctive characteristic of unmethylated human CpG island promoters. Mol Cell 45: 814–825. doi: 10.1016/j.molcel.2012.01.017
- 19. Xu B, Clayton DA (1996) RNA-DNA hybrid formation at the human mitochondrial heavy-strand origin ceases at replication start sites: an implication for RNA-DNA hybrids serving as primers. EMBO J 15: 3135–3143.
- 20. Itoh T, Tomizawa J (1980) Formation of an RNA primer for initiation of replication of ColE1 DNA by ribonuclease H. Proc Natl Acad Sci U S A 77: 2450–2454. doi: 10.1073/pnas.77.5.2450
- 21. Yu K, Chedin F, Hsieh CL, Wilson TE, Lieber MR (2003) R-loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells. Nat Immunol 4: 442–451. doi: 10.1038/ni919
- 22. Chan YA, Aristizabal MJ, Lu PY, Luo Z, Hamza A, et al. (2014) Genome-Wide Profiling of Yeast DNA∶RNA Hybrid Prone Sites with DRIP-Chip. PLoS Genet 10: e1004288. doi: 10.1371/journal.pgen.1004288
- 23. Tous C, Aguilera A (2007) Impairment of transcription elongation by R-loops in vitro. Biochem Biophys Res Commun 360: 428–432. doi: 10.1016/j.bbrc.2007.06.098
- 24. Huertas P, Aguilera A (2003) Cotranscriptionally formed DNA∶RNA hybrids mediate transcription elongation impairment and transcription-associated recombination. Mol Cell 12: 711–721. doi: 10.1016/j.molcel.2003.08.010
- 25. Nakama M, Kawakami K, Kajitani T, Urano T, Murakami Y (2012) DNA-RNA hybrid formation mediates RNAi-directed heterochromatin formation. Genes Cells 17: 218–233. doi: 10.1111/j.1365-2443.2012.01583.x
- 26. Castellano-Pozo M, Santos-Pereira JM, Rondon AG, Barroso S, Andujar E, et al. (2013) R loops are linked to histone H3 S10 phosphorylation and chromatin condensation. Mol Cell 52: 583–590. doi: 10.1016/j.molcel.2013.10.006
- 27. Yang Y, McBride KM, Hensley S, Lu Y, Chedin F, et al. (2014) Arginine methylation facilitates the recruitment of TOP3B to chromatin to prevent R loop accumulation. Mol Cell 53: 484–497. doi: 10.1016/j.molcel.2014.01.011
- 28. Santos-Pereira JM, Herrero AB, Garcia-Rubio ML, Marin A, Moreno S, et al. (2013) The Npl3 hnRNP prevents R-loop-mediated transcription-replication conflicts and genome instability. Genes Dev 27: 2445–2458. doi: 10.1101/gad.229880.113
- 29. Sordet O, Redon CE, Guirouilh-Barbat J, Smith S, Solier S, et al. (2009) Ataxia telangiectasia mutated activation by transcription- and topoisomerase I-induced DNA double-strand breaks. EMBO Rep 10: 887–893. doi: 10.1038/embor.2009.97
- 30. Li X, Manley JL (2005) Inactivation of the SR protein splicing factor ASF/SF2 results in genomic instability. Cell 122: 365–378. doi: 10.1016/j.cell.2005.06.008
- 31. Stirling PC, Chan YA, Minaker SW, Aristizabal MJ, Barrett I, et al. (2012) R-loop-mediated genome instability in mRNA cleavage and polyadenylation mutants. Genes Dev 26: 163–175. doi: 10.1101/gad.179721.111
- 32. Helmrich A, Ballarino M, Tora L (2011) Collisions between replication and transcription complexes cause common fragile site instability at the longest human genes. Mol Cell 44: 966–977. doi: 10.1016/j.molcel.2011.10.013
- 33. Tuduri S, Crabbe L, Conti C, Tourriere H, Holtgreve-Grez H, et al. (2009) Topoisomerase I suppresses genomic instability by preventing interference between replication and transcription. Nat Cell Biol 11: 1315–1324. doi: 10.1038/ncb1984
- 34. Wahba L, Amon JD, Koshland D, Vuica-Ross M (2011) RNase H and multiple RNA biogenesis factors cooperate to prevent RNA∶DNA hybrids from generating genome instability. Mol Cell 44: 978–988. doi: 10.1016/j.molcel.2011.10.017
- 35. Herrera-Moyano E, Mergui X, Garcia-Rubio ML, Barroso S, Aguilera A (2014) The yeast and human FACT chromatin-reorganizing complexes solve R-loop-mediated transcription-replication conflicts. Genes Dev 28: 735–748. doi: 10.1101/gad.234070.113
- 36. Cerritelli SM, Crouch RJ (2009) Ribonuclease H: the enzymes in eukaryotes. FEBS J 276: 1494–1505. doi: 10.1111/j.1742-4658.2009.06908.x
- 37. Boule JB, Zakian VA (2007) The yeast Pif1p DNA helicase preferentially unwinds RNA DNA substrates. Nucleic Acids Res 35: 5809–5818. doi: 10.1093/nar/gkm613
- 38. Shin JH, Kelman Z (2006) The replicative helicases of bacteria, archaea, and eukarya can unwind RNA-DNA hybrid substrates. J Biol Chem 281: 26914–26921. doi: 10.1074/jbc.m605518200
- 39. Moreira MC, Klur S, Watanabe M, Nemeth AH, Le Ber I, et al. (2004) Senataxin, the ortholog of a yeast RNA helicase, is mutant in ataxia-ocular apraxia 2. Nat Genet 36: 225–227. doi: 10.1038/ng1303
- 40. Anheim M, Monga B, Fleury M, Charles P, Barbot C, et al. (2009) Ataxia with oculomotor apraxia type 2: clinical, biological and genotype/phenotype correlation study of a cohort of 90 patients. Brain 132: 2688–2698. doi: 10.1093/brain/awp211
- 41. Chen YZ, Bennett CL, Huynh HM, Blair IP, Puls I, et al. (2004) DNA/RNA helicase gene mutations in a form of juvenile amyotrophic lateral sclerosis (ALS4). Am J Hum Genet 74: 1128–1135. doi: 10.1086/421054
- 42. Suraweera A, Lim Y, Woods R, Birrell GW, Nasim T, et al. (2009) Functional role for senataxin, defective in ataxia oculomotor apraxia type 2, in transcriptional regulation. Hum Mol Genet 18: 3384–3396. doi: 10.1093/hmg/ddp278
- 43. Steinmetz EJ, Warren CL, Kuehner JN, Panbehi B, Ansari AZ, et al. (2006) Genome-wide distribution of yeast RNA polymerase II and its control by Sen1 helicase. Mol Cell 24: 735–746. doi: 10.1016/j.molcel.2006.10.023
- 44. West S, Gromak N, Proudfoot NJ (2004) Human 5′→3′ exonuclease Xrn2 promotes transcription termination at co-transcriptional cleavage sites. Nature 432: 522–525. doi: 10.1038/nature03035
- 45. Suraweera A, Becherel OJ, Chen P, Rundle N, Woods R, et al. (2007) Senataxin, defective in ataxia oculomotor apraxia type 2, is involved in the defense against oxidative DNA damage. J Cell Biol 177: 969–979. doi: 10.1083/jcb.200701042
- 46. Yuce O, West SC (2013) Senataxin, defective in the neurodegenerative disorder ataxia with oculomotor apraxia 2, lies at the interface of transcription and the DNA damage response. Mol Cell Biol 33: 406–417. doi: 10.1128/mcb.01195-12
- 47. Alzu A, Bermejo R, Begnis M, Lucca C, Piccini D, et al. (2012) Senataxin associates with replication forks to protect fork integrity across RNA-polymerase-II-transcribed genes. Cell 151: 835–846. doi: 10.1016/j.cell.2012.09.041
- 48. Richard P, Feng S, Manley JL (2013) A SUMO-dependent interaction between Senataxin and the exosome, disrupted in the neurodegenerative disease AOA2, targets the exosome to sites of transcription-induced DNA damage. Genes Dev 27: 2227–2232. doi: 10.1101/gad.224923.113
- 49. Vantaggiato C, Bondioni S, Airoldi G, Bozzato A, Borsani G, et al. (2011) Senataxin modulates neurite growth through fibroblast growth factor 8 signalling. Brain 134: 1808–1828. doi: 10.1093/brain/awr084
- 50. Powell WT, Coulson RL, Gonzales ML, Crary FK, Wong SS, et al. (2013) R-loop formation at Snord116 mediates topotecan inhibition of Ube3a-antisense and allele-specific chromatin decondensation. Proc Natl Acad Sci U S A 110: 13938–13943. doi: 10.1073/pnas.1305426110
- 51. Helmrich A, Ballarino M, Nudler E, Tora L (2013) Transcription-replication encounters, consequences and genomic instability. Nat Struct Mol Biol 20: 412–418. doi: 10.1038/nsmb.2543
- 52. Chon H, Sparks JL, Rychlik M, Nowotny M, Burgers PM, et al. (2013) RNase H2 roles in genome integrity revealed by unlinking its activities. Nucleic Acids Res 41: 3130–3143. doi: 10.1093/nar/gkt027
- 53. Cerritelli SM, Frolova EG, Feng C, Grinberg A, Love PE, et al. (2003) Failure to produce mitochondrial DNA results in embryonic lethality in Rnaseh1 null mice. Mol Cell 11: 807–815. doi: 10.1016/s1097-2765(03)00088-1
- 54. Rydberg B, Game J (2002) Excision of misincorporated ribonucleotides in DNA by RNase H (type 2) and FEN-1 in cell-free extracts. Proc Natl Acad Sci U S A 99: 16654–16659. doi: 10.1073/pnas.262591699
- 55. Nick McElhinny SA, Kumar D, Clark AB, Watt DL, Watts BE, et al. (2010) Genome instability due to ribonucleotide incorporation into DNA. Nat Chem Biol 6: 774–781. doi: 10.1038/nchembio.424
- 56. Reijns MA, Rabe B, Rigby RE, Mill P, Astell KR, et al. (2012) Enzymatic removal of ribonucleotides from DNA is essential for mammalian genome integrity and development. Cell 149: 1008–1022. doi: 10.1016/j.cell.2012.04.011
- 57. Lazzaro F, Novarina D, Amara F, Watt DL, Stone JE, et al. (2012) RNase H and postreplication repair protect cells from ribonucleotides incorporated in DNA. Mol Cell 45: 99–110. doi: 10.1016/j.molcel.2011.12.019
- 58. Arana ME, Kerns RT, Wharey L, Gerrish KE, Bushel PR, et al. (2012) Transcriptional responses to loss of RNase H2 in Saccharomyces cerevisiae. DNA Repair (Amst) 11: 933–941. doi: 10.1016/j.dnarep.2012.09.006
- 59. Crow YJ, Leitch A, Hayward BE, Garner A, Parmar R, et al. (2006) Mutations in genes encoding ribonuclease H2 subunits cause Aicardi-Goutieres syndrome and mimic congenital viral brain infection. Nat Genet 38: 910–916. doi: 10.1038/ng1842
- 60. Crow YJ, Rehwinkel J (2009) Aicardi-Goutieres syndrome and related phenotypes: linking nucleic acid metabolism with autoimmunity. Hum Mol Genet 18: R130–136. doi: 10.1093/hmg/ddp293
- 61. Crow YJ, Hayward BE, Parmar R, Robins P, Leitch A, et al. (2006) Mutations in the gene encoding the 3′-5′ DNA exonuclease TREX1 cause Aicardi-Goutieres syndrome at the AGS1 locus. Nat Genet 38: 917–920. doi: 10.1038/ng1845
- 62. Rice GI, Kasher PR, Forte GM, Mannion NM, Greenwood SM, et al. (2012) Mutations in ADAR1 cause Aicardi-Goutieres syndrome associated with a type I interferon signature. Nat Genet 44: 1243–1248. doi: 10.1038/ng.2414
- 63. Rice GI, Bond J, Asipu A, Brunette RL, Manfield IW, et al. (2009) Mutations involved in Aicardi-Goutieres syndrome implicate SAMHD1 as regulator of the innate immune response. Nat Genet 41: 829–832. doi: 10.1038/ng.373
- 64. Lee-Kirsch MA, Wolf C, Gunther C (2014) Aicardi-Goutieres syndrome: a model disease for systemic autoimmunity. Clin Exp Immunol 175: 17–24. doi: 10.1111/cei.12160
- 65. Rabe B (2013) Aicardi-Goutieres syndrome: clues from the RNase H2 knock-out mouse. J Mol Med (Berl) 91: 1235–1240. doi: 10.1007/s00109-013-1061-x
- 66. Hiller B, Achleitner M, Glage S, Naumann R, Behrendt R, et al. (2012) Mammalian RNase H2 removes ribonucleotides from DNA to maintain genome integrity. J Exp Med 209: 1419–1426. doi: 10.1084/jem.20120876
- 67. Bubeck D, Reijns MA, Graham SC, Astell KR, Jones EY, et al. (2011) PCNA directs type 2 RNase H activity on DNA replication and repair substrates. Nucleic Acids Res 39: 3652–3666. doi: 10.1093/nar/gkq980
- 68. Rigby RE, Webb LM, Mackenzie KJ, Li Y, Leitch A, et al. (2014) RNA∶DNA hybrids are a novel molecular pattern sensed by TLR9. EMBO J 33: 542–558. doi: 10.1002/embj.201386117
- 69. Lopez Castel A, Cleary JD, Pearson CE (2010) Repeat instability as the basis for human diseases and as a potential target for therapy. Nat Rev Mol Cell Biol 11: 165–170. doi: 10.1038/nrm2854
- 70. Grabczyk E, Mancuso M, Sammarco MC (2007) A persistent RNA.DNA hybrid formed by transcription of the Friedreich ataxia triplet repeat in live bacteria, and by T7 RNAP in vitro. Nucleic Acids Res 35: 5351–5359. doi: 10.1093/nar/gkm589
- 71. Lin Y, Dent SY, Wilson JH, Wells RD, Napierala M (2010) R loops stimulate genetic instability of CTG.CAG repeats. Proc Natl Acad Sci U S A 107: 692–697. doi: 10.1073/pnas.0909740107
- 72. McIvor EI, Polak U, Napierala M (2010) New insights into repeat instability: role of RNA*DNA hybrids. RNA Biol 7: 551–558. doi: 10.4161/rna.7.5.12745
- 73. Reddy K, Tam M, Bowater RP, Barber M, Tomlinson M, et al. (2011) Determinants of R-loop formation at convergent bidirectionally transcribed trinucleotide repeats. Nucleic Acids Res 39: 1749–1762. doi: 10.1093/nar/gkq935
- 74. Belotserkovskii BP, Mirkin SM, Hanawalt PC (2013) DNA sequences that interfere with transcription: implications for genome function and stability. Chem Rev 113: 8620–8637. doi: 10.1021/cr400078y
- 75. Groh M, Lufino MM, Wade-Martins R, Gromak N (2014) R-loops associated with triplet repeat expansions promote gene silencing in Friedreich ataxia and fragile X syndrome. PLoS Genet 10: e1004318. doi: 10.1371/journal.pgen.1004318
- 76. Loomis EW, Sanz LA, Chedin F, Hagerman PJ (2014) Transcription-Associated R-Loop Formation across the Human FMR1 CGG-Repeat Region. PLoS Genet 10: e1004294. doi: 10.1371/journal.pgen.1004294
- 77. Colak D, Zaninovic N, Cohen MS, Rosenwaks Z, Yang WY, et al. (2014) Promoter-bound trinucleotide repeat mRNA drives epigenetic silencing in fragile X syndrome. Science 343: 1002–1005. doi: 10.1126/science.1245831
- 78. Haeusler AR, Donnelly CJ, Periz G, Simko EA, Shaw PG, et al. (2014) C9orf72 nucleotide repeat structures initiate molecular cascades of disease. Nature 507: 195–200. doi: 10.1038/nature13124
- 79. Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144: 646–674. doi: 10.1016/j.cell.2011.02.013
- 80. Negrini S, Gorgoulis VG, Halazonetis TD (2010) Genomic instability–an evolving hallmark of cancer. Nat Rev Mol Cell Biol 11: 220–228. doi: 10.1038/nrm2858
- 81. Chernikova SB, Razorenova OV, Higgins JP, Sishc BJ, Nicolau M, et al. (2012) Deficiency in mammalian histone H2B ubiquitin ligase Bre1 (Rnf20/Rnf40) leads to replication stress and chromosomal instability. Cancer Res 72: 2111–2119. doi: 10.1158/0008-5472.can-11-2209
- 82. Bhatia V, Barroso SI, Garcia-Rubio ML, Tumini E, Herrera-Moyano E, et al. (2014) BRCA2 prevents R-loop accumulation and associates with TREX-2 mRNA export factor PCID2. Nature 511: 362–365. doi: 10.1038/nature13374
- 83. Jackson BR, Noerenberg M, Whitehouse A (2014) A novel mechanism inducing genome instability in Kaposi's sarcoma-associated herpesvirus infected cells. PLoS Pathog 10: e1004098. doi: 10.1371/journal.ppat.1004098
- 84. Sikdar N, Banerjee S, Zhang H, Smith S, Myung K (2008) Spt2p defines a new transcription-dependent gross chromosomal rearrangement pathway. PLoS Genet 4: e1000290. doi: 10.1371/journal.pgen.1000290
- 85. San Filippo J, Sung P, Klein H (2008) Mechanism of eukaryotic homologous recombination. Annu Rev Biochem 77: 229–257. doi: 10.1146/annurev.biochem.77.061306.125255
- 86. Chaudhuri J, Tian M, Khuong C, Chua K, Pinaud E, et al. (2003) Transcription-targeted DNA deamination by the AID antibody diversification enzyme. Nature 422: 726–730. doi: 10.1038/nature01574
- 87. Robbiani DF, Nussenzweig MC (2013) Chromosome translocation, B cell lymphoma, and activation-induced cytidine deaminase. Annu Rev Pathol 8: 79–103. doi: 10.1146/annurev-pathol-020712-164004
- 88. Ruiz JF, Gomez-Gonzalez B, Aguilera A (2011) AID induces double-strand breaks at immunoglobulin switch regions and c-MYC causing chromosomal translocations in yeast THO mutants. PLoS Genet 7: e1002009. doi: 10.1371/journal.pgen.1002009
- 89. Duquette ML, Pham P, Goodman MF, Maizels N (2005) AID binds to transcription-induced structures in c-MYC that map to regions associated with translocation and hypermutation. Oncogene 24: 5791–5798. doi: 10.1038/sj.onc.1208746
- 90. Burns MB, Lackey L, Carpenter MA, Rathore A, Land AM, et al. (2013) APOBEC3B is an enzymatic source of mutation in breast cancer. Nature 494: 366–370. doi: 10.1038/nature11881
- 91. Kulis M, Esteller M (2010) DNA methylation and cancer. Adv Genet 70: 27–56. doi: 10.1016/b978-0-12-380866-0.60002-2
- 92. Wongsurawat T, Jenjaroenpun P, Kwoh CK, Kuznetsov V (2011) Quantitative model of R-loop forming structures reveals a novel level of RNA-DNA interactome complexity. Nucleic Acids Res 40: e16. doi: 10.1093/nar/gkr1075
- 93. Hynes NE, Stoelzle T (2009) Key signalling nodes in mammary gland development and cancer: Myc. Breast Cancer Res 11: 210. doi: 10.1186/bcr2406
- 94. Balk B, Maicher A, Dees M, Klermund J, Luke-Glaser S, et al. (2013) Telomeric RNA-DNA hybrids affect telomere-length dynamics and senescence. Nat Struct Mol Biol 20: 1199–1205. doi: 10.1038/nsmb.2662
- 95. Pfeiffer V, Crittin J, Grolimund L, Lingner J (2013) The THO complex component Thp2 counteracts telomeric R-loops and telomere shortening. EMBO J 32: 2861–2871. doi: 10.1038/emboj.2013.217
- 96. De Amicis A, Piane M, Ferrari F, Fanciulli M, Delia D, et al. (2011) Role of senataxin in DNA damage and telomeric stability. DNA Repair (Amst) 10: 199–209. doi: 10.1016/j.dnarep.2010.10.012
- 97. Wheelhouse RT, Chaires JB (2010) Drug binding to DNA×RNA hybrid structures. Methods Mol Biol 613: 55–70. doi: 10.1007/978-1-60327-418-0_4
- 98. Shaw NN, Arya DP (2008) Recognition of the unique structure of DNA∶RNA hybrids. Biochimie 90: 1026–1039. doi: 10.1016/j.biochi.2008.04.011
- 99. Shaw NN, Xi H, Arya DP (2008) Molecular recognition of a DNA∶RNA hybrid: sub-nanomolar binding by a neomycin-methidium conjugate. Bioorg Med Chem Lett 18: 4142–4145. doi: 10.1016/j.bmcl.2008.05.090
- 100. Cassidy SB, Dykens E, Williams CA (2000) Prader-Willi and Angelman syndromes: sister imprinted disorders. Am J Med Genet 97: 136–146. doi: 10.1002/1096-8628(200022)97:2<136::aid-ajmg5>3.0.co;2-v
- 101. Meng L, Person RE, Beaudet AL (2012) Ube3a-ATS is an atypical RNA polymerase II transcript that represses the paternal expression of Ube3a. Hum Mol Genet 21: 3001–3012. doi: 10.1093/hmg/dds130
- 102. White R, Saxty B, Large J, Kettleborough CA, Jackson AP (2013) Identification of small-molecule inhibitors of the ribonuclease H2 enzyme. J Biomol Screen 18: 610–620. doi: 10.1177/1087057113476550
- 103. Sipova H, Zhang S, Dudley AM, Galas D, Wang K, et al. (2010) Surface plasmon resonance biosensor for rapid label-free detection of microribonucleic acid at subfemtomole level. Anal Chem 82: 10110–10115. doi: 10.1021/ac102131s
- 104. Qavi AJ, Kindt JT, Gleeson MA, Bailey RC (2011) Anti-DNA∶RNA antibodies and silicon photonic microring resonators: increased sensitivity for multiplexed microRNA detection. Anal Chem 83: 5949–5956. doi: 10.1021/ac201340s
- 105. Gomez-Gonzalez B, Garcia-Rubio M, Bermejo R, Gaillard H, Shirahige K, et al. (2011) Genome-wide function of THO/TREX in active genes prevents R-loop-dependent replication obstacles. EMBO J 30: 3106–3119. doi: 10.1038/emboj.2011.206