Base J, β-D-glucosyl-hydroxymethyluracil, is a chromatin modification of thymine in the nuclear DNA of flagellated protozoa of the order Kinetoplastida. In Trypanosoma brucei, J is enriched, along with histone H3 variant (H3.V), at sites involved in RNA Polymerase (RNAP) II termination and telomeric sites involved in regulating variant surface glycoprotein gene (VSG) transcription by RNAP I. Reduction of J in T. brucei indicated a role of J in the regulation of RNAP II termination, where the loss of J at specific sites within polycistronic gene clusters led to read-through transcription and increased expression of downstream genes. We now demonstrate that the loss of H3.V leads to similar defects in RNAP II termination within gene clusters and increased expression of downstream genes. Gene derepression is intensified upon the subsequent loss of J in the H3.V knockout. mRNA-seq indicates gene derepression includes VSG genes within the silent RNAP I transcribed telomeric gene clusters, suggesting an important role for H3.V in telomeric gene repression and antigenic variation. Furthermore, the loss of H3.V at regions of overlapping transcription at the end of convergent gene clusters leads to increased nascent RNA and siRNA production. Our results suggest base J and H3.V can act independently as well as synergistically to regulate transcription termination and expression of coding and non-coding RNAs in T. brucei, depending on chromatin context (and transcribing polymerase). As such these studies provide the first direct evidence for histone H3.V negatively influencing transcription elongation to promote termination.
Trypanosoma brucei is an early-diverged parasitic protozoan that causes African sleeping sickness in humans. The genome of T. brucei is organized into polycistronic gene clusters that contain multiple genes that are co-transcribed from a single promoter. Because of this genome arrangement, it is thought that all gene regulation in T. brucei occurs after transcription at the level of RNA (processing, stability, and translation). We have recently described the presence of a modified DNA base J and variant of histone H3 (H3.V) at transcription termination sites within gene clusters where the loss of base J leads to read-through transcription and the expression of downstream genes. We now find that H3.V also promotes termination prior to the end of gene clusters, thus regulating the transcription of specific genes. Additionally, H3.V inhibits transcription of siRNA producing loci. Our data suggest H3.V and base J are utilized for regulating gene expression via terminating transcription within polycistronic gene arrays and regulating the synthesis of siRNAs in trypanosomes. These findings significantly expand our understanding of epigenetic regulatory mechanisms underlying transcription termination in eukaryotes, including divergent organisms that utilize polycistronic transcription, providing the first example of a histone variant that promotes transcription termination.
Citation: Reynolds D, Hofmeister BT, Cliffe L, Alabady M, Siegel TN, Schmitz RJ, et al. (2016) Histone H3 Variant Regulates RNA Polymerase II Transcription Termination and Dual Strand Transcription of siRNA Loci in Trypanosoma brucei. PLoS Genet 12(1): e1005758. https://doi.org/10.1371/journal.pgen.1005758
Editor: Luisa Figueiredo, Universidade de Lisboa Instituto de Medicina Molecular, PORTUGAL
Received: June 30, 2015; Accepted: December 1, 2015; Published: January 21, 2016
Copyright: © 2016 Reynolds et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All sequencing data discussed in this publication have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE70229 and GSE69929.
Funding: This work was supported by the National Institutes of Health [grant number AI109108](to RS); and funding by the Young Investigator Program of the Research Center of Infectious Diseases (ZINF) of the University of Wuerzburg, Germany; German Research Foundation DFG [SI 1610/2-1] and Human Frontier Science Program (to TNS); funding by UGA Office of the Vice President for Research (to RJS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Kinetoplastids are early-diverged protozoa that include the human parasites Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major, which cause African sleeping sickness, Chagas disease, and leishmaniasis, respectively. The genomes of kinetoplastids are arranged into long gene clusters, or polycistronic transcription units (PTUs), which are transcribed by RNA polymerase (RNAP) II [1–3]. RNAP II transcription initiation and termination occurs at regions flanking PTUs called divergent strand switch regions (dSSRs) and convergent strand switch regions (cSSRs), respectively . Pre-messenger RNAs (mRNA) are processed to mature mRNA with the addition of a 5’ spliced leader sequence through trans-splicing, followed by 3’ polyadenylation [5–10]. The arrangement of genes into PTUs has led to the assumption that transcription is an unregulated process in these eukaryotes and a model in which gene regulation occurs strictly post-transcriptionally [11, 12]. However, specific chromatin marks have been characterized at sites of transcription initiation and termination, including histone variants and modified DNA base J, which could function to regulate polycistronic transcription and gene expression [13–17].
Base J, β-D-glucosyl-hydroxymethyluracil, is a modified DNA base consisting of O-linked glycosylation of thymine in the genome of kinetoplastids and closely related unicellular flagellates [18, 19]. Whilst J is largely a telomeric modification, it is also found internally within chromosomes at RNAP II transcription initiation and termination sites [13, 20–25]. As reviewed in Borst and Sabatini (2008), analysis of RNAP I transcribed telomeric polycistronic units in T. brucei led to the discovery of base J [20, 26]. Regulation of the ~15 telomeric variant surface glycoprotein expression sites (VSG ESs) allows the parasite to evade the host immune system in a process called antigenic variation [27, 28]. Although the genome of T. brucei has over 1,000 VSG genes, only one VSG is expressed at a given time. This is achieved through regulated transcription of the telomeric ESs, only one of which is productively transcribed at any time. The association of the modified base with silent ESs in the bloodstream life-cycle stage of the parasite has led to the hypothesis that J plays a role in the regulation of antigenic variation. However, no direct evidence has been provided.
Base J is synthesized in a two-step pathway in which a thymidine hydroxylase, JBP1 or JBP2, hydroxylates thymidine residues at specific positions in DNA to form hydroxymethyluracil, followed by the transfer of glucose to hydroxymethyluracil by the glucosyltransferase, JGT [26, 29, 30]. JBP1 and JBP2 belong to the TET/JBP subfamily of dioxygenases, which require Fe2+ and 2-oxoglutarate for activity [31–34]. The synthesis of base J can be inhibited by competitive inhibition of the thymidine hydroxylase domain of JBP1 and JBP2 by dimethyloxalylglycine (DMOG), a structural analog of 2-oxoglutarate [31, 35]. Removal of both JBP1 and JBP2 or the JGT also results in T. brucei cells devoid of base J [29–31, 36].
The co-localization of base J with modified and variant histones at dSSRs and cSSRs suggested a functional role of modified DNA in the regulation of RNAP II transcription . Our work in T. cruzi described a unique role of J in regulating RNAP II transcription initiation, where the loss of base J resulted in the formation of more active chromatin, increased RNAP II recruitment and increased PTU transcription rate [24, 37]. Recent studies have described a role for base J regulating RNAP II termination in T. brucei and Leishmania. van Luenen et al. (2012) found that reduction of base J in L. tarentolae is associated with the generation of RNAs downstream of the cSSR that are antisense to the genes on the opposing gene cluster . Reduction of base J in L. major resulted in similar defects . Strand-specific RT-PCR detection of the nascent transcript confirmed that the J-dependent generation of RNAs downstream of the cSSR is due to read-through transcription at cSSR termination sites. In contrast, loss of J in T. brucei failed to indicate any defect in termination at cSSRs . However, we localized base J at sites within PTUs where the loss of J led to read-through transcription and upregulated expression of downstream genes. Therefore, base J is required for RNAP II termination in both Leishmania and T. brucei, but to different degrees and at different locations. In L. major, J regulates termination at the end of each PTU to prevent read-through transcription and the generation of RNAs antisense to the genes on the opposing PTU. In contrast, although termination occurs at the end of each PTU in T. brucei in a J-independent manner, J-dependent termination within a PTU allows developmentally regulated expression of downstream genes.
The core histones H2A, H2B, H3 and H4, package DNA into nucleosomes and represent a critical component of higher order chromatin. All core histones have variant counterparts. Although histone post-translational modifications (PTMs) and their impact on transcription have been well documented, less is known about the role of histone variants in the regulation of transcription . The most understood are variants of H2A and H3. Several variants of H2A exist, including H2A.Z, H2A.B, and macroH2A. Both H2A.Z and H2A.B are associated with transcriptional activation [39–41]. Knockdown of H2A.Z inhibits transcriptional activation [42–44]. Consistent with this, and perhaps the most direct evidence of a transcriptional role of a histone variant, H2A.Z positively correlates with rates of RNAP II elongation, such that the reduction of H2A.Z increases RNAP II stalling . Presumably, the nucleosome destabilizing effect of H2A.Z  leads to more accessible DNA at promoter regions for transcription factor binding, as well as promoting RNAP II elongation through gene bodies. Like H2A.Z, H2A.B is enriched at promoter regions and its reduction largely results in the downregulation of gene expression [39, 46]. In contrast, macroH2A is enriched at transcriptionally repressed regions  and its reduction results in increased gene expression in an unknown mechanism . Several H3 variants have also been characterized, including H3.3 and CENP-A, both of which are found in most eukaryotes including plants, mammals, and yeast. H3.3 differs from canonical H3 by only 4–5 amino acids and is found predominately at actively transcribed genes, forming more accessible nucleosomes [49–51]. H3.3 also provides a genome stabilization function at repetitive regions such as telomeres and centromeres [49, 52–55]. Recent studies have implicated H3.3 in the maintenance of a repressed chromatin structure [56–58]. Evidence in mouse embryonic stem cells indicates H3.3 is enriched at lowly transcribed developmentally regulated genes where it promotes polycomb repressive complex 2 activity, which catalyzes the formation of the repressive modification H3K27me3 [57, 58]. These findings suggest H3.3 maintains the promoters of developmentally regulated genes in a repressed, but transcriptionally “poised” state important for proper differentiation. H3.3 has also been implicated in the maintenance of H3K9me3 at endogenous retroviral elements in mouse embryonic stem cells . Presence of H3.3 (and H3K9me3) at endogenous retroviral elements repressed retrotransposition and expression of adjacent genes . The centromeric specific histone variant has a well-characterized role in kinetochore formation, but its role in the regulation of transcription, if any, remains unknown . Overall, although much progress has been achieved in the characterization of histone variants, few studies have revealed a direct link between histone variant function and transcriptional regulation.
The J-independent nature of termination at cSSRs in T. brucei led us to characterize the role of H3.V in regulating RNAP II termination. H3.V and base J co-localize at RNAP II termination sites in T. brucei, including cSSRs and PTU internal termination sites [13, 14]. H3.V and base J also co-localize at telomeric repeats involved in regulating RNAP I transcription of the VSG expression sites [14, 60]. T. brucei H3.V shares 45% sequence identity with canonical H3, much of the sequence divergence lying within the N-terminus, outside of the histone fold domain. H3.V appears to be unique to kinetoplastids [14, 60] and aside from its localization to termination sites and telomeres, very little is known about H3.V and its potential role in the regulation of transcription termination. We demonstrate here that, similar to phenotypes associated with the loss of J, loss of H3.V leads to defects in RNAP II termination within gene clusters and increased expression of downstream genes. Interestingly, many of the gene expression changes in the H3.V knockout (KO) are further increased upon the subsequent loss of base J, suggesting that J and H3.V have independent but overlapping roles in regulating transcription termination in T. brucei. Although the loss of H3.V from cSSRs did not indicate any termination defects leading to transcription of the opposing strand of the adjacent convergent gene cluster, it does lead to increased generation of small interfering RNAs (siRNAs) that map to regions of overlapping transcription. Analysis of nascent RNA suggests this is due to increased transcription of the dual strand siRNA loci at cSSRs. We also detect increased expression of VSGs from silent VSG ESs in the H3.V KO, indicating H3.V can act independently in regulating telomeric repression and antigenic variation. Overall these findings provide the first known example of a histone H3 variant that functions as a repressive chromatin mark to promote transcription termination, in this case repressing both mRNAs and non-coding RNAs.
H3.V regulates RNAP II transcription at cSSRs
The co-localization of base J and H3.V at RNAP II termination sites in T. brucei prompted us to examine the role of H3.V in transcription termination. High-throughput sequencing of small RNAs has been shown previously to reveal transcription termination sites in trypanosomatids, as reflected in RNA degradation products [25, 35]. The reduction of base J in L. major by treatment with DMOG resulted in the production of antisense small RNAs corresponding to genes in the opposing PTU due to read-through transcription at cSSRs . In contrast, we found no evidence of termination defects in T. brucei at cSSRs following DMOG treatment and the complete loss of J. Antisense small RNAs, indicative of read-through transcription at cSSRs into the downstream PTU, were not increased following the loss of J . We now show that the loss of H3.V also does not result in read-through transcription at cSSRs. No significant changes in antisense small RNAs corresponding to read-through transcription at cSSRs into the downstream PTU were detected by small RNA-seq in the H3.V KO compared to wild type (WT) T. brucei (Fig 1A) (small RNA sequencing data discussed in this publication have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO Series accession number GSE70229). Subsequent loss of base J in the H3.V KO parasites, via DMOG treatment, also failed to uncover any defect. These results suggest H3.V is not required to prevent RNAP II read-through transcription at the end of convergent gene arrays in T. brucei.
(A) Small RNA-sequencing reads for three representative cSSRs are shown (11.9, 11.4, and 10.9; where cSSR 11.9 refers to the ninth termination site on chromosome 11) where H3.V loss does not lead to read-through transcription, but does lead to increased siRNAs. Small RNA reads are plotted as reads per million reads mapped (RPM). ORFs and the genomic location (kb) are shown above the graphs. WT: wild type; KO: H3.V KO; KO+DMOG: H3.V KO + DMOG. Blue: top strand; red: bottom strand. (B and C) Length distribution of small RNAs. (B) Length distribution of small RNAs from cSSR 11.9. Shown is the RPM for each RNA length observed in cSSR 11.9, a site with a statistically significant increase in 21-27nt RNAs in the H3.V KO (S2 Table). The average of three independent small RNA-seq experiments is plotted. White bars, WT; black bars, H3.V KO. Error bars represent the standard deviation. (C) Length distribution of small RNAs genome-wide. The RPM for each RNA length observed in the entire small RNA-seq data set is shown. Data are plotted as in B.
Unique to T. brucei, large peaks of sense and antisense small RNAs map to regions of overlapping transcription at the ends of convergent PTUs at cSSRs (; Fig 1A). These presumably represent the previously characterized Dicer 2-dependent Argonaute-associated siRNAs derived from cSSRs . Consistent with this, we show that the RNAs that map to these regions correspond to the previously characterized siRNA size range in bloodstream form T. brucei (21-27nt) [61–63] and exhibit characteristic phasing of siRNAs biogenesis, as indicated by the mapping of siRNA sequences in phased intervals at the target loci [64–66] (Fig 1B and S1 Fig). This pattern suggests that these siRNAs were enzymatically processed, most likely by the DCR complex , as opposed to random RNA degradation. siRNAs that map to cSSRs are 21-27nt compared to the 18-30nt range of small RNAs genome-wide (Fig 1B and 1C). Although the loss of base J in WT T. brucei parasites has no effect on siRNA levels , the level of siRNAs derived from the cSSRs significantly increased in the H3.V KO (Fig 1A). To confirm these changes, we repeated the small RNA-seq analysis of WT and H3.V KO in triplicate. Quantitation of the small RNAs associated with cSSRs genome-wide indicates a statistically significant increase in 21-27nt RNAs at 30 out of 72 cSSRs in the H3.V KO (Fig 1B and S2 Table). This can be visualized by specifically mapping the siRNA size range of small RNAs (S1A Fig). The increase in siRNAs is not restricted to cSSRs, but occurs at all previously characterized siRNA generating regions of the T. brucei genome [61, 62]; including the SLACS and ingi retrotransposable elements, CIR147 centromeric repeats, and inverted repeats (S2 Fig). These regions are also enriched with H3.V [14, 68].
Small RNA-seq suggests that the siRNA mapping to dual strand transcription regions at cSSRs is due to continued RNAP II transcription of the convergent PTU resulting in overlapping transcription. An example that illustrates this is shown in Fig 2A–2D, where H3.V (and J) is found enriched at cSSR 2.5. Presumably, H3.V interferes with RNAP II elongation in these dual strand transcription regions. Removal of H3.V would then lead to increased nascent RNA corresponding to these regions (siRNA precursor) that is then processed, resulting in increased siRNA levels that map to both strands (Figs 1 and 2D, S1A and S2 Table). We have previously utilized strand-specific RT-PCR to follow read-through transcription and increased nascent RNA production in L. major and T. brucei . Here, primers were designed that span the poly(A) site of the final gene in the PTU, allowing us to specifically detect nascent, unprocessed RNA within cSSR 2.5 (Fig 2E). Strand-specific RT-PCR results indicate an increase in nascent RNA in the H3.V KO at dual strand transcribed loci at cSSR 2.5 as well as cSSR 1.4 (Fig 2F and 2G). Although no significant increase in siRNAs was observed in DMOG treated WT T. brucei , we do detect a slight increase in nascent RNA following the loss of base J at cSSR 2.5 (Fig 2F and 2G). Nascent RNA is further increased following the loss of both H3.V and J at the two cSSRs analyzed. Overall these results indicate that H3.V, and to a lesser extent J, attenuate transcription elongation within dual strand transcribed loci at cSSRs, potentially enabling regulated expression of siRNAs derived from these sites. Loss of H3.V does not lead to read-through transcription downstream of the dual strand transcribed loci however.
A region on chromosome 2 (950–975 kb) representing cSSR 2.5 is shown where H3.V regulates transcription. (A) Base J and H3.V co-localize at sites of RNAP II termination within cSSRs [13, 14]. H3.V ChIP-seq reads and base J IP-seq reads are plotted as reads per million reads (RPM), as previously described [14, 35]. (B) ORFs are shown with the top strand in blue and the bottom strand in red. (C) mRNA-seq reads from wild type T. brucei are plotted as RPM. Reads that mapped to the top strand are shown in blue and reads that mapped to the bottom strand in red. (D) Small RNA-seq reads from WT and H3.V KO are mapped as described in Fig 1A. (E-G) Strand-specific RT-PCR analysis of nascent RNA in cSSRs. (E) Schematic representation (not to scale) of primer location and direction at cSSR 2.5 (primers shown as green arrows in this and all subsequent figures). The arrowhead below the line indicates the poly(A) processing site for the final gene in the PTU. (F) Strand-specific RT-PCR analysis. cDNA was synthesized using the reverse primer. PCR was performed using the same reverse primer to make the cDNA plus the forward primer, as indicated. Data is also presented for an additional cSSR on chromosome 1 (cSSR 1.4, 635–637 kb). Wild type: WT; Wild type+DMOG: -J; H3.V KO: KO; H3.V KO+DMOG: KO-J. 40S ribosomal protein S11 provides a positive control and minus RT (-RT) negative control is shown. (G) Nested qPCR. Primers were designed within the PCR reaction in F to use in subsequent qPCR analysis. White bars: Wild type; grey bars: Wild type+DMOG; dark grey bars: H3.V KO; black bars: H3.V KO+DMOG. All products were normalized to 40S ribosomal protein S11. The average of three independent strand-specific RT-PCR nested qPCR experiments is plotted. Error bars represent the standard deviation. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
H3.V inhibits RNAP II elongation within PTUs
We have recently shown that base J is present along with H3.V at termination sites within a PTU where J loss results in read-through transcription and increased expression of downstream genes in T. brucei . Because RNAP II elongation and gene expression is inhibited prior to the end of these PTUs, we refer to this as PTU internal termination. To explore the impact of H3.V on RNAP II elongation at these PTU internal termination sites we analyzed the downstream genes by RT-qPCR in the H3.V KO cell line. At three representative PTU internal termination sites we detect increased expression of downstream genes in the H3.V KO (Fig 3A–3D and S3A–S3D Fig). In each of these cases gene derepression is enhanced upon the subsequent loss of J in the H3.V KO following DMOG treatment (Fig 3D and S3D Fig). Importantly, derepression is limited to genes downstream or within the peak of H3.V/base J. Strand specific RT-PCR using oligos flanking the termination site (based on mRNA-seq and base J localization [13, 35]) detects increased RNA in the H3.V KO (Fig 3E–3F and S3F Fig). This is consistent with an increase in nascent read-through RNA resulting from continued transcription elongation at PTU internal termination sites marked by base J and H3.V. Consistent with the gene expression changes, read-through is visibly enhanced at region 7.3 upon the subsequent loss of J in the H3.V KO following DMOG treatment (Fig 3F). These results, combined with our previous study of J regulation of termination , suggest J and H3.V have independent and overlapping roles in regulating RNAP II termination through the inhibition of transcription elongation and the expression of downstream genes in T. brucei.
A region on chromosome 3 (617–670 kb) representing cSSR 3.3 and chromosome 7 (453–525 kb) representing cSSR 7.3 is shown where H3.V regulates transcription of a cluster of genes. (A-C) Base J and H3.V co-localize at sites of RNAP II termination within a PTU. H3.V ChIP-seq reads and base J IP-seq reads, ORFs, and mRNA-seq reads from wild type T. brucei are plotted for cSSR 3.3 (left) and cSSR 7.3 (right) as described in Fig 2. (D) RT-qPCR analysis of genes numbered according to the ORF maps above in panel B. As described in Fig 2G, white bars: Wild type; grey bars: Wild type+DMOG; dark grey bars: H3.V KO; black bars: H3.V KO+DMOG. Transcripts were normalized against 40S ribosomal protein S11, and are plotted as the average and standard deviation of three replicates. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01. The silent gene cluster at cSSR 7.3 consists of nine highly similar retrotransposon hot spot protein genes, therefore the primers used to analyze gene 2 also amplify the additional upstream genes. (E and F) Strand-specific RT-PCR analysis of read-through transcription of the two cSSRs analyzed in A-D. Above each panel is a schematic representation (not to scale) of primer location and direction at a transcription termination site (TTS). The vertical arrow indicates the proposed TTS as described in the text . The long solid arrow indicates the direction of transcription and the dashed arrow indicates read-through transcription past the TTS. cDNA was synthesized using the reverse primer (relative to transcription). PCR was performed using the same reverse primer to make the cDNA plus the forward primer, as indicated. 40S ribosomal protein S11 provides a positive control and a minus RT (-RT) negative control is shown.
To further explore the role of H3.V in the regulation of termination and whether H3.V functions similarly across the genome, we performed mRNA-seq to compare the expression profiles of WT, WT+DMOG, H3.V KO and H3.V KO+DMOG cells (GEO accession number GSE69929). This led to the detection of 153 mRNAs that are increased at least 2-fold in one or more of the treatments (S1 Table). Many of the gene expression changes have been confirmed by RT-qPCR (S4 Fig and below). Consistent with our previous mRNA-seq results, in WT cells treated with DMOG we observe similar increases in the expression of genes downstream of base J (and H3.V), which we previously demonstrated is caused by an RNAP II transcription termination defect within a PTU . However, we now see that a significant number of genes downstream of J/H3.V within other PTUs are upregulated following the loss of H3.V, and that many are further increased following the subsequent loss of J (Fig 4, S1 and S4 Tables). In the H3.V KO we identified 71 genes that are upregulated (Fig 5A and S1 Table). Although many of these genes are not increased by 2-fold in the WT+DMOG condition, some respond at least slightly to the loss of J in WT cells: 28 of the 71 genes upregulated in the H3.V KO are also increased at least 1.3-fold in the WT+DMOG condition, suggesting J and H3.V have overlapping functions in regulating termination at these sites, where H3.V plays a dominant role. Consistent with this, 42 of the 71 H3.V regulated genes are upregulated even further upon subsequent loss of base J in the H3.V KO. This trend is evident in the heatmap shown in Fig 5A. A specific example is shown in Fig 5B–5F where a cluster of genes is affected by the loss of H3.V, and to a lesser extent base J. Both chromatin marks are enriched upstream of and within this gene cluster, which consists of genes annotated as VSG pseudogenes, an atypical VSG, and a hypothetical protein. mRNA-seq indicates gene upregulation is largest in the absence of H3.V and J, which we confirmed by RT-qPCR (Fig 5E and 5F). An additional example is shown in S5 Fig Chromosome maps indicating the genomic location of upregulated genes upon the loss of H3.V and J are shown in S6 Fig. These results also confirm our initial analyses of nascent and steady-state RNA indicating a role for H3.V (and J) regulating termination and expression of downstream genes (Fig 3 and S3 Fig).
The average reads per kilobase per million reads mapped (RPKM) of triplicate mRNA-seq libraries is plotted on a log2 scale. Genes differentially expressed by 2-fold or more in T. brucei following the loss of base J and/or H3.V fall above or below the dotted lines. Red dots indicate genes that are adjacent to H3.V (S1 Table). Only mRNAs with an RPKM≥1 are included.
(A) Heatmap of genes upregulated in the H3.V KO. For the list of genes represented on the heatmap see S1 Table. Clustering of genes at the top indicate those that are further upregulated upon loss of base J in the H3.V KO. J and H3.V columns indicate whether each gene is located within 10 kb of the modification (filled black box), as described in the Materials and Methods section. (B-D) H3.V/J localization, gene map, and mRNA-seq reads plotted for a gene cluster on chromosome 7 at cSSR 7.7 (position 1750–1800 kb shown) is illustrated as described in Fig 3. (E) Plot of the mRNA-seq data for the genes indicated (numbered) in the ORF map. The average RPKM of triplicate mRNA-seq libraries was used to determine fold changes, with wild type set to 1. Error bars indicate the standard deviation between mRNA-seq replicates and p values, determined in Cuffdiff, are indicated by asterisks: *, p value ≤ 0.05; **, p value ≤ 0.01. (F) RT-qPCR analysis of gene expression for the indicated genes (according to the ORF map) as described in Fig 3D. White bars: Wild type; grey bars: Wild type+DMOG; dark grey bars: H3.V KO; black bars: H3.V KO+DMOG. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
80% of the affected genes are found within 10 kb of H3.V and base J (see S1 Table and S7 Fig) and thus fit a model where derepression occurs as a result of deregulated transcription elongation/termination within a PTU following the loss of the two chromatin marks. In support of this model, strand-specific RT-PCR analysis links gene derepression with increased nascent RNA production downstream of J/H3.V marks within the PTU (Figs 3E–3F and S3F). In a few cases, genes that are further than 10 kb downstream of base J and H3.V are still within a regulated cluster, explaining why some genes are indicated as not adjacent to J/H3.V but can still be regulated by these epigenetic marks (e.g. genes Tb427.07.6730-Tb427.07.6780, S1 and S4 Tables). Although clusters of genes downstream of a J and H3.V enriched region are similarly upregulated in many cases, the 153 upregulated genes localize to 91 different PTUs (S4 Table). Therefore, H3.V and J repress gene expression genome-wide, often repressing the expression of a single gene, usually the last gene, within a PTU. Overall, the data indicate that base J and H3.V have independent but overlapping roles in regulating RNAP II transcription termination within specific PTUs and enable regulated expression of downstream genes.
Although the majority of the differentially expressed genes are upregulated, we also see some downregulated (S1 Table). 35 genes are downregulated by at least 2-fold in the H3.V KO cells. Unlike the upregulated genes, where many were increased furthest following the combined loss of J and H3.V, only 7 of the 35 downregulated genes decreased more in the H3.V KO+DMOG condition. However, 33 of the 35 downregulated genes are found proximal (within 10kb) to H3.V and J. The effect is also locus specific and mainly restricted to arrays of genes transcribed from the same strand. These points indicate a strong link between H3.V/J localization and gene expression changes observed. A few examples are indicated in S8 Fig. Among the genes with most significant downregulation upon loss of H3.V were procyclins and procyclin associated genes (PAGs), which include surface protein encoding genes most highly expressed during the (procyclic) insect stage of the parasite . Although PAGs are located in multiple copies in the genome, within RNAP II transcribed arrays or within RNAP I transcribed procyclin arrays [69, 70], the PAGs downregulated following the loss of H3.V (PAG1, PAG2, PAG4 and PAG5) are specifically arranged in an RNAP I transcribed array. For example, there are two PAG2 genes located on chromosome 10, one in an RNAP II transcribed PTU and the other in the RNAP I transcribed procyclin locus. The only gene that is significantly downregulated when H3.V is deleted is the one within the RNAP I procyclin locus. A similar locus specific alteration of PAG expression was seen upon the depletion of histone H1 . Interestingly, the PAGs within this locus undergo overlapping RNAP II and RNAP I transcription, i.e. continued RNAP II transcription of the upstream opposing PTU produces antisense PAG RNAs . This suggests a possible mechanism of PAG (and procyclin) downregulation resulting from increased formation of dsRNAs upon the loss of H3.V (see discussion). Similarly, other downregulated genes are arranged in opposing transcriptional genes pairs where extended RNAP II transcription would result in dsRNA for each mRNA (S8 Fig). We also identified 25 genes that are downregulated specifically in the H3.V KO+DMOG condition, though 22 of these genes are not located near H3.V or J. We therefore assume many of these changes are an indirect effect of genes that are upregulated in this cell line. For example, we have demonstrated that the genome-wide increase in RNAP II transcription in T. cruzi results in a global increase in gene expression that includes proteins that degrade specific mRNAs .
H3.V regulates expression of RNAP I transcribed VSG genes
In addition to RNAP II termination sites, H3.V co-localizes with base J at telomeres, which in the T. brucei genome contain the RNAP I transcribed polycistronic units involved in antigenic variation (so called VSG expression sites, ESs) (Fig 6A) [27, 28]. mRNA-seq results indicate that the deletion of H3.V leads to increased expression of VSG genes from silent ESs, which we have confirmed by RT-qPCR (S1 Table and Fig 6B). Although VSGs within expression sites are not affected by the loss of J in WT cells, five of the VSGs are further upregulated upon the loss of J in the H3.V KO (S1 Table and Fig 6B).
(A) A schematic diagram of the silent ES15 (not to scale). The box with stripes represents the 70 bp repeats. Numbers indicate ESAG genes. Grey box represents the VSG pseudogene 11 (Tb427.BES126.13). (B-C) mRNA-seq and RT-qPCR analysis of the indicated VSG genes in silent expression sites. As described in Figs 3D and 5E, white bars: Wild type; grey bars: Wild type+DMOG; dark grey bars: H3.V KO; black bars: H3.V KO+DMOG. For mRNA-seq analysis, p values determined by Cuffdiff are indicated by asterisks: *, p value ≤ 0.05; **, p value ≤ 0.01. For RT-qPCR analysis, p values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
mRNA-seq indicates several expression site associated genes (ESAGs) within silent ESs with differential expression. Because the repetitive nature of ESAGs complicates read alignment of mRNA-seq data, we analyzed several genes by RT-qPCR using primers that are specific to ES ESAGs (S9 Fig). Consistent with mRNA-seq results, with the apparent exception of ESAG8, most of the ESAGs are not upregulated in the H3.V KO (S9 Fig). The lack of significant change in the majority of ESAGs within the ESs suggests derepression of telomere-proximal VSG genes after H3.V depletion is not due to transcriptional activation of silent promoters. Repression of silent ESs is mediated in part by the inhibition of RNAP I elongation within the ES preventing the production of VSG mRNA from the silent ESs . Similar to its inhibition of RNAP II transcription elongation at termination sites within PTUs genome-wide, H3.V may function at telomeric regions to attenuate transcription elongation within the silent ESs, thereby preventing transcription of silent VSGs. To further investigate this possibility and determine how telomere-proximal and telomere distal genes are affected after loss of H3.V, we used RT-qPCR to compare the derepression of a unique gene in the silent ES15 that is located 10 kb (VSG pseudogene 11, Tb427.BES126.13) and 1 kb (VSG11) upstream of the telomere (Fig 6; ). Although VSG11 was upregulated in the H3.V KO, we found that the VSG pseudogene is not significantly derepressed in the H3.V KO or upon the loss of H3.V and J (Fig 6B). These results overall suggest H3.V is involved in the repression of VSGs in silent ESs, but does not significantly impact silencing of the entire ES, similar to the role of telomere localized TbRAP1 . ESAG8 derepression is consistent with ES derepression from the promoter, however upregulation of multiple ESAGs would be expected if derepression of the silent ESs occurs from the promoter, which was observed after ISWI depletion . Although we cannot rule out possible alterations in VSG switching in the H3.V KO (see discussion), the data here suggest H3.V is involved in repressing RNAP I transcription of VSGs within the silent ESs as well as RNAP II transcription of VSG genes within genome internal PTUs.
H3.V is a kinetoplastid-specific H3 variant and appears to be the only H3 variant found in these early-diverged eukaryotes. The T. brucei H3.V shares only 45% sequence identity with the canonical H3 . Although H3.V localizes to centromeres, it is not essential for viability and does not contain sequence variations common to all identified centromeric H3 variants [14, 60, 68]. Aside from its localization to RNAP II termination sites and telomeres, the functional significance of H3.V and its potential role in the regulation of RNAP II termination has been unexplored. We have demonstrated that H3.V negatively regulates transcription elongation and promotes RNAP II termination in T. brucei. Several lines of evidence support this conclusion. At many of the same sites within gene clusters where we have previously shown J regulates transcription termination and expression of downstream genes, we detect similar increases in the expression of genes downstream of the termination site following the loss of H3.V. Also similar to J loss, detection of increased nascent, unprocessed RNA by strand-specific RT-PCR at these sites supports the conclusion that the loss of H3.V leads to read-through transcription. Furthermore, the loss of H3.V from regions where dual strand transcription naturally occurs at cSSRs leads to increased levels of siRNAs, and strand-specific RT-PCR indicates this is due to increased transcription of the cSSR. These findings overall suggest H3.V imparts a repressive chromatin structure that is refractory to transcription elongation, or potentially recruits other repressive factors or transcription termination factors at RNAP II termination sites. To our knowledge, this is the first example of a histone H3 variant that has been shown to repress the expression of mRNAs and non-coding RNAs by promoting transcription termination.
These results extend our previous findings that base J functions to prevent read-through transcription at termination sites within gene clusters in T. brucei, revealing an overlapping role of J and H3.V in the regulation of transcription termination. However, H3.V appears to have a broader and in many cases a more dominant role. In this study mRNA-seq indicated 71 genes were upregulated by 2-fold or more following the loss of H3.V. 39% of those genes were also affected by at least 1.3-fold following the loss of J alone, and 59% were further increased upon the subsequent loss of base J in the H3.V KO. Thus H3.V and J appear to function similarly, but independently in the regulation of transcription termination. Although we cannot exclude a potential role of H3.V (and J) in the regulation of RNA processing, the increase in both unprocessed (nascent) and processed RNAs (mRNAs and siRNAs) strongly suggests H3.V regulates RNA abundance at the level of transcription and that the defects we observe are not simply due to an alteration of RNA processing. We propose a model in which both H3.V and base J inhibit RNAP II elongation, and therefore stimulate termination at sites within gene clusters (S10 Fig). According to this model, the loss of J within a gene cluster results in read-through transcription and expression of genes that were previously silent, but the presence of H3.V within the downstream cSSR prevents increased dual strand transcription and thus siRNAs are not significantly increased. Similarly, the loss of H3.V leads to read-through transcription at termination sites within a gene cluster and subsequent gene derepression. H3.V loss also results in increased transcription of dual strand transcribed loci at cSSRs, giving rise to more siRNAs. Therefore, base J is a chromatin modification that specifically regulates the expression of a subset of genes in the bloodstream form of T. brucei parasites, whereas H3.V regulates a similar, but larger subset of genes, in addition to the generation of siRNAs.
Recent description of the trypanosome stress response has indicated that the location of a gene within a PTU can impact its expression, presumably via regulated transcription elongation . Here we provide evidence that regulated transcription and expression of genes within PTUs can be achieved through their spatial organization and position relative to H3.V and base J. However, the biological significance of the gene expression changes we describe following the loss of these chromatin marks, remains unclear. It does not help that the majority of the regulated genes are annotated as hypothetical proteins of unknown function. Many of the H3.V and base J regulated genes include VSGs, ESAGs, RHS proteins, and pseudogenes that are normally lowly expressed (or not at all) in wild type T. brucei. Interestingly, consistent with base J synthesis, VSGs and ESAGs are developmentally regulated, typically exclusively expressed in bloodstream form trypanosomes from the telomeric PTU (VSG ESs). Monoallelic expression of a VSG ES leads to the expression of a single VSG on the surface of the parasite, a key aspect of trypanosome antigenic variation. Therefore the repression of silent VSGs by H3.V/J allows the parasite to maintain this monoallelic expression. Another important aspect of antigenic variation is the periodic switching of the VSG protein expressed on the surface allowing the parasite to remain a step ahead of the host immune response. DNA recombination (i.e. gene conversion events) of silent VSG genes into the active ES is the dominant driver of trypanosome antigenic variation. Transcription of a donor DNA sequence has been shown to increase its use during gene conversion events in human cells . It has also been demonstrated that active transcription in T. brucei stimulates DNA recombination . Therefore, the regulation of transcription of silent VSGs by H3.V/J, including VSGs in silent telomeric ESs and VSG pseudogenes at the end of genomic internal PTUs, could play a role in gene conversion events. It is well characterized that during late phases of mammalian infection, trypanosomes predominately express mosaic VSGs comprised of multiple VSGs and pseudogenes . These findings thus raise the possibility that regulated transcription of silent VSGs by H3.V/J, in particular VSG pseudogenes at internal PTUs, contributes to gene conversion events that result in the formation of these mosaic VSGs.
If H3.V and base J are utilized to effect specific gene expression changes, then mechanisms likely exist to overcome the silencing effects of these modifications, i.e. regulated addition and/or removal. Histone chaperones and chromatin remodeling complexes incorporate histone variants at specific chromatin locations. Thus, regulation of (unidentified) histone chaperones that incorporate H3.V could enable regulated gene expression. Chromatin remodeling proteins could also be involved in the removal of H3.V. The first step of J synthesis consists of thymidine oxidation by JBP1 and 2, which utilize oxygen and 2-oxoglutarate and require Fe2+ as a cofactor. Changes in oxygen concentrations or metabolic changes could thus impact J synthesis and effect gene expression changes. We previously demonstrated oxygen regulation of JBP1/2 and J synthesis, which led to changes in gene expression and pathogenesis of T. cruzi [24, 31]. JBP1/2 have been shown to have differential chromatin substrates for de novo J synthesis in vivo . Therefore, regulation of JBP1/2, or associated factors, could provide differential regulation of J synthesis at specific loci. Reiterative oxidations of thymidine residues by JBP1/2 , similar to TET mediated oxidation of cytosines , may also contribute to regulated J synthesis at specific loci.
Loss of H3.V also led to derepression of VSG genes within the silent telomeric ESs. We hypothesize, similar to its effect on RNAP II termination within PTUs genome-wide, H3.V localized to telomeric repeats limits basal levels of RNAP I transcription elongation within silent ESs . A similar telomeric VSG derepression effect was observed following the loss of RAP1 in T. brucei . The lack of significant ESAG gene derepression suggests loss of H3.V does not result in derepression from the promoters of silent ESs, though ESAG8 upregulation is consistent with this possibility. Therefore, we acknowledge that further detailed analysis is required, including the use of tagged silent ESs, to fully understand the role of H3.V on ES transcription. Because of our inability to effectively measure VSG switching rates in our H3.V KO cell line, we also cannot exclude the possibility that H3.V restricts VSG switching, though a recent study has indicated that switching frequency does not appear to change significantly in the H3.V KO or upon the loss of H3.V and J (Schulz, Papavasiliou, and Kim, personal communication). While base J has no apparent independent role in telomeric repression or VSG switching, the additional derepression of VSG genes observed in the H3.V KO upon loss of J suggests the novel modified base can act synergistically with H3.V in telomeric silencing and antigenic variation. This function is consistent with the distinct localization of base J in the silent ESs, with J density highest close to the telomeres .
We also found a specific RNAP I transcribed procyclin gene cluster that was downregulated following the loss of J and even more so by the loss of H3.V. As mentioned above, this locus has been shown to undergo overlapping RNAP II and RNAP I transcription in the procyclic form T. brucei . In the procyclic form, transcription from the opposite strand was detectable from GU2 to EP1 (S8 Fig). Increased antisense transcription from the RNAP II PTU on the opposite strand led to antisense RNA and co-transcriptional silencing of PAG genes . In contrast, there was little transcription from the opposing strand in bloodstream forms. The presence of H3.V, and to some extent J, may inhibit this dual strand transcription in bloodstream forms, thus reducing the formation of dsRNA and/or transcriptional interference by RNAP II that could interfere with the expression of the procyclin locus. However, analysis of this locus in procyclic cells has indicated that the loss of Argonaute did not appear to alter the expression of the procyclin genes , thus the role of dsRNAs at this locus, if any, remains unknown. Interestingly, several of the other genes that are downregulated in the H3.V KO are arranged in opposing transcriptional gene pairs where extended RNAP II transcription would result in dsRNA for each mRNA (S8 Fig). Additional studies are needed to characterize the role of H3.V in regulating transcription at these loci. As we described for the effect of base J in T. cruzi , it is also possible that some transcripts are downregulated in the H3.V KO due to secondary effects of derepressed genes, which could include regulatory proteins (destabilizing specific mRNAs).
Future studies are necessary to further elucidate the mechanisms by which H3.V regulates transcription, including what proteins interact with H3.V, the impact of H3.V on nucleosome structure/stability, whether H3.V undergoes any post-translational modifications, and how chromatin structure is affected by the loss of H3.V. It is not clear how sequence variation in H3.V confers its specific localization to transcription termination sites or its function. H3 PTMs have not been well characterized in T. brucei, however those identified by mass spectrometry analyses include S1 and K23 acetylation and K4, K32, and K76 methylation . In comparison to the canonical H3, the T. brucei H3.V differs in that it contains an A1 and R23, and thus lacks the corresponding acetylation. However, differences in PTMs between H3.V and H3 have not been investigated.
Interestingly, the loss of H3.V and base J in T. brucei did not result in read-through transcription that extends into the downstream gene cluster encoded on the opposite strand and generation of antisense RNAs. In L. major the loss of J alone led to such read-through transcription at a majority of the cSSRs sites in the genome  whereas the loss of H3.V had no effect . Overall these findings indicate that the function of epigenetic modifications in kinetoplastid parasites is not necessarily conserved. One obvious difference between T. brucei and L. major is the absence of a complete RNAi pathway in L. major [86, 87]. It would therefore be interesting to investigate the role of H3.V in regulating siRNAs in a Leishmania species with an intact RNAi pathway.
In addition to its role in the regulation of dual strand transcription at cSSRs and the generation of siRNAs, we also find H3.V regulates other characterized siRNA generating loci, including the SLACS and ingi retrotransposable elements, CIR147 centromeric repeats, and inverted repeats. Aside from the SLACS and ingi derived siRNAs, the function of siRNAs in T. brucei is unclear. Mature SLACS and ingi transcripts are present at low levels in WT T. brucei due to the presence of a functional RNAi pathway . Surprisingly, despite the increase in SLACS and ingi siRNAs following the loss of H3.V, we also observe a modest increase in SLACS (Tb427tmp.211.5010) and ingi transcripts by mRNA-seq (S1 Table). Although the relative increase in steady-state level of siRNAs is greater than that of the mature mRNA transcript, presumably the increased transcription of these loci in the absence of H3.V increases both RNA species. Dicer 2 is responsible for the formation of siRNAs derived from cSSRs, and its removal (and corresponding decrease in siRNAs) did not have a significant effect on the expression of genes located at cSSRs that coincide with the siRNA peak , suggesting that cSSR derived siRNAs do not regulate mRNA abundance. Consistent with this, at cSSRs where we detect increased siRNAs we do not observe significant decreases in mRNAs from genes that overlap the siRNA peak. Therefore the function, if any, of cSSR derived siRNAs remains unknown.
In summary, we have provided evidence for the connection between a histone H3 variant and transcription termination for the first time. These findings highlight the importance of chromatin modifications in the regulation of transcription termination, particularly in early-diverged eukaryotes with unique polycistronic transcription. These findings also have direct implications for a strictly post-transcriptional model of gene expression regulation in kinetoplastids.
Materials and Methods
Parasite cell culture
WT and H3.V KO bloodstream form T. brucei 221a cell lines of strain 427 were cultured in HMI-9 medium as described previously . The bloodstream form T. brucei H3.V KO cell line, generated by deleting both H3.V alleles by homologous recombination , was provided by George Cross. DMOG treatment of cells was performed by supplementing media with 1mM DMOG for 5 days as described previously .
Strand-specific RNA-seq library construction
Small RNA-sequencing was performed using two different methods. The analysis of WT, H3.V KO, and H3.V KO+DMOG (Figs 1A, 2D and S2 Fig) was performed as previously described . Briefly, small RNAs were isolated from T. brucei (5x107 cells) using a Qiagen miRNeasy kit according to the manufacturer’s instructions. The small RNA-seq libraries were prepared using approximately 250ng small RNA by Vertis Biotechnology AG, Germany. The small RNA sample was poly(A)-tailed using poly(A) polymerase. Then, the 5'PPP and cap structures were removed using tobacco acid pyrophosphatase (TAP, Epicentre). Afterwards, an RNA adapter was ligated to the 5'-monophosphate of the RNA. First-strand cDNA synthesis was performed using an oligo(dT)-adapter primer and the M-MLV reverse transcriptase. The resulting cDNAs were PCR-amplified to about 10–20 ng/μL using a high fidelity DNA polymerase. The cDNAs were purified using the Agencourt AMPure XP kit (Beckman Coulter Genomics). Quality and concentration of all libraries was determined by capillary electrophoresis and high throughput sequencing was performed on a HiSeq2000 (Illumina). Sequencing reads were mapped to the T. brucei reference genome using Bowtie2 version 2.2.3 with local sensitive mode, all other parameters default,  and further processed using Samtools 1.2 . Reads shorter than 18 bp were discarded before mapping. Genome and gene annotations of strain 427 version 6.0 were downloaded from EuPathDB  and used as the reference in all small RNA-seq analyses. RPM were calculated using a window size of 101 bp and a step size of 101 bp. Total sequence reads and overall alignment rate for all RNA-seq libraries discussed in this publications are listed in S3 Table.
Small RNA-sequencing of the triplicate analysis of WT and H3.V KO (Figs 1B, 1C, S1A and S2 Tables) was performed in a similar manner. Briefly, total RNA was isolated from log phase T. brucei cultures (5x107 cells) using Trizol according to the manufacturer’s instructions. The small RNA-seq libraries were prepared using approximately 250ng total RNA using the Illumina-compatible NEBNext small RNA library preparation kit following the manufacturer protocol (New England Biolabs). Quality and concentration of all libraries was determined using a Bioanalyzer 2100 (Agilent). Libraries were pooled using equi-molar amounts and sequenced on a NextSeq500 (Illumina). Both library construction and sequencing were done at the Georgia Genomics Facility (GFF). Small RNA reads were quality and adapter trimmed using Cutadapt  and reads shorter than 18 nucleotides were discarded. Reads were mapped to the T. brucei reference genome using Bowtie2 version 2.2.3 with the following parameters “-a -D 10 -R 5 -N 1 -L 15 -i S,1,0.50” and further processed using Samtools 1.2 , BEDTools , and custom scripts. RPM shown in Fig 1B and 1C were calculated by dividing the total number of reads in each size class by the total million reads mapped. For Fig 1B, only reads that mapped to the dual strand transcribed region on cSSR 11.9 (3826–3835 kb) were included, whereas Fig 1C includes all mapped reads. Differential expression analysis on small RNA-seq read count data (WT versus H3.V KO) was performed using EdgeR (S2 Table). Significance testing was pairwise using Fisher’s Exact test. Significance was assessed in both the total small RNA-seq reads and in the 21-27nt reads.
For the mRNA-seq, total RNA was isolated from log phase T. brucei cultures (5x107 cells) using Trizol. 12 mRNA-seq libraries were constructed (triplicate WT, WT+DMOG, H3.V KO, and H3.V KO+DMOG) using Illumina TruSeq Stranded RNA LT Kit following the manufacturer’s instructions with limited modifications. The starting quantity of total RNA was adjusted to 1.3 μg, and all volumes were reduced to a third of the described quantity. High throughput sequencing was performed at the Georgia Genomics Facility (GFF) on a NextSeq500 (Illumina). Raw reads from mRNA-seq were first trimmed using Trimmomatic version 0.32 . The single-end reads were trimmed for TruSeq3 adapters; leading and trailing bases with quality less than 15 and reads with average quality less 20 were removed. Finally, any reads shorter than 50 base pair were discarded. Remaining reads were locally aligned to the T. brucei Lister 427 version 9.0 genome, from EuPathDB , using Bowtie2 version 2.2.3 . All settings were default except specifying sensitive local and further processed with Samtools 1.2 . Transcript abundances were computed using the Cufflinks suite version 2.2.1 . For individual replicates, Cuffnorm was used with the library type fr-firststrand flag and the T. brucei Lister 427 version 9.0 annotation (downloaded from EuPathDB ). To estimate gene expression levels for a condition, replicates were used together and analyzed by Cuffdiff with the T. brucei Lister 427 version 9.0 annotation. Default parameters were used except specifying library type fr-firststrand. All p values reported here, determined by Cuffdiff, reflect the FDR-adjusted p value. Correlation coefficients for mRNA-seq replicates of WT, WT+DMOG, and H3.V KO were all greater than 0.96 and H3.V KO+DMOG replicates were greater than 0.91. To express the transcripts levels for individual mRNA encoding genes as shown in S1 Table, we determined the number of reads per kilobase per million reads (RPKM) . Briefly, we counted the number of reads mapped to all annotated transcriptomic features (e.g. mRNA) on the same strand (i.e. sense) and opposite strand (i.e. antisense). Both the sense and antisense read numbers were normalized by length of the feature (in kilobase) and the total number of reads (in millions) mapped to non-structural RNAs in the corresponding library (i.e. number of mappable reads excluding rRNA and tRNA reads). mRNA-seq data shown in Figs 2C, 3C, 5D, S3C and S5C are from our previously published dataset  and are consistent with mRNA-seq performed in this study (see S3 Table for the RNA-seq datasets used in each figure). Genes were considered adjacent to base J and/or H3.V if the gene, according to the T. brucei Lister 427 annotation, overlapped within 10,000 base pairs upstream or downstream of the modification. All J IP-seq and H3.V ChIP-seq data shown here are from previously published work [13, 14]. Fold changes for the heatmaps were computed as (RPKMvar + pseudocount) / (RPKMwt + pseudocount), where pseudocount = 0.5. Once all fold changes were computed, any fold change value above five was set equal to five to improve visualization.
Strand-specific RT-PCR analysis of read through transcription
Total RNA was isolated using the hot phenol method, as described previously . To ensure complete removal of contaminating genomic DNA, purified RNA was treated with Turbo DNase, followed by phenol:chloroform extraction. RNA concentration was determined using a spectrophotometer. Strand specific RT-PCR was performed as previously described . ThermoScript Reverse Transcriptase from Life Technologies was used for cDNA synthesis at 60–65°C. 1–2 μg of RNA were used to make cDNA using a reverse primer as described in the Figure legends. PCR was performed using GoTaq DNA Polymerase from Promega. A minus-RT control was used to ensure no contaminating genomic DNA was amplified. Primer sequences used in the analysis are available upon request.
Reverse transcription quantitative PCR (RT qPCR)
Total RNA was obtained using Qiagen RNeasy kits according to manufacturer’s instructions. First-strand cDNA was synthesized from 1 μg of total RNA using an iScript cDNA synthesis kit (Bio-Rad Laboratories, Hercules, CA) per the manufacturer's instructions. Quantification of selected genes were performed on an iCycler with an iQ5 multicolor real-time PCR detection system (Bio-Rad Laboratories, Hercules, CA). Primer sequences used in the analysis are available upon request. The reaction mixture contained 5 pmol forward and reverse primer, 2x iQ SYBR green super mix (Bio-Rad Laboratories, Hercules, CA), and 2 μl of template cDNA. Standard curves were prepared for each gene using 5-fold dilutions of known quantity (100 ng/μl) of WT DNA. The quantities were calculated using iQ5 optical detection system software.
S1 Fig. Regulation of siRNAs by H3.V.
(A) Mapping of 23-26nt small RNAs to cSSR 11.9 in WT and H3.V KO. (B) Phasing of siRNAs mapping to a cSSR on chromosome 5 in WT cells. Position is indicated in kb. Colors indicate nucleotide: green, A; red, T; blue, C; and orange, G.
S2 Fig. Regulation of siRNAs by H3.V at other loci.
Mapping of small RNAs to SLACS, INGI, CIR147 and IR3.
S3 Fig. Regulation of termination and gene expression by H3.V.
(A-C) Localization of H3.V, J, ORFs, and mRNA-seq reads from wild type T. brucei are plotted for cSSR 9.2 (position 1110–1190 kb is shown). (D-F) Gene expression changes and termination defects are analyzed as described in Fig 3. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
S4 Fig. Confirmation of mRNA-seq transcript changes in T. brucei by RT-qPCR.
RT-qPCR was performed as described in Fig 3D. 5350: Tb427tmp.160.5350; 1960: Tb427.07.1960; 1565: Tb427tmp.02.1565; 1580: Tb427tmp.02.1580; 5384: Tb427.02.5384; and 6820: Tb427.07.6820. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
S5 Fig. Regulation of termination and gene expression by H3.V.
(A-C) Localization of H3.V, J, ORFs and mRNA-seq reads from wild type T. brucei are ploted for cSSR 10.5 (1120–1140). (D) mRNA-seq transcript fold changes of the genes indicated in the ORF map in B, as described in Fig 5E. White bars: Wild type; grey bars: Wild type+DMOG; dark grey bars: H3.V KO; black bars: H3.V KO+DMOG. The fold change in the wild type+DMOG condition for gene 4 is 12.2, with a standard deviation of 4.3 and p value of 0.03.
S6 Fig. Chromosome maps.
Whole chromosomes (Chr. 1–11, T. brucei Lister 427 version 9.0 genome) and the localization of base J (blue), H3.V (red), and mRNA coding genes (black lines; top strand is indicated by a line in the top half of the panel, bottom strand by a line in the bottom half) are shown. Genes on the top strand are transcribed from left to right and those on the bottom strand are transcribed from right to left. Position along each chromosome is indicated in kilobases (KB) or megabases (MB). Bottom two panels: mRNAs found upregulated by at least 2-fold or more in the H3.V KO (top) and H3.V KO+DMOG (bottom) relative to WT are indicated by a green line. Only mRNAs with an RPKM≥1 and significantly differentially expressed relative to wild type, as determined by Cuffdiff, are included. Boxes indicate sites examined in more detail in other figures. Genes are listed in S1 and S4 Tables.
S7 Fig. Enrichment of genes adjacent to H3.V and J following the loss of H3.V and/or J.
(A) Genes were defined as adjacent to H3.V or J if located within 10 kb of an H3.V and J enriched region, respectively, and are indicated in grey. Genes not adjacent to H3.V or J are indicated in white [13, 14]. 2463 (27%) genes are adjacent to H3.V and 2837 (31%) are adjacent to J out of a total of 9266 annotated genes in the T. brucei genome. (B) 153 genes were upregulated in the absence of H3.V and/or J, 122 are adjacent to H3.V and 121 are adjacent to J (80%). 65 genes were downregulated in the absence of H3.V and/or J, 35 are adjacent to H3.V (54%) and 40 are adjacent to J (62%).
S8 Fig. Genomic context of genes downregulated in the H3.V KO.
(A) The EP/PAG1 loci. Small arrow indicates the RNAP I transcription start site in the promoter region. Genes in bold are downregulated in the H3.V KO. Genes in blue are transcribed by RNAP II on the top strand and EP and PAG genes in red are transcribed by RNAP I. MARP: microtubule-associated repetitive protein; EP1-2: procyclin; PAG; procyclin associated gene; T: ‘T region’ encoding transcripts containing small ORFs of <240 bp; GU2: gene of unknown function. The Fig is drawn to scale. (B) Gene cluster on chromosome 6. Genes in bold are downregulated in the H3.V KO and identities are listed in S1 Table.
S9 Fig. RT-qPCR analysis of ES associated ESAGs.
RT-qPCR analysis of the indicated ESAGs was performed as described in Fig 3D. P values were calculated using Student’s t test. *, p value ≤ 0.05; **, p value ≤ 0.01.
S10 Fig. Working model for H3.V regulating RNAP II transcription and mRNA and siRNA expression.
O-linked glycosylation of DNA (base J) is indicated by black line and dot. Nucleosomes are indicated by circles where green represents canonical histones and red represents histone H3 variant and an additional histone variant found at termination sites in T. brucei , histone H4 variant (H4.V). In the H3.V KO the H3.V is replaced with a canonical H3, with no change in nucleosome structure or H4.V, since it is currently unclear what happens to the nucleosome upon the loss of H3.V. According to the model, (A) the loss of base J leads to read-through transcription (indicated by the thicker red arrow) at internal termination sites within the cluster that is once again attenuated once it reaches H3.V within the cSSR. The loss of H3.V leads to read-through transcription at the internal site and continues into the cSSR, thus allowing increased dual strand transcription and generation of siRNAs. (B) At regions without an internal termination site, loss of base J has no effect on dual strand transcription. But, as described above, the loss of H3.V leads to increased transcription at cSSRs and generation of siRNAs.
S1 Table. T. brucei gene expression changes following H3.V and/or J loss.
mRNAs found up or downregulated by 2-fold or more by mRNA-seq are listed, along with available gene descriptions, RPKM, fold change, and an indication of whether the genes are located within 10kb of H3.V and/or J. P values determined by Cuffdiff are listed for each condition compared to WT. For upregulated mRNAs, only those with an RPKM≥1 upon the loss of H3.V and/or J are included. For downregulated mRNAs, only those with an RPKM≥1 in WT cells are included.
S2 Table. Small RNA-seq RPM and statistical significance at cSSRs.
The average small RNA-seq RPM of triplicate libraries at cSSRs is listed. Chromosome number and the 5’ and 3’ position of regions quantified are included. Statistical significance was assessed using pairwise Fisher’s Exact test on both total reads and specifically 21-27bp reads. Yellow highlight indicates significance at a p value ≤ 0.05.
S3 Table. High-throughput sequencing information.
Information about all sequencing experiments performed in this study is listed. Also indicates the figures in which the data are presented.
S4 Table. T. brucei upregulated genes following H3.V and/or J loss.
Similar to S1 Table, but upregulated genes are organized according to their location within PTUs. Genes sharing the same number are located in the same PTU and genes with different numbers are located in different PTUs.
We are grateful to Jessica Lopes da Rosa-Spiegler, Rudo Kieft, Whitney Bullard, and Piet Borst for critical reading of the manuscript. We also thank Nick Rohr for preparing mRNA-seq libraries. This paper is dedicated to the memory of Dr. Laura Cliffe.
Conceived and designed the experiments: DR LC RS. Performed the experiments: DR LC. Analyzed the data: DR BTH MA TNS. Contributed reagents/materials/analysis tools: BTH LC MA TNS RJS. Wrote the paper: DR RS.
- 1. Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Renauld H, Bartholomeu DC, et al. The genome of the African trypanosome Trypanosoma brucei. Science. 2005; 309:416–22. pmid:16020726
- 2. Jackson AP, Sanders M, Berry A, McQuillan J, Aslett MA, Quail MA, et al. The genome sequence of Trypanosoma brucei gambiense, causative agent of chronic human african trypanosomiasis. PLoS Negl Trop Dis. 2010; 4:e658. pmid:20404998
- 3. El-Sayed NM, Myler PJ, Blandin G, Berriman M, Crabtree J, Aggarwal G, et al. Comparative genomics of trypanosomatid parasitic protozoa. Science. 2005; 309:404–9. pmid:16020724
- 4. Martinez-Calvillo S, Yan S, Nguyen D, Fox M, Stuart K, Myler PJ. Transcription of Leishmania major Friedlin chromosome 1 initiates in both directions within a single region. Mol Cell. 2003; 11:1291–9. pmid:12769852
- 5. Boothroyd JC, Cross GA. Transcripts coding for variant surface glycoproteins of Trypanosoma brucei have a short, identical exon at their 5' end. Gene. 1982; 20:281–9. pmid:7166234
- 6. Van der Ploeg LH, Liu AY, Michels PA, De Lange TD, Borst P, Majumder HK, et al. RNA splicing is required to make the messenger RNA for a variant surface antigen in trypanosomes. Nucleic Acids Res. 1982; 10:3591–604. pmid:6287414
- 7. De Lange T, Liu AY, Van der Ploeg LH, Borst P, Tromp MC, Van Boom JH. Tandem repetition of the 5' mini-exon of variant surface glycoprotein genes: a multiple promoter for VSG gene transcription? Cell. 1983; 34:891–900. pmid:6313212
- 8. Nelson RG, Parsons M, Barr PJ, Stuart K, Selkirk M, Agabian N. Sequences homologous to the variant antigen mRNA spliced leader are located in tandem repeats and variable orphons in Trypanosoma brucei. Cell. 1983; 34:901–9. pmid:6313213
- 9. Sutton RE, Boothroyd JC. Evidence for Trans splicing in trypanosomes. Cell. 1986; 47:527–35. pmid:3022935
- 10. Agabian N. Trans splicing of nuclear pre-mRNAs. Cell. 1990; 61:1157–60. pmid:2142018
- 11. Clayton CE. Life without transcriptional control? From fly to man and back again. EMBO J. 2002; 21:1881–8. pmid:11953307
- 12. Campbell DA, Thomas S, Sturm NR. Transcription in kinetoplastid protozoa: why be normal? Microbes and Infection. 2003; 5:1231–40. pmid:14623019
- 13. Cliffe LJ, Siegel TN, Marshall M, Cross GA, Sabatini R. Two thymidine hydroxylases differentially regulate the formation of glucosylated DNA at regions flanking polymerase II polycistronic transcription units throughout the genome of Trypanosoma brucei. Nucleic Acids Res. 2010; 38:3923–35. pmid:20215442
- 14. Siegel TN, Hekstra DR, Kemp LE, Figueiredo LM, Lowell JE, Fenyo D, et al. Four histone variants mark the boundaries of polycistronic transcription units in Trypanosoma brucei. Genes Dev. 2009; 23:1063–76. pmid:19369410
- 15. Respuela P, Ferella M, Rada-Iglesias A, Aslund L. Histone acetylation and methylation at sites initiating divergent polycistronic transcription in Trypanosoma cruzi. J Biol Chem. 2008; 283:15884–92. pmid:18400752
- 16. Thomas S, Green A, Sturm NR, Campbell DA, Myler PJ. Histone acetylations mark origins of polycistronic transcription in Leishmania major. BMC Genomics. 2009; 10:152. pmid:19356248
- 17. Wright JR, Siegel TN, Cross GA. Histone H3 trimethylated at lysine 4 is enriched at probable transcription start sites in Trypanosoma brucei. Mol Biochem Parasitol. 2010; 172:141–4. pmid:20347883
- 18. van Leeuwen F, Taylor MC, Mondragon A, Moreau H, Gibson W, Kieft R, et al. beta-D-glucosyl-hydroxymethyluracil is a conserved DNA modification in kinetoplastid protozoans and is abundant in their telomeres. Proc Natl Acad Sci U S A. 1998; 95:2366–71. pmid:9482891
- 19. Dooijes D, Chaves I, Kieft R, Dirks-Mulder A, Martin W, Borst P. Base J originally found in kinetoplastid is also a minor constituent of nuclear DNA of Euglena gracilis. Nucleic Acids Res. 2000; 28:3017–21. pmid:10931915
- 20. Gommers-Ampt JH, Van Leeuwen F, de Beer AL, Vliegenthart JF, Dizdaroglu M, Kowalak JA, et al. beta-D-glucosyl-hydroxymethyluracil: a novel modified base present in the DNA of the parasitic protozoan T. brucei. Cell. 1993; 75:1129–36. pmid:8261512
- 21. van Leeuwen F, Kieft R, Cross M, Borst P. Tandemly repeated DNA is a target for the partial replacement of thymine by beta-D-glucosal-hydroxymethyluracil in Trypanosoma brucei. Mol Biochem Parasitol. 2000; 109:133–45. pmid:10960172
- 22. van Leeuwen F, Wijsman ER, Kieft R, van der Marel GA, van Boom JH, Borst P. Localization of the modified base J in telomeric VSG gene expression sites of Trypanosoma brucei. Genes Dev. 1997; 11:3232–41. pmid:9389654
- 23. van Leeuwen F, Wijsman ER, Kuyl-Yeheskiely E, van der Marel GA, van Boom JH, Borst P. The telomeric GGGTTA repeats of Trypanosoma brucei contain the hypermodified base J in both strands. Nucleic Acids Res. 1996; 24:2476–82. pmid:8692684
- 24. Ekanayake DK, Minning T, Weatherly B, Gunasekera K, Nilsson D, Tarleton R, et al. Epigenetic regulation of transcription and virulence in Trypanosoma cruzi by O-linked thymine glucosylation of DNA. Mol Cell Biol. 2011; 31:1690–700. pmid:21321080
- 25. van Luenen HG, Farris C, Jan S, Genest PA, Tripathi P, Velds A, et al. Glucosylated hydroxymethyluracil, DNA base J, prevents transcriptional readthrough in Leishmania. Cell. 2012; 150:909–21. pmid:22939620
- 26. Borst P, Sabatini R. Base J: discovery, biosynthesis, and possible functions. Annu Rev Microbiol. 2008; 62:235–51. pmid:18729733
- 27. Horn D. Antigenic variation in African trypanosomes. Mol Biochem Parasitol. 2014; 195:123–9. pmid:24859277
- 28. Horn D, McCulloch R. Molecular mechanisms underlying the control of antigenic variation in African trypanosomes. Curr Opin Microbiol. 2010; 13:700–5. pmid:20884281
- 29. Bullard W, Lopes da Rosa-Spiegler J, Liu S, Wang Y, Sabatini R. Identification of the glucosyltransferase that converts hydroxymethyluracil to base J in the trypanosomatid genome. J Biol Chem. 2014; 289:20273–82. pmid:24891501
- 30. Sekar A, Merritt C, Baugh L, Stuart K, Myler PJ. Tb927.10.6900 encodes the glucosyltransferase involved in synthesis of base J in Trypanosoma brucei. Mol Biochem Parasitol. 2014; 196:9–11. pmid:25064607
- 31. Cliffe LJ, Hirsch G, Wang J, Ekanayake D, Bullard W, Hu M, et al. JBP1 and JBP2 Proteins Are Fe2+/2-Oxoglutarate-dependent Dioxygenases Regulating Hydroxylation of Thymidine Residues in Trypanosome DNA. J Biol Chem. 2012; 287:19886–95. pmid:22514282
- 32. Tahiliani M, Koh KP, Shen Y, Pastor WA, Bandukwala H, Brudno Y, et al. Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1. Science. 2009; 324:930–5. pmid:19372391
- 33. Iyer LM, Tahiliani M, Rao A, Aravind L. Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids. Cell Cycle. 2009; 8:1698–710. pmid:19411852
- 34. Reynolds D, Cliffe L, Sabatini R. 2-Oxoglutarate-dependent hydroxylases involved in DNA base J (β-D-Glucopyranosyloxymethyluracil) synthesis. In: Schofield CJ, Hausinger RP, editors. 2-Oxoglutarate-Dependent Oxygenases. 1. Cambridge, U.K: Royal Society of Chemistry; 2015.
- 35. Reynolds D, Cliffe L, Forstner KU, Hon CC, Siegel TN, Sabatini R. Regulation of transcription termination by glucosylated hydroxymethyluracil, base J, in Leishmania major and Trypanosoma brucei. Nucleic Acids Res. 2014; 42:9717–29. pmid:25104019
- 36. Cliffe LJ, Kieft R, Southern T, Birkeland SR, Marshall M, Sweeney K, et al. JBP1 and JBP2 are two distinct thymidine hydroxylases involved in J biosynthesis in genomic DNA of African trypanosomes. Nucleic Acids Res. 2009; 37:1452–62. pmid:19136460
- 37. Ekanayake D, Sabatini R. Epigenetic regulation of Pol II transcription initiation in Trypanosoma cruzi: Modulation of nucleosome abundance, histone modification and polymerase occupancy by O-linked thymine DNA glucosylation. Eukaryot Cell. 2011; 10:1465–72. pmid:21926332
- 38. Weber CM, Henikoff S. Histone variants: dynamic punctuation in transcription. Genes Dev. 2014; 28:672–82. pmid:24696452
- 39. Tolstorukov Michael Y, Goldman Joseph A, Gilbert C, Ogryzko V, Kingston Robert E, Park Peter J. Histone Variant H2A.Bbd Is Associated with Active Transcription and mRNA Processing in Human Cells. Mol Cell. 2012; 47:596–607. pmid:22795134
- 40. Weber CM, Ramachandran S, Henikoff S. Nucleosomes are context-specific, H2A.Z-modulated barriers to RNA polymerase. Mol Cell. 2014; 53:819–30. pmid:24606920
- 41. Chen Y, Chen Q, McEachin RC, Cavalcoli JD, Yu X. H2A.B facilitates transcription elongation at methylated CpG loci. Genome Res. 2014; 24:570–9. pmid:24402521
- 42. Draker R, Sarcinella E, Cheung P. USP10 deubiquitylates the histone variant H2A.Z and both are required for androgen receptor-mediated gene activation. Nucleic Acids Research. 2011; 39:3529–42. pmid:21245042
- 43. Cuadrado A, Corrado N, Perdiguero E, Lafarga V, Muñoz-Canoves P, Nebreda AR. Essential role of p18Hamlet/SRCAP-mediated histone H2A.Z chromatin incorporation in muscle differentiation. The EMBO Journal. 2010; 29:2014–25. pmid:20473270
- 44. Gévry N, Hardy S, Jacques P-É, Laflamme L, Svotelis A, Robert F, et al. Histone H2A.Z is essential for estrogen receptor signaling. Genes Dev. 2009; 23:1522–33. pmid:19515975
- 45. Bönisch C, Hake SB. Histone H2A variants in nucleosomes and chromatin: more or less stable? Nucleic Acids Research. 2012; 40:10719–41. pmid:23002134
- 46. Chen Y, Chen Q, McEachin R, Cavalcoli J, Yu X. H2A.B facilitates transcription elongation at methylated CpG loci. Genome Res. 2014.
- 47. Gamble MJ, Frizzell KM, Yang C, Krishnakumar R, Kraus WL. The histone variant macroH2A1 marks repressed autosomal chromatin, but protects a subset of its target genes from silencing. Genes Dev. 2010; 24:21–32. pmid:20008927
- 48. Buschbeck M, Uribesalgo I, Wibowo I, Rue P, Martin D, Gutierrez A, et al. The histone variant macroH2A is an epigenetic regulator of key developmental genes. Nat Struct Mol Biol. 2009; 16:1074–9. pmid:19734898
- 49. Mito Y, Henikoff JG, Henikoff S. Genome-scale profiling of histone H3.3 replacement patterns. Nat Genet. 2005; 37:1090–7. pmid:16155569
- 50. Delbarre E, Jacobsen BM, Reiner AH, Sørensen AL, Küntziger T, Collas P. Chromatin Environment of Histone Variant H3.3 Revealed by Quantitative Imaging and Genome-scale Chromatin and DNA Immunoprecipitation. Mol Biol Cell. 2010; 21:1872–84. pmid:20375147
- 51. Ooi SL, Henikoff JG, Henikoff S. A native chromatin purification system for epigenomic profiling in Caenorhabditis elegans. Nucleic Acids Research. 2010; 38:e26–e. pmid:19966274
- 52. Wong LH, Ren H, Williams E, McGhie J, Ahn S, Sim M, et al. Histone H3.3 incorporation provides a unique and functionally essential telomeric chromatin in embryonic stem cells. Genome Res. 2009; 19:404–14. pmid:19196724
- 53. Santenard A, Ziegler-Birling C, Koch M, Tora L, Bannister AJ, Torres-Padilla M-E. Heterochromatin formation in the mouse embryo requires critical residues of the histone variant H3.3. Nat Cell Biol. 2010; 12:853–62. pmid:20676102
- 54. Drané P, Ouararhni K, Depaux A, Shuaib M, Hamiche A. The death-associated protein DAXX is a novel histone chaperone involved in the replication-independent deposition of H3.3. Genes Dev. 2010; 24:1253–65. pmid:20504901
- 55. Lewis PW, Elsaesser SJ, Noh K-M, Stadler SC, Allis CD. Daxx is an H3.3-specific histone chaperone and cooperates with ATRX in replication-independent chromatin assembly at telomeres. Proceedings of the National Academy of Sciences. 2010; 107:14075–80.
- 56. Elsasser SJ, Noh KM, Diaz N, Allis CD, Banaszynski LA. Histone H3.3 is required for endogenous retroviral element silencing in embryonic stem cells. Nature. 2015; 522:240–4. pmid:25938714
- 57. Banaszynski Laura A, Wen D, Dewell S, Whitcomb Sarah J, Lin M, Diaz N, et al. Hira-Dependent Histone H3.3 Deposition Facilitates PRC2 Recruitment at Developmental Loci in ES Cells. Cell. 2013; 155:107–20. pmid:24074864
- 58. Goldberg AD, Banaszynski LA, Noh K-M, Lewis PW, Elsaesser SJ, Stadler S, et al. Distinct Factors Control Histone Variant H3.3 Localization at Specific Genomic Regions. Cell. 2010; 140:678–91. pmid:20211137
- 59. Law C, Cheung P. Histone Variants and Transcription Regulation. In: Kundu TK, editor. Epigenetics: Development and Disease. Subcellular Biochemistry. 61: Springer Netherlands; 2013. p. 319–41.
- 60. Lowell JE, Cross GA. A variant histone H3 is enriched at telomeres in Trypanosoma brucei. J Cell Sci. 2004; 117:5937–47. pmid:15522895
- 61. Tschudi C, Fau SH, Ullu E. Small interfering RNA-producing loci in the ancient parasitic eukaryote Trypanosoma brucei. BMC Genomics. 2012; 13.
- 62. Zheng LL, Wen YZ, Yang JH, Liao JY, Shao P, Xu H, et al. Comparative transcriptome analysis of small noncoding RNAs in different stages of Trypanosoma brucei. RNA. 2013; 19:863–75. pmid:23704326
- 63. Wen Y-Z, Zheng L-L, Liao J-Y, Wang M-H, Wei Y, Guo X-M, et al. Pseudogene-derived small interference RNAs regulate gene expression in African Trypanosoma brucei. Proceedings of the National Academy of Sciences. 2011; 108:8345–50.
- 64. Zamore PD, Tuschl T, Sharp PA, Bartel DP. RNAi: Double-Stranded RNA Directs the ATP-Dependent Cleavage of mRNA at 21 to 23 Nucleotide Intervals. Cell. 2000; 101:25–33. pmid:10778853
- 65. Elbashir SM, Lendeckel W, Tuschl T. RNA interference is mediated by 21- and 22-nucleotide RNAs. Genes Dev. 2001; 15:188–200. pmid:11157775
- 66. Allen E, Xie Z, Gustafson AM, Carrington JC. microRNA-Directed Phasing during Trans-Acting siRNA Biogenesis in Plants. Cell. 2005; 121:207–21. pmid:15851028
- 67. MacRae IJ, Zhou K, Li F, Repic A, Brooks AN, Cande WZ, et al. Structural Basis for Double-Stranded RNA Processing by Dicer. Science. 2006; 311:195–8. pmid:16410517
- 68. Akiyoshi B, Gull K. Discovery of unconventional kinetochores in kinetoplastids. Cell. 2014; 156:1247–58. pmid:24582333
- 69. Haenni S, Renggli CK, Fragoso CM, Oberle M, Roditi I. The procyclin-associated genes of Trypanosoma brucei are not essential for cyclical transmission by tsetse. Mol Biochem Parasitol. 2006; 150:144–56. pmid:16930740
- 70. Kim HS, Park SH, Gunzl A, Cross GA. MCM-BP is required for repression of life-cycle specific genes transcribed by RNA polymerase I in the mammalian infectious form of Trypanosoma brucei. PLoS One. 2013; 8:e57001. pmid:23451133
- 71. Pena AC, Pimentel MR, Manso H, Vaz-Drago R, Pinto-Neves D, Aresta-Branco F, et al. Trypanosoma brucei histone H1 inhibits RNA polymerase I transcription and is important for parasite fitness in vivo. Mol Microbiol. 2014; 93:645–63. pmid:24946224
- 72. Liniger M, Bodenmüller K, Pays E, Gallati S, Roditi I. Overlapping sense and antisense transcription units in Trypanosoma brucei. Mol Microbiol. 2001; 40:869–78. pmid:11401694
- 73. Vanhamme L, Poelvoorde P, Pays A, Tebabi P, Van Xong H, Pays E. Differential RNA elongation controls the variant surface glycoprotein gene expression sites of Trypanosoma brucei. Mol Microbiol. 2000; 36:328–40. pmid:10792720
- 74. Hertz-Fowler C, Figueiredo LM, Quail MA, Becker M, Jackson A, Bason N, et al. Telomeric expression sites are highly conserved in Trypanosoma brucei. PLoS One. 2008; 3:e3527. pmid:18953401
- 75. Yang X, Figueiredo LM, Espinal A, Okubo E, Li B. RAP1 is essential for silencing telomeric variant surface glycoprotein genes in Trypanosoma brucei. Cell. 2009; 137:99–109. pmid:19345190
- 76. Stanne TM, Kushwaha M, Wand M, Taylor JE, Rudenko G. TbISWI regulates multiple polymerase I (Pol I)-transcribed loci and is present at Pol II transcription boundaries in Trypanosoma brucei. Eukaryot Cell. 2011; 10:964–76. pmid:21571922
- 77. Kelly S, Kramer S, Schwede A, Maini PK, Gull K, Carrington M. Genome organization is a major component of gene expression control in response to stress and during the cell division cycle in trypanosomes. Open Biol. 2012; 2:120033. pmid:22724062
- 78. Schildkraut E, Miller CA, Nickoloff JA. Transcription of a Donor Enhances Its Use during Double-Strand Break-Induced Gene Conversion in Human Cells. Mol Cell Biol. 2006; 26:3098–105. pmid:16581784
- 79. Alsford S, Horn D. RNA polymerase I transcription stimulates homologous recombination in T. brucei. Mol Biochem Parasitol. 2007; 153:
- 80. Hall JPJ, Wang H, Barry JD. Mosaic VSGs and the Scale of Trypanosoma brucei Antigenic Variation. PLoS Pathog. 2013; 9:e1003502. pmid:23853603
- 81. Ito S, Shen L, Dai Q, Wu SC, Collins LB, Swenberg JA, et al. Tet Proteins Can Convert 5-Methylcytosine to 5-Formylcytosine and 5-Carboxylcytosine. Science. 2011; 333:1300–3. pmid:21778364
- 82. Bernards A, van Harten-Loosbroek N, Borst P. Modification of telomeric DNA in Trypanosoma brucei; a role in antigenic variation? Nucleic Acids Research. 1984; 12:4153–70. pmid:6328412
- 83. Haenni S, Studer E, Burkard GS, Roditi I. Bidirectional silencing of RNA polymerase I transcription by a strand switch region in Trypanosoma brucei. Nucleic Acids Res. 2009; 37:5007–18. pmid:19531741
- 84. Mandava V, Fernandez JP, Deng H, Janzen CJ, Hake SB, Cross GA. Histone modifications in Trypanosoma brucei. Mol Biochem Parasitol. 2007; 156:41–50. pmid:17714803
- 85. Anderson BA, Wong IL, Baugh L, Ramasamy G, Myler PJ, Beverley SM. Kinetoplastid-specific histone variant functions are conserved in Leishmania major. Mol Biochem Parasitol. 2013; 191:53–7. pmid:24080031
- 86. Lye LF, Owens K, Shi H, Murta SM, Vieira AC, Turco SJ, et al. Retention and loss of RNA interference pathways in trypanosomatid protozoans. PLoS Pathog. 2010; 6:e1001161. pmid:21060810
- 87. Robinson KA, Beverley SM. Improvements in transfection efficiency and tests of RNA interference (RNAi) approaches in the protozoan parasite Leishmania. Mol Biochem Parasitol. 2003; 128:217–28. pmid:12742588
- 88. Djikeng A, Shi H, Tschudi C, Ullu E. RNA interference in Trypanosoma brucei: cloning of small interfering RNAs provides evidence for retroposon-derived 24-26-nucleotide RNAs. RNA. 2001; 7:1522–30. pmid:11720282
- 89. DiPaolo C, Kieft R, Cross M, Sabatini R. Regulation of trypanosome DNA glycosylation by a SWI2/SNF2-like protein. Mol Cell. 2005; 17:441–51. pmid:15694344
- 90. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9:357–9.
- 91. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009; 25:2078–9. pmid:19505943
- 92. Aslett M, Aurrecoechea C, Berriman M, Brestelli J, Brunk BP, Carrington M, et al. TriTrypDB: a functional genomic resource for the Trypanosomatidae. Nucleic Acids Res. 2010; 38:D457–62. pmid:19843604
- 93. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal. 2011; 17:10–2.
- 94. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010; 26:841–2. pmid:20110278
- 95. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30:2114–20. pmid:24695404
- 96. Trapnell C, Hendrickson DG, Sauvageau M, Goff L, Rinn JL, Pachter L. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotech. 2013; 31:46–53.
- 97. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008; 5:621–8. pmid:18516045
- 98. Roditi I, Schwarz H, Pearson TW, Beecroft RP, Liu MK, Richardson JP, et al. Procyclin gene expression and loss of the variant surface glycoprotein during differentiation of Trypanosoma brucei. J Cell Biol. 1989; 108:737–46. pmid:2645304
- 99. Al Husini N, Kudla P, Ansari A. A role for CF1A 3' end processing complex in promoter-associated transcription. PLoS Genet. 2013; 9:e1003722. pmid:23966880