Leishmania are unicellular eukaryotes responsible for leishmaniasis in humans. Like other trypanosomatids, leishmania regulate protein coding gene expression almost exclusively at the post-transcriptional level with the help of RNA binding proteins (RBPs). Due to the presence of polycystronic transcription units, leishmania do not regulate RNA polymerase II-dependent transcription initiation. Recent evidence suggests that the main control points in gene expression are mRNA degradation and translation. Protein-RNA interactions are involved in every aspect of RNA biology, such as mRNA splicing, polyadenylation, localization, degradation, and translation. A detailed picture of these interactions would likely prove to be highly informative in understanding leishmania biology and virulence. We developed a strategy involving covalent UV cross-linking of RBPs to mRNA in vivo, followed by interactome capture using oligo(dT) magnetic beads to define comprehensively the mRNA interactome of growing L. donovani amastigotes. The protein mass spectrometry analysis of captured proteins identified 79 mRNA interacting proteins which withstood very stringent washing conditions. Strikingly, we found that 49 of these mRNA interacting proteins had no orthologs or homologs in the human genome. Consequently, these may represent high quality candidates for selective drug targeting leading to novel therapeutics. These results show that this unbiased, systematic strategy has the promise to be applicable to study the mRNA interactome during various biological settings such as metabolic changes, stress (low pH environment, oxidative stress and nutrient deprivation) or drug treatment.
Citation: Nandan D, Thomas SA, Nguyen A, Moon K-M, Foster LJ, Reiner NE (2017) Comprehensive Identification of mRNA-Binding Proteins of Leishmania donovani by Interactome Capture. PLoS ONE 12(1): e0170068. https://doi.org/10.1371/journal.pone.0170068
Editor: Thomas Preiss, John Curtin School of Medical Research, AUSTRALIA
Received: August 12, 2016; Accepted: December 28, 2016; Published: January 30, 2017
Copyright: © 2017 Nandan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The mass spectrometry proteomics data files available from ProteomeXchange Consortium identifier PXD005601.
Funding: This work was funded by Grants MOP – 125879 to NER and 77688 to LJF from the Canadian Institutes for Health Research. The funding agencies had no role in design, collection, analysis, or interpretation of data; neither were they involved in writing the manuscript; or in the decision to submit the manuscript for publication.
Competing interests: The authors have declared that no competing interests exist.
Regulation of gene expression requires an array of coordinated mechanisms that are used by cells to modulate the production of specific gene products. In eukaryotic cells, gene regulation starts at the level of transcription initiation, followed by regulation at the post-transcriptional level. Nascent mRNAs are subject to regulation via RNA processing, transport, localization, translation and degradation. During biosynthesis and after completion of transcription, nascent mRNAs associate with RNA binding proteins to form messenger ribonucleoprotein (mRNP) complexes that regulate most aspects of mRNA metabolism and function. In their dynamic association with mRNP complexes, a repertoire of RBPs plays critical roles in regulating RNA processing, transport, localization, translation and degradation. In fact, there is evidence for the existence of an extensive network of RBPs, which through combinatorial binding affects the post-transcriptional fate of every mRNA in the cell [1–4]. It is becoming increasingly clear that post-transcriptional processes play equally important roles in the output of gene products when compared to transcriptional control. Notably, Leishmania is one genus that has evolved to bypass the regulation of gene expression at the transcription initiation stage .
Leishmania are the causative agents of leishmaniases in humans. Transmitted by sand flies, these infections are responsible for severe morbidity and mortality, worldwide. To date, there are no effective vaccines against these parasites. Moreover, currently available drugs have significant toxicities and over time, these organisms are known to develop drug resistance. Leishmania have a digenetic life-cycle alternating between their mammalian hosts (amastigote stage) and their sandfly vectors (promastigote stage). Adaptation to distinct environments requires activation of specific gene networks. As described above, most eukaryotes activate gene networks by transcriptional mechanisms singularly, or in parallel with post-transcriptional regulatory mechanisms. However, in leishmania and other related trypanosomatids, these responses appear to be accomplished mainly by post-transcriptional mechanisms . In this setting, RNA-binding proteins (RBPs) are emerging as key regulatory factors that aid in post-transcriptional gene regulation. The shift from transcriptional to post-transcriptional control in these organisms appears to be due to an anomalous arrangement of trypanosomatid protein coding genes . In addition to this specialized arrangement, the genes lack canonical promoters . Transcriptional control of gene expression poses a challenge in these organisms as RNA polymerase II transcribes all protein encoding genes at comparable rates . Nevertheless, the final output of mature mRNAs and proteins can vary tremendously even between adjacent genes. One post-transcriptional regulatory mechanism used by leishmania to control gene expression involves the processing of polycistronic pre-mRNAs into mature mRNAs. This RNA processing tool results in the trans-splicing of a capped spliced-leader exon at the 5’-end and a polyadenylation at the 3’-end [9, 10].
The association of RBPs with the 3’-UTRs of mRNAs usually leads to changes in stability, translation, or localization of target mRNAs, and this mechanism seems to be operative in trypanosomatids . Over the past two decades, researchers have identified and characterized numerous RBPs in trypanosomatids, particularly Trypanosoma cruzi and Trypanosoma brucei. Recent advancements in sequencing and annotation of the trypanosomatid genomes have led to various predictions of novel RBPs [12, 13]. These predictions were based on finding classical mRNA structural motifs that serve as ligands for different RBPs . Different transcripts carrying the same signature motifs are likely to be regulated in similar ways . RBPs interact with mRNA motifs via unique functional domains such as RNA-Recognition Motifs (RRM), Zinc Finger domains (ZF) and Pumilio domains (PUM); and these motifs have been found in the RBPs of trypanosomatids . For example, in silico screening of the genomes of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major identified unique zinc finger proteins which are indicative of RNA binding . In addition, computational analysis and bioinformatics have been used to predict and characterize RBPs of trypanosomes . In vitro approaches have been used to pull down proteins bound to specific target 3’-UTR transcripts or, conversely, to pull down RNA transcripts bound to known RNA-binding proteins . However, all of these approaches are limited and cannot provide a comprehensive picture of RBPs in the organisms of interest. Identification of the repertoire of trypanosomatid mRNA interacting proteins is essential for organizing our current molecular and genetic understanding of gene expression and in particular, post-transcriptional gene regulation. Accordingly, it was pertinent to develop an alternative strategy for characterizing RBPs in these pathogenic protozoa.
Recently, as part of an effort to identify mammalian in vivo mRNA-bound proteomes, three independent studies used a combination of UV RNA-protein cross-linking in vivo followed by Poly(A)-RNA pull downs [19–21]. This comprehensive approach is termed ‘‘interactome capture”, and these studies generated an atlas of mammalian mRNA-binding proteins in vivo. Recently, using a genome-wide tethering screen, in which random protein fragments were artificially bound to a reporter mRNA as bait, over 300 proteins were identified as possible mRNA-fate regulators by post-transcriptional mechanisms in T. brucei . Since random protein fragments were artificially bound to a reporter mRNA, the results obtained may not be true for full-length proteins. Subsequently, the tethering screen was extended by the same group using a small-scale library of full-length open reading frames and this resulted in a catalogue of about 100 proteins that may regulate T. brucei mRNA-fate . This same study also reported an mRNA-bound proteome consisting of 155 high-confidence RBPs from bloodstream forms of T. brucei. However, to the best of our knowledge no efforts have been reported to identify comprehensively the repertoire of RBPs in leishmania. We used Leishmania donovani, in its amastigote stage, as a paradigm for this study. We developed the interactome by combining UV cross-linking and oligo(dT) capture to pull down proteins bound to mRNAs in viable L. donovani amastigotes. We show that the in vivo leishmania mRNA interactome consists of at least 79 proteins, 49 of which show no significant homology to identified human RBPs in the database and we discuss resulting insights into the RNA biology of leishmania.
Materials and Methods
Leishmania promastigote culture.
L. donovani Sudan strain S2 promastigotes were routinely cultured in M199 (Sigma-Aldrich) with 10% heat inactivated fetal bovine serum (FBS, Gibco), 20 mM HEPES (Sigma-Aldrich), 6 μg/mL hemin (Sigma-Aldrich), 10 μg/mL folic acid (Sigma-Aldrich), 2 mM L-glutamine (Stemcell), 100 U/mL penicillin/streptomycin (Sigma-Aldrich) and 100 μM adenosine (Sigma-Aldrich) at 26°C. Every 3 days the parasites were subcultured 1:10 in fresh medium and were kept in culture for a maximum of 20–25 passages. In order to maintain virulence and infectivity, fresh parasites were routinely obtained by purification of amastigotes from spleens of 6–8 week infected Syrian Golden hamsters followed by in vitro transformation into promastigotes by culturing for 5–7 days at 26°C in promastigote medium. Prior approval for animal experiments was obtained from the Animal Care Committee of University of British Columbia (approval# A14-00218)
Leishmania axenic amastigote culture.
In vitro culture of amastigotes was carried out as follows: late logarithmic phase cultures of L. donovani promastigotes were transferred from promastigote medium at 26°C and pH 7.4 to RPMI 1640 medium supplemented with 10% fetal bovine serum and incubated at 37°C (5% CO2) for 16–18 h. Subsequently, cells were collected by centrifugation (1200×g at room temperature for 10 min) and resuspended in amastigote medium (RPMI 1640 with 25% fetal bovine serum, 100 μM adenosine and pH was titrated to pH 5.5 with 1M MES) at 37°C in 5% CO2 environment. Three days after initiation of transformation, cells were diluted in fresh amastigote medium and maintained by changing media every 4th day.
UV-cross linking of leishmania amastigotes
Dividing amastigotes (10 x 109) were washed once with cold phosphate buffered saline (PBS) and resuspended in cold PBS to a concentration of 1.7 x 109 per ml. The cell suspension (6 ml) was dispersed into a 100 mm x 20 mm cell culture plate without lid, placed on ice pack (immediately before cross-linking), and UV-irradiated 3 times (1 min each with 2 mins gap on ice) with 0.15 J cm− 2 at 254 nm using Ultra-LUM electronic ultraviolet cross linker at a distance of 15 cm from the UV source. After UV irradiation, cells were collected by centrifugation.
Preparation of parasite protein extracts
Pellets from non irradiated control and irradiated amastigotes were lysed separately in 10 ml of ice cold stringent lysis buffer (20 mM Tris-HCl (pH 7.5), 500 mM LiCl, 0.5% lithium dodecyl sulfate, 1 mM EDTA and 5 mM DTT) containing 1 mM phenylmethylsulfonyl fluoride, 10 μg of aprotinin/ml, 10 μg of leupeptin/ml, and 2 μg of pepstatin A/ml. Pellets were resuspended by pipetting up and down. The suspended samples were then passed at least three times through a syringe with a narrow needle or until the homogenate became clear. The lysates were incubated for 10 min at 4°C.
300 μl of Oligo(dT)25 magnetic beads (New England Biolabs) per sample, previously equilibrated in three volumes of lysis buffer, was added to the lysate and incubated them for 1 h at 4°C with gentle end-over-end rotation. The tubes were placed on a magnet at 4°C for 5 mins to completely capture the magnetic beads. The beads were resuspended in 5 ml of ice-cold lysis buffer and washed for 5 mins at 4°C by gentle end-over-end rotation, and then pelleted with the help of a magnet. Recovered beads were resuspended in 5 ml of ice-cold wash buffer 1 ((20 mM Tris-HCl (pH 7.5), 500 mM LiCl, 0.1% LiDS, 1 mM EDTA and 5 mM DTT) and incubated for 5 mins at 4°C by gentle end-over-end rotation and beads were pelleted with the help of a magnet. 5 ml of ice-cold wash buffer 2 (20 mM Tris-HCl (pH 7.5), 500 mM LiCl and 1 mM EDTA) was added to pelleted beads which were resuspended and incubated for 5 mins at 4°C by gentle end-over-end rotation and beads were pelleted using a magnet. A final wash was given to pelleted beads with low salt buffer (20 mM Tris-HCl (pH 7.5), 200 mM LiCl and 1 mM EDTA). Elution of the mRNA-protein complexes was performed by incubating washed beads with 250 μl/ml no salt elution buffer (20 mM Tris-HCl (pH 7.5) and 1 mM EDTA) for 3 mins at 55°C. RNA concentration was determined using a Nanodrop device.
The RNA quality was assessed by Bioanalyzer chips (Agilent) using buffers and instructions provided by the manufacturer.
RNA digestion and protein concentration
To recover mRNA bound proteins, mRNA-protein complexes were incubated with RNAse A (Fermentas) for 1h at 37°C. The RNAse-treated eluate was then concentrated by transfer to a 2-ml Amicon Ultra 10 3-kDa cutoff device and topped up with buffer 4 (20 mM Tris-HCl (pH 7.5) and 50 mM NaCl).
Preparations of samples for mass spectrometry analysis
Resulting protein solutions were reduced and alkylated as per . and digested with 1μg of trypsin (Promega) at 37°C overnight. Peptide samples were purified by solid phase extraction on C-18 STop And Go Extraction (STAGE) tips , and each treatment was chemically labeled by reductive dimethylation using formaldehyde isotopologues . The labelled peptides were mixed and purified again by C-18 STAGE tips.
HPLC and mass spectrometery anslysis
Final peptides were loaded onto quadrupole–time of flight mass spectrometer (Impact II; Bruker Daltonics) on-line coupled to an Easy nano LC 1000 HPLC (ThermoFisher Scientific) using a 40-50cm long analytical column packed with 1.9 μm-diameter Reprosil-Pur C-18-AQ beads (Dr. Maisch, www.Dr-Maisch.com), fixed on an in-house constructed column heater set at 50°C. Buffer A consisted of 0.1% aqueous formic acid, and buffer B consisted of 0.1% formic acid and 80% acetonitrile in water. Samples were resuspended in buffer A and loaded with the same buffer. Samples were analyzed over 90 min peptide separation and the column was washed with 100% Buffer B for 15 min, and re-equilibrated with Buffer A. The analysis was performed at 0.25 μl/min flow rate and the Impact II was set to acquire in a data-dependent auto-MS/MS mode with inactive focus fragmenting the 20 most abundant ions (one at the time at 18 Hz rate) after each full-range scan from m/z 200 Th to m/z 2000 Th (at 5 Hz rate). The isolation window for MS/MS was 2 to 3 Th depending on parent ion mass to charge ratio and the collision energy ranged from 23 to 65 eV depending on ion mass and charge. Parent ions were then excluded from MS/MS for the next 0.4 min and reconsidered if their intensity increased more than 5 times. Singly charged ions were excluded since in ESI mode peptides usually carry multiple charges. Strict active exclusion was applied. Mass accuracy: error of mass measurement is typically within 5 ppm and is not allowed to exceed 10 ppm. The nano ESI source was operated at 1700 V capillary voltage, 0.20 Bar nano buster pressure, 3 L/min drying gas and 150°C drying temperature.
Mass spectrometry data analysis
Analysis of Mass Spectrometry Data was performed using MaxQuant 184.108.40.206. The search was performed against a database comprised of the protein sequences from Uniprot’s Leishmania donovani (bpk282a1) plus common contaminants using the following parameters: peptide mass accuracy 10 ppm; fragment mass accuracy 0.05 Da; trypsin enzyme specificity, fixed modifications—carbamidomethyl, variable modifications—methionine oxidation and N-acetyl protein. Only those peptides exceeding the individually calculated 99% confidence limit (as opposed to the average limit for the whole experiment) were considered as accurately identified.
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD005601 
Results and Discussion
In vivo capture of leishmania amastigote mRNA binding proteins
To characterize leishmania amastigote in vivo mRNA interacting proteins, we developed an approach based on UV irradiation of live amastigotes. UV irradiation generates highly reactive, short-lived states of the nucleotide bases within RNAs, resulting in covalent bond formation only with amino acids in direct contact. In order to covalently couple RBPs to mRNAs in a physiological in vivo state, dividing amastigotes were irradiated with UV light at 254 nm. In fact, it has been shown that UV light at 254 nm cross-links the naturally photoreactive nucleotide bases, especially pyrimidines, with amino acids such as Phe, Trp, Tyr, Cys, and Lys [28, 29]. Notably, this method offered several advantages over previous leishmania RBP identification methods: (i) UV irradiation induces covalent bond formation between nucleotide bases within RNAs only with amino acids in direct contact (zero distance] ; however, UV irradiation does not promote protein-protein cross-linking [30, 31], (ii) UV light was directly applied to living amastigotes, thus capturing physiological in vivo RNA-protein interactions, and (iii) mRNA poly(A) tail-oligo(dT) hybridization is very stable in high salt and ionic detergent containing buffers thus allowing efficient removal of non-specific interactions. In order to increase the efficiency of UV cross-linking, approximately 10 x 109 leishmania amastigotes were used as starting material for each analysis. After UV irradiation of living amastigotes at 254 nm, cells were lysed in buffer containing high salt and ionic detergent. Following cell lysis, RBPs covalently bound to polyadenylated RNAs in vivo were captured on oligo(dT) magnetic beads. As a negative control, we included nonirradiated amastigotes cell lysates in the oligo(dT) capture assay. This nucleic acid hybridization based strategy allowed washings of affinity beads using highly stringent biochemical conditions including 500 mM lithium chloride and 0.5% lithium dodecyl sulfate. This protein-denaturing condition during the purification ensured the stringent isolation of proteins in direct contact with leishmania mRNAs through covalent bonds and allowed us to minimize contaminants. Following stringent washes, proteins were released from complexes by RNase treatment and were identified using MS (Fig 1). To test the efficiency of oligo(dT) pull-downs, we determined the amounts of beta-tubulin and glyceraldehyde 3-phosphate dehydrogenase (GAPDH) mRNAs in oligo(dt) pull-down material using RT-qPCR as compared with total RNA. There was a five to six fold enrichment in β-tublin and GAPDH mRNAS, in the oligo(dT) pull downs in comparison to the starting material (S1 Fig). This confirmed that the mRNAs were efficiently captured by the oligo(dT) affinity beads. Enrichment of mRNAs over rRNAs was independently confirmed using a Bioanalyzer Chip (S2 Fig). DNA did not copurify as no PCR amplification of beta-tubulin or GAPDH occurred when samples were RNase treated before the oligo(dT) pull-down or when reverse transcriptase (RT) was omitted from the RT reaction during complementary DNA (cDNA) preparation (data not shown).
Live L. donovani amastigotes were UV irradiated to stabilize mRNA-protein interactions. These complexes were isolated by oligo(dT) magnetic beads capture and stringent washings. Bound protein-mRNA complexes were eluted with RNase treatment and identified by MS.
Identification of leishmania mRNA-bound proteome by mass spectrometry
To identify proteins cross-linked to mRNAs in UV irradiated leishmania, we performed oligo(dT) purifications followed by the release of bound proteins by RNAse treatment. The isolated proteins were concentrated using an Amicon Ultra 10 3-kDa device. The concentrated proteins were separated by SDS-PAGE followed by in-gel digestion with trypsin. Peptides were analyzed by nanoflow high-performance liquid chromatography and online mass spectrometry. By processing the data with MaxQuant 220.127.116.11, we selected proteins which were identified in proteomic analyses of three biological UV cross-linked replicates. In parallel, non-cross-linked mRNA bound samples were analyzed for specificity control. To perform statistical data analysis, protein enrichment in UV irradiated samples, as compared to non-irradiated samples, was assessed by considering intensity values. These values were derived by summing up all the extracted ion current of mass-to-charge values over time associated with the protein using MaxQuant [32, 33] and the results are presented in Supporting Information (SI) S1 Table. For statistical analysis we used one-tailed, paired t-test with the confidence interval of 90%. This analysis resulted in a list of 79 proteins found in three independent cross-linked samples (S1 Table). p-values were corrected using Benjamini-Hockberg procedure . It is likely that the coverage of RBPs for the amastigote form of L. donovani is not complete. This protocol will not detect RBPs that are (i) not interacting with poly (A) RNAs; (ii) not expressed in amastigote form of L. donovani; (iii) not active under the experimental conditions; or (iv) not in sufficient quantity in the starting material used for the analysis. Nevertheless, the mRNA-bound proteome of the amastigote form of L. donovani is comprised of a complex mixture of proteins enriched in RNA-related activities (S2 Table).
Overview of identified mRNA-interacting proteins
We first classified the identified proteins into functional categories based on gene annotation. As expected, ribosomal proteins, RNA helicases, translation factors, and classical RNA-binding proteins were most frequent, making up close to 70% of the identified proteins (S2 Table, Fig 2A & 2B). The low numbers of highly expressed cellular proteins such as metabolic enzymes suggested that the oligo(dT) purification was specific.
A and B. Gene Ontology enrichment analysis of L. donovani mRNA interacting proteins.
In addition to known mRNA-interacting proteins, we identified 19 proteins (S2 Table), that have not previously been annotated as RNA binding. Interestingly, 2 out of 19 seem to have RNA binding motifs (Protein ID: E9BH26 has 2 RNA-Recognition Motifs (RRMs) and protein ID: E9B8L4 has tryptophan-aspartic acid (WD) repeats regions (S2 Table). These findings suggest that the approach we used had the capability to identify RNA-interacting proteins that use distinct or highly divergent RNA-binding domains.
Some of the RBPs we identified are involved in protein translation. Elongation factor 1-alpha (S2 Table) is an abundant multifunctional protein. In addition to its canonical role in translation, it has also been involved in other cellular processes such as nuclear transport, cellular organization, protein quality control, co-translational degradation, mitochondrial tRNA import [35, 36] and macrophage SHP-1 activation . Elongation factor-2 (S2 Table) is involved in translocation of the ribosome along the mRNA strand during translation [38, 39].
The Leishmania genome encodes three poly(A)-binding proteins and we were able to identify all three (Protein ID: E9BHB2, EBSU0 and E9BT55) (S2 Table). We also identified three proteins (Protein ID: E9BE15, E9BM42 and E9BTJ1) that are involved in energy metabolism. These were unexpected results, since these proteins do not possess known RNA binding domains in their structures. However, in yeast and other organisms, metabolic multifunctional enzymes capable of binding to RNA sequences have been previously reported [40, 41]. Indeed, an increasing number of multifunctional enzymes are being described in parasitic protozoa . We identified five helicases (Protein ID: E9B8A4, E9B909, E9BNE6, E9BRR9 and E9BS1) (S2 Table) in the proteome of leishmania mRNA interacting proteins. DEAD-box RNA helicases mediate conformational change in RNA containing complexes and these changes are required to unwind RNA duplexes or disrupt RNA-protein interactions . Two proteins -alpha tubulin and beta tubulin- bound to mRNAs which have no RNA-related gene ontology annotation. However, interestingly this unexpected interaction has been shown in recently published plant mRNA interactome . Moreover, participation of the cytoskeleton in the spatial organization and regulation of translation has recently been suggested . In addition, we also found three molecular chaperones (HSP-70, HSP-83-1 and mitochondrial HSP-60) (S2 Table). RNA-binding roles are not unprecedented for HSPs as mammalian HSP70 is able to bind to the 3’UTR of labile mRNAs . In addition, it is known that human HSP-70 binds to and stabilizes endogenous AU-rich element-containing mRNAs . Moreover, HSP90 is involved in RNA interference in human and in yeast . It is also possible these HSPs are interacting nonspecifically due to their sticky nature.
It was of interest to ask whether any of the leishmania RBPs that we identified had previously been implicated in leishmania pathogenesis. In fact, we found that three of the RBPs (A0A0R4J963): Elongation factor 1-alpha , E9BTN1: Universal minicircle sequence binding protein  and E9BTJ1: Fructose-bisphosphate aldolase  are considered to potentially contribute to the pathogenesis of leishmania infection (S2 Table).
mRNA binding domains of leishmania RBPs
To gain insights into modes of interaction of RBPs with mRNAs, the RBPs we recovered were analyzed for the presence of associated structures like functional domains (S2 Table). For this we performed Pfam and Inter-Pro domain enrichment analyses  which showed that most of the significantly enriched domains were various RNA-interaction motifs (Fig 3). In most eukaryotes, many RBPs interact with mRNAs via modular RNA binding domains (RBDs), including the RNA recognition motif (RRM), heterogeneous nuclear RNP K-homology domain (KH), zinc fingers (Znf) and others. . These motifs have informed in silico algorithms to identify other proteins harboring similar signatures as putative novel RBPs . However, numerous noncanonical RBDs have also been reported [52–55], reflecting limitations in the scope of computational predictions. Analysis of the leishmania interactome showed that about half of the mRNA interacting proteome contained known RBDs. This included several classical and nonclassical RBDs (Fig 3). The RNA-Recognition Motif (RRM) is one of the most commonly found protein domains in nature. Our analysis identified eleven RRM containing proteins (S2 Table, Fig 3). RRMs are most often involved in sequence-specific interactions with single-stranded RNAs. Proteins that contain RRMs are involved in all pathways of the RNA cycle, from splicing to mRNA turnover. RRMs are usually present in multiple copies within a protein and this multiplicity enhances ligand specificity .
Number of proteins harboring (A) classical and (B) non-classical RNA binding motifs in L. donovani mRNA interacting proteome. (C) Consolidated number of RNA binding motifs in 79 identified proteins, including 32 unknown RNA binding domains (RBD unknown).
Another commonly recognized RNA binding domain is the zinc finger domain. This domain was first described as a DNA-binding domain and was subsequently found in proteins that interact with RNA . We found seven in our analysis, four containing CCHC type and three containing C3H1-type Zinc finger motifs (S2 Table, Fig 3). Besides the commonly recognized RNA-binding domains, we also found PUM and hnRNP K-homology (KH) RNA binding domains (Fig 3). The best characterized function of the PUM containing Pumilio protein is as a post-transcriptional repressor . We found two PUM containing proteins in our analysis (S2 Table, Fig 3). hnRNP K-homology domain containing proteins are associated with multiple gene expression regulatory mechanisms, such as transcriptional regulation, and translational control . We found only one hnRNP K-homology domain containing protein in the leishmania interactome (S2 Table, Fig 3).
Comparison with previously known RBPs of leishmanial
To estimate the depth of the mRNA-bound proteome we captured in our oligo(dT) precipitations, we compared the number of identified proteins with previously known RBPs of leishmania. For this purpose, we began by examining the genome of L. donovani for potential RBPs. Recently, the whole genome of L. donovani strain bpk282A1 has been sequenced and annotated. Searching this genome for predicted proteins containing RNA binding domains from the complete predicted proteome of L. donovani resulted in 67 proteins. Interestingly, out of these 67 predicted RBPs, 13 were part of the L. donovani interactome reported here (S3 Table) providing experimental support for their in silico identification as putative RNA binding proteins.
Recently, L. braziliensis HSP70 mRNAs interacting proteins have been reported . We also compared our list of leishmania RBPs with RNA-binding domain containing proteins observed in this in vitro study to identify proteins that interact with L. braziliensis HSP70 mRNAs . In this study the 5’ UTR and the two types of 3’ UTRs from L. braziliensis HSP70 genes were used as bait in pull down assays using total protein parasite extracts. The bound proteins were separated by two-dimensional gel electrophoresis and identified by MS. This study resulted in the identification of 52 different proteins based on their interaction with L. braziliensis HSP70-mRNAs . Out of this list of 52 proteins, 12 were represented in our in vivo proteome of L. donovani amastigote mRNA interacting proteins (S3 Table). The differences in composition of our mRNA interacting proteome and the HSP70 study could be due to in vivo versus in vitro mRNA interactions, different leishmania species or different washing conditions to capture bound proteins. Very recently, the mRNA-bound proteome of the bloodstream forms of T. brucei has been reported . For this proteome, the authors performed UV-crosslinking of bloodstream forms of T. brucei followed by lysing of parasites and oligo (dt) capture of RBPs. Bound proteins were analyzed by mass spectrometry. This analysis resulted in the identification of 155 proteins highly enriched in RNA-related activities. Out of this list of 155 proteins, only 25 were represented in our in vivo proteome of L. donovani amastigotes mRNA interacting proteins (S3 Table). This low level of similarity (approximately 16%) between mRNA interactomes of L. donovani and T. brucei was a bit surprising as both parasites belong to family Trypanosomatidae. These differences in number and composition of mRNA interacting proteins could represent differences in their quite divergent life styles; T. brucei bloodstream being extracellular and leishmania amastigotes living inside phagolysosomes of mononuclear phagocytes with much reduced metabolic activities.
As discussed above, one group of RNA binding proteins is defined by the presence of a CCCH type zinc finger motif that directly binds to RNA . Zinc finger proteins are RNA binding proteins with regulatory functions at all stages of mRNA metabolism. Genome-wide in silico screens for CCCH-type zinc finger proteins of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major resulted in a list of CCCH-type finger proteins [16, 17]. Surprisingly, our proteome analysis did not identify any CCCH motif containing proteins. This lack of congruence could be due to species differences, low levels of expression in the amastigote stage of L. donovani or inactivity under the conditions of the experiment, such as not being efficiently cross-linked upon UV irradiation or possibly to false positive computational predictions. Despite the fact that our analysis did not identify any CCCH motif containing proteins, we were able to identify seven proteins containing C3H-1 type (4/7) and CCHC-type Zinc finger motifs (3/7) (Fig 3).
Identification of mRBPs in the secretome of L. donovani
Recently, we characterized the secretome of L. donovani and this led to the discovery of previously unrecognized exosome-based secretion pathway . Using liquid chromatography-tandem mass spectrometry, we found that the leishmania secretome contained 360 proteins and 188 of them were detected in exosomes . We also showed that leishmania exosomes are involved in the delivery of protein and RNA cargo to host macrophages to promote cell deactivation . Functional annotation of the leishmania secretome revealed the presence of proteins involved in mRNA metabolism and synthesis . It seems likely that the secretion of at least some of these leishmania RBPs may be part of the strategy used by leishmania to survive inside macrophages. One potential mechanism could be that leishmania secretome RBPs compete with host RBPs to skew host gene expression and create a phenotype which is permissive for infection. In fact, leishmania elongation factor 1 alpha and fructose-bisphosphate aldolase detected in the leishmania mRNA interactome (S2 Table) have previously been found in the cytosol of leishmania infected cells [37, 50] and may contribute to macrophage deactivation [37, 50]. Thus, it was of interest to ask how many other identified leishmania RBPs were represented in the leishmania secretome. Notably, from the list of 360 proteins in the leishmania secretome, 43 were represented in our in vivo proteome of L. donovani mRNA interacting proteins (S4 Table, Fig 4 panel A). Remarkably, 32/43 of these proteins were exclusively present in the exosome portion of the secretome (S4 Table, Fig 4 panel B).
(A) Forty three L. donovani mRNA interacting proteins were detected in previously identified L. donovani secretome. (B) Thirty two L. donovani mRNA interacting proteins out of the above 43 proteins were detected in previously identified L. donovani exosome proteome.
In order to search for human orthologs for the leishmania mRNA interacting proteins we identified, BLAST search was performed against the human RBPs database (http://rbpdb.ccbr.utoronto.ca/). The protein list presented in S2 Table shows 32 human orthologs. The remaining 47 leishmania mRNA interacting proteins showed no significant homology to identified human RBPs in the database. This list of 47 leishmania specific mRNA interacting proteins is of particular interest as these proteins are potential leishmania drug targets.
In summary, the findings presented herein identify for the first time the in vivo mRNA bound proteome of living leishmania amastigotes. This work presents a novel picture of the RNA biology of the life cycle stage of L. donovani responsible for causing visceral leishmaniasis in humans. The approach of interactome capture can also be applied to study changes in interactome composition as a function of various biological contexts, such as metabolic changes, low pH environment, oxidative stress, nutrient deprivation or response to drugs. Notably, this study identified a novel group of leishmania RBPs with no orthologs or homologs in the human genome and these may represent high quality candidates for selective drug targeting leading to novel therapeutics.
S1 Table. L. donovani mRNA binding proteins with Max Quant intensity values in non-crosslinked and UV-crosslinked conditions.
S2 Table. L. donovani mRNA binding proteins categorization.
S3 Table. Comparison of our RBPS with published trypanosomatid RBPs.
S4 Table. L. donovani mRNA interacting proteins identified in previously published L. donovani secretome/exosomes.
S1 Fig. Levels of β-tubulin and GAPDH RNAs in oligo(dT) captured material.
Equal amounts of RNA from oligo(dt) captured material and from starting input RNA were analyzed for the levels of GAPDH and β-tubulin RNAs by RT-qPCR (n = 3;±s.d).
S2 Fig. Agilent Bioanalyzer RNA profiles of oligo(dT) captured RNA alongside input RNA.
Equal amounts of input RNAs and oligo (dT) beads captured RNAs were analyzed using Agilent 2100 Bioanalyzer. The middle three prominent peaks in each panel (A-D) represent leishmania ribosomal RNAs.
- Conceptualization: DN NER.
- Data curation: DN NER.
- Formal analysis: DN LJF AN KM.
- Funding acquisition: NER LJF.
- Investigation: DN SAT.
- Methodology: DN NER LJF.
- Project administration: DN NER.
- Supervision: DN NER.
- Validation: DN NER.
- Visualization: DN NER.
- Writing – original draft: DN NER.
- Writing – review & editing: DN NER.
- 1. Hieronymus H, Silver PA. Genome-wide analysis of RNA-protein interactions illustrates specificity of the mRNA export machinery. Nature genetics. 2003 Feb;33(2):155–61. pmid:12524544
- 2. Gerber AP, Herschlag D, Brown PO. Extensive association of functionally and cytotopically related mRNAs with Puf family RNA-binding proteins in yeast. PLoS biology. 2004 Mar;2(3):E79. Pubmed Central PMCID: 368173. pmid:15024427
- 3. Keene JD. Biological clocks and the coordination theory of RNA operons and regulons. Cold Spring Harbor symposia on quantitative biology. 2007;72:157–65. pmid:18419273
- 4. Keene JD. RNA regulons: coordination of post-transcriptional events. Nature reviews Genetics. 2007 Jul;8(7):533–43. pmid:17572691
- 5. Clayton C, Shapira M. Post-transcriptional regulation of gene expression in trypanosomes and leishmanias. Molecular and biochemical parasitology. 2007 Dec;156(2):93–101. pmid:17765983
- 6. Daniels JP, Gull K, Wickstead B. Cell biology of the trypanosome genome. Microbiology and molecular biology reviews: MMBR. 2010 Dec;74(4):552–69. Pubmed Central PMCID: 3008170. pmid:21119017
- 7. El-Sayed NM, Myler PJ, Blandin G, Berriman M, Crabtree J, Aggarwal G, et al. Comparative genomics of trypanosomatid parasitic protozoa. Science. 2005 Jul 15;309(5733):404–9. pmid:16020724
- 8. Ouellette M, Papadopoulou B. Coordinated gene expression by post-transcriptional regulons in African trypanosomes. Journal of biology. 2009;8(11):100. Pubmed Central PMCID: 2804284. pmid:20017896
- 9. Liang XH, Haritan A, Uliel S, Michaeli S. trans and cis splicing in trypanosomatids: mechanism, factors, and regulation. Eukaryotic cell. 2003 Oct;2(5):830–40. Pubmed Central PMCID: 219355. pmid:14555465
- 10. Preusser C, Jae N, Bindereif A. mRNA splicing in trypanosomes. International journal of medical microbiology: IJMM. 2012 Oct;302(4–5):221–4. pmid:22964417
- 11. Fernandez-Moya SM, Estevez AM. Posttranscriptional control and the role of RNA-binding proteins in gene regulation in trypanosomatid protozoan parasites. Wiley interdisciplinary reviews RNA. 2010 Jul-Aug;1(1):34–46. pmid:21956905
- 12. El-Sayed NM, Myler PJ, Bartholomeu DC, Nilsson D, Aggarwal G, Tran AN, et al. The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease. Science. 2005 Jul 15;309(5733):409–15. pmid:16020725
- 13. Berriman M, Ghedin E, Hertz-Fowler C, Blandin G, Renauld H, Bartholomeu DC, et al. The genome of the African trypanosome Trypanosoma brucei. Science. 2005 Jul 15;309(5733):416–22. pmid:16020726
- 14. Lunde BM, Moore C, Varani G. RNA-binding proteins: modular design for efficient function. Nature reviews Molecular cell biology. 2007 Jun;8(6):479–90. pmid:17473849
- 15. Najafabadi HS, Lu Z, MacPherson C, Mehta V, Adoue V, Pastinen T, et al. Global identification of conserved post-transcriptional regulatory programs in trypanosomatids. Nucleic acids research. 2013 Oct;41(18):8591–600. Pubmed Central PMCID: 3794602. pmid:23877242
- 16. De Gaudenzi J, Frasch AC, Clayton C. RNA-binding domain proteins in Kinetoplastids: a comparative analysis. Eukaryotic cell. 2005 Dec;4(12):2106–14. Pubmed Central PMCID: 1317496. pmid:16339728
- 17. Kramer S, Kimblin NC, Carrington M. Genome-wide in silico screen for CCCH-type zinc finger proteins of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major. BMC genomics. 2010;11:283. Pubmed Central PMCID: 2873481. pmid:20444260
- 18. Ramirez CA, Dea-Ayuela MA, Gutierrez-Blazquez MD, Bolas-Fernandez F, Requena JM, Puerta CJ. Identification of proteins interacting with HSP70 mRNAs in Leishmania braziliensis. Journal of proteomics. 2013 Dec 6;94:124–37. pmid:24060997
- 19. Baltz AG, Munschauer M, Schwanhausser B, Vasile A, Murakawa Y, Schueler M, et al. The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. Molecular cell. 2012 Jun 8;46(5):674–90. pmid:22681889
- 20. Castello A, Fischer B, Eichelbaum K, Horos R, Beckmann BM, Strein C, et al. Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell. 2012 Jun 8;149(6):1393–406. pmid:22658674
- 21. Kwon SC, Yi H, Eichelbaum K, Fohr S, Fischer B, You KT, et al. The RNA-binding protein repertoire of embryonic stem cells. Nature structural & molecular biology. 2013 Sep;20(9):1122–30.
- 22. Erben ED, Fadda A, Lueong S, Hoheisel JD, Clayton C. A genome-wide tethering screen reveals novel potential post-transcriptional regulators in Trypanosoma brucei. PLoS pathogens. 2014 Jun;10(6):e1004178. Pubmed Central PMCID: 4055773. pmid:24945722
- 23. Lueong S, Merce C, Fischer B, Hoheisel JD, Erben ED. Gene expression regulatory networks in Trypanosoma brucei: insights into the role of the mRNA-binding proteome. Molecular microbiology. 2016 May;100(3):457–71. pmid:26784394
- 24. Foster LJ, De Hoog CL, Mann M. Unbiased quantitative proteomics of lipid rafts reveals high specificity for signaling factors. Proceedings of the National Academy of Sciences of the United States of America. 2003 May 13;100(10):5813–8. Pubmed Central PMCID: 156283. pmid:12724530
- 25. Ishihama Y, Rappsilber J, Andersen JS, Mann M. Microcolumns with self-assembled particle frits for proteomics. Journal of chromatography A. 2002 Dec 6;979(1–2):233–9. pmid:12498253
- 26. Parker R, Guarna MM, Melathopoulos AP, Moon KM, White R, Huxter E, et al. Correlation of proteome-wide changes with social immunity behaviors provides insight into resistance to the parasitic mite, Varroa destructor, in the honey bee (Apis mellifera). Genome biology. 2012;13(9):R81. Pubmed Central PMCID: 3491398. pmid:23021491
- 27. Vizcaino JA, Csordas A, del-Toro N, Dianes JA, Griss J, Lavidas I, et al. 2016 update of the PRIDE database and its related tools. Nucleic acids research. 2016 Jan 04;44(D1):D447–56. Pubmed Central PMCID: 4702828. pmid:26527722
- 28. Brimacombe R, Stiege W, Kyriatsoulis A, Maly P. Intra-RNA and RNA-protein cross-linking techniques in Escherichia coli ribosomes. Methods in enzymology. 1988;164:287–309. pmid:3071669
- 29. Hockensmith JW, Kubasek WL, Vorachek WR, von Hippel PH. Laser cross-linking of nucleic acids to proteins. Methodology and first applications to the phage T4 DNA replication system. The Journal of biological chemistry. 1986 Mar 15;261(8):3512–8. pmid:3949776
- 30. Pashev IG, Dimitrov SI, Angelov D. Crosslinking proteins to nucleic acids by ultraviolet laser irradiation. Trends in biochemical sciences. 1991 Sep;16(9):323–6. pmid:1835191
- 31. Suchanek M, Radzikowska A, Thiele C. Photo-leucine and photo-methionine allow identification of protein-protein interactions in living cells. Nature methods. 2005 Apr;2(4):261–7. pmid:15782218
- 32. Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nature biotechnology. 2008 Dec;26(12):1367–72. pmid:19029910
- 33. Beck S, Michalski A, Raether O, Lubeck M, Kaspar S, Goedecke N, et al. The Impact II, a Very High-Resolution Quadrupole Time-of-Flight Instrument (QTOF) for Deep Shotgun Proteomics. Molecular & cellular proteomics: MCP. 2015 Jul;14(7):2014–29. Pubmed Central PMCID: 4587313.
- 34. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological). 1995;57(1):289–300.
- 35. Mateyak MK, Kinzy TG. eEF1A: thinking outside the ribosome. The Journal of biological chemistry. 2010 Jul 9;285(28):21209–13. Pubmed Central PMCID: 2898402. pmid:20444696
- 36. Mittal N, Subramanian G, Butikofer P, Madhubala R. Unique posttranslational modifications in eukaryotic translation factors and their roles in protozoan parasite viability and pathogenesis. Molecular and biochemical parasitology. 2013 Jan;187(1):21–31. pmid:23201129
- 37. Nandan D, Yi T, Lopez M, Lai C, Reiner NE. Leishmania EF-1alpha activates the Src homology 2 domain containing tyrosine phosphatase SHP-1 leading to macrophage deactivation. The Journal of biological chemistry. 2002 Dec 20;277(51):50190–7. pmid:12384497
- 38. Justice MC, Hsu MJ, Tse B, Ku T, Balkovec J, Schmatz D, et al. Elongation factor 2 as a novel target for selective inhibition of fungal protein synthesis. The Journal of biological chemistry. 1998 Feb 6;273(6):3148–51. pmid:9452424
- 39. Noble CG, Song H. Structural studies of elongation and release factors. Cellular and molecular life sciences: CMLS. 2008 May;65(9):1335–46. pmid:18213444
- 40. Tsvetanova NG, Klass DM, Salzman J, Brown PO. Proteome-wide search reveals unexpected RNA-binding proteins in Saccharomyces cerevisiae. PloS one. 2010;5(9). Pubmed Central PMCID: 2937035.
- 41. Scherrer T, Mittal N, Janga SC, Gerber AP. A screen for RNA-binding proteins in yeast indicates dual functions for many enzymes. PloS one. 2010;5(11):e15499. Pubmed Central PMCID: 2988813. pmid:21124907
- 42. Collingridge PW, Brown RW, Ginger ML. Moonlighting enzymes in parasitic protozoa. Parasitology. 2010 Aug;137(9):1467–75. pmid:20233494
- 43. Yang Q, Jankowsky E. ATP- and ADP-dependent modulation of RNA unwinding and strand annealing activities by the DEAD-box protein DED1. Biochemistry. 2005 Oct 18;44(41):13591–601. pmid:16216083
- 44. Reichel M, Liao Y, Rettel M, Ragan C, Evers M, Alleaume AM, et al. In Planta Determination of the mRNA-Binding Proteome of Arabidopsis Etiolated Seedlings. The Plant cell. 2016 Oct;28(10):2435–52. Pubmed Central PMCID: 5134986. pmid:27729395
- 45. Kim S, Coulombe PA. Emerging role for the cytoskeleton as an organizer and regulator of translation. Nature reviews Molecular cell biology. 2010 Jan;11(1):75–81. pmid:20027187
- 46. Henics T, Nagy E, Oh HJ, Csermely P, von Gabain A, Subjeck JR. Mammalian Hsp70 and Hsp110 proteins bind to RNA motifs involved in mRNA stability. The Journal of biological chemistry. 1999 Jun 11;274(24):17318–24. pmid:10358092
- 47. Kishor A, Tandukar B, Ly YV, Toth EA, Suarez Y, Brewer G, et al. Hsp70 is a novel posttranscriptional regulator of gene expression that binds and stabilizes selected mRNAs containing AU-rich elements. Molecular and cellular biology. 2013 Jan;33(1):71–84. Pubmed Central PMCID: 3536313. pmid:23109422
- 48. Wang Y, Mercier R, Hobman TC, LaPointe P. Regulation of RNA interference by Hsp90 is an evolutionarily conserved process. Biochimica et biophysica acta. 2013 Dec;1833(12):2673–81. pmid:23827255
- 49. Singh R, Purkait B, Abhishek K, Saini S, Das S, Verma S, et al. Universal minicircle sequence binding protein of Leishmania donovani regulates pathogenicity by controlling expression of cytochrome-b. Cell & bioscience. 2016;6:13. Pubmed Central PMCID: 4756535.
- 50. Nandan D, Tran T, Trinh E, Silverman JM, Lopez M. Identification of leishmania fructose-1,6-bisphosphate aldolase as a novel activator of host macrophage Src homology 2 domain containing protein tyrosine phosphatase SHP-1. Biochemical and biophysical research communications. 2007 Dec 21;364(3):601–7. pmid:18028878
- 51. Anantharaman V, Koonin EV, Aravind L. Comparative genomics and evolution of proteins involved in RNA metabolism. Nucleic acids research. 2002 Apr 1;30(7):1427–64. Pubmed Central PMCID: 101826. pmid:11917006
- 52. Lee I, Hong W. RAP—a putative RNA-binding domain. Trends in biochemical sciences. 2004 Nov;29(11):567–70. pmid:15501674
- 53. Niessing D, Huttelmaier S, Zenklusen D, Singer RH, Burley SK. She2p is a novel RNA binding protein with a basic helical hairpin motif. Cell. 2004 Nov 12;119(4):491–502. pmid:15537539
- 54. Zalfa F, Adinolfi S, Napoli I, Kuhn-Holsken E, Urlaub H, Achsel T, et al. Fragile X mental retardation protein (FMRP) binds specifically to the brain cytoplasmic RNAs BC1/BC200 via a novel RNA-binding motif. The Journal of biological chemistry. 2005 Sep 30;280(39):33403–10. pmid:16006558
- 55. Rammelt C, Bilen B, Zavolan M, Keller W. PAPD5, a noncanonical poly(A) polymerase with an unusual RNA-binding motif. Rna. 2011 Sep;17(9):1737–46. Pubmed Central PMCID: 3162338. pmid:21788334
- 56. Brown RS. Zinc finger proteins: getting a grip on RNA. Current opinion in structural biology. 2005 Feb;15(1):94–8. pmid:15718139
- 57. Chritton JJ, Wickens M. Translational repression by PUF proteins in vitro. Rna. 2010 Jun;16(6):1217–25. Pubmed Central PMCID: 2874173. pmid:20427513
- 58. Valverde R, Edwards L, Regan L. Structure and function of KH domains. The FEBS journal. 2008 Jun;275(11):2712–26. pmid:18422648
- 59. Silverman JM, Clos J, de'Oliveira CC, Shirvani O, Fang Y, Wang C, et al. An exosome-based secretion pathway is responsible for protein export from Leishmania and communication with macrophages. Journal of cell science. 2010 Mar 15;123(Pt 6):842–52. pmid:20159964
- 60. Silverman JM, Clos J, Horakova E, Wang AY, Wiesgigl M, Kelly I, et al. Leishmania exosomes modulate innate and adaptive immune responses through effects on monocytes and dendritic cells. Journal of immunology. 2010 Nov 1;185(9):5011–22.
- 61. Silverman JM, Chan SK, Robinson DP, Dwyer DM, Nandan D, Foster LJ, et al. Proteomic analysis of the secretome of Leishmania donovani. Genome biology. 2008;9(2):R35. Pubmed Central PMCID: 2374696. pmid:18282296