The Neurospora crassa mitochondrial tyrosyl-tRNA synthetase (mtTyrRS; CYT-18 protein) evolved a new function as a group I intron splicing factor by acquiring the ability to bind group I intron RNAs and stabilize their catalytically active RNA structure. Previous studies showed: (i) CYT-18 binds group I introns by using both its N-terminal catalytic domain and flexibly attached C-terminal anticodon-binding domain (CTD); and (ii) the catalytic domain binds group I introns specifically via multiple structural adaptations that occurred during or after the divergence of Peziomycotina and Saccharomycotina. However, the function of the CTD and how it contributed to the evolution of splicing activity have been unclear. Here, small angle X-ray scattering analysis of CYT-18 shows that both CTDs of the homodimeric protein extend outward from the catalytic domain, but move inward to bind opposite ends of a group I intron RNA. Biochemical assays show that the isolated CTD of CYT-18 binds RNAs non-specifically, possibly contributing to its interaction with the structurally different ends of the intron RNA. Finally, we find that the yeast mtTyrRS, which diverged from Pezizomycotina fungal mtTyrRSs prior to the evolution of splicing activity, binds group I intron and other RNAs non-specifically via its CTD, but lacks further adaptations needed for group I intron splicing. Our results suggest a scenario of constructive neutral (i.e., pre-adaptive) evolution in which an initial non-specific interaction between the CTD of an ancestral fungal mtTyrRS and a self-splicing group I intron was “fixed” by an intron RNA mutation that resulted in protein-dependent splicing. Once fixed, this interaction could be elaborated by further adaptive mutations in both the catalytic domain and CTD that enabled specific binding of group I introns. Our results highlight a role for non-specific RNA binding in the evolution of RNA-binding proteins.
The acquisition of new modes of post-transcriptional gene regulation played an important role in the evolution of eukaryotes and was achieved by an increase in the number of RNA-binding proteins with new functions. RNA-binding proteins bind directly to double- or single-stranded RNA and regulate many cellular processes. Here, we address how proteins evolve new RNA-binding functions by using as a model system a fungal mitochondrial tyrosyl-tRNA synthetase that evolved to acquire a novel function in splicing group I introns. Group I introns are RNA enzymes (or “ribozymes”) that catalyze their own removal from transcripts, but can become dependent upon proteins to stabilize their active structure. We show that the C-terminal domain of the synthetase is flexibly attached and has high non-specific RNA-binding activity that likely pre-dated the evolution of splicing activity. Our findings suggest an evolutionary scenario in which an initial non-specific interaction between an ancestral synthetase and a self-splicing group I intron was fixed by an intron RNA mutation, thereby making it dependent upon the protein for structural stabilization. The interaction then evolved by the acquisition of adaptive mutations throughout the protein and RNA that increased both the splicing efficiency and its protein-dependence. Our results suggest a general mechanism by which non-specific binding interactions can lead to the evolution of new RNA-binding functions and provide novel insights into splicing and synthetase mechanisms.
Citation: Lamech LT, Mallam AL, Lambowitz AM (2014) Evolution of RNA-Protein Interactions: Non-Specific Binding Led to RNA Splicing Activity of Fungal Mitochondrial Tyrosyl-tRNA Synthetases. PLoS Biol 12(12): e1002028. https://doi.org/10.1371/journal.pbio.1002028
Academic Editor: Daniel Herschlag, Stanford University, United States of America
Received: June 30, 2014; Accepted: November 12, 2014; Published: December 23, 2014
Copyright: © 2014 Lamech et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the National Institutes of Health grant GM037951. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: aaRS, aminoacyl-tRNA synthetase; C-tail, C-terminal tail; CTD, C-terminal anticodon-binding domain; Dmax, maximum dimension of the particle; mt, mitochondrial; mtTyrRS, mitochondrial tyrosyl-tRNA synthetase; Nc mt LSU intron, Neurospora crassa mitochondrial large subunit rRNA-ΔORF intron; Nc ND1m intron, Neurospora crassa NADH dehydrogenase subunit 1-ΔORF intron; NSD, normalized spatial discrepancy; NTD, N-terminal domain; P(r), distance distribution function; q, momentum transfer; Rg, radius of gyration; SAXS, small angle X-ray scattering; Sc mtTyrRS, Saccharomyces cerevisiae mitochondrial tyrosyl-tRNA synthetase; TEV, tobacco etch virus
RNA-binding proteins play critical roles in post-transcriptional regulation of gene expression in all domains of life . However, the complexity of this regulation is far greater in eukaryotes than in prokaryotes, reflecting both the larger number of RNAs requiring regulation and the evolution of new RNA processing and regulatory mechanisms. The latter include extensive RNA splicing and alternative splicing to produce different protein isoforms; an increased importance of RNA localization in larger and more complex eukaryotic cells; nonsense-mediated decay to prevent translation of intron-containing RNAs; and combinatorial regulation of mRNA translation and stability by RNA-binding proteins and miRNAs acting in ribonucleoprotein complexes –. These new modes of post-transcriptional regulation necessitated and were enabled by corresponding increases in the number and diversity of RNA-binding proteins and the evolution of new RNA-binding functions ,. Thus far, however, the molecular mechanisms underlying the evolution of new RNA-binding functions have remained unclear.
Cellular proteins that adapted to splice autocatalytic group I and group II introns provide powerful model systems for investigating how proteins evolve new RNA-binding functions. Group I and group II introns are found in prokaryotes and in the mitochondrial (mt) and chloroplast DNAs of some eukaryotes, with group I introns also found in the nuclear rRNA genes of certain fungi and protozoa ,. Both types of introns are ribozymes that catalyze their own splicing as well as mobile genetic elements that can be horizontally transferred to different hosts where they propagate by inserting into new genomic sites ,. Although some group I and II introns self-splice in vitro, most have acquired mutations that impair formation of a catalytically active RNA structure, necessitating the recruitment of cellular proteins to promote RNA folding for efficient splicing in vivo ,. These group I and group II intron splicing factors include both host-encoded proteins, such as aminoacyl-tRNA synthetases (aaRSs) and translation factors, and intron-encoded proteins, such as DNA endonuclease and reverse transcriptases, that evolved secondarily to function in RNA splicing . Such co-option of pre-existing proteins to function in splicing is pertinent to the evolution of splicing mechanisms in higher organisms, as emphasized by recent findings that a key spliceosomal protein, Prp8, was derived from a group II intron-like reverse transcriptase ,.
One of the most extensively studied examples of a cellular protein that evolved to function in RNA splicing is the Neurospora crassa mtTyrRS (CYT-18 protein), which acts as a splicing factor for mt group I introns –. Biochemical and structural studies showed that CYT-18 functions in splicing by recognizing and stabilizing the conserved phosphodiester backbone structure of group I intron RNAs –. This splicing function has been found only for those mtTyrRS of fungi belonging to the subphylum Pezizomycotina and can be traced to a series of structural adaptations of the protein that were acquired during or after the divergence of Pezizomycotina from Saccharomycotina .
CYT-18 and other mtTyrRSs are class 1 aaRSs that are closely related to bacterial TyrRSs . They consist of an N-terminal catalytic domain, which binds the acceptor stem of tRNATyr, followed by an intermediate α-helical domain and a C-terminal anticodon-binding domain (CTD), which bind the anticodon and variable arms (Figure 1A and 1B; the catalytic and intermediate α-helical domains together are denoted the N-terminal domains or NTDs). Like its bacterial counterparts, CYT-18 functions as a homodimer, with each dimer binding either one molecule of tRNATyr or group I intron RNA –. CYT-18 binds group I introns by using both its N-terminal catalytic domain and CTD, but only some introns require the CTD for RNA splicing –.
(A) Schematic of a CYT-18 homodimer. CYT-18 consists of N-terminal catalytic and α-helical domains (NTDs, blue) that are connected via a flexible linker to a CTD (red). The protein dimerizes via interactions between the catalytic domains. (B) Domain architecture of the N. crassa (Nc), A. nidulans (An), and S. cerevisiae (Sc) mtTyrRS compared to E. coli (Ec) TyrRS. The mitochondrial targeting sequence (MT) and C-terminal extension (C-tail) are in white, and Pezizomycotina-specific insertions in the NTD and CTD are highlighted with blue and red diagonal stripes, respectively. Insertions in the Sc mtTyrRS relative to E. coli TyrRS are indicated in gray and are not homologous to the Pezizomycotina insertions . (C) CYT-18 protein constructs used in this study. The wild-type CYT-18 protein corresponds to the full-length mature protein lacking the mitochondrial targeting sequence. CYT-18* additionally lacks most of the non-essential C-tail and ends nine residues after Ins 5 (residue 583) to match the short C-tail of the A. nidulans mtTyrRS for which an NMR structure was determined . CYT-18 NTDs contains the N-terminal catalytic and α-helical domains and lacks both the CTD and C-tail. The CTD construct begins with a few residues of the flexible linker (Ins 3) and ends at the same position as CYT-18*.
Both the N-terminal catalytic domain and CTD of Pezizomycotinia mtTyrRSs have distinctive structural adaptations that are absent in non-splicing mtTyrRSs, including the closely related Saccharomycotina mtTyrRSs . These structural adaptations include a small N-terminal α-helical extension (denoted H0) and a series of small insertions (Ins 1–5), whose presence correlates with RNA splicing activity (Figure 1B) ,. The Pezizomycotina mtTyrRS also have a non-essential C-terminal tail of variable length (C-tail; 13–152 amino acids) appended to the CTD ( and this work).
Structural studies, including a co-crystal structure of a splicing-active CYT-18 protein lacking the CTD (here denoted CYT-18 NTDs) bound to a group I intron RNA (the bacteriophage Twort orf142-I2 ribozyme), provided insight into group I intron binding by the N-terminal catalytic domain ,. These studies showed that CYT-18 binds group I introns asymmetrically across the two subunits of the homodimer by using a newly evolved group I intron-binding surface on the side of the catalytic domain opposite that which binds tRNATyr. This new RNA-binding surface includes the N-terminal extension H0, Ins 1, and Ins 2 and provides an extended scaffold for the conserved phosphodiester backbone structure of the group I intron catalytic core.
The CYT-18 constructs used for crystallography lacked the flexibly attached CTD, which has been problematic for X-ray crystallography of TyrRSs –. Recently, we determined an NMR structure of the isolated CTD of the splicing-active Aspergillus nidulans mtTyrRS, which is closely related to CYT-18 . The structure showed that the mtTyrRS CTD resembles those of bacterial TyrRSs in having a fold similar to that of bacterial ribosomal protein S4, but with novel structural features. The latter include three Pezizomycotina-specific insertions (Ins 3–5), with Ins 3 corresponding to an expansion of the flexible linker between the NTDs and CTD. Modeling of the NMR structure onto the CYT-18 NTDs+Twort co-crystal structure using distance constraints from directed hydroxyl-radical cleavage assays suggested that the two CTDs of the homodimeric protein bind opposite ends of a group I intron RNA. This model requires that the CTD of one subunit of the CYT-18 homodimer undergo a large shift on its flexible linker to interact with either tRNATyr or the group I intron RNA bound on opposite sides of the catalytic domain . Thus far, however, there has been no structural data for a CYT-18 protein that contains both the NTDs and CTD, and the role of the CTD in promoting group I intron splicing has remained unclear.
CYT-18 has been used as a model for the theory of constructive neutral evolution (referred to here as “pre-adaptive evolution”). This theory holds that complex multi-protein or RNP complexes arise by a “ratchet-like process” in which a pre-existing neutral or mildly deleterious interaction is “fixed” by a mutation in one partner that makes it dependent upon the other to perform a biological function. Once fixed, this dependence can be further elaborated by adaptive changes in both partners, which increase reaction efficiency and co-dependence –. In the case of CYT-18, this hypothesis suggests that an ancestral non-splicing fungal mtTyrRS had a pre-existing ability to bind group I introns, which became fixed when the intron RNA acquired mutations that impaired self-splicing, resulting in dependence upon the bound protein for structural stabilization . After the interaction was fixed, further adaptive mutations in both the RNA and protein increased both the efficiency of RNA splicing and its protein-dependence. Early studies suggesting that CYT-18 recognized tRNA-like structural features of group I intron RNAs were cited as a prime example of a pre-adaptive interaction leading to the evolution of a new RNA-splicing function ,. However, subsequent findings that CYT-18's N-terminal catalytic domain binds group I introns specifically by using a separate non-tRNA-binding surface , made the nature of the initial non-adaptive interaction unclear.
Here, we used small angle X-ray scattering (SAXS) and biofchemical assays to investigate the solution structures of full-length CYT-18 protein and its CTDs and their mode of interaction with group I intron RNAs. The SAXS analysis shows that the CTDs of both subunits of the CYT-18 homodimer extend outward from the NTDs, but move inward to bind opposite ends of the group I intron RNA. Surprisingly, we find that the CTD of CYT-18 has a high non-specific RNA binding affinity, which may contribute to its interaction with group I intron RNAs, and that the Saccharomyces cerevisiae (yeast) mtTyrRS, which diverged prior to the evolution of splicing activity, can also bind intron RNAs non-specifically via its CTD. Finally, experiments with chimeric proteins show that the yeast CTD can replace CYT-18's to promote aminoacylation but not group I intron splicing. Our results suggest a scenario of pre-adaptive evolution in which the initial non-adaptive interaction between an ancestral mtTyrRS and group I intron RNA was non-specific binding by the CTD and highlight a role for non-specific binding in the evolution of new RNA-binding functions.
SAXS Analysis of CYT-18 and Its CTD
First, we used SAXS to investigate the conformational changes of CYT-18 and the position of its CTDs in the absence and presence of a group I intron RNA. Scattering data were collected for three CYT-18 constructs: CYT-18*, a wild-type protein truncated to delete most of the non-essential C-tail in order to simplify modeling and analysis; CYT-18 NTDs, which contains the N-terminal catalytic and α-helical domains, but lacks both the CTD and C-tail; and CTD, the isolated C-terminal anticodon-binding domain (Figure 1C). CYT-18* is fully active in tyrosyl-adenylation, which measures the number of TyrRS active sites, and it functions similarly to full-length CYT-18 both in aminoacylation of Escherichia coli tRNATyr, a standard substrate for this protein, and in splicing the N. crassa mt large subunit rRNA (Nc mt LSU) intron, which requires a functional CTD (Figure S1). The CYT-18 NTDs construct is also fully active in tyrosyl-adenylation, but cannot aminoacylate tRNATyr as expected because of the lack of the CTD (Figure S1A and S1B) .
Figure 2 shows SAXS curves for all three proteins, and Table 1 summarizes size parameters calculated from the SAXS curves, including the protein molecular weight; the maximum dimension of the particle (Dmax); and the radius of gyration (Rg), which is the root mean square distance to the center of mass of a particle and provides an estimate of the overall particle size . For all three proteins, Kratky plots of the SAXS data show a bell shape curve with a distinct peak, indicative of a folded globular protein (Figure S2).
(A) Scattering curves for the CYT-18 NTDs, CTD, and CYT-18*. The plots show the log of the scattering intensity (I, arbitrary units [a.u.]) as a function of momentum transfer (q = 4πsin(θ)/λ) and are displaced along the y-axis for visualization. The top and middle curves for CYT-18 NTDs and CTD show the SAXS data (black) overlaid with the expected scattering profiles calculated by CRYSOL from the CYT-18 NTDs crystal structure (blue, PDB:1Y42; χ = 1.9) or a CYT-18 CTD homology model based on the A. nidulans mtTyrRS CTD NMR structure (red, PDB:2KTL; χ = 2.8), respectively. The bottom curve for CYT-18* shows the SAXS data (black) overlaid with the calculated scattering profile for the CORAL model of this protein (gray; χ = 1.8). (B) Normalized pair distance distribution functions (P(r)) calculated from the scattering profiles by AUTOGNOM for CYT-18 CTD (red), CYT-18 NTDs (blue), and CYT-18* (gray). The P(r) curves for the CYT-18 NTDs and CTD are single peaks with a short tail, consistent with an elongated protein shape. (C–E) Ab initio models built by DAMMIN with the fit of the SAXS envelope to the corresponding high-resolution structure (CYT-18 NTDs), homology model (CYT-18 CTD), or CORAL model (CYT-18*) superimposed by SUPCOMB shown below the model . The χ values shown in parentheses in the figure indicate the fit of the ab initio models to the scattering data.
Focusing first on the CYT-18 NTDs protein, the scattering curve overlays well (χ = 1.9) with a theoretical scattering curve calculated from the previous CYT-18 NTDs crystal structure  by using the program CRYSOL (Figure 2A, top curve) . The SAXS curve gave an estimated molecular weight of 84.4 kDa and Rg and Dmax values of 35.6 and 123 Å, respectively, in good agreement with the molecular weight calculated from protein sequence (89.6 kDa) and with Rg and Dmax values calculated from the crystal structure using CRYSOL (35.2 and 125 Å, respectively) (Table 1). The distance distribution function P(r) for the CYT-18 NTDs displays a single peak with a tail (Figure 2B), a pattern indicative of a protein having an elongated structure . Ab initio models of the CYT-18 NTDs protein were built from the SAXS data by simulated annealing of either dummy atoms by DAMMIN or a chain-like ensemble of dummy residues by GASBOR (Figures 2C and S3, respectively) ,. The DAMMIN and GASBOR models show good fits to the experimental SAXS curve (χ = 1.8 for both models) and are similar in shape to each other and to the high-resolution structure as shown by the superposition of the crystal structure within the SAXS model envelopes. The final DAMMIN and GASBOR models are the result of analyzing multiple solutions and either averaging the models (DAMMIN) or picking the most representative one (GASBOR). The normalized spatial discrepancy (NSD) value, which describes the similarity between the different models produced by the programs, is low for both the DAMMIN and GASBOR models (0.63±0.03 and 1.10±0.02, respectively), indicating that the multiple solutions built by the programs are similar to each other (Table 2). Taken together, these results indicate that the conformation adopted by CYT-18 NTDs in solution is similar to that in the crystal structure .
The SAXS data for the isolated CTD overlays well with a theoretical scattering curve calculated from a homology model of CYT-18's CTD constructed from the NMR structure of the A. nidulan CTD using I-TASSER (χ = 2.8) (Figure 2A, middle curve) . The molecular weight of 13.3 kDa estimated from the scattering data (Table 1) indicates that the CTD is monomeric in solution. The Rg and Dmax values for the CTD calculated from the SAXS data (17.7 and 62 Å, respectively) are in good agreement with those for the I-TASSER model (17.0 and 55.9 Å, respectively) (Table 1). The DAMMIN and GASBOR models of the CYT-18 CTD (χ = 1.7 and 2.2, respectively) also superpose well with the homology model (Figures 2D and Figure S3, respectively). Thus, the SAXS analysis indicates that the CYT-18 CTD folds independently of the remainder of the protein and that the I-TASSER model provides a good representation of the structure of the CYT-18 CTD in solution. The latter finding validates the use of the I-TASSER model in building high-resolution structures of CYT-18* from the SAXS data (see below).
CYT-18* is the first CYT-18 protein to be investigated structurally that contains both the NTDs and CTD. The molecular weight for this protein estimated from the SAXS data is 119 kDa, confirming that CYT-18* is a dimer in solution (Table 1). The Rg and Dmax values from the CYT-18* scattering data are 46.9 and 170 Å, respectively, both larger than that for the CYT-18 NTDs, as expected. Ab initio models of the CYT-18* homodimer indicate an open conformation with both CTDs extending outward from the NTDs (χ = 1.5 and 2.1 for the DAMMIN and GASBOR models, respectively) (Figures 2E and S3). A rigid-body model of CYT-18* was also built by CORAL, which constructs models that fit the SAXS data by combining high-resolution models of individual components, in this case the crystal structure of the CYT-18 NTDs and the I-TASSER model of the CTD (see above), with different conformations of flexible dummy residue linkers . The CORAL model overlays well with the scattering curve (χ = 1.8) (Figure 2A, bottom curve) and superposes well into the SAXS envelopes of the ab initio models (Figures 2E and S3). These findings indicate that in the absence of intron RNA, CYT-18* adopts an S-shaped configuration with the two CTDs of the homodimer extending outward in opposite orientations.
CYT-18 Forms a Compact Structure When Bound to a Group I Intron RNA
The co-crystal structure of the CYT-18 NTDs bound to Twort intron RNA indicated that the RNA binds asymmetrically across the dimer interface and that the structure of the NTDs does not change substantially upon binding the intron RNA . To elucidate interacting regions and conformational changes of the CTDs upon binding to the intron RNA, we obtained SAXS data for complexes of both the CYT-18 NTDs and CYT-18* bound to the same Twort group I intron RNA. The experimental scattering curve of the CYT-18 NTDs+Twort RNA complex overlays reasonably well with the scattering curve calculated from the co-crystal structure (χ = 4.4) (Figure 3A, top curve), and gave Rg and Dmax values (39.2 and 137 Å, respectively) in agreement with those calculated from the co-crystal structure (39.1 and 134 Å, respectively) (Table 1). Likewise, a rigid-body model of the CYT-18 NTDs+Twort RNA complex built using CORAL shows a good fit to the experimental SAXS curve (χ = 2.2) (Figure 3B; top curve) and is similar in shape to the co-crystal structure (Figures 3A and 3B, compare insets above the top curves). These findings indicate that the CYT-18 NTDs+Twort RNA co-crystal structure is similar to the structure of the complex in solution and can be used as a component for structural modeling of the CYT-18*+Twort RNA complex from the SAXS data.
(A, B) Scattering profiles of the CYT-18 NTDs and CYT-18* bound to the Twort ribozyme displaced along the logarithm axis for visualization. (A) shows CRYSOL fits of the SAXS curves (black) to theoretical scattering curves calculated from the CYT-18 NTDs+Twort co-crystal structure (blue; χ = 4.4) and CYT-18+Twort RNA model structure based on biochemical data  (gray; χ = 8.8). The CYT-18 NTDs+Twort RNA co-crystal structure and CYT-18+Twort RNA biochemical model structure are shown alongside the plots together with the Rg values calculated from the SAXS data and the structures. (B) shows fits of the SAXS curves (black) to the CORAL models of the CYT-18 NTDs+Twort (blue; χ = 2.2) and CYT-18*+Twort (gray; χ = 3.2). The CORAL models are shown alongside the plots together with the radii of gyration (Rg) calculated from the SAXS data and the models. (C, D) CORAL and biochemical models of the CYT-18+Twort RNA complex, respectively. Three views of the models are shown with the CYT-18 NTDs colored blue and the two CTDs of the homodimer colored red. The unmodeled flexible linkers in the biochemical model are shown as dashed lines. The Twort group I intron RNA is depicted in gray cartoon representation with the P4–P6 domain in orange and the P3–P9 domain in cyan. Parts of intron RNA regions P2, P4–P5, P6–P6a, P8, and P9 are near and potentially interact with the CTDs.
Finally, the scattering data for the Twort RNA complex with CYT-18*, which contains both the NTDs and CTD, gave Rg and Dmax values of 41.9 Å and 146 Å, respectively (Figure 3A, bottom curve; Table 1). The relatively small difference between these values and those for the CYT-18 NTDs+Twort complex (Rg and Dmax values of 39.2 Å and 137 Å, respectively; Table 1) suggests that CYT-18* forms a compact complex with the RNA in which the CTDs make a smaller than expected contribution to the overall particle dimensions. This conclusion was supported by ensemble optimization analysis using the program EOM, which generates a large random pool of conformations and picks an optimized ensemble that best fits the scattering data (see Materials and Methods). This optimized ensemble pool displayed a smaller, tighter range of Rg and Dmax values than a random pool of protein-RNA conformations consistent with a compact rigid complex (Figure S4).
A rigid-body model of the CYT-18*+Twort RNA complex built using CORAL indicates that both CTDs are positioned near the Twort intron (Figure 3C). This CORAL model can be compared to a previous model of CYT-18*+Twort built using biochemical data (Figure 3D) . While both the CORAL and biochemical models show that both CTDs are located near the intron RNA, the CORAL model better fits to the scattering data than does the biochemical model (χ = 3.2 and 8.8, respectively). In both models, the CTD of one subunit of the CYT-18 homodimer is close to and may interact with P2, P6–P6a, and P8 of the intron RNA, while the CTD of the other subunit is close to and may interact with P4–P5, and P9 of the intron RNA (Figure 3C and 3D). Considered together, the SAXS analyses indicate that upon binding a group I intron RNA, CYT-18* forms a compact complex in which both CTDs of the CYT-18 homodimer clamp down to interact with opposite ends of the group I intron RNA.
The CYT-18 CTD Is a Non-specific RNA Binding Domain
To investigate how the RNA-binding properties of CYT-18's CTD enable it to interact with the two structurally distinct ends of a group I intron RNA, we analyzed the interaction of the CYT-18 NTDs and the isolated CTD with various RNAs by equilibrium-binding assays at 25°C and 37°C (Figures 4 and S5, respectively). The RNAs compared were three group I introns (the N. crassa mt large ribosomal subunit-ΔORF intron (Nc mt LSU); the N. crassa NADH dehydrogenase subunit 1-ΔORF intron (Nc ND1m); and the Twort intron RNA); a group II intron RNA (Lactococcus lactis Ll.LtrB-ΔORF; Ll.LtrB); and poly(U)30, which presumably lacks higher-order structure.
Binding assays of CYT-18 NTDs (blue) or CTD (red) to the (A–C) N. crassa mt LSU (Nc mt LSU), N. crassa ND1m (Nc ND1m), and Twort ribozyme group I intron RNAs, respectively; (D) L. lactis Ll.LtrB group II intron RNA; and (E) poly(U)30. For the binding assays, 32P-labeled RNAs were incubated with increasing concentrations of protein at 25°C for 30 min, a time verified to be sufficient to reach equilibrium (see Materials and Methods). The plots show the fraction of RNA bound to a nitrocellulose filter as a function of protein concentration with the CYT-18 NTDs-group I intron binding data fit to hyperbolic curves and the CYT-18 NTDs-Ll.LtrB and all CYT-18 CTD-RNA binding data fit to sigmoidal curves. Dissociation constant (Kd) or K1/2 values and Hill coefficient (n) are shown in (A–E) and are the mean for three experiments with the error bars indicating the standard deviation. Equilibrium-binding assays for the same proteins and RNAs at 37°C are shown in Figure S4.
The binding curves for the CYT-18 NTDs to the Nc mt LSU, Nc ND1m, and Twort group I intron RNAs were best fit by hyperbolic functions with Kds ranging from 200 to 590 nM at 25°C (Figure 4A–4C) and 440 to 740 nM at 37°C (Figure S5A–S5C). The Kd values for the Nc ND1m intron are substantially higher than that calculated from previous koff measurements, which assumed that the kon of the construct lacking the CTD is the same as that for the wild-type protein (71±24 pM) . This difference suggests that the CTD might make a major contribution to kon by mediating the initial interaction with intron RNA substrates. At both temperatures, the strongest binding group I intron RNA was the Nc ND1m intron and the weakest was the Nc mt LSU intron, consistent with previous findings that the CTD is required for tight binding and splicing of the Nc mt LSU, but not the Nc ND1m intron . At 25°C, the CYT-18 NTDs bound the Ll.LtrB group II intron RNA with a K1/2 = 240 nM, within the range of Kds for group I intron RNAs, but the binding curve was sigmoidal, with n = 1.6, suggesting cooperative and possibly non-specific binding (Figure 4D), whereas at 37°C, binding of the group II intron RNA was weaker (Kd = 440 nM) and the binding curve was hyperbolic (Figure S5D). The CYT-18 NTDs did not bind appreciably to poly(U)30 at either 25°C or 37°C (Figures 4E and S5E).
Surprisingly, the isolated CYT-18 CTD bound group I and II intron RNAs and poly(U)30 more strongly than did the CYT-18 NTDs and with similar affinities for all five RNAs tested (Figure 4). Indeed, the binding curves for the isolated CTD to these radically different RNAs were remarkably similar to each other, each being sigmoidal with K1/2s = 57–64 nM at 25°C and 51–77 nM at 37°C. These sigmoidal binding curves (i.e., Hill coefficients (n)>1) suggest cooperative binding of two or more CTDs to each RNA. The ability of the isolated CTD to bind similarly to group I and II intron RNAs, as well as unstructured poly(U)30 indicates that it is a non-specific RNA binding domain.
The Non-splicing S. cerevisiae mtTyrRS Binds Group I Introns Non-specifically Via Its CTD
The finding that the CYT-18 CTD is a non-specific RNA-binding domain led us to wonder whether an ancestral Pezizomycotina mtTyrRS might have initially bound group I intron RNAs non-specifically. To address this question, we turned to the S. cerevisiae (Sc) mtTyrRS, which is closely related to CYT-18 but branched from the Pezizomycotina mtTyrRSs prior to the evolution of splicing activity . We compared two constructs that were expressed in E. coli: recombinant wild-type Sc mtTyrRS and a derivative lacking the CTD (Sc NTDs). We confirmed that both proteins are fully active in tyrosyl-adenylation, indicating correct folding of the catalytic domain (Figure S1A). Aminoacylation assays showed that the Sc mtTyrRS has higher activity with E. coli tRNATyr than does CYT-18 (Figure S1B), possibly reflecting that the E. coli tRNATyr and the Sc mt tRNATyr are more similar to each other than to the Nc mt tRNATyr. All three tyrosyl-tRNAs share the same N73 identity element (A73) and anticodon, but differ in the N1-N72 identity element at the end of the acceptor stem (G-C in E. coli tRNATyr and Sc mt tRNATyr, but A-U in Nc mt tRNATyr) and the length of the variable arm (13–14 nt in E. coli tRNATyr and Sc mt tRNATyr, but unusually long at 16 nt in Nc mt tRNATyr) (Figure S6) –.
Equilibrium binding assays showed that the Sc mtTyrRS, although incapable of splicing group I intron RNAs , can bind both the Nc mt LSU group I intron RNA and Ll.LtrB group II intron RNA with Kds = 430 and 440 nM, respectively (Figure 5A and 5B), within the range of Kds for specific binding of CYT-18 NTDs to group I intron RNAs (see above). However, the similar affinity of the Sc mtTyrRS for the group I and group II intron RNAs suggests that this binding is non-specific. Strikingly, the ability of the Sc mtTyrRS to bind group I and II intron RNAs was entirely dependent upon its CTD, with the Sc NTDs protein showing no detectable binding of either intron RNA over the concentration range tested (Figure 5A and 5B).
(A, B) Binding of Sc mtTyrRS (orange) and Sc NTDs (green) to the Nc mt LSU group I intron and L. lactis Ll.LtrB group II intron, respectively. (C) Binding of the CTD of the yeast mtTyrRS to the Nc mt LSU (blue), Nc ND1m (black), and Twort group I introns (pink); the Ll.LtrB group II intron (green); and poly(U)30 (purple). Binding assays were done at 25°C, as described in Materials and Methods. Kd or K1/2 values and Hill coefficients (n) are shown in boxes in (A–C) and are the mean for three experiments, with the error bars indicating the standard deviations. The Sc NTDs protein showed no detectable binding (n.b.) over the protein concentration range tested.
To investigate if the Sc mtTyrRS CTD is a non-specific RNA-binding domain like CYT-18's CTD, we expressed and purified this domain separately including a small segment of the upstream linker region (denoted Sc CTD). We then assayed equilibrium binding of the Sc CTD to three group I introns (Nc mt LSU, Nc ND1m and Twort), a group II intron (Ll.LtrB), and poly(U)30 at 25°C. These assays showed that the Sc CTD is capable of binding all the RNAs tested with Kd or K1/2 values ranging from 110 nM to 1 µM (Figure 5C). The Sc CTD had the highest affinity for the Ll.LtrB group II intron RNA (K1/2 = 110 nM), which it bound in a cooperative manner (n = 1.8), and the lowest affinity for poly(U)30 (K1/2 = 1 µM; n = 1.6). Notably, for all RNAs tested, the non-specific binding by the isolated Sc mtTyrRS CTD is 1.5- to 15-fold weaker than the binding of CYT-18's CTD to the same RNA (K1/2s at 25°C = 57–64 nM) (Figure 4). Together, these findings indicate that the CTD of the Sc mtTyrRS is also a non-specific RNA-binding domain and that the Sc mtTyrRS binds both group I and group II intron RNAs non-specifically via its CTD.
Splicing Properties of CYT-18/Sc mtTyrRS Chimeric Proteins
Finally, we investigated whether the Sc CTD could replace the CYT-18 CTD to promote group I intron splicing by making CYT-18/Sc mtTyrRS chimeric proteins. Two chimeric constructs were made differing in whether they contain the flexible linker region from CYT-18 or the Sc mtTyrRS (Figure 6). Chimera 1 consists of the CYT-18 catalytic and α-helical domains fused to the Sc CTD via the Sc mtTyrRS linker region, whereas chimera 2 contains the same CYT-18 NTDs fused to the Sc CTD via the CYT-18 linker, which includes Ins 3. Both chimeric proteins showed tyrosyl-adenylation activity similar to wild-type CYT-18, but displayed differences in aminoacylation activity (Figure 7A and 7B). Chimera 1, which contains both the Sc mtTyrRS linker and CTD, is similar to the yeast Sc mtTyrRS in having higher aminoacylation activity with E. coli tRNATyr than does CYT-18, likely due to its better recognition of the bacterial tRNA (see above). By contrast, chimera 2, which contains the CYT-18 linker followed by the yeast mtTyrRS CTD, has substantially lower aminoacylation activity than CYT-18, indicating that the CYT-18 linker impairs charging of E. coli tRNATyr. The findings for chimera 1 indicate that higher TyrRS activity with E. coli tRNATyr correlates with presence of the yeast CTD and linker, regions of the TyrRS that recognize the tRNA variable arm and anticodon stem, and not with the catalytic domain, which recognizes the acceptor stem ,,. The Sc mtTyrRS linker may contribute to the recognition of E. coli tRNATyr, either by contacting the tRNA directly or by facilitating binding of the CTD to the variable arm and/or anticodon stem.
CYT-18 and Sc mtTyrRS C-terminal regions starting from the end of the α-helical domain were aligned to the chimeric proteins using the multiple sequence alignment tool, MUSCLE . The CYT-18 sequence is in black, and the Sc mtTyrRS sequence is in green, with regions corresponding to the CYT-18 CTD insertions in red boxes. The secondary structure of the CYT-18 CTD predicted by I-TASSER is shown above the CYT-18 CTD sequence as either black helices or arrows representing β-sheets. The first residues of the CYT-18 and Sc CTD constructs are highlighted in orange, and the last residue of the CYT-18* construct highlighted in blue.
Chimera 1 consists of the CYT-18 NTDs fused to the Sc mtTyrRS linker region and CTD, while chimera 2 consists of the CYT-18 NTDs and linker region (including Ins 3) fused to Sc CTD. Tyrosyl-adenylation, aminoacylation, and RNA-splicing assays were done at 30°C, as described in Materials and Methods. (A) Tyrosyl-adenylation activity is displayed as a bar graph showing the mean for three experiments, with the error bars indicating the standard deviation. (B) Aminoacylation assay showing the formation of [3H]-Tyr-tRNATyr over a 60-min time course (black open circles, wild-type CYT-18; purple open squares, chimera 1; pink open diamonds, chimera 2; black closed circles, no protein). (C, D) End-point splicing assays of 32P-labeled precursor RNAs (200 nM) containing the Nc mt LSU and ND1m group I introns, respectively, with 100 nM protein and 1 mM GTP for 60 min at 30°C in splicing reaction medium containing 25 or 100 mM KCl. Additional end-point splicing assays with 32P-labeled precursor RNA at these protein and RNA concentrations at 25°C and 37°C are shown in Figure S7A and S7B. (E, F) Splicing time courses of 32P-labeled precursor RNA containing the ND1m intron (200 nM) with 100 nM protein and 1 nM GTP at 30°C in splicing reaction medium containing 25 mM or 100 mM KCl, respectively. The plots show disappearance of precursor RNA as a function of time (black open circles, wild-type CYT-18; purple open squares, chimera 1; pink open diamonds, chimera 2). (G) End-point splicing assays with unlabeled precursor RNA containing 200 nM Nc mt LSU intron and [α-32P]GTP (500 nM; 3,000 Ci mmol−1) with 500 nM protein. Reactions were incubated for 60 min at 30°C and 37°C. Darker exposures of the same gels are shown below. Minor labeled products in the CYT-18 lanes, including one co-migrating with precursor RNA and others migrating above precursor RNA (not shown), appear in time-course experiments after the 5′-labeled intron products and likely reflect secondary reactions catalyzed by group I intron RNAs (see ). The left panel shows splicing reactions for the same concentrations of wild-type CYT-18 protein and 32P-labeled precursor RNA at 30°C run in parallel as a control. Abbreviations: E1-E2, ligated exons; E1-I, 5′ exon+intron; I, excised intron; I-E2, intron+3′ exon; P, precursor RNA.
To determine whether the Sc mtTyrRS CTD could function in splicing, we compared the ability of the chimeric proteins to splice the Nc mt LSU and ND1m group I introns, which do and do not require the CTD for splicing, respectively . Group I introns splice via two sequential transesterification reactions initiated by the addition of guanosine nucleotide to the 5′ end of the intron, resulting in ligated exons and excised linear intron RNA with a non-coded G residue at its 5′ end . The ability of the chimeric proteins to splice the Nc mt LSU and ND1m group I introns was assayed by using 200 nM 32P-labeled precursor RNA containing the introns, 100 nM protein, and unlabeled GTP at three different temperatures (25°C, 30°C, and 37°C) at either 100 mM KCl (the standard condition for CYT-18) or a lower salt concentration, 25 mM KCl (Figures 7C–7F, S7A, and S7B). The assays showed that the chimeric proteins could splice the Nc ND1m intron, which does not require the CTD, but could not splice the Nc mt LSU intron, which requires the CTD, under all conditions examined. The inability of the chimeric proteins to splice the Nc mt LSU intron was additionally confirmed by splicing assays done with higher protein concentration (500 nM; protein excess conditions) to compensate for potentially weaker binding of the intron RNA by the Sc CTD (Figure S7C), and by using a more sensitive assay in which [α-32P]GTP is incubated with unlabeled precursor RNA to label the 5′ end of the intron RNA during the first step of splicing (Figures 7G and S7D).
Notably, although the chimeric proteins were capable of splicing the ND1m intron (Figures 7D and S7B), they did so at a slower rate than full-length CYT-18, with the rate decreasing further at higher salt conditions (Figure 7E and 7F). The slower rate of ND1m intron splicing by the chimeric proteins is similar to that found previously for the CYT-18 NTDs alone , suggesting that it reflects a lack of contributing but nonessential interactions with the CTD. Interestingly, chimera 2 has higher splicing activity with the ND1m intron than does chimera 1, the reverse of what was found for TyrRS activity (Figure 7B). This finding suggests that the longer CYT-18-linker region, which is present in chimera 2 but not chimera 1, contributes to splicing activity. This contribution could involve either specific or non-specific interaction of Ins 3 with the group I intron RNA or increased conformational flexibility of the CTD due to expansion of the linker. Considered together, the findings for chimera 1 and chimera 2 indicate that although both the Sc mtTyrRS and CYT-18 CTDs bind group I intron RNAs non-specifically, the Sc CTD lacks further adaptations required for group I intron splicing activity.
Our results provide insight into the function of CYT-18's CTD and its contribution to the evolution of group I intron splicing activity, highlighting a role for non-specific binding interactions in the evolution of new RNA-binding functions. First, the SAXS analysis indicates that the CTDs of both subunits of the CYT-18 homodimer have a preferred orientation in solution extending outward in opposite directions from the NTDs, but move inward to bind opposite ends of a group I intron RNA. The CORAL model of CYT-18* bound to Twort intron RNA based on the SAXS data suggests that the CTD of one subunit binds the intron near P2, P6–P6a, and P8, while the CTD of the other subunit likely interacts with P4–P5 and P9 (Figure 3C). These interaction sites agree with a previous biochemical model based on directed hydroxyl-radical cleavage assays in which Fe-EPD with a cleavage radius of 25 Å was conjugated at two sites in the CTD (G493C and C494) . These assays found cleavage sites in P6–P6a, P3–P8, and P5 in the Nc ND1m intron and P2, P4, and P6–P6a in the Nc mt LSU intron ,. The additional cleavages in the Nc mt LSU intron P2 helix, which is considerably longer than P2 of the Twort or Nc ND1 introns, are consistent with its proximity to P8. The putative interaction sites between the CTDs and intron RNA in our CORAL model are also consistent with genetic experiments showing that CTD binding can suppress intron RNA mutations that impair long-range tertiary interactions P5-L9 and P2-L8 on opposite ends of the intron RNA .
The relatively fixed orientation of the CTDs in the free CYT-18 protein agrees with previous 15N-1H- two-dimensional NMR analysis showing that the CTDs of the full-length A. nidulans mt and Geobacillus stearothermophilus TyrRSs do not tumble independently in solution . Nevertheless, the linker must be sufficiently flexible to allow the CTDs to bind to group I introns or tRNATyr on opposite sides of the catalytic domain, and the SAXS analysis provides the first direct evidence for this conformational flexibility by showing the two CTDs of the homodimer swing downward from their starting position in the free protein to interact with different regions of a group I intron RNA.
We were surprised to find that the CTDs of both CYT-18 and the non-splicing yeast mtTyrRS are non-specific RNA-binding domains. The isolated CTDs of both proteins bind structured group I and group II intron RNAs, or the simple homopolymer, poly(U)30 with similar affinities, with this non-specific binding 1.5- to 15-fold stronger for the CYT-18 CTD than the yeast mtTyrRS CTD (see Results). The non-specific RNA-binding activity of the TyrRS CTDs may contribute to its function in aminoacylation by augmenting its specific-binding interactions with tRNATyr, which include recognition of the variable arm and anticodon bases ,. Likewise, the high non-specific binding activity of the CYT-18 CTD does not preclude and may bolster specific-binding interactions of this domain with group I intron RNAs. The latter could result either from further adaptive evolution of the CTD or simply from positioning of the CTD on the intron RNA via specific binding of the NTDs.
Although non-specific RNA binding was unexpected for an aaRS domain involved in tRNA recognition, yeast and higher eukaryotic aaRSs have been shown previously to have appended non-specific RNA-binding domains that are not present in their bacterial counterparts and contribute to aminoacylation efficiency. Thus, the yeast glutaminyl-tRNA synthetase (GlnRS) has an N-terminal non-specific RNA binding domain, which when fused to a bacterial GlnRS enabled it to functionally replace the yeast enzyme in vivo, as did fusion of the yeast Arc1 protein, a non-specific RNA-binding protein that ordinarily helps mediate tRNA/aaRS interactions in trans ,. Similarly, some higher eukaryotic aaRSs have tandem repeats of a small non-specific RNA-binding motif that enhances tRNA binding . These non-specific RNA-binding domains are thought to act by adding sufficient binding energy to compensate for relatively weak specific binding interactions of aaRSs with tRNA substrates, similar to the augmentation of specific binding of tRNA and intron RNA substrates suggested above for the TyrRS CTD.
Notably, the ribosomal protein S4-like fold, which forms the core of bacterial and mitochondrial TyrRS CTDs, has been identified previously as an ancient RNA-binding domain. This domain is found in all three kingdoms of life in a variety of proteins that bind structurally different RNAs, including two families of pseudouridine synthetases, a family of predicted RNA methylases, an RNA-modification enzyme with both pseudouridine synthetase and cytidine deaminase activity, threonyl-tRNA synthetases, and a heat-shock protein –. The S4-like fold consists of two α-helices arranged as a helical hairpin packed against three or four β-sheets. Connecting two of the β-sheets is a characteristic L-shaped loop, which together with the two α-helices is termed the αL motif. This motif generally contains clusters of basic and polar residues that are capable of interacting with various nucleic acid substrates in the different S4-like fold containing proteins. In TyrRSs, the αL motif interacts in a region between the variable and anticodon arms ,. We suggest that the inherently high non-specific RNA-binding affinity of the S4-like fold was the key factor enabling it to evolve interactions with different RNA substrates in the course of evolution. Indeed, the fungal mtTyrRSs provide a dramatic example of a case in which the S4-like fold of a single enzyme may bolster specific-binding interactions with three different regions of two different RNA substrates, a mt tRNATyr and a group I intron RNA.
Although we suggest that the non-specific binding of the CTD played a key role in initial interaction with group I intron RNAs, the CTDs of present-day fungal mtTyrRS appear to have evolved specific interactions with group I intron RNAs. Thus chimeric proteins containing the CYT-18 NTDs linked to the yeast CTD can efficiently aminoacylate E. coli tRNATyr, as well as splice the Nc ND1 intron, which requires only the NTDs . However, the chimeric proteins splice the Nc ND1m intron less efficiently than full-length CYT-18 at a rate expected for loss of contributing CTD interactions, and they are unable to splice the Nc mt LSU intron, which requires the CTD . Additional adaptations of the CYT-18 CTD required to promote splicing may include RNA-binding contacts by Ins 3–5, which are found in the CTDs of splicing-competent Pezizomycotina mtTyrRS, but not in the Sc mtTyrRS . Both the previous biochemical model of CYT-18*+Twort  and the new CORAL model based on the SAXS data (Figure 3C) place Ins 4 and 5 in position to bind group I intron RNAs.
Since the discovery of the splicing function of CYT-18 , there have been numerous additional examples of aaRSs that have acquired new functions unrelated to translation, in most cases via addition of non-catalytic domains ,. The acquisition of these new domains and functions is thought to reflect that aaRS are ancient essential enzymes whose presence early in evolution of the cell provided a robust scaffold for the addition of new structural elements . In archael and eukaryotic TyrRSs, the N-terminal catalytic domain is followed by a different anticodon-binding domain, known as the C-W/Y domain, which is homologous to the anticodon-binding domain of TrpRSs . Two additional structural elements were acquired during the evolution of higher eukaryotes and function in receptor-mediated signaling pathways associated with angiogenesis: the ELR motif in the catalytic domain and a C-terminal EMAP II-like domain, which has non-specific RNA-binding properties ,–. The ELR motif is on the intron-binding side of the catalytic domain  and incorporated in the same α-helix as Ins1 in the fungal mtTyrRS, suggesting that this region may be a particularly robust location for insertion of new functional elements.
Finally, our results provide evidence that non-specific binding can play a key and perhaps widespread role in pre-adaptive interactions that lead to the evolution of new RNA-binding functions of proteins. For the group I intron splicing activity of fungal mtTyrRSs, our findings suggest a scenario outlined in Figure 8 in which an initial non-specific interaction between the CTD of an ancestral mtTyrRS and a group I intron RNA was fixed by an intron RNA mutation that made formation of active ribozyme structure dependent upon interaction with the protein. After the interaction was fixed, the mtTyrRS and group I intron were forced to co-evolve, with further adaptive mutations in the protein leading to specific binding of both the catalytic domain and CTD to the intron RNA. These specific-binding interactions extended the intron RNA-binding surface, both increasing the efficiency of splicing and permitting additional mutations in the intron RNA that made it more dependent upon the protein for structural stabilization. RNA-editing enzymes such as APOBEC1, which evolved from enzymes that acted on mononucleotide substrates, may be additional examples of constructive neutral evolution in which a relatively non-specific pre-adaptive interaction with an RNA substrate was fixed by a deleterious mutation, in this case one that could be corrected by RNA editing, and then elaborated by further adaptive mutations ,. Indeed, a similar evolutionary pathway may have been used more generally for other RNA-modification enzymes, including the ones mentioned above that contain an S4-like non-specific RNA-binding domain.
(A) A Pezizomycotina ancestor contained a self-splicing group I intron and an mtTyrRS, which functions in aminoacylation by specifically binding tRNATyr. (B) The ancestral mtTyrRS could bind non-specifically to the group I intron and other RNAs via its CTD, and its interaction with a group I intron RNA was fixed by an intron RNA mutation (orange star) that made it dependent upon the binding of the mtTyrRs for formation or stabilization of the active ribozyme structure. (C) Once fixed, this initial non-specific interaction was elaborated by further adaptive mutations in both the protein (orange spheres) and intron RNA (orange stars) that increased both the efficiency and protein-dependence of RNA splicing. The adaptive mutations in the fungal mtTyrRSs resulted in specific binding of the conserved tertiary structure of the group I intron RNA catalytic core, conferring the ability to bind and splice additional group I introns (blue, pink).
Beyond the initial pre-adaptive phase, the extensive structural data for the interaction of fungal mtTyrRSs with group I intron RNAs provide strong evidence for a ratchet-like process in which multiple adaptive mutations, including six different Peziomycotina-specific insertions, led to the evolution of an efficient splicing apparatus for group I introns. It is highly unlikely that the multiple adaptive mutations in the protein leading to an extensive group I intron-binding surface occurred in one step. The surprising finding that the structural adaptations of the mtTyrRS catalytic domain utilized a non-tRNA-binding surface could reflect that the tRNA-binding site in the catalytic domain could not be easily modified to function in group I intron splicing without inhibiting mtTyrRS activity, which is essential in an obligate aerobe. Additionally, the non-tRNA-binding side of the catalytic domain may have had a pre-existing auxiliary RNA-binding function, as found for some aaRSs ,. By contrast to the catalytic domain, the regions of the CTD needed for splicing activity overlap tRNA-binding regions requiring co-evolution with both the intron RNA and mt tRNATyr. Indeed, the unusually long variable arm of Pezizomycotina mt tRNATyrs (see Results) (Figure S6) may be an example of a feature that co-evolved with the CTD to allow it to better accommodate group I intron RNAs . We also note that although the initial interaction of an ancestral fungal mtTyrRS likely involved a single group I intron RNA, perhaps the mt LSU intron, which is dependent upon the mtTyrRS for splicing in all Pezizomycotina fungi examined , the fungal mtTyrRSs ultimately evolved to function in splicing multiple group I introns by recognizing the conserved phosphodiester backbone structure of the catalytic core. This binding mode has the evolutionary advantages of enabling the fungal mtTyrRSs to coordinate the splicing of multiple group I introns as well as the ability to accommodate new group I introns that invade genomes as mobile genetic elements.
Materials and Methods
Recombinant plasmids used for protein expression in E. coli are derivatives of the phage T7 promoter-driven expression vectors pET3a, pET11a, or pET11d (EMD Millipore). pEX560, which expresses a wild-type CYT-18 protein (amino acids 33–669), contains the cyt-18 ORF (nucleotides 97–2,010) cloned downstream of the T7 promoter in pET3a . pCYT18/ΔC-tail, which expresses CYT-18* (C-terminal truncation of the non-essential C-tail; amino acids 584–669), was derived from pEX560 by introducing three stop codons (TAATAGTAG) after Leu583 by site-directed mutagenesis (QuikChange; Agilent Technologies). pHISTEV602 expresses the CYT-18 NTDs (C-terminal truncation of both the CTD and C-tail; amino acids 424–669), with an N-terminal tobacco etch virus (TEV) protease-cleavable 6× His-tag. It was constructed by PCR of pEX560 using primers that amplify nucleotides 97–1,251 of the CYT-18 ORF and append NcoI and BamHI sites, and then cloning the resulting PCR product between the NcoI and BamHI sites of pET11d. pCYT18-CTD, which expresses the CYT-18 CTD (amino acids 448–583) with an N-terminal TEV-cleavable 6× HIS-tag, was constructed by PCR of pEX560 using primers that amplify nucleotides 1,342–1,749 of the CYT-18 ORF and append NdeI and BamHI sites, and then cloning the resulting PCR product between the NdeI and BamHI sites of pET11a. All CYT-18 expression constructs lack the mt targeting sequence (amino acids 1–32). Wild-type CYT-18 and CYT-18* have an extra N-terminal methionine, while CYT-18 NTDs and CTD have an extra N-terminal glycine resulting from TEV-protease cleavage of the N-terminal 6× His-tag.
pHISTEVScTyrRS, which expresses the full-length mature S. cerevisiae mtTyrRS with an N-terminal TEV-cleavable 6× HIS-tag, contains Sc mtTyrRS codons 38–492 (lacking the mt target sequence; amino acids 1–37) cloned between the Nco1 and BamHI sites of pET11d . pHISTEVSc/ΔCTD expresses Sc mtTyrRS lacking the CTD (denoted Sc NTDs) and was derived from pHISTEVScTyrRS by using site-directed mutagenesis to add three stop codons (TAATAATAA) after Asp400. pMAL-ScCTD, which expresses the Sc mtTyrRS CTD (denoted Sc CTD), contains Sc CTD codons 414–492 cloned between the BamHI and HindIII sites of pMAL-c2t , a derivative of plasmid pMAL-c2x (New England Biolabs) that expresses the protein with an N-terminal maltose-binding protein tag followed by a TEV-protease site.
Chimeric proteins containing the N-terminal catalytic domain of CYT-18 and the CTD of the Sc mtTyrRS were made by overlap PCR. Chimera 1 contains the CYT-18 NTDs (amino acids 33–417) fused to the Sc mtTyrRS flexible linker and CTD (amino acids 397–492). Chimera 2 contains the CYT-18 NTDs and linker including Ins 3 (amino acids 33–451) fused to the Sc mtTyrRS CTD (amino acids 416–492). The chimeric protein ORFs were cloned between the BamHI and HindIII sites of pMAL-c2t (see above), enabling the expression of fusion proteins with an N-terminal TEV-protease cleavable maltose-binding protein tag.
Recombinant plasmids used for in vitro transcription contain group I or II introns cloned downstream of a phage T3 or T7 promoter. pBD5a contains the N. crassa mt large subunit rRNA-ΔORF (Nc mt LSU) intron cloned downstream of a T3 promoter in pBS(+) . Transcription of pBD5a linearized with BanI yields a 503-nt RNA containing a 65-nt 5′ exon, the 388-nt mt LSU intron, and a 50-nt 3′-exon. pND1m contains the N. crassa NADH dehydrogenase subunit 1-ΔORF (Nc ND1m) intron cloned downstream of a T7 promoter in pUC18 . Transcription of pND1m linearized with NdeI yields a 209-nt RNA containing a 6-nt 5′ exon, the 196-nt ND1 intron, and a 7-nt 3′ exon. pTWORT-P2 contains a ribozyme derivative of a group I intron of the Staphylococcus aureus bacteriophage Twort orf142 gene (intron nucleotides 9-250) cloned downstream of a T7 promoter in pUC19 . Transcription of pTWORT-P2 linearized with EarI yields a 242-nt transcript of the Twort ribozyme. pSSltrBΔA contains a derivative of the L. lactis Ll.LtrB-ΔORF intron with a deletion of the branch-point nucleotide to prevent splicing during binding assays cloned downstream of a T7 promoter in pUC19 . Transcription of a DNA template made by PCR of the pSSltrBΔA plasmid (forward primer 5′-ATGAATTCTAATACGACTCACTATAGGGTTATAATTATCCTTACACATCCATAAC and reverse primer 5′-CGCTGCAGAATTGATATCAAAAATGATATG) yields an 807-nt RNA containing a 28-nt 5′ exon, the 749-nt intron, and a 30-nt 3′ exon.
Protein Expression and Purification
Proteins were expressed from the recombinant plasmids indicated above in E. coli HMS174(DE3) (CYT-18, CYT-18*, and CYT-18 NTDs); BL21(DE3) (CYT-18 CTD, chimera 1, and chimera 2); or Rosetta 2(DE3) (EMD Millipore) (Sc mtTyrRS, Sc NTDs, and Sc CTD). Overnight cultures of fresh transformants were inoculated into LB media, and the proteins expressed via auto-induction . Cells expressing CYT-18 and CYT-18* were grown at 35°C overnight with shaking at 260 rpm. Cells expressing all other proteins were grown at 37°C for 4 h then shifted to 25°C overnight with shaking at 260 rpm.
Wild-type CYT-18 and CYT-18* were purified as described ,. Briefly, cells were lysed by incubation with lysozyme at 1 mg/ml for 30 min followed by polyethyleneimine precipitation to remove nucleic acids, and ammonium sulfate precipitation . The ammonium sulfate pellet was dissolved in 500 mM KCl, 25 mM Tris-HCl (pH 7.5) and then dialyzed overnight in 25 mM KCl, 25 mM Tris-HCl (pH 7.5). The protein was purified from the dialysate by using a HiTrap SP XL cation exchange column (GE Healthcare Life Sciences), followed by a size-exclusion column (HiLoad 16/60 Superdex 200; GE Healthcare Life Sciences) .
The 6× HIS-tagged proteins CYT-18 NTDs, CYT-18 CTD, Sc mtTyrRS, and Sc NTDs were purified similarly, except that the ammonium sulfate pellet was dissolved in 500 mM KCl, 25 mM Tris-HCl (pH 7.5), and 30 mM imidazole, and the proteins were purified by nickel-affinity chromatography using a HisTrap HP column (GE Healthcare Life Sciences) , followed by TEV protease-cleavage of the 6× HIS-tag in dialysis buffer (500 mM KCl, 25 mM Tris-HCl [pH 7.5], 5 mM DTT) to remove imidazole. The proteins were then further purified by an additional round of nickel-affinity chromatography, followed by size-exclusion chromatography (HiLoad 16/60 Superdex 200; GE Healthcare Life Sciences).
The maltose-binding protein (MalE) fusions MalE-ScCTD, MalE-chimera 1, and MalE-chimera 2 were purified by polyethyleneimine precipitation of nucleic acids, as described above for CYT-18, and then loaded onto an amylose affinity column (New England Biolabs) in buffer containing 25 mM Tris-HCl (pH 7.5), 500 mM KCl, 1 mM DTT, 1 mM EDTA, and 10% glycerol followed by elution with 10 mM maltose in the same buffer. The proteins were further purified using a heparin-sepharose column (HiTrap heparin HP column; GE Healthcare Life Sciences) in 300 mM KCl, 25 mM Tris-HCl (pH 7.5), 1 mM DTT, and 1 mM EDTA and eluted with a salt gradient of 300 mM to 1.5 M KCl in the same buffer. The final purification step was size-exclusion chromatography (HiLoad 16/60 Superdex 200; GE Healthcare Life Sciences) in 25 mM Tris-HCl (pH 7.5), 200 mM KCl, and 10% glycerol.
Proteins used for SAXS were stored in buffer containing 100 mM KCl, 5 mM MgCl2, 10 mM Tris-HCl (pH 7.5), 5% glycerol at −80°C. Proteins used for biochemical assays were dialyzed into 100 mM KCl, 25 mM Tris-HCl (pH 7.5), and 50% glycerol and stored at −80°C. Protein yields ranged from 14 to 44 mg/l (monomer concentrations), and all proteins were >99% pure as judged by SDS-polyacrylamide gels stained with Coomassie blue. Protein concentrations were determined by measuring A280 under denaturing conditions (6 M guanidine hydrochloride). Concentrations of wild-type CYT-18, the Sc mtTyrRS, and C-terminal truncations of these proteins refer to the homodimer, while CTD concentrations refer to the monomer.
Preparation of RNA Substrates for SAXS and Biochemical Assays
Intron-containing RNA substrates for SAXS and biochemical assays were transcribed from the linearized recombinant plasmids indicated above. The Twort intron for SAXS was synthesized by large-scale in vitro transcription reactions (10–30 ml) with T7 polymerase at 37°C in reaction buffer containing 40 mM Tris-HCl (pH 8.1), 1 mM spermidine, 10 mM DTT, 8 mM NTPs, and 15 mM MgCl2. Transcription reactions were incubated at 37°C for 8 h and terminated by adding 50 mM EDTA followed by extraction with phenol-chloroform-isoamyl alcohol (25∶24∶1; phenol-CIA). The RNA was then purified through a 5-ml HiTrap desalting column (GE Healthcare Life Sciences) and a size exclusion column (HiLoad 16/60 Superdex 200; GE Healthcare Life Sciences). T7 RNA polymerase for the large-scale transcriptions was expressed with an N-terminal 6× HIS-tag from pRC9 and purified as described .
32P-labeled Nc mt LSU, Nc ND1m, and Twort RNAs for equilibrium-binding assays were synthesized by using a MAXIscript transcription kit (Life Technologies), with the concentration of unlabeled UTP changed from that recommended in the manufacturer's protocol (0.5 mM) to 10 µM UTP to obtain higher specific activity transcripts (300 Ci/mmol). The Ll.LtrB group II intron was synthesized by using a mutant T7 polymerase that can read through a T7 polymerase transcription termination site within the intron  in reaction medium containing 40 mM Tris-HCl (pH 7.9), 6 mM MgCl2, 10 mM DTT, 2 mM spermidine, 1 mM GTP, 1 mM CTP, 1 mM ATP, 250 nM UTP, and 1 µM [α-32P]UTP (3,000 Ci mmol−1; Perkin Elmer). After transcription and DNase treatments (MAXIscript transcription kit; Life Technologies), transcripts were purified by extraction with phenol-CIA, followed by gel filtration through two consecutive 1-ml Sephadex G-50 columns (Sigma-Aldrich).
Intron-containing RNA substrates for splicing reactions were transcribed from linearized DNA template using a MEGAscript transcription kit (Life Technologies) with 1 µCi [α-32P]UTP (3,000 Ci mmol−1; PerkinElmer) added for standard 32P-labeled substrates and 3 µCi [α-32P]UTP (3,000 Ci mmol−1) added for higher specific activity subtrates (Figures 7C, S7A, and S7C). The Nc mt LSU intron substrate was synthesized by in vitro transcription of pBD5a (BanI digested) using a MEGAscript T3 kit, while the Nc ND1m intron substrate was synthesized by in vitro transcription of pND1m (NdeI digested) using a MEGAscript T7 kit. The intron RNAs were purified as described above.
The poly(U)30 oligonucleotide used for binding assays was synthesized and HPLC-purified by Integrated DNA Technologies. The oligonucleotide was dissolved in 10 mM HEPES (pH 7.5), 1 mM EDTA and stored at a concentration of 25 µM. For equilibrium-binding assays, 25 pmoles of the oligonucleotide was 5′-end labeled with [γ-32P]ATP (3,000 Ci mmol−1; PerkinElmer) using T4 kinase (New England Biolabs) and then purified by phenol-CIA extraction followed by desalting through a Sephadex G-25 column.
SAXS Data Collection and Analysis
Proteins and RNAs used for SAXS analysis were prepared as described above. RNA-protein complexes were formed by mixing protein dimer and RNA at a 1∶1 molar ratio in 1.2 ml of 100 mM KCl, 5 mM MgCl2, 10 mM Tris-HCl (pH 7.5), and 5% glycerol. After incubation at room temperature for 15 min, RNP complexes were purified by size-exclusion chromatography (Hi-Load 16/60 Superdex 200 column; GE Healthcare Life Sciences) in the same buffer. RNP complexes and proteins for SAXS were concentrated by using Amicon Ultra-4 centrifugal filter units (EMD Millipore) and frozen for storage at −80°C. The size-exclusion chromatography column buffer was used as a solvent blank for SAXS.
SAXS data were collected on beamline 12-ID-C at the Advanced Photon Source (Argonne, Illinois). Each sample had 20 1-s exposures taken at a sample-to-detector distance of 2.0 m, covering a momentum transfer range of 0.007<q<0.35 Å−1. Samples were continuously passed through the beam using a flow-cell to minimize radiation damage. The 20 consecutive exposures were compared and showed no change in scattering intensity, indicating no radiation damage. Radially averaged scattering data were buffer subtracted and analyzed by using ATSAS  and IGOR-Pro (WaveMetrics). Scattering curves were displayed as the scattering intensity (I(q)) as a function of momentum transfer q = (4πsinθ)/λ, where λ is the wavelength of the incident X-ray beam and θ is half the angle between the incident and scattering radiation. SAXS data were obtained for least three different concentrations of each protein and checked for aggregation and interparticle interference by examination of the Guinier region . Guinier plots (log(I(q)) versus log(q)) were checked for linearity in the Guinier region, a diagnostic of sample quality. For globular proteins, the Guinier approximation is valid for qRg<1.3. The q range used for SAXS analysis was 0.015<q<0.3 Å−1 for CYT-18 protein constructs and 0.02<q<0.3 Å−1 for CYT-18+Twort complexes. The I(0) (extrapolated forward scattering at zero angle) and Rg (radius of gyration) were evaluated using the Guinier approximation for scattering intensity (I(q)) according to the equation:I(0) and Rg were also computed from the scattering curve by using the indirect Fourier transform program AUTOGNOM, which additionally provides an estimate of the maximum particle dimension (Dmax) from the distance distribution function P(r) . The Rg values determined by using the Guinier approximation were consistent with those determined by AUTOGNOM. Molecular weights were calculated by comparing the extrapolated forward scattering at zero angle, I(0), with that of a protein standard, bovine serum albumin (BSA), by using the equation:where MMp and MMst are the molecular weights of the protein sample and protein standard, respectively, cp and cst are their concentrations in g/l, and I(0)p and I(0)st are the forward scattering intensities of the protein and standard, respectively. Agreement with the calculated molecular weights of the samples indicates sample quality and monodispersity . Experimental scattering curves were compared with theoretical scattering curves calculated by the program CRYSOL (for qmax<0.3) from the crystal structures of those macromolecules with known atomic structures (CYT-18 NTDs, CYT-18 NTDs+Twort, CYT-18 CTD homology model) .
SAXS Ab Initio Shape Reconstructions
Ab initio shape reconstructions were done by using DAMMIN (for qmax<8/Rg) and GASBOR, which use simulated annealing methods to build low resolution protein models from dummy atoms or residues, respectively ,. The program DAMMIN uses dummy atoms packed into a sphere with the beads determined to be either protein or solvent. The final DAMMIN model was obtained by using the DAMAVER program suite to align ten models from independent DAMMIN runs and produce an average model. The latter was further refined by using DAMMIN to produce the final model . GASBOR represents the protein as a chain-like ensemble of dummy residues equal to the number of residues in the protein. The final GASBOR model was chosen as the one with the lowest NSD value after running DAMSEL to compare ten models from independent GASBOR runs . No symmetry was specified for the building of CYT-18* or CYT-18 CTD ab initio models, while P2 symmetry was specified for the CYT-18 NTDs models based on prior knowledge from the CYT-18 NTDs crystal structure. DAMMIN and GASBOR produced similar models of CYT-18* with or without P2 symmetry enforced.
Rigid-body models of CYT-18* by itself and of CYT-18* and the CYT-18 NTDs bound to Twort RNA were built by using the program CORAL . This program employs a simulated annealing method to place high resolution models of individual components in orientations that minimize the discrepancy between the calculated SAXS profile and the experimental SAXS data, with distances between the structured components constrained by randomized dummy residue linkers chosen from a generated library of non-clashing loop structures. To build models of CYT-18*, a homology model of the CYT-18 CTD (amino acids 448–583) was generated by I-TASSER , using the A. nidulans CTD NMR structure (PDB:2KTL) as a template for modeling . The confidence (C-score) and TM-scores of the CYT-18 CTD homology model, which are indicators of model quality, are high at 0.98 and 0.85, respectively. This CYT-18 CTD model and available high-resolution crystal structures for CYT-18 NTDs+Twort RNA (PDB:2RKJ) were used for rigid-body modeling. The final CORAL models were chosen from among ten independently derived models based on the best fit to the experimental scattering data as indicated by a low χ value .
Ensemble optimization analysis to characterize the flexibility of CYT-18*+Twort system was conducted by using the program, EOM ,. This program generates a random pool of 10,000 structures and creates an optimized ensemble from this pool, such that the average scattering pattern of the ensemble fits the experimental SAXS data. Comparison of the shape of the Rg and Dmax distributions of the optimized ensemble with those of the random pool provides information about the size and flexibility of the structure, with a broad peak resembling that of the random pool suggesting a flexible, extended structure and a peak narrower than the random pool suggesting a more rigid structure.
Tyrosyl-Adenylation and TyrRS Assays
Tyrosyl-adenylation assays were done by incubating 100 nM protein in a 50-µl reaction containing 5 mM ATP, 100 mM KCl, 10 mM MgCl2, 144 mM Tris-HCl (pH 7.5), 2 mM DTT, 0.1 mg/ml BSA (New England Biolabs), 0.1 unit of yeast inorganic phosphatase (New England Biolabs), and 5 µCi of L-[3,5-3H]-tyrosine (53 Ci mmol−1; Amersham Biosciences Corp.) . Reactions were initiated by adding protein and incubated at 30°C for 10 min. Reactions were terminated by adding 1 ml of reaction medium and immediately filtering through a nitrocellulose membrane to trap protein bound tyrosyl-adenylate. Radioactivity was measured by Beckman Coulter LS 6500 scintillation counter using Ready Protein scintillation cocktail (Beckman).
Aminoacylation assays were done as described previously with protein concentrations normalized to tyrosyl-adenylation activity . Reactions of 120 µl contained 100 nM protein and 6 µM E. coli tRNATyr (Sigma-Aldrich) in 100 mM KCl, 15 mM MgCl2, 50 mM Tris-HCl (pH 7.5), 5 mM ATP, and 10 mM L-tyrosine (a 1∶10 mixture of L-[3,5-3H]-tyrosine and unlabeled L-tyrosine). Reactions were initiated by adding protein and incubated at 30°C. For time courses, 20-µl portions were removed after times ranging from 2 to 60 min, and the reaction was terminated by precipitation with 0.8 ml of a solution containing 10% trichloroacetic acid and 20 mM sodium pyrophosphate. Reactions were filtered through Whatman 3 MM filter paper to collect the precipitates, and the filters were washed three times with 1 ml of a solution containing 5% trichloroacetic acid and 20 mM sodium pyrophosphate followed by 2 ml of 95% ethanol. The filters were then dried and quantified by using a Beckman Coulter LS 6500 scintillation counter as above.
32P-labeled RNAs (5 pM; 300 Ci/mmol) were incubated with increasing concentrations of protein in a 50-µl reaction containing 100 mM KCl, 5 mM MgCl2, 20 mM Tris-HCl (pH 7.5), 5 mM DTT, 0.1 mg/ml BSA, and 10% glycerol at either 25°C (Figures 4 and 5) or 37°C (Figure S4). Binding reactions were initiated by adding 10 µl protein and terminated after 30 min by filtering 10 µl of the reaction through a nitrocellulose membrane (Amersham Hybond ECL nitrocellulose; GE Healthcare Life Sciences) backed by a nylon membrane (Amersham Hybond-N+; GE Healthcare Life Sciences). The nitrocellulose membrane retains protein-bound RNA and the nylon membrane retains free RNA. The end point (30 min) was chosen after determining that incubations times of 20, 30, or 60 min gave indistinguishable results for all proteins assayed. After application of samples, the membranes were washed three times with 20-µl wash buffer containing 100 mM KCl, 5 mM MgCl2, and 20 mM Tris-HCl (pH 7.5), then dried and quantified using a PhosphorImager and the program ImageQuant (GE Healthcare Life Sciences).
RNA Splicing Assays
Splicing time courses for the Nc mt LSU and Nc ND1m introns were done by pre-incubating 32P-labeled precursor intron RNA (50 or 200 nM; 0.13–0.4 Ci mmol−1) with protein (25 or 100 nM) in a 100-µl reaction containing 100 mM KCl, 5 mM MgCl2, 20 mM Tris-HCl (pH 7.5), 1 mM DTT, 0.1 mg/ml BSA, and 10% glycerol for 10 min on ice, followed by 5 min at reaction temperature. Reactions were initiated by adding 1 mM GTP-Mg2+. Portions (8 µl) were removed at different times, and the reaction terminated by adding 50 mM EDTA, followed by phenol-CIA extraction and mixing 10 µl of sample with 10 µl of 2× gel loading dye (95% formamide, 0.02% SDS, 0.02% bromophenol blue, 0.01% xylene cyanol, and 1 mM EDTA). End-point splicing assays were done similarly in reaction medium containing 25 or 100 mM KCl for 60 min. Splicing assays comparing wild-type CYT-18 and chimera CYT-18/Sc mtTyrRS proteins were also done with higher concentrations of 32P-labeled precursor RNA (200 nM, 0.13–0.4 Ci mmol−1) and protein (100 nM or 500 nM dimer) and with 200 nM unlabeled precursor, 500 nM protein, and 500 nM [α-32P]GTP (3,000 Ci mmol−1; PerkinElmer). In all cases, samples were analyzed by electrophoresis in a denaturing 4% polyacrylamide gel, which was dried and quantified with a PhosphorImager, and data were analyzed by using ImageQuant TL.
Excel spreadsheet containing the numerical data and statistical analysis for Figures 2A, 2B, 3A, 3B, 4A–4E, 5A–5C, 7A, 7B, 7E, 7F, S1A–S1D, S2, S4A, S4B, and S5A–S5E.
Characterization of the CYT-18* construct. (A) Tyrosyl-adenylation activity of wild-type CYT-18, CYT-18*, CYT-18 NTDs, Sc mtTyrRS, and the Sc NTDs. Assays were done at 30°C, as described in Materials and Methods. The bar graphs show the mean of three experiments, with the error bars indicating the standard deviation. All CYT-18 and S. cerevisiae constructs synthesize tyrosyl-adenylate, which remains bound at the active site with a stoichiometry of close to one molecule of tyrosyl-adenylate per protein homodimer, as expected for fully active proteins. (B) Aminoacylation assays of CYT-18 and S. cerevisiae protein constructs. The plots show the formation of [3H]-Tyr-tRNATyr synthesized over time at 30°C (black open circles, wild-type CYT-18; gray open squares, CYT-18*; blue open diamonds, CYT-18 NTDs; orange open triangles, Sc mtTyrRS; green open diamonds, Sc NTDs; black closed circles, no protein control). The assays were done in triplicate with the error bars indicating the standard deviation. (C, D) Splicing activity of wild-type CYT-18 (black open circles) and CYT-18* (gray open squares) with the Nc mt LSU intron at 30°C and 37°C, respectively. Assays were done with 50 nM 32P-labeled precursor RNA, 25 nM protein, as described in Materials and Methods. The disappearance of unspliced precursor RNA is plotted over a time period of 60 min. The assays were done in triplicate with the error bars indicating the standard deviation.
Kratky plots of CYT-18 protein constructs. The scattering data for (A) CYT-18 NTDs, (B) CYT-18 CTD, and (C) CYT-18* are plotted as q2×I versus q, where I is the scattering intensity and q is the scattering angle (q = 4πsin(θ)/λ). The Kratky plots are normalized to the maximum q2×I value and show a bell-shape curve with a distinct peak, indicative of a folded globular protein.
Ab initio models of CYT-18 proteins built using GASBOR. CYT-18 NTDs (blue), CTD (red), and CYT-18* (gray) models built by GASBOR, a simulated annealing program which uses a chain-like ensemble of dummy residues . The dummy residue representations are shown above, and the low-resolution SAXS envelopes of the models fit with the CYT-18 NTDs high-resolution structure, the CYT-18 CTD homology model, and the CYT-18* CORAL model using SUPCOMB are shown below. χ alues shown in parentheses indicate the fit of the ab initio models to the experimental scattering data. The GASBOR model shown had the lowest NSD among ten calculated models (Table 2).
Size distributions of CYT-18*+Twort optimized ensembles built by EOM. (A) Radius of gyration (Rg) and (B) maximum dimension (Dmax) distributions of an optimized ensemble that best describes the experimental scattering data (gray) compared to those of a random pool of conformations (black). The optimized ensemble is selected from the random pool, which consists of 10,000 randomly generated structures. The narrower peak for the size and shape distributions and smaller values of Rg and Dmax for the optimized ensemble compared to the random pool, suggests that CYT-18*+Twort is a rigid, compact complex.
Equilibrium-binding assays of CYT-18 deletion mutants to various RNAs at 37°C. Binding assays of CYT-18 NTDs (blue) and CTD (red) to the (A–C) N. crassa mt LSU (Nc mt LSU), N. crassa ND1m (Nc ND1m), and Twort ribozyme group I intron RNAs; (D) L. lactis Ll.LtrB group II intron RNA; and (E) poly(U)30. The binding assays were done at 37°C, as described in Materials and Methods. The plots show the fraction of RNA retained on a nitrocellulose membrane as a function of protein concentration. The binding data for the CYT-18 NTDs were fit to hyperbolic curves, while CYT-18 CTD binding data were fit to sigmoidal curves. Kd or K1/2 values and Hill coefficients (n) are shown in boxes and are the mean for three experiments with the error bars indicating the standard deviation. The CYT-18 CTD binds similarly to all RNAs tested at 37°C, as it did at 25°C.
Sequence cloverleaf structure diagrams of tRNATyr. The E. coli (species 1) tRNATyr, S. cerevisiae mt tRNATyr, and N. crassa mt tRNATyr are shown as cloverleafs with the four major identity elements highlighted. The latter are: (1) the N73 nucleotide (purple); (2) the N1–N72 base pair (orange); (3) the anticodon (blue); and (4) the variable arm (green). Modified nucleotides are indicated for each tRNATyr: D, dihydrouridine; Gm, 2′-O-methylguanosine; i6A, N-6-isopentenyladenosine; m5C, 5-methylcytidine; m2,2G, N2,N2-dimethylguanosine; (m5G) 5-methylguanosine. ms2i6A, N6-(delta 2-isopentenyl)-2-methylthioadenosine; ψ, pseudouridine; Q, queuosine; s4U, 4-thiouridine; T, ribothymidine.
Splicing of the N. crassa mt LSU and ND1m group I introns by CYT-18/Sc mtTyrRS chimeric proteins. (A–C) Splicing assays of wild-type CYT-18 and chimeric proteins 1 and 2 (Chi.1 and Chi. 2, respectively) with 32P-labeled Nc mt LSU group I intron (A and C) or Nc ND1m group I intron (B) at two temperatures (25°C or 37°C) and two salt concentrations (25 mM or 100 mM KCl). The splicing reactions were done with 200 nM 32P-labeled RNA and 100 nM protein (A and B) or 200 nM 32P-labeled RNA and 500 nM protein (C) for 60 min, as described in Materials and Methods. (D) Splicing assays of wild-type CYT-18, Chi.1, and Chi. 2, with unlabeled precursor RNA and [α-32P]GTP. Splicing assays were done as above with 500 nM protein, 200 nM unlabeled Nc mt LSU RNA, and 500 nM [α-32P]GTP (3,000 Ci mmol−1) for 60 min at 25°C. A darker exposure of the autogradiogram is shown below. The chimeric proteins spliced the Nc ND1m intron, which is not dependent upon the CYT-18 CTD, but were unable to splice Nc mt LSU intron, which is dependent upon the CYT-18 CTD, under any condition tested. The splicing of the Nc ND1m intron by the chimeric proteins in 100 mM KCl decreased with increasing temperature relative to the wild-type protein, likely reflecting loss of CTD interactions that contribute to but are not essential for splicing. Abbreviations: E1–E2, ligated exons; E1-I, 5′ exon+intron; I, excised intron; I-E2, intron+3′ exon; P, precursor RNA.
We thank Paul Paukstelis for the plasmid pHISTEV602 and Mark Del Campo for pCYT18/ΔC-tail construct and contributions to SAXS experimental design, data collection, and analysis. We thank Soenke Seifert at the Advanced Photon Source beamline 12-ID-C for assistance with data collection. Use of the Advanced Photon Source, an Office of Science User Facility operated for the US Department of Energy (DOE) Office of Science by Argonne National Laboratory, was supported by the US DOE under contract number DE-AC02-06CH11357. We thank Mark Del Campo, Georg Mohr, and Rick Russell for comments on the manuscript.
The author(s) have made the following declarations about their contributions: Conceived and designed the experiments: LTL AML. Performed the experiments: LTL. Analyzed the data: LTL ALM AML. Wrote the paper: LTL ALM AML.
- 1. Hall KB (2002) RNA–protein interactions. Curr Opin Struct Biol 12: 283–288.
- 2. Martin W, Koonin EV (2006) Introns and the origin of nucleus–cytosol compartmentalization. Nature 440: 41–45.
- 3. Hogan DJ, Riordan DP, Gerber AP, Herschlag D, Brown PO (2008) Diverse RNA-binding proteins interact with functionally related sets of RNAs, suggesting an extensive regulatory system. PLoS Biol 6: e255.
- 4. Keene JD (2007) RNA regulons: coordination of post-transcriptional events. Nat Rev Genet 8: 533–543.
- 5. Gurtan AM, Sharp PA (2013) The role of miRNAs in regulating gene expression networks. J Mol Biol 425: 3582–3600.
- 6. Glisovic T, Bachorik JL, Yong J, Dreyfuss G (2008) RNA-binding proteins and post-transcriptional gene regulation. FEBS Lett 582: 1977–1986.
- 7. Cannone JJ, Subramanian S, Schnare MN, Collett JR, D'Souza LM, et al. (2002) The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics 3: 2.
- 8. Haugen P, Simon DM, Bhattacharya D (2005) The natural history of group I introns. Trends Genet 21: 111–119.
- 9. Lambowitz AM, Belfort M (1993) Introns as mobile genetic elements. Annu Rev Biochem 62: 587–622.
- 10. Belfort M, Derbyshire V, Parker M, Cousineau B, Lambowitz AM (2002) Mobile introns: pathways and proteins. Craig N, Craigie R, Gellert M, Lambowitz AM, editors. Mobile DNA II. Washington (D.C.): American Society for Microbiology. pp. 761–783.
- 11. Lambowitz AM, Perlman PS (1990) Involvement of aminoacyl-tRNA synthetases and other proteins in group I and group II intron splicing. Trends Biochem Sci 15: 440–444.
- 12. Vicens Q, Paukstelis PJ, Westhof E, Lambowitz AM, Cech TR (2008) Toward predicting self-splicing and protein-facilitated splicing of group I introns. RNA 14: 2013–2029.
- 13. Dlakić M, Mushegian A (2011) Prp8, the pivotal protein of the spliceosomal catalytic center, evolved from a retroelement-encoded reverse transcriptase. RNA 17: 799–808.
- 14. Galej WP, Oubridge C, Newman AJ, Nagai K (2013) Crystal structure of Prp8 reveals active site cavity of the spliceosome. Nature 493: 638–643.
- 15. Mannella CA, Collins RA, Green MR, Lambowitz AM (1979) Defective splicing of mitochondrial rRNA in cytochrome-deficient nuclear mutants of Neurospora crassa. Proc Natl Acad Sci U S A 76: 2635–2639.
- 16. Collins RA, Lambowitz AM (1985) RNA splicing in Neurospora mitochondria. Defective splicing of mitochondrial mRNA precursors in the nuclear mutant cyt18-1. J Mol Biol 184: 413–428.
- 17. Akins RA, Lambowitz AM (1987) A protein required for splicing group I introns in Neurospora mitochondria is mitochondrial tyrosyl-tRNA synthetase or a derivative thereof. Cell 50: 331–345.
- 18. Wallweber GJ, Mohr S, Rennard R, Caprara MG, Lambowitz AM (1997) Characterization of Neurospora mitochondrial group I introns reveals different CYT-18 dependent and independent splicing strategies and an alternative 3′ splice site for an intron ORF. RNA 3: 114–131.
- 19. Guo Q, Lambowitz AM (1992) A tyrosyl-tRNA synthetase binds specifically to the group I intron catalytic core. Genes Dev 6: 1357–1372.
- 20. Mohr G, Zhang A, Gianelos JA, Belfort M, Lambowitz AM (1992) The Neurospora CYT-18 protein suppresses defects in the phage T4 td intron by stabilizing the catalytically active structure of the intron core. Cell 69: 483–494.
- 21. Caprara MG, Mohr G, Lambowitz AM (1996) A tyrosyl-tRNA synthetase protein induces tertiary folding of the group I intron catalytic core. J Mol Biol 257: 512–531.
- 22. Paukstelis PJ, Chen JH, Chase E, Lambowitz AM, Golden BL (2008) Structure of a tyrosyl-tRNA synthetase splicing factor bound to a group I intron RNA. Nature 451: 94–97.
- 23. Paukstelis PJ, Lambowitz AM (2008) Identification and evolution of fungal mitochondrial tyrosyl-tRNA synthetases with group I intron splicing activity. Proc Natl Acad Sci U S A 105: 6010–6015.
- 24. Eriani G, Delarue M, Poch O, Gangloff J, Moras D (1990) Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 347: 203–206.
- 25. Jakes R, Fersht AR (1975) Tyrosyl-tRNA synthetase from Escherichia coli. Stoichiometry of ligand binding and half-of-the-sites reactivity in aminoacylation. Biochemistry 14: 3344–3350.
- 26. Carter P, Bedouelle H, Winter G (1986) Construction of heterodimer tyrosyl-tRNA synthetase shows tRNATyr interacts with both subunits. Proc Natl Acad Sci U S A 83: 1189–1192.
- 27. Saldanha RJ, Patel SS, Surendran R, Lee JC, Lambowitz AM (1995) Involvement of Neurospora mitochondrial tyrosyl-tRNA synthetase in RNA splicing. A new method for purifying the protein and characterization of physical and enzymatic properties pertinent to splicing. Biochemistry 34: 1275–1287.
- 28. Yaremchuk A, Kriklivyi I, Tukalo M, Cusack S (2002) Class I tyrosyl-tRNA synthetase has a class II mode of cognate tRNA recognition. EMBO J 21: 3829–3840.
- 29. Kittle JD, Mohr G, Gianelos JA, Wang H, Lambowitz AM (1991) The Neurospora mitochondrial tyrosyl-tRNA synthetase is sufficient for group I intron splicing in vitro and uses the carboxy-terminal tRNA-binding domain along with other regions. Genes Dev 5: 1009–1021.
- 30. Mohr G, Rennard R, Cherniack AD, Stryker J, Lambowitz AM (2001) Function of the Neurospora crassa mitochondrial tyrosyl-tRNA synthetase in RNA splicing. Role of the idiosyncratic N-terminal extension and different modes of interaction with different group I introns. J Mol Biol 307: 75–92.
- 31. Chen X, Mohr G, Lambowitz AM (2004) The Neurospora crassa CYT-18 protein C-terminal RNA-binding domain helps stabilize interdomain tertiary interactions in group I introns. RNA 10: 634–644.
- 32. Cherniack AD, Garriga G, Kittle JD, Akins RA, Lambowitz AM (1990) Function of Neurospora mitochondrial tyrosyl-tRNA synthetase in RNA splicing requires an idiosyncratic domain not found in other synthetases. Cell 62: 745–755.
- 33. Paukstelis PJ, Coon R, Madabusi L, Nowakowski J, Monzingo A, et al. (2005) A tyrosyl-tRNA synthetase adapted to function in group I intron splicing by acquiring a new RNA binding surface. Mol Cell 17: 417–428.
- 34. Brick P, Blow DM (1987) Crystal structure of a deletion mutant of a tyrosyl-tRNA synthetase complexed with tyrosine. J Mol Biol 194: 287–297.
- 35. Qiu X, Janson CA, Smith WW, Green SM, McDevitt P, et al. (2001) Crystal structure of Staphylococcus aureus tyrosyl-tRNA synthetase in complex with a class of potent and specific inhibitors. Protein Sci 10: 2008–2016.
- 36. Kobayashi T, Takimura T, Sekine R, Vincent K, Kamata K, et al. (2005) Structural snapshots of the KMSKS loop rearrangement for amino acid activation by bacterial tyrosyl-tRNA synthetase. J Mol Biol 346: 105–117.
- 37. Paukstelis PJ, Chari N, Lambowitz AM, Hoffman D (2011) NMR structure of the C-terminal domain of a tyrosyl-tRNA synthetase that functions in group I intron splicing. Biochemistry 50: 3816–3826.
- 38. Covello PS, Gray MW (1993) On the evolution of RNA editing. Trends Genet 9: 265–268.
- 39. Stoltzfus A (1999) On the possibility of constructive neutral evolution. J Mol Evol 49: 169–181.
- 40. Lynch M (2007) The evolution of genetic networks by non-adaptive processes. Nat Rev Genet 8: 803–813.
- 41. Gray MW, Lukeš J, Archibald JM, Keeling PJ, Doolittle WF (2010) Irremediable complexity? Science 330: 920–921.
- 42. Petoukhov MV, Konarev PV, Kikhney AG, Svergun DI (2007) ATSAS 2.1 - towards automated and web-supported small-angle scattering data analysis. J Appl Crystallogr 40: s223–s228.
- 43. Svergun D, Barberato C, Koch M (1995) CRYSOL - a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J Appl Crystallogr 28: 768–773.
- 44. Svergun DI, Koch MHJ (2003) Small-angle scattering studies of biological macromolecules in solution. Rep Prog Phys 66: 1735.
- 45. Svergun DI (1999) Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing. Biophys J 76: 2879–2886.
- 46. Svergun DI, Petoukhov MV, Koch MH (2001) Determination of domain structure of proteins from X-ray solution scattering. Biophys J 80: 2946–2953.
- 47. Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738.
- 48. Petoukhov MV, Svergun DI (2005) Global rigid body modeling of macromolecular complexes against small-angle scattering data. Biophys J 89: 1237–1250.
- 49. Goodman HM, Abelson JN, Landy A, Zadrazil S, Smith JD (1970) The nucleotide sequences of tyrosine transfer RNAs of Escherichia coli. Eur J Biochem 13: 461–483.
- 50. Heckman JE, Alzner-Deweerd B, RajBhandary UL (1979) Interesting and unusual features in the sequence of Neurospora crassa mitochondrial tyrosine transfer RNA. Proc Natl Acad Sci U S A 76: 717–721.
- 51. Sibler A-P, Dirheimer G, Martin RP (1983) The primary structure of yeast mitochondrial tyrosine tRNA. FEBS Lett 152: 153–156.
- 52. Quinn CL, Tao N, Schimmel P (1995) Species-specific microhelix aminoacylation by a eukaryotic pathogen tRNA synthetase dependent on a single base pair. Biochemistry 34: 12489–12495.
- 53. Cech TR, Zaug AJ, Grabowski PJ (1981) In vitro splicing of the ribosomal RNA precursor of Tetrahymena: Involvement of a guanosine nucleotide in the excision of the intervening sequence. Cell 27: 487–496.
- 54. Myers CA, Kuhla B, Cusack S, Lambowitz AM (2002) tRNA-like recognition of group I introns by a tyrosyl-tRNA synthetase. Proc Natl Acad Sci U S A 99: 2630–2635.
- 55. Bedouelle H, Guez-Ivanier V, Nageotte R (1993) Discrimination between transfer-RNAs by tyrosyl-tRNA synthetase. Biochimie 75: 1099–1108.
- 56. Whelihan EF, Schimmel P (1997) Rescuing an essential enzyme–RNA complex with a non-essential appended domain. EMBO J 16: 2968–2974.
- 57. Wang CC, Schimmel P (1999) Species barrier to RNA recognition overcome with nonspecific RNA binding domains. J Biol Chem 274: 16508–16512.
- 58. Cahuzac B, Berthonneau E, Birlirakis N, Guittet E, Mirande M (2000) A recurrent RNA-binding domain is appended to eukaryotic aminoacyl-tRNA synthetases. EMBO J 19: 445–452.
- 59. Aravind L, Koonin EV (1999) Novel predicted RNA-binding domains associated with the translation machinery. J Mol Evol 48: 291–302.
- 60. Korber P, Zander T, Herschlag D, Bardwell JC (1999) A new heat shock protein that binds nucleic acids. J Biol Chem 274: 249–256.
- 61. Staker BL, Korber P, Bardwell JC, Saper MA (2000) Structure of Hsp15 reveals a novel RNA-binding motif. EMBO J 19: 749–757.
- 62. Volpon L, Lievre C, Osborne MJ, Gandhi S, Iannuzzi P, et al. (2003) The solution structure of YbcJ from Escherichia coli reveals a recently discovered αL motif involved in RNA binding. J Bacteriol 185: 4204–4210.
- 63. Guo M, Yang X-L, Schimmel P (2010) New functions of aminoacyl-tRNA synthetases beyond translation. Nat Rev Mol Cell Biol 11: 668–674.
- 64. Smirnova EV, Lakunina VA, Tarassov I, Krasheninnikov IA, Kamenski PA (2012) Noncanonical functions of aminoacyl-tRNA synthetases. Biochemistry 77: 15–25.
- 65. Park SG, Schimmel P, Kim S (2008) Aminoacyl tRNA synthetases and their connections to disease. Proc Natl Acad Sci U S A 105: 11043–11049.
- 66. Bonnefond L, Giegé R, Rudinger-Thirion J (2005) Evolution of the tRNA(Tyr)/TyrRS aminoacylation systems. Biochimie 87: 873–883.
- 67. Kleeman TA, Wei D, Simpson KL, First EA (1997) Human tyrosyl-tRNA synthetase shares amino acid sequence homology with a putative cytokine. J Biol Chem 272: 14420–14425.
- 68. Wakasugi K, Schimmel P (1999) Two distinct cytokines released from a human aminoacyl-tRNA synthetase. Science 284: 147–151.
- 69. Kim Y, Shin J, Li R, Cheong C, Kim K, et al. (2000) A novel anti-tumor cytokine contains an RNA binding motif present in aminoacyl-tRNA synthetases. J Biol Chem 275: 27062–27068.
- 70. Yang X-L, Skene RJ, McRee DE, Schimmel P (2002) Crystal structure of a human aminoacyl-tRNA synthetase cytokine. Proc Natl Acad Sci U S A 99: 15369–15374.
- 71. Navaratnam N, Bhattacharya S, Fujino T, Patel D, Jarmuz AL, et al. (1995) Evolutionary origins of apoB mRNA editing: catalysis by a cytidine deaminase that has acquired a novel RNA-binding motif at its active site. Cell 81: 187–195.
- 72. Gray MW (2012) Evolutionary origin of RNA editing. Biochemistry 51: 5235–5242.
- 73. Beebe K, Mock M, Merriman E, Schimmel P (2008) Distinct domains of tRNA synthetase recognize the same base pair. Nature 451: 90–93.
- 74. Roy H, Ibba M (2010) Bridging the gap between ribosomal and nonribosomal protein synthesis. Proc Natl Acad Sci U S A 107: 14517–14518.
- 75. Kristelly R, Earnest BT, Krishnamoorthy L, Tesmer JJG (2003) Preliminary structure analysis of the DH/PH domains of leukemia-associated RhoGEF. Acta Crystallogr D Biol Crystallogr 59: 1859–1862.
- 76. Golden BL, Kim H, Chase E (2005) Crystal structure of a phage Twort group I ribozyme-product complex. Nat Struct Mol Biol 12: 82–89.
- 77. Mohr S, Ghanem E, Smith W, Sheeter D, Qin Y, et al. (2013) Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing. RNA 19: 958–970.
- 78. Studier FW (2005) Protein production by auto-induction in high density shaking cultures. Protein Expr Purif 41: 207–234.
- 79. He B, Rong M, Lyakhov D, Gartenstein H, Diaz G, et al. (1997) Rapid mutagenesis and purification of phage RNA polymerases. Protein Expr Purif 9: 142–151.
- 80. Lyakhov DL, He B, Zhang X, Studier F, Dunn JJ, et al. (1997) Mutant bacteriophage T7 RNA polymerases with altered termination properties. J Mol Biol 269: 28–40.
- 81. Jacques DA, Trewhella J (2010) Small-angle scattering for structural biology—expanding the frontier while avoiding the pitfalls. Protein Sci 19: 642–657.
- 82. Vladimir V. Volkov DIS (2003) Uniqueness of ab initio shape determination in small-angle scattering. J Appl Crystallogr 36: 860–864.
- 83. Bernadó P, Mylonas E, Petoukhov MV, Blackledge M, Svergun DI (2007) Structural characterization of flexible proteins using small-angle X-ray scattering. J Am Chem Soc 129: 5656–5664.
- 84. Petoukhov MV, Franke D, Shkumatov AV, Tria G, Kikhney AG, et al. (2012) New developments in the ATSAS program package for small-angle scattering data analysis. J Appl Crystallogr 45: 342–350.
- 85. M B Kozin DIS (2001) Automated matching of high-and low-resolution structural models. J Appl Crystallogr 34: 33–41.
- 86. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797.
- 87. Inoue T, Sullivan FX, Cech TR (1986) New reactions of the ribosomal RNA precursor of Tetrahymena and the mechanism of self-splicing. J Mol Biol 189: 143–165.