A Transcript Cleavage Factor of Mycobacterium tuberculosis Important for Its Survival

After initiation of transcription, a number of proteins participate during elongation and termination modifying the properties of the RNA polymerase (RNAP). Gre factors are one such group conserved across bacteria. They regulate transcription by projecting their N-terminal coiled-coil domain into the active center of RNAP through the secondary channel and stimulating hydrolysis of the newly synthesized RNA in backtracked elongation complexes. Rv1080c is a putative gre factor (MtbGre) in the genome of Mycobacterium tuberculosis. The protein enhanced the efficiency of promoter clearance by lowering abortive transcription and also rescued arrested and paused elongation complexes on the GC rich mycobacterial template. Although MtbGre is similar in domain organization and shares key residues for catalysis and RNAP interaction with the Gre factors of Escherichia coli, it could not complement an E. coli gre deficient strain. Moreover, MtbGre failed to rescue E. coli RNAP stalled elongation complexes, indicating the importance of specific protein-protein interactions for transcript cleavage. Decrease in the level of MtbGre reduced the bacterial survival by several fold indicating its essential role in mycobacteria. Another Gre homolog, Rv3788 was not functional in transcript cleavage activity indicating that a single Gre is sufficient for efficient transcription of the M. tuberculosis genome.


Introduction
Once the process of transcription is initiated by RNAP, it is important for the enzyme to carry out elongation and termination to ensure the full-length RNA synthesis. However, the movement of the RNAP along the template during the transcription elongation is not uniform and gets interrupted either accidentally or due to regulatory mechanisms [1]. Inadvertent disruption of the elongation complex would lead to the accumulation of nonfunctional RNA which can be potentially deleterious to the cell [2]. To overcome these interruptions, a number of transcription factors act during elongation and termination by modifying the properties of RNAP [1,3,4]. These factors deal with the accidental disruption of the elongation process and affect transcription processivity and fidelity by modulating pausing, arrest, termination or anti-termination of the enzyme [1,5]. Prokaryotic transcript cleavage factors GreA and GreB [6,7] and their eukaryotic analog, elongation factor TFIIS [8], stimulate intrinsic transcript cleavage activity of RNAP [9,10] for removal of the aberrant RNA 39 ends so that polymerization activity can be restored from the end of a cleaved RNA. They suppress the RNAP pausing to rescue arrested [7,11] or road-blocked [12] transcription complexes, providing RNAP a second chance to resume elongation [13] by directly accessing the RNAP active center through the secondary channel [10,14]. Although homologs of the Gre factors are found in most bacteria, they are well characterized only from a few species viz. E. coli [6,7], Thermus thermophilus and Thermus aquaticus [15,16,17].
No information on the properties of the transcript cleavage factors is available from genus mycobacteria which harbors several pathogenic species. In this manuscript we describe the characteristics of M. tuberculosis Gre factor.
The genome of M. tuberculosis harbors a gre factor -Rv1080c [18], sharing 32% and 26% identity (48% and 43% similarity) with the E. coli GreA and GreB respectively. Other ORFs which show low degree of similarity with the E. coli Gre factors in the genome are Rv3788 which shares 16% identity and 33% similarity with the E. coli GreA ( Figure S1A) and Rv2103 -a hypothetical protein, having much lower similarity (9% identity and 21% similarity with E. coli GreA). The former has Gre like domain organization while the latter lacks key acidic amino acids and the domains required for Gre like activity.
A number of molecular processes show significant differences in mycobacteria compared to the other well-studied bacterial systems [19]. Presence of a large number of sigma factors recognizing unique sequences of the promoters in their GC rich genomes [20], slow rates of transcription and macromolecular synthesis [21,22] and occurrence of novel transcription activators [18] etc. point towards the differences in the transcription process. The GC rich genome of M. tuberculosis (65.6% G+C) may pose additional challenges to the transcribing RNAP and hence the role of Gre factor could be critical for high fidelity transcription. We demonstrate that Rv1080c, the primary Gre factor of the genome is essential for cell survival unlike the Gre factors characterized from other eubacteria. The protein is needed for efficient promoter escape by reducing the abortive initiation and anti-arrest action during transcription elongation. Although its properties resemble E. coli GreA in many respects, it does not appear to collaborate with E. coli RNAP during elongation process and much of its properties seem to be tailored for the mycobacterial transcription.

Results
Rv1080c has Gre factor like domain organization Rv1080c encodes for a 164 amino-acid protein having sequence similarity with the E. coli transcript cleavage factors GreA and GreB ( Figure S1). A homology model of the protein was generated by using the crystal structure of E. coli GreA (PDB code:1GRJ) [23] as a template ( Figure S1B). GreA and GreB of E. coli have two distinct domains: an N-terminal coiled-coil (Gre-NTD) and a Cterminal globular domain (Gre-CTD) [14,24,25]. NTD is responsible for the stimulation of specific nucleolytic and antiarrest activities, whereas the residues in Gre-CTD interact with RNAP-b9 subunit coiled-coil domain [26,27]. From the model, it is evident that Rv1080c is more similar to the E. coli GreA than GreB in its surface charge distribution ( Figure S1C). The homology model of Rv3788, the other Gre homolog in the M. tuberculosis genome, shows that most of the features of the Gre factor are conserved in the ORF (Figure S1A, S1B and S1C). The M. smegmatis Gre (MsGre) has 97% similarity with the M. tuberculosis protein in the amino acid sequence and shares similar domain architecture. To understand the function and the nature of transcript cleavage stimulatory activity of mycobacterial Gre factor and the Gre factor homolog Rv3788, the genes were cloned in pET20b for over-expression of the ,18 kDa proteins in E. coli ( Figure S2A, S2B and S2C). The identities of the expressed proteins were confirmed by peptide-mass-fingerprinting using MALDI-TOF (data not shown).

MtbGre stimulates the intrinsic cleavage activity of mycobacterial RNAP
A stalled elongation complex comprising of 20 nt RNA was generated from the T7A1 promoter (T7A1-TEC) for studying transcript cleavage on the elongation complexes ( Figure S3A). RNAP from both M. smegmatis (MsRNAP) and M. tuberculosis (MtbRNAP) were proficient in carrying out transcription from this template ( Figure S3B). Transcript cleavage is an intrinsic property of the catalytic center of the RNAP [9] but is very slow and requires prolonged incubation. First, this intrinsic cleavage activity of the enzymes from E. coli, M. smegmatis and M. tuberculosis were compared. In all the three enzyme systems, RNA fragments of varied length were generated after incubation for a few hrs. Varied amount of short RNA fragments generated from the 39 end of the stalled TEC could be detected at the bottom of the gels ( Figure 1A). Both MtbRNAP and MsRNAP had lower intrinsic cleavage compared to E. coli RNAP (EcRNAP) ( Figure 1A), but the cleavage activity was stimulated in alkaline pH similar to the E. coli enzyme ( Figure 1B) indicating the conservation of the mechanism across different bacterial species. However, the cleavage of the TEC was not complete for the mycobacterial RNAPs even at alkaline pH. The slower nuclease activity seen above was inherent to the mycobacterial polymerases and not due to the copurification of endogenous Gre factor ( Figure S4).
MtbGre factor stimulated the cleavage of short fragments (2-3 nt) from the 39 end of the nascent RNA in 20-mer T7A1-TEC, and 50% of the cleavage could be achieved in less than 12 minutes (Figure 2A and 2B) indicating that Rv1080c indeed functions like a Gre factor. The pattern seen with MsGre was nearly identical mirroring their high degree of similarity ( Figure S5A). However, its transcript cleavage activity appears to be higher compared to the MtbGre. In E. coli, GreA -induced hydrolysis generates mostly shorter di-and tri-nucleotides (type I cleavage), while GreBinduced hydrolysis generates variable length of fragments up to 18 nt in length (type II cleavage) depending on the extent of RNAP backtracking [6,7,28]. The pattern shown in Figure 2A and 2B and Figure S5A indicate that mycobacterial Gre factor follows type I cleavage.
The MtbGre homolog -Rv3788 is a protein of 161 amino acids with a predicted coiled coil N-terminal domain and C-terminal globular domain ( Figure S1A and S1B). The key acidic residues required for transcript cleavage activity of Gre factors and the hydrophobic residues in the C-terminal RNAP interaction region are conserved in Rv3788. However, the transcript cleavage assays presented in Figure 2C show that Rv3788 lacks the cleavage stimulatory activity on the stalled elongation complexes in assay conditions used for canonical Gre factor and hence not investigated further.

Gre factor knock-down results in growth retardation in mycobacteria
To check the importance of gre factor for cell growth, an antisense construct was generated by cloning the M. tuberculosis gre gene in reverse orientation under the control of the constitutive hsp60 promoter in pMV261 ( Figure S5B). This strategy has been successfully employed to assess the physiological importance of several other mycobacterial genes [29,30,31]. The expression of M. tuberculosis gre anti-sense reduced the viability M. tuberculosis ( Figure 3A) by several folds compared to the control cells transformed with only pMV261 vector. M. smegmatis cells transformed with the MtbGre anti-sense construct also showed reduced viability ( Figure 3A) and were compromised in growth when compared to the cells transformed with vector or MtbGre over-expressing construct ( Figure 3B). Western blots of the cell lysates probed with anti-Gre antibody showed highly reduced level of Gre protein in the cells with anti-sense construct, suggesting that the decreased survival could be due to the reduction in Gre concentration in the cells ( Figure 3C). The M. smegmatis cells overexpressing MtbGre factor also showed an elongated phenotype ( Figure S5C).
From the above data, it is apparent that the decrease in intracellular Gre levels could have caused the growth defects in both the organisms. This would also mean that a balanced pool of Gre may be required to sustain the cell viability. To measure the endogenous levels of the protein, semi-quantitative western blot analysis was carried out at different stages of cell growth. The expression level of the endogenous Gre was highest in midexponential phase, both in M. smegmatis and in M. tuberculosis ( Figure S6A). The Gre concentration in M. smegmatis was ,82 fmoles/mg total protein in early exponential stage cells and remained almost at the same level during late exponential phase, after which it declined slightly to 66 fmols/mg total protein in the stationary phase ( Figure S6B). Gre levels in exponentially growing M. tuberculosis cells were also comparable to the levels seen with M. smegmatis cells ( Figure S6A). Interestingly, the combined amount of GreA (,53 fmol/mg of total protein) and GreB (,13 fmol/mg of total protein) [32] in exponentially growing E. coli cells is comparable to the level of single Gre protein found in mycobacteria. The RNAP concentration also seems to be comparable between the two species (Gupta and Nagaraja, unpublished results). Next, the expression of Gre in response to different cellular stress conditions in M. smegmatis was determined by measuring the protein content, and was found to be mostly unperturbed ( Figure S6C). RT-PCR experiments under various conditions also did not show significant alterations in the gre mRNA levels (data not shown). Together, these results indicate that a constant level of Gre is retained irrespective of growth phases or environmental conditions. Above findings are in contrast to the observations in several other organisms where under different stress conditions GreA level was found to be altered [33,34]. Thus from all the results presented in Figure 3A to 3C (gre knock-down) and Figure S6A to S6C, we surmise that although amount of Gre in mycobacteria is found to be comparable to E. coli, maintaining the level is critical for cell survival.

Reduction of abortive transcription initiation, and antiarrest activity of MtbGre
To determine the activity of MtbGre, in vitro transcriptions were carried out using M. smegmatis P rrnPCL1 as a template. The efficient open complex (RP O ) formation is not effectively transmitted to the synthesis of full length transcripts in this promoter due to high abortive RNA synthesis [35]. One of the properties of the E. coli Gre factors is to reduce abortive RNA synthesis and enhance promoter clearance [36,37]. MtbGre enhanced the full-length transcript synthesis from P rrnPCL1 by overcoming the abortive transcripts ( Figure 4A). Notably, the intermittent pauses seen above the abortive transcripts in the transcription from P rrnPCL1 were also reduced in the presence of MtbGre ( Figure 4B). After the cleavage of the transcript in the paused elongation complex, the trimmed TEC was capable of restarting the transcription in presence of all NTPs from both T7A1 promoter and mycobacterial P rrnB promoter templates ( Figure 5A, 5B). However, the minor differences in the patterns in Figures 5A and 5B could be template specific effect. It is possible that some of the stalled elongation complexes generated on T7A1 template have entered an inactive arrested state which could not be elongated further. Taken together, data from these experiments indicate that MtbGre factor could function on pre-formed stalled elongation complexes and induce transcript cleavage-restart activity.

Structural features of Gre factors are conserved in MtbGre
Alignment of the MtbGre with its E. coli counterparts revealed the following conserved features ( Figure 6A). (i) Acidic amino acids at the tip of the predicted coiled-coil domain found in the Nterminus of the protein. In E. coli Gre factors, these residues are involved in Mg 2+ co-ordination with the RNAP active center [10].
(ii) A short basic patch of residues on one side of a helix, which interacts with the 39 end of RNA in E. coli [38]. (iii) A globular domain at the C-terminus of the protein. Residues in this domain of E. coli GreB interact with the carboxyl-terminal coiled-coil domain of RNAP b9 subunit [27]. The D43, E46 at the acidic tip of the coiled-coil domain (equivalent to the D36 and E39 of E. coli GreA) and S127 at the C-terminal globular domain of MtbGre factor (equivalent of E. coli GreA S119) ( Figure 6A) were mutated to D43N, E46R, and S127E to address their function in MtbGre. The D43N and S127E mutations completely abolished the activity of MtbGre factor. On the other hand, E46R mutant retained the cleavage stimulation activity ( Figure 6B). These results indicate that among the two acidic residues in the tip of N-terminal predicted coiled-coil domain, D43 is essential for the transcript cleavage activity. The loss of activity of the S127E mutant was probably due to its loss of interaction with the RNAP. Ni-NTA pull down assays were carried out to assess the direct interaction between purified MtbRNAP and histidine tagged MtbGre or its S127E variant. The MtbGre factor bound MtbRNAP (Lane 4 of Figure 6C), and as predicted S127E mutant did not interact with the RNAP (Lane 6 of Figure 6C).

MtbGre factor is specific to the mycobacterial RNAP
The MtbGre factor shares similar structural features ( Figure 7A) with E. coli GreA and could rescue halted elongation complexes. Therefore, the ability of MtbGre to functionally complement the E. coli Gre factors was tested by using an E. coli DgreA/DgreB double knock-out strain [39], which shows a cold-sensitive phenotype. MtbGre factor expressed from a pTrc construct could not complement E. coli DgreA/DgreB grown at 28uC ( Figure 7B) although the protein was expressed in E. coli ( Figure S7A). The failure to complement could be due to the lack of interaction between E. coli RNAP and MtbGre ( Figure 7C). In support of this, in vitro assays showed that MtbGre factor functions only on mycobacterial, i.e., M. smegmatis and M. tuberculosis TECs ( Figure 7D). It did not stimulate transcript cleavage on E. coli RNAP containing TEC even at a very high concentration (.10 mM). Similarly, E. coli GreA was also not functional on the mycobacterial elongation complexes ( Figure S7B).

Discussion
In this study, we describe the characterization of Rv1080c -the Gre factor present in the M. tuberculosis genome. The MtbGre increased the transcription efficiency both during initiation and elongation phase of the process. During initiation, it reduced the abortive transcripts and enhanced the promoter clearance. At elongation phase, the protein rescued RNAP from the transcription pauses by inducing the transcript cleavage. Knocking down of the gene resulted in growth retardation and cell death indicating its essentiality for cell survival.
In organisms where Gre factors have been analyzed so far, they show remarkably similar structural features. Functional characterization of the Gre factors from E. coli [6,7], T. aquaticus and T. thermophilus [15,17,40] revealed the conserved nature of the transcript cleavage stimulation activity required for efficient transcription process. However, gre genes were found to be dispensable in E. coli; DgreA -DgreB double knock-out strain showed only a mild cold-sensitive phenotype [39]. In contrast, in M. tuberculosis, the protein appears to have a more pronounced and indispensable role. In the first glance our results appear to be contradicting the earlier transposon mutagenesis studies which led to the isolation of insertional mutation of M. tuberculosis gre (http:// mylims2.cvmbs.colostate.edu/tnlist/). We have noticed that the point of insertion of the transposon is at the 493 rd position out of the 495 bases in the Rv1080c. Thus it is likely that, the gene was not inactivated in the mutant strain. Also, with the decrease in intracellular Gre levels, the cell survival was affected. Notably, significant amount of the protein is present at all growth phases indicating its house-keeping function. Further, the Gre protein level was not altered to a great extent during different stress conditions, indicating that an optimum level of the protein may be required for cell survival. MtbGre can rescue a pre-formed halted elongation complex to exert its anti-arrest activity similar to E. coli GreA and ensure efficient transcription elongation. The transcript cleavage pattern of MtbGre showed type I cleavage products i.e. predominantly 2-3 nt fragments similar to the activity of E. coli GreA. The longer transcript cleavage pattern (2-18 nt, type II) seen with E. coli GreB is mediated by a large stretch of positively charged residues in its N-terminal domain [38]. MtbGre does not have such a large stretch of basic amino acids and the surface charge distribution is similar to that of E. coli GreA ( Figure S1C). In organisms having GreB, RNAP could backtrack farther to have a larger RNA 39 end fragment to be processed. Indeed, in such conditions, high affinity interaction between RNAP and GreB results in transcript cleavage activity [16,27]. Earlier studies have revealed lower transcription elongation rates in mycobacteria [41,42]. Organisms such as E. coli with faster transcription rates seem to require two Gre factors to process shorter and longer RNA.
The action of the MtbGre seems to be restricted to mycobacterial transcription machinery as it did not rescue a halted elongation complex of E. coli RNAP. Lack of interaction between these heterologous partners could account for the observation. The interaction surface on E. coli RNAP for E. coli GreB was mapped to a conserved hydrophobic loop in the coiledcoil domain in the C-terminus of the b9 subunit [27]. The region is also conserved in the mycobacterial RNAP ( Figure 7E) indicating the conserved architecture of transcription machinery. However, the C-terminal globular domain of Gre factors (GreA, GreB of E. coli and MtbGre), which interacts with the RNAP, shows considerable variation, although certain specific residues in the hydrophobic patch are conserved in all these proteins. Importance of specific interactions between RNAP and Gre is suggested from the studies in T. aquaticus. GreA of T. aquaticus failed to induce transcript cleavage in EcRNAP elongation complexes [15] similar to the present observation with MtbGre. Thus it appears that the transcript cleavage activity requires species-specific interactions, although both partners viz RNAP and Gre have conserved characteristics across species. Gre may have a more important function in mycobacteria to compensate for the low intrinsic cleavage activity of mycobacterial RNAP compared to its E. coli and themophilic counterparts. This deficiency could affect the recovery from arrest of backtracked MtbRNAP in the absence of MtbGre. The similar mechanism has been recently proposed to explain growth inhibition of the yeast strains expressing the cleavage deficient mutant of the eukaryotic Gre homolog, TFIIS [43]. The results presented here and the data emerged till date from a number of studies with Gre factors of diverse group of organisms emphasize the biological importance of these secondary channel binding proteins. The deletion of greA led to hypersensitivity phenotype under various stress conditions in E. coli [39], Sinorhizobium meliloti [44] and Rhizobium tropici [45] implicating the importance of Gre factors in the survival of the organism in the restrictive environment. In contrast, the decrease in Gre levels under normal cellular growth conditions itself reduced the viability of M. tuberculosis. The indispensability of the Gre factor in M. tuberculosis but not in E. coli [39] or T. thermophilus [17] indicates that the intracellular role of the factor is likely to be varied between different species of bacteria.
MtbGre seems to be the only transcription elongation factor in the genome possessing cleavage activity as the other ORF -Rv3788 found in the genome with lower degree of relatedness do not appear to participate in the process. The lack of transcript cleavage stimulatory activity in Rv3788 may be attributed to the absence of several key residues in the N-terminus which are found in Gre factors across different organisms. Although the two acidic residues needed for Mg 2+ co-ordination are conserved in Rv3788 ( Figure S1A), Asn47 and Tyr50 (present in MtbGre), required for binding to the backtracked protruding nascent RNA are absent. Nevertheless, Rv3788, has several features similar to the RNAP secondary channel binding proteins and hence may have some other intracellular role. It is also apparent that the RNAP secondary channel binding proteins are emerging to be the key regulators of different cellular functions apart from the transcript cleavage stimulatory functions [5].
In conclusion, Rv1080c functions like a bona fide Gre factor with transcript cleavage stimulatory activity in M. tuberculosis. Gre function is required for the optimal growth of the mycobacteria in contrast to its dispensability in E. coli. GC rich templates are known to impose blockage during transcription due to the formation of stable RNA-DNA hybrids [46]. Such strong barriers have to be overcome to ensure high fidelity RNA synthesis. Slower transcription rates in mycobacteria may lead to intermittent pauses and stalling at specific signals. Under these circumstances RNAP has to ensure completing the elongation process. Transcription factors like Gre, which maintain the efficiency by preventing premature pauses, appear to have a more profound role in maintaining the genomic integrity of M. tuberculosis. Knock-down of gre expression in M. smegmatis mc 2 155 and M. tuberculosis H37Ra was carried out by generating the plasmid pMVgreAS (Mtbgre in anti-sense orientation) in pMV261 [51]. The coding sequence was amplified using primers with BamH1 site (Table 1) and cloned downstream of the hsp60 promoter at a BamH1 site of the vector pMV261 to generate plasmid pMVgreOE (Table 1) for over-expression of MtbGre in both M. smegmatis and M. tuberculosis. Comparison of the growth rates of different strains was carried out by inoculating (1% inoculum) 30 ml of Middlebrook 7H9 medium with 25 mg ml 21 kanamycin to obtain an initial OD 600 of 0.02 to 0.04. Growth of the strains was monitored by dilution -plating from 8 day culture of M. tuberculosis or 20 hrs cultures of M. smegmatis grown at 37uC in shaking conditions. The cells were diluted in fresh media and plated into the middlebrook 7H10 agar plates to determine the cell viability by counting the cfu.

Western blots
To detect the protein level at different growth phases, cell lysates were probed for Gre factor with a polyclonal antibody raised in mice and anti-SigA antibody in rabbit. The primary antibodies were probed with the secondary antibody coupled with HRP and blots were developed using a chemiluminescence substrate (GE Health Care). Expression of Gre factor during different stress conditions were also checked by growing M. smegmatis cells till mid-log phase and subjecting them to varied stresses as described [52]. The amount of Gre protein present in the M. smegmatis cells was determined by western blot. Varying concentrations of the purified M. smegmatis Gre were loaded in the same gel as standards along with 120 mg of cell extracts from different growth phase cultures and subsequently probed with anti-Gre antibody.

Microscopy
M. smegmatis cells harboring pMV261 or pMVgreAS or pMVgreOE constructs were grown in Middlebrook 7H9 medium at 37uC to mid-exponential phase. Cells were pre-fixed in PBS, 1% (v/v) Triton X-100 (Sigma) and 2% (v/v) toluene (Merck) solution and incubated overnight at 4uC. Cells were stained with DAPI solution (49,6-diamidino-2-phenylindole), which binds Expression and purification of MtbGre, MsGre and Rv3788 gre (Rv1080c) and Rv3788 genes were PCR amplified from M. tuberculosis genomic DNA with specific primers (Table 1) and cloned between the NdeI and HindIII site of pET20b (pET20bgre and pET20brv3788). The M. smegmatis gre (MSMEG_5263) gene was PCR amplified from M. smegmatis mc 2 155 genomic DNA and cloned in pET20b (between NdeI and HindIII site). Site directed mutants of Mtbgre were generated using the mega-primer inverse PCR method with pET20bgre as a template (primer sequences are listed in Table 1). The purification of MtbGre, its mutants and MsGre was carried out as follows. E. coli BL21 cells [53] with pET20bgre or its mutants or pET20bmsgre were grown till OD 600 0.6 at 37uC and induced with 0.3 mM IPTG. Cells were lysed by sonication and centrifuged at 100,000 g for 2 hrs. The supernatants were subjected to 0-65% ammonium sulfate precipitation and re-suspended in 3 ml of TGE buffer [10 mM Tris-HCl, pH 8.0, 5% glycerol, 0.1 mM EDTA] with 50 mM NaCl and subsequently resolved by a 120 ml Sephacryl S-100 gel filtration column. The fractions having Gre protein were further purified through DEAE -Sephacel chromatography by eluting with a linear NaCl gradient of 50 mM to 400 mM. The Rv3788 protein was purified from the E. coli BL21 cells harboring pET20brv3788. The purification involved a 45-60% ammonium sulfate precipitation of the cell lysate followed by DEAE -Sephacel chromatography. All the proteins purified were approximately 95% pure as judged by SDS-PAGE ( Figure S2C). From 2 liters each of the cultures overexpressing the proteins (MtbGre, MsGre and Rv3788), about 5 mg of each of the protein were obtained. E. coli greA was cloned with a C-terminal His-tag in pET20b and the protein was purified from E. coli BL21 cells [53] over-expressing the protein using a Ni-NTA column. M. smegmatis RNAP was purified by following the method described earlier [49]. M. tuberculosis RNAP was purified from 2 liters of M. tuberculosis H37Ra cells grown for 8 days at 37uC in MB7H9 medium with ADC supplement (Difco). The purification involved gel filtration on Superdex S-200 matrix and subsequent heparin -Sepharose chromatography following the method described for native M. smegmatis RNAP purification [49].

Stalled TEC preparation
Transcription assays were carried out using T7A1 promoter and RNAPs from E. coli, M. smegmatis and M. tuberculosis. Ternary elongation complexes were generated on a 59 biotinylated T7A1 promoter-containing DNA template ( Figure S3A). The TECs for E. coli or the mycobacterial RNAPs were prepared by following the methods described for E. coli and T. thermophillus enzymes [15,26].

Intrinsic cleavage activity of RNAP
Intrinsic cleavage activity of the M. smegmatis, M. tuberculosis and E. coli RNAPs was detected by prolonged incubation (up to 4 hrs) of the TECs (prepared with 15 nM template and 100 nM RNAP) in transcription buffer (pH 7.5) at 37uC followed by resolving in a 20% urea -acrylamide gel. pH -induced transcript cleavage reactions were carried out in three different buffer systems at 37uC for 30 mins.

Cleavage-restart activity of MtbGre
The 20 mer T7A1 TEC or the 39 mer M. smegmatis P rrnB TECs were prepared by using 15 nM biotinylated template and 100 nM RNAP. The RNAP was stalled at T7A1 template by using only 100 mM of ATP, GTP and 2 mCi of [a-32 P] ATP (300 Cimmol 21 , Perkin Elmer) in each of the 10 ml reaction volume. For generating +39 stalled elongation complex at P rrnB promoter, 100 mM of ATP, GTP and CTP were used along with 2 mCi of [a-32 P] ATP. To detect cleavage-restart activity of MtbGre, the TECs were incubated with the MtbGre factor in presence or absence all the four NTPs. Initially the complexes were incubated with 2 mM of MtbGre for 30 min followed by the addition of the NTPs and incubation was continued for another 10 min followed by resolving in a 20% urea PAGE.

MtbGre-RNAP interaction
C-terminal his-tagged MtbGre and its S127E mutant were cloned in pET20b and purified using a Ni-NTA column. 5 mg of both RNAP (Ec or Mtb) and Gre protein were used for analyzing direct interactions. Proteins were incubated together for 15 mins in 50 ml volume of incubation buffer containing 50 mM tris -HCl (pH 8.0), 100 mM potassium glutamate, 5% glycerol, and 20 mM imidazole at room temperature. 20 ml of Ni-NTA pre-equilibrated with incubation buffer was then added to the protein mixture and incubated for an additional 30 mins in a rotary mixer. The supernatant was separated and the pellet was washed thrice with 400 ml of the incubation buffer. Finally, the pellet was re-suspended in 50 ml of buffer mixed with SDS-gel loading buffer, boiled and loaded onto an 11% SDS-PAGE along with the supernatant fractions followed by silver staining of the gel.
Complementation of E. coli DgreA/DgreB strain with M. tuberculosis gre The M. tuberculosis gre gene was cloned in pTrc99c vector to obtain pTrc99gre construct which was used for complementing the E. coli TK1021 strain ( Table 1). The parental strain TK1001 was used as wild type E. coli control. E. coli greA expressing plasmid pMS002 was used as a positive control in these experiments [39]. The cells were grown in liquid culture and different dilutions were spotted on LB plates containing 0.3 mM IPTG and appropriate antibiotics ( Table 1).