Convergence of the Transcriptional Responses to Heat Shock and Singlet Oxygen Stresses

Cells often mount transcriptional responses and activate specific sets of genes in response to stress-inducing signals such as heat or reactive oxygen species. Transcription factors in the RpoH family of bacterial alternative σ factors usually control gene expression during a heat shock response. Interestingly, several α-proteobacteria possess two or more paralogs of RpoH, suggesting some functional distinction. We investigated the target promoters of Rhodobacter sphaeroides RpoHI and RpoHII using genome-scale data derived from gene expression profiling and the direct interactions of each protein with DNA in vivo. We found that the RpoHI and RpoHII regulons have both distinct and overlapping gene sets. We predicted DNA sequence elements that dictate promoter recognition specificity by each RpoH paralog. We found that several bases in the highly conserved TTG in the −35 element are important for activity with both RpoH homologs; that the T-9 position, which is over-represented in the RpoHI promoter sequence logo, is critical for RpoHI–dependent transcription; and that several bases in the predicted −10 element were important for activity with either RpoHII or both RpoH homologs. Genes that are transcribed by both RpoHI and RpoHII are predicted to encode for functions involved in general cell maintenance. The functions specific to the RpoHI regulon are associated with a classic heat shock response, while those specific to RpoHII are associated with the response to the reactive oxygen species, singlet oxygen. We propose that a gene duplication event followed by changes in promoter recognition by RpoHI and RpoHII allowed convergence of the transcriptional responses to heat and singlet oxygen stress in R. sphaeroides and possibly other bacteria.


Introduction
Transcriptional responses to stress are critical to cell growth and survival. In bacteria, stress responses are often controlled by alternative s factors that direct RNA polymerase to transcribe promoters different from those recognized by the primary s factor [1,2]. Therefore, identifying the target genes for a particular alternative s factor can help identify the functions necessary to respond to a given stress. For example, the transcriptional response to heat shock in Escherichia coli uses the alternative s factor s 32 to increase synthesis of gene products involved in protein homeostasis or membrane integrity [3]. From available genome sequences, proteins related to E. coli s 32 are conserved across virtually all proteobacteria. This so-called RpoH family of alternative s factors is characterized by a conserved amino acid sequence (the ''RpoH box'') that is involved in RNA polymerase interactions [4,5]. RpoH family members also possess conserved amino acid sequences in s factor regions 2.4 and 4.2 that interact with promoter sequences situated approximately 210 and 235 base pairs upstream of the transcriptional start sites, respectively [6].
However, the definition of functional promoters for this family of alternative s factor using only the presence or the extent of sequence identity for the predicted 210 and 235 binding regions is not a sufficient predictor of transcription activity [7].
While bacteria often possess many alternative s factors, they usually possess only one member of the RpoH family. However, several a-proteobacteria, including Brucella melitensis [8], Sinorhizobium meliloti [9,10], Bradyrhizobium japonicum [11,12], Rhizobium elti [13] and Rhodobacter sphaeroides [14], possess two or more RpoH homologs. In some cases, one or more of these RpoH homologs completely or partially complement the phenotypes of E. coli DrpoH mutants, suggesting that these proteins can functionally interact with RNA polymerase and recognize similar promoter elements [8][9][10][11]14,15]. However, in the nitrogen-fixing plant symbiont Rhizobium elti, the DrpoH 1 mutant was sensitive to heat and oxidative stress while the DrpoH 2 mutant was sensitive to osmotic stress [13]. Therefore, the additional members of the RpoH family in aproteobacteria may have roles in other stress responses.
Previous work demonstrated that either R. sphaeroides RpoH I or RpoH II can complement the temperature sensitive phenotype of an E. coli DrpoH mutant; that singly mutant R. sphaeroides strains lacking either rpoH I or rpoH II are able to mount a heat shock response; and that RNA polymerase containing either RpoH I or RpoH II can initiate transcription from a common set of promoters in vitro [14][15][16]. Combined, these observations suggest that RpoH I and RpoH II have some overlapping functions in R. sphaeroides. On the other hand, in vitro transcription assays identified promoters that were selectively transcribed by either RpoH I or RpoH II [14,15]. Moreover, rpoH II is under direct transcriptional control of RpoE, a Group IV alternative s factor that acts as the master regulator of the response of R. sphaeroides to singlet oxygen stress [17][18][19]. These later results and the recent observation that a DrpoH II mutant is more sensitive to singlet oxygen stress than the wild-type strain [15,17] suggest that RpoH I and RpoH II also have distinct functions in R. sphaeroides. Finally, global protein profiles of R. sphaeroides mutants lacking rpoH I , rpoH II , or both genes, suggested that RpoH I and RpoH II have distinct and overlapping regulons [15,17,20]. However, the extent of genes that are direct targets for RpoH I and RpoH II is still unknown because past studies have been unable to distinguish direct from indirect effects on gene expression or identify all the direct targets for either of these s factors.
In this study, we characterized the RpoH I and RpoH II regulons using a combination of expression microarrays, chromatin immunoprecipitation and computational methods which have been previously been shown to predict correctly direct targets for other alternative s factors or DNA binding proteins [19,21]. We found that the genes predicted to be common to the RpoH I and RpoH II regulons function in protein repair or turnover, membrane maintenance, and DNA repair. Genes specific to the RpoH I regulon encode other proteins involved in protein maintenance and DNA repair, whereas genes specific to the RpoH II regulon include proteins involved in maintaining the oxidation-reduction state of the cytoplasmic thiol pool. We used information on the members of each regulon to generate and test hypotheses about DNA sequences that determine promoter specificity of these two RpoH homologs. The observed properties of these two R. sphaeroides RpoH homologs illustrate how duplication of an alternative s factor and subsequent changes in promoter recognition could have allowed convergence of transcriptional responses to separate signals. In the case of R. sphaeroides, we predict that these events allowed convergence of the transcriptional responses to heat shock and singlet oxygen stresses to be under control of these two RpoH paralogs.

Results
Defining the distinct and overlapping regulons of R. sphaeroides RpoH I and RpoH II To define members of the RpoH I and RpoH II regulons, we monitored transcript levels and protein-DNA interactions in R. sphaeroides strains ectopically expressing either RpoH I or RpoH II . To generate these strains, we constructed low copy plasmids carrying rpoH I or rpoH II under the control of an IPTG-inducible promoter [22] and conjugated them into R. sphaeroides mutant strains lacking rpoH I [16] or rpoH II [15], respectively. To induce target gene expression, we exposed exponentially growing aerobic cultures to IPTG for one generation before cells were either harvested to extract total RNA for analysis of transcript levels or treated with formaldehyde to prepare samples for chromatin immunoprecipitation on a chip (ChIP-chip) assays. The Western blot analysis used to measure levels of these alternative s factors demonstrates that cells ectopically expressing RpoH I and RpoH II contained each protein at levels comparable to those following either heat shock or singlet oxygen stress ( Figure 1). Thus, these strains can be used to characterize members of the RpoH I and RpoH II regulons.
As controls for this experiment, we measured the abundance of individual RpoH proteins and a control transcription factor (PrrA) [23], which is not known to be dependent on either alternative s factor for its expression, when wild type cells were exposed to either heat or singlet oxygen stress. This analysis showed that RpoH I is detectable prior to heat stress, but its levels increase 10 and 20 minutes after the shift to increased temperature ( Figure 1A). RpoH I levels remain elevated after the temperature shift but they decline within 60 minutes after heat shock, suggesting that as in the case of E. coli s 32 , there is an initial rise in RpoH I levels immediately on heat shock before they return to a new steady state level at elevated temperature [24]. RpoH II was also detected prior to exposure to singlet oxygen and within 10 minutes of exposure to this reactive oxygen species, levels of this protein were increased ( Figure 1B). Levels of RpoH II found within 20 minutes after exposure to singlet oxygen remained relatively constant over the time course of this experiment, suggesting a continuous requirement for RpoH II during this stress response ( Figure 1B). The abundance of the control transcription factor PrrA did not follow these same trends, suggesting that the observed increases in individual RpoH proteins was associated with these stress responses. In addition, the abundance of individual RpoH proteins did not increase significantly to both stress responses, as expected if these increases were not due to a general increase in protein levels in response to different signals.
To identify transcripts that were increased in abundance as a result of RpoH I or RpoH II activity, we compared mRNA levels of cells expressing RpoH I or RpoH II ectopically to those of control cells lacking either rpoH I or rpoH II . We selected differentially expressed genes with a significance level set for a false discovery rate #5% and that displayed at least 1.5-fold higher transcript levels in cells expressing either RpoH family member. This analysis revealed that transcripts from 241 and 186 genes were increased by expression of RpoH I and RpoH II , respectively ( Figure 2). These two sets of differentially expressed genes have 60 genes in common.

Author Summary
An important property of living systems is their ability to survive under conditions of stress such as increased temperature or the presence of reactive oxygen species. Central to the function of these stress responses are transcription factors that activate specific sets of genes needed for this response. Despite the central role of stress responses across all forms of life, the processes driving their organization and evolution across organisms are poorly understood. This paper uses genomic, computational, and mutational analyses to dissect stress responses controlled by two proteins that are each members of the RpoH family of alternative s factors. RpoH family members usually control gene expression during a heat shock response. However, the photosynthetic bacterium Rhodobacter sphaeroides and several other a-proteobacteria possess two or more paralogs of RpoH, suggesting some functional distinction. Our findings predict that a gene duplication event followed by changes in DNA recognition by RpoH I and RpoH II allowed convergence of the transcriptional responses to heat and singlet oxygen stress in R. sphaeroides and possibly other bacteria. Our approach and findings should interest those studying the evolution of transcription factors or the signal transduction pathways that control stress responses.
We recognize that some of these differentially-expressed transcripts might be not be direct targets for RpoH I and RpoH II . Therefore, to determine which of the above genes were directly transcribed by RNA polymerase holoenzyme containing either RpoH I or RpoH II , we performed ChIP-chip assays from comparable cultures to map direct interactions of RpoH I or RpoH II with genomic DNA. We were able to raise specific antibodies against RpoH II that performed well for the ChIP-chip assay, but repeated attempts to raise suitable antibodies against RpoH I failed. Therefore, we placed a FLAG polypeptide tag [25] at the N-terminus of the RpoH I protein sequence and used anti-FLAG monoclonal antibodies to perform the ChIP-chip assay. As a control we tested and showed that addition of the polypeptide tag did not alter the activity and specificity of RpoH I by comparing the mRNA level profiles of cells expressing the tagged version of RpoH I with cells expressing wild-type RpoH I ( Figure  S1). In addition, other control experiments showed there was no detectable cross-reaction between FLAG-RpoH I and the antibody used to precipitate RpoH II , and vice versa (data not shown). From the ChIP-chip analysis we identified 812 and 1353 genomic regions enriched after immunoprecipitation with antibodies against RpoH I and RpoH II , respectively, using a significance level set for a false discovery rate #5%. Because the signal from a single s factor binding site extends on average over a 1 kb region, some enriched regions may contain multiple binding sites. To increase the resolution of the putative RpoH I and RpoH II binding sites, we identified the modes of the ChIP-chip signal distributions within each enriched region. This adjustment increased the number of putative binding sites for RpoH I and RpoH II to 1085 and 1765, respectively.
We then identified all the annotated genes that contained a ChIP-chip peak within 300 base pairs upstream of their start codons as a way to define candidate genes or operons in the RpoH I or RpoH II regulons. Included in this list of potential regulon members were genes that are predicted to be cotranscribed using a previous computational analysis of R. sphaeroides operon organization (http://www.microbesonline.org/operons/) [26]. Therefore, by these criteria, the upper limits of the total numbers of genes potentially regulated by RpoH I or RpoH II are 1120 and 1616, respectively ( Figure 2). We recognized that a significant number of the putative RpoH I or RpoH II promoters may not be assigned from the ChIP-chip dataset alone, especially because promoter orientation needs to be considered and that because s factor or RNA polymerase binding events do not always promote transcription. Therefore, we refined the respective RpoH I and RpoH II regulons by intersecting the lists of target genes identified from the ChIP-chip analysis with the lists of candidate genes identified from the expression profiling analysis. After this intersection, we predict that the RpoH I regulon contains 175 genes and the RpoH II regulon contains 144 genes with 45 genes common to both regulons ( Figure 2).
Upon examining the annotations of these predicted target genes, the 45 genes that are members of both the RpoH I and RpoH II regulons are predicted to encode mainly for functions related to the electron transport chain, protein homeostasis, and DNA repair (Table 1 and Table S1). The 130 predicted members of the RpoH I regulon also encode functions in these three groups, Figure 1. RpoH I and RpoH II accumulation following heat and singlet oxygen stresses. Western blots illustrating the levels of RpoH I and RpoH II in wild-type R. sphaeroides (WT) at different times following (A) a shift of temperature from 30uC to 42uC (heat shock) or (B) addition of the photosensitizer methylene blue in the presence of oxygen (singlet oxygen stress). On the same western blots, the levels of FLAG-RpoH I and RpoH II obtained from ectopic expression vectors used in the expression profiling and ChIP-chip experiments under normal conditions. Note that because of the addition of the FLAG polypeptide, RpoH I -FLAG migrates slower than the wild-type RpoH I . The abundance of RpoH I and RpoH II in wild-type cells in the absence of added stress are shown in the first lane. As a gel loading control, the membranes were also subsequently treated polyclonal antibodies against the response regulator PrrA, a control transcription factor who's expression is not known to be dependent on either of the RpoH homologs. The experiment was designed to analyze changes in levels of RpoH I , RpoH II and PrrA before and after a stress, so the differences between panels reflect different exposure times used when developing the Western blots. doi:10.1371/journal.pgen.1002929.g001 Figure 2. Overlap between the RpoH I and RpoH II regulons. Venn diagram representing the overlaps between genes that were significantly induced by the expression of RpoH I or RpoH II , and genes whose promoters were bound by RpoH I or RpoH II containing RNA polymerase holoenzyme in vivo. The total numbers of genes identified in each study are indicated in the parentheses. The RpoH I (solid outline) and RpoH II (dashed outline) regulons, as defined in this study, are identified by the emphasized outlines. The total numbers of genes contained in each regulon are indicated below the arrows. doi:10.1371/journal.pgen.1002929.g002  Predicted differences in promoter sequences recognized by RpoH I or RpoH II Previous work indicated that RpoH I and RpoH II can recognize and initiate transcription from similar promoter sequences [14,15,20]. The characterization of their respective regulons also suggests that some promoters can be transcribed by both s factors while others are specific to either RpoH I or RpoH II . Therefore, we hypothesized that while the promoter sequences of the two s factors may be similar, different sequence-specific interactions of RpoH I or RpoH II with promoter elements are the basis of promoter specificity for transcription initiation by RNA polymerase.
To overcome the limited resolution of the ChIP-chip experiment and predict determinants of promoter specificity for RpoH I or RpoH II , we searched the regions upstream of genes in each regulon for conserved sequence elements (137 sequences for RpoH I and 120 sequences for RpoH II ). The conserved sequence elements we identified mapped to putative promoter elements that were within 100 bp of the coordinates of the modes of the distributions of the ChIP-chip signal. Thus, the predictions of these searches identified conserved sequence elements that were in agreement with the experimental data. In addition, even though we analyzed the individual RpoH I and RpoH II regulons independently for these motifs, the sequence alignment algorithm converged to the same sequence elements for promoters that were predicted to be recognized by both RpoH I and RpoH II . This result is not surprising given that both s factors have similar amino acid sequences in their DNA recognition regions and are thus expected to recognize similar promoter sequences. However, this observation supports the hypothesis that RpoH I and RpoH II recognize common promoter sequences in their respective target genes as opposed to distinct promoters.
To predict specificity sequence determinants for each RpoH paralog, the putative distinct and overlapping promoter sequences were sorted into three groups according to the expression profiling and ChIP-chip data sets and converted into sequence logos ( Figure 3, Table S2). The sequence logos derived from the three groups include: two groups that are preferentially or selectively bound and transcribed by either RpoH I or RpoH II and one group that is bound and transcribed by both s factors. As noted above, some promoters appear to be bound by RpoH I or RpoH II without inducing detectable changes in transcript levels. We aligned these promoters separately to determine if they possessed unique characteristics, but no significant differences were detected (data not shown).
The conservation of a TTG motif in the 235 region in all three logos is consistent with the importance of this triplet in a previous analysis of at least one promoter known to be recognized by both RpoH I and RpoH II [27]. However, there was also evidence for sequence-specific elements in the logos for each RpoH paralog. In the logo for the RpoH I -dependent promoters, a cytosine is overrepresented at position 237 and a thymine is overrepresented at position 29. In the logo for RpoH II -dependent promoters, cytosine and thymine are overrepresented at positions 214 and 213, respectively.
Overall, the comparison between RpoH I and RpoH II -specific promoter logos allowed us to identify significant differences in the promoter sequences that may be used to adjust promoter selectivity and strength for RpoH I or RpoH II . In addition, the predicted sequence elements for RpoH I or RpoH II promoters are not mutually exclusive. Rather, it appears that promoter specificities for RpoH I or RpoH II are distributed along a gradient using a combination of specific bases at various positions of the 235 or 210 promoter elements.

Degrees of promoter specificity of RpoH I and RpoH II
To test predictions about specificity determinants derived from these logos, we cloned several putative promoters upstream of a lacZ reporter gene and integrated these into the genome of a R. sphaeroides DrpoH I DrpoH II mutant [15] via homologous recombination. The activity of each promoter was measured by assaying bgalactosidase activity in these R. sphaeroides reporter strains ectopically expressing either RpoH I or RpoH II (Figure 4) at levels comparable to those found during a stress response (see above and Figure 1). The RSP_1173, RSP_1408, and RSP_1531 promoters (which were either predicted to be members of the RpoH I regulon or, in the case of RSP_1173, known to be heat inducible and transcribed by RpoH I [16], had significant activity in the strain expressing RpoH I , but not when the same strain expressed RpoH II ( Figure 4). In contrast, the RSP_2314, RSP_2389, and RSP_3274 promoters (which were either predicted to be members of the RpoH II regulon by our analysis or known to be induced by conditions that generate singlet oxygen [17,18,20]) showed activity in the presence of RpoH II but not RpoH I (Figure 4). Finally, the RSP_1207 and RSP_2617 promoters (which were predicted to be transcribed by both RpoH proteins and, in the case of RSP_1207, known to be transcribed by RNA polymerase holoenzyme containing either RpoH homolog [15] showed activity in cells containing either RpoH I or RpoH II (Figure 4). Overall, these results support predictions about members of the RpoH I or RpoH II regulons derived by combining the transcription profiling, ChIP-chip and computational analyses.
To test the predictions about the contributions of individual bases to promoter recognition, we measured the activity of R. sphaeroides RpoH I with an existing library of mutant E. coli groE promoters fused to a lacZ reporter in an E. coli tester strain [7]. The data from this analysis revealed that base substitutions in the TTG motif of the 235 region of this RpoH-dependent promoter (positions 236, 235, and 234) reduced its activity by at least 80% with RpoH I (Figure 5A), as expected from the predictions of promoter logo. We also found a slight increase in promoter activity when position 232 was changed to a cytosine, even though the C-32 is not conserved in RpoH I promoters. This observation is consistent with the results of a previous mutational analysis showing that E. coli s 32 prefers a cytosine at position 232 when the alanine at position 264 of its amino acid sequence is substituted to an arginine (corresponding to R267 of RpoH I ) [28], but also suggests that the 232 position is not utilized to distinguish between RpoH I -and RpoH II -specific promoters. In the 210 region of the groE promoter, substitutions of the cytosine at position 214 for an adenine or guanine, the cytosine at position 213 for an adenine, or substitution of the thymine at position 211 for a cytosine, each reduced RpoH I -dependent promoter activity. In addition, a substitution of the adenine at position 212 for a cytosine or changing the thymine at position 29 for any other base reduced RpoH I -dependent activity by .90%. These observations are consistent with the conservation of a thymine at position 29 of the derived RpoH I promoter logo (Figure 3).
To test the predicted requirement of RpoH I for a thymine at position 29, we also analyzed the properties of two R. sphaeroides promoters in this E. coli tester strain. Activity of the RpoH Idependent RSP_1531 promoter was reduced by 90% when the thymine at position 29 was changed to a cytosine, whereas the RpoH II -dependent RSP_2314 promoter had higher RpoH Idependent activity when a thymine was placed at position 29 ( Figure 5B). Therefore, this analysis confirmed that position 29 plays a critical role in promoter specificity for RpoH I . In conclusion, the measured effects of mutations in the E. coli groE promoter on RpoH I -dependent transcription confirmed that our . Conserved promoter sequences recognized by RpoH I and RpoH II . The logos were constructed from promoter sequences alignments sorted into three categories according to their predicted specificity. The consensus sequence for s 32 -dependent promoters in E. coli,as determined by Nomaka et al. [37], is shown as a reference. The heights of the letters represent the degree of conservation across sequences (information in bits, logos generated using WebLogo: http:// weblogo.berkeley.edu/). The coordinates on the x-axes represent the positions relative to the predicted transcription start site. The numbers of promoter sequences used to create the logos are indicated in parentheses on the left of the logos. Below the logos are the sequence alignments of selected promoters that were used for direct experimental validation. doi:10.1371/journal.pgen.1002929.g003 models captured elements that are critical for promoter recognition by RpoH I .
We were unable to test activity of R. sphaeroides RpoH II against this groEL promoter library in the same E. coli tester strain (data not shown). Instead, we generated a small set of point mutations in the P1 promoter of the R. sphaeroides cycA promoter ( Figure 3) which was previously shown to be transcribed by both RpoH I and RpoH II [27] and measured activity from single-copy fusions of these mutant promoters to lacZ in cells that either lacked both RpoH homologs or that contained a single rpoH gene under control of an IPTG-inducible promoter (Materials and Methods).
By analyzing this promoter library, we found that a G to T mutation at position 236 of cycA P1 (G-36T) increased its transcription by both RpoH I and RpoH II ( Figure 5C). This result is consistent with the high predicted information content for T at this position for both RpoH I and H II (Figure 3), as well as the previous observation that the overall increase in activity of cycA P1 is caused by the G-36T mutation [27]. While our RpoH I and RpoH II promoter models (Figure 3) predict that a C could be allowed at position 236, a G-36C mutation lowered activity with RpoH II and had no positive impact on transcription by RpoH I (Figure 5C). Due to the significantly increased in activity from the G-36T mutation in cycA P1, all of the other promoter mutations we tested were generated in this background. Mutations we tested in the 235 region, T-35C and G-34C, resulted in virtually complete loss of cycA P1 activity with either RpoH I and RpoH II when compared to their G-36T parent promoter ( Figure 5C), indicating that these bases are essential for transcription initiation by both RpoH homologs. Based on the relatively low information content predicted by our models for other positions in the 235 element (Figure 3), we did not test the effects of other mutations in this region on promoter selectivity by RpoH homologs.
In the predicted 210 region, A-12 has very high information content for both RpoH I and RpoH II , but the sequence logo suggests a T at this position might allow selective recognition by RpoH I (Figure 3). Indeed, a promoter containing a T at position 212 is still active only with RpoH I , suggesting that A-12 is essential for RpoH II activity but not RpoH I activity. The T at position 29 of cycA P1 is also predicted to have significantly higher information content for RpoH I than RpoH II , while a C at this position should have more information content for RpoH II than RpoH I (Figure 3). As predicted, we found RpoH II retained significant activity after placing a T-9C mutation in the context of the G-36T cycA P1 promoter. Furthermore, we found that this mutation completely abolished its activity with RpoH I , illustrating the high information content of a T at this position for transcription by this RpoH homolog. The importance for a T at the analogous position was also observed when testing activity of mutant E. coli groE promoters with RpoH I (T-9C mutation Figure 5A) or assaying function of the R. sphaeroides RSP_1531 promoter (which contains a T, Figure 5B) that is only transcribed to a detectable level by RpoH I (Figure 4). Finally, we also replaced the A at position 210 of the cycA P1 promoter with a G, as the sequence logo suggests there to be little information content at this position for either RpoH I or RpoH II ( Figure 3). As predicted, there is little impact of the A-10G mutation on promoter function, though activity with RpoH II is more significantly reduced than that with RpoH I activity ( Figure 5C).

Discussion
When organisms encounter environmental or internal stress they often increase the transcription of genes encoding proteins that help mitigate damage to cellular components. Therefore, identifying functions that are involved in transcriptional stress responses is critical to understand both the nature of the damage caused to cellular components and how organisms respond to these challenges. Singlet oxygen and increased temperature are very different phenomena, but in R. sphaeroides the transcriptional responses to these two stresses involve two alternative s factors, RpoH I and RpoH II , that each belong to the RpoH family [15,16,18]. Several other a-proteobacteria contain two or more members of the RpoH family that appear to control different stress responses [13,29,30]. However, as it is the case in R. sphaeroides, little is known about the target genes for these multiple RpoH homologs. In this work, we characterized genes that are directly transcribed by R. sphaeroides RpoH I and RpoH II to gain a better understanding of the biological response to heat shock and singlet oxygen stresses. We found that each of these RpoH paralogs control transcription of over 100 genes, suggesting that each of these phenomena lead to large changes in gene expression. However, we also found that there is significant overlap in the RpoH I and RpoH II regulons, creating an unexpectedly extensive connection between the transcriptional responses to these two signals. In addition, we investigated the characteristics of RpoH Iand RpoH II -dependent promoters. This effort allowed us to identify sequence elements that define promoter specificity for each s factor, thereby allowing cells to selectively partition target genes for each RpoH paralog into different stress responses.

R. sphaeroides RpoH I and RpoH II control the expression of a common set of functions
This work revealed a surprisingly extensive overlap of the RpoH I and RpoH II regulons even though these two homologs activate transcriptional responses to different signals in R. sphaeroides. This suggests that genes activated by these two pathways of the transcriptional regulation network play a role in the physiological response to both these, and even possibly, other stresses. Indeed, the genes regulated by both RpoH I and RpoH II encode known or annotated functions involved in protein homeostasis, DNA repair, and maintenance of cell membrane integrity (Table 1). These types of functions are central to cell viability and may be relevant for the physiological responses to multiple stresses that can have broad primary and secondary effects on cells. Indeed, the predicted functions of the overlapping members of the RpoH I and RpoH II regulons encode functions that are also part of the general stress response regulons for s S in E. coli or s B in Bacillus subtilis [31,32]. Interestingly, s S homologs are mostly present in band c-proteobacteria, but to date absent from sequenced genomes of a-proteobacteria like R. sphaeroides (http://img.jgi.doe.gov/) [33]. Thus, it is possible that the set of genes controlled by both RpoH I and RpoH II is part of a general stress response that is common to the heat shock, singlet oxygen and possibly other uncharacterized signals in R. sphaeroides [14,15,17,18,20]. This hypothesis is supported by the observation that R. sphaeroides and R. elti strains lacking both RpoH I and RpoH II are more sensitive to several conditions than strains lacking only one of these proteins [13,15,20].
In considering the scope of functions that are regulated by both RpoH I and RpoH II , it is also important to note that this set of genes may be larger than the one we characterized because some promoters known to be transcribed by both s factors were only marginally affected by ectopic expression of either RpoH I or RpoH II . For example, the RSP_2310 (groES) promoter was shown to be transcribed by both RpoH I and RpoH II in previous in vitro experiments [14] and was detected by our ChIP-chip experiment to be bound by both RpoH I and RpoH II , but did not meet all the criteria of our analysis. Thus, the groES promoter, like other promoters, may be subject to complex regulation in vivo.
RpoH I and RpoH II each control functions specific to heat shock or singlet oxygen stresses, respectively Our data also significantly extend the number and types of functions that are specifically controlled by RpoH I or RpoH II (Table 1). We expected to find specific sets of target genes because strains lacking either RpoH I or RpoH II displayed different phenotypes [14,15,17,20]. While previous results indicated that accumulation of ,25 proteins was dependent on RpoH II [17], our data indicate that some 150 genes are directly controlled by each R. sphaeroides RpoH paralog.
Genes in the direct but RpoH I -specific regulon encode functions that are involved in protein homeostasis, maintaining membrane integrity, and DNA repair, as is found for the E. coli s 32 regulon [3] (Table 1) The RpoH I specific regulon is also predicted to encode cation transporters and proteins in the thioredoxindependent reduction system (Table 1). Ion transporters can aid the heat shock stress response since exporting cations like iron, which may be released by thermal denaturation of damaged ironsulfur or other metalloproteins, decreases secondary effects caused by formation of toxic reactive oxygen species [34]. The thioredoxin-dependent reduction system reduces disulfide bonds and peroxides, which are created by protein oxidation, and thereby helps maintain cytoplasmic proteins in a reduced state [35]. Inclusion of these functions in the RpoH I regulon suggests that oxidative damage may be an important secondary effect of heat shock, perhaps caused by protein denaturation or permeabilization of the cell envelope. Overall, these results support the hypothesis that the function of RpoH I in R. sphaeroides is similar to that of s 32 in E. coli for the response to heat shock stress. In addition, it is also possible that RpoH I plays a role in the R. sphaeroides response to other forms of stress. There is precedent for roles of s 32 homologs in other stress responses by other bacteria since the activity of RpoH in Caulobacter crescentus is increased by heavy metal stress [36].
In contrast, rpoH II transcription is under direct control of a Group IV alternative s factor (RpoE) that serves as the master regulator of the singlet oxygen stress response [18]. In addition, an R. sphaeroides DrpoH II mutant is more sensitive to singlet oxygen than a wild-type or DrpoH I strain [15,17]. Therefore, members of the direct RpoH II -specific regulon might be expected to play an important role in the response to singlet oxygen stress. Among the genes in the RpoH II -specific regulon are others predicted to function in maintaining membrane integrity and performing DNA repair, both potential targets for damage by singlet oxygen. However, the RpoH II -specific regulon contains fewer genes encoding functions related to protein homeostasis than found in the RpoH I regulon (Table 1). Other functions apparently unique to the RpoH II regulon include the glutathione-dependent reduction system, which like the thioredoxin-dependent system repair oxidized protein residues and maintain a reduced cytoplasm (Table 1). Even though the thioredoxin-and gluthationedependent reduction systems serve similar cellular functions, they are apparently under the control of different RpoH-dependent transcriptional networks in R. sphaeroides. Thus, it is possible that the thioredoxin-and gluthatione-dependent reduction systems preferentially function on different oxidized substrates. Glutathione-dependent reduction systems are known to function on lipids or other types of protein oxidative damage that might be experienced by the cell following singlet oxygen damage [35]. We also found that the RpoH II -specific regulon includes the multisubunit NADH:quinone oxidoreductase and genes encoding enzymes in heme and quinone biosynthesis ( Table 1). Each of these functions are critical for the respiratory and photosynthetic electron transport chains of R. sphaeroides and are known or predicted to contain one or more oxidant-sensitive metal centers. Thus, placement of these genes in the RpoH II -specific regulon suggests that these membrane or bioenergetic functions are damaged by and need to be replaced in the presence of singlet oxygen. Overall, our data indicates that the RpoH II -specific regulon controls expression of functions in the repair of oxidized proteins and replacement or assembly of critical electron transport chain components. Furthermore, the different types of repair functions found in the RpoH II regulon predict that singlet oxygen can damage numerous cellular components.

RpoH I and RpoH II recognize different but compatible promoter sequence elements
Our global gene expression data, results from analysis of gene fusions, as well as previously reported in vitro experiments [14,15] all indicate that RNA polymerase containing either RpoH I or RpoH II can recognize some promoters in common. This observation is not surprising considering that RpoH I and RpoH II have similar amino acid sequences in their respective promoter recognition regions and are each able to rescue growth of E. coli s 32 mutants [14][15][16]. Likewise, the sequence logos derived here revealed that the promoter sequences recognized by each of the R. sphaeroides RpoH homologs are similar to both each other and to that recognized by E. coli s 32 [37].
Our experiments provide definitive evidence that some promoters are transcribed either exclusively or predominantly by RpoH I or by RpoH II . We were also able to predict and confirm the importance of bases for activity with individual RpoH homologs (particularly those in the 235 element). We have computational and experimental observations that can explain some aspects of promoter selectivity by RpoH I and RpoH II . For example, our experiments identify T-9 and other positions in the 210 element as potential candidates in this discrimination, as one or more substitutions have larger effects on activity with individual RpoH I homologs. Mutation of T-9 to any other base reduced RpoH I -driven expression of GroE promoter by more than 90%, and this same effect was observed using an authentic RpoH I promoter from R. sphaeroides. Importantly, changing the 29 position of an RpoH II R. sphaeroides promoter to T permitted expression by RpoH I . Together, these data suggest that T-9 is either required for or significantly enhances expression of RpoH I promoters, but is likely to be less important for expression of RpoH II promoters, as there is only weak conservation of -9T in RpoH II promoters. Our data also predict that other bases, which are overrepresented in the RpoH II promoters, could be critical for expression by that s factor. As is the case with E. coli s 32 there are likely to be specificity determinants that lie outside the canonical 235 and 210 elements [7,37]. Thus, additional in vivo and in vitro experiments with a larger suite of mutant promoters and a library of mutant RpoH proteins are needed to better define the determinants of promoter selectivity by RpoH I and RpoH II .
In conclusion, our results suggest that, at least in R. sphaeroides, RpoH I controls functions that are necessary for maintenance of protein homeostasis and membrane integrity after temperature increase and other cytoplasmic stress, similar to the wellcharacterized role of E. coli s 32 in the heat shock response [3]. However, we propose that, in R. sphaeroides, some RpoH I -regulated functions are also useful for survival in the presence of other forms of stress because these target genes also contain promoters that are recognized by RpoH II . We propose that the duplication of an ancestral RpoH protein to create a second homolog of this alterative s factor provided R. sphaeroides the opportunity to connect stress response functions to another stimulus. In this model, rpoH II was placed under the control of the master regulator of the singlet oxygen stress response and the two RpoH proteins evolved to recognize somewhat different but compatible promoter elements to assure the optimal regulation of distinct but overlapping stress regulons. As a result of these events, the transcriptional responses of R. sphaeroides to heat shock and singlet oxygen stress were separable but allowed to converge and contain a common set of functions. It will be interesting to identify and examine other examples of such convergence across bacteria and other organisms that possess multiple homologs of RpoH or other transcription factors.

Materials and Methods
Bacterial strains and growth conditions E. coli strains were grown in Luria-Bertani medium [38] at 30uC or 37uC. R. sphaeroides strains were grown at 30uC in Sistrom's succinate-based medium [39]. E. coli DH5a was used as a plasmid host, and E. coli S17-1 was used as a donor for plasmid conjugation into R. sphaeroides. The media were supplemented with kanamycin (25 mg/ml), ampicillin (100 mg/ml), chloramphenicol (30 mg/ml), spectinomycin (50 mg/ml), tetracycline (10 mg/ml for E. coli and 1 mg/ml for R. sphaeroides), trimethoprim (30 mg/ml), or 0.1% of L-(+)-arabinose when required. Unless noted, all reagents were used according to the manufacturer's specifications. The list of bacterial strains and plasmids used in this study are summarized in Table S3.

Construction of plasmids for controlled expression of RpoH I and RpoH II in R. sphaeroides
Plasmids for ectopically expressing RpoH I or RpoH II were constructed by separately cloning the rpoH I or rpoH II genes downstream of the IPTG-inducible promoter in pIND4 [22]. DNA fragments containing rpoH I or rpoH II were amplified from R. sphaeroides 2.4.1 genomic DNA using oligonucleotides containing BsrDI and BglII restriction sites (for RpoH II , RSP_0601_BsrDI_F GTAGCAATGCATGGCACTGGACGGATATACCGATC, RSP _0601_BglII_R GTAAGATCTTCATAGGAGGAAGTGATG-CACCTCC, and for RpoH I , RSP_2410_BsrDI_F GTAG-CAATGCATGAGCACTTACACCAGCCTTC, and RSP_2410 _BglII_R GTAAGATCTTCAGGCGGGGATCGTCATGCC). These resulting fragments were digested with BsrDI and BglII and ligated into pIND4 that was digested with BseRI and BglII to create pYSD40 (rpoH I ) and pYSD41 (rpoH II ), respectively. The pYSD42 plasmid expressing the FLAG-tagged version of RpoH I was constructed following the same procedure but with an oligonucleotide primer containing a sequence encoding for three consecutive copies of the FLAG epitope (DYKDDDDK) at the Nterminus (RSP_2410_3FLAG_BsrDI GTAGCAATGCATGGA-CTACAAGGACCACGACGGCGACTACAAGGACCACGAC-ATCGACTACAAGGACGACGACGACAAGAGCACTTACAC-CAGCCTTCCCGCTC). pYSD40, pYSD41, and pYSD42 were conjugated into R. sphaeroides DrpoH I [16] and R. sphaeroides DrpoH II respectively.

Western blot analysis for the expression of RpoH I and RpoH II
To monitor levels of RpoH I and RpoH II after heat shock, exponential phase aerobic cultures (69% nitrogen, 30% oxygen and 1% carbon dioxide) of wild type R. sphaeroides strain 2.4.1 grown at 30uC, were transferred to a 42uC warm bath with samples collected before heat treatment and at 10 min time intervals after heat shock, up to 60 min. To assess induction resulting from singlet oxygen stress, similarly grown wild type cells were treated with 1 mM methylene blue and exposed to 10 W/m 2 incandescent light with samples collected before treatment and at 10 min time intervals after treatment, up to 60 min. Exponentially growing aerobic cultures of R. sphaeroides DrpoH I and DrpoH II mutants carrying the pYSD40 or pYSD42 plasmids respectively, were treated with 100 mM IPTG for one generation and harvested. All cell samples were resuspended in 3 M urea containing 16 protease inhibitor cocktail (Thermo Scientific, Rockford, IL) and sonicated. Samples were centrifuged to remove debris and total protein concentration of the samples determined with a protein assay kit following the manufacturer protocol (Bio-Rad, Hercules, CA). An equal amount of total protein for each sample was loaded onto a NuPAGE acrylamide gel (Invitrogen, Carlsbad, CA) and run in 16 4-morpholineethanesulfonic acid running buffer at 150 V for ,90 min. Proteins were transferred to Invitrolon PVDF membranes (Invitrogen, Carlsbad, CA), which were subsequently incubated for 1 hr in 16 Tris-buffered saline, 0.1% Triton-X, and 5% milk protein. The membranes were incubated with rabbit polyclonal antibodies raised against either RpoH I , RpoH II or PrrA. Horseradish-peroxidase-conjugated goat anti-Rabbit IgG antibody (Thermo Scientific, Rockford, IL) was used as secondary antibody for detection with Super Signal West Dura extended duration substrate (Thermo Scientific, Rockford, IL).

Gene expression microarrays
Triplicate 500 ml cultures were grown aerobically with bubbling (30%O 2 , 69% N 2 , 1% CO 2 ) until they reached early exponential phase (OD at 600 nm of 0.15). At this point IPTG (Isopropyl b-D-1-thiogalactopyranoside) was added to a final concentration of 100 mM to induce gene expression from the pIND4 derivatives. After 3 hours incubation (OD at 600 nm of 0.30), 44 ml of cell culture were collected and 6 ml of 5% v/v phenol in ethanol was immediately added. Cells were collected by centrifugation at 6,000 g and frozen at 280uC until sample preparation. RNA extraction, cDNA synthesis, labeling, and hybridization were performed as previously described on Genechip Custom Express microarrays (Affymetrix, Santa Clara, CA) [40]. Processing, normalization, and statistical analysis of the expression profile data were performed in the R statistical software environment (http://www.r-project.org/) [41]. Data were normalized using the affyPLM package with default settings [42][43][44]. The expression microarray data have been deposited in the NCBI's Gene Expression Omnibus [45] and are accessible through GEO Series accession number GSE39806 (http://www. ncbi.nlm.nih.gov/projects/geo/query/acc.cgi?acc = GSE39806).

Chromatin immunoprecipitation on a chip
Cells were harvested at mid-exponential growth (OD at 600 nm of 0.30) from the same cell cultures used for the expression microarray experiment to prepare samples for a ChIP-chip assay [19]. RpoH I -FLAG was immunoprecipitated using commercial monoclonal antibodies against the FLAG polypeptide (DYKDDDDK ) (Sigma Aldrich, St Louis MO). RpoH II was immunoprecipitated with anti-R. sphaeroides RpoH II rabbit serum. Labeled DNA was hybridized on a custom-made tiling microarray, synthesized by NimbleGen (Roche NimbleGen Inc, Madison, WI), covering the genome of R. sphaeroides 2.4.1 [19]. Before data analysis, dye intensity bias and array-to-array absolute intensity variations were corrected using quantile normalization across replicates (limma package in the R environment) [46]. Regions of the genome enriched for occupancy by RpoH I or RpoH II were identified using CMARRT with a false-discovery rate #0.05 [47]. The ChIP-chip data have been deposited in the NCBI's Gene Expression Omnibus [45] and are accessible through GEO Series accession number GSE39806 (http://www.ncbi.nlm.nih.gov/ projects/geo/query/acc.cgi?acc = GSE39806).
Construction of plasmid vectors, lacZ reporter promoter fusions, and b-galactosidase assays to assay promoter activity in vivo To assay the in vivo activity of RpoH I and RpoH II at target promoters, b-galactosidase assays were conducted with R. sphaeroides DrpoH I DrpoH II mutant strains containing individual reporter gene fusions. To construct this set of reporter strains ,350 base pair regions upstream of putative target genes: RSP_1173, RSP_1408, RSP_1531, RSP_2314, RSP_2389, RSP_3274, RSP_1207 and RSP_2617, were amplified from genomic DNA using sequence specific primers, with NcoI and XbaI restriction sites at the ends of the upstream and downstream primers respectively. The amplified DNA fragments were purified, digested with NcoI-XbaI and then cloned in a pSUP202 suicide vector containing a promoterless lacZ gene (pYSD51). These Tc r plasmids were then conjugated into an R. sphaeroides DrpoH I DrpoH II mutant [15], generating single copy promoter-lacZ fusions integrated in the genome. pYSD40, pYSD41 or pIND4 (empty vector) were then conjugated into each of these reporter strains. Exponential phase cultures of these strains, grown by shaking 10 mL in 125 mL conical flasks, were then treated with 100 mM IPTG for one generation and samples analyzed for b-galactosidase activity. b-galactosidase assays were performed as previously described [52]. The data, presented in Miller units, represents the average of three independent replicates.
To test bases that contribute to RpoH I and RpoH II promoter specificity, b-galactosidase assays were conducted in R. sphaeroides tester strains containing reporter gene fusions of the cycA (RSP_0296) P1 promoter with a variety of point mutations (see Results). These reporter strains were constructed as described above, with individual point mutations being generated by overlap extension PCR [53]. b-galactosidase assays were conducted as described above and the data represents the average of three independent replicates. Background LacZ activity from control strains for each promoter fusion containing only the empty pIND4 plasmid (i.e. not expressing either RpoH I or RpoH II ) was subtracted from the measured LacZ activity for each mutant promoter.
The construction of the E. coli CAG57102 mutant strain, the promoter library, and the b-galactosidase assay used to test the activity of R. sphaeroides RpoH I in vivo on mutant promoters were described previously [7]. To express R. sphaeroides RpoH I the E. coli rpoH gene of pSAKT32 [7] was replaced with the R. sphaeroides rpoH I gene. At least triplicate assays for b-galactosidase activity were performed on all strains. Figure S1 Scatter plot of RpoH I versus FLAG-RpoH I dependent change in gene transcription levels.