Global Analysis of Extracytoplasmic Stress Signaling in Escherichia coli

The Bae, Cpx, Psp, Rcs, and σE pathways constitute the Escherichia coli signaling systems that detect and respond to alterations of the bacterial envelope. Contributions of these systems to stress response have previously been examined individually; however, the possible interconnections between these pathways are unknown. Here we investigate the dynamics between the five stress response pathways by determining the specificities of each system with respect to signal-inducing conditions, and monitoring global transcriptional changes in response to transient overexpression of each of the effectors. Our studies show that different extracytoplasmic stress conditions elicit a combined response of these pathways. Involvement of the five pathways in the various tested stress conditions is explained by our unexpected finding that transcriptional responses induced by the individual systems show little overlap. The extracytoplasmic stress signaling pathways in E. coli thus regulate mainly complementary functions whose discrete contributions are integrated to mount the full adaptive response.


Introduction
Bacteria possess various stress signaling systems that sense and respond to specific stimuli and allow the cell to cope with changing environmental conditions. One or several stress stimuli may activate multiple stress response pathways to constitute an integrated and complex response. Adaptation to envelope stress illustrates the complexity of these regulatory networks.
The bacterial envelope is involved in necessary processes including nutrient transport, respiration, secretion, adhesion, virulence and maintenance of bacterial integrity. In Gram negative bacteria such as Escherichia coli, the envelope comprises an inner membrane, a periplasmic space that contains the cell wall, an outer membrane and bacterial appendages such as pili and flagella. Being in contact with the external medium, the envelope is the initial target of physical (e.g., hyperthermia, osmolarity), chemical (e.g., ethanol, pH, detergent) or biological (e.g., adhesion, infection) stresses that may alter envelope components, thus inducing an extracytoplasmic stress response. The E. coli s E , Psp, Cpx and Bae signaling pathways are the main elements of this response described to date (reviewed in [1]). The s E and Psp (phage shock protein) pathways are both regulated via sequestration and release of a transcriptional factor in response to specific signals: Accumulation of specific misfolded outer membrane proteins (OMP) within the periplasm induces sequential regulated intramembrane proteolysis (RIP) events leading to degradation of the inner membrane protein RseA, the s E sequestrator [2][3][4], and resulting in s E release in the cytoplasm. Free s E associates with RNA polymerase to allow s E -regulated gene transcription. PspF is a s 54 enhancer binding protein: In the absence of signals, PspF-enhanced transcription is inhibited by PspA binding to PspF [5]. According to the current model, one or both inner membrane proteins PspB and PspC sense the inducing signal (possibly a decrease of proton motive force) and then bind PspA, disrupting its interaction with PspF (reviewed in [6]). PspA, PspB and PspC thus act as regulators and effectors of the Psp response [7,8], although another cascade might also exist [9]. The two other signal transduction pathways that respond to extracytoplasmic stress, Cpx for conjugative plasmid expression (for a review, see [10]) and Bae for bacterial adaptative response [11], are classical two component regulatory systems. Upon stimulation, the sensor (CpxA or BaeS) autophosphorylates a conserved histidine residue of its transmitter domain. The phosphoryl group is then transferred to a conserved aspartate of the receiver domain of the response regulator (CpxR or BaeR), resulting in its activation. In the absence of signals, sensor proteins are thought to function as phosphatases to deactivate their phosphorylated effector proteins. Additional proteins can participate in signal transduction prior to the sensor step: For example, the outer membrane lipoprotein NlpE stimulates CpxA following bacterial adhesion [12,13], whereas the periplasmic protein CpxP inhibits CpxA autokinase activity in the absence of signal [14]. In the presence of an extracytoplasmic stress such as accumulation of P pili subunits, CpxP is titrated away from the CpxA periplasmic domain and degraded, together with bound misfolded proteins, by the periplasmic protease DegP [15]. P pili accumulation also induces the Bae pathway [11].
The Rcs system is a complex phosphorelay signaling pathway that also participates in the extracytoplasmic stress response. Initially described as a regulator of colanic acid capsule synthesis [16], mutational analyses later showed that the Rcs regulon also affects envelope composition [17,18]. Recently, Rcs phosphorelay was shown to be activated by stresses affecting the peptidoglycan layer, and to contribute to intrinsic antibiotic resistance [19]. Rcs phosphorelay was also proposed to sense the extent of phosphorylation of the undecaprenyl carrier lipid, which is also involved in colanic acid synthesis [20,21]. The Rcs pathway presents several differences as compared to classical two-component systems: RcsC is a hybrid sensor kinase having both a classical histidine kinase transmitter domain and an additional receiver domain with a conserved aspartate. Phosphate transfer from RcsC to RcsB is mediated by RcsD (formerly, YojN), a histidine-containing phosphotransmitter (Hpt). Finally, RcsB, the transcriptional regulator, utilizes an auxiliary cytoplasmic protein, RcsA, to regulate expression of some genes (for reviews, see [21][22][23]). Targets of the Rcs regulon can thus be classified as RcsAdependent or RcsA-independent. RcsA is degraded by the Lon protease. Its instability has significant regulatory consequences, since the amount of RcsA is generally low. Formation of the RcsA-RcsB heterodimer protects RcsA from proteolysis and leads to transcriptional activation of RcsA-dependent genes and consequent capsule production.
The five envelope stress response systems have mainly been investigated individually. To gain further insights into the specificity of each pathway and their possible interconnections, here we compare the conditions leading to induction of each of the five pathways, and investigate the global transcriptional responses in parallel. This constitutes the first fully integrated transcriptomic study of extracytoplasmic stress response in E. coli.

Results/Discussion
Exogenous and genetic signals induce several extracytoplasmic stress pathways Although molecular mechanisms leading to signal transduction and extracytoplasmic stress responses are well documented for several systems, the environmental conditions that act as natural inducers remain obscure. For each stress response system, several genetic defects or different treatments were shown to induce adaptive responses (i.e., Bae [11,46]; Cpx [47]; Psp [6,[48][49][50]; Rcs [21,22]; s E [4,47,50]). Some conditions are known to concomitantly activate several pathways. For instance, indole induces the Bae and Cpx pathways [11], ethanol, verapamil (calcium channel inhibitor), or dibucaine (an amide local anesthetic that alters membrane fluidity) induce the s E and Psp pathways [50], and antibiotics targeting penicillin binding proteins induce Rcs, Bae, Cpx and s E [19]. However the effects of activating signals have never been simultaneously compared for all five systems.
To determine whether a given stress condition could be specific to a single pathway, we investigated the impact of several stress conditions on activation of the five known extracytoplasmic stress response pathways. To do this, we first used strains with transcriptional gene fusions that place the lacZ reporter gene (encoding b-galactosidase) under the control of a promoter representative of each system: Bae, Cpx, Psp, Rcs and s E pathways were monitored using spy::lacZ, cpxP::lacZ, pspA::lacZ, rprA::lacZ and P3rpoH::lacZ, respectively (Table S1).
Strains were grown under various stress conditions using both external and genetic stimuli, and b-galactosidase activity was determined (see Materials and Methods). In the absence of their corresponding transcriptional regulators, cpxP::lacZ and pspA::lacZ fusions had no detectable activity and rprA::lacZ had strongly reduced activity when compared to the wild type background (data not shown). In the case of the s E reporter, P3rpoH::lacZ, s E is essential [51], and a basal level of b-galactosidase activity was observed in the unstressed condition.
Since the spy::lacZ fusion is dependent upon both BaeR and CpxR, we also analyzed the effects of these signals in a baeR background [52]. To our knowledge, no genes subjected only to baeR regulation have been described. We chose an rprA::lacZ reporter to monitor the Rcs pathway. rprA encodes a small regulatory RNA that stimulates rpoS (encoding s S ) translation [53]: This gene was used rather than a cps::lacZ fusion, since the latter fusion is also RcsA-dependent, and thus reports as much on the level of RcsA (which is potentially limiting) as on activation of RcsB [53].

Author Summary
Bacteria possess various signaling systems that sense and respond to environmental conditions. The bacterial envelope is at the front line for most external stress conditions; its components sense perturbations and transmit signals to induce transcriptional reprogramming, leading to an adaptive response. In Escherichia coli, at least five response pathways, called Bae, Cpx, Psp, Rcs, and s E , are induced in response to envelope stress. To date, these pathways have been studied mainly individually, and the interconnections and/or overlaps between them have not been extensively characterized. The present study establishes two important characteristics of stress response in E. coli: first, that a given stress solicits the combined responses of several pathways; second, that each individual pathway controls a discrete set of genes involved in the response, and shows little overlap with other pathways. Based on previous knowledge and the present data, we propose that an environmental stress probably impacts on the cell envelope by inducing numerous alterations, each of which may be perceived by different pathways of the stress response and contributes to adapting the cell to different aspects of the stress damage. The extracytoplasmic stress signaling pathways in E. coli thus regulate mainly complementary functions whose discrete contributions are integrated to mount the full adaptive response.
None of the external tested stimuli were found to be strictly specific for a single signaling system (Table 1). Indeed, 5% ethanol and 4 mM indole induced all the transcriptional fusions tested. Dibucaine activated the Cpx, Rcs, Bae, and Psp systems whereas 0.6 M NaCl activated only the Rcs and Psp pathways. However, we observed differences in the response levels of reporter fusions to different signals: Cpx, Bae and s E pathways were induced preferentially by indole and ethanol, Rcs by NaCl in addition to ethanol and indole, and Psp by ethanol and dibucaine, in keeping with previous results [50]. Interestingly, the Rcs pathway was induced in all membrane-altering stress conditions tested, in accordance with a bona fide role of this pathway in extracytoplasmic stress response.
During a genetic screen using an E. coli genomic DNA library [54], we observed that overexpression of yedR (encoding a putative integral inner membrane protein conserved in enteric bacteria) led to a strong mucoid phenotype. This could indicate that YedR is a component of the Rcs pathway, or alternatively, that it acts as an internal inducer of the Rcs response. Further analysis showed that induction of capsule production by yedR depended on RcsB and RcsC proteins (data not shown). However, the rprA::lacZ fusion was responsive to different stresses (0.5 mM dibucaine, 3% ethanol or 0.6 M NaCl) in a DyedR background (data not shown), leading us to conclude that YedR is not part of the Rcs signal transduction pathway. Furthermore, multicopy yedR also strongly stimulated the Psp pathway, suggesting that YedR accumulation generates an envelope stress that is sensed preferentially by these two signaling systems (Table 1).
All the above results show that stress activates multiple pathways. Nevertheless, some signal are considered as specific activator of pathways: for example, e.g., exposure of the Cterminal part of certain OMPs to the PDZ domain of DegS activates s E [3,55,56], a drop of the proton motive force activates Psp [6], and accumulation of P-Pili subunits activates Cpx [13]; in the case of Rcs and Bae, specific signal sensing mechanisms remain to be identified. For Rcs, although it has been proposed that the signal could be a perturbation of the peptidoglycan [19], alteration in other envelope compartments can also be efficient inducers as previously reported [22] and illustrated by the impact of yedR overexpression (Table 1).
These observations may be reconciled, as the above tested conditions likely alter several aspects of bacterial envelope integrity, generating multiple signals that are in turn specifically sensed by different pathways. In addition, connections between stress response regulons could also account for indirect activation of some pathways.
In conclusion, one exogenous signal induces multiple defects by affecting different envelope components, which may be sensed by specific signal transducing mechanisms that activate all five pathways. It is expected that induction of all five extracytoplasmic stress responses is required to fully protect the cell against the variety of damages caused by a single stress.

Extracytoplasmic stress signaling outputs: Parallel transcriptome analyses
To gain further insight into specifically regulated functions and interconnection between all five extracytoplasmic stress signaling systems, we carried out a global analysis of the general transcriptional responses following activation of each pathway. As discussed above, in view of the possible secondary signals generated by the use of inducers, we continued the study using an approach based on overexpression of each five pathway regulators. This methodology was previously used to characterize in detail the s E regulon [57]. We point out a possible limit to this approach, in cases where the regulator requires phosphorylation for its activity (i.e., for BaeR, RcsB, and CpxR); nevertheless, experimental evidence indicates that overproduction of such regulators can be effectively used in such studies, as a proportion of the molecules is phosphorylated in the absence of signal (e.g., [46,58]).
baeR, cpxR, rcsB, pspF and rpoE were cloned under the control of the P LtetO-1 promoter in cloning vector pZE21, and the corresponding plasmids introduced into the wild-type strain MG1655Z1 (Table S1); a strain containing the plasmid with no insert was used as a control. Expression from the cloned genes was induced by addition of anhydrotetracycline (aTc). Since overexpression of CpxR [59], PspF (data not shown) and s E [35] was toxic, we determined the minimal aTc concentration that resulted in minimal cell toxicity. Accordingly, induction was carried out with 10 ng/mL aTc in exponential phase in LB medium at 37uC for 45 min, to limit indirect effects of regulator overexpression. Western blot experiments indicated that these conditions led to accumulation (approximately between three-to ten-fold) of CpxR, RcsB, and s E as compared to a strain having the control plasmid (data not shown; not determined for BaeR and PspF). Bacteria were harvested and total RNA was extracted for microarray The induction level of the stress response pathways was determined using transcriptional gene fusion reporters expressing b-galactosidase. Cpx, Rcs, Bae, Psp, and s E pathways were tested using derivatives of strains TR50, GEB658, TR530, MC3, and CAG16037, respectively (Table S1). Results are presented as the ratio of bgalactosidase activity under stress conditions to standard conditions (growth in LB) from at least three independent experiments with the following codification: ''+++'', ratio.5; ''++'', ratio experiments (Materials and Methods). In addition, 30 genes were selected to follow expression by qRT-PCR in six strains (overexpressing BaeR, CpxR, RcsB, PspF and s E , or having the control plasmid). The qRT-PCR tested genes were chosen because: i) they were not previously known to be modulated during one of the studied conditions; ii) they were expected from published studies to be modulated by overexpression of one of the studied regulators, but the predictions were not confirmed by our micro-array experiments or, iii) the statistical significance of our micro-array data for these genes was inconclusive. As detailed below, the increased transcription of cpxR resulted in induction of only a subset of genes previously shown to belong to the CpxR regulon. We therefore complemented these experiments with a transcriptome analysis of a cpxR mutant relative to the wild type MG1655 parental strain in late exponential phase. Since the Cpx pathway is activated in this condition [52,59,60], this comparison was expected to give access to at least some genes of the cpxR regulon.
The s E regulon. 69 transcription units containing 114 genes were differentially expressed in response to s E overexpression. A vast majority (.90%) were induced, which is consistent with s E being a sigma factor ( Table 2). Two previous studies reported an analysis of the s E regulon following transient overexpression [37,57]. Our results are in good agreement with these studies; 35% of the operons that we found are in common with the two studies, and 64% are in common with one of the two studies. As previously described, and in keeping with the known function of the s E regulon, a large majority of the regulated genes are related to cell surface maintenance. Among the repressed genes, we found many that encode outer membrane proteins (lamB, ompC, ompF, ompN, ompS1, ompW, and phoE; Table 2) whose expression contributes to extracytoplasmic stress. Some of these genes are under the control of several sRNAs (many of which are also regulated by s E ) [61][62][63][64][65][66]. Among the s E up-regulated genes were those encoding sigma factors (rpoD, rpoH, rpoN), suggesting that regulatory cascades are triggered, as well as ptsN, a suppressor of rpoE essentiality when overexpressed [67]. As was previously reported [37,57], the s E consensus binding site was not found upstream of several of the induced genes (e.g., the imp-surA-pdxA operon, and djlA, a gene divergent from imp, encoding a membrane associated co-chaperone of the DnaJ family [68,69]). In the case of these genes, regulation by s E might be indirect, for instance by sRNAs, or by sigma factors that are increased by s E induction.
The RcsB regulon. Among genes whose expression is altered by RcsB overexpression, the vast majority (.90%) encodes proteins related to the envelope or its metabolism, or localized in the envelope (Table 3). All affected genes were induced, showing that RcsB is mainly a positive transcriptional regulator. Previous transcriptomic studies have explored Rcs regulated genes under different conditions: i) following DjlA overexpression in the absence of rcsC [30], ii) in exponential phase in the absence of rcsB or rcsD [70], and iii) in the absence of rcsC or rcsF at low temperature with or without zinc excess [71]. More than 70% of the genes that we identified as being differentially expressed after RcsB overexpression were also found in at least one of these studies; These include bacterial capsule synthesis genes (e.g., wcawza-cps) or genes induced by an osmotic stress (osmB and osmY). The relatively low fold induction of the colonic acid synthesis operon in our conditions could be due to a limiting amount of RcsA. Additional genes of interest, previously not described, are involved in O antigen biosynthesis. qRT-PCR experiments also indicate that RcsB induces the expression of spy and yaiY genes encoding a periplasmic protein under the control of both BaeR and CpxR (see below) and a putative inner membrane protein, respectively ( Table 3).
The BaeR regulon. The Bae pathway is auto-regulated and specifically deals with toxic compounds by induction of the mdt-bae operon, which encodes a multidrug transporter of the RND family, and the tolC gene, which ensures efflux through the outer membrane ( [72] and Table 4). Transcriptome analysis in conditions of BaeR overexpression provides the smallest gene list in this study. Indeed, up-regulation of only 8 genes, corresponding to 5 transcription units, was statistically significant, even considering a minimal ratio of induction of 1.4 confirming that like RcsB, BaeR acts mainly by stimulating transcription (Table 4). Only four BaeR binding sites, upstream of spy, the mdt-bae operon, acrD, and ycaC, have been reported in the E. coli. All genes were previously reported as having their expression activated by the Bae pathway [46,70]; however, spy, mdtA and mdtB described as highly induced [46] were moderately induced in the microarray study (although spy was found to be upregulated more than 70 fold by qRT-PCR, variation in the expression of the other genes was not determined). This variation could reflect differences in parameters such as overexpression conditions, medium, growth phase, or method used (microarray technology tends to underestimate the variation of expression). It could also be due to (i) a role of CpxR in modulating BaeR activity [58,73] (see also below), (ii) low affinity of BaeR for its target sequences, and/or (iii) differences in results depending on transient (our study) or constitutive overexpression [46]. We also observed induction of the ynjABCD operon that encodes a putative membrane transporter ( Table 4). Induction of ynjA was confirmed by Northern blot (data not shown). Neither the ynjABCD operon nor the tolC gene displayed an upstream baeR binding consensus motif, suggesting that BaeR-mediated effects on these operons may be indirect.
The Psp Regulon. PspF is a member of the enhancerbinding protein family of transcriptional regulators, which stimulates transcription by the alternative sigma factor s 54 . PspA, encoded by the pspABCDE operon has a dual function. In normal conditions, it binds PspF, preventing transcription of the operon. But it is also thought to be the major effector of the Psp pathway, by promoting proton motive force maintenance in the inner membrane (for a review, see [6]). Only two previous studies addressed the nature and extent of the PspF regulon in E. coli. In one, overexpression of the pIV protein (a secretin from the filamentous phage f1) was shown to induce the pspABCDE operon, as well as an additional gene, pspG [48]. Subsequent transcriptome analyses were performed on E. coli strains disrupted for pspA, pspD and pspG. The psp genes, including pspG, were all found to be highly expressed in the pspA mutant [27]. In the present study, overproduction of PspF led to induction of the pspABCDE operon and pspG (Table 5) as previously reported [48]. These genes are directly involved in combating stress and/or regulating expression of the Psp pathway [7]. In addition, we identified 11 genes corresponding to 10 transcription units that were derepressed following PspF overproduction, and not previously reported ( Table 5, [48]). Among them, tolB encodes a member of the Tol-Pal trans-envelope complex, which is required to maintain cell envelope integrity [74]. Interestingly, the interaction between TolA and the Pal lipoprotein is driven by the proton motive force [74]. Another significantly derepressed gene was hyfR, which controls expression of genes responsible for the protontranslocating formate hydrogenase system and formate transport. Derepression of the above functions is in keeping with the implication of PspF in maintaining bacterial proton motive force. norW, which encodes a flavorubredoxin reductase important for nitric oxide reduction and detoxification that is regulated by s 54 (RpoN) [75], was also strongly up-regulated (Table 5). Since PspF is a s 54 enhancer-binding protein [5], its overproduction may lead to a strong transcription of norW. Several genes/operons were also found to be repressed in PspF overproduction conditions. In particular, glpABC, encoding glycerol-3-phosphate dehydrogenase, is down regulated. Consistent with this observation, these genes were up regulated in a strain depleted for pspA and/or pspD [27], whereas in our conditions, pspA was overexpressed 12-fold. Note that this level of induction is within the range observed by Lloyd et al [48] after overproduction of the filamentous phage f1 protein IV. Previous transcriptome studies indicated that Psp responds to decreased proton motive force due to membrane perturbations [27]. Lower glpABC in PspF overproduction conditions could reduce utilization of glycerol-3phosphate as an electron donor for the respiratory chain, and help maintain the pool of glycerol-3-phosphate required for phospholipid synthesis. Glycerol-3-phosphate is a substrate of glycerol-3phosphate acyltransferase encoded by pls, itself a gene under the positive control of s E ( Table 2).
Also repressed were several genes encoding functions related to the anaerobic respiration of nitrate (i.e., several nar and nap genes) as well as genes required for cytochrome c biogenesis (ccmA). These results do not support the conclusion of Jovanovic et al [27] that Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes whose expression was determined by Q-PCR are underlined, and the fold change thus determined is indicated in parentheses after the microarrays value. Genes previously reported to be s E -regulated are indicated by uppercase letters that refer to the concerned study with the following codification: D, reported in [99]; K, reported in [37]; Re, reported in [100]; Rh, reported in [57]; and J, reported in [64]. b b numbers correspond to genes of the first column.   Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes whose expression was determined by Q-PCR are underlined, and the fold change thus determined is indicated in parentheses after the microarray value. Genes previously reported to be RcsB regulated are indicated by uppercase letters that refer to the concerned study with the following codification: F, reported in [30]; H, reported in [71]. b b numbers correspond to genes of the first column.  Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes whose expression were determined by Q-PCR are underlined, and the fold change thus determined is indicated in parentheses after the microarray value. Genes previously reported to be BaeR-regulated are indicated by uppercase letters that refer to the concerned study with the following codification: N, reported in [46]; O, reported in [70]. b b numbers correspond to genes of the first column. the psp regulon responds to the dissipation of proton motive force by favoring anaerobic respiration of nitrate. In addition, contrary to their suggestion that a function of the Psp pathway was to downregulate motility and chemotaxis, motility genes were conspicuously unaffected in our study (Table 5). These differences could be due to the fact that PspA induction was about 8-fold lower in our study than in the one previously reported [27]. The CpxR Regulon. Previous characterization of the CpxR regulon consisted mainly of using a CpxR sequence recognition weight matrix and identifying reliable target promoters within the E. coli genome [52]. This search identified genes that were directly regulated by CpxR, either positively or negatively. In addition, transcriptome analysis following a copper stress revealed twelve genes that were depressed by copper in a Cpx dependent manner [76]. Regulation of many of those genes was confirmed by a recent study using lux fusions in a MC4100 cpxA* background, where the Cpx pathway is constitutively activated [77]. This study also showed that the presence of a CpxR box upstream of the gene (or operon) was not a good predictor of the extent of regulation by CpxR.
We aimed to generate in vivo data supporting these predictions and observations, and investigated transcriptome modifications following CpxR overexpression in exponential phase. Results were surprising, as well-known targets of CpxR, such as degP/htrA, or dsbA [78] were not significantly regulated upon cpxR induction. cpxP and spy were derepressed at a level insufficient to be included in our gene selection (1.4 and 1.8 fold, respectively). Analysis of gene expression by qRT-PCR, which is more sensitive than microarrays, showed a two-fold induction of dsbA, a three-fold induction of cpxP, and strong induction of spy (40 times), but no induction of degP or smpA (Table 6). In the latter two cases, this could be explained by the dependence of gene transcription upon s E , whose amount, in the absence of an inducing signal, might be limiting. To complement our data, we performed a comparative transcriptome analysis of MG1655 and its DcpxR derivative in late exponential phase, a condition in which the Cpx pathway is Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes whose expression was determined by Q-PCR are underlined, and the fold change thus determined is indicated in parentheses after the microarrays value. Genes previously reported to be PspF-regulated are indicated by uppercase letters that refer to the concerned study with the following codification: J, reported in [27]; L, reported in [48]. b b numbers correspond to genes of the first column.  Table 6. Genes whose expression is significantly modulated by CpxR. Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes whose expression was determined by Q-PCR are underlined, and the fold change thus determined is indicated in parentheses after the microarrays value. Genes reported to be CpxR-regulated are indicated by uppercase letters that refer to the concerned study with the following codification: O, reported in [70]; Y, reported in [76]; W, reported in [52]. b b numbers correspond to genes of the first column. proposed to be activated [52,59,60]. Deletion of cpxR did not have a strong effect on most of the known cpxR-regulated genes (the maximum effect was 4 fold, see Table 7), as recently reported [77]. However, we found a four-fold reduction of expression of yccA, a modulator of the membrane protease HflB, a 2.5 reduction of smpA expression, and a 2.2 fold reduction of ftnB, encoding a ferritin-like protein.
Altogether our results confirmed just 8 genes/transcription units as regulated by CpxR, out of 33 gene clusters that were assigned by previous in vivo, in silico and/or in vitro studies as members of the CpxR regulon (Tables 6 & 7, [52,70,77,79]). ppiD, which encodes a periplasmic peptidyl-prolyl isomerase, was previously proposed to be part of the CpxR regulon. In agreement with Price and Raivio [78], this gene was not affected by cpxR overproduction or by deletion, as also confirmed by qRT-PCR (data not shown). In addition, we identified 71 new genes belonging to 49 transcription units that lacked a CpxR box upstream of their promoters [52,79], which yet were modulated by overproduction or absence of CpxR (Table 6 & 7). Regulation of many of these additional genes can be an indirect consequence of CpxR overproduction or inactivation, and would be expected, as several transcriptional regulators are putative members of the CpxR regulon (Table 6 & 7).
The weak effect of overproducing CpxR on the MG1655 transcriptome could be due to a low basal level CpxR phosphorylation. Alternatively, this could be explained by a dependence upon other regulators, which might be limiting in the absence of an inducing signal. Indeed, most genes thus far identified as being regulated by CpxR are also dependent on other factors, such as s E or BaeR (Table 4, [58], see also the compilation of CpxR regulated genes in Ecocyc http://ecocyc. org/). This phenomenon would be especially important if the other regulator is a s factor. Out of 33 gene clusters listed as regulated by CpxR in EcoCyc, 7 depend upon s E , one (the mdt operon) depends upon s S and one upon s F . Many other genes Table 7. Genes whose expression is significantly modulated by the absence of CpxR. Genes are grouped by putative or known operons (ordered in the direction of transcription). Genes whose expression was found to be significantly modulated are in bold (see Materials and Methods). Genes reported to be CpxR-regulated are indicated by uppercase letters that refer to the concerned study with the following codification: O, reported in [70]; Y, reported in [76]; W, reported in [52]. b b numbers correspond to genes of the first column. transcribed by s 70 are subject to multiple regulations (e.g BaeR, catabolic repression, and other specific regulators) in addition to being regulated by CpxR, pointing to the interdependency of such factors in mounting a full response. In favor of this explanation, spy expression, which is regulated independently by CpxR and BaeR [58], responded well to CpxR overproduction (Table 6). In the case of the acrD gene and the mdt operon, their regulation by CpxR was previously shown to be strictly dependent upon BaeR [58]; accordingly, we observed no effect of CpxR overexpression.

Comparison of extracytoplasmic stress response pathways
Principal components analysis (PCA) is a useful statistical technique that removes noise from complex data sets by reducing the dimensionality and helps discriminate the key factors of variations [80,81]. PCA has proved useful for finding significant patterns in microarray analyses (see for complete explanation, see [82,83]). Briefly, given m observations (the gene expression ratio) on n variables (our 6 different conditions), the goal of PCA is to find r significant variables, where r is less than n, to select the factors that best explain the observed variance in the observations. PCA was used to analyze our microarray data on the mean logratio measures obtained, each condition corresponding to a variable (see Materials and Methods). The first dimension accounting for 33% of the variance, could not discriminate between the conditions, whereas the second dimension axis separated CpxR, PspF and DcpxR conditions from s E , BaeR and RcsB conditions (data not shown). The third and fourth components (accounting together for 29% of the variance) discriminated s E overproduction and DcpxR conditions from BaeR/RcsB/CpxR/PspF overproduction conditions ( Figure 1A). In the case of s E , both the size of the regulon and the nature of the regulation (i.e., by a s factor or by transcriptional regulators) could account for this result, whereas in the case of DcpxR, the result might be explained by the difference in the experimental strategy (deletion vs. overproduction).
A hierarchical clustering that grouped together genes with similar expression patterns was performed on the set of genes differentially expressed in each of the five overexpression conditions ( Figure 1B). Results were in agreement with those observed with PCA. The PspF response clustered with the CpxR response, while s E and RcsB responses were further away. But the main conclusion of this analysis is the striking specialization of each pathway, with very limited overlap between responses ( Figure 1B). This is also revealed by a Venn diagram representation showing genes regulated by the extracytoplasmic pathways ( Figure 2). Results of this analysis are unexpected, since redundancy is often proposed as an important property of robust networks. In addition, in the case of the extracytoplasmic stress response, redundancy was expected because many genes that are regulated by s E or BaeR are also regulated by CpxR. Furthermore, several genes regulated by PspF were also affected by CpxR (Figure 2). One surprising finding in our study is that the overproduction of CpxR had a limited effect on many known CpxR regulated genes, in sharp contrast with the situation in the case of RcsB. We propose that in many cases, CpxR acts in conjunction with other regulators, which are limiting in the absence of a stress signal. Hence, rather than controlling in itself specific genes, an important role of CpxR may be to amplify the response promoted by the other regulons. It should be noted that s S promotes transcription of the cpxRA operon [25] and CpxR can cross-talk with the EnvZ-OmpR response [84,85]. Thus, CpxR may integrate diverse stimuli associated with growth and central metabolism [60]. In view of these analyses, and of some previous results (e.g., [58]), our results suggest that CpxR functions more as a modulator of the other extra cytoplasmic stress responses, especially s 54E , BaeR and PspF than as a standalone regulator.
s E appears to be the major regulator, with at least 69 transcription units affected. It is mostly in charge of envelope biogenesis maintenance, especially genes required for synthesis, assembly, and homeostasis of outer membrane proteins and lipopolysaccharides ( [57], Table 2). The other responses are more limited and specialized in certain categories of envelope components. PspF is dedicated to maintenance of energy, and has an important role associated with the cytoplasmic membrane in prokaryotes. RcsB affects additional envelope structures such as capsular exopolysaccharide production and O-antigen ( [30,71], Table 3), and BaeR controls the production of several drug export systems that might be important to extrude toxic compounds during an extracytoplasmic stress ( [46], Table 4). Hence, each system has its raison d'être in term of restoring various aspects of envelope physiology.
Given the high specialization of each pathway, genes that are regulated by several of these pathways are likely to play a crucial role in cell physiology in response to an extracytoplasmic stress. Although responses are mainly distinct, handful of genes were found to be in common between some of the pathways. For example, the trans-envelope protein components of the Tol-Pal system have a major role in maintaining envelope integrity, which, driven by proton motive force, bring the inner and outer membranes in close proximity [74,86]. Genes encoding the Tol-Pal system were positively regulated by the s E , Cpx and possibly Psp pathways. In contrast, the tnaLAB operon was repressed in by these same pathways (Figure 2). This could be in relation with the fact that tnaA encodes tryptophanase, an enzyme that degrades tryptophan and generates indole, itself toxic to the cell and an extracytoplasmic stress inducer. lamB, encoding an outer membrane protein was also found to be repressed in several conditions (Figure 2), which may reflect the extra demands imposed on the cell for folding factors controlled by the extracytoplasmic stress regulons [63].

Concluding remarks
Extensive studies of stress response in E. coli have established the existence of several extracytoplasmic pathways, and suggest that expression of numerous genes are affected by more than one of these pathways. These findings suggest redundancy and raise questions concerning the reason for such a multiplicity of pathways. For the first time, we explored all five extracytoplasmic stress response pathways under comparable conditions in E. coli. We found that they can be activated simultaneously in response to exogenous or endogenous stimulation. Thus, although activation of a single pathway has been demonstrated experimentally using specific substrates or conditions, our results suggest that natural environmental stimuli provoke bacterial modifications that lead to multiple pathway responses.
To determine the contributions of each stress response pathway, we avoided the use of non-specific inducers, and opted for overexpression of each regulator. Transcriptome analyses show that induction of specific target genes via multiple pathways is an uncommon occurrence. Some genes can also be subject to cooperative regulation between different pathways, or to a cascade of pathway responses. In addition, the pathways might cross-talk through transcriptional regulation, as is the case between s E and s S , and possibly evoked by our results.
Each of for stress response systems (s E , Rcs, Psp and Bae) appears to be specialized in assuring a specific aspect of envelope biogenesis and maintenance, whereas CpxR might have a role as modulator of the response by integrating other endogenous signals. We conclude that all five pathways are needed to mount a full response to extracytoplasmic stress.

Materials and Methods
Bacterial strains, plasmids, oligonucleotides, and culture conditions The E. coli strains and oligonucleotides used in this study are listed in supplementary tables S1 and S2. Several plasmids constructed for this study were derived from pZE21 that has a ColE1 replication origin, confers kanamycin resistance and has a P LtetO-1 promoter upstream of a multicloning site [87]. pZE21-baeR, expressing baeR under the control of P LtetO-1 , was constructed as follows: The MG1655 baeR gene was PCRamplified using oligonucleotides 450 and 451 (for oligonucleotide sequences, see Table S2). The resulting fragment was digested by KpnI and BamHI and cloned into KpnI-BamHI restricted pZE21-MCS. pZE21-cpxR, pZE21-pspF, pZE21-rcsB and pZE21-rpoE, containing cpxR, pspF, rcsB and rpoE were constructed using the same approach as for pZE21-baeR but with 452/453, 454/455, 456/457 and 458/459 oligonucleotide-pairs, respectively. Plasmid pGem-T-easy is a high copy number plasmid conferring ampicillin resistance (Promega, Madison, WI, USA). pGem-T-easy-yedR expressing yedR was constructed as follows: MG1655 yedR gene was PCR-amplified using oligonucleotides 143 and 144. The resulting fragment was digested by KpnI and BamHI and cloned into KpnI-BamHI restricted pGem-T-easy.
P1 vir-mediated transduction was carried out as described [88]. The chromosomal baeR gene was deleted by targeted gene substitution using a combination of two published protocols as described [89]. The baeR deletion was confirmed by PCR.
Cells were grown in LB broth or on solid LB containing 15 mg.mL 21 agar [90]. When necessary, antibiotics were added at the following concentrations: ampicillin 100 mg.mL 21 , chloramphenicol 30 mg.mL 21 , kanamycin 20 mg.mL 21 , and tetracycline 12.5 mg.mL 21 . Growth of strains containing pZE21-MCS or related plasmids was analyzed as followed: 5 ml of overnight cultures adjusted to OD 600 of 0.3. Serial dilutions in M9 medium [90] were spotted on solid LB medium containing kanamycin and 0, 2, 10 or 100 ng.mL 21 of anhydrotetracycline (aTc), and incubated at 37uC for 24 h.

Recombinant molecular techniques
Plasmid preparations, DNA cloning and ligation, classical PCR amplification and DNA transformations were carried out according to standard protocols [90] and manufacturers' instructions. Northern blots (see Supplementary Table S2 for information on primers used for probe synthesis) were performed as previously described [90,91] with 10 to 20 mg of RNA, except that hybridization was performed at 42uC using a NorthernMax prehybridization/hybridization buffer (Ambion, Austin, TX, USA) according to manufacturer's instruction. The ssrA gene was used as a reference to normalize RNA quantities in Northern blot experiments.

b-galactosidase assay
Overnight cultures were diluted in LB broth to an OD 600 of 0.004. For parallel analyses of inducing and non-inducing growth conditions, cultures were incubated under agitation at 37uC using 96-well culture plates in a total volume of 1 ml. Various growth conditions were investigated: i) standard LB, ii) LB containing ethanol (3 or 5%), iii) 0.5 mM dibucaine, iv) NaCl (0.6 M), v) 5 mM EDTA or vi) indole (2 or 4 mM). The stock indole solution was prepared by dissolving indole in hot LB before use. After five hours, three samples of 200 ml of each culture were taken: One was used to measure OD 600 in 96-well plates in the Biolumin (Molecular Dynamics) or Chameleon (Bioscan Inc., Washington DC, USA). The two others were used to evaluate b-galactosidase activity according to Miller's protocol adapted to 96-well plate assays [88]. For other bgalactosidase assays, cultures were prepared in individual tubes in 5 mL LB broth under agitation at 37uC. Strains containing pGem-Teasy and related plasmids were grown with 100 mg.mL 21 ampicillin. b-galactosidase activities were measured in duplicate from 200 ml samples taken at OD 600 of 0.4 (exponential phase).  (2) regulation found in the present study, is indicated for each regulon. The Cpx regulon comprises genes affected by CpxR overproduction or deletion (genes repressed or induced by the absence of cpxR are counted + and 2, respectively). Only genes that are common to at least two regulons and confirmed by qRT-PCR and/or the literature are listed (red: induced, green: repressed). Genes described in the literature as dependent on CpxR and s E or BaeR are also included. doi:10.1371/journal.pgen.1000651.g002 Gene expression profiling using microarrays Transcriptome analysis of the effect of transient overexpression of extracytoplasmic stress response regulators was performed with three independent RNA preparations for each of the six biological conditions tested, namely pZE21 (control plasmid), pZE21-baeR, pZE21-cpxR, pZE21-pspF, pZE21-rcsB and pZE21-rpoE. Addition of aTc (for 45 min) to the medium resulted in a 16-, 166-, 225-, 13-and 19-fold increase of baeR, cpxR, pspF, rcsB and rpoE mRNA in strains containing pZE21-baeR, pZE21-cpxR, pZE21-pspF, pZE21-rcsB and pZE21-rpoE respectively, as compared to the reference strain containing the control plasmid, pZE21. Additionally, a DcpxR strain was included in the study, using four independent RNA preparations. To assess data reproducibility and minimize dye bias effects, one of the samples (two in the case of DcpxR) was measured with Cy3 instead of Cy5. To ensure robustness and comprehensiveness in data analysis, a reference design was used with an equimolar mixture of all the biological conditions serving as a baseline for the comparisons. Such a design does not require pre-definition of the subgroups for comparison, allows discovery of non-anticipated classes among the samples and is compatible with subsequent additional sampling. Strain MG1655 (Table S1) containing pZE21 and derivative plasmids were grown overnight in LB broth supplemented with kanamycin and diluted in 7 mL LB broth at an OD 600 of 0.004. After two hours, expression from the P LtetO-1 promoter was obtained by addition of 10 ng.mL 21 aTc to the medium. After 45 minutes, 7 mL of cold absolute ethanol was added to bacterial cultures (at OD 600 of about 0.4). Cells were then harvested by centrifugation for 15 min at 3000 g and stored at 280uC to prevent RNA degradation. For parental and cpxR strain transcriptome analysis, LB overnight cultures were inoculated at OD 600 of 0.004. When, cultures reached an OD 600 of 2 (late exponential phase), they were harvested in the presence of cold absolute ethanol and frozen at 280uC. The next steps were carried as for overexpression transcriptome experiments: Cells were lysed and RNA was extracted three times with an equal volume of acidic hot phenol and once with chloroform. RNA was ethanol precipitated, air dried and dissolved in water. RNA integrity was evaluated using RNA 6000 nano chips and the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA) according to manufacturer's instructions. RNA quality control was performed using user-independent classifiers as described [92].
Ten mg of total RNA from each biological condition and the RNA reference mixture were supplemented with RNA corresponding to known sequences to serve as an hybridization control (Spikes, Universal ScoreCard, GE Healthcare), reverse transcribed and labeled using the Superscript Indirect cDNA labeling System (Invitrogen, Carlsbad, CA, USA) according to manufacturer's instruction, except that purification steps were done using the QIAquick PCR mini column system (Qiagen, Hilden, Germany). Labeling efficiency and product integrity was checked according to [93]. For each condition, the hybridization experiment was performed against the reference sample; a mixture of 0.75 mg Cy3-and 0.75 mg Cy5-labeled targets was incubated at 95uC for 3 min in the presence of a 26 Hybridization Buffer (Agilent technologies, Santa Clara, CA, USA). Denatured targets were placed on an E. coli v2 whole genome array [63] (ArrayExpress accession: A-MEXP-1516, http://www.ebi.ac.uk/microarray-as/ ae/) and hybridized for 17 hours at 60uC, in a rotating oven (6 rpm), using an Agilent hybridization chamber system. The hybridized slides were washed once for 10 min in 26SCC/0.1% SDS at 50uC and once in 0.56SCC/0.1% SDS at room temperature, then twice for 5 min in 0.16SSC at room temperature. Any traces of water were eliminated immediately by air drying with ozone-safe dry air (''canned air''). Slides were scanned using a GenePix 4000B scanner (Molecular Devices, Sunnyvale, CA, USA) at 10-mm resolution. All slides were scanned using 100% laser power; PMT voltages were automatically adjusted using the Genepix Pro 6.0 software acquisition system to obtain maximal signal intensities with ,0.005% probe saturation.
The resulting 16 bit images were processed using the GenePix Pro 6.0 image analysis software (v6.0.1.26). Data were processed using the MAnGO software [94], an R script that allows integrated analysis of two-color microarrays. Raw data were normalized using the print-tip loess method [95]. The average Log2 expression ratios were then calculated [log2(pZE21-geneX/ REF)2log2(pZE21/REF) = log2(pZE21-geneX/pZE21), where geneX is the gene of interest in the condition of interest, REF the value obtained for X in the case of the pool of all conditions (reference sample), and pZE21 the value obtained in the condition corresponding to the vector alone (biological reference)] and used for all subsequent statistical analyses. MIAME-compliant data [96] were deposited in the ArrayExpress database (http://www.ebi.ac. uk/microarray-as/ae/) under the accession number E-MEXP-2139.

Data analysis of microarrays
Gene functions were assigned using data from EcoCyc (http:// ecocyc.org/) and Uniprot (http://www.uniprot.org/). Adjacent genes coordinately regulated, possibly involved in the same function and separated with a short distance with no apparent terminator were considered as belonging to a putative operon.
Dimensionality reduction. To reduce the dimensionality of the expression data set (where the 6 experimental conditions are the variables, and the gene expression measurements are the observations), a principal component analysis (PCA) [80] in gene space, using normalized log-ratio measures, was performed with the R package ADE4 and FactoMineR [97] for calculating threedimensional projections of the biological samples. We kept the first four components that altogether accounted for over 78% of the explained variance Differential analysis. Statistical comparisons were performed using multiple testing procedures to evaluate statistical significance for differentially expressed genes. A modified t-test was computed to measure the significance associated with each differential expression value. An error rate (p-value), measuring the risk of false predictions of differentially expressed genes was associated with each test value. A gene expression value was decided to be significantly different under an extracytoplasmic stress response condition when the p-value was less than 0.01 (except otherwise mentioned), the signal intensities (A = 0.5log 2 (condition 16condition 2)) was .6.0, and the expression ratio was $1.9 (or $1.4 for genes that were part of a putative transcriptional unit containing at least one gene with a fold change $1.9), except in the case of the Bae pathway where the fold change cutoff was fixed at 1.4. Genes from prophages were systematically removed from the analysis, as fluctuating intensities may be linked to the presence of numerous paralogs present in the genome and were thus difficult to interpret.
Hierarchical clustering. Unsupervised average-linkage hierarchical clustering with uncentered Pearson correlation as a similarity metric was done using Cluster on the gene set defined above [98]. This method leads to an expression matrix such that genes and samples with similar expression patterns are adjacent to each other. Results were visualized with the help of heat-maps and dendrograms using the TreeView program [98].

Quantitative real-time PCR (qRT-PCR)
One microgram of total RNA was reverse-transcribed in a 30 ml final reaction volume using the High Capacity cDNA Reverse Transcription Kit with RNase inhibitor (Applied Biosystems, Foster City, CA, USA) following the manufacturer's instructions. For each sample, negative reverse transcription reaction was done to verify the absence of genomic contamination in subsequent q-PCR. Primer sequences (see supplementary Table S2) were designed using Primer Express 3.0 software (Applied Biosystems). BLAST searches were performed to confirm gene specificity and the absence of multi-locus matching at the primer site. SYBR-Green q-PCR reactions were performed using the ABI Prism 7900 HT sequence detection system (Applied Biosystems) in 384 well optical reaction plates. 3 ml of cDNA (5 ng/reaction), standard or water (no-template control) were used as template for q-PCR reactions with Fast SYBR Green PCR Master Mix (Applied Biosystems) and primers at 500 nM final concentration. Real-time q-PCR amplifications were carried out (95uC for 20 sec, followed by 40 cycles of 95uC for 1 sec and 60uC for 20 sec, and a final dissociation curve analysis step from 65uC to 95uC). Technical replicate experiments were performed for each biological triplicate sample. The amplification efficiencies of each probe were generated using the slopes of the standard curves obtained by a ten-fold dilution series. The efficiency of the q-PCR amplifications for all of the genes tested was higher than 90%. Amplification specificity for each q-PCR reaction was confirmed by the dissociation curve analysis. Determined Ct values were then exploited for further analysis.
The gene expression levels were analyzed using the relative quantification (delta-Ct method). 16 housekeeping genes were tested and GeNorm and Normfinder functions in Genex 4.3.8 (MultiD, Göteborg, Sweden) were used to select the most stable genes. The geometric mean of 5 housekeeping genes (dnaQ, glnD, pcnB, uvrB and gyrA) was used to normalize our samples. Data were analyzed with StatMiner 3.0.0 Software (Integromix, Madrid, Spain). Analyses were done with biological replicates and a relative quantification (RQ) value was calculated for each gene with the control group as a reference. RQ values were adjusted according to specific amplification efficiency. A p-value was computed using a moderated t-test to measure the significance associated with each RQ value. Variations were considered statistically significant when the p-value was ,0.05 unless otherwise specified in the tables.