Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Covert Genetic Selections to Optimize Phenotypes

  • Di Wu,

    Affiliation Monash Institute of Medical Research, Monash University, Monash Medical Centre, Melbourne, Victoria, Australia

  • Elizabeth Townsley,

    Affiliation Department of Pathology, Cell Biology Program, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America

  • Alan Michael Tartakoff

    To whom correspondence should be addressed. E-mail:

    Affiliation Department of Pathology, Cell Biology Program, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America


In many high complexity systems (cells, organisms, institutions, societies, economies, etc.), it is unclear which components should be regulated to affect overall performance. To identify and prioritize molecular targets which impact cellular phenotypes, we have developed a selection procedure (“SPI”–single promoting/inhibiting target identification) which monitors the abundance of ectopic cDNAs. We have used this approach to identify growth regulators. For this purpose, complex pools of S. cerevisiae cDNA transformants were established and we quantitated the evolution of the spectrum of cDNAs which was initially present. These data emphasized the importance of translation initiation and ER-Golgi traffic for growth. SPI provides functional insight into the stability of cellular phenotypes under circumstances in which established genetic approaches cannot be implemented. It provides a functional “synthetic genetic signature” for each state of the cell (i.e. genotype and environment) by surveying complex genetic libraries, and does not require specialized arrays of cDNAs/shRNAs, deletion strains, direct assessment of clonal growth or even a conditional phenotype. Moreover, it establishes a hierarchy of importance of those targets which can contribute, either positively or negatively, to modify the prevailing phenotype. Extensions of these proof-of-principle experiments to other cell types should provide a novel and powerful approach to analyze multiple aspects of the basic biology of yeast and animal cells as well as clinically-relevant issues.


Cell growth, migration, and ability to resist stress and infection depend on many factors which have been the subject of focused investigations. Further elucidation of these known determinants is of intrinsic interest and could lead to the design of corresponding molecular therapies. Nevertheless, when multiple components are involved–only some of which are known-it generally is unclear which should be targeted in order to have maximal effects on cellular performance. This optimization task becomes especially daunting when the goal is to manipulate performance–not with cells in culture–but in the full complexity of the animal. Moreover, the task of identifying growth inhibitory factors from high complexity cDNA or shRNA libraries is problematic since these are the components which disappear and therefore can be identified only by implementation of specialized strategies, e.g. [1], [2], [3], [4], [5].

The search procedure described below (SPI) starts with a complex library and selects–in one step and without requiring assays which depend on clonal growth-single cDNAs which can have positive or negative impact on growth or on other complex phenotypes. It thus identifies groups of “contributory” cDNAs, and establishes an approximate hierarchy of their importance. Moreover, it makes possible the identification of relevant genes without requiring that they be able to overtly correct a given phenotype. These features of SPI distinguish it from most classical cloning strategies.

To exemplify this approach–which could be extended to shRNAs and to animal cells using viral libraries at low multiplicity-we have used a bank of yeast transformants which differ from each other with regard to single ectopic cDNAs. We have then quantitated the abundance of each cDNA/transformant as the pool of cells grows in liquid culture (Figure 1A). The central premise is that cDNAs which promote growth or survival will cause the corresponding cells and their cDNAs to become more abundant, while those which impair growth will have inverse effects. The selection is “covert,” since the critical data pertain to relative abundance of cDNAs over time rather than to gross alteration of phenotype.

Figure 1. Overview A: Covert Selection Strategy Each color symbolizes the presence of a distinct ectopic cDNA.

Significant changes occur when the cells are grown. cDNAs which sensitize cells to the culture conditions are expected to become depleted, while those which promote growth are expected to become enriched. B: cDNA Quantitation Upper: A typical yeast plasmid for cDNA expression, indicating the flanking targets (red) for PCR amplification. Lower: Comparison of transcriptional profiling to SPI. Both procedures generate biotinylated cRNAs using T7 RNA polymerase. In SPI, 30 cycles with the reverse primer are designed to yield linear single-stranded products, which are converted into a double-stranded copy in a single final step upon addition of the forward primer. C: GC Content of Subsets of cDNAs Having determined which of 933 cDNAs are detected on the microarrays, we subdivide the group into true positives, false positives, true negatives and false negatives. As shown, each group has nearly the same GC content. False negatives therefore are not due to difficulties in copying sequences of high GC content. D: Proportionality of Readout Two groups of 1000 strains were mixed in different proportions and then analyzed. The mean signal ratio between the groups is a near linear function of the relative DNA input, with a Pearson Correlation Coefficient of 0.986. The rectangular insert at the bottom of the figure represents the strategy of mixing different proportions of two pools of transformants.

By identifying the universe of enriched and depleted cDNAs which are characteristic of a given cell, SPI establishes a signature of positive and negative synthetic genetic relations, which is the equivalent of what a mathematician would refer to as a “sensitivity analysis.” Such information will allow inroads into the understanding of seemingly asymptomatic knockout cells. Moreover, adaptations of SPI should provide novel access to optimization in the context of malignant growth, cell migration and susceptibility to infection.

SPI can be effective without requiring panels of deletion strains, “bar-coded” strains or plasmids, or further design of novel expression systems, e.g. [6], [7]. Since it is also likely to be able to make use of conventional cDNA libraries, it should be able to encompass the full diversity of splice variants of transcripts, e.g. for investigation of animal cells. The present study–in addition to identifying cDNAs which govern growth of yeast-provides a proof of principle for such undertakings.



The pool of yeast strains which we have studied carries a defined synthetic library of 5885 plasmids in which each cDNA is under control of a galactose-inducible promoter [8], [9]. The availability of these constructs made it possible to generate an approximately uniform mixture of corresponding transformants without induction, and then to test the impact of their expression when galactose was added. To develop procedures for plasmid recovery, copying of cDNA inserts, and probing of microarrays, we have first worked with small subsets of transformants.

The cDNA inserts are all present in a constant context in the library plasmids. To copy them without introducing the length and abundance bias which is characteristic of conventional PCR, we developed an efficient extraction procedure (Figure S1A) and a linear PCR procedure (“ds-Linear PCR”) (Figure 1B, S1B, S2) which in a final step generates double-stranded products. The rest of the procedure flows directly into the established chemistry, microarray probing with biotinylated cRNAs, and analytic algorithms developed for transcriptional profiling.

By processing a pooled subset of 933 defined transformants (before growth), we were able to tabulate the number of loci for which microarray signals were both expected and were detected (true positives), loci for which signals were expected but were not detected (false negatives), loci for which signals were not expected but were detected (false positives), and loci for which signals were not expected and were not detected (true negatives). In this situation, there were 94% real positives and 6% false negatives. We tabulated the GC content of cDNAs in each in these four categories and observe that false negatives were not enriched in cDNAs of high CG content, as might have been expected if they were difficult to copy-Figure 1C–and there was only modest depletion of longer cDNAs.

When DNA was recovered from two distinct pools of ∼1000 defined transformants (before growth) and different proportions were mixed and processed, the ratio of the mean signal values for each pool was nearly proportional to its relative abundance over the range examined–Figure 1D. In biological experiments, a two-fold change in relative signal intensity therefore corresponds to a comparable relative change in cDNA input.

We have inquired whether the pool of 5885 transformants retains its initial cDNA spectrum (i.e. the distribution of fluorescent intensities among all cDNAs) in liquid culture in selective medium under several conditions. Those which change to the greatest extent–the tails of the distributions–are referred to as “SPI Extremes.”

When quadruplicate cultures were maintained for 30 generations at 30°C without induction (i.e. in glucose medium), there was little change of relative abundance of most cDNAs; however, some SPI Extremes could already be identified, with negative changes being obvious. Increases appear very modest (Figure 2A), in part because the algorithm which was used for these calculations places upper (and lower) limits on the numerical estimates. Figures 2B and 2C show that the spectrum which persisted at 37°C in glucose medium was nearly identical to that which persisted at 30°C in glucose medium. (Full data sets for all cDNAs are provided in Table S1, S2, S3, S4, S5 and S6.)

Figure 2. Differential cDNA Enrichment and Depletion (Fold Change) Upon Growth Replicate data sets were used to calculate fold-enrichment for six comparisons (see Methods).

In each case, the horizontal axis (gene index) displays all cDNAs in order of their relative fold change. A: Cells grown at 30°C in glucose vs the initial sample. B: Cells grown at 37°C in glucose vs the initial sample. C: Fold-changes (B) divided by fold-changes (A). D: Cells grown at 30°C in galactose vs 30°C in glucose. E: Cells grown at 37°C in galactose vs 37°C in glucose. F: Fold-changes (E) divided by fold-changes (D).

Analysis of Growth

To identify cDNAs which affect growth upon deliberate overexpression, we have cultured samples of the full mixture of 5885 transformants in galactose medium for 30 generations at 30°C and compared their cDNA spectra to that of cells cultured in glucose medium-Figure 2D. Small numbers of cDNAs showed distinct differential enrichment or depletion (Table 1 and 2). Their identity is discussed below. For the 100 most enriched or depleted cDNAs, the quantitative estimates of fold change have a normalized standard deviation of ∼10% (see Methods). Although SPI can identify growth inhibitory cDNAs, there is little reason to expect functional coherence within this group since interference with many functions can be detrimental. No candidates are obviously related to plasmid maintenance.

By comparing data sets for cells cultured in galactose medium vs glucose medium, we minimized the possible contribution to SPI Extremes of host cell mutations which affect growth. To validate the impact of single SPI Extreme cDNAs we have used three growth tests at 30°C, with the understanding that the SPI protocol of growth for 30 generations greatly amplifies differences which are expected to be only modest after briefer growth intervals.

In the first test, we evaluated growth on solid media including galactose vs glucose (e.g. Figure 3A/B). All transformants gave colonies of comparable size on glucose plates. Of 70 strains which showed SPI depletion at 30°C less than −10 or SPI enrichment greater than +10, ∼80% were confirmed as being stimulatory or inhibitory by these plate tests (In Figure 3B, 9/13 of the transformants which showed SPI depletion were smaller than the average of the five controls, while 5/13 of the transformants which showed SPI enrichment generated colonies larger than the average size of the controls.) None showed growth characteristics on plates opposite those which are registered by SPI fold-change calculations.

Figure 3. Validations on Solid Media Five control strains (#1–3, 18–19) which show little or no SPI enrichment/depletion and 27 SPI extreme strains including representatives from Table 3 were streaked on glucose dropout plates (Glc: A) or galactose dropout plates (Gal: B) and allowed to grow at 30°C.

Strains which show SPI depletion are in the inner circle, while strains which show enrichment are in the outer circle. Growth of all strains is comparable on the glucose plates. On the galactose plate, the colony sizes of the strains often parallels expectations, both for increases and decreases. Among those illustrated, the concordance is modest, but more extensive surveys raise it to ∼80% (see text). The strains which do not conform to expectations could reflect differences between the requirements for growth in liquid vs solid media. The identities of the strains are given in Table 3. Strains #1–3 and #18–19 are controls whose SPI values range from −0.6 to 1.19

The second test involved growth over 24 hr in liquid galactose vs glucose medium. This test appeared to be more sensitive for detection of the inhibitory cDNAs than the plate test, in that 11/13 of the strains in Figure 3 which showed SPI depletion grew more slowly in liquid culture than the controls. The normalized growth rate of the entire group of 13 strains was 0.57+/−0.15–Table 3. The liquid test appears less sensitive than the plate test for transformants which had consistently shown SPI enrichment. We observe, for example, a normalized growth rate of 0.93+/−0.12 for a set of 22 such strains (Table 3), i.e. comparable to the controls. As illustrated in the enrichment plot in Figure 2D, the positive SPI extremes at 30°C were less pronounced than for the depletions.

In a final liquid medium validation test, we have mixed small pools of transformants, grown them in glucose or galactose medium and assessed the relative abundance of their cDNAs both at t0 and after ten generations, using ds-Linear PCR. As shown in Figure S3, those which had shown enrichment persisted, while those which had been depleted vanished.

Thus, the predictive power of SPI is high. SPI therefore could be applicable under circumstances in which clonal validation is not possible.

Since transformants can be cultured in galactose medium at temperatures up to 37°C without loss of viability (as judged by staining with FUN1), parallel experiments have also been conducted at this modestly stressful temperature (Figure 2E). Under these conditions, there were more examples of strong enrichment than at 30°C, while decreases remained modest. Table 1 lists those which are most enriched at both 30°C and at 37°C.

SPI can identify cDNAs which show differential enrichment upon exposure to stress. It therefore can also make predictions as to how to protect cells against stress or sensitize them to stress. For this purpose, we have evaluated the fold-enrichment of cDNAs after growth at 37°C vs 30°C in galactose–Figure 2F. Several groups of cDNAs were among the 100 which showed the highest differential enrichment (Table 4).


Gene complementation strategies can be used to investigate diseases of monogenic origin but are seldom applied to diseases of more complex origin. The present study shows that related strategies can be extended to the analysis of both normal growth and growth at elevated temperature, providing an example of a complex phenotype. A fundamental difference between SPI and classical complementation cloning is that the goal is to identify a spectrum of “contributory genes” (or cDNAs), rather than a single entity (“command gene”), which by definition have graded impact on cellular phenotypes. Such genes are not identified by classical genetic strategies unless they overtly change the phenotype. A second fundamental difference is a loosening of the concept of genetic selection, since in SPI the selection is at the level of differential cDNA enrichment, rather than overt phenotypic correction. The two attendant characteristics of SPI which are integrally part of this strategy are the expression of single ectopic cDNAs, and the inclusive quantitative readout which is afforded by microarrays.

Two related plasmid-enrichment strategies have recently been described for yeast, both of which used microarrays and were based on two-way (two-color) comparisons. The first used a centromeric galactose-inducible cDNA library to investigate aspects of drug resistance. Plasmids were recovered and transcribed to yield single-stranded fluorescent probes which identified at least one logical target, but appeared also to generate non-specific products [10]. The second study used a high copy yeast genomic library to identify proteins which are related to the exosome. Plasmids recovered after growth were amplified through E. coli and then copied to yield fluorescent probes [11]. In neither study was attention given to the information content of depleted cDNAs.

The realization that multiple cDNAs can promote cell growth may seem at variance with the concept of there being a single rate-limiting-step. Nevertheless, such multiplicity is characteristic of optimization strategies for complex systems, reflecting the intricate interactions among numerous components which contribute to the sustenance of the whole. Unlike other complex systems, when working with cells the availability of gene libraries make it possible to learn which components are most critical. Clearly, the optimization strategy of SPI could be made more stringent by prolonging the growth/selection interval. Moreover, it could be iterated to identify dependent groups of functionally significant genes, e.g. by starting with a cell in which one SPI extreme cDNA is already overexpressed (or depleted).

Our control experiments (Figure 1) using defined mixtures of cDNA transformants show that SPI can yield data sets in which real positives overwhelmingly dominate the readout. Moreover, their recovery does not depend on GC content, the accuracy of replicates appears satisfactory (fold changes having a normalized standard deviation are ∼10%), and signal strength on the microarrays is a monotonic approximately linear function of input. Analysis of pools of transformants grown without induction already identified some differential depletion of cDNAs, presumably because unspecified factors were sequestered by the inserts.

Upon induction, considering that the library which we have used results in significant production of most of the proteins encoded by cDNAs [9], [12], it is notable that yeast tolerates overexpression of a large fraction of its genome.

To our knowledge, there has been no previous identification of single proteins which can increase the rate of growth or survival of yeast under standard growth conditions at 30°C. The cDNAs which become enriched could either directly accelerate the cell cycle or possibly bypass checkpoints or other events which delay rapid growth. Strikingly, twenty-nine cDNAs among the 100 which were most enriched at 30°C were also among the 100 most enriched at 37°C (Table 1). Several of these pertain to events which have previously been recognized as important control points: ribosome synthesis and translation initiation, for example, are generally considered to limit the speed of cell growth and are implicated in oncogenesis [13]. Moreover, censorship of glycoprotein exit from the ER is certainly needed for growth and expansion of the cell wall.

The six of these which encode proteins which are critical for protein synthesis or ribosome genesis-and therefore potentially affect the titer of many other proteins-are:

  • Fun12p/eIF5B, a GTPase which functions in translation initiation by promoting binding of Met-tRNAiMet to small ribosomal subunits and in subunit joining [14].
  • Egd2p, a subunit of the NAC complex which binds nascent polypeptide chains and is thought to influence their delivery to the ER [15].
  • Rps9A/B, a conserved small ribosomal subunit protein which is a major determinant of translational fidelity [16].
  • Utp10p, a protein associated with U3 snoRNA which is required for 18S rRNA synthesis [17].
  • Rpl5p/L1, which is required for assembly of large ribosomal subunits [18].
  • Lsm3p, which functions in mRNA decay [19].

Three cDNAs encode proteins which are relevant to ER-Golgi transport:

  • Gls2p/Rot2p, one of the subunits of the endoplasmic reticulum glucosidase II. This enzyme trims the N-glycans of newly-synthesized glycoproteins after their folding in the ER and prior to exit to the Golgi. It is also required for efficient ER-associated degradation [20], [21]. Suboptimal maturation and expression of one or more of its glycoprotein substrates–or perhaps a cell wall component such as 1,6-β-glucan–could normally limit cell growth.
  • Bet2p, a prenyl transferase which is required to anchor Ypt1p and Sec4p, which function in ER-Golgi transport [22].
  • Sfb2p, a probable component of COP II vesicles [23].

Two encode components which impact the actin cytoskeleton

  • Slm1p, which regulates actin organization in response to stress [24].
  • Mlc2p, the regulatory light chain of myosin I [25].

SPI readily identifies cDNAs which become depleted upon growth. As mentioned above, many of these can be validated qualitatively, both by following the colony size of single transformants on solid media (Figure 3) and by monitoring the growth of pure cultures in liquid medium (Table 3).

Previous studies have used panels of S. cerevisiae transformants to evaluate the ability of individual cDNAs, GST fusions and gene fragments to inhibit growth (Table 5, Table S7) [8], [26], [27], [28], [29], [30], [31]. These mostly qualitative assays monitor clonal growth on solid media, which places distinct demands on cells (e.g. [32]), while SPI is based on a competition assay in liquid culture and lends itself to quantitative comparisons. Table 5 summarizes the vectors used and numbers of inhibitory cDNAs which were identified. Table 6 tabulates the extent of overlap with SPI. From the SPI data set we have used the 600 most depleted cDNAs for this comparison since the two recent large-scale studies based on colony growth of cDNA transformants have identified 454 and 759 inhibitory cDNAs, respectively [8], [12]. Judging from this comparison, SPI shows the greatest overlap with the one study which has used the same GST-ORF expression library that we have used [8]–although it uses a different host cell. The Venn diagram in Figure 4 illustrates the overlap between SPI and this study as well as a recent study which used an ORF-GST fusion library [12].

Figure 4. Venn Diagram comparison of data sets which have identified growth-inhibitory cDNAs.

See Table 5 and 6 for detail.

Table 7 enumerates the 27 cDNAs which are shared by the SPI 600, the second study which used the same library, and the study which is based on expression of ORF-GST fusions [12]. Of this group, only 5 are also shared by the investigation which surveys random transcriptional fragments [27]. Proteins which affect growth can do so as part of complexes. Since the functionality of such complexes may be perturbed by alterations of their stoichiometry [33], we have asked whether the group of 27 depleted SPI extremes enumerated in Table 7 is enriched in proteins which are known to be in stable complexes. Judging from the database [], there is no obvious enrichment. A similar conclusion has been reached in the second study which surveys the same GST-ORF library [8].

With regard to stress resistance, among the 100 which show the greatest differential enrichment at 37°C (vs 30°C) are 21 cDNAs which encode mitochondrial proteins, reminiscent of longstanding observations of the importance of functional mitochondria for survival at elevated temperature [34]. Additionally, six contribute to expression of GPI-anchored proteins, which concentrate at the cell surface [35], [36]. Table 4 also lists groups of differentially enriched cDNAs which are implicated in protein turnover or stress responses.

Calculations of differential enrichment at 37°C are clearly a composite, including cDNAs which were enriched at 37°C as well as those which were depleted at 30°C. Figure S4 separates out these two subsets by comparing the 37°C vs 30°C data to both the enrichment at 37°C and the depletion at 30°C. The subgroup encoding mitochondrial proteins accounts for ∼40% of those which show an increment at 37°C.

One might expect that the SPI Extreme (enriched) cDNAs would correspond to essential genes; however, as for the total genome, only about 20% of the cDNAs which are most enriched at 30°C (or 37°C) correspond to genes which are essential under standard growth conditions (Saccharomyces Genome Database). Thus, survival of the organism under laboratory conditions cannot require those genes which have the potential to be most beneficial. These “accessory” beneficial genes represent an evolutionary opportunity (or therapeutic opportunity) which can be detected by SPI.

It is also of interest to ask whether SPI Extreme enriched cDNAs correspond to mRNAs which are upregulated at 37°C (in either glucose or galactose medium). The comparison is greatly simplified since the same microarrays can be used for both purposes. As shown in Figure S5, there is minimal concordance. This discrepancy could signify that the normal circuitry of gene expression seldom allows the cell to manipulate the level of single transcripts, i.e. bystander transcripts which would be co-modulated would sabotage any attempt to up- or down-regulate those which–by themselves–could be most useful.

We expect that the greatest prospect for implementation of SPI will be in the context of animal cell biology, where it should again exemplify the utility of genetic approaches outside the normal realm of genetic inquiry. This is especially because of the difficulty of choosing optimal therapeutic targets for many diseases. In each case, subtle selective events are surely always at work. SPI makes it possible to use a covert selection “in situ” to identify the single genes which should be manipulated to influence such phenotypes.

Materials and Methods

Cells and cDNA Library

These materials were obtained from M. Snyder and D. Gelperin (Yale University) [9]. The haploid host cell was YC123 = SF657-2D = Snyder strain 258 (MATa pep4-3 his4-580 ura3-52 leu2-3, 112). The 10 kb pEG(KG) 2μ vector used for cDNA expression carried both URA3 and leu2-d selectable markers and appends GST to the N-terminus of each product [37]. Frozen stocks of single transformants and pools of transformants were prepared by standard methods.

Cell Growth, Plasmid and RNA Recovery

Liquid cultures were established from single colonies. After growth in uracil dropout medium at room temperature, aliquots were diluted to A600 = 0.1 using uracil drop-out glycerol-lactate medium (2% glycerol, 2% lactate, 0.05% glucose, 0.67% bacto-yeast nitrogen base without amino acids, pH 5.5), grown overnight at room temperature to A600 = 1−2 and then harvested by sedimentation.

5 ml samples of cells were washed with water, broken by vortexing with glass beads, and extracted with a Qiagen DNA extraction kit.

To evaluate cell viability, samples were stained with FUN1 (Molecular Probes (F-7030)) and examined by epifluorescence. Living cells showed bar-shaped orange structures in the vacuole while dead cells lacked this signal and were predominantly green.

To study growth at 30°C and 37°C, frozen pools including all 5885 strains were thawed, washed, and then adjusted to A600 = 0.05−0.1 in glycerol-lactate medium. 5 ml at OD600 = 1−2 was set aside and refrozen to provide a t0 sample. Duplicate cultures were supplemented with 2% glucose or 2% galactose and then shaken at 30 and 37°C. Growth was monitored at 600 nm and aliquots of each culture were rediluted to A600 = 0.05−0.1 so that the A600 never exceeded 1.5. Duplicate 5 ml cultures were snap frozen in liquid nitrogen after a total of 30 generations.

For RNA analysis, we used hot phenol [38] to extract logarithmic cultures growing in glycerol-lactate medium supplemented with 2% glucose- or 2% galactose and processed the samples in accordance with Affymetrix protocols.

Ds-Linear PCR Amplification

Linear amplification of cDNA inserts was performed using a mixture of DNA polymerases (Taq and pfu) and the reverse primer (5′-TGTAATACGACTCACTATAGGGGATCCCCGGGAATTGCCATG-3) which includes the T7 phage RNA polymerase sequences (5 min pre-denaturation at 94°C, followed by 30 cycles of 1 min at 94°C, 1 min at 58°C and 7 min at 67°C, and concluding with 10 min at 70°C). The concluding step with addition of forward primer (5′-TGGTGGTGGTGGAATTCCAGCTGACCACC-3′) and Hotstart taq polymerase consisted of 6 min at 94°C, 5 sec at 95°C, 1 min at 58°C, 7 min at 67°C and 10 min at 70°C. The primers were complementary to sequences which flank the inserts and do not include the regions which encode GST or the GAL promoter.

Microarray analysis

Ds-linear PCR products were pooled, concentrated and purified using a Qiagen column. In general, 200 ng of PCR product was used to generate biotinylated cRNA probes for Affymetrix S98 DNA microarrays. Samples were processed at the University Affymetrix facility, scanning the microarrays with a GeneArray scanner and preprocessing data using RMA (Robust multiarray average) [39], [40]. For background-correction, normalization, and signal intensity calculations. In all experiments described below we studied independent triplicates or quadruplicates and based analysis on those cDNAs for which signals are classified as “Present” in each sample, using the MAS5 algorithm. Normalized signals from totally independent replicates have a mean correlation coefficient of 0.963014.

To learn whether cDNAs which give only a weak signal at t = 0 can be studied reproducibly, we asked whether there is any correlation between initial signal intensity and the consistency of their presence (or absence) after 30 generations of growth. We did not detect any such correlation using Student's t-test. Signal intensity data from replicate samples were used to perform differential expression analysis by fitting a linear model (one way ANOVA) using the limma package. The P values were adjusted with an empirical Bayesian method and the Benjamini and Hochberg False Discover Rate. Fold changes of >2 with p<0.05 were considered meaningful. Average fold-changes were then used to produce the enrichment or “S-plots” and to identify the most enriched and most depleted cDNAs. Fractional values reflect depletion and are represented according to the following convention: 0.1 = −10; 0.2 = −5 etc. Standard deviations of the logarithm of fold change values were also estimated using the limma (Linear Models for Microarray Analysis) package [41]. For the groups of fold changes listed in Table S1, S2, S3, S4, S5 and S6, the means +/− standard deviation are −6.43+/−0.60 (30°C decreases); 4.24+/−0.48 (30°C increases); −6.24+/−0.47 (37°C depletion); 4.90+/−0.47 (37°C increases). Given the normalization procedures of RMA, the fold change estimates are relative.


Growth of individual transformants was studied by streaking single colonies onto solid media (2% glycerol, 2% lactate in uracil drop-out medium, pH5.5+2% glucose or+0.05% raffinose and 2% galactose) at 30°C and following their growth for increasing periods of time. We also have monitored the growth of duplicate cultures of single transformants in liquid uracil drop-out glycerol-lactate medium supplemented with 2% glucose vs 2% galactose at 30°C over 24 hrs. Cultures were initiated with an A600 = 0.03. Light scattering at 620 nm was measured with a Coulter-Beckman plate reader as a function of time. Kinetic rate constants were estimated using Origin and the galactose data were compared to the rate of growth of parallel cultures of each strain in glucose medium. To provide a uniform point of comparison between experiments, the five control strains (#1–3 and #18–19 in Figure 3) were included in each experiment, their average rate of growth was normalized to 1.0 and individual strains were then compared to them.

Alternatively, mixtures of transformants were cultured in liquid media identical to those used for the initial experiment, processed using ds-Linear PCR and the resulting double-stranded DNA products were then resolved on Agarose gels.

To monitor growth in the absence of galactose, selected cDNAs (without the GST moiety) were copied by conventional PCR and subcloned into a URA3 vector in which transcription was under control of a MET25, methionine-represssible, promoter [42]. Corresponding transformants were then tested on plates made with complete medium vs methionine-dropout medium.

Supporting Information

Figure S1.

Plasmid Extraction and Examples of ds-Linear PCR Products A)DNA was extracted from a set of six transformants, restricted with SmaI, and fractionated on a gel and stained with ethidium bromide. Note the progressively increasing sizes of the inserts, whose abundance is comparable to that of the endogenous 2-micron circle. B) An equivalent pool of eight plasmids was used for ds-Linear PCR. The fractionated products are illustrated adjacent to size standards (*)

(0.10 MB DOC)

Figure S2.

Determination of the End-Point for ds-Linear PCR Figure S1 shows that the products of ds-Linear PCR yield sharp bands, which is surprising. To determine the approximate point of arrest of the linear cycles, the products of the first 30 cycles (30R)-which included only the reverse primer-were incubated with any of three forward primers for the final step of the reaction. Our conventional primer (F)-not shown-and one of the test primers (F1) caused an obvious increase of intensity of the product bands. A second test primer (F2) did not. The point of arrest is therefore between sites complementary to F1 and F2. Primer F2 is 103 base upstream of primer F1, which is 234 bases upstream of primer F. The sequence of F1 is 5′-ATGTGCCTGGATGCGTTCC-3′. The sequence of F2 is 5′-TGAAAATGTTCGAAGATCGTTTATGTC-3′.

(0.14 MB DOC)

Figure S3.

Validation by PCR. A small pool of transformants (a–d) was either extracted at once or allowed to grow for ten generations at 30°C in glucose or galactose medium, before DNA extraction. Their degree of enrichment or depletion (fold-change) in the initial experiment with 5885 transformants is indicated in parentheses. Triplicate samples of the initial and final pools were copied by ds-Linear PCR and the products were analyzed. Note that the enriched cDNA becomes dominant and that those which had been depleted vanish, by comparison to a control. The control plasmid had become neither enriched nor depleted when part of the complete pool of strains in the initial experiment.

(0.18 MB TIF)

Figure S4.

Analysis of Differential Enrichment (37°C vs 30°C) cDNAs which show strong differential enrichment in Table 2 can do so either because of enrichment at 37°C or depletion at 30°C. Correspondingly, cDNAs which show strong differential depletion at 37°C can do so either because of depletion at 37°C or enrichment at 30°C. The three panels to the left concern the 100 cDNAs which show the greatest relative depletion at 37°C, while those at the right concern the 100 which show the greatest enrichment. Note, in each case, the presence of cDNAs which show both types of behavior.

(0.05 MB TIF)

Figure S5.

Comparison of 30°C SPI Data to Transcriptional Profiles. Panel A represents SPI second order fold data which are calculated by dividing the 37°C vs 30°C fold change in galactose by the 37°C vs 30°C fold change in glucose. In panels B–D, the second order SPI data are compared to RNA transcript profiles of the same host cell (without plasmid) cultured at 30°C or 37°C in glucose or galactose medium. In panel B the RNA signals at 37°C in glucose are compared to RNA data at 30°C in glucose. In panel C the RNA signals at 37°C in galactose are compared to 30°C in galactose. In panel D the (37°C galactose/30°C galactose) ratio is compared to the (37°C glucose/30°C glucose) ratio. As can be readily seen, there is no widespread correspondence between the levels of mRNAs and SPI data. The transcripts which do show strong induction upon addition of galactose include the familiar set of genes GAL1, GAL2, etc.

(0.07 MB TIF)

Table S1.

glc30 vs control. For Tables S1, S2, S3, S4, S5 and S6: Full SPI Data Sets. The averaged data are divided into six groups as in Figure 2, comparing S1) Cells cultured in glucose medium at 30°C to t0 samples, S2) Cells cultured in glucose medium at 37°C to t0 samples, S3) Cells cultured in glucose medium at 37°C to cells cultured in glucose medium at 30°C, S4) Cells grown at 30°C in galactose vs 30°C in glucose, S5) Cells grown at 37°C in galactose vs 37°C in glucose, and S6) Cells grown at 37°C in galactose vs 30°C in galactose. The successive columns indicate the Affymetrix identification number interrogated, the Systematic and Gene names, the logarithm of fold change (FC), the t statistic (1/normalized standard deviation), the P value, adjusted P value, and B statistic.

(1.82 MB XLS)

Table S7.

Inhibitory cDNAs, Comparison to Previous Studies. The citations of earlier investigations which have identified yeast cDNAs and gene fragments which inhibit growth are included in the text. In the study of Sopko et al. the magnitude of growth inhibition is designated 1–4, with 1 being the strongest inhibition.

(0.28 MB XLS)


We thank Drs. D. Gelperin and M. Snyder for materials and B. Dujon, T. Hattier, P. Leahy, S. Lemmon, C. Li, G. Smyth, M. Sy, M. Veigl, H. Weinstein, C. Widnell, P. Zhang and Y. Zhang for information and advice.

Author Contributions

Conceived and designed the experiments: AT. Performed the experiments: DW ET. Analyzed the data: AT DW ET. Wrote the paper: AT.


  1. 1. Deiss LP, Feinstein E, Berissi H, Cohen O, Kimchi A (1995) Identification of a novel serine/threonine kinase and a novel 15-kD protein as potential mediators of the gamma interferon-induced cell death. Genes Dev 9: 15–30.
  2. 2. Moffat J, Grueneberg DA, Yang X, Kim SY, Kloepfer AM, et al. (2006) A lentiviral RNAi library for human and mouse genes applied to an arrayed viral high-content screen. Cell 124: 1283–1298.
  3. 3. Ossovskaya VS, Mazo IA, Chernov MV, Chernova OB, Strezoska Z, et al. (1996) Use of genetic suppressor elements to dissect distinct biological effects of separate p53 domains. Proc Natl Acad Sci U S A 93: 10309–10314.
  4. 4. Pestov DG, Lau LF (1994) Genetic selection of growth-inhibitory sequences in mammalian cells. Proc Natl Acad Sci U S A 91: 12549–12553.
  5. 5. Singhi AD, Kondratov RV, Neznanov N, Chernov MV, Gudkov AV (2004) Selection-subtraction approach (SSA): a universal genetic screening technique that enables negative selection. Proc Natl Acad Sci U S A 101: 9327–9332.
  6. 6. Giaever G, Flaherty P, Kumm J, Proctor M, Nislow C, et al. (2004) Chemogenomic profiling: identifying the functional interactions of small molecules in yeast. Proc Natl Acad Sci U S A 101: 793–798.
  7. 7. Parsons AB, Brost RL, Ding H, Li Z, Zhang C, et al. (2004) Integration of chemical-genetic and genetic interaction data links bioactive compounds to cellular target pathways. Nat Biotechnol 22: 62–69.
  8. 8. Sopko R, Huang D, Preston N, Chua G, Papp B, et al. (2006) Mapping pathways and phenotypes by systematic gene overexpression. Mol Cell 21: 319–330.
  9. 9. Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, et al. (2001) Global analysis of protein activities using proteome chips. Science 293: 2101–2105.
  10. 10. Butcher RA, Schreiber SL (2006) A microarray-based protocol for monitoring the growth of yeast overexpression strains. Nat Protoc 1: 569–576.
  11. 11. Abruzzi K, Denome S, Olsen JR, Assenholt J, Haaning LL, et al. (2007) A novel plasmid-based microarray screen identifies suppressors of rrp6Delta in Saccharomyces cerevisiae. Mol Cell Biol 27: 1044–1055.
  12. 12. Gelperin DM, White MA, Wilkinson ML, Kon Y, Kung LA, et al. (2005) Biochemical and genetic analysis of the yeast proteome with a movable ORF collection. Genes Dev 19: 2816–2826.
  13. 13. Mamane Y, Petroulakis E, Rong L, Yoshida K, Ler LW, et al. (2004) eIF4E–from translation to transformation. Oncogene 23: 3172–3179.
  14. 14. Guillon L, Schmitt E, Blanquet S, Mechulam Y (2005) Initiator tRNA binding by e/aIF5B, the eukaryotic/archaeal homologue of bacterial initiation factor IF2. Biochemistry 44: 15594–15601.
  15. 15. Reimann B, Bradsher J, Franke J, Hartmann E, Wiedmann M, et al. (1999) Initial characterization of the nascent polypeptide-associated complex in yeast. Yeast 15: 397–407.
  16. 16. Stansfield I, Jones KM, Herbert P, Lewendon A, Shaw WV, et al. (1998) Missense translation errors in Saccharomyces cerevisiae. J Mol Biol 282: 13–24.
  17. 17. Dragon F, Gallagher JE, Compagnone-Post PA, Mitchell BM, Porwancher KA, et al. (2002) A large nucleolar U3 ribonucleoprotein required for 18S ribosomal RNA biogenesis. Nature 417: 967–970.
  18. 18. Deshmukh M, Stark J, Yeh LC, Lee JC, Woolford JL Jr (1995) Multiple regions of yeast ribosomal protein L1 are important for its interaction with 5 S rRNA and assembly into ribosomes. J Biol Chem 270: 30148–30156.
  19. 19. Beggs JD (2005) Lsm proteins and RNA processing. Biochem Soc Trans 33: 433–438.
  20. 20. Helenius A, Aebi M (2004) Roles of N-linked glycans in the endoplasmic reticulum. Annu Rev Biochem 73: 1019–1049.
  21. 21. Simons JF, Ebersold M, Helenius A (1998) Cell wall 1,6-beta-glucan synthesis in Saccharomyces cerevisiae depends on ER glucosidases I and II, and the molecular chaperone BiP/Kar2p. Embo J 17: 396–405.
  22. 22. Rossi G, Yu JA, Newman AP, Ferro-Novick S (1991) Dependence of Ypt1 and Sec4 membrane attachment on Bet2. Nature 351: 158–161.
  23. 23. Peng R, De Antoni A, Gallwitz D (2000) Evidence for overlapping and distinct functions in protein transport of coat protein Sec24p family members. J Biol Chem 275: 11521–11528.
  24. 24. Fadri M, Daquinag A, Wang S, Xue T, Kunz J (2005) The pleckstrin homology domain proteins Slm1 and Slm2 are required for actin cytoskeleton organization in yeast and bind phosphatidylinositol-4,5-bisphosphate and TORC2. Mol Biol Cell 16: 1883–1900.
  25. 25. Luo J, Vallen EA, Dravis C, Tcheperegine SE, Drees B, et al. (2004) Identification and functional analysis of the essential and regulatory light chains of the only type II myosin Myo1p in Saccharomyces cerevisiae. J Cell Biol 165: 843–855.
  26. 26. Akada R, Yamamoto J, Yamashita I (1997) Screening and identification of yeast sequences that cause growth inhibition when overexpressed. Mol Gen Genet 254: 267–274.
  27. 27. Boyer J, Badis G, Fairhead C, Talla E, Hantraye F, et al. (2004) Large-scale exploration of growth inhibition caused by overexpression of genomic fragments in Saccharomyces cerevisiae. Genome Biol 5: R72.
  28. 28. Espinet C, de la Torre MA, Aldea M, Herrero E (1995) An efficient method to isolate yeast genes causing overexpression-mediated growth arrest. Yeast 11: 25–32.
  29. 29. Liu H, Krizek J, Bretscher A (1992) Construction of a GAL1-regulated yeast cDNA expression library and its application to the identification of genes whose overexpression causes lethality in yeast. Genetics 132: 665–673.
  30. 30. Ramer SW, Elledge SJ, Davis RW (1992) Dominant genetics using a yeast genomic library under the control of a strong inducible promoter. Proc Natl Acad Sci U S A 89: 11589–11593.
  31. 31. Stevenson LF, Kennedy BK, Harlow E (2001) A large-scale overexpression screen in Saccharomyces cerevisiae identifies previously uncharacterized cell cycle genes. Proc Natl Acad Sci U S A 98: 3946–3951.
  32. 32. Meunier JR, Choder M (1999) Saccharomyces cerevisiae colony growth and ageing: biphasic growth accompanied by changes in gene expression. Yeast 15: 1159–1169.
  33. 33. Papp B, Pal C, Hurst LD (2003) Dosage sensitivity and the evolution of gene families in yeast. Nature 424: 194–197.
  34. 34. Ogur M, Ogur S, St John R (1960) Temperature Dependence of the Spontaneous Mutation Rate to Respiration Deficiency in Saccharomyces. Genetics 45: 189–194.
  35. 35. Lim T, Loh W, Shih Y (2000) A comparison of prediction accurary, complexity, and training time of thirty-three old and new classification algorithms. Machine Learning 40: 203–228.
  36. 36. Tomishige N, Noda Y, Adachi H, Shimoi H, Takatsuki A, et al. (2003) Mutations that are synthetically lethal with a gas1Delta allele cause defects in the cell wall of Saccharomyces cerevisiae. Mol Genet Genomics 269: 562–573.
  37. 37. Mitchell DA, Marshall TK, Deschenes RJ (1993) Vectors for the inducible overexpression of glutathione S-transferase fusion proteins in yeast. Yeast 9: 715–722.
  38. 38. Kohrer K, Domdey H (1991) Preparation of high molecular weight RNA. Methods Enzymol 194: 398–405.
  39. 39. Bolstad BM, Irizarry RA, Astrand M, Speed TP (2003) A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19: 185–193.
  40. 40. Speed TP (2003) Statistical analysis of gene expression microarray data. Boca Raton, FL: Chapman & Hall/CRC. xiii, 222 p., [224] p. of plates p.
  41. 41. Wettenhall JM, Smyth GK (2004) limmaGUI: a graphical user interface for linear modeling of microarray data. Bioinformatics 20: 3705–3706.
  42. 42. Niedenthal RK, Riles L, Johnston M, Hegemann JH (1996) Green fluorescent protein as a marker for gene expression and subcellular localization in budding yeast. Yeast 12: 773–786.