Transcriptome-wide profiling of acute stress induced changes in ribosome occupancy level using external standards

Annie W. Shieh; Sandeep K. Bansal; Zhen Zuo; Sidney H. Wang

doi:10.1371/journal.pone.0294308

Abstract

Acute cellular stress is known to induce a global reduction in mRNA translation through suppression of cap dependent translation. Selective translation in response to acute stress has been shown to play important roles in regulating the stress response. However, accurately profiling translational changes transcriptome-wide in response to acute cellular stress has been challenging. Commonly used data normalization methods operate on the assumption that any systematic shifts are experimental artifacts. Consequently, if applied to profiling acute cellular stress-induced mRNA translation changes, these methods are expected to produce biased estimates. To address this issue, we designed, produced, and evaluated a panel of 16 oligomers to serve as external standards for ribosome profiling studies. Using Sodium Arsenite treatment-induced oxidative stress in lymphoblastoid cell lines as a model system, we applied spike-in oligomers as external standards. We found our spike-in oligomers to display a strong linear correlation between the observed and the expected quantification, with small ratio compression at the lower concentration range. Using the expected fold changes constructed from spike-in controls, we found in our dataset that TMM normalization, a popular global scaling normalization approach, produced 87.5% false positives at a significant cutoff that is expected to produce only 10% false positive discoveries. In addition, TMM normalization produced a systematic shift of fold change by 3.25 fold. These results highlight the consequences of applying global scaling approaches to conditions that clearly violate their key assumptions. In contrast, we found RUVg normalization using spike-in oligomers as control genes recapitulated the expected stress induced global reduction of translation and resulted in little, if any, systematic shifts in the expected fold change. Our results clearly demonstrated the utility of our spike-in oligomers, both for constructing expected results as controls and for data normalization.

Citation: Shieh AW, Bansal SK, Zuo Z, Wang SH (2023) Transcriptome-wide profiling of acute stress induced changes in ribosome occupancy level using external standards. PLoS ONE 18(11): e0294308. https://doi.org/10.1371/journal.pone.0294308

Editor: Joel McManus, Carnegie Mellon University, UNITED STATES

Received: November 2, 2022; Accepted: October 30, 2023; Published: November 21, 2023

Copyright: © 2023 Shieh et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All sequencing data files are available from Gene Expression Omnibus (accession number GSE212714). The Gene Expression Omnibus (GEO) can be found at the following website. https://www.ncbi.nlm.nih.gov/geo/.

Funding: This work was partly supported by National Institute of Health grant R01GM139980 to SHW, AWS, SKB.The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Advances in high throughput gene expression profiling technologies, such as microarrays and next generation sequencing technologies, have revealed many technical challenges in making seemingly straightforward comparisons. Oftentimes, despite great care in implementing experimental procedures, systematic shifts in quantification range are observed between samples, and sometimes even between samples that are otherwise considered identical. In response to these technical challenges, many procedures have been developed to identify and to remove these unwanted variations [1–9]. Approaches such as standardizing read count by library size and global scaling to homogenize quantitative range across samples are among the most commonly used strategies for increasing statistical power to detect differential expression and to improve accuracy in effect size estimates [2, 5]. These powerful approaches, however, make certain assumptions that need to be met in order to deliver the intended results [10]. For example, a global scaling normalization approach, such as TMM, assumes that any systematic shift in quantitative range is an experimental artifact, and scales the data accordingly to mitigate the systematic bias. Such assumption is often valid, as a clear global shift would indicate an unlikely scenario that most, if not all, genes are changing expression level in the same direction relative to the control samples/conditions. In some scenarios, however, a systematic shift in gene expression is a part of the biological process of interest. For example, a key step of the Unfolded Protein Response (also known as Integrated Stress Response) is the phosphorylation of a translation initiation factor eIF2-alpha, which in turn suppresses cap-dependent translation in order to reduce protein synthesis load in the ER [11–15]. In such a scenario, a global reduction in mRNA translation is expected. Applying global scaling normalization on a dataset profiling stress induced changes in mRNA translation level will therefore erroneously normalize away many of the stress induced changes by severely distorting the effect size of most, if not all, quantitated genes.

In a scenario where a global reduction in gene expression is expected, a normalization approach alternative to global scaling is needed to accurately quantitate gene expression. External standards designed for quality control to facilitate cross platform comparison, such as ERCC RNA spike-in control mixes [16, 17], could be used in such experiments to provide reference points for normalization. In theory, a straightforward regression based standard curve approach could connect estimates of gene expression level to RNA concentration (i.e. absolute quantification), however, this approach has been shown to be ineffective for normalization efforts to increase power in differential expression tests: Using libraries constructed with ERCC spike-in pool, Jiang et al. found results from a standard-curve-based normalization to produce significant deviations in the observed fold change from the expected, despite high overall Pearson correlation for spike-in quantification between libraries [17]. The observed deviations exceeded two fold for spike-in controls in the lower abundance range, likely arising simply from sampling variation [17]. In other words, despite achieving overall accuracy, a standard curve approach is not precise enough for detecting biologically relevant fold changes, especially at the extreme ends of the quantitative range.

To provide a feasible alternative, Risso et al. contemplated between different approaches of using spike-in controls for data normalization and demonstrated potential issues with spike-in based global scaling approaches and a spike-in based local regression approach [1, 2, 18]. Instead, Risso et al. adapted a factor analysis approach for control-gene-based normalization (RUVg) by treating quantifications of spike-in oligomers as control genes. RUVg estimates unwanted effects from control genes, agnostically to the effect of origin, and uses these estimates as covariates in a linear model for differential expression tests and for fold change estimates. Using quantification of ERCC spike-in oligomers as control genes, Risso et al. found RUVg to outperform its contemporaries [1].

While ERCC spike-in oligos are available for RNA-Seq studies, they are not easily adaptable to ribosome profiling experiments, an approach enabling transcriptome-wide profiling of mRNA translation [19]. Consequently, transcriptome-wide profiling of translation level changes in response to external stimuli, especially for those that have the potential to induce global shift in quantification, has been challenging [20]. Earlier stress response ribo-seq studies therefore opted for either not normalizing the data or normalizing the data with available methods that were not necessarily suited for the purpose [21, 22]. On the other hand, Andreev et al. used a single oligomer spike-in as external control for data standardization in their study on the impact of Sodium Arsenite treatment (oxidative stress) on mRNA translation in HEK293T cells [23]. Their analysis, however, was limited by a rather small sample size and by their design of a single spike-in oligomer control, which does not enable more rigorous assessments on normalization impact. Work from Iwasaki et al. and subsequently Liu et al used mitochondrial ribosome footprints as controls for normalization and achieved apparent success [24, 25]. However, as was articulated by the authors, the underlying assumptions supporting the use of mitochondrial footprints as controls for normalization may or may not be met in different experimental conditions. Such an approach is therefore unlikely to be generalizable. Even in studies where the results appear to meet expectations, the caveat of observing false results introduced by potential systematic shifts in mitochondrial ribosome footprints could undermine the conclusion.

To meet this apparent need, a few recent studies have explored the utility of different spike-in formulations as external standards for data normalization [26, 27]. Here we report our efforts on this front. We developed and further characterized a panel of 16 spike-in control oligomers designed for ribosome profiling experiments. To characterize these spike-in oligomers and demonstrate their utility in ribo-seq data normalization, we conducted a study to profile transcriptome-wide changes in ribosome occupancy level induced by Sodium Arsenite treatment. Our spike-in formulation and study design provides the expected fold changes between samples, which enables rigorous evaluation of normalization impact on effect size and false positive rate.

Results

A set of short spike-in control oligos designed for human ribo-seq experiments

Our spike-in oligomer design follows Lutzmayor et al. oligomer design for small RNA-Seq [28]. The key features are 1. A core sequence at position 5–24 nt that mimics the base composition of the first 20 nts of human miRNA. 2. Four-nucleotide-random-sequences flanking each end. 3. A free energy profile mimicking endogenous miRNA. 4. End modifications to mimic ribosome footprints (S1A Fig in S1 File). We designed the spike-in oligomers based on endogenous miRNA minimum free energy profile, because we frequently observed “ribosome protected fragments” pileup at miRNA loci. While it remained unclear what proportion of these miRNA fragments were true ribosome footprints and what proportion were resulted from miRNA RISC complex co-purified with monosomes during size exclusion column purification, these miRNA fragments clearly made their way into the ribo-seq data. We therefore reason that artificial oligomers mimicking miRNA folding and base composition will make good candidates as spike-in controls for ribo-seq studies.

Using this design, we generated 993 core sequences each in combination with 65,536 different permutations of flanking sequences. We selected 16 oligo sequences, one from each category of core sequence, based on their free energy profile similarity to the endogenous miRNA (S1B Fig in S1 File); 16 different spike-in oligomers were purchased from the manufacturer in two batches of 8 (S1 Table in S2 File). These synthetic oligomers were then mixed to create spike-in pools, following the modified Latin square design by Pine et al. [29], to create pools of spike-in oligos that cover a relative quantitative range of 1~17,920 (Fig 1A). We applied these spike-in pools for ribo-seq experiments by adding the oligo-mix to the RNA sample prepared from the elute of size exclusion column purification of macromolecular complexes (intended to isolate individual ribosomes loaded with footprinting RNA fragments), which we termed digested macromolecular RNA. Because the majority of the digested macromolecular RNA are not ribosome footprints, through trial and error, we empirically determined the ratio between the spike-in oligo pool and the digested macromolecular RNA that will result in a desirable amount of spike-in oligomers sequenced in the final ribo-seq library. We found that, in our system, a 1:20,000 weight ratio between the spike-in pool and the digested macromolecular RNA results in an average of 2.30% final spike-in count/ total footprint count in the sequencing data.

Download:

Fig 1. The design and application of spike-in oligomers for stress response ribo-seq studies.

(a) Spike-in oligomer pooling design: A stacked barplot showing proportion of spike-in mixes used in each spike-in pool. Each mix is composed of 4 spike-in oligomers (e.g. D1, D2, D3, D4) mixed in the same 8 fold increment from oligomer 1 to oligomer 4 (proportion in percentages presented in the table to the left for mix D). Below the stacked barplot, samples receiving each spike-in pool are labeled with color code distinguishing different effects of interests to highlight the study design. Sodium Arsenite treatment (Control vs. Experiment). Donor (18505 vs. 19204 vs. 19193). Library preparation batch (T vs. U). For true negative and true positive control comparisons we compared quantification of the same oligomer between treated (Experiment) and untreated (Control) cell lines that either received the same oligomer pool (true negative; black brackets) or the cell lines were derived from the same individual but received different oligomer pools (true positive; the red bracket marked one such example), respectively. (b) Western blots for GM18505 indicate that our treatment conditions induced integrated stress response. Primary antibodies used were labeled to the left of the blots. Treatment types and durations were labeled below the blots. Treatment type "None” indicated baseline conditions, while treatment type "H2O” indicated the control condition used in the ribo-seq study. Note the band of phosphorylated eIF2α, a stress marker, only visible in Sodium Arsenite treated samples in contrast to the loading controls. (c, d, e) Boxplots summarizing impact of different normalization strategies on overall distribution of ribosome occupancy level (log2 counts) across all genes analyzed. Boxes are color coded to distinguish control samples (blue) from Sodium Arsenite treated samples (orange). The maximum and minimum values for the boxplot (i.e. the whiskers) are defined by the genes with quantification levels closest to (but without exceeding) 1.5 times of the interquartile range extending from the box.

https://doi.org/10.1371/journal.pone.0294308.g001

A stress response ribo-seq study for evaluating spike-in utility

We aim to evaluate the utility of spike-in oligomers in a well-established cell line model system in a scenario where a global shift in translation level is expected. As such, we collected ribo-seq data with spike-in from three unrelated HapMap LCL cell lines (GM18505, GM19193, GM19204) with and without 30 minutes of Sodium Arsenite induced Oxidative stress. A 30 minute treatment with Sodium Arsenite at a final concentration of 134 μM is sufficient to induce phosphorylation of eIF2α in LCL (Fig 1B and S2 Fig in S1 File). Phosphorylation of eIF2α is a hallmark of integrated stress response and is expected to result in a global reduction of cap-dependent mRNA translation [11, 12]. From each sample we created two replicates of sequencing libraries, one of which was treated with an additional CRISPR depletion step aimed to further reduce the level of rRNA contamination in ribo-seq libraries.

We generated a total of 873 million sequencing reads, which, after filtering out the rRNA, tRNA, and snoRNA reads, resulted in an average of ~2 million ribosome footprint reads uniquely mapped to the GRCh38 human genome and ~32K reads mapped to the spike-in oligo sequences (S2 Table in S2 File). Consistent with prior publications, we found more than 80% of the ribo-seq reads are derived from either ribosomal RNA, transfer RNA, or small nucleolar RNA (S3 Table in S2 File). On the other hand, in comparison to our past experience with ligation based ribo-seq protocol, we found the current set of libraries, generated using a ligation free protocol, to have a higher proportion of sorter-than-25-nt ribo-seq reads, which resulted in an overall lower proportion of uniquely mapped reads (S2, S3 Tables in S2 File). Nonetheless, the ribo-seq reads, when viewed in aggregate, show a clear subcodon periodicity pattern in the coding region at the expected positions (S3 Fig in S1 File). A clear pattern of subcodon periodicity reflects the mechanism of ribosomes decoding messages and indicates the quality of our footprinting data. Using principal component analysis (PCA), we found the stress treatment (separated along the first PC, 31.9% variance) and the cell line identify (separated along the second and the third PC, 13.5% and 11.6% variance, respectively) contributed to the majority of variation in our ribo-seq data (S4 Fig in S1 File). We found no significant differences in the proportion of rRNA reads between libraries prepared with and without the CRISPR depletion step (P = 0.82, Wilcoxon rank sum). Libraries with and without CRISPR treatment were then treated as technical replicates in this study to evaluate the unwanted effects introduced from separate rounds of library preparations (i.e. library batch).

Quantitative properties of spike-in oligo pools

We next evaluate our spike-in oligo pools for their quantitative range and their correlations with expected concentrations. To focus our analysis on quantitated genes, we required, for each GENCODE annotated gene, at least one sequencing read mapped to the gene in each sample of our dataset. With this criteria our sequencing coverage enabled analysis of 12,357 genes. We found that the panel of 16 spike-in oligomers span similar quantitative ranges across samples, which on average covers 97% of the quantitated genes (S5A Fig in S1 File). Without further normalization, we readily observed the expected global reduction in ribosome footprint counts across quantitated genes in the Sodium Arsenite treated samples (S5A Fig in S1 File and Fig 1D). Overall we observe strong positive correlations between spike-in sequencing counts and their corresponding nominal concentrations (median Spearman’s rho at 0.87, ranging between 0.81 to 0.92; S5B Fig in S1 File). Importantly, we found no significant effect from treatment conditions (P = 0.86, ANOVA) nor from technical replication batches (i.e. library batches, P = 0.19, ANOVA) on the correlation between the observed and the expected quantification levels for spike-in oligo pools. On the other hand, we noticed that the oligos ordered from two separate manufacturing batches show clear batch effects (S5C Fig in S1 File; P = 1.43e-4, ANOVA); indicating the importance of considering the batch effect from manufacturing, especially if the oligomers were intended for use in a standard curve.

Quantitative properties of individual spike-in oligo

To evaluate the quantitative properties of each individual oligomer across samples, we first evaluate the monotonicity of oligomer quantification (i.e. whether we observe the same ranking between the expected, based on concentration, and the observed quantification). Using Spearman’s Rho, we found all spike-in oligos used in this study were monotonic. We next evaluated the Pearson correlation between the expected, based on concentration, and the observed quantification level. Overall We found strong correlations between the expected and the observed (median 0.94, interquartile range 0.92~0.97). On the other hand, for spike-in oligos used in the lower concentration range, we observed a decrease in correlation between the observed and the expected (Fig 2A). This decrease in correlation is likely reflecting the increased coefficient of variation (i.e. increased variance relative to the mean) commonly observed in RNA-seq data at the low count region (i.e. an over-dispersed Poisson).

Download:

Fig 2. Quantitative properties of spike-in oligomers.

(a) Correlations (Pearson) between the observed quantification and expected for each spike-in oligomer is plotted against the acrosspool-average-expected concentration of the spike-in oligomer. Note the downward trend towards the lower concentration range. Blue loess trend line and its corresponding 95% confidence interval (shaded area) indicated oligomer A2 (highlighted in red) as an outlier. (b) Observed log2 fold change for spike-in constructed true positives plotted against the expected. Each data point represents the fold difference of the same oligomer between 2 different spike-in pools. The blue trendline and shaded area represents the linear regression coefficients and corresponding 95% confidence intervals calculated using the expected log fold change as predictor. (c) Observed log2 fold change for spike -in constructed true positives plotted against the expected concentrations of the quantifications used as the denominator for the observed log2 fold change calculation. The horizontal lines mark the expected log2 fold change. Each data point is represented by corresponding oligomer identification abbreviations and color coded in the same way as the expected fold change lines. Note the ratio compression at the lower concentration range.

https://doi.org/10.1371/journal.pone.0294308.g002

Of note, we found oligo A2_752 to be an outlier of the observed trend (Fig 2A). A Pearson correlation between the observed quantification and the expected based on input concentration for A2_752 is 0.54, which fell far below the 95% confidence interval of a loess fit adjusting for the concentration range effect. Upon further exploration of the properties of oligomers, we found A2_752 to have the highest minimum free energy of 0 among the 16 spike-in oligomers. On the other hand, we found no significant association between oligomer minimum free energy and the Pearson correlation between the observed and the expected oligomer quantification (P = 0.60, S6 Fig in S1 File).

Next, for each spike-in oligomer we compare the observed between-sample fold change to the expected fold change, which is calculated based on our modified Latin square pooling design (Fig 1A). Overall, we found a strong positive correlation with a small ratio compression effect in the observed fold change relative to the expected (Fig 2B). When using the expected fold change of spike-in oligomers in a linear model to predict the observed fold change, we found a regression slope of 0.87 +/- 0.023, which is close to but significantly different from the expected regression slope of 1 (P = 2.28e-6). This ratio compression effect appears to be stronger in the lower concentration range (Fig 2C). In fact, by stratifying data into three groups of the concentration range covered in our study, we found a progressive increase in ratio compression in the lower concentration group (S7 Fig in S1 File), which points to the possibility of an additive process such as consistent pipetting bias during serial dilution as the cause of the observed ratio compression. On the other hand, the overall strong correlation observed without normalization provides strong support for the high technical consistency of our experimental operations and our choice of loading reference.

Spike-in based data normalization preserved the expected global shift in quantification while global scaling normalization produced high level of false positives

It has previously been shown that a standard-curve-based-normalization approach using control oligos is technically challenging [17]. Consistently, evaluation of our spike-in oligos has shown both manufacturing batch effect and ratio compression effect, each as a practical example of the technical challenge of a standard-curve-based approach for absolute quantification. Alternatively, Risso et al. have developed a factor-analysis-based approach—RUV, which uses “control genes” for normalizing RNA-seq data [1]. Here using our spike-in oligomers as control genes we applied RUVg for an external-standard-based normalization of our ribo-seq stress response dataset and compared the results with those that we generated from applying a popular global-scaling-normalization approach, TMM [2]. Note that as a result of adapting the RUVg approach, despite using external standards, our approach doesn’t provide absolute quantification, instead it provides between condition fold changes (i.e., relative quantification), which is estimated from sequencing counts after accounting for the unwanted factors.

The first, and often challenging, step of applying RUVg for data normalization is to determine the number of unwanted variables (k) to remove. As a factor analysis method, RUVg approaches the data normalization issue by identifying unknown factors that contribute to the inter-sample distribution differences in quantification level. These nuisance variables, such as variation in sequencing depth, technical differences between batches or between sequencing centers, that contribute to unwanted variation are termed unwanted variables (k). While RUVg provides a formal procedure for estimating these unwanted variables, it is not always clear how many of these unwanted variables should be removed. Removing too many k could end up inadvertently removing biological variations and, in some designs, overfit the data.

Our external spike-in provides a principled approach to determine k. We evaluated the impact of removing unwanted variables on the correlation between the observed spike-in quantification and the expected. By incrementally removing unwanted variables, we found that the increased correlation between the observed and the expected spike-in quantification started to plateau at k = 3. We therefore decided on removing the top 3 unwanted factors, which increased the correlation between the observed spike-in quantification and the expected from R² = 0.73 to 0.83. This choice of k is robust against the exact composition of spike-in oligomers used for normalization (S8 Fig in S1 File). Using spike-in based RUVg normalization, we found the resulting normalized data to preserve the expected global reduction in ribosome occupancy level (Fig 1E). In contrast, when applying TMM normalization to the same dataset, the expected global shift is lost (Fig 1C).

The loss of an expected global shift will clearly result in false negatives. On the other hand, we suspect the same underlying shift in expression level created by TMM method could also introduce false positives. To evaluate the extent of false positive discoveries in such a scenario, we constructed true negative comparisons using quantifications for the same oligos between samples receiving the same pool of spike-in oligomer mixtures (black brackets in Fig 1A) and performed tests for differential expression between control and treated (e.g. comparing D1 quantifications between C505 and E193 samples; Fig 1A). We compared the number of discoveries between unnormalized data, TMM normalized data, and RUVg normalized data. At a P value cutoff of 0.1, which is expected to result in 10% false positives by chance, we found ~20% false positives from tests using either unnormalized data or RUVg normalized data. Conversely, at the same 10% cutoff, using TMM normalization we found 42 false positives out of a total of 48 tests (87.5%). The extremely high proportion of false positives identified using TMM normalized data is in clear contrast to the proportion of false positives identified either using unnormalized data or RUVg normalized data (S9 Fig in S1 File).

Global scaling normalization introduced false positives in stress response dataset by distorting fold change

To identify the underlying cause of false positive and false negative results introduced by TMM normalization of our stress response dataset, we used the input concentration of spike-in oligomers to construct true positive comparisons with expected fold changes. More specifically, we used quantifications of the same spike-in oligomer from cell lines derived from the same individual but received different pools of spike-in oligomer mixtures (see red brackets in Fig 1A for an example) and performed tests for differential expression between control and treated (e.g. comparing D1 quantifications between C505 and E505 samples). Using the expected fold change calculated from known oligomer concentrations for the true positives, we found TMM normalization to clearly distort the fold change (Fig 3A and S10 Fig in S1 File). When modeling the observed log2 fold change in TMM normalized spike-in quantification using the expected log2 fold change as the predictor, we found TMM normalization to shift the log2 fold change of spike-in quantification from the expected by 1.612 +/- 0.119 (i.e. a systematic upward shift) while maintaining a regression coefficient of 0.824 +/- 0.038 and an r-squared of 0.910 (P < 2e-16; Fig 3A). In contrast, only -0.082 +/- 0.105 log2 deviations from the expected fold change were observed for unnormalized data, with a comparable regression coefficient of 0.832 +/- 0.033 and an r-squared of 0.930 (Fig 3A).

Download:

Fig 3. Impact of TMM normalization and spike-in based RUVg normalization on expression quantification.

(a) Observed log2 fold change for spike-in constructed true positives plotted against the expected; comparing between results from different normalization approaches. Data points represent mean plus/minus standard errors calculated from each group of true positives that share the same expected log2 fold change. Black data points (i.e. "Original”) are results from log2 transformed counts without further normalization. Black line indicates the ideal correlation (i.e. an intercept of 0 and a slope of 1). (b) A volcano plot showing the relationship between fold change (treatment versus control) and p value from differential expression tests for RUVg normalized data. Data points are color-coded in green for spike-in oligomers, in red for endogenous genes that are significantly differentially expressed at 5% FDR, and in blue for endogenous genes that are not differentially expressed. (c, d) Venn diagrams summarizing the number of shared and distinct differentially expressed genes found with and without normalization. Unnormalized (unnorm) in blue. Normalized (either TMM or RUVg) in pink. Numbers labeling each area indicate the number of genes belonging to each group, with the size of the area drawn in proportion to the size of the group. (e) Spike-in based RUVg normalization results are robust against the exact set of spike-in oligomers used as control genes. Boxplots comparing between the across sample biological CV to the across subsampling technical CV for endogenous genes. The technical CV here is calculated based on the normalized quantification level of endogenous genes across 10 iterations of spike-in oligomer subsampling for control gene selection.

https://doi.org/10.1371/journal.pone.0294308.g003

The distortion in fold change estimates observed in TMM normalized data explains the high false positive rates observed in the aforementioned true negative control comparisons and indicates the possibility of introducing false negatives by the global scaling normalization approach shifting true differences towards zero. On the other hand, RUVg normalization appears to have nudged the observed fold change towards the expected (Fig 3A), which resulted in a slight increase in the regression coefficient, from 0.832 (+/- 0.033) of the unnormalized to 0.885 (+/- 0.039) and the estimated intercept changed from -0.082 (+/- 0.105) of the unnormalized to -0.044 (+/- 0.123), both parameter values inching closer to the expected (i.e. a slope of one and an intercept of zero).

Spike-in based RUVg normalization recovered 99% of the discoveries from unnormalized data and identified an additional 5% of differentially translated genes

After evaluating the impact of normalization on spike-in oligomer quantification, we next evaluated the impact of different normalization procedures on the 12,357 endogenous genes that were sufficiently quantitated. We compared test results for identifying differentially translated genes between control and treated samples across datasets normalized using either a spike-in based RUVg normalization or TMM normalization. In these comparisons, we evaluate two main aspects of data normalization impact, namely, improvement in detection power and introduction of biases in effect estimates. Raw data collected without biased manipulations reflects true effects buried in both technical and biological noises. In other words, assuming no biased manipulations involved, effects estimated from unnormalized data are accurate but imprecise. Data normalization, when applied appropriately, should increase power without introducing bias. Using spike-in based RUVg normalized data, at 5% FDR, we identified 9,912 stress-induced differentially translated genes. Consistent with the observed global reduction in translation activities, 96% of stress induced changes were down regulations (Fig 3B). When compared with test results from unnormalized data, RUVg normalization replicated 99% of the discoveries (i.e. 9,420 out of 9,516) and identified an additional 492 genes (~5%) (Fig 3C); a moderate increase in power. In contrast, using TMM normalization, at 5% FDR, we identified 7,421 stress-induced differentially translated genes. Of these differentially translated genes only 48% were down regulations (S11B Fig in S1 File). When compared with test results from unnormalized data (Fig 3D and S11 Fig in S1 File), TMM normalization only replicated 51% of the discoveries (i.e. 4,806 out of 9,516). On the other hand, 2,615 of the differentially translated genes found in TMM normalized data (i.e. 35% of TMM discoveries) were not found in test results derived from the unnormalized data (Fig 3D). The rather limited overlap between differentially translated genes identified from unnormalized data and the ones identified from TMM normalized data indicates potential biases introduced from TMM normalization.

Global scaling normalization of stress response datasets leads to high rates of false discoveries among endogenous genes

Following our observation with spike-in oligomers, we consider the possibility that TMM normalization introduces biases among endogenous genes through the same upward shift of quantifications in treated samples (Fig 3A). Indeed, for genes that were found differentially translated in unnormalized data but not in TMM normalized data (i.e. potential false negatives), we found TMM normalization to shift the log2 fold change towards zero, which resulted in a shift of log2 fold change by 1.733 +/- 0.001 (treated samples relative to control sample; S12A Fig in S1 File). This upward shift is comparable in magnitude to the upward shift observed in spike-in oligomers (1.612 +/- 0.143) (S10 Fig in S1 File). This systematic shift is not limited to those potential-false-negative genes, rather the systematic shift can be clearly visualized when we plot the log2 fold change of TMM normalized data against the unnormalized data for each individual gene across all quantitated genes (S12B Fig in S1 File).

In addition to introducing false negatives, a systematic upward shift could also produce false positives (i.e. by shifting true negatives away from zero). In such a scenario, these differences in discoveries between TMM normalized data and unnormalized data were resulted from biased effect estimates instead of a difference in statistical power. In other words, we do not expect relaxing significance cutoffs to recover these mismatches. Consistent with our expectations, for differentially translated genes found in TMM normalized data (at 5% FDR) but not in unnormalized data, we found limited recovery from relaxing significance cutoffs for unnormalized data (S12C Fig in S1 File). For example, when relaxing significance cutoff for testing differential expression in unnormalized data to 20% FDR, we recovered only 25.6% of the TMM-normalization-specific findings. Importantly, amongst the recovered genes, 69.4% had an opposite direction of effect between datasets indicating that for these genes TMM normalization distorted the original effect to an extent that flipped the sign. In contrast, for RUVg normalization, we found a much higher recovery rate (S12D Fig in S1 File). Using 20% FDR for the unnormalized data, we recovered 86% of the normalization specific findings. All of the recovered effects were found in the same direction between the unnormalized data and RUVg normalized data. Taken together, these results highlight the issues in using a global scaling approach, such as TMM, to normalize ribo-seq datasets in scenarios of an expected global shift in mRNA translation, such as acute cellular stress response.

Spike-in-based RUVg normalization results are consistent regardless of the exact combinations of spike-in oligomers used

To evaluate the robustness of spike-in-based RUVg normalization, we took a subsampling approach. For each iteration we randomly subsampled half of our spike-in quantification data as control genes. We compared RUVg normalization results across ten iterations of subsampling. We first consider the variation in normalized quantifications of spike-in oligomers across iterations of subsampling. For the spike-in comparisons designed to have expected fold changes (i.e. true positives) we found the observed fold changes to have a tight range of values across subsampling iterations (S13A, S13B Fig in S1 File). The deviation of the observed log2 fold change from the expected is centered at zero with the percentage of deviation consistently at around 30% across iterations (33.8% +/- 0.3% of the expected). Similarly when using the expected fold change as the predictor in a linear model for the observed, we found rather consistent regression coefficients across subsampling interactions and the intercept to center at zero (regression coefficients: 0.888 +/- 0.002, intercept:-0.016 +/- 0.027) (S13C, S13D Fig in S1 File). For the true negatives, we found the distribution of fold change to consistently center near the expected zero across iterations (S14A Fig in S1 File) and the proportion of false positive discoveries to consistently fell at ~10% above the expected (S14B Fig in S1 File), which is similar to the results from normalizing using the full set of spike-in oligomers (S9C Fig in S1 File). Qualitatively similar, albeit noisier, results were observed when analyzing only spike-in quantification data that were not included as control genes for normalization (fold change: -0.021 +/- 0.045, proportion false positives: S14C Fig in S1 File).

We next consider the variation in normalized quantifications of endogenous genes across iterations of subsampling. We found an average CV of 14.7% across subsampling iterations (interquartile range: 10% to 17.7%) and the CV decreases steadily as a function of quantification levels (i.e. log2 counts; S15A Fig in S1 File). These properties are similar to the technical variations observed in published RNA-Seq data [30]. When comparing the subsampling CV to the across-sample-biological CV (i.e. variations mainly originated from the between treatment group differences) we found the across-sample-biological CV to be more than 4 times higher than the subsampling CV (average at 64.7% vs. 14.7%; P < 2.2e-16; Fig 3E). When limiting the CV comparison to genes with low quantification levels (here we set the quantification cutoff at an average count of less than 32), from which we often see strong overdispersion, we found similar large differences between the biological CV and the subsampling CV (58.5% vs. 15.4%; P < 2.2e-16; S15B Fig in S1 File). The large difference between biological CV and subsampling CV indicated that subsampling variations likely had limited contribution to false discoveries, which is in agreement with the stable false positive rates observed across iterations of subsampling (S14 Fig in S1 File). Taken together these results indicated that the spike-in based normalization approach developed here is robust against the exact combination of spike-in oligomers used for the purpose of identifying stress induced differences in mRNA translation.

Discussion

We developed a panel of 16 spike-in oligomers and a corresponding pooling scheme for applications in ribosome profiling studies to identify acute cellular stress induced changes in mRNA translation across the transcriptome. To evaluate the utility of this set of spike-in oligos, we performed ribosome profiling experiments to identify Sodium Arsenite induced changes in translation level from three LCLs in two conditions, each with two library preparation replicates. Note that the changes in ribosome occupancy level observed here is used as a proxy for the changes in the level of translation, which reflects the combined effect of both transcriptional and translational regulation of gene expression.

Evaluation of spike-in oligomers overall found strong positive correlations between the observed and the expected. At the same time, we identified one oligomer, A2, as potentially problematic. Because of the fact that our design requires a consistent length and has a rather homogeneous base composition across oligomers, we were unable to identify length bias or sequence features associated with unwanted effects. On the other hand, our spike-in oligomers were designed to span a spectrum of minimum free energy (i.e. a numeric proxy for oligomer folding structure) that resembles endogenous miRNAs, which allows us to evaluate the correlations between minimum free energy and the quantitative properties of the spike-in oligomers. We found no significant association between oligomer minimum free energy and the correlation between the observed and the expected oligomer quantification. A lack of correlation observed between spike-in performance and minimum free energy is reassuring, in that the quantification level is in general not biased by the minimum free energy of RNA fragments. On the other hand, we found the outlier A2 oligomer to have the highest minimum free energy of zero among the 16 spike-in oligomers. Unfortunately, because A2 is the only spike-in oligomer with a minimum free energy of zero, we were unable to conclude if a minimum free energy of zero led to the lower correlation between the observed and the expected.

Our analyses on the quantitative property of spike-in oligomers revealed two major unwanted effects, a manufacturing batch effect and a ratio compression effect.

Based on the observed pattern of manufacturing batch effect, we postulate that the between batch variations in either the quantification process from the manufacturer or variations in preparing the stock solution when we first received the oligomers (e.g. different sets of pipetman used or pipetman calibration shifted over time) were the likely culprit. The progressive nature of ratio compression led us to postulate that a consistent pipetting error could have resulted in the observed compression, which is more pronounced at the lower concentration range. These unwanted technical effects will clearly have a negative impact on a study relying on a standard-curved-based absolute quantification approach. On the other hand, their potential impact on results from a factor-analysis-based relative quantification approach is expected to be limited: unwanted effects that are not shared by endogenous genes, especially the ones that are properly randomized across the effect of interest, such as the oligomer manufacturing batch effect and ratio compression effect observed here, will have limited contribution to the test results when fitted as covariates in the linear model for differential expression.

In contrast to features that are unique to spike-in oligomers, features that are shared between spike-in oligomers and endogenous genes are the critical attributes forming the basis of spike-in-based RUVg normalization. RUVg has previously been used for spike-in-based normalization [1]. Risso et al. have shown that using ERCC spike-in as control genes for RUVg normalization appeared to outperform other popular methods, despite the fact that certain key assumptions of the RUVg normalization method were violated [1]. Following work from Risso et al. we used RUVg to perform spike-in-based data normalization. In the RUVg framework, these spike-in oligomers are treated as control genes and the differences in quantification levels found between samples in these control genes are used to estimate unwanted factors. A key assumption made by this approach is therefore that the spike-in oligomers shared the same impact from the unwanted factors with the endogenous genes. As an example of unwanted effects, we visualized the impact of library preparation batch effects comparing between spike-in oligomers and endogenous genes (S16 Fig in S1 File). Overall, we found a rather similar trend between spike-in oligomers and endogenous genes, with the exception of a bump in the middle of the distribution (S16 in S1 File), which could potentially be attributed to sampling variation.

Applying RUVg normalization to our stress response ribo-seq dataset, using our 16 spike-in oligomers as control genes, preserved the expected global reduction in mRNA translation in response to stress treatment and slightly increased power for detecting differences. Although the power increase was rather limited, RUVg normalization increased accuracy in fold change estimates, as was indicated by the increase in regression coefficient (i.e. approaching the expected value of one) of a linear model fit between the observed and the expected fold changes for control spike-in oligomers. This regression coefficient increase was observed consistently across subsampling from groups of spike-in oligomers. As the first study evaluating the panel of spike-in oligomers, we paid careful attention to avoid introducing biases. Importantly, we did not attempt to increase sequencing coverages only for libraries that appear to have lower coverages (a rather common practice for sequencing studies). This is especially relevant for our study, because most (if not all) of the low coverage libraries are from the treated group; by submitting only libraries from the treated group for additional rounds of sequencing we will introduce bias in the data. Our careful experimental design and operation in conjunction with the large effect from stress treatment likely resulted in the majority of differentially translated genes readily detectable without normalization, which could explain the limited power increase from RUVg normalization. On the other hand, deliberately introducing sequencing coverage differences between samples in future studies will be useful for evaluating the robustness of our spike-in-based RUVg normalization approach against such bias.

A fundamental feature of our study design is the expected fold change of the same oligomer between samples receiving different pools of spike-in oligomers. This expected fold change (or the lack of it) is determined during spike-in pool preparation and this determination is independent of the samples and the downstream experimental conditions. We used the expected fold changes to construct true negatives and true positives, which we used as the yardstick to evaluate our results from quantitating spike-in oligomers. An underlying assumption of this approach is an unbiased (e.g., equal) loading of spike-in oligomer pools between samples. An important decision we made for this study is therefore the loading reference used to determine the amount of spike-in pools to add to each sample. Given that we are working with cultured cells and are testing for differences in cellular response to acute stress, a straightforward choice for loading reference would be the number of cells in each sample. However, our preliminary results indicated high variability between cell counts. Instead, we used the amount of digested macromolecular RNA as our loading reference, which assumes no stress-induced macromolecular RNA turnover within the 30-min treatment time window. Our choice of loading reference is supported both by the literature and by our results from quantitating the between-pool fold change for each spike-in oligomer, which, as mentioned above, is independent of the choice of loading reference. Although, in theory, the amount of loading control used could be determined in absolute scale (i.e. final molar concentration of spike-in oligomers) and the between sample loading variation could then be estimated as unwanted factors by RUVg. In reality, how such an approach would perform requires further investigation with a different study design.

Our spike-in oligomers, when used as control oligos, clearly demonstrated the dramatic consequences of using a global scaling normalization approach to transform ribo-seq data collected from an acute cellular stress response study. When evaluating the expected true negatives constructed based on our spike-in pooling design, we found an extremely high proportion of false discoveries in TMM normalized data. In contrast, in data without normalization or normalized using the spike-in-based factor analysis approach we found the proportion of false discoveries to consistently fall slightly above the expected. When evaluating the expected fold changes of true positives, we found TMM normalization to shift fold changes based on differences in sequencing coverage between libraries. In the current dataset, such shifts ended up creating both false positives and false negatives. We want to emphasize here that the troubling results observed from TMM normalization does not indicate problematic behaviors of the TMM normalization method per se. Instead, it reflects the consequences of violating an important underlying assumption for all global scaling normalization approaches, i.e. any systematic shifts were assumed to be experimental artifacts. In contrast, a factor-analysis-based normalization approach doesn’t stipulate such an assumption. In our analysis, using spike-in oligomer quantification as controls genes, RUVg normalization maintained the expected global reduction of ribosome occupancy level in the treated samples, despite heavily transforming the data (i.e., removing the top three unwanted factors).

In order to keep the linear model parameterization consistent across comparisons, in our differential expression tests, we extracted normalized counts from RUVg as input data. While we designed our study this way to keep a fair comparison between normalization approaches, for studies focusing on identifying genes that are differentially translated, we suggest following RUVg authors’ advice to fit these unwanted factors as covariates. By extracting the normalized counts (i.e. equivalent to regressing out the unwanted factors), one risks potentially removing biologically relevant variations. These considerations are not unlike the ones to contend with when using a typical linear modeling approach for adjusting batch effects. Similarly, when deciding on the number of k to fit as covariates, the degrees of freedom available from the dataset could put some constraints on the available options. The costs and benefits of including additional samples should therefore be carefully considered during study design in order to enable effective factor-analysis-based normalization.

While our study aimed specifically to address the unmet need of external standards for transcriptome-wide profiling of translational regulation in stress response, the exceedingly high proportion of false positives observed as a result of applying TMM normalization to our oxidative stress response ribo-seq dataset and the almost ubiquitous use of global scaling normalization approaches in the field of genomics prompted the following unsettling question: How common have such error occurred without the researchers acknowledging it? It is easy to envision scenarios of subtler global shifts in gene expression for studies comparing between developmental stages or testing for drug treatment effects. Without appropriate use of external standards, these subtle shifts could escape researchers and the subsequent application of global scaling normalization will lead to distortion of effect size, which, as we have shown, results in both false positives and false negatives. On the other hand, while an appropriate application of external standards could safeguard against such pitfalls, further studies to identify effective approaches for determining the amount of spike-in oligomers to use for each sample is needed. Nonetheless, a collective consciousness in reducing experimental biases and a wide adoption of external spike-in oligomers could enable future retrospective meta-analyses to provide insight.

Materials and methods

Cell culture and oxidative stress

Three lymphoblastoid cell lines (LCLs) (GM18505, GM19193, and GM19204), each derived from a separate Yoruba people from Nigeria, were purchased from Coriell Institute for Medical Research (NIGMS Human Genetic Cell Repository). The cells were maintained at 37°C with 5% CO₂ in RPMI media supplemented with 15% FBS, 2 mM L-glutamate, 100 IU/ml penicillin, and 100 μg/ml streptomycin, in accordance with instructions provided by Coriell. Of note, cell cultures were vigilantly maintained at a cell density between 600,000 to 700,000 cells/ml to avoid inadvertent induction of stress response. To induce oxidative stress, cells were treated with 134 μM (i.e. the final concentration in cell culture) Sodium Arsenite (NaAsO₂; Sigma-Aldrich, Cat # S7400), a heavy metal oxidative stressor, for 30 minutes in otherwise the same cell culture conditions. For the control group, nuclease free water was used in place of the Sodium Arsenite solution. After treatment, cells were pelleted by centrifugation at 100g for 10 min and washed twice with cold PBS (4°C). Cell pellets were flash frozen in liquid nitrogen and stored at -80°C.

Western blot

Proteins were prepared from flash frozen cell pellets using M-PER Mammalian Protein Extraction Reagent (Thermo Scientific, Cat # 78503) following vendor’s instructions. Protein concentration was estimated using bicinchoninic acid (BCA) assay (Pierce™ BCA Protein Assay Kit, Cat# 23227) and 5 μg of proteins from each sample were used for western blot. All antibodies (e.g. anti eIF2α, anti Phospho-eIF2α) used were purchased from Cell Signaling Technologies (S2 Table in S2 File). XCell II Blot module was used for wet electrophoretic transfer of proteins from SDS-PAGE gel to nitrocellulose membrane (Biorad, Cat # 1620145, 0.45 μm) in Tris-Glycine Electroblotting buffer (National Diagnostic, Cat # EC-880). ProtoBlock Solution (National Diagnostics, Catalog number: CL-252) was used for blocking the nitrocellulose membrane following manufacturer’s instructions. After wet transfer, membranes were incubated with ProtoBlock solution for 1 hour at room temperature before overnight incubation with primary antibodies at 4°C (in blocking solution). After primary antibody incubation, membranes were washed in TBST buffer for three times (5 minutes each) before incubation with HRP conjugated secondary antibody for 2 hours at room temperature (in blocking solution). After secondary antibody incubation, membranes were washed three times in TBST buffer for 5 minutes each. ProtoGlow ECL (National Diagnostics, Cat # CL-300) and HyBlot CL autoradiography films (Thomas Scientific, Cat # NC1601219) were used for signal detection. Quantification of digital images was performed using ImagJ.

Spike-in oligomers

Spike-in oligomer sequences were designed following Lutzmayer et al. [28]. While we followed the design principle and the programming scripts developed by Lutzmayer et al. a few key aspects were modified to suit our purpose of using these oligomers as external standards in ribosome profiling experiments for human samples. Instead of mimicking the free energy profile and base composition of Arabidopsis miRNA, our design mimics human miRNAs. In addition, we extended the oligomer length to 28 nts with flanking random tetramers to mimic ribosome footprint length. We generated/selected 1000 permutations of 20 nts RNA sequences that have a base composition resembling human miRNA (calculated based on a high confidence set of 896 miRNA sequences downloaded from miRBase [31]). Seven of these permutation sequences mapped to the human genome and were therefore removed. For the remaining 993 permutations of RNA sequences, we added random tetramers to each end of the sequence, which resulted in 993 sets of 65,536 sequences. Using RNAfold [32], we determined the minimum free energies of all sequences. Based on the minimum free energy profile, we selected a total of 16 sequences (S1 Table in S2 File), each from a different set, to produce spike-in oligomers for the current study. These 16 spike-in oligomers were purchased in two separate batches (S1 Table in S2 File) from Sigma Aldrich and resuspended in 10 mM Tris (pH 8) to a stock concentration of 100 μM based on the quantifications provided by the vendor. To create the final spike-in pools used in experiments, four spike-in mixes (A, B, C, D) each composed of 4 different oligomers (1,2,3,4) in eight-fold concentration increments were created separately through 2-fold serial dilutions. The four mixes (A, B, C, D) were then combined in a defined ratio of (2%,6%,22%,70%) in 3 permutations to create 3 different spike-in pools each have individual oligomers in different concentrations but together covering the same concentration range (Fig 1A). The working solution for spike-in pools was prepared in a weight concentration of 50 pg/μl, which we found convenient for an application of 50 pg spike-in pool for each 1μg of digested macromolecular RNA before gel isolating the ribosome footprints.

Ribosome profiling

Ribosome footprint profiling experiments were performed following the ligation free protocol described in Hornstein et al. [33] with a few specific modifications made for the current study. Key steps include, RNase I digestion to generate ribosome protected fragments (100 U per 200 μl of cell lysate), size exclusion spin column (Sephacryl S400, GE: 27-5140-01) isolation of ribosomes, spike-in oligomer control addition, gel isolation of ribosome footprints, sequencing library construction using Clontech SMARTER smRNA SEQ KIT (Fisher Scientific: NC1098027), rRNA depletion using subtraction oligo [34], and finally PCR amplified Indexed libraries were pooled to sequence on an Illumina HiSeq 4000. For incorporating spike-in oligomers, for each sample, 50 pg of spike-in pool was used for each 1μg of digested macromolecular RNA.

CRISPR/Cas9 targeted depletion of rRNA

CRISPR/Cas9 mediated rRNA removal was performed following Han et al. [35] and Mito et al. [36]. Ten target-specific oligomers for rRNA (see sequences in Han et al. [35]) were ordered from Sigma Aldrich and separately PCR amplified to prepare each oligomer as the template for in vitro transcription (T7-Scribe™ Standard RNA IVT, CELLSCRIPT, Cat# C-AS3107) to produce guide RNAs. CRISPR/Cas9/gRNA solution for the targeted removal of rRNA sequence was prepared and applied to ribo-seq libraries following Mito et al. [36].

Data processing

Sequencing read processing and mapping.

Sequence reads were processed and mapped to the human genome following a modified procedure used by Ingolia et al. [34]. Before aligning to the genome, the adapter and polyA sequences as well as 4 nucleotides at the 5’ end were removed from each read using FASTX-Toolkit. Processed reads mapped to a reference FASTA file composed of human rRNA, tRNA and snoRNA sequences were removed. The remaining sequence reads were mapped to the GRCh38 human genome using TopHat2 [37] with splice-junction information from GENCODE GTF (release 37). The mapping procedure allowed a maximum of 2 mismatches and only uniquely mapped reads were retained. Levels of mRNA translation were estimated by counting the number of ribosome profiling reads aligning to each gene based on GENCODE annotation (i.e. the ENSG entries) using featureCounts [38].

Data filtering and transformation.

To focus our analysis on sufficiently quantitated genes, we only analyzed genes containing at least one sequencing read from each sample, which resulted in a dataset including 12,357 genes (Data S1). For spike-in oligomers, all available quantifications were included in the analyses. Unless otherwise specified, before downstream analyses, count data were log2 transformed after the addition of 0.25 pseudo-count to avoid creating singular values in log scale.

Analyses.

Statistical analyses were performed in the R statistical computing environment (version 4.0.4). Linear regression analyses were performed using the lm() function. Spearman and Pearson correlations and corresponding tests were performed using cor() and cor.test(). Student’s t-Test were performed using t.test(). Wilcoxon Rank Sum tests were performed using wilcox.test().

Normalization

TMM normalization was performed using the cpm() function from the edgeR package [39] with effective library size calculated using the calcNormFactors() function. Default parameters were used for the trimmed mean of M value calculation.

RUVg normalization was performed using the RUVSeq package [1]. Factor analysis for identifying unwanted variables was performed by applying the RUVg() function on raw ribo-seq counts using the default settings with the spike-in oligomers as control genes. Note that the factor analysis was done on count data after filtering out genes that do not have at least one count per sample, but without the commonly applied preprocessing of upper quartile normalization, which could remove biologically meaningful coverage differences resulting from the stress treatment. RUVg normalized counts were extracted using the normCounts() function for downstream analyses to visualize the impact of normalization.

Differential expression tests

Differential expression analyses were performed using the limma package [40]. After processing with various normalization and transformation procedures, functions lmFit() and eBayes() were applied sequentially to perform differential expression tests under a linear modeling framework. When testing for treatment effect for endogenous genes, both library construction batch and donor identity were fitted as covariates in the linear model. More specifically, for each gene, expression level E across sample j is modeled by the treatment effect T as the predictor and the library batch effect B and the cell line effect C as covariates in the following equation: Where μ is the intercept term and ε is the residual. To identify differentially expressed genes, we tested the null hypothesis of β₁ is zero using the empirical Bayes moderated t-statistics from the eBayes() function. False discovery rates were calculated using the Benjamini-Hochberg procedure.

Supporting information

S1 File. Supplemental S1 to S16 Figs.

https://doi.org/10.1371/journal.pone.0294308.s001

(PDF)

S2 File. Supplemental S1-S3 Tables.

https://doi.org/10.1371/journal.pone.0294308.s002

(PDF)

S1 Data. Ribo-seq count table for all genes analyzed.

https://doi.org/10.1371/journal.pone.0294308.s003

(XLSX)

Acknowledgments

We thank members of the Wang lab for helpful discussions. The University of Chicago Genomics Facility for collecting the sequencing data. The journal editors and reviewers for their helpful comments.

References

1. Risso D., Ngai J., Speed T. P. & Dudoit S. Normalization of RNA-seq data using factor analysis of control genes or samples. Nat Biotechnol 32, 896–902 (2014). pmid:25150836
- View Article
- PubMed/NCBI
- Google Scholar
2. Robinson M. D. & Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology 11, R25 (2010). pmid:20196867
- View Article
- PubMed/NCBI
- Google Scholar
3. Leek J. T., Johnson W. E., Parker H. S., Jaffe A. E. & Storey J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883 (2012). pmid:22257669
- View Article
- PubMed/NCBI
- Google Scholar
4. Stegle O., Parts L., Piipari M., Winn J. & Durbin R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat Protoc 7, 500–507 (2012). pmid:22343431
- View Article
- PubMed/NCBI
- Google Scholar
5. Bullard J. H., Purdom E., Hansen K. D. & Dudoit S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics 11, 94 (2010). pmid:20167110
- View Article
- PubMed/NCBI
- Google Scholar
6. Dillies M.-A. et al. A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis. Brief Bioinform 14, 671–683 (2013). pmid:22988256
- View Article
- PubMed/NCBI
- Google Scholar
7. Hansen K. D., Irizarry R. A. & Wu Z. Removing technical variability in RNA-seq data using conditional quantile normalization. Biostatistics 13, 204–216 (2012). pmid:22285995
- View Article
- PubMed/NCBI
- Google Scholar
8. Risso D., Schwartz K., Sherlock G. & Dudoit S. GC-Content Normalization for RNA-Seq Data. BMC Bioinformatics 12, 480 (2011).
- View Article
- Google Scholar
9. Sun Z. & Zhu Y. Systematic comparison of RNA-Seq normalization methods using measurement error models. Bioinformatics 28, 2584–2591 (2012). pmid:22914217
- View Article
- PubMed/NCBI
- Google Scholar
10. Evans C., Hardin J. & Stoebel D. M. Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions. Briefings in Bioinformatics 19, 776–792 (2018). pmid:28334202
- View Article
- PubMed/NCBI
- Google Scholar
11. Spriggs K. A., Bushell M. & Willis A. E. Translational regulation of gene expression during conditions of cell stress. Mol. Cell 40, 228–237 (2010). pmid:20965418
- View Article
- PubMed/NCBI
- Google Scholar
12. Krishnamoorthy T., Pavitt G. D., Zhang F., Dever T. E. & Hinnebusch A. G. Tight binding of the phosphorylated alpha subunit of initiation factor 2 (eIF2alpha) to the regulatory subunits of guanine nucleotide exchange factor eIF2B is required for inhibition of translation initiation. Mol. Cell. Biol. 21, 5018–5030 (2001). pmid:11438658
- View Article
- PubMed/NCBI
- Google Scholar
13. Baird T. D. & Wek R. C. Eukaryotic Initiation Factor 2 Phosphorylation and Translational Control in Metabolism12. Adv Nutr 3, 307–321 (2012).
- View Article
- Google Scholar
14. Hinnebusch A. G. The scanning mechanism of eukaryotic translation initiation. Annu. Rev. Biochem. 83, 779–812 (2014). pmid:24499181
- View Article
- PubMed/NCBI
- Google Scholar
15. Hinnebusch A. G. & Lorsch J. R. The Mechanism of Eukaryotic Translation Initiation: New Insights and Challenges. Cold Spring Harb Perspect Biol 4, (2012). pmid:22815232
- View Article
- PubMed/NCBI
- Google Scholar
16. Lemire A. et al. Development of ERCC RNA Spike-In Control Mixes. J Biomol Tech 22, S46 (2011).
- View Article
- Google Scholar
17. Jiang L. et al. Synthetic spike-in standards for RNA-seq experiments. Genome Res 21, 1543–1551 (2011). pmid:21816910
- View Article
- PubMed/NCBI
- Google Scholar
18. Lovén J. et al. Revisiting global gene expression analysis. Cell 151, 476–482 (2012). pmid:23101621
- View Article
- PubMed/NCBI
- Google Scholar
19. Ingolia N. T., Ghaemmaghami S., Newman J. R. S. & Weissman J. S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324, 218–223 (2009). pmid:19213877
- View Article
- PubMed/NCBI
- Google Scholar
20. Ingolia N. T. Ribosome Footprint Profiling of Translation throughout the Genome. Cell 165, 22–33 (2016). pmid:27015305
- View Article
- PubMed/NCBI
- Google Scholar
21. Sendoel A. et al. Translation from unconventional 5’ start sites drives tumour initiation. Nature 541, 494–499 (2017). pmid:28077873
- View Article
- PubMed/NCBI
- Google Scholar
22. Hsieh A. C. et al. The translational landscape of mTOR signalling steers cancer initiation and metastasis. Nature 485, 55–61 (2012). pmid:22367541
- View Article
- PubMed/NCBI
- Google Scholar
23. Andreev D. E. et al. Translation of 5′ leaders is pervasive in genes resistant to eIF2 repression. eLife 4,. pmid:25621764
- View Article
- PubMed/NCBI
- Google Scholar
24. Iwasaki S., Floor S. N. & Ingolia N. T. Rocaglates convert DEAD-box protein eIF4A into a sequence-selective translational repressor. Nature 534, 558–561 (2016). pmid:27309803
- View Article
- PubMed/NCBI
- Google Scholar
25. Liu T.-Y. et al. Time-Resolved Proteomics Extends Ribosome Profiling-Based Measurements of Protein Synthesis Dynamics. Cell Systems 4, 636–644.e9 (2017). pmid:28578850
- View Article
- PubMed/NCBI
- Google Scholar
26. Wang Y. J. & Gilbert W. V. Quantitative Comparisons of Translation Activity by Ribosome Profiling with Internal Standards. Methods Mol Biol 2252, 127–149 (2021). pmid:33765273
- View Article
- PubMed/NCBI
- Google Scholar
27. Hoerth K., Reitter S. & Schott J. Normalized Ribo-Seq for Quantifying Absolute Global and Specific Changes in Translation. Bio Protoc 12, e4323 (2022). pmid:35340296
- View Article
- PubMed/NCBI
- Google Scholar
28. Lutzmayer S., Enugutti B. & Nodine M. D. Novel small RNA spike-in oligonucleotides enable absolute normalization of small RNA-Seq data. Scientific Reports 7, 5913 (2017). pmid:28724941
- View Article
- PubMed/NCBI
- Google Scholar
29. Pine P. S. et al. Evaluation of the External RNA Controls Consortium (ERCC) reference material using a modified Latin square design. BMC Biotechnology 16, 54 (2016).
- View Article
- Google Scholar
30. Law C. W., Chen Y., Shi W. & Smyth G. K. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014). pmid:24485249
- View Article
- PubMed/NCBI
- Google Scholar
31. Kozomara A., Birgaoanu M. & Griffiths-Jones S. miRBase: from microRNA sequences to function. Nucleic Acids Research 47, D155–D162 (2019). pmid:30423142
- View Article
- PubMed/NCBI
- Google Scholar
32. Hofacker I. L. Vienna RNA secondary structure server. Nucleic Acids Res 31, 3429–3431 (2003). pmid:12824340
- View Article
- PubMed/NCBI
- Google Scholar
33. Hornstein N. et al. Ligation-free ribosome profiling of cell type-specific translation in the brain. Genome Biology 17, 149 (2016). pmid:27380875
- View Article
- PubMed/NCBI
- Google Scholar
34. Ingolia N. T., Brar G. A., Rouskin S., McGeachy A. M. & Weissman J. S. The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments. Nat Protoc 7, 1534–1550 (2012). pmid:22836135
- View Article
- PubMed/NCBI
- Google Scholar
35. Han P. et al. Genome-wide Survey of Ribosome Collision. Cell Reports 31, 107610 (2020). pmid:32375038
- View Article
- PubMed/NCBI
- Google Scholar
36. Mito M., Mishima Y. & Iwasaki S. Protocol for Disome Profiling to Survey Ribosome Collision in Humans and Zebrafish. STAR Protocols 1, 100168 (2020). pmid:33377062
- View Article
- PubMed/NCBI
- Google Scholar
37. Trapnell C., Pachter L. & Salzberg S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009). pmid:19289445
- View Article
- PubMed/NCBI
- Google Scholar
38. Liao Y., Smyth G. K. & Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014). pmid:24227677
- View Article
- PubMed/NCBI
- Google Scholar
39. Robinson M. D., McCarthy D. J. & Smyth G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010). pmid:19910308
- View Article
- PubMed/NCBI
- Google Scholar
40. Ritchie M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucl. Acids Res. gkv007 (2015) pmid:25605792
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Risso D., Ngai J., Speed T. P. & Dudoit S. Normalization of RNA-seq data using factor analysis of control genes or samples. Nat Biotechnol 32, 896–902 (2014). pmid:25150836
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Robinson M. D. & Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology 11, R25 (2010). pmid:20196867
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Leek J. T., Johnson W. E., Parker H. S., Jaffe A. E. & Storey J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883 (2012). pmid:22257669
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Stegle O., Parts L., Piipari M., Winn J. & Durbin R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat Protoc 7, 500–507 (2012). pmid:22343431
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Bullard J. H., Purdom E., Hansen K. D. & Dudoit S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics 11, 94 (2010). pmid:20167110
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Dillies M.-A. et al. A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis. Brief Bioinform 14, 671–683 (2013). pmid:22988256
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Hansen K. D., Irizarry R. A. & Wu Z. Removing technical variability in RNA-seq data using conditional quantile normalization. Biostatistics 13, 204–216 (2012). pmid:22285995
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Risso D., Schwartz K., Sherlock G. & Dudoit S. GC-Content Normalization for RNA-Seq Data. BMC Bioinformatics 12, 480 (2011).
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref9] 9. Sun Z. & Zhu Y. Systematic comparison of RNA-Seq normalization methods using measurement error models. Bioinformatics 28, 2584–2591 (2012). pmid:22914217
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Evans C., Hardin J. & Stoebel D. M. Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions. Briefings in Bioinformatics 19, 776–792 (2018). pmid:28334202
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Spriggs K. A., Bushell M. & Willis A. E. Translational regulation of gene expression during conditions of cell stress. Mol. Cell 40, 228–237 (2010). pmid:20965418
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Krishnamoorthy T., Pavitt G. D., Zhang F., Dever T. E. & Hinnebusch A. G. Tight binding of the phosphorylated alpha subunit of initiation factor 2 (eIF2alpha) to the regulatory subunits of guanine nucleotide exchange factor eIF2B is required for inhibition of translation initiation. Mol. Cell. Biol. 21, 5018–5030 (2001). pmid:11438658
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref13] 13. Baird T. D. & Wek R. C. Eukaryotic Initiation Factor 2 Phosphorylation and Translational Control in Metabolism12. Adv Nutr 3, 307–321 (2012).
View Article
Google Scholar

[49] View Article

[50] Google Scholar

[ref14] 14. Hinnebusch A. G. The scanning mechanism of eukaryotic translation initiation. Annu. Rev. Biochem. 83, 779–812 (2014). pmid:24499181
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref15] 15. Hinnebusch A. G. & Lorsch J. R. The Mechanism of Eukaryotic Translation Initiation: New Insights and Challenges. Cold Spring Harb Perspect Biol 4, (2012). pmid:22815232
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref16] 16. Lemire A. et al. Development of ERCC RNA Spike-In Control Mixes. J Biomol Tech 22, S46 (2011).
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref17] 17. Jiang L. et al. Synthetic spike-in standards for RNA-seq experiments. Genome Res 21, 1543–1551 (2011). pmid:21816910
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref18] 18. Lovén J. et al. Revisiting global gene expression analysis. Cell 151, 476–482 (2012). pmid:23101621
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref19] 19. Ingolia N. T., Ghaemmaghami S., Newman J. R. S. & Weissman J. S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324, 218–223 (2009). pmid:19213877
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref20] 20. Ingolia N. T. Ribosome Footprint Profiling of Translation throughout the Genome. Cell 165, 22–33 (2016). pmid:27015305
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref21] 21. Sendoel A. et al. Translation from unconventional 5’ start sites drives tumour initiation. Nature 541, 494–499 (2017). pmid:28077873
View Article
PubMed/NCBI
Google Scholar

[79] View Article

[80] PubMed/NCBI

[81] Google Scholar

[ref22] 22. Hsieh A. C. et al. The translational landscape of mTOR signalling steers cancer initiation and metastasis. Nature 485, 55–61 (2012). pmid:22367541
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref23] 23. Andreev D. E. et al. Translation of 5′ leaders is pervasive in genes resistant to eIF2 repression. eLife 4,. pmid:25621764
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref24] 24. Iwasaki S., Floor S. N. & Ingolia N. T. Rocaglates convert DEAD-box protein eIF4A into a sequence-selective translational repressor. Nature 534, 558–561 (2016). pmid:27309803
View Article
PubMed/NCBI
Google Scholar

[91] View Article

[92] PubMed/NCBI

[93] Google Scholar

[ref25] 25. Liu T.-Y. et al. Time-Resolved Proteomics Extends Ribosome Profiling-Based Measurements of Protein Synthesis Dynamics. Cell Systems 4, 636–644.e9 (2017). pmid:28578850
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar

[ref26] 26. Wang Y. J. & Gilbert W. V. Quantitative Comparisons of Translation Activity by Ribosome Profiling with Internal Standards. Methods Mol Biol 2252, 127–149 (2021). pmid:33765273
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref27] 27. Hoerth K., Reitter S. & Schott J. Normalized Ribo-Seq for Quantifying Absolute Global and Specific Changes in Translation. Bio Protoc 12, e4323 (2022). pmid:35340296
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref28] 28. Lutzmayer S., Enugutti B. & Nodine M. D. Novel small RNA spike-in oligonucleotides enable absolute normalization of small RNA-Seq data. Scientific Reports 7, 5913 (2017). pmid:28724941
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref29] 29. Pine P. S. et al. Evaluation of the External RNA Controls Consortium (ERCC) reference material using a modified Latin square design. BMC Biotechnology 16, 54 (2016).
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref30] 30. Law C. W., Chen Y., Shi W. & Smyth G. K. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014). pmid:24485249
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref31] 31. Kozomara A., Birgaoanu M. & Griffiths-Jones S. miRBase: from microRNA sequences to function. Nucleic Acids Research 47, D155–D162 (2019). pmid:30423142
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref32] 32. Hofacker I. L. Vienna RNA secondary structure server. Nucleic Acids Res 31, 3429–3431 (2003). pmid:12824340
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref33] 33. Hornstein N. et al. Ligation-free ribosome profiling of cell type-specific translation in the brain. Genome Biology 17, 149 (2016). pmid:27380875
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref34] 34. Ingolia N. T., Brar G. A., Rouskin S., McGeachy A. M. & Weissman J. S. The ribosome profiling strategy for monitoring translation in vivo by deep sequencing of ribosome-protected mRNA fragments. Nat Protoc 7, 1534–1550 (2012). pmid:22836135
View Article
PubMed/NCBI
Google Scholar

[130] View Article

[131] PubMed/NCBI

[132] Google Scholar

[ref35] 35. Han P. et al. Genome-wide Survey of Ribosome Collision. Cell Reports 31, 107610 (2020). pmid:32375038
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref36] 36. Mito M., Mishima Y. & Iwasaki S. Protocol for Disome Profiling to Survey Ribosome Collision in Humans and Zebrafish. STAR Protocols 1, 100168 (2020). pmid:33377062
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref37] 37. Trapnell C., Pachter L. & Salzberg S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009). pmid:19289445
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref38] 38. Liao Y., Smyth G. K. & Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014). pmid:24227677
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

[ref39] 39. Robinson M. D., McCarthy D. J. & Smyth G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010). pmid:19910308
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref40] 40. Ritchie M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucl. Acids Res. gkv007 (2015) pmid:25605792
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

Figures

Abstract

Introduction

Results

A set of short spike-in control oligos designed for human ribo-seq experiments

A stress response ribo-seq study for evaluating spike-in utility

Quantitative properties of spike-in oligo pools

Quantitative properties of individual spike-in oligo

Spike-in based data normalization preserved the expected global shift in quantification while global scaling normalization produced high level of false positives

Global scaling normalization introduced false positives in stress response dataset by distorting fold change

Spike-in based RUVg normalization recovered 99% of the discoveries from unnormalized data and identified an additional 5% of differentially translated genes

Global scaling normalization of stress response datasets leads to high rates of false discoveries among endogenous genes

Spike-in-based RUVg normalization results are consistent regardless of the exact combinations of spike-in oligomers used

Discussion

Materials and methods

Cell culture and oxidative stress

Western blot

Spike-in oligomers

Ribosome profiling

CRISPR/Cas9 targeted depletion of rRNA

Data processing

Sequencing read processing and mapping.

Data filtering and transformation.

Analyses.

Normalization

Differential expression tests

Supporting information

S1 File. Supplemental S1 to S16 Figs.

S2 File. Supplemental S1-S3 Tables.

S1 Data. Ribo-seq count table for all genes analyzed.

Acknowledgments

References