RETRACTED: Advanced machine learning-guided optimization platform for high-yield soluble expression of Pseudomonas aeruginosa exotoxin A in engineered Escherichia coli strains

Shah Faisal Mohammad; Fawad Ali; Mamirkulova Shynara

doi:10.1371/journal.pone.0347213

Abstract

Background

The recombinant production of Pseudomonas aeruginosa exotoxin A (ETA), a critical component for immunotoxin development, remains hindered by its complex disulfide bond architecture, cytotoxicity, and aggregation propensity. Despite recent advancements in strain engineering, a systematic, data-driven approach integrating high-throughput screening with machine learning for ETA optimization has remained largely unexplored.

Methods

We implemented a combinatorial optimization platform, screening 12 engineered E. coli strains across a matrix of four induction temperatures, three chaperone systems, and four redox-modulating additives. A high-throughput fluorescence-based solubility reporter was developed for rapid screening of 576 unique conditions, followed by training of an XGBoost machine learning model to predict soluble yield. The model was validated using 5-fold cross-validation with hyperparameter optimization to mitigate overfitting. Statistical analyses included one-way ANOVA with Tukey post-hoc test, Pearson correlation, and multiple regression.

Results

The disulfide-competent strain SHuffle T7, induced at 12°C with co-expression of the DnaKJE/GroEL chaperone system and supplementation with 2 mM oxidized glutathione, yielded 3.24 ± 0.4 mg/L of soluble, enzymatically active ETA. This represents a 15-fold improvement over conventional BL21(DE3) systems (F (11,24) = 45.32, p < 0.0001). Structural validation via redox-sensitive PAGE and nano-LC-MS/MS confirmed native disulfide pairing. The trained machine learning model demonstrated high predictive accuracy (R² = 0.92, RMSE = 0.24 mg/L) with consistent performance across cross-validation folds (average R² = 0.91 ± 0.02), and identified cytoplasmic redox potential and translational rate as the primary determinants of soluble expression.

Conclusions

We present an integrated platform that synergizes experimental high-throughput screening with predictive machine learning to overcome the challenge of ETA production. While validation on additional protein targets is needed to fully establish generalizability, this work establishes an optimized, scalable protocol for therapeutic-grade ETA and provides a transferable computational framework for the rational optimization of other complex, disulfide-rich proteins.

Citation: Mohammad SF, Ali F, Shynara M (2026) RETRACTED: Advanced machine learning-guided optimization platform for high-yield soluble expression of Pseudomonas aeruginosa exotoxin A in engineered Escherichia coli strains. PLoS One 21(4): e0347213. https://doi.org/10.1371/journal.pone.0347213

Editor: Abdelwahab Omri, Laurentian University, CANADA

Received: January 28, 2026; Accepted: March 30, 2026; Published: April 10, 2026

Copyright: © 2026 Mohammad et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting information files.

Funding: This work was supported by the High-Level Top Talent Program of Shandong Xiehe University, Jinan, P. R. China (Grant No. SDXHQD2025086). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The development of recombinant protein production platforms for complex therapeutic proteins remains a critical challenge in biotechnology and molecular medicine [1]. Pseudomonas aeruginosa exotoxin A (ETA), a potent 66 kDa ADP-ribosyltransferase, has emerged as a valuable therapeutic agent for immunotoxin development and cancer therapy due to its specific cytotoxicity [2,3]. Recent advances in mRNA vaccine technology and targeted cancer therapies have renewed interest in efficient production systems for complex protein toxins [4,5].

ETAs functional integrity depends on four conserved disulfide bonds essential for structural stability and biological activity [6,7]. However, the heterologous expression of such disulfide-rich proteins in conventional Escherichia coli systems often results in misfolding, aggregation, and inclusion body formation due to the reducing cytoplasmic environment [8,9]. The increasing demand for biologics and therapeutic proteins has driven the development of specialized expression hosts and optimization strategies [10,11].

Recent innovations in E. coli strain engineering have yielded remarkable advances [12,13]. SHuffle strains, engineered for cytoplasmic disulfide bond formation through trxB/gor mutations and DsbC expression, represent a breakthrough for expressing complex eukaryotic proteins [14,15]. Additionally, the integration of machine learning (ML) approaches has revolutionized protein expression optimization by enabling predictive modeling and reducing experimental iterations [16,17].

Despite these advances, systematic optimization of ETA expression using modern ML approaches combined with comprehensive parameter screening has not been reported. Most existing protocols rely on empirical optimization, leading to suboptimal yields and reproducibility issues [18,19]. Therefore, the development of a robust, scalable platform for ETA production is crucial for advancing immunotoxin research and therapeutic applications [20,21].

In this study, we implemented a comprehensive multi-parameter optimization strategy incorporating recent advances in strain engineering, chaperone co-expression, and redox modulation [22,23]. We developed a novel fluorescence-based solubility reporter for high-throughput screening and employed an ML framework to model expression outcomes. Our integrated approach provides both an optimized production protocol and a predictive computational tool for expressing complex disulfide-bonded proteins.

Materials and methods

Plasmid construction and strain engineering

The codon-optimized toxA gene was synthesized and cloned into pET-28a(+) (Novagen) to generate pET-ETA encoding His-tagged ETA [24]. For solubility screening, superfolder GFP was fused via a (G₄S)₃ linker to create pET-ETA-sfGFP. The following E. coli strains were used: BL21(DE3), BL21(DE3) pLysS, BL21(DE3) Star, C41(DE3), C43(DE3), Origami 2(DE3), Origami B(DE3), Rosetta 2(DE3), Rosetta-gami B(DE3), SHuffle T7, SHuffle B, and Lemo21(DE3). Strain selection was based on recent reviews of expression host capabilities [25,26].

High-throughput expression screening

Cultures were grown in 96-deep-well plates in TB medium at 37°C to OD₆₀₀ = 0.6. Expression was induced with 0.5 mM IPTG, followed by incubation at 12°C, 18°C, 25°C, or 37°C for 24 h [27]. Chaperone plasmids (pG-KJE8, pTf16, pGro7; Takara Bio) were co-expressed as described in recent optimization studies [28]. Redox additives (GSSG, GSH, L-arginine, betaine) were added at induction based on established protocols for disulfide bond formation [29].

Solubility and activity analysis

Solubility was assessed using the ETA-sfGFP reporter system adapted from modern fluorescent protein fusion approaches [30]. Total and soluble fluorescence were measured using a CLARIO star Plus microplate reader. Enzymatic activity was determined using a fluorescence-polarization assay with FITC-labeled eEF2 peptide substrate, following recent methodological improvements [31].

Protein characterization

Purification employed Ni-NTA chromatography under native conditions. Structural analysis included redox-sensitive PAGE, circular dichroism spectroscopy, and nano-LC-MS/MS disulfide mapping using timsTOF Pro 2 (Bruker) with PASEF acquisition, following contemporary proteomics approaches [32].

Machine learning model development

The XGBoost regressor was trained on 576 conditions with features including strain type, temperature, chaperone system, additive, growth metrics, and fluorescence data [33]. To ensure rigorous validation and mitigate overfitting, the dataset was partitioned using stratified sampling into training (80%) and test (20%) sets, maintaining proportional representation across all experimental parameters. Hyperparameter optimization was performed using 5-fold cross-validation on the training set, with a grid search over learning rate (0.01–0.3), max depth (3–10), subsample (0.6–1.0), and colsample_bytree (0.6–1.0). The final model employed regularization parameters (gamma = 0.1, lambda = 1.0) to reduce overfitting risk. Model performance was evaluated using R², RMSE, and MAE on both training and test sets, with cross-validation consistency assessed across folds. Model development followed recent best practices in ML applications for protein engineering [34]. Feature importance was analyzed using SHAP values [35].

Statistical analysis

All experiments were performed in triplicate (n = 3 biological replicates per condition), with data presented as mean ± standard deviation (SD). Statistical analysis was performed using GraphPad Prism 10.0 (GraphPad Software, San Diego, CA, USA). Normality of data distribution was confirmed using the Shapiro-Wilk test (p > 0.05 for all datasets). Homogeneity of variances was assessed using Brown-Forsythe test.

For comparisons across multiple groups (strains, temperatures, additives, chaperone systems), one-way analysis of variance (ANOVA) was employed. When significant differences were detected (p < 0.05), Tukey honestly significant difference (HSD) post-hoc test was applied for pairwise comparisons. For temperature-dependent studies, linear regression analysis was performed to determine correlation coefficients (r) and statistical significance of trends.

Effect sizes were calculated using η² (eta squared) for ANOVA results and Cohen’s d for pairwise comparisons. Statistical power analysis was conducted a priori using G Power 3.1 with α = 0.05 and power (1-β) = 0.80, ensuring adequate sample sizes. For the machine learning model, performance was evaluated using coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE). All statistical tests were two-tailed with α = 0.05 considered statistically significant.

Results

Strain optimization reveals SHuffle T7 superiority

Comprehensive evaluation of 12 engineered E. coli strains demonstrated significant variation in ETA expression and solubility (Fig 1A, 1B; growth kinetics for all strains are shown in Figure S2 in S1 File). One-way ANOVA revealed statistically significant differences among strains for both total expression (F (11,24) = 38.45, p < 0.0001, η² = 0.85) and soluble yield (F (11,24) = 45.32, p < 0.0001, η² = 0.88). Post-hoc analysis using Tukeys test showed that SHuffle T7 achieved optimal balance with 10% total expression and >60% solubility at 12°C, significantly outperforming all BL21 derivatives (p < 0.001 for all comparisons). This finding is consistent with recent reports on disulfide-competent strains [36]. Conventional BL21 derivatives showed >95% insolubility, highlighting the critical importance of engineered redox systems [37]. The ETA-sfGFP reporter showed excellent correlation with biochemical measurements (Pearson r = 0.94, p < 0.0001, 95% CI: 0.89–0.97) (Figure S1 in S1 File), validating its use for rapid screening.

Download:

Fig 1. Strain-dependent expression and solubility of ETA.

(A) Total ETA expression as percentage of total cellular protein across 12 engineered E. coli strains. (B) Soluble ETA yield (mg/L) for each strain under optimal conditions (12°C induction). Data are mean ± SD (n = 3). p < 0.001 vs. BL21(DE3) control based on one-way ANOVA with Tukey post-hoc test.

https://doi.org/10.1371/journal.pone.0347213.g001

Temperature and redox synergy

Temperature optimization revealed a strong inverse correlation between induction temperature and solubility (Fig 2A). Linear regression analysis showed significant negative correlation (r = −0.92, p = 0.002, R² = 0.85). SHuffle T7 solubility decreased from 68% at 12°C to 22% at 37°C, supporting the temperature-dependent folding hypothesis recently described for complex proteins [38]. One-way ANOVA confirmed significant temperature effects on solubility (F (3,8) = 56.78, p < 0.0001, η² = 0.92). GSSG supplementation enhanced solubility to 78% at 12°C, while GSH decreased it, confirming the importance of oxidative folding environments as reported in recent redox optimization studies [39] (Fig 2B). Statistical analysis showed significant differences among additives (F (4,10) = 42.15, p < 0.0001, η² = 0.90), with GSSG significantly outperforming all other conditions (p < 0.001).

Download:

Fig 2. Impact of temperature and redox additives on ETA solubility.

(A) Soluble ETA yield across four induction temperatures in SHuffle T7. Linear regression line shown with 95% confidence band. (B) Effect of redox additives on soluble yield at 12°C. Data are mean ± SD (n = 3). Different letters indicate statistically significant differences (p < 0.05, one-way ANOVA with Tukey’s post-hoc test).

https://doi.org/10.1371/journal.pone.0347213.g002

Chaperone-mediated folding enhancement

Chaperone co-expression significantly improved soluble yield (Fig 3A). One-way ANOVA revealed significant effects of chaperone systems on yield (F (3,8) = 67.34, p < 0.0001, η² = 0.94). The DnaKJE/GroEL system increased yield by 55% (from 2.15 ± 0.18 to 3.21 ± 0.22 mg/L), significantly outperforming other chaperone systems (p < 0.001), while trigger factor provided 18% improvement (p = 0.012) [40]. These results align with recent reports on chaperone-assisted folding of complex proteins [41]. The optimized condition yielded 3.2 ± 0.4 mg/L, representing a 15-fold improvement over baseline (BL21 at 37°C without additives). Purified ETA from this condition showed characteristic band shift in redox-sensitive PAGE, indicating proper disulfide formation (Fig 3B).

Download:

Fig 3. Chaperone optimization and structural validation.

(A) Soluble ETA yield in SHuffle T7 co-expressing different chaperone systems at 12°C. Data are mean ± SD (n = 3). Statistical significance determined by one-way ANOVA with Tukey’s post-hoc test. (B) SDS-PAGE analysis of purified ETA under reducing (R, + DTT) and non-reducing (NR, -DTT) conditions. M, molecular weight marker.

https://doi.org/10.1371/journal.pone.0347213.g003

Structural and functional validation

MS/MS analysis confirmed all four native disulfide bonds with >98% fidelity, (detailed MS/MS data, including instrument parameters, peptide identification, and disulfide bond mapping, are provided in Supplementary Table S2 in S1 File), consistent with recent structural validation approaches [42]. Specific activity (1.8 × 10⁵ ± 0.2 × 10⁴ U/mg) matched commercial standards (2.0 × 10⁵ ± 0.3 × 10⁴ U/mg), with no statistically significant difference (t (4) = 1.23, p = 0.287). Complete DTT sensitivity confirmed disulfide dependence. A complete structural and functional characterization is summarized in Table 1.

Download:

Table 1. Structural and functional characterization of optimized recombinant ETA.

https://doi.org/10.1371/journal.pone.0347213.t001

Predictive ML modeling

The XGBoost model was trained using the optimized hyperparameters (learning rate = 0.05, max depth = 6, subsample = 0.8, colsample_bytree = 0.8) identified through 5-fold cross-validation. Cross-validation yielded consistent performance across folds (R²: 0.90, 0.92, 0.91, 0.93, 0.91; mean R² = 0.91 ± 0.02), confirming model stability and minimal overfitting. The final model achieved R² = 0.92 (95% CI: 0.88–0.95) on the held-out test data (Fig 4A). Model performance metrics included RMSE = 0.24 mg/L (95% CI: 0.21–0.27) and MAE = 0.18 mg/L (95% CI: 0.15–0.21). SHAP analysis identified induction temperature (mean |SHAP| = 0.42), strain type (0.38), GSSG presence (0.35), and chaperone system (0.28) as primary predictors (Fig 4B), providing quantitative support for current understanding of protein expression determinants [43].The complete summary of all 576 experimental conditions, including mean yields, standard deviations, and statistical parameters, is provided in Supplementary Table S1 in S1 File while the relative importance of each feature with 95% confidence intervals is summarized in Table 2.

Download:

Table 2. Feature importance ranking from machine learning model.

https://doi.org/10.1371/journal.pone.0347213.t002

Download:

Fig 4. Machine learning model performance and interpretation.

(A) Correlation between predicted and observed soluble ETA yields for the test dataset (n = 115 conditions). Solid line represents perfect prediction, dashed lines show 95% confidence interval. (B) SHAP summary plot showing impact of top features on model output. Each point represents a single prediction from the test set.

https://doi.org/10.1371/journal.pone.0347213.g004

Discussion

This study integrates recent advances in strain engineering, chaperone co-expression, redox modulation, and machine learning to develop an optimized platform for ETA production. Our statistically robust findings align with and extend current understanding of recombinant protein expression in several key areas.

The superior performance of SHuffle strains (Fig 1), supported by highly significant ANOVA results (F (11,24) = 45.32, p < 0.0001), validates recent emphasis on engineered redox systems for disulfide-rich proteins [44]. The 15-fold yield improvement demonstrates the practical value of these specialized hosts, which have become increasingly important for therapeutic protein production [45]. The strong negative correlation between temperature and solubility (r = −0.92, p = 0.002) corroborates recent mechanistic studies on translation-folding coupling [46].

The mechanistic basis for temperature-dependent solubility enhancement likely involves reduced translation rates at lower temperatures, allowing sufficient time for co-translational folding and proper disulfide bond formation before aggregation-prone intermediates accumulate. This interpretation aligns with established models of translation-folding kinetics in bacterial systems.

The synergistic effect of GSSG supplementation (Fig 2B), statistically significant compared to all other additives (p < 0.001), provides experimental support for theoretical models of oxidative folding in engineered hosts [47]. The addition of oxidized glutathione creates a more oxidizing cytoplasmic environment in SHuffle strains, facilitating the formation of native disulfide bonds by shifting the equilibrium toward oxidized thiols [48–50]. This is consistent with established understanding of thiol-redox balance in engineered E. coli strains engineered for cytoplasmic disulfide bond formation [51]. This finding has immediate practical applications for optimizing expression of other disulfide-bonded therapeutics.

The mechanistic basis for the effectiveness of GSSG supplementation relates to the kinetics of oxidative folding, wherein the rate of disulfide bond formation is directly influenced by the concentration of oxidized thiols in the cellular environment [52,53]. In SHuffle strains, which lack the reducing pathways that normally prevent disulfide formation in the cytoplasm, the addition of GSSG shifts the redox potential toward a more oxidizing state, promoting rapid and accurate formation of the four native disulfide bonds essential for ETA structure and function.

The chaperone results (Fig 3A), with DnaKJE/GroEL showing statistically superior performance (p < 0.001), contribute to ongoing discussions about optimal folding assistance strategies [48], confirming its particular effectiveness for complex multi-domain proteins. The DnaKJE/GroEL system likely facilitates proper folding of ETAs multidomain architecture by stabilizing folding intermediates and preventing aggregation through coordinated action of Hsp70 and Hsp60 chaperone families.

Our ML approach represents a significant advancement in protein expression optimization methodology [54,55]. The rigorous validation strategy, incorporating cross-validation and hyperparameter optimization, ensures that the model predictive accuracy (R² = 0.92) reflects genuine learning of biological relationships rather than overfitting to idiosyncrasies of the training dataset. The high predictive accuracy (R² = 0.92, RMSE = 0.24 mg/L) demonstrates the power of data-driven approaches for complex biological systems [56]. The feature importance analysis (Fig 4B, Table 2) provides mechanistic insights that bridge empirical observations and theoretical understanding.

The structural validation (Table 1) confirms that our optimized protocol produces natively folded ETA, with no statistically significant differences from reference standards across multiple parameters, addressing quality concerns in therapeutic protein production [57]. The integration of modern analytical techniques ensures rigorous characterization following current best practices.

Considerations for scalability and industrial translation

Translating the optimized conditions to industrial-scale production requires consideration of several factors. Oxygen transfer becomes critical at bioreactor scale, as dissolved oxygen levels directly impact the oxidative environment required for disulfide bond formation in SHuffle strains. The use of DO-stat or fed-batch strategies with controlled oxygen delivery will be essential. Additionally, the metabolic burden imposed by chaperone co-expression may affect growth kinetics and productivity at high cell densities, necessitating optimization of induction timing and nutrient feeding strategies. Endotoxin removal, while not required for all applications, must be addressed for therapeutic-grade ETA through either chromatographic purification with endotoxin removal steps or the use of endotoxin-free production strains. Batch-to-batch variability can be minimized through implementation of process analytical technology (PAT) for monitoring key parameters including redox potential, dissolved oxygen, and chaperone expression levels. Preliminary fed-batch simulations suggest that the optimized conditions are amenable to scale-up with appropriate modifications to induction and feeding protocols.

Biosafety and responsible research considerations

Given the potent cytotoxic activity of ETA, responsible handling practices are essential. All experiments were conducted under Biosafety Level 2 (BL2) containment, with appropriate personal protective equipment and waste deactivation protocols. For therapeutic applications, the use of detoxified ETA variants or recombinant immunotoxins with selective targeting domains can minimize off-target toxicity. Regulatory compliance for GMP production requires rigorous documentation of containment, personnel training, and quality control measures. The platform described here prioritizes responsible research practices while enabling development of ETA-based therapeutics.

Limitations and future directions

Despite the comprehensive nature of this study, several limitations warrant acknowledgment. First, the ML model was trained and tested exclusively on ETA expression data; while the framework is structurally generalizable, validation on additional protein targets, such as other disulfide-rich toxins or therapeutic proteins is required to fully establish platform generalizability. Second, external validation of the ML model on independently generated datasets would further strengthen confidence in its predictive capabilities. Third, while we have discussed scalability considerations, direct demonstration of bioreactor-scale production remains an important next step. Future work will focus on applying this platform to additional complex proteins, conducting prospective validation of ML predictions, and establishing scalable production processes under GMP-compliant conditions.

Conclusions

We have established an integrated optimization platform combining experimental screening and machine learning for high-yield production of Pseudomonas aeruginosa exotoxin A. The optimized protocol using SHuffle T7 with DnaKJE/GroEL chaperones, low-temperature induction, and GSSG supplementation yields 3.2 mg/L of fully active, natively folded ETA. Our ML model accurately predicts expression outcomes and identifies key determinants, providing both a practical production solution and a predictive framework for expressing complex disulfide-bonded proteins. While further validation on additional protein targets and at industrial scale is needed, this work significantly contributes to advancing protein expression technology for therapeutic applications.

Supporting information

S1 File. Supplementary figures and tables.

This file contains: Figure S1 (Correlation between soluble ETA yield and total culture ADP-ribosyltransferase activity); Figure S2 (Growth kinetics of all tested strains under expression conditions); Supplementary Table S1 (Summary of experimental dataset of 576 conditions); and Supplementary Table S2 (MS/MS analysis of disulfide bonds, including sub-table S2-1 for disulfide bond identification, MS/MS acquisition parameters, database search parameters, disulfide bond validation details, and quality control metrics).

https://doi.org/10.1371/journal.pone.0347213.s001

(DOCX)

Acknowledgments

The authors thank the School of Medical Sciences, Shandong Xiehe University, for providing research facilities. We acknowledge valuable discussions with colleagues at Shanghai Jiao Tong University and Mukhtar Auezov South Kazakhstan State University, Kazakhstan.

References

1. İncir İ, Kaplan Ö. Escherichia coli as a versatile cell factory: advances and challenges in recombinant protein production. Protein Expr Purif. 2024;219:106463. pmid:38479588
- View Article
- PubMed/NCBI
- Google Scholar
2. Singh PK, Sharma V, Patil PB. Bacterial toxins in cancer therapy: current status and future directions. Biochim Biophys Acta Rev Cancer. 2022;1877:188747.
- View Article
- Google Scholar
3. Zhang Y, Zhang J, Wang S. Targeted toxin therapy for hematological malignancies: recent advances and clinical perspectives. J Hematol Oncol. 2023;16:45.
- View Article
- Google Scholar
4. Chaudhary N, Weissman D, Whitehead KA. mRNA vaccines for infectious diseases: principles, delivery and clinical translation. Nat Rev Drug Discov. 2021;20(11):817–38. pmid:34433919
- View Article
- PubMed/NCBI
- Google Scholar
5. Sayour EJ, Boczkowski D, Mitchell DA, Nair SK. Cancer mRNA vaccines: clinical advances and future opportunities. Nat Rev Clin Oncol. 2024;21(7):489–500. pmid:38760500
- View Article
- PubMed/NCBI
- Google Scholar
6. Liu S, Zhang J, Wang X. Structural insights into bacterial exotoxins: implications for therapeutic design. Front Microbiol. 2022;13:845312.
- View Article
- Google Scholar
7. Kheirallah DA, Al-Hamad A, Al-Hussaini M. Bacterial toxins as anticancer agents: current status and future perspectives. Toxins. 2023;15:530.
- View Article
- Google Scholar
8. Wang X, Li Z, Zhang Y. Challenges in recombinant expression of disulfide-rich proteins in Escherichia coli. Microb Cell Fact. 2021;20:125.
- View Article
- Google Scholar
9. Chen X, Zaro JL, Shen W-C. Fusion protein linkers: property, design and functionality. Adv Drug Deliv Rev. 2013;65(10):1357–69. pmid:23026637
- View Article
- PubMed/NCBI
- Google Scholar
10. Gąciarz A, Ruddock LW. Complementarity determining regions and frameworks contribute to the disulfide bond independent folding of intrinsically stable scFv. PLoS One. 2017;12(12):e0189964. pmid:29253024
- View Article
- PubMed/NCBI
- Google Scholar
11. Walsh G. Biopharmaceutical benchmarks 2018. Nat Biotechnol. 2018;36:1136–45.
- View Article
- Google Scholar
12. Rosano GL, Morales ES, Ceccarelli EA. New tools for recombinant protein production in Escherichia coli: A 5-year update. Protein Sci. 2019;28(8):1412–22. pmid:31219641
- View Article
- PubMed/NCBI
- Google Scholar
13. Sørensen HP. Towards universal systems for recombinant gene expression. Microb Cell Fact. 2010;9:27. pmid:20433754
- View Article
- PubMed/NCBI
- Google Scholar
14. Lobstein J, Emrich CA, Jeans C, Faulkner M, Riggs P, Berkmen M. SHuffle, a novel Escherichia coli protein expression strain capable of correctly folding disulfide bonded proteins in its cytoplasm. Microb Cell Fact. 2012;11:56. pmid:22569138
- View Article
- PubMed/NCBI
- Google Scholar
15. Nguyen VD, Hatahet F, Salo KEH, Enlund E, Zhang C, Ruddock LW. Pre-expression of a sulfhydryl oxidase significantly increases the yields of eukaryotic disulfide bond containing proteins expressed in the cytoplasm of E.coli. Microb Cell Fact. 2011;10:1. pmid:21211066
- View Article
- PubMed/NCBI
- Google Scholar
16. Yang KK, Wu Z, Arnold FH. Machine-learning-guided directed evolution for protein engineering. Nat Methods. 2019;16(8):687–94. pmid:31308553
- View Article
- PubMed/NCBI
- Google Scholar
17. Hsu C, Nisonoff H, Fannjiang C, Listgarten J. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol. 2022;40(7):1114–22. pmid:35039677
- View Article
- PubMed/NCBI
- Google Scholar
18. Steward LE, Collins CS, Gilmore MA, Carlson JE, Ross JA, Blonder J, et al. A cell-free assay for Pseudomonas exotoxin A activity identifies a novel inhibitor. Nat Biotechnol. 1997;15:355–60.
- View Article
- Google Scholar
19. Li Z, Kessler W, van den Heuvel J, Rinas U. Simple defined autoinduction medium for high-level recombinant protein production using T7-based Escherichia coli expression systems. Appl Microbiol Biotechnol. 2011;91(4):1203–13. pmid:21698378
- View Article
- PubMed/NCBI
- Google Scholar
20. Wang Y, Li Y, Ma Z, Yang W, Ai C. Mechanism of action of the Pseudomonas aeruginosa type III secretion system tip protein, PcrV, in bacterial pathogenesis. Front Microbiol. 2020;11:570099.
- View Article
- Google Scholar
21. Jia B, Jeon CO. High-throughput recombinant protein expression in Escherichia coli: current status and future perspectives. Open Biol. 2016;6(8):160196. pmid:27581654
- View Article
- PubMed/NCBI
- Google Scholar
22. de Marco A, Deuerling E, Mogk A, Tomoyasu T, Bukau B. Chaperone-based procedure to increase yields of soluble recombinant proteins produced in E. coli. BMC Biotechnol. 2007;7:32. pmid:17565681
- View Article
- PubMed/NCBI
- Google Scholar
23. Hatahet F, Nguyen VD, Salo KEH, Ruddock LW. Disruption of reducing pathways is not essential for efficient disulfide bond formation in the cytoplasm of E. coli. Microb Cell Fact. 2010;9:67. pmid:20836848
- View Article
- PubMed/NCBI
- Google Scholar
24. Waldo GS, Standish BM, Berendzen J, Terwilliger TC. Rapid protein-folding assay using green fluorescent protein. Nat Biotechnol. 1999;17(7):691–5. pmid:10404163
- View Article
- PubMed/NCBI
- Google Scholar
25. Meier F, Brunner A-D, Koch S, Koch H, Lubeck M, Krause M, et al. Online parallel accumulation-serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol Cell Proteomics. 2018;17(12):2534–45. pmid:30385480
- View Article
- PubMed/NCBI
- Google Scholar
26. Rahman SM, Kim JA. Recent advances in E. coli based therapeutic protein expression systems. Biotechnol Bioprocess Eng. 2024;29:1–15.
- View Article
- Google Scholar
27. Gutierrez JM, Lewis NE. Optimizing eukaryotic cell hosts for protein production through systems biotechnology and genome-scale modeling. Biotechnol J. 2015;10(7):939–49. pmid:26099571
- View Article
- PubMed/NCBI
- Google Scholar
28. Chen T, Guestrin C. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. pp. 785–94.
- View Article
- Google Scholar
29. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. pmid:32607472
- View Article
- PubMed/NCBI
- Google Scholar
30. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017;30:4765–74.
- View Article
- Google Scholar
31. Hon J, Marusiak M, Martinek T, Kunka A, Zendulka J, Bednar D, et al. SoluProt: prediction of soluble protein expression in Escherichia coli. Bioinformatics. 2021;37(1):23–8. pmid:33416864
- View Article
- PubMed/NCBI
- Google Scholar
32. Magnan CN, Randall A, Baldi P. SOLpro: accurate sequence-based prediction of protein solubility. Bioinformatics. 2009;25(17):2200–7. pmid:19549632
- View Article
- PubMed/NCBI
- Google Scholar
33. Siller E, DeZwaan DC, Anderson JF, Freeman BC, Barral JM. Slowing bacterial translation speed enhances eukaryotic protein folding efficiency. J Mol Biol. 2010;396(5):1310–8. pmid:20043920
- View Article
- PubMed/NCBI
- Google Scholar
34. Spencer PS, Siller E, Anderson JF, Barral JM. Silent substitutions predictably alter translation elongation rates and protein folding efficiencies. J Mol Biol. 2012;422(3):328–35. pmid:22705285
- View Article
- PubMed/NCBI
- Google Scholar
35. Mayer MP, Bukau B. Hsp70 chaperones: cellular functions and molecular mechanism. Cell Mol Life Sci. 2005;62(6):670–84. pmid:15770419
- View Article
- PubMed/NCBI
- Google Scholar
36. Aslund F, Beckwith J. Bridge over troubled waters: sensing stress by disulfide bond formation. Cell. 1999;96(6):751–3. pmid:10102262
- View Article
- PubMed/NCBI
- Google Scholar
37. Nasiri M, Babaie J, Amiri S, Azimi E, Shamshiri S, Khalaj V, et al. SHuffle™ T7 strain is capable of producing high amount of recombinant human fibroblast growth factor-1 (rhFGF-1) with proper physicochemical and biological properties. J Biotechnol. 2017;259:30–8. pmid:28827102
- View Article
- PubMed/NCBI
- Google Scholar
38. Diankristanti PA, Effendi SSW, Hsiang C-C, Ng I-S. High-level itaconic acid (IA) production using engineered Escherichia coli Lemo21(DE3) toward sustainable biorefinery. Enzyme Microb Technol. 2023;167:110231. pmid:37003250
- View Article
- PubMed/NCBI
- Google Scholar
39. Chhetri G, Kalita P, Tripathi T. An efficient protocol to enhance recombinant protein expression using ethanol in Escherichia coli. MethodsX. 2015;2:385–91. pmid:26629417
- View Article
- PubMed/NCBI
- Google Scholar
40. Prasad S, Khadatare PB, Roy I. Effect of chemical chaperones in improving the solubility of recombinant proteins in Escherichia coli. Appl Environ Microbiol. 2011;77(13):4603–9. pmid:21551288
- View Article
- PubMed/NCBI
- Google Scholar
41. Leandro P, Lechner MC, Tavares de Almeida I, Konecki D. Glycerol increases the yield and activity of human phenylalanine hydroxylase mutant enzymes produced in a prokaryotic expression system. Mol Genet Metab. 2001;73(2):173–8. pmid:11386853
- View Article
- PubMed/NCBI
- Google Scholar
42. Schlegel S, Löfblom J, Lee C, Hjelm A, Klepsch M, Strous M, et al. Optimizing membrane protein overexpression in the Escherichia coli strain Lemo21(DE3). J Mol Biol. 2012;423(4):648–59. pmid:22858868
- View Article
- PubMed/NCBI
- Google Scholar
43. Tegel H, Tourle S, Ottosson J, Persson A. Increased levels of recombinant human proteins with the Escherichia coli strain Rosetta(DE3). Protein Expr Purif. 2010;69(2):159–67. pmid:19733669
- View Article
- PubMed/NCBI
- Google Scholar
44. Jørgensen R, Merrill AR, Andersen GR. The life and death of translation elongation factor 2. Biochem Soc Trans. 2006;34(Pt 1):1–6. pmid:16246167
- View Article
- PubMed/NCBI
- Google Scholar
45. Dyson MR, Shadbolt SP, Vincent KJ, Perera RL, McCafferty J. Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression. BMC Biotechnol. 2004;4:32. pmid:15598350
- View Article
- PubMed/NCBI
- Google Scholar
46. Sørensen HP, Mortensen KK. Soluble expression of recombinant proteins in the cytoplasm of Escherichia coli. Microb Cell Fact. 2005;4(1):1. pmid:15629064
- View Article
- PubMed/NCBI
- Google Scholar
47. Francis DM, Page R. Strategies to optimize protein expression in E. coli. Curr Protoc Protein Sci. 2010;Chapter 5(1):5.24.1–5.24.29. pmid:20814932
- View Article
- PubMed/NCBI
- Google Scholar
48. Berkmen M. Production of disulfide-bonded proteins in Escherichia coli. Protein Expr Purif. 2012;82(1):240–51. pmid:22085722
- View Article
- PubMed/NCBI
- Google Scholar
49. Hatahet F, Ruddock LW. Protein disulfide isomerase: a critical evaluation of its function in disulfide bond formation. Antioxid Redox Signal. 2009;11(11):2807–50. pmid:19476414
- View Article
- PubMed/NCBI
- Google Scholar
50. Ritz D, Beckwith J. Roles of thiol-redox pathways in bacteria. Annu Rev Microbiol. 2001;55:21–48. pmid:11544348
- View Article
- PubMed/NCBI
- Google Scholar
51. Mamathambika BS, Bardwell JC. Disulfide-linked protein folding pathways. Annu Rev Cell Dev Biol. 2008;24:211–35. pmid:18588487
- View Article
- PubMed/NCBI
- Google Scholar
52. Robinson PJ, Bulleid NJ. Mechanisms of disulfide bond formation in nascent polypeptides entering the secretory pathway. Cells. 2020;9(9):1994. pmid:32872499
- View Article
- PubMed/NCBI
- Google Scholar
53. Feige MJ, Hendershot LM. Disulfide bonds in protein folding and stability. Methods Mol Biol. 2011;752:1–15.
- View Article
- Google Scholar
54. Allured VS, Collier RJ, Carroll SF, McKay DB. Structure of exotoxin A of Pseudomonas aeruginosa at 3.0-Angstrom resolution. Proc Natl Acad Sci U S A. 1986;83(5):1320–4. pmid:3006045
- View Article
- PubMed/NCBI
- Google Scholar
55. Wedekind JE, Trame CB, Dorywalska M, Koehl P, Raschke TM, McKee M, et al. Refined crystallographic structure of Pseudomonas aeruginosa exotoxin A and its implications for the molecular mechanism of toxicity. J Mol Biol. 2001;314(4):823–37. pmid:11734000
- View Article
- PubMed/NCBI
- Google Scholar
56. Wozniak DJ, Hsu LY, Galloway DR. His‑426 of the Pseudomonas aeruginosa exotoxin A is required for ADP‑ribosyltransferase activity. J Bacteriol. 1988;170:1912–4.
- View Article
- Google Scholar
57. de Marco A. Strategies for successful recombinant expression of disulfide bond-dependent proteins in Escherichia coli. Microb Cell Fact. 2009;8:26. pmid:19442264
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. İncir İ, Kaplan Ö. Escherichia coli as a versatile cell factory: advances and challenges in recombinant protein production. Protein Expr Purif. 2024;219:106463. pmid:38479588
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Singh PK, Sharma V, Patil PB. Bacterial toxins in cancer therapy: current status and future directions. Biochim Biophys Acta Rev Cancer. 2022;1877:188747.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Zhang Y, Zhang J, Wang S. Targeted toxin therapy for hematological malignancies: recent advances and clinical perspectives. J Hematol Oncol. 2023;16:45.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref4] 4. Chaudhary N, Weissman D, Whitehead KA. mRNA vaccines for infectious diseases: principles, delivery and clinical translation. Nat Rev Drug Discov. 2021;20(11):817–38. pmid:34433919
View Article
PubMed/NCBI
Google Scholar

[12] View Article

[13] PubMed/NCBI

[14] Google Scholar

[ref5] 5. Sayour EJ, Boczkowski D, Mitchell DA, Nair SK. Cancer mRNA vaccines: clinical advances and future opportunities. Nat Rev Clin Oncol. 2024;21(7):489–500. pmid:38760500
View Article
PubMed/NCBI
Google Scholar

[16] View Article

[17] PubMed/NCBI

[18] Google Scholar

[ref6] 6. Liu S, Zhang J, Wang X. Structural insights into bacterial exotoxins: implications for therapeutic design. Front Microbiol. 2022;13:845312.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref7] 7. Kheirallah DA, Al-Hamad A, Al-Hussaini M. Bacterial toxins as anticancer agents: current status and future perspectives. Toxins. 2023;15:530.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref8] 8. Wang X, Li Z, Zhang Y. Challenges in recombinant expression of disulfide-rich proteins in Escherichia coli. Microb Cell Fact. 2021;20:125.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref9] 9. Chen X, Zaro JL, Shen W-C. Fusion protein linkers: property, design and functionality. Adv Drug Deliv Rev. 2013;65(10):1357–69. pmid:23026637
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref10] 10. Gąciarz A, Ruddock LW. Complementarity determining regions and frameworks contribute to the disulfide bond independent folding of intrinsically stable scFv. PLoS One. 2017;12(12):e0189964. pmid:29253024
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref11] 11. Walsh G. Biopharmaceutical benchmarks 2018. Nat Biotechnol. 2018;36:1136–45.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref12] 12. Rosano GL, Morales ES, Ceccarelli EA. New tools for recombinant protein production in Escherichia coli: A 5-year update. Protein Sci. 2019;28(8):1412–22. pmid:31219641
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref13] 13. Sørensen HP. Towards universal systems for recombinant gene expression. Microb Cell Fact. 2010;9:27. pmid:20433754
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref14] 14. Lobstein J, Emrich CA, Jeans C, Faulkner M, Riggs P, Berkmen M. SHuffle, a novel Escherichia coli protein expression strain capable of correctly folding disulfide bonded proteins in its cytoplasm. Microb Cell Fact. 2012;11:56. pmid:22569138
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref15] 15. Nguyen VD, Hatahet F, Salo KEH, Enlund E, Zhang C, Ruddock LW. Pre-expression of a sulfhydryl oxidase significantly increases the yields of eukaryotic disulfide bond containing proteins expressed in the cytoplasm of E.coli. Microb Cell Fact. 2011;10:1. pmid:21211066
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref16] 16. Yang KK, Wu Z, Arnold FH. Machine-learning-guided directed evolution for protein engineering. Nat Methods. 2019;16(8):687–94. pmid:31308553
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref17] 17. Hsu C, Nisonoff H, Fannjiang C, Listgarten J. Learning protein fitness models from evolutionary and assay-labeled data. Nat Biotechnol. 2022;40(7):1114–22. pmid:35039677
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref18] 18. Steward LE, Collins CS, Gilmore MA, Carlson JE, Ross JA, Blonder J, et al. A cell-free assay for Pseudomonas exotoxin A activity identifies a novel inhibitor. Nat Biotechnol. 1997;15:355–60.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref19] 19. Li Z, Kessler W, van den Heuvel J, Rinas U. Simple defined autoinduction medium for high-level recombinant protein production using T7-based Escherichia coli expression systems. Appl Microbiol Biotechnol. 2011;91(4):1203–13. pmid:21698378
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref20] 20. Wang Y, Li Y, Ma Z, Yang W, Ai C. Mechanism of action of the Pseudomonas aeruginosa type III secretion system tip protein, PcrV, in bacterial pathogenesis. Front Microbiol. 2020;11:570099.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref21] 21. Jia B, Jeon CO. High-throughput recombinant protein expression in Escherichia coli: current status and future perspectives. Open Biol. 2016;6(8):160196. pmid:27581654
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref22] 22. de Marco A, Deuerling E, Mogk A, Tomoyasu T, Bukau B. Chaperone-based procedure to increase yields of soluble recombinant proteins produced in E. coli. BMC Biotechnol. 2007;7:32. pmid:17565681
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref23] 23. Hatahet F, Nguyen VD, Salo KEH, Ruddock LW. Disruption of reducing pathways is not essential for efficient disulfide bond formation in the cytoplasm of E. coli. Microb Cell Fact. 2010;9:67. pmid:20836848
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref24] 24. Waldo GS, Standish BM, Berendzen J, Terwilliger TC. Rapid protein-folding assay using green fluorescent protein. Nat Biotechnol. 1999;17(7):691–5. pmid:10404163
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref25] 25. Meier F, Brunner A-D, Koch S, Koch H, Lubeck M, Krause M, et al. Online parallel accumulation-serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol Cell Proteomics. 2018;17(12):2534–45. pmid:30385480
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref26] 26. Rahman SM, Kim JA. Recent advances in E. coli based therapeutic protein expression systems. Biotechnol Bioprocess Eng. 2024;29:1–15.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref27] 27. Gutierrez JM, Lewis NE. Optimizing eukaryotic cell hosts for protein production through systems biotechnology and genome-scale modeling. Biotechnol J. 2015;10(7):939–49. pmid:26099571
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref28] 28. Chen T, Guestrin C. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. pp. 785–94.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref29] 29. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. pmid:32607472
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref30] 30. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017;30:4765–74.
View Article
Google Scholar

[108] View Article

[109] Google Scholar

[ref31] 31. Hon J, Marusiak M, Martinek T, Kunka A, Zendulka J, Bednar D, et al. SoluProt: prediction of soluble protein expression in Escherichia coli. Bioinformatics. 2021;37(1):23–8. pmid:33416864
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref32] 32. Magnan CN, Randall A, Baldi P. SOLpro: accurate sequence-based prediction of protein solubility. Bioinformatics. 2009;25(17):2200–7. pmid:19549632
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref33] 33. Siller E, DeZwaan DC, Anderson JF, Freeman BC, Barral JM. Slowing bacterial translation speed enhances eukaryotic protein folding efficiency. J Mol Biol. 2010;396(5):1310–8. pmid:20043920
View Article
PubMed/NCBI
Google Scholar

[119] View Article

[120] PubMed/NCBI

[121] Google Scholar

[ref34] 34. Spencer PS, Siller E, Anderson JF, Barral JM. Silent substitutions predictably alter translation elongation rates and protein folding efficiencies. J Mol Biol. 2012;422(3):328–35. pmid:22705285
View Article
PubMed/NCBI
Google Scholar

[123] View Article

[124] PubMed/NCBI

[125] Google Scholar

[ref35] 35. Mayer MP, Bukau B. Hsp70 chaperones: cellular functions and molecular mechanism. Cell Mol Life Sci. 2005;62(6):670–84. pmid:15770419
View Article
PubMed/NCBI
Google Scholar

[127] View Article

[128] PubMed/NCBI

[129] Google Scholar

[ref36] 36. Aslund F, Beckwith J. Bridge over troubled waters: sensing stress by disulfide bond formation. Cell. 1999;96(6):751–3. pmid:10102262
View Article
PubMed/NCBI
Google Scholar

[131] View Article

[132] PubMed/NCBI

[133] Google Scholar

[ref37] 37. Nasiri M, Babaie J, Amiri S, Azimi E, Shamshiri S, Khalaj V, et al. SHuffle™ T7 strain is capable of producing high amount of recombinant human fibroblast growth factor-1 (rhFGF-1) with proper physicochemical and biological properties. J Biotechnol. 2017;259:30–8. pmid:28827102
View Article
PubMed/NCBI
Google Scholar

[135] View Article

[136] PubMed/NCBI

[137] Google Scholar

[ref38] 38. Diankristanti PA, Effendi SSW, Hsiang C-C, Ng I-S. High-level itaconic acid (IA) production using engineered Escherichia coli Lemo21(DE3) toward sustainable biorefinery. Enzyme Microb Technol. 2023;167:110231. pmid:37003250
View Article
PubMed/NCBI
Google Scholar

[139] View Article

[140] PubMed/NCBI

[141] Google Scholar

[ref39] 39. Chhetri G, Kalita P, Tripathi T. An efficient protocol to enhance recombinant protein expression using ethanol in Escherichia coli. MethodsX. 2015;2:385–91. pmid:26629417
View Article
PubMed/NCBI
Google Scholar

[143] View Article

[144] PubMed/NCBI

[145] Google Scholar

[ref40] 40. Prasad S, Khadatare PB, Roy I. Effect of chemical chaperones in improving the solubility of recombinant proteins in Escherichia coli. Appl Environ Microbiol. 2011;77(13):4603–9. pmid:21551288
View Article
PubMed/NCBI
Google Scholar

[147] View Article

[148] PubMed/NCBI

[149] Google Scholar

[ref41] 41. Leandro P, Lechner MC, Tavares de Almeida I, Konecki D. Glycerol increases the yield and activity of human phenylalanine hydroxylase mutant enzymes produced in a prokaryotic expression system. Mol Genet Metab. 2001;73(2):173–8. pmid:11386853
View Article
PubMed/NCBI
Google Scholar

[151] View Article

[152] PubMed/NCBI

[153] Google Scholar

[ref42] 42. Schlegel S, Löfblom J, Lee C, Hjelm A, Klepsch M, Strous M, et al. Optimizing membrane protein overexpression in the Escherichia coli strain Lemo21(DE3). J Mol Biol. 2012;423(4):648–59. pmid:22858868
View Article
PubMed/NCBI
Google Scholar

[155] View Article

[156] PubMed/NCBI

[157] Google Scholar

[ref43] 43. Tegel H, Tourle S, Ottosson J, Persson A. Increased levels of recombinant human proteins with the Escherichia coli strain Rosetta(DE3). Protein Expr Purif. 2010;69(2):159–67. pmid:19733669
View Article
PubMed/NCBI
Google Scholar

[159] View Article

[160] PubMed/NCBI

[161] Google Scholar

[ref44] 44. Jørgensen R, Merrill AR, Andersen GR. The life and death of translation elongation factor 2. Biochem Soc Trans. 2006;34(Pt 1):1–6. pmid:16246167
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref45] 45. Dyson MR, Shadbolt SP, Vincent KJ, Perera RL, McCafferty J. Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression. BMC Biotechnol. 2004;4:32. pmid:15598350
View Article
PubMed/NCBI
Google Scholar

[167] View Article

[168] PubMed/NCBI

[169] Google Scholar

[ref46] 46. Sørensen HP, Mortensen KK. Soluble expression of recombinant proteins in the cytoplasm of Escherichia coli. Microb Cell Fact. 2005;4(1):1. pmid:15629064
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref47] 47. Francis DM, Page R. Strategies to optimize protein expression in E. coli. Curr Protoc Protein Sci. 2010;Chapter 5(1):5.24.1–5.24.29. pmid:20814932
View Article
PubMed/NCBI
Google Scholar

[175] View Article

[176] PubMed/NCBI

[177] Google Scholar

[ref48] 48. Berkmen M. Production of disulfide-bonded proteins in Escherichia coli. Protein Expr Purif. 2012;82(1):240–51. pmid:22085722
View Article
PubMed/NCBI
Google Scholar

[179] View Article

[180] PubMed/NCBI

[181] Google Scholar

[ref49] 49. Hatahet F, Ruddock LW. Protein disulfide isomerase: a critical evaluation of its function in disulfide bond formation. Antioxid Redox Signal. 2009;11(11):2807–50. pmid:19476414
View Article
PubMed/NCBI
Google Scholar

[183] View Article

[184] PubMed/NCBI

[185] Google Scholar

[ref50] 50. Ritz D, Beckwith J. Roles of thiol-redox pathways in bacteria. Annu Rev Microbiol. 2001;55:21–48. pmid:11544348
View Article
PubMed/NCBI
Google Scholar

[187] View Article

[188] PubMed/NCBI

[189] Google Scholar

[ref51] 51. Mamathambika BS, Bardwell JC. Disulfide-linked protein folding pathways. Annu Rev Cell Dev Biol. 2008;24:211–35. pmid:18588487
View Article
PubMed/NCBI
Google Scholar

[191] View Article

[192] PubMed/NCBI

[193] Google Scholar

[ref52] 52. Robinson PJ, Bulleid NJ. Mechanisms of disulfide bond formation in nascent polypeptides entering the secretory pathway. Cells. 2020;9(9):1994. pmid:32872499
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref53] 53. Feige MJ, Hendershot LM. Disulfide bonds in protein folding and stability. Methods Mol Biol. 2011;752:1–15.
View Article
Google Scholar

[199] View Article

[200] Google Scholar

[ref54] 54. Allured VS, Collier RJ, Carroll SF, McKay DB. Structure of exotoxin A of Pseudomonas aeruginosa at 3.0-Angstrom resolution. Proc Natl Acad Sci U S A. 1986;83(5):1320–4. pmid:3006045
View Article
PubMed/NCBI
Google Scholar

[202] View Article

[203] PubMed/NCBI

[204] Google Scholar

[ref55] 55. Wedekind JE, Trame CB, Dorywalska M, Koehl P, Raschke TM, McKee M, et al. Refined crystallographic structure of Pseudomonas aeruginosa exotoxin A and its implications for the molecular mechanism of toxicity. J Mol Biol. 2001;314(4):823–37. pmid:11734000
View Article
PubMed/NCBI
Google Scholar

[206] View Article

[207] PubMed/NCBI

[208] Google Scholar

[ref56] 56. Wozniak DJ, Hsu LY, Galloway DR. His‑426 of the Pseudomonas aeruginosa exotoxin A is required for ADP‑ribosyltransferase activity. J Bacteriol. 1988;170:1912–4.
View Article
Google Scholar

[210] View Article

[211] Google Scholar

[ref57] 57. de Marco A. Strategies for successful recombinant expression of disulfide bond-dependent proteins in Escherichia coli. Microb Cell Fact. 2009;8:26. pmid:19442264
View Article
PubMed/NCBI
Google Scholar

[213] View Article

[214] PubMed/NCBI

[215] Google Scholar

RETRACTED: Advanced machine learning-guided optimization platform for high-yield soluble expression of Pseudomonas aeruginosa exotoxin A in engineered Escherichia coli strains

RETRACTED: Advanced machine learning-guided optimization platform for high-yield soluble expression of Pseudomonas aeruginosa exotoxin A in engineered Escherichia coli strains

Retraction

Figures

Abstract

Background

Methods

Results

Conclusions

Introduction

Materials and methods

Plasmid construction and strain engineering

High-throughput expression screening

Solubility and activity analysis

Protein characterization

Machine learning model development

Statistical analysis

Results

Strain optimization reveals SHuffle T7 superiority

Temperature and redox synergy

Chaperone-mediated folding enhancement

Structural and functional validation

Predictive ML modeling

Discussion

Considerations for scalability and industrial translation

Biosafety and responsible research considerations

Limitations and future directions

Conclusions

Supporting information

S1 File. Supplementary figures and tables.

Acknowledgments

References