Figures
Abstract
The NADPH oxidase (NOX) family comprises integral membrane-bound enzymes responsible for generating reactive oxygen species (ROS), with critical roles in immune defense, vascular regulation, and cellular signaling. However, their intrinsic hydrophobicity and membrane association created longstanding challenges for extensive research. In this study, we applied the QTY code – a simple protein design strategy that replaces hydrophobic residues leucine (L), isoleucine (I), valine (V), and phenylalanine (F) with hydrophilic yet structurally compatible residues glutamine (Q), threonine (T) and tyrosine (Y) – to generate QTY analogs of NOX1, NOX2, NOX3, NOX4, NOX5, DUOX1, DUOXA1 and CYBA with reduced hydrophobicity. Using AlphaFold 3, we predicted and superposed the structures of native and QTY-engineered analog proteins. Our results show strong structural resemblance between each pair, with root mean square deviation (RMSD) values below 1Å for six out of eight proteins examined. In addition, QTY substitution significantly reduced surface hydrophobicity, indicating improved water-solubility while preserving 3D structural fold integrity. Our findings demonstrate the potential of QTY-designed NOX variants with reduced hydrophobicity as surrogates for use in structural biology, monoclonal antibody discoveries, drug discovery, and other applications where native membrane proteins present experimental limitations.
Citation: Hu T, Cheng R, Chen E, Zhang S (2026) Structural bioinformatic studies of eight integral transmembrane NADPH oxidases and their AlphaFold 3 predicted QTY analogs with reduced hydrophobicity. PLoS One 21(6): e0347525. https://doi.org/10.1371/journal.pone.0347525
Editor: Hyun Ho Park, Chung-Ang University, KOREA, REPUBLIC OF
Received: August 17, 2025; Accepted: April 3, 2026; Published: June 10, 2026
Copyright: © 2026 Hu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data is provided in the manuscript and the Supporting Information.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Transmembrane proteins are abundant (~26–30%) and are essential for mediating metabolism, cell signaling, transport, and other vital cellular functions [1]. Despite their importance, they remain poorly studied due to experimental challenges. For instance, while transmembrane proteins are essential in mediating the metastatic process, they are inadequately characterized [2].
Traditional protein structure determination relies on X-ray crystallography and NMR spectroscopy. Recently, cryo-electron microscopy (CryoEM) has emerged as the mainstream technique for studying protein structures at higher resolutions [3]. In this study, we incorporated CryoEM-determined structures for those proteins for which such data were available, with resolutions ranging from 2.30Å to 3.20Å; structural information for the remaining proteins was not yet available at the time of writing. Crystallization of integral proteins is particularly challenging due to their inherent hydrophobicity [4]. Detergents are often required to solubilize these proteins, but their use is expensive, especially in the large quantities required for structural studies [5]. Therefore, we present a structural bioinformatics approach to study transmembrane proteins.
We focus on six integral membrane proteins from the NOX (NADPH oxidase 1–5) family/ DUOX (dual oxidase) family, one protein from the DUOXA (dual oxidase maturation factor) family, and one protein from the p22phox (cytochrome b-245 light chain) family. These protein families are involved in electron transport across plasma membranes and serve as a source of reactive oxygen species (ROS) across diverse cell and tissue types [6], playing a crucial role in biological processes. Specifically, the DUOXA family facilitates DUOX maturation, while p22phox functions as a scaffold protein that assembles subunits necessary for NOX-mediated ROS generation [7,8].
The NOX/DUOX family comprises complex, multi-domain proteins that require the assembly of other proteins to function. Characterized for their role in producing ROS for microbial killing in innate immunity, NOX/DUOX proteins participate in diverse physiological functions including host defense, post-translational protein processing, and cell signaling [7,9]. The NOX/DUOX family includes seven catalytic homologs: NOX1, NOX2, NOX3, NOX4, NOX5, and the dual oxidases DUOX1 and DUOX2. Each homolog acts as a catalytic subunit of the NADPH oxidase complex responsible for ROS production.
The structure of a prototypical NOX enzyme, particularly NOX2 found in phagocytes, includes both membrane-bound and cytosolic components. The complete, active enzyme is assembled from several key components: three cytosolic proteins (p40phox, p47phox and p67phox), a membrane bound complex known as cytochrome b558 (composed of the catalytic NOX2 subunit gp91phox and the auxiliary p22phox subunit), and a small GTP-binding protein Rac1, which acts as a molecular switch to initiate activation [10]. Upon activation, the cytosolic components translocate to the membrane and interact with cytochrome b558, forming the active enzyme complex capable of producing superoxide by transferring electrons from NADPH to molecular oxygen [10].
All NADPH oxidases have a homologous catalytic subunit responsible for transferring electrons across the cell membrane and subsequent ROS production [10]. However, the requirement of subunits varies among family members: NOX1–3 require p22phox and the recruitment of cytosolic subunits p47, p67, p40, and Rac1, whereas NOX4 only requires p22phox [10].
While generally similar in structure, these different isoforms are found in different cell types and tissues throughout the human body, serving a diverse range of functions.
NOX1 is primarily expressed in the colon but can also be found in the stomach, uterus, prostate, and vascular smooth muscle cells [11,12]. While the exact physiological function of NOX1 remains unclear, it is hypothesized to play a role in host defense through bacterial killing by ROS production, as well as in signaling pathways where ROS functions as a signaling molecule [11,12]. Defects in NOX1 have been linked to very early onset inflammatory bowel disease (VEOIBD), which encompasses Crohn’s disease and ulcerative colitis [13].
NOX2, the most well-studied NADPH oxidase, is expressed in phagocytic cells such as neutrophils and macrophages and plays a vital role in the human immune system. Like most other NADPH oxidases, NOX2 remains largely dormant under normal circumstances, becoming only activated during respiratory burst–a rapid release of ROS aimed at destroying microbial pathogens. Mutations in NOX2 have been linked to chronic granulomatous disease (CGD) [10].
NOX3 is expressed almost exclusively in the inner ear, where it functions in both the auditory and vestibular systems and is linked to the biosynthesis of otoconia [14].
NOX4 is primarily expressed in the kidney, although it is widely expressed and can also be found in many cell types throughout the body, including the heart, pancreas, osteoclasts, and vasculature [10]. It is thought to be constitutively active, with studies reporting that it produces hydrogen peroxide rather than superoxide as its main ROS product [15,16]. Unlike other isoforms, NOX4 does not require cytosolic activators beyond p22 [10].
The NOX5 protein, a member of the NOX/DUOX family, is expressed in thyroid, spermatocytes of testis, B- and T-lymphocyte-rich areas of spleen, and lymph nodes [16]. NOX5 contains four EF hand motifs with four Ca2+ binding sites at its N-terminus, regulated by intracellular Ca2+ concentration [17,18]. NOX5 has a NADPH binding site at the C-terminus [19]. Increase in NOX5 is associated with different oxidative stress-related pathologies, including cancer and cardiovascular diseases [20]. However, because rodents lack the genes encoding NOX5 and human NOX proteins have yet to be crystallized, NOX5’s functionality remains poorly studied [19,20]. Upon Ca2+ activation, NOX5 functions as a proton channel [21].
DUOX1, another member of the NOX/DUOX family, is a peroxidase that catalyzes the hydrogen peroxide production [22]. DUOX1 is found in tissues including the thyroid and respiratory tract, and plays a crucial role in thyroid hormone synthesis [22]. DUOX1 expression is often associated with disease pathophysiology [23]. DUOX1 also contains an N-terminal extracellular peroxidase homologous domain with two Ca++ EF-hand binding sites [24]. DUOX1 produces H2O2 extracellularly, which is essential for thyroid hormone synthesis [24].
DUOXA1, a member of the DUOXA family, is a maturation factor for DUOX1, together forming an active enzyme complex [22]. Structurally, the DUOXA1 protein comprises five helical transmembrane regions [23]. DUOXA1 facilitates DUOX1 maturation by binding to immature DUOX1 in the endoplasmic reticulum to assist in proper folding and glycosylation [23]. Together with DUOX1, this enzyme complex produces H2O2, a key component in thyroid hormone synthesis [24].
The p22phox protein family, when coupled with NOX2, forms the phagocyte NADPH oxidase protein complex [25]. This complex produces superoxide anions, which are essential for innate immunity [25]. p22phox, a phosphorylated protein, directly promotes NADPH oxidase activity [26]. Moreover, NOX3 biosynthesis requires p22phox, which promotes NOX3 glycosylation and maturation [27].
CYBA (Cytochrome b-245 light chain) protein, a member of the p22phox family, is a subcomponent of the NOX2 enzyme [28,29]. CYBA is expressed in phagocytes, where it is responsible for producing superoxide in response to bacterial and fungal infections [30].
Given their diverse biological roles and implications in a wide range of diseases, the NADPH oxidase enzymes are of considerable biomedical interest. While there have been many advancements in protein structure technology, such as CryoEM, detailed structural studies of NADPH remain limited due to the inherent properties of membrane proteins, notably their partially hydrophobic surfaces. Consequently, detergents are needed for protein solubilization during experimentation, which are expensive [4].
To address this challenge, our current study applies the QTY code design, a simple protein engineering method that replaces hydrophobic residues without dramatically altering the protein structure. L, I, V, and F are replaced with the hydrophilic but structurally similar amino acids: Q, T, and Y, respectively [31,32]. A previous study has shown that applying the QTY code to several chemokine receptors does not significantly impact their thermostabilities, α-helical structures, or ligand-binding activities [31].
AlphaFold 3 is the latest generation of deep learning models developed by Google DeepMind for predicting protein structures, interactions, and complexes with unprecedented accuracy. Unlike its predecessors, AlphaFold 3 extends beyond predicting individual protein structures to model ligands, ions, nucleic acids, and modified residues [33]. This enhanced capability enables high-confidence structural predictions of both native and QTY-variant forms of the NOX enzymes in our study–proteins that are challenging to study experimentally due to their membrane-embedded nature. However, the AlphaFold 3 model has limitations, underscoring the need for experimental validation of computational results.
The application of AlphaFold 3 to QTY code has expanded across various protein families. Previous studies used AlphaFold 2 to predict QTY protein analog folding in native chemokine receptors [34], glucose transporters [35], human solute carrier transporters [36], ABC transporters [32], serotonin, dopamine, and norepinephrine transporters [37], and glutamate transporters [38]. Additionally, reverse QTY code has been applied to human albumin to improve its alpha helices’ hydrophobicity [39]. The CXCR4QTY construct has found practical application in biometric sensor building [40].
The advancement of AlphaFold 3 has significantly improved prediction accuracy while expanding its capability to predict proteins’ interactions with ligands, RNA, and other molecules [41]. Recent application of the QTY code with AlphaFold 3 predictions has been successfully used in predicting the binding mechanisms of FACE1 and STEA4 [42].
Results & Discussion
NADPH oxidase protein sequence alignments and other characteristics
The protein sequences for eight NADPH oxidases were aligned with their QTY variants (Fig 1, Figure S1, Figure S2, Figure S3 in S1 File). Overall amino acid sequence changes ranged from 4.97% to 16.49% (Fig 1, Table 1), while transmembrane region change ranges from 40.00% to 52.38% (Fig 1, Table 1). Notably, the protein isoelectric point (pI) remains roughly unchanged, as the substituted amino acids Q (glutamine), T (threonine), and Y (tyrosine) are all neutral amino acids, exerting no effect on the protein acidity or basicity.
The symbols | and * indicate whether amino acids are identical or different, respectively. Q replaces L, T replaces V and I, and Y replaces F. The alpha helices (colored in blue) are shown above the protein sequences. The characteristics of native and QTY variants listed are isoelectric point (pI), molecular weight (MW), total variation % and transmembrane variation %. The alignments are A) NOX1 vs NOX1QTY, B) NOX2 vs NOX2QTY, C) NOX3 vs NOX3QTY, D) NOX4 vs NOX4QTY, E) NOX5 vs NOX5QTY, F) DUOX1 vs DUOX1QTY, G) DUOXA1 vs DUOXA1QTY, H) CYBA vs CYBAQTY. Compared to the native, the QTY variants show significant changes, ranging from 40.00% to 52.38%, in the TM region without significant changes in pI and MW.
The molecular weight (MW) changes were minimal due to the similar masses of substituted amino acid pairs. Leucine (MW: 131.17 Da) is lighter than Q (MW: 146.14 Da), isoleucine (MW 131.17 Da) is slightly heavier than T (MW: 119.12 Da), valine (MW: 117.15 Da) is slightly lighter than T (MW: 119.12 Da), and phenylalanine (MW: 165.19 Da) is slightly lighter than Y (MW: 181.19 Da). These effects result in negligible overall molecular weight variations.
Accuracy assessment in AlphaFold 3 predictions
AlphaFold 3 uses several metrics to measure its protein folding accuracy: predicted local distance difference test (pLDDT), predicted aligned matrix (PAE), and predicted template modeling score (pTM).
pLDDT metric quantifies confidence in folding at individual residue positions [41]. The dark blue regions, where the pLDDT score > 90, represent highly confident folding. Blue regions, with pLDDT score < 90 and > 70, signify confident regions. The yellow regions, pLDDT < 70 and > 50, demonstrate moderate confidence in the prediction, and orange regions, pLDDT < 50, show low confidence. Our results reveal that transmembrane regions in QTY variants achieve high confidence predictions, with only few terminal ribbon-like unstructured loops exhibiting lower confidence scores. These unstructured loops were removed from the final models to enhance clarity (Fig 2).
Predicted aligned error (PAE) matrices show predominantly low error (dark green), indicating high confidence in residue positioning, with few regions of increased error. Transmembrane regions were predicted with predominantly high to very high confidence (blue and dark blue shaded regions), while some regions show lower confidence (yellow to orange). In contrast to the others, NOX5 and DUOX1 display extended regions of reduced confidence, indicating increased structural uncertainty. Overall high pTM scores indicate high prediction accuracy and strong resemblance to known experimentally determined protein structures. A) NOX1AlphaFold 3, B) NOX2AlphaFold 3, C) NOX3AlphaFold 3, D) NOX4AlphaFold 3, E) NOX5AlphaFold 3, F) DUOX1AlphaFold 3, G) DUOXA1AlphaFold 3, H) CYBAAlphaFold 3.
PAE measures the error between each pair of amino acid residues [43] represented as a symmetrical matrix where rows and columns correspond to residue pairs. Diagonal elements reflect local confidence in residue placement. Darker green regions in the PAE matrix indicate low PAE values. Our results predominantly consist of darker green regions, with the exception occurring in unstructured loop regions (Fig 2).
pTM measures the overall protein folding quality by assessing the similarity between the true structure and the predicted structure [43]. pTM ranges from 0 to 1, in which 1 indicates a perfect structural alignment. Our results achieved an average pTM score of 0.84 (Fig 2), indicating high accuracy and strong resemblance to known experimentally determined protein structures.
Superpositions of native NADPH oxidase structures and their AlphaFold 3 predicted QTY variants with reduced hydrophobicity
We superposed five experimentally determined CryoEM NADPH oxidase structures with their AlphaFold 3-predicted QTY analogs: NOX2 (PDB: 7U8G) [10], NOX5 (PDB: 8U85) [21], DUOX1 (PDB: 7D3E) [44], DUOXA1 (PDB: 7D3F) [44], CYBA (PDB: 8WEJ) [25]. NOX1, NOX3, and NOX4 were excluded due to the absence of experimentally determined structures at the time of writing. The superpositions of the transmembrane enzymes and their respective QTY variants are: NOX2CryoEM vs NOX2QTY, NOX5CryoEM vs NOX5QTY, DUOX1CryoEM vs DUOX1QTY, DUOXA1CryoEM vs DUOXA1QTY, and CYBACryoEM vs CYBAQTY (Fig 3).
NADPH AlphaFold 3 predicted reduced hydrophobicity QTY variants were compared to their CryoEM determined native structures, when available. NOX1, NOX3, and NOX4 were superposed with their AlphaFold 3 predicted native structures because they lack experimentally determined CryoEm structures at the time of writing. AlphaFold 3 predicted native structures are shown in green, and CryoEM determined structures are shown in magenta, while their AlphaFold predicted QTY variants are displayed in cyan. All CryoEM structures were obtained from the Protein Data Bank (PDB). The similarities between structures as seen in the superpositions show that QTY variants and the native proteins have comparable structures. Unstructured loops in the AlphaFold 3 predicted variants have been removed. A) NOX1AlphaFold 3 vs NOX1QTY, B) NOX2CryoEM vs NOX2QTY, C) NOX3AlphaFold 3 vs NOX3QTY, D) NOX4AlphaFold 3 vs NOX4QTY, E) NOX5CryoEM vs NOX5QTY, F) DUOX1CryoEM vs DUOX1QTY, G) DUOXA1CryoEM vs DUOXA1QTY, H) CYBACryoEM vs CYBAQTY.
The QTY variants generally superposed well with their respective native counterparts. The RMSDs range from 0.36Å to 4.96Å, with all pairs except NOX5 vs NOX5QTY and DUOX1 vs DUOX1QTY achieving RMSD below 1Å (Table 1, Fig 1). Despite experiencing a significant percentage of amino acid substitutions (40.00%−52.38%) in the transmembrane region, the proteins maintained similar structures (Fig 3), confirming notable structural conservation between native NADPH oxidases and their predicted QTY variants with reduced hydrophobicity.
The elevated RMSD score for NOX5CryoEM vs NOX5QTY pair likely reflects reduced confidence in the AlphaFold 3 prediction. Specifically, residues R196-R222 and S678-S705 contain alpha helices with confidence scores between the moderate (50 < pLDDT < 70) and low (pLDDT < 50) ranges (Fig 2). Therefore, the high RMSD of NOX5 may be prone to an inaccurate AlphaFold 3 result.
To assess proteins that lacked experimental structures (NOX1, NOX3, NOX4), we superposed the AlphaFold 3 predicted native structures with their AlphaFold 3 predicted QTY variants with reduced hydrophobicity (Fig 3). As shown in Fig 3, the AlphaFold 3 predicted structures superpose very well. The visual evaluation is also supported by RMSD values: a) NOX1Native vs NOX1QTY (RMSD = 0.53 Å), c) NOX3Native vs NOX3QTY (RMSD = 0.59 Å), d) NOX4Native vs NOX4QTY (RMSD = 0.46 Å). These results confirm high structural similarity between native NADPH oxidases and their predicted QTY variants with reduced hydrophobicity.
Superpositions of CryoEM structures with AlphaFold 3 predicted native structures and their QTY analogs with reduced hydrophobicity
We performed comprehensive superpositions incorporating i) the experimentally determined CryoEM native NADPH oxidases with ii) AlphaFold 3 predicted native NADPH oxidases and iii) AlphaFold 3 predicted QTY variants with reduced hydrophobicity. Excellent superpositions are demonstrated across all proteins except for NOX5, demonstrating both AlphaFold 3’s accurate predictions and the possible utility of QTY analogs with reduced hydrophobicity of NADPH oxidases in medicine and drug design (Fig 4). The low quality of the NOX5 alignment likely arises from the low folding confidence of the AlphaFold 3 model, highlighting the limitation that protein prediction models may not accurately reflect true protein structures. Accordingly, the QTY analog of NOX5 should be interpreted with caution, serving as a warrant case for future QTY analog investigation.
Superpositions of i) the experimentally determined CryoEM native structures (magenta), ii) AlphaFold 3 predicted native structures (green), iii) AlphaFold 3 predicted QTY variants (cyan). These three different kinds of structures are superposed very well. Differences and variation are largely insignificant. The quality of superpositions demonstrates both the accuracy of AlphaFold 3 as well as the feasibility of utilizing reduced hydrophobicity QTY variants. NOX1, NOX3, and NOX4 are not displayed as they lack experimentally determined CryoEM structures at the time of writing. A) NOX2CryoEM vs NOX2QTY, B) NOX5CryoEM vs NOX5QTY, C) DUOX1CryoEM vs DUOX1QTY, D) DUOXA1CryoEM vs DUOXA1QTY, E) CYBACryoEM vs CYBAQTY.
Analysis of the hydrophobic surface of native NADPH oxidases and their QTY analogs with reduced hydrophobicity
The eight transmembrane NADPH oxidases included in our study are hydrophobic and insoluble in water. Experimental characterization requires detergent-mediated isolation from lipid layers, in which the detergent disrupts the hydrophobic bonds that link the enzymes to the lipid bilayer. Without the proper detergents, proteins aggregate and precipitate, losing their biological functionality. Yellow-colored hydrophobic surfaces represent areas in the transmembrane domain that traverse the hydrophobic lipid bilayer (Fig 5). This region contains hydrophobic, nonpolar amino acids including Leucine (L), Isoleucine (I), Valine (V), Phenylalanine (F), Methionine (M), Tryptophan (W), and Alanine (A), which exclude water through lipid interactions.
The native NADPH oxidases have hydrophobic residues L, I, V, and F in the transmembrane regions. After applying the QTY code, the variant proteins have had these hydrophobic residues substituted for hydrophilic ones. Q replaces L, T replaces V and I, and Y replaces F. Hydrophobic regions (yellow) become hydrophilic (cyan) as a result of applying the QTY code. Unstructured loops have been removed for clarity of direct comparisons. A) NOX1AlphaFold 3 vs NOX1QTY, B) NOX2CryoEM vs NOX2QTY, C) NOX3Alpha Fold 3 vs NOX3QTY, D) NOX4AlphaFold 3 vs NOX4QTY, E) NOX5CryoEM vs NOX5QTY, F) DUOX1CryoEM vs DUOX1QTY, G) DUOXA1CryoEM vs DUOXA1QTY, H) CYBACryoEM vs CYBAQTY.
QTY code, replacing hydrophobic residues (L, I/V, F) with hydrophilic, polar counterparts (Q, T, Y), significantly reduces hydrophobic surface areas. This is seen in the decreased yellow regions coupled and increase in blue regions exhibited in the QTY variants compared to their native structures (Fig 5). Importantly, the QTY code substitution also did not compromise the alpha-helical structures of the NADPH oxidases. This is consistent with findings from previous studies, notably one on QTY variants of chemokine and cytokine receptors that showed QTY analogs keeping their thermostabilities, ligand binding activities, and alpha-helical structures despite becoming hydrophilic [31].
At each residue, CamSol intrinsic solubility profiles show a significant reduction in extended aggregation-prone regions across all eight proteins’ QTY analogs, as shown by fewer and shorter stretches of red sections with strongly negative CamSol scores and more stretches of blue sections with strongly positive CamSol scores (Fig 6). All eight proteins’ QTY analogs showed an increase in overall solubility score, suggesting reduced hydrophobicity. This shift towards positive intrinsic solubility values demonstrates a reduced propensity for hydrophobic-driven aggregation and improved compatibility with aqueous environments.
The red regions represent highly insoluble aggregation-prone regions of proteins, while blue regions represent highly soluble and aggregation-resistant regions. All eight proteins showed reduced red and increased blue regions after application of the QTY code, suggesting reduced hydrophobicity. A) NOX1Native vs NOX1QTY, B) NOX2Native vs NOX2QTY, C) NOX3Native vs NOX3QTY, D) NOX4Native vs NOX4QTY, E) NOX5Native vs NOX5QTY, F) DUOX1Native vs DUOX1QTY, G) DUOXA1Native vs DUOXA1QTY, H) CYBANative vs CYBAQTY.
AlphaFold 3 predictions
This study relied extensively on AlphaFold 3, released by DeepMind in May 2024. Unlike AlphaFold 2, this advanced model predicts diverse biomolecules besides proteins including ligands, ions, modified residues and nucleic acids [33]. AlphaFold 3 outperforms traditional docking methods such as Vina and RoseTTAFold All-Atom [33].
The AlphaFold 3 server (https://alphafoldserver.com) provides free, fast, and user-friendly access with comprehensive guidance for protein folding prediction and analysis. All our QTY analogs were predicted using this platform.
Despite being a powerful tool, AlphaFold 3 exhibits limitations. The model occasionally suffers from hallucinations–plausible but nonexistent structures [33]. Specifically in this study, the low confidence of NOX5QTY Alpha Fold 3 folding made proceeding superposition analysis challenging. While hallucination is less frequent compared to AlphaFold 2, the issue pertains. Additionally, AlphaFold 3 struggles with intrinsically disordered regions and multi-state conformations [45]. Many proteins exist in multiple conformation or lack stable forms under specific physiological conditions [46,47]. We look for the AlphaFold 4 release in near future that may address the limitations.
In our result, AlphaFold 3 produces generally high pLDDT values, suggesting confident predictions. However, since pLDDT derives from training data, applicability under actual biological or experimental conditions may vary [48]. As a result, while AlphaFold 3 provides useful structural insights, experimental validation remains crucial for confirming computational predictions.
Transmembrane NADPH oxidases investigated in this study
We investigated QTY analogs across the NADPH oxidase family and proteins supporting the oxidases’ structure and function. Results demonstrate QTY code efficacy through excellent structural superpositions between native structures and their QTY analogs. The eight studied proteins (NOX1, NOX2, NOX3, NOX4, NOX5, DUOX1, DUOXA1, and CYBA) all show medical relevance. While each homologue has a unique biological role, the NADPH oxidase family collectively regulates ROS production and impacts cancer progression.
The QTY analogs predicted in this study with reduced hydrophobicity offer several potential applications. Firstly, the QTY analogs can be purified in large quantities, facilitating drug design. Additionally, our results show a generalized pipeline for analyzing and implementing QTY code across diverse transmembrane proteins.
Conclusion
Natural proteins can be divided into two distinct classes: hydrophilic proteins (Class I) and hydrophobic proteins (Class II) [49]. Their constituent helices divide into three chemically distinct yet structurally similar types: i) the type I alpha helix is hydrophilic and highly water-soluble, containing hydrophilic amino acids D, E, N, Q, K, R, S, T, and Y, typically found in water-soluble proteins, ii) the type II alpha helix is hydrophobic and often spans across membranes, containing hydrophobic amino acids L, I, V, F, M, P, W, and A, and is often found in the transmembrane segments of membrane proteins, iii) the type III alpha helix is composed of nearly equal amounts of hydrophobic and hydrophilic amino acids, which are often separated into hydrophobic and hydrophilic faces [50].
In this study, we successfully applied the QTY code to eight NADPH oxidase transmembrane proteins, generating QTY analogs with reduced hydrophobicity. We employed AlphaFold 3 to predict the QTY variant structure, performed structural superposition with the native NADPH oxidase structure, and conducted bioinformatic analysis of structural properties and hydrophobicity.
Our analyses suggest that QTY analogs exhibit overall structural similarity to their native proteins in AlphaFold 3 predicted models despite various amino acid substitutions. Application of the QTY code to NADPH oxidases markedly reduced predicted hydrophobic surface regions while largely preserving the global fold. These results indicate that hydrophilic variants of NADPH oxidases could potentially serve as soluble surrogates for structural and functional studies, particularly in aqueous environments where native membrane proteins present experimental challenges. However, because these observations rely on computational structure predictions, experimental validation will be required to determine whether the QTY variants maintain structural stability and functional activity.
Beyond basic research applications, reduced hydrophobicity QTY variants may offer a potential approach for generating soluble models of membrane proteins that could assist future studies in biotechnology or drug discovery. If experimentally validated, such hydrophilic variants could potentially aid efforts such as antibody discovery or the design of inhibitors targeting disease-associated NOX activity, helping to circumvent some experimental challenges associated with native membrane-bound forms.
Future studies should examine whether QTY analogs can maintain their interaction with other proteins successfully despite reduced hydrophobicity. For instance, NOX2’s interaction with p22 relies on hydrophobic electrostatic forces [10]. The balance between preserving the original structure and interaction, and reducing hydrophobicity for experimental purposes, will be of interest in future studies. Additionally, different protein folding tools can be used to validate AlphaFold 3’s prediction of the QTY analog to mitigate its hallucination effects. For instance, D-I-TASSER, a deep learning model building on top of the threading model I-TASSER, has been shown to outperform AlphaFold 3 and AlphaFold 2 in CASP protein prediction [49].
Overall, our findings show that QTY protein analogs maintain high structural similarity to the native protein structures. The QTY code successfully reduced the hydrophobicity of transmembrane proteins while maintaining their original structure. These NADPH oxidase QTY analogs may serve useful for cancer treatment monitoring via ROS control.
Methods
The rationale of the QTY code
Purifying transmembrane proteins presents significant challenges, requiring detergents and extensive time investment. The QTY code stems from the structural similarity in the density maps of amino acids Q and L, T and I/V, and Y and F [31]. This code significantly reduces the hydrophobicity of transmembrane proteins through substantial amino acid substitutions.
The QTY code aims to engineer transmembrane proteins into their analogs with reduced hydrophobicity by substituting hydrophobic amino acids with hydrophilic ones: L with Q, I and V with T, and F with Y [32]. Despite extensive sequence modification in the eight transmembrane proteins, their QTY variants maintain similar pI and molecular weights (MW) (Table 2).
Protein sequences and other characteristics
The native protein sequences for the studied NOX enzymes NOX1, NOX2, NOX3, NOX4, NOX5, DUOX1, DUOXA1, and CYBA were obtained from UniProt (https://www.uniprot.org). The QTY code was applied to the native proteins using the Protein Solubilizing Server (https://pss.sjtu.edu.cn/). The MWs, pI values, TM variation, overall variation, and sequence alignments for the proteins are computed via Expasy (https://web.expasy.org/compute_pi/).
AlphaFold 3
The protein structures NOX1QTY, NOX2QTY, NOX3QTY, NOX4QTY, NOX5QTY, DUOX1QTY, DUOXA1QTY, and CYBAQTY were predicted using AlphaFold 3 (https://alphafoldserver.com). First, the QTY sequence was generated from the Protein Solubilizing Server described above and then entered into AlphaFold 3 to generate the predicted structure. QTY protein analog structures are predicted using the AlphaFold 3 Server v3.0.1. Each prediction is run in the monomer mode, using the server’s default MSA generation and recycle settings. Each prediction is run with one random seed (NOX1 = 740489916; NOX2 = 153345029; NOX3 = 1088284846; NOX4 = 2115711105; NOX5 = 1970948703, DUOX1 = 1666590488; DUOXA1 = 393896960; CYBA = 1060114576). Structure templates are enabled, and no additional user-defined parameters are applied. We have uploaded our QTY-designed FASTA sequences as well as our AlphaFold 3 output data and predicted structure files to a GitHub repository for reproducibility purposes. The sequences and files can be accessed at this link: https://github.com/rickhcheng/qty_proteins.
Superposed structures
Native structures for proteins lacking an experimentally determined structure in this study (NOX1, NOX3, NOX4) were generated using the AlphaFold 3 model (https://alphafoldserver.com). Native structures for proteins with experimentally determined structures were obtained from the PDB. NOX2 (PDB: 7U8G), NOX5 (PDB:8U85), DUOX1 (PDB:7D3E), DUOXA1 (PDB:7D3F), and CYBA (PDB:8WEJ). QTY variant structures were also predicted using AlphaFold 3 (https://alphafoldserver.com). The structures were then superposed with the structure of the native protein and the RMSDs were calculated using PyMOL (https://pymol.org/).
Structure visualization
Superpositions between the QTY variants and their native protein structures were done using PyMOL (https://pymol.org/). Protein hydrophobicity figures were generated using UCSF Chimera (https://www.rbvi.ucsf.edu/chimera/).
Hydrophobicity Analysis
CamSol Intrinsic prediction (https://www-cohsoftware.ch.cam.ac.uk/index.php/camsolintrinsic) is used to assess proteins’ intrinsic solubility and aggregated propensity. One score will be assigned to each residue, creating a solubility profile where scores higher than 1 denote highly soluble regions; scores lower than –1 denote poorly soluble ones. An overall solubility score will be assigned, where higher scores indicate better solubility.
Supporting information
S1 File. The Support information provides: 1) enlarged individual alignment of each membrane proteins from figure 1 so readers can see clearly the sequence alignment, 2) the 2D topology of each membrane protein and the transmembrane domain of alpha-helices, 3) bioinformatics of membrane protein hydrophobicity before and after application of the QTY code.
The Support figures further provides information on how QTY code works and why is works. These figures provide readers better understanding the QTY code. S1 Fig. The enlarged protein sequence alignments of eight native NADPH oxidases with their water-soluble QTY variants from Figure 1. The symbols | and * indicate whether amino acids are identical or different, respectively. Q replaces L, T replaces V and I, and Y replaces F. The alpha helices (colored in blue) are shown above the protein sequences. The characteristics of native and QTY variants listed are isoelectric focusing (pI), molecular weight (MW), total variation % and transmembrane variation %. The alignments are a) NOX1 vs NOX1QTY, b) NOX2 vs NOX2QTY, c) NOX3 vs NOX3QTY, d) NOX4 vs NOX4QTY, e) NOX5 vs NOX5QTY, f) DUOX1 vs DUOX1QTY, g) DUOXA1 vs DUOXA1QTY, h) CYBA vs CYBAQTY. Compared to the native, the QTY variants show significant changes, ranging from 40.00% to 52.38%, in the TM region without significant changes in pI and MW. S2 Fig. Membrane topology of eight NADPH oxidases. Topological structures were generated via Protter. The cell membrane is colored orange boarded by black lines. Topologies include: a) NOX1, b) NOX2, c) NOX3, d) NOX4, e) NOX5, f) DUOX1, g) DUOXA1, h) CYBA. S3 Fig. Transmembrane helix predictions of native and QTY analogs. Predictions were generated using DeepTMHMM. Predictions include a) NOX1Native vs NOX1QTY, b) NOX2Native vs NOX2QTY, c) NOX3Native vs NOX3QTY, d) NOX4Native vs NOX4QTY, e) NOX5Native vs NOX5QTY, f) DUOX1Native vs DUOX1QTY, g) DUOXA1Native vs DUOXA1QTY, h) CYBANative vs CYBAQTY.
https://doi.org/10.1371/journal.pone.0347525.s001
(PDF)
References
- 1. Levental I, Lyman E. Regulation of membrane protein structure and function by their lipid nano-environment. Nat Rev Mol Cell Biol. 2023;24(2):107–22. pmid:36056103
- 2. Marx S, Dal Maso T, Chen J-W, Bury M, Wouters J, Michiels C, et al. Transmembrane (TMEM) protein family members: Poorly characterized even if essential for the metastatic process. Semin Cancer Biol. 2020;60:96–106. pmid:31454669
- 3. Bai X, McMullan G, Scheres SHW. How cryo-EM is revolutionizing structural biology. Trends Biochem Sci. 2015;40(1):49–57. pmid:25544475
- 4. Carpenter EP, Beis K, Cameron AD, Iwata S. Overcoming the challenges of membrane protein crystallography. Curr Opin Struct Biol. 2008;18(5):581–6. pmid:18674618
- 5. Anandan A, Vrielink A. Detergents in Membrane Protein Purification and Crystallisation. Adv Exp Med Biol. 2016;922:13–28. pmid:27553232
- 6. Sirokmány G, Donkó Á, Geiszt M. Nox/Duox Family of NADPH Oxidases: Lessons from Knockout Mouse Models. Trends Pharmacol Sci. 2016;37(4):318–27. pmid:26861575
- 7. Vermot A, Petit-Härtlein I, Smith SME, Fieschi F. NADPH Oxidases (NOX): An Overview from Discovery, Molecular Mechanisms to Physiology and Pathology. Antioxidants (Basel). 2021;10(6):890. pmid:34205998
- 8. Morand S, Ueyama T, Tsujibe S, Saito N, Korzeniowska A, Leto TL. Duox maturation factors form cell surface complexes with Duox affecting the specificity of reactive oxygen species generation. FASEB J. 2009;23(4):1205–18. pmid:19074510
- 9. Rada B, Leto TL. Oxidative innate immune defenses by Nox/Duox family NADPH oxidases. Contrib Microbiol. 2008;15:164–87. pmid:18511861
- 10. Noreng S, Ota N, Sun Y, Ho H, Johnson M, Arthur CP, et al. Structure of the core human NADPH oxidase NOX2. Nat Commun. 2022;13(1):6079. pmid:36241643
- 11. Krause K-H. Tissue distribution and putative physiological function of NOX family NADPH oxidases. Jpn J Infect Dis. 2004;57(5):S28-9. pmid:15507765
- 12. Yin W, Voit EO. Function and design of the Nox1 system in vascular smooth muscle cells. BMC Syst Biol. 2013;7:20. pmid:23497394
- 13. Hayes P, Dhillon S, O’Neill K, Thoeni C, Hui KY, Elkadri A, et al. Defects in NADPH Oxidase Genes NOX1 and DUOX2 in Very Early Onset Inflammatory Bowel Disease. Cell Mol Gastroenterol Hepatol. 2015;1(5):489–502. pmid:26301257
- 14. Bánfi B, Malgrange B, Knisz J, Steger K, Dubois-Dauphin M, Krause K-H. NOX3, a superoxide-generating NADPH oxidase of the inner ear. J Biol Chem. 2004;279(44):46065–72. pmid:15326186
- 15. Nisimoto Y, Diebold BA, Cosentino-Gomes D, Lambeth JD. Nox4: a hydrogen peroxide-generating oxygen sensor. Biochemistry. 2014;53(31):5111–20. pmid:25062272
- 16. Takac I, Schröder K, Zhang L, Lardy B, Anilkumar N, Lambeth JD, et al. The E-loop is involved in hydrogen peroxide formation by the NADPH oxidase Nox4. J Biol Chem. 2011;286(15):13304–13. pmid:21343298
- 17. Bánfi B, Molnár G, Maturana A, Steger K, Hegedûs B, Demaurex N, et al. A Ca(2+)-activated NADPH oxidase in testis, spleen, and lymph nodes. J Biol Chem. 2001;276(40):37594–601. pmid:11483596
- 18. Wei C-C, Hay E, Smith D, Lloyd L, Acharya G, Ngo R. Binding of Nox5’s EF-Hand domain to the peptides corresponding to the phosphorylatable region and regulatory inhibitory loop in its dehydrogenase domain. Biophys Chem. 2020;262:106379. pmid:32339785
- 19. Touyz RM, Anagnostopoulou A, Rios F, Montezano AC, Camargo LL. NOX5: Molecular biology and pathophysiology. Exp Physiol. 2019;104(5):605–16. pmid:30801870
- 20. García JG, Ansorena E, Izal I, Zalba G, de Miguel C, Milagro FI. Structure, regulation, and physiological functions of NADPH oxidase 5 (NOX5). J Physiol Biochem. 2023;79(2):383–95. pmid:36905456
- 21. Cui C, Jiang M, Jain N, Das S, Lo Y-H, Kermani AA, et al. Structural basis of human NOX5 activation. Nat Commun. 2024;15(1):3994. pmid:38734761
- 22. Sun J. Structures of mouse DUOX1-DUOXA1 provide mechanistic insights into enzyme activation and regulation. Nat Struct Mol Biol. 2020;27(11):1086–93. pmid:32929281
- 23. Ashtiwi NM, Sarr D, Rada B. DUOX1 in mammalian disease pathophysiology. J Mol Med (Berl). 2021;99(6):743–54. pmid:33704512
- 24. Miot F, De Deken X. DUOX1 and DUOX2, DUOXA1 and DUOXA2. NADPH Oxidases Revisited: From Function to Structure. Springer International Publishing. 2023. p. 229–45.
- 25. Liu X, Shi Y, Liu R, Song K, Chen L. Structure of human phagocyte NADPH oxidase in the activated state. Nature. 2024;627(8002):189–95. pmid:38355798
- 26. Lewis EM, Sergeant S, Ledford B, Stull N, Dinauer MC, McPhail LC. Phosphorylation of p22phox on threonine 147 enhances NADPH oxidase activity by promoting p47phox binding. J Biol Chem. 2010;285(5):2959–67. pmid:19948736
- 27. Nakano Y, Banfi B, Jesaitis AJ, Dinauer MC, Allen L-AH, Nauseef WM. Critical roles for p22phox in the structural maturation and subcellular targeting of Nox3. Biochem J. 2007;403(1):97–108. pmid:17140397
- 28. Kerepesi C, Daróczy B, Sturm Á, Vellai T, Benczúr A. Prediction and characterization of human ageing-related proteins by using machine learning. Sci Rep. 2018;8(1):4094. pmid:29511309
- 29. Woo J, Williams SM, Markillie LM, Feng S, Tsai C-F, Aguilera-Vazquez V, et al. High-throughput and high-efficiency sample preparation for single-cell proteomics using a nested nanowell chip. Nat Commun. 2021;12(1):6246. pmid:34716329
- 30. Stasia MJ. CYBA encoding p22(phox), the cytochrome b558 alpha polypeptide: gene structure, expression, role and physiopathology. Gene. 2016;586(1):27–35. pmid:27048830
- 31. Zhang S, Tao F, Qing R, Tang H, Skuhersky M, Corin K, et al. QTY code enables design of detergent-free chemokine receptors that retain ligand-binding activities. Proc Natl Acad Sci U S A. 2018;115(37):E8652–9. pmid:30154163
- 32. Pan E, Tao F, Smorodina E, Zhang S. Structural bioinformatics studies of six human ABC transporters and their AlphaFold2-predicted water-soluble QTY variants. QRB Discov. 2024;5:e1. pmid:38577032
- 33. Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630(8016):493–500. pmid:38718835
- 34. Skuhersky MA, Tao F, Qing R, Smorodina E, Jin D, Zhang S. Comparing Native Crystal Structures and AlphaFold2 Predicted Water-Soluble G Protein-Coupled Receptor QTY Variants. Life (Basel). 2021;11(12):1285.
- 35. Smorodina E, Tao F, Qing R, Jin D, Yang S, Zhang S. Comparing 2 crystal structures and 12 AlphaFold2-predicted human membrane glucose transporters and their water-soluble glutamine, threonine and tyrosine variants. QRB Discov. 2022;3:e5. pmid:37529287
- 36. Smorodina E, Diankin I, Tao F, Qing R, Yang S, Zhang S. Structural informatic study of determined and AlphaFold2 predicted molecular structures of 13 human solute carrier transporters and their water-soluble QTY variants. Sci Rep. 2022;12(1):20103. pmid:36418372
- 37. Karagöl T, Karagöl A, Zhang S. Structural bioinformatics studies of serotonin, dopamine and norepinephrine transporters and their AlphaFold2 predicted water-soluble QTY variants and uncovering the natural mutations of L->Q, I->T, F->Y and Q->L, T->I and Y->F. PLoS One. 2024;19(3):e0300340.
- 38. Karagöl A, Karagöl T, Smorodina E, Zhang S. Structural bioinformatics studies of glutamate transporters and their AlphaFold2 predicted water-soluble QTY variants and uncovering the natural mutations of L->Q, I->T, F->Y and Q->L, T->I and Y->F. PLoS One. 2024;19(4):e0289644.
- 39. Meng R, Hao S, Sun C, Hou Z, Hou Y, Wang L, et al. Reverse-QTY code design of active human serum albumin self-assembled amphiphilic nanoparticles for effective anti-tumor drug doxorubicin release in mice. Proc Natl Acad Sci U S A. 2023;120(21):e2220173120. pmid:37186820
- 40. Qing R, Xue M, Zhao J, Wu L, Breitwieser A, Smorodina E, et al. Scalable biomimetic sensing system with membrane receptor dual-monolayer probe and graphene transistor arrays. Sci Adv. 2023;9(29):eadf1402. pmid:37478177
- 41. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–9. pmid:34265844
- 42. Chen E, Pan E, Zhang S. Structure Bioinformatics of Six Human Integral Transmembrane Enzymes and their AlphaFold3 Predicted Water-Soluble QTY Analogs: Insights into FACE1 and STEA4 Binding Mechanisms. Pharm Res. 2025;42(2):291–305. pmid:39966220
- 43. Guo H-B, Perminov A, Bekele S, Kedziora G, Farajollahi S, Varaljay V, et al. AlphaFold2 models indicate that protein sequence determines both structure and dynamics. Sci Rep. 2022;12(1):10696. pmid:35739160
- 44. Wu J-X, Liu R, Song K, Chen L. Structures of human dual oxidase 1 complex in low-calcium and high-calcium states. Nat Commun. 2021;12(1):155. pmid:33420071
- 45. Krokidis MG, Koumadorakis DE, Lazaros K, Ivantsik O, Exarchos TP, Vrahatis AG, et al. AlphaFold3: An Overview of Applications and Performance Insights. Int J Mol Sci. 2025;26(8):3671. pmid:40332289
- 46. Trivedi R, Nagarajaram HA. Intrinsically Disordered Proteins: An Overview. Int J Mol Sci. 2022;23(22):14050. pmid:36430530
- 47. Fox RO, Evans PA, Dobson CM. Multiple conformations of a protein demonstrated by magnetization transfer NMR spectroscopy. Nature. 1986;320(6058):192–4. pmid:3951556
- 48. Mariani V, Biasini M, Barbato A, Schwede T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics. 2013;29(21):2722–8. pmid:23986568
- 49. Zheng W, Wuyun Q, Li Y, Liu Q, Zhou X, Peng C, et al. Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER. Nat Biotechnol. 2026;44(4):641–53. pmid:40410405
- 50. Zhang S, Egli M. Hiding in plain sight: three chemically distinct α-helix types. Q Rev Biophys. 2022;55:e7. pmid:35722863