Figures
Abstract
Nuclear receptor binding SET domain protein 2 (NSD2) significantly contributes to the development of cancer, making it a promising target for cancer drug discovery. This research explores natural compounds as potential selective inhibitors for NSD2 in cancer treatment. Employing a comprehensive in silico approach, the study utilized pharmacophore modeling, molecular docking, pharmacokinetic profiling, and molecular dynamics simulations. An e-pharmacophore model-based screening using the first selective and potent ligand bound to NSD2 identified 49,248 natural compounds from the SuperNatural 3.0 database (containing 449,008 molecules) with acceptable alignment with the developed pharmacophore hypotheses. Subsequently, molecular docking was executed to assess the standout compounds which led to the selection of ten candidates that surpassed the reference inhibitor in accordance w the binding affinity expressed as a G score. Ligand-residue interaction analyses of the top three hits (SN0450102, SN0410255, and SN0142336) revealed diverse crucial interactions with the NSD2 active site, including hydrogen bonds, pi-pi stacking, and hydrophobic contacts with key amino acid residues in the NSD2-PWWP1 domain. Pharmacokinetic profiling confirmed the drug-likability for the refined hits, indicating good cellular permeability and minimal blood-brain barrier penetration. Molecular dynamics simulations for 200 nanoseconds affirmed the stability of protein-ligand complexes, with minimal fluctuations in root mean square deviation and root mean square fluctuation analyses. Overall, this study identified promising natural compounds as potential pharmaceutical agents in the treatment of NSD2-associated cancers.
Citation: Mohamed GA, Abdallah HM, Sindi IA, Ibrahim SRM, Alzain AA (2024) Unveiling the potential of phytochemicals to inhibit nuclear receptor binding SET domain protein 2 for cancer: Pharmacophore screening, molecular docking, ADME properties, and molecular dynamics simulation investigations. PLoS ONE 19(8): e0308913. https://doi.org/10.1371/journal.pone.0308913
Editor: Chandrabose Selvaraj, Saveetha University - Poonamallee Campus: SIMATS Deemed University, INDIA
Received: March 13, 2024; Accepted: August 2, 2024; Published: August 20, 2024
Copyright: © 2024 Mohamed et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper.
Funding: The Deanship of Scientific Research (DSR) at King Abdulaziz University (KAU), Jeddah, Saudi Arabia has funded this project, under grant no. (RG-43-166-43). The funders had no role in study design, data collection and analysis, decision to publish.
Competing interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Introduction
Cancer is a major global health concern, with millions of new cases and deaths reported annually. In 2018, there were approximately 18.1 M newly diagnosed cases in addition to 9.6 M cancer-related deaths worldwide [1]. These numbers are anticipated to repeatedly increase, to culminate in an annual count of nearly 22 million new cases by 2030 [2, 3]. Both genetic and epigenetic alterations contribute to the initiation and progression of cancers. Epigenetic aberrations, unlike genetic alterations, can be reversed, allowing malignant cells to return to a lesser aggressive state. Epigenetic therapy i.e., targeting critical regulators of gene expression and chromatin integrity that are dysregulated in various human cancers, has emerged as an effective approach for both chemotherapy and chemoprevention of cancer [4].
The nuclear receptor-binding SET domain (NSD), a family of histone lysine methyltransferases, including NSD1, NSD2/WHSC1/MMSET, and NSD3/WHSC1L1, have been identified as potential therapeutic targets for cancer [5]. These enzymes perform vital functions in chromatin regulation and have been implicated in various cancers [6–10]. NSDs activation, including NSD1, NSD2, and NSD3 has been found to be tightly associated with the occurrence and progression of different cancer types [8]. Beside cancers, NSD2 is also linked with other diseases, such as inflammatory and autoimmune diseases. In this research, we focus in discovering NSD2 inhibitors for cancer, as it is the most directly related and extensively investigated disease linked with NSD2 abnormalities compared to other diseases.
NSD2, specifically, regulates chromatin through methylations of H3K36 and H4K20, although it also shows substrate specificity for other histone marks (H3K4, H3K9, H3K27, H3K79, and H4K20), indicating a complex and not fully understood regulatory mechanism [6, 11, 12]. NSD2 overexpression enhances cellular proliferation, affects chromosomal accessibility, and alters gene expression regulation [11]. It is involved in DNA damage repair, being recruited to double-strand breaks, and increasing H4K20 methylation levels at these sites [12]. The short isoform of NSD2, REIIBP, increases H3K27 methylation and grabs histone deacetylases [13]. REIIBP can additionally methylate H3K79, which was previously assumed to be exclusively methylated by DOT1L [14].
NSD2 overexpression is linked to the aggressiveness of tumors and is involved in over 20 types of cancer [14, 15]. It is frequently mutated in pediatric cancers and mantle cell lymphoma and it is highly expressed in various other cancer types [16–20]. NSD2 overexpression is a characteristic feature of the translocation in multiple myeloma. This translocation is linked with a poor prognosis for patients. Multiple myeloma itself accounts for 20% of deaths related to hematological malignancies which is currently with no cure [21]. Targeting NSD2, especially in multiple myeloma, holds potential for cancer therapy [15, 22]. Despite considering NSD2 as a promising target in cancer therapy, the development of selective small molecule inhibitors for NSD2 has been challenging [23, 24]. While some inhibitors have shown activity against NSD2, their selectivity against other NSD enzymes (NSD1 and NSD3) is uncertain [25, 26]. Existing inhibitors, such as MCTP39, Sinefungin, and PTD2, demonstrate inhibitory effects on NSD2 but lack selectivity for NSD1 and NSD3 [22, 27]. Inhibitors targeting G9a/G9a-like protein (GLP) H3K9 HMTase, like BIX-01294 and BIX-01338, also exhibit inhibitory activity against NSD1, NSD2, and NSD3 [28]. KTX-1001 as a new inhibitor NSD2 for the SET catalytic site, was advanced to a phase I clinical trial for the treatment of patients with recurrent and refractory MM. There is no detailed information on KTX-1001, including its structure and preclinical data [29]. Taken together, the development of selective inhibitors for each NSD enzyme, particularly NSD2, has yet to be accomplished and is urgently needed [30, 31].
The scientific literature demonstrates a strong interest in screening natural compounds for the development of medications, particularly in the field of cancer treatment [32]. Over 60% of current anticancer drugs are derived from natural sources or their derivatives [33]. Natural compounds are favored for their affordability, minimal side effects, and high tolerability by the body, making them potentially effective against medication resistance in cancer [34].
Advancements in technology have revolutionized the application of computational strategies in drug development. Currently, these approaches are frequently used at several phases of the drug development process, including hit recognition, lead optimization, and formulation design [35–41]. In the specific context of developing NSD2 inhibitors, a ligand-based virtual screening study was conducted using various computational techniques. The study employed a combination of computational methods to find fundamentally new NSD2 inhibitors including, pharmacophore modeling, molecular docking, and MD simulations.
Methods
All computational studies were carried out using maestro v 12.8 of Schrodinger and the academic Desmond v6.5 by D.E. Shaw Research for molecular dynamics.
Preparation of protein
The X-ray crystallographic structure of the NSD2 protein (PDB ID: 6G2O) was obtained from the open-access Protein Data Bank website and prepared using the protein preparation wizard provided by Schrödinger software [42]. The PDB file initially contains heavy atoms and may include water molecules, cofactors, activators, ligands, metal ions, and protein subunits. However, it lacks assigned ionization and tautomeric states, and there may be missing side chains. Additionally, information about bond order and atomic charges is absent. To address these structural issues, the protein preparation wizard was utilized. The protein structure was minimized using the OPLS4 force field, and various modifications were made, including adding hydrogen atoms, fixing residue charges, assigning bond orders, and creating disulfide bonds [43]. The flip-states of histidine residues were also adjusted. In the final step, the entire structure underwent energy minimization using the impref utility, which employs minimization cycles based on the OPLS4 force field and the impact molecular mechanics engine. The restrained minimization process continued until the RMSD value of heavy atoms reached a threshold of 0.3 Å relative to the starting X-ray structure. The resulting modified protein structure, obtained after the preparation steps, was then used for the computational studies.
E-pharmacophore generation
Structure-based pharmacophore design is an approach used to create a pharmacophore model based on the structural characteristics of the protein target and the bioactive conformation of a ligand that is co-crystallized with the protein. This method relies on the analysis of the stable active site of the protein target and the specific conformation and binding interactions observed in the X-ray crystallographic structure of the ligand-protein complex [44]. The pharmacophore modeling in this study was performed using the PHASE module of the Schrödinger suite [45]. The PHASE module generates pharmacophore hypotheses based on the features observed in the co-crystallized ligand. The pharmacophore skeleton utilized in this study consisted of six features: HB-donor (D), HB-acceptor (A), hydrophobic region (H), negatively charged region (N), positively charged region (P), and aromatic rings (R) [46]. The generated pharmacophore hypotheses were then ranked using scoring parameters and aligned against the co-crystallized ligand. This ranking process helps identify the most relevant pharmacophore hypotheses. Finally, the generated pharmacophore hypothesis was applied to a database to retrieve hits with higher potency than the reference ligand.
Screening of the SN.3 database
SN.3 natural compound database, consisting of approximately 449,008 compounds, was screened to hunt potential inhibitors for the NSD2 receptor. The compounds in the database were first minimized using the MacroModel module available in the Schrödinger suite. This minimization process ensures optimized molecular structures and conformations of the natural compounds. Following that, the minimized NPs database was screened utilizing the PHASE module’s sophisticated pharmacophore-screening function, based on the verified pharmacophore hypotheses. The screening process aims to find compounds that match the specified pharmacophore features and exhibit potential inhibitory activity against the NSD2 receptor. In this screening step, four out of the four pharmacophore features were designated as "must match," meaning that compounds must possess these features to be considered as hits.
Molecular docking
Molecular docking is a prevalent computational approach for predicting the affinity of binding for the protein and ligand complexes. In this study, the Glide module of the Schrödinger Suite was utilized for molecular docking to identify potential "hit" molecules obtained from the pharmacophore-based screening [47]. The prepared NSD2 protein structure served as the receptor for the docking process. A receptor grid was generated based on the conformation of the co-crystal ligand, which helped in the definition of the active amino acids within the grid for accurate docking [47]. The docking was done in a step-by-step manner using two modes: HTVS and XP. HTVS is a fast-screening process used for the preliminary docking of large databases. The top 100 HTVS scoring ligands were chosen and underwent XP docking, which provides more detailed and accurate results by considering precise interaction modes between the ligand and the protein. The docking parameters and settings involved adding the receptor grid file to the receptor grid box and the prepared ligands to the ligands tab. The first docking stage used HTVS, followed by XP precision in the subsequent stage [48]. Default parameters were employed for other parameters. The docking outcomes were analyzed and tabulated based on the G-scores, which indicate the binding affinity and represent the quality of the protein-ligand interactions.
Pharmacokinetic profile prediction
QikProp, a module within the Maestro program, is a tool used to calculate unique ADME-relevant descriptors [49]. It offers a novel approach for optimizing the pharmacokinetic profile of pharmaceutical compounds. Recognizing the importance of favorable pharmacokinetic features in successful drug discovery, ADME assessments have been integrated earlier into drug design strategies. In this study, QikProp was employed to establish correlations between 3D molecular structures and physicochemical and pharmacokinetic properties. The top 3 molecules were subjected to energy minimization to obtain their minimum energy conformations. ADME properties were then calculated based on these conformations.
Molecular dynamics
To refine the NSD2 receptor complexes with the top three lead molecules and the co-crystal ligand, MDs were performed using the Desmond software [50–52]. The consistency of their interactions was studied through these simulations. The process began by using the system builder application of the Desmond module. Default parameters were used, then the system was solvated with a TIP3P water model in an orthorhombic periodic boundary box. The box buffer size dimensions were set to 10 Å in each direction (a:10 × b:10 × c:10). To neutralize the system, Cl− ions were added based on the total charge of the model, along with a salt concentration of 0.15 M. The next step involved minimizing the model obtained from the system builder. The minimization application was used, with the maximum iterations set to 2000, and the remaining parameters were kept at their default values. The then system was subjected to the MD production phase. This phase of the MD simulation is divided into seven different stages as follow:
stage 1 –task.
stage 2—simulate, Brownian Dynamics NVT, T = 10 K, small timesteps, and restraints on solute heavy atoms, 100ps.
stage 3—simulate, NVT, T = 10 K, small timesteps, and restraints on solute heavy atoms, 12ps.
stage 4—simulate, NPT, T = 10 K, and restraints on solute heavy atoms, 12ps.
stage 5—simulate, NPT and restraints on solute heavy atoms, 12ps.
stage 6—simulate, NPT and no restraints, 24ps.
stage 7 –simulate.
The first six stages include the equilibrium phase and consist of short simulation steps. Step 7 is a last, long simulation stage. A total of 200 ns production stage was carried out.
Results and discussion
The workflow of this study is summarized in Fig 1.
Pharmacophore modeling
Depend on the targeted domain, NSD2 small molecules inhibitors are classified into three classes: catalytic SET domain of NSD2 (NSD2-SET) inhibitors, PHD domain of NSD2 (NSD2-PHD) inhibitors, and PWWP1 domain of NSD2 (NSD2-PWWP1) inhibitors. Drug discovery projects targeting the catalytic SET domain yielded little success. PPI domains, including PHD domains and PWWP domains (PWWP1 and PWWP2) could be clinically relevant and selectively blocked by compounds [29]. In 2021, to identify a selective NSD2 inhibitors targeting the NSD2-PWWP1 domain, molecular docking and experimental validation were performed [29]. A new scaffold was identified and provided the basis for further SAR studies. This research employed NSD2 structure for the generation of an e-pharmacophore model with a specific focus on the PWWP1 domain. The pharmacophore model was created based on the distinctive structural properties observed in the co-crystallized inhibitor (Fig 2). Consequently, this led to the identification of four pharmacophoric features (three-ring structures and a hydrogen bond acceptor). Accordingly, a library containing 449,008 natural compounds retrieved from the SN.3 database was screened against the generated pharmacophoric features. As a result, a total of 49,248 candidate molecules that possessed three-ring structures in addition to a hydrogen-accepting moiety have been filtered in as matching the pharmacophore hypotheses. Subsequently, to validate the results of the match, molecular docking was performed on the active site of the NSD2 domain. Fig 3 depicts the pharmacophoric features of the PWW1 binding domain.
(R8), (R9), (R11) doughnut-shaped circles denoting the aromatic ring structures and A1 denoting for the hydrogen bond acceptor.
Molecular docking analysis
Molecular docking is an integral part of computational drug discovery. Basically, it is introduced to study the interactions between a target entity and a binder in terms of bonding and non-bonding interactions. While molecular docking has a limitation of not accounting for the flexibility of receptors, it continues to be a valid method for providing initial insights [53, 54].
In Glide, the HTVS procedure was done on 49,248 compounds that matched the pharmacophore hypotheses. Afterward, the top 100 molecules of the pool were taken for subsequent extra-precision (XP) docking (docking scores: -9.061 to -7.164 kcal/mol, S1 Fig in S1 File), since the remaining molecules showed low docking scores < -6.00 kcal/mol. The XP mode docking is recognized for its increased accuracy and precision compared to other modes. This mode was employed for the initial screening of the compounds and consequent filtering that relies on their XP docking scores. The scoring function integrated into Glide combines empirical and force-field-based factors to compute binding energy, facilitating the filtration of the most optimal docking poses [47, 55]. In reference to Table 1, it can be clearly noticed that there are 10 compounds having docking scores better than that of the co-crystallized reference inhibitor. In this regard, the quantitative assessment of the binding is taken via docking score. The docking score reflects the binding affinity of the docked entities. Whereby, the more negative the docking score, the better the binding affinity and the merrier binding stability.
The co-crystallized inhibitor was identified as the first potent and selective NSD3-PWWP1 antagonist (IC50 = 0.2 μM), which can disrupt the interactions between NSD3-PWWP1 and H3K36me2, leading to a significant reduction of MYC mRNA levels and suppressing the proliferation of leukemia cell lines [56]. As in Table 1, it can be seen that this potent inhibitor presented a docking score of -9.355 kcal/mol. This in turn stands as a reference point for the docked library of compounds. Ten out of the top hundred molecules showed outstanding scores of dockings that surpass that of the potent inhibitors. The docking scores of the ten compounds ranged from -12.041 kcal/mol to -9.519 kcal/mol. As long as these top-ranked hits have superior binding affinities, they are anticipated to possess a better activity in further experiments. However, a more detailed explanation of their precise binding mechanism is warranted.
Ligand-residue interaction pattern analysis
According to the previously reported work of Ma et al. [29], in the crystal structure of the active domain of NSD2-PWWP1 (PDB ID: 6G2O), the recognized key amino acid residues are SER-314 and GLU-318. Encouragingly, the co-crystallized inhibitor that was docked in the active site domain reproduced the same reported interactions with the aforementioned key amino acid residues. This is consistent with the previously reported binding mode [29]. For the purpose of taking the compounds for further analysis, the top-scoring three compounds will be discussed and taken as representatives due to the high computational cost of handling a large number of compounds. The focus on the details of the interacting amino acid residues and their corresponding parts in contact with the ligands demonstrates the nature of the interactions. To attain this, the ligand interactions diagram panel of the Maestro interface was utilized to display the two-dimensional interactions in Fig 4. Likewise, Fig 5 depicts the three-dimensional interactions of the docked compounds on the active site pocket showing the electrostatic potential map.
According to Fig 3, the co-crystallized reference inhibitor showed the ability to constitute direct hydrogen bonds to SER-314 (1.95 Å), the key amino acid of NSD2, and a slat bridge with the other key amino acid GLU-318 (3.26 Å). Addedly, a form of hydrophobic interaction known as pi-pi stacking was also presented between the ring structures of the reference compound and the amino acid residues TRP-284, TYR-281, and PHE-312.
The first hit, SN0450102 which is extracted from the plant Curculigo sinensis [57] and is chemically known as 8-(3,4-dihydroxybenzoyl)naphthalene-2,3,6-triol showcased interactions of different types. Particularly, it presented a direct hydrogen bond to the amino acid residue SER-367 (1.88 Å) together with indirect water-mediated hydrogen bonds to GLU-364 (3.93 Å) and GLN-316 (4.17 Å). Additionally, it can be seen that pi-pi stackings are also occurring with TRP-284.
The second hit, SN0410255 is reported to be produced by Molineria crassifolia and is chemically identified as 2,5-bis(3,4-dihydroxyphenyl)furan-3-carbaldehyde [57]. This molecule also exhibited a direct hydrogen bond to the essential amino acid residue SER-314 (2.14 Å) and another hydrogen bond to GLN-367 (1.8 Å). Water-mediated hydrogen bonds were also prevalent with GLN-316 (4 Å) and ARG-319 (3.56 Å). The pi-pi stacking was apparent with TRP-284 and PHE-312 as a sort of interaction that strengthened the binding to the active pocket.
The third hit, SN0142336, an extract of the plant Aquilegia ecalcarata [58] that is chemically recognized as 7-hydroxy-4-[5-(hydroxymethyl)furan-2-yl]-1,2-dihydroquinolin-2-one, constituted a direct hydrogen bonding to the key amino acid residue SER-314 (1.96 Å). Two more hydrogen bonds to GLN-367 (1.7 Å) and GLN-316 (1.59 Å) were also noticed. In addition, water-bridged hydrogen bonds were also seen with GLU-364 (4.4 Å). Besides, multiple pi-pi stackings with TYR-281, TRP-284, and PHE-312 were also seen.
In summary, as the docking process is known to provide meaningful glimpses into the binding mode, the affinity of binding, and the receptor-ligand interaction pattern, the docking results of these three hits disclosed evaluable information about their binding affinity and how they behave with respect to the constitutional amino acid residues inside NSD2 receptor cavity.
Pharmacokinetic profiling
Prior to subjecting the highest-ranked hits to additional computational evaluation, it was imperative to address the critical issue of pharmacokinetic profiling. Consequently, the QikProp tool embedded in Maestro was employed to assess the key pharmacokinetic parameters that play a fundamental role in determining the drug-like characteristics of a molecular entity. This goal is achieved through the comparison of the molecular properties against the criteria set by Lipinski’s rule of five. In accordance with Lipinski’s rule, a molecule is considered pharmaceutically viable if it adheres to the following criteria: a molecular weight not surpassing 500 Da, a count of hydrogen bond donors limited to five (donorHB ≤ 5), a restriction of hydrogen bond acceptors to not more than ten (acceptorHB ≤ 10), and a predicted octanol/water partition coefficient (QPlogPo/w) below 5 [59, 60]. In Table 2, it can be noted that all the hit molecules satisfied the rule of five criteria with no violation. Moreover, the solubility parameter (QPlogS) was within the accepted reference range (-6.5˗0.5) for the three hit compounds. Furthermore, two parameters associated with the potential for cellular membrane penetration underwent a thorough examination. The first is QPlogBB, which serves as an indicator of blood-brain barrier (BBB) permeability, and the second is QPPCaco-2, reflecting cell membrane permeability. It is noteworthy that all the hit compounds under investigation exhibited favorable characteristics in terms of good cellular permeability while concurrently demonstrating limited capacity to traverse the BBB. Besides, a crucial parameter, QPlog HERG, responsible for predicting IC50 values associated with the inhibition of HERG K+ channels, as outlined in Table 2, indicated that none of the compounds showed signs of cardiotoxicity. Collectively, the examination of all the pharmacokinetic parameters under study suggests that the candidate molecules hold promise as potential pharmaceutical agents.
Molecular dynamics (MD) simulation analysis
Molecular dynamics simulations evaluate the time-based stability of the simulated protein, considering the impact of bound ligands throughout the duration of the simulation. This approach aids in the refinement and validation of docking outcomes by assessing the dependability of predicted binding modes and discerning compounds that are unable to sustain stable interactions. By accounting for variables such as temperature and solvent effects, MD simulations offer valuable insights into the stability, flexibility, and conformational changes of the protein-ligand complexes. Herein, the complexes of the top three docked hits and the co-crystallized reference inhibitor were simulated for a duration of 200 nanoseconds on the Desmond package. The aim of the simulation was to further consolidate the matter of the suitability of the docked hits as inhibitors of the NSD2 pathway and by default assess their reliability as new anticancer candidates. For this purpose, the trajectories of MD were taken for extensive analysis by virtue of the essential statistical parameters.
Importantly, root mean square deviation (RMSD) stands as a crucial metric, employed to evaluate the stability and integrity of the protein’s structure. Upon binding to compounds at its active site, a protein may experience structural alterations that impact its conformational stability. RMSD fundamentally quantifies the square root of the average deviations from the mean distances within the complex formed by the protein and ligand. The examination of molecular dynamics trajectories, coupled with the analysis of RMSD plots, offers an insightful preliminary assessment of the stability of the simulated protein-ligand complexes as previously highlighted [61]. In the case of a globular protein, ensuring stability within a simulated system generally involves RMSD fluctuations within a span of 1 to 3 Å. Notably, as long as the RMSD value attains a consistent level of fluctuation, it can be reasonably considered indicative of a converged system [61]. The co-crystallized reference inhibitor presented a relatively stable plot of fluctuations throughout the duration of the simulation as illustrated in Fig 6. Comparatively, for the hit molecules and the reference under study, the plots of RMSD throughout the 200 ns seemed to be equilibrated after 80 ns. They showed similar fluctuation patterns with an average protein RMSD of 4.81 Å. The hits SN0450102, SN0410255, SN0142336 and the reference displayed ligand RMSD values of 4.54, 5.06, 5.60, and 4.08 Å, respectively. SN0450102 and SN0410255 exhibited comparable fluctuations with the reference. This strongly suggests that they possess an equivalent binding pattern and stability. Another crucial metric for the evaluation of the flexibility of proteins during molecular dynamics simulations is the root mean square fluctuation (RMSF). This property offers valuable insights into the variations observed in specific amino acid residues, facilitating an assessment of how ligand binding influences the overall stability of the protein structure [62]. For the hit molecules in this study, the average RMSF value of the co-crystallized reference inhibitor and the hits, SN0450102, SN0410255, and SN0142336, was 1.37 Å. In general, as observed in Fig 6, the patterns observed in root mean square fluctuation indicate that the simulated protein-ligand complexes exhibit a degree of stability, characterized by minimal fluctuations in the crucial amino acid residues as in Fig 7. The protein-ligand contact histogram in Fig 8 indicates the types of different interactions that were formed throughout the simulation time. Taken as a reference, the co-crystallized inhibitor demonstrated hydrogen bonds to GLU-318 (76%) and ARG-319 (11%) alongside a water bridge with ARG-319 7%. Hydrophobic contacts were also seen with VAL-278 (12%), and ARG-363 (26%).
As per the histogram of SN0450102, it is noticed to engage in water-mediated hydrogen bonds with multiple amino acids: TYR-281 (15%), ARG-360 (5%), GLU-364 (40%), and GLN-367 (10%). Additionally, it did form hydrophobic contacts and ionic interactions with TYR-281 for 4% of the simulation time. The hit SN0410255 formed direct hydrogen bonds with the key amino acids SER-314 (4%) and GLN-367 (19%). Moreover, it also formed an indirect water-bridged hydrogen bond for 5% of the simulation time with LYS-277, GLY-279, ASN-315, and ARG-319. Likewise, it presented a water bridge with SER-314 (21%) and GLN-316 (29%). The hit SN0142336 formed direct hydrogen bonds with GLN-316 (74%) and GLU-318 (98%) together with bridged hydrogen bonds with GLN-316, GLU-318, ARG-363, and PRO-317.
In comparing the whole contacts, the hits SN0450102, SN0410255, and SN0142336 showcased diverse binding mechanisms. Hit SN0450102 relies on water-mediated hydrogen bonds and hydrophobic contacts, while hit SN0410255 demonstrates specificity through direct and water-bridged hydrogen bonds. Hit SN0142336 emphasizes direct hydrogen bonds with a pronounced affinity for GLN-316 and GLU-318. The reference inhibitor shares similarities with the hit SN0142336 but displays a broader hydrophobic interaction profile. Understanding these binding modes is critical for designing targeted inhibitors with optimal therapeutic efficacy.
Several ligand properties were studied, including Ligand RMSD, rGyr, MolSA, and SASA, as depicted in Fig 9. The rGyr (radius of gyration) is considered a significant marker of protein compactness. The average rGyr values of SN0450102, SN0410255, SN0142336, and the reference were 4.52 ± 0.036, 4.97 ± 0.04, 4.31 ± 0.04, and 5.15 ± 0.04 Å, respectively. To study the protein’s exposure to solvent molecules and its structural stability, three surface areas were examined, including MolSA (molecular surface area), SASA (solvent-accessible surface area), and PSA (polar surface area). For SN0450102, the MolSA, SASA, and PSA average values were 289.87 ± 1.3199, 416.95 ± 18.63, 280.028 ± 2.47 Å2, respectively. In the case of SN0410255, the corresponding average values were 326.13 ± 1.24 Å2 for MolSA, 400.60 ± 18.08 Å2 for SASA, and 261.93 ± 2.30 Å2 for PSA. For SN0142336, the averages were 276.35 ± 1.33 Å2 for MolSA, 239.81 ± 28.64 Å2 for SASA, and 190.45 ± 2.95 Å2 for PSA. Finally, for the reference, the averages were 423.51 ± 1.61 Å2 for MolSA, 487.75 ± 35.35 Å2 for SASA, and 92.27 ± 1.86 Å2 for PSA.
It can be recapped from these observations that these three compounds offer unique opportunity for NSD2 repression since they tackle influential amino acid residues in the PWWP1 domain with outstanding binding affinities, interaction stability, and oral bioavailability. Moreover, being natural promises a good safety profile.
The binding affinity and binding mode of the top compounds with NDS2 can further be investigated experimentally to ascertain its biological fitness using differential static light scattering (DSLS), surface plasmon resonance (SPR) assay and reverse ITC titration. Ligand based drug design such as QSAR studies is a very convenient approach to accelerate the development of compounds by studying the large number of molecules that interact with the biological target of interest [63]. These studies can be done to further explore the structures activity relationship of these NSD2 inhibitors. The selectivity of these compounds toward other NDS isoforms (NDS1 and NDS3) which also linked to cancer can be explored for therapeutic development against cancer. Combinatorial studies of these potential NSD2 inhibitors with other therapies such as EZH2 inhibitors can be studied for effective cancer treatment.
Conclusion
NSD2 mal-expression has been linked with cancer development and proliferation, therefore it gained tremendous attention as a drug target for cancer management, especially in cases of AML. This study explored SuperNatural 3.0 utilizing a multi-faceted approach that included pharmacophore modeling, molecular docking, and molecular dynamics simulations to identify potential inhibitors for NSD2. After rigorous computational analysis, three leads promised superior docking scores i.e., binding the reference inhibitor. Ligand-residue interaction analysis highlighted diverse binding mechanisms, while pharmacokinetic profiling indicated drug-like characteristics. Molecular dynamics simulations over 200 nanoseconds confirmed the stability of the observed interactions between NSD2 and the selected hits. Overall, these compounds show potential as anti-cancer agents targeting NSD2, Nonetheless, further experimental validation is warranted to affirm these conclusions.
Supporting information
S1 File. The online version contains supplementary material available at xxx.
S1 Fig: The histogram shows the distribution of HTVS docking scores for the top 100 compounds selected for the XP molecular docking.; S1 Table: HTVS dockings scores for the top 100 compounds selected for the XP molecular docking.
https://doi.org/10.1371/journal.pone.0308913.s001
(DOCX)
References
- 1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49. pmid:33538338
- 2. Bray F, Jemal A, Torre LA, Forman D, Vineis P. Long-term Realism and Cost-effectiveness: Primary Prevention in Combatting Cancer and Associated Inequalities Worldwide. J Natl Cancer Inst. 2015 Dec;107(12):djv273. pmid:26424777
- 3.
Bray F, Soerjomataram I. The Changing Global Burden of Cancer: Transitions in Human Development and Implications for Cancer Prevention and Control. In: Gelband H, Jha P, Sankaranarayanan R, Horton S, editors. Disease Control Priorities, Third Edition (Volume 3): Cancer. Washington (DC); 2015. p. 23–44.
- 4. Ferreira de Freitas R, Liu Y, Szewczyk MM, Mehta N, Li F, McLeod D, et al. Discovery of Small-Molecule Antagonists of the PWWP Domain of NSD2. J Med Chem. 2021 Feb;64(3):1584–92. pmid:33522809
- 5. Bennett RL, Swaroop A, Troche C, Licht JD. The Role of Nuclear Receptor-Binding SET Domain Family Histone Lysine Methyltransferases in Cancer. Cold Spring Harb Perspect Med. 2017 Jun;7(6). pmid:28193767
- 6. Morishita M, Mevius D, di Luccio E. In vitro histone lysine methylation by NSD1, NSD2/MMSET/WHSC1 and NSD3/WHSC1L. BMC Struct Biol. 2014;14(1):25. pmid:25494638
- 7. di Luccio E, Koehl P. The H-factor as a novel quality metric for homology modeling. J Clin Bioinforma. 2012;2(1):18. pmid:23121764
- 8. Morishita M, di Luccio E. Structural insights into the regulation and the recognition of histone marks by the SET domain of NSD1. Biochem Biophys Res Commun. 2011;412(2):214–9. pmid:21806967
- 9. Morishita M, di Luccio E. Cancers and the NSD family of histone lysine methyltransferases. Biochim Biophys Acta—Rev Cancer. 2011;1816(2):158–63. pmid:21664949
- 10. di Luccio E. Inhibition of Nuclear Receptor Binding SET Domain 2/Multiple Myeloma SET Domain by LEM-06 Implication for Epigenetic Cancer Therapies. J cancer Prev. 2015 Jun;20(2):113–20. pmid:26151044
- 11. Martinez-Garcia E, Popovic R, Min DJ, Sweet SMM, Thomas PM, Zamdborg L, et al. The MMSET histone methyl transferase switches global histone methylation and alters gene expression in t(4;14) multiple myeloma cells. Blood. 2011;117(1):211–20. pmid:20974671
- 12. Pei H, Zhang L, Luo K, Qin Y, Chesi M, Fei F, et al. MMSET regulates histone H4K20 methylation and 53BP1 accumulation at DNA damage sites. Nature. 2011;470(7332):124–8. pmid:21293379
- 13. Kim JY, Kee HJ, Choe NW, Kim SM, Eom GH, Baek HJ, et al. Multiple-myeloma-related WHSC1/MMSET isoform RE-IIBP is a histone methyltransferase with transcriptional repression activity. Mol Cell Biol. 2008 Mar;28(6):2023–34. pmid:18172012
- 14. Woo Park J, Kim KB, Kim JY, Chae YC, Jeong OS, Seo SB. RE-IIBP Methylates H3K79 and Induces MEIS1-mediated Apoptosis via H2BK120 Ubiquitination by RNF20. Sci Rep. 2015 Jul;5:12485. pmid:26206755
- 15. Vougiouklakis T, Hamamoto R, Nakamura Y, Saloura V. The NSD family of protein methyltransferases in human cancer. Epigenomics. 2015 Aug;7(5):863–74. pmid:25942451
- 16. Hudlebusch HR, Santoni-Rugiu E, Simon R, Ralfkiær E, Rossing HH, Johansen JV, et al. The histone methyltransferase and putative oncoprotein MMSET is overexpressed in a large variety of human tumors. Clin cancer Res an Off J Am Assoc Cancer Res. 2011 May;17(9):2919–33.
- 17. Saloura V, Cho HS, Kiyotani K, Alachkar H, Zuo Z, Nakakido M, et al. WHSC1 promotes oncogenesis through regulation of NIMA-related kinase-7 in squamous cell carcinoma of the head and neck. Mol Cancer Res. 2015 Feb;13(2):293–304. pmid:25280969
- 18. Li J, Yin C, Okamoto H, Mushlin H, Balgley BM, Lee CS, et al. Identification of a novel proliferation-related protein, WHSC1 4a, in human gliomas. Neuro Oncol. 2008 Feb;10(1):45–51. pmid:18182627
- 19. Kassambara A, Klein B, Moreaux J. MMSET is overexpressed in cancers: link with tumor aggressiveness. Biochem Biophys Res Commun. 2009 Feb;379(4):840–5. pmid:19121287
- 20. Huether R, Dong L, Chen X, Wu G, Parker M, Wei L, et al. The landscape of somatic mutations in epigenetic regulators across 1,000 paediatric cancer genomes. Nat Commun. 2014 Apr;5:3630. pmid:24710217
- 21. Beà S, Valdés-Mas R, Navarro A, Salaverria I, Martín-Garcia D, Jares P, et al. Landscape of somatic mutations and clonal evolution in mantle cell lymphoma. Proc Natl Acad Sci U S A. 2013 Nov;110(45):18250–5. pmid:24145436
- 22. Morrison MJ, Boriack-Sjodin PA, Swinger KK, Wigle TJ, Sadalge D, Kuntz KW, et al. Identification of a peptide inhibitor for the histone methyltransferase WHSC1. PLoS One. 2018;13(5):e0197082. pmid:29742153
- 23. McGrath J, Trojer P. Targeting histone lysine methylation in cancer. Pharmacol Ther. 2015 Jun;150:1–22. pmid:25578037
- 24. Liu Q, Wang MW. Histone lysine methyltransferases as anti-cancer targets for drug discovery. Acta Pharmacol Sin. 2016 Sep;37(10):1273–80. pmid:27397541
- 25. Chesi M, Nardini E, Lim RS, Smith KD, Kuehl WM, Bergsagel PL. The t(4;14) translocation in myeloma dysregulates both FGFR3 and a novel gene, MMSET, resulting in IgH/MMSET hybrid transcripts. Blood. 1998 Nov;92(9):3025–34. pmid:9787135
- 26. Arrowsmith CH, Audia JE, Austin C, Baell J, Bennett J, Blagg J, et al. The promise and peril of chemical probes. Nat Chem Biol. 2015 Aug;11(8):536–41. pmid:26196764
- 27. Tisi D, Chiarparin E, Tamanini E, Pathuri P, Coyle JE, Hold A, et al. Structure of the Epigenetic Oncogene MMSET and Inhibition by N-Alkyl Sinefungin Derivatives. ACS Chem Biol. 2016 Nov;11(11):3093–105. pmid:27571355
- 28. Chang Y, Zhang X, Horton JR, Upadhyay AK, Spannhoff A, Liu J, et al. Structural basis for G9a-like protein lysine methyltransferase inhibition by BIX-01294. Nat Struct Mol Biol. 2009 Mar;16(3):312–7. pmid:19219047
- 29.
Ma Z, Bolinger AA, Chen H, Zhou J. Drug Discovery Targeting Nuclear Receptor Binding SET Domain Protein 2 (NSD2). Vol. 66, Journal of Medicinal Chemistry. American Chemical Society; 2023. p. 10991–1026.
- 30. Cain C. NSD2 momentum. Sci Exch. 2013;6(39):1083.
- 31. Coussens NP, Kales SC, Henderson MJ, Lee OW, Horiuchi KY, et al. High-throughput screening with nucleosome substrate identifies small-molecule inhibitors of the human histone lysine methyltransferase NSD2. J Biol Chem. 2018 Aug;293(35):13750–65. pmid:29945974
- 32. Yan XJ, Gong LH, Zheng FY, Cheng KJ, Chen ZS, Shi Z. Triterpenoids as reversal agents for anticancer drug resistance treatment. Drug Discov Today. 2014 Apr;19(4):482–8. pmid:23954181
- 33. Carocho M, Ferreira ICFR. The role of phenolic compounds in the fight against cancer—a review. Anticancer Agents Med Chem. 2013 Oct;13(8):1236–58. pmid:23796249
- 34. Vahedi F, Fathi Najafi M, Bozari K. Evaluation of inhibitory effect and apoptosis induction of Zyzyphus Jujube on tumor cell lines, an in vitro preliminary study. Cytotechnology. 2008 Feb;56(2):105–11. pmid:19002848
- 35. Arora R, Issar U, Kakkar R. Identification of novel urease inhibitors: pharmacophore modeling, virtual screening and molecular docking studies. J Biomol Struct Dyn. 2019 Oct;37(16):4312–26. pmid:30580662
- 36. Iqbal S, Anantha Krishnan D, Gunasekaran K. Identification of potential PKC inhibitors through pharmacophore designing, 3D-QSAR and molecular dynamics simulations targeting Alzheimer’s disease. J Biomol Struct Dyn. 2018 Nov;36(15):4029–44. pmid:29182053
- 37. Hu Y, Zhou L, Zhu X, Dai D, Bao Y, Qiu Y. Pharmacophore modeling, multiple docking, and molecular dynamics studies on Wee1 kinase inhibitors. J Biomol Struct Dyn. 2019 Jul;37(10):2703–15. pmid:30052133
- 38. Ramakrishnan C, Mary Thangakani A, Velmurugan D, Anantha Krishnan D, Sekijima M, Akiyama Y, et al. Identification of type I and type II inhibitors of c-Yes kinase using in silico and experimental techniques. J Biomol Struct Dyn. 2018 May;36(6):1566–76. pmid:28589758
- 39. Sakthivel S, Habeeb SKM. Combined pharmacophore, virtual screening and molecular dynamics studies to identify Bruton’s tyrosine kinase inhibitors. J Biomol Struct Dyn. 2018 Dec;36(16):4320–37. pmid:29293382
- 40.
Monteagudo MC, González-Díaz H. New Experimental and Computational Tools for Drug Discovery: Medicinal Chemistry, Molecular Docking, and Machine Learning—Part-VI. Vol. 18, Current topics in medicinal chemistry. United Arab Emirates; 2018. p. 2325–6.
- 41. Mehta CH, Narayan R, Nayak UY. Computational modeling for formulation design. Drug Discov Today. 2019;24(3):781–8. pmid:30502513
- 42. Madhavi Sastry G, Adzhigirey M, Day T, Annabhimoju R, Sherman W. Protein and ligand preparation: Parameters, protocols, and influence on virtual screening enrichments. J Comput Aided Mol Des. 2013;27(3):221–34. pmid:23579614
- 43. Jorgensen WL, Tirado-Rives J. The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. J Am Chem Soc. 1988 Mar;110(6):1657–66. pmid:27557051
- 44. Rella M, Rushworth CA, Guy JL, Turner AJ, Langer T, Jackson RM. Structure-based pharmacophore design and virtual screening for novel angiotensin converting enzyme 2 inhibitors. J Chem Inf Model. 2006;46(2):708–16. pmid:16563001
- 45. Salam NK, Nuti R, Sherman W. Novel method for generating structure-based pharmacophores using energetic analysis. J Chem Inf Model. 2009 Oct;49(10):2356–68. pmid:19761201
- 46. Dixon SL, Smondyrev AM, Rao SN. PHASE: A novel approach to pharmacophore modeling and 3D database searching. Chem Biol Drug Des. 2006 May;67(5):370–2. pmid:16784462
- 47. Halgren TA, Murphy RB, Friesner RA, Beard HS, Frye LL, Pollard WT, et al. Glide: A New Approach for Rapid, Accurate Docking and Scoring. 2. Enrichment Factors in Database Screening. J Med Chem. 2004;47(7):1750–9. pmid:15027866
- 48. Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, et al. Glide: A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy. J Med Chem. 2004 Mar;47(7):1739–49. pmid:15027865
- 49.
QikProp, Schrödinger, LLC, New York, NY, 2020.
- 50. Alzain AA, Elbadwi FA, Alsamani FO. Discovery of novel TMPRSS2 inhibitors for COVID-19 using in silico fragment-based drug design, molecular docking, molecular dynamics, and quantum mechanics studies. Informatics Med Unlocked. 2022 Jan 1;29:100870. pmid:35128036
- 51. Elbadwi FA, Khairy EA, Alsamani FO, Mahadi MA, Abdalrahman SE, Alsharf Z, et al. Identification of novel transmembrane Protease Serine Type 2 drug candidates for COVID-19 using computational studies. Informatics Med Unlocked. 2021;26:100725. pmid:34514079
- 52. Ash J, Fourches D. Characterizing the Chemical Space of ERK2 Kinase Inhibitors Using Descriptors Computed from Molecular Dynamics Trajectories. J Chem Inf Model. 2017;57(6):1286–99. pmid:28471171
- 53. Meng XY, Zhang HX, Mezei M, Cui M. Molecular Docking: A Powerful Approach for Structure-Based Drug Discovery. Curr Comput Aided-Drug Des. 2012;7(2):146–57.
- 54. Romano T. Kroemer . Structure-Based Drug Design: Docking and Scoring. Curr Protein Pept Sci. 2007 Jul;8(4):312–28. pmid:17696866
- 55. Friesner RA, Murphy RB, Repasky MP, Frye LL, Greenwood JR, Halgren TA, et al. Extra precision glide: Docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. J Med Chem. 2006;49(21):6177–96. pmid:17034125
- 56. Böttcher J, Dilworth D, Reiser U, Neumüller RA, Schleicher M, Petronczki M, et al. Fragment-based discovery of a chemical probe for the PWWP1 domain of NSD3. Nat Chem Biol. 2019 Aug;15(8):822–9. pmid:31285596
- 57. Li N, Chen JJ, Zhou J. Four New Phenolic Compounds from Curculigo crassifolia (Hypoxidaceae). Helv Chim Acta. 2004 Apr;87(4):845–50.
- 58. Chen SB, Gao GY, Li YS, Yu SC, Xiao PG. Cytotoxic constituents from Aquilegia ecalcarata. Planta Med. 2002 Jun;68(6):554–6. pmid:12094305
- 59. Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 2001 Mar;46(1–3):3–26. pmid:11259830
- 60. Jorgensen WL, Duffy EM. Prediction of drug solubility from structure. Adv Drug Deliv Rev. 2002 Mar;54(3):355–66. pmid:11922952
- 61. Sargsyan K, Grauffel C, Lim C. How Molecular Size Impacts RMSD Applications in Molecular Dynamics Simulations. J Chem Theory Comput. 2017 Apr;13(4):1518–24. pmid:28267328
- 62. Shoaib TH, Ibraheem W, Abdelrahman M, Osman W, Sherif AE, Ashour A, et al. Exploring the potential of approved drugs for triple-negative breast cancer treatment by targeting casein kinase 2: Insights from computational studies. Eldehna WM, editor. PLoS One. 2023 Aug;18(8):e0289887. pmid:37578958
- 63. Shashank SM, Venkatachalam T, Chandra SS, Hemendra PS, Sourav K, Neeraj K. Computational investigation of binding mechanism of substituted pyrazinones targeting corticotropin releasing factor-1 receptor deliberated for anti-depressant drug design. J Biomol Struct Dyn. 2019.