Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Identification of a peptide inhibitor for the histone methyltransferase WHSC1

  • Michael J. Morrison ,

    Roles Conceptualization, Formal analysis, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • P. Ann Boriack-Sjodin,

    Roles Conceptualization, Formal analysis, Investigation, Writing – original draft

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Kerren K. Swinger,

    Roles Conceptualization, Data curation, Formal analysis, Writing – original draft

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Tim J. Wigle,

    Roles Conceptualization, Investigation, Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Dipti Sadalge,

    Roles Data curation, Formal analysis, Investigation, Visualization

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Kevin W. Kuntz,

    Roles Conceptualization, Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Margaret Porter Scott,

    Roles Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • William P. Janzen,

    Roles Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Richard Chesworth,

    Roles Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Kenneth W. Duncan,

    Roles Formal analysis, Investigation, Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Darren M. Harvey,

    Roles Investigation, Supervision, Visualization, Writing – review & editing

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • John W. Lampe,

    Roles Formal analysis, Investigation, Visualization

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Lorna H. Mitchell,

    Roles Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America

  • Robert A. Copeland

    Roles Supervision

    Affiliation Epizyme Inc., Cambridge, Massachusetts, United States of America


WHSC1 is a histone methyltransferase that is responsible for mono- and dimethylation of lysine 36 on histone H3 and has been implicated as a driver in a variety of hematological and solid tumors. Currently, there is a complete lack of validated chemical matter for this important drug discovery target. Herein we report on the first fully validated WHSC1 inhibitor, PTD2, a norleucine-containing peptide derived from the histone H4 sequence. This peptide exhibits micromolar affinity towards WHSC1 in biochemical and biophysical assays. Furthermore, a crystal structure was solved with the peptide in complex with SAM and the SET domain of WHSC1L1. This inhibitor is an important first step in creating potent, selective WHSC1 tool compounds for the purposes of understanding the complex biology in relation to human disease.


Epigenetic mechanisms, including post-translational modifications of histones within chromatin, regulate gene transcription in normal cells [1]. It is therefore unsurprising that dysregulation of epigenetic pathways can lead to a variety of human diseases, including cancer, metabolic disease, neurological disorders, and inflammation [2]. Protein methyltransferases are enzymes that add covalent methyl groups to arginine or lysine side chains of proteins employing S-adenosyl-L-methionine (SAM) as a cofactor. The transfer of this methyl group can result in a mono-, di-, or trimethylated lysine or conversely, a mono- or dimethylated (symmetric or asymmetric) arginine. Methylation of histones has been shown to act as both a transcriptional activator and repressor making methyltransferases attractive targets for drug discovery [3].

The NSD (Nuclear receptor SET Domain containing) family of histone methyltransferases consists of three members: NSD1, WHSC1 (also known as NSD2 or MMSET), and WHSC1L1 (also known as NSD3). These three enzymes mono- and dimethylate lysine 36 on histone H3 (H3K36) [4, 5]. The NSD family members are large, multi-domain enzymes that, in addition to the catalytic SET (Su(var), E(z) and Trithorax) domain, include known epigenetic reader domains such as PHD and PWWP domains, and these reader domains likely contribute to chromatin binding [6].

Fusion proteins with the NSD family enzymes have been implicated in several cancers. NUP98-NSD1, the fusion protein resulting from the t(5;11)(q35;p15.5) translocation, has been seen in acute myeloid leukemias [7, 8]. The t(4;14)(p16;q32) translocation puts the WHSC1 gene under the control of the IgH intronic Eu (mu) enhancer resulting in elevated WHSC1 protein expression and increased dimethylation at H3K36 [911]. Translocations at t(8;11)(p11.2;p15) create a NUP98-WHSC1L1 fusion seen in acute myeloid leukemia [12]. Additionally, upregulation of WHSC1 has been linked to several cancers including neuroblastoma [13], gliomas [14], and several others including bladder cancer [13] and has also been linked to increased tumor aggressiveness [15]. Furthermore, a gain-of-function WHSC1 point mutation (E1099K) has also been identified in pediatric acute lymphoblastic leukemia cell lines resulting in a dependency on mutant WHSC1 activity [16, 17]. Amplification of WHSC1L1 has been seen in breast cancer where knockdown modulated the growth and survival of these cells [18].

Given the evidence linking NSD family members and oncology, there is great interest in developing potent and selective inhibitors for one or more members of the NSD family. Structure-based methods have been extremely useful in the development of several inhibitors to histone methyltransferases, such as DOT1L [1923], EHMT1/2 (G9a/GLP) [2430], and SMYD2/3 [3137] and would be valuable for the NSD family as well. Autoinhibited structures of the NSD1 [38] and WHSC1 [39] SET domains have been solved along with a structure of the WHSC1L1 SET domain in an open loop conformation (PDB code = 5UPD, Structural Genomics Consortium). All of these structures have been solved in the presence of the substrate SAM, but to date no structure of an NSD protein has been solved in the presence of an enzyme inhibitor. Inhibitors based on sinefungin, a substrate analog of the cofactor SAM, have been reported for WHSC1 but structures of these compounds were reported in a closely related enzyme, SETD2, rather than in an NSD family protein [39]. Additional studies with two known inhibitors of G9a, UNC0638 [39] and BIX-01294 [40], have not shown experimental evidence of specific binding of these compounds to the active site of the NSD family members. A specific inhibitor for any member of the NSD family would be a valuable addition to the chemical library for protein methyltransferases.

In this report, the first fully validated inhibitor of WHSC1 is presented. The norleucine-containing peptide is based on the histone protein H4 sequence surrounding residue K44 (H4K44). This peptide inhibits WHSC1 in both biochemical and biophysical assays with similar potencies. Additionally, the structure of the WHSC1L1 SET domain (residues 1054–1285) complexed with PTD2 and SAM has been solved. This structure, the first NSD family member complexed with an inhibitor, reveals the importance of the autoinhibitory loop in binding the protein substrate and further reveals how crucial loop dynamics are in the catalytic cycle of the enzyme. Additionally, this result provides further documentation of the general utility of norleucine containing peptides as inhibitors of lysine methyltransferases.

Materials and methods

Protein production

GST-tagged WHSC1 941–1240 (#51026) and Avi-tagged WHSC1 941–1240 (#111207) (Genbank #: NM_133330 were purchased from BPS Bioscience. Purity of the purchased protein was at least 80% as measured with the Agilent Bioanalyzer.

A WHSC1L1 construct containing amino acids 1054–1285 (Genbank #: NM_001042424.2) was cloned into a vector containing an N-terminal His-TEV tag to facilitate purification. His-TEV-WHSC1L1 (1054–1285) was produced in E. coli using IPTG to induce expression. After harvest by centrifugation, the resulting cell pellets were resuspended in Buffer A (20mM Tris, 250 mM NaCl, 5% glycerol, 2mM β-ME, pH 8.0) with 1 mM PMSF added, passed through a high pressure homogenizer, and centrifuged to remove cell debris. The supernatant was added to nickel affinity resin (Qiagen) pre-equilibrated with Buffer A and incubated for 2 hours at 4°C. The solution was then packed into a column and washed with Buffer A with 20 mM imidazole until baseline was achieved. Target protein was eluted with Buffer A with 250 mM imidazole. After overnight incubation with TEV protease at 4°C, the protein was passed over nickel affinity resin pre-equilibrated with Buffer B (20mM Tris, 50mM NaCl, 5% glycerol, 2mM β-ME, pH 8.0) and washed in the same buffer; the target protein was eluted in the flow-through and wash steps. The protein was then loaded onto a SP Sepharose FF column (GE) pre-equilibrated with Buffer B and washed with Buffer B until baseline was reached. The protein was eluted using 50mM Tris, 100mM NaCl, 1mM TCEP, pH 8.5. After concentration, the target protein was loaded onto a Superdex 200 gel filtration column pre-equilibrated with 50mM Tris, 20mM NaCl, 1mM TCEP, pH 8.5. Eluted fractions were combined, concentrated and frozen for future use. The final purity of the protein was >85% by SDS-page gel and the correct mass for the cleaved protein was obtained by mass spectrometry.

Peptide synthesis

Peptide synthesis for PTD1 and PTD2 was performed at Biopeptide with a purity > 95% by HPLC.

Biochemical assay

The WHSC1 941–1240 biochemical assay was performed using a radiometric format involving the transfer of a 3[H] methyl group from SAM to an N-terminally biotinylated histone H4 peptide (RLARRGGVKRISGLI). An optimized 1× assay buffer containing 20 mM Tris-HCl (pH 8.0), 1 mM DTT, 0.005% Tween-20, and 0.01% bovine skin gelatin was utilized. The assay was performed in a 384-well V-bottom polypropylene microplate (Greiner Bio-One) at 25°C at 10 nM enzyme under balanced substrate conditions (0.15 μM 3[H]-SAM, 0.85 μM unlabeled SAM and 0.5 μM peptide). The reaction was quenched after 120 minutes by adding excess (1 mM final concentration) unlabeled SAM. Quenched reactions were transferred onto streptavidin coated Flashplates (PerkinElmer), incubated for 2 hours, washed 1x with 0.1% Tween 20, and read on a Topcount instrument (PerkinElmer).

The WHSC1L1 1054–1285 biochemical assay was performed as above with 40 nM enzyme under balanced conditions (0.4 μM [3H]–SAM, 5.6 μM unlabeled SAM, and 0.8 μM peptide).

For determining the potency of PTD1 and PTD2, 1 μL of serially diluted peptide inhibitor (in 100% DMSO) was pre-incubated with 40 μL of enzyme for 30 minutes. The reaction was initiated by adding 10 μL substrate mix containing peptide and SAM. Concentration–response curves in units of percent inhibition were plotted as a function of inhibitor concentration and fitted in GraphPad Prism to determine the IC50 values using a four-parameter logistic equation. The 100% inhibition control consisted of 200 μM final concentration of PTD1, whereas the 0% inhibition control consisted of 2% DMSO.


The SPR binding assay for Avi-tagged WHSC1 941–1240 was performed at 15°C using the Biacore T200 system (GE Healthcare, Marlborough, MA). The running buffer contained 25 mM Tris-HCl pH 8.0, 150 mM NaCl, 5 mM DTT, 0.05% Tween-20, 10 μM ZnCl2, 50 μM SAM, and 2% DMSO. Biotinylated WHSC1 was immobilized on a streptavidin-coated SA chip up to 3700 RU (response units). The reference cell was blocked with PEG-biotin. Peptide inhibitor was injected at a flow rate of 50 μL/min with a 3-fold, 5-point dilution series ending at a 20 μM top concentration. Association and dissociation times were 240 and 2400 seconds, respectively. Double-referenced data were analyzed globally using a 1:1 binding model in BIAevaluation software.


PTD2 (2 mM) and SAM (2mM) were incubated with WHSC1L1 1054–1285 (10 mg/ml) on ice prior to crystallization. Vapor diffusion methods utilizing hanging drops with a 0.5 mL reservoir were used. 1 μL protein was added to 1 μL of reservoir solution containing 0.1 M Bis-Tris pH 6.5, 20% w/v PEG5000MME; 0.2 μL 0.1 M Sarcosine and microseeds were added to facilitate crystallization. Crystals were frozen in liquid nitrogen after incubation in a solution containing 80% reservoir/20% glycerol. Data reduction and scaling were performed with XDS [41]. Structure determination was performed using the WHSC1L1 SET domain structure (PDB code = 5UPD) and visual inspection of electron density maps. Structure refinement was completed using iterative cycles of refinement and model building using REFMAC [42] and COOT [43], respectively. Data collection and refinement statistics are shown in Table 1. The WHSC1L1 1054-1285-PTD2 structure has been deposited into the Protein Data Bank (PDB code = 6CEN).

Table 1. Crystallographic data collection and refinement statistics for WHSC1L1 1054–1285 with PTD2 and SAM.


NSD family members are large, multi-domain containing enzymes that, in addition to the catalytic SET domain, include known epigenetic reader domains such as PHD and PWWP domains [6, 44]. A truncated version of WHSC1 containing amino acids 941–1240, which spans the entire catalytic SET domain and additional residues on both the N- and C-termini but does not include reader domains, was used in the assay. WHSC1 is known to perform mono- and di-methylation of H3K36 [4, 5]. However, an internal peptide library covering histone protein sequences was tested to identify potential WHSC1 substrates. It was determined that peptides that included H4K44 were superior substrates for WHSC1 941–1240 than the physiological H3K36 sequence. This is consistent with experiments performed with native and recombinant histone octamers which showed H4K44 to be a superior substrate for WHSC1 in the absence of DNA [4].

Previous work has shown that the mutation of lysine to methionine in histone proteins can act as inhibitors for protein methyltransferases such as EZH2, SUV39H1, and G9a [45], and recent work has shown hydrophobic moieties in place of lysine can inhibit SETD2 [46, 47] and SETD8 [48]. The unnatural amino acid norleucine (Nle) can mimic the alkyl chain of lysine but is deficient in the terminal primary amine. As methylation of side chains by lysine methyltransferases requires the basic amine moiety, it was hypothesized that norleucine substituted for lysine residues would also result in a peptide that would be an inhibitor of the enzyme rather than a substrate. Two peptides were synthesized based on the H4K44 substrate sequence used in the assay, PTD1 (15 residues) and PTD2 (5 residues) (Table 2), where K44 has been replaced with Nle. These peptides were tested for inhibitory activity against a truncated construct of WHSC1 containing amino acids 941–1240 in a radiometric, biochemical assay following the transfer of 3H from SAM to an H4K44 peptide substrate. While the shorter 5 residue Nle peptide exhibited micromolar affinity towards WHSC1, the longer 15 residue peptide was 250-fold more potent. These data show the importance of exo site binding within the WHSC1 active site to achieve sub-micromolar potency (Fig 1 and Table 2).

Fig 1. Norleucine-containing peptides can inhibit WHSC1 and WHSC1L1 activity in vitro.

Representative peptide inhibitor biochemical dose-response curves for (A) WHSC1 941–1240 and (B) WHSC1L1 1054–1285. Error bars represent the standard deviation of three independent replicates. Resulting IC50 values are reported in Table 2.

Table 2. Biochemical and biophysical peptide inhibitor potency values for WHSC1 941–1240 and WHSC1L1 1054–1285a.

To determine the contribution of binding to WHSC1, a series of peptides were synthesized that varied the flanking position relative to the norleucine in PTD2 (Table 3). Predictably, replacement of the norleucine with alanine (PTD3) resulted in a 30-fold decrease in potency emphasizing the importance of this residue. A loss in WHSC1 potency is also observed when varying the flanking residues with respect to norleucine (Table 3) illustrating the importance of the surrounding amino acids and position of norleucine. Additionally, a series of H3K36 norleucine peptides were synthesized to determine the relative affinities with respect to the non-physiological H4K44 peptide inhibitors. As observed in Table 3, the longer 15 residue H3K36 peptide (PTD8) exhibited a substantial decrease in potency when compared with the H4K44 15 residue peptide (PTD1). This 150-fold loss in potency is extremely surprising on account of PTD8 being the physiologically relevant WHSC1 substrate sequence. The shorter 5 residue H3K36 peptide (PTD9) exhibited a more modest 6-fold decrease relative to the analogous H4K44 peptide (PTD2). This loss in potency can most likely be attributed to the replacement of isoleucine for proline resulting in a significant change in this shorter peptide sequence.

Table 3. Biochemical IC50 values for norleucine peptide inhibitors derived from the H4K44 and H3K36 sequencesa.

Inhibitor selectivity plays a crucial role in determining the molecular mechanism of action in a specific cell line. In order to understand the selectivity of this peptide series, the potency of PTD2 was determined across a panel of histone methyltransferases. While this peptide inhibitor exhibited micromolar affinity towards WHSC1, it demonstrated no enzymatic inhibition against the entire panel of methyltransferases (Table 4).

Table 4. PTD2 biochemical IC50 values for a panel of HMT enzymes.

Biochemical enzymatic inhibition can be the result of a non-specific interaction rather than direct binding to the protein target. Therefore, an orthogonal method to measure the binding of peptides to WHSC1 was necessary. An SPR assay using an Avi-tagged version of WHSC1 941–1240 was developed and validated using cofactor SAM (S1 Fig). PTD1 exhibited non-specific matrix interactions with the SPR surface at concentrations greater than 1 μM, resulting in the inability to accurately determine peptide affinity to WHSC1 (S2 Fig). This non-specific interaction can likely be attributed to the charged state of PTD1 and its subsequent binding to the matrix on the SPR sensor chip. An increase in salt and detergent concentration was unable to mitigate this phenomenon. In contrast, sensorgrams for PTD2 showed dose dependent binding and saturation of the enzyme in single-cycle kinetics mode at affinities comparable to potency values obtained in the biochemical enzyme assay (Fig 2). To validate these results and confirm that immobilization of WHSC1 to the SPR sensor chip did not interfere with inhibitor binding, an isothermal titration calorimetry (ITC) assay was utilized. As expected, this orthogonal biophysical technique was able to corroborate the potency of PTD2 (ITC KD = 3.4 μM) (Fig 3). To determine the influence of cofactor on peptide binding, PTD2 was co-injected in the presence of either S-adenosyl-L-homocysteine (SAH) or sinefungin (SFG) and in the absence of SAM (Fig 4). Importantly, the presence of SAM is essential for PTD2 potency as no binding is observed in the absence of cofactor and weak binding (KD > 100 μM) in the presence of either SAH or SFG. These results suggested that the inhibition of PTD2 was the result of specific binding of the peptide in the presence of SAM to the WHSC1 protein.

Fig 2. Representative sensorgram for PTD2 binding to Avi-tagged WHSC1 941–1240 from single-cycle kinetic SPR measurements.

WHSC1 was immobilized on a streptavidin-coated chip and peptide inhibitor was co-injected with SAM utilizing a 3-fold, 5-point dilution series ending at a 20 μM top concentration. Data reported in Table 2 is presented as the standard deviation of three independent experiments.

Fig 3. Isothermal titration calorimetry of PTD2 binding to WHSC1 941–1240.

Upper panel, calorimetric trace for ligand titration; lower panel, binding isotherm from calorimetric trace. WHSC1 concentration in the cell was 10 μM supplemented with 100 μM SAM. PTD2 concentration in the syringe was 100 μM.

Fig 4. Representative SPR sensorgrams for PTD2 binding to Avi-tagged WHSC1 941–1240 in the absence or presence of SAM analogs.

WHSC1 was immobilized on a streptavidin-coated chip and peptide inhibitor was either injected in the absence of cofactor (left panel), co-injected with SAH (middle panel), or co-injected with SFG (right panel) utilizing a 2-fold, 10-point dilution series ending at a 100 μM top concentration.

In order to understand the binding mode of the peptide to the protein and to further validate these inhibitors, the structure of PTD2 was solved with a SET domain construct of WHSC1L1 comprising amino acids 1054–1285. WHSC1 and WHSC1L1 are highly homologous proteins with 45% identity (53% similarity) overall and are 78% identical (84% similar) in the sequence corresponding to the construct utilized for structure determination. Additionally, WHSC1 (PDB code = 5LSU) and WHSC1L1 (PDB code = 5UPD) [39] are extremely similar in their secondary and tertiary structures (Fig 5A) with an RMSD of 0.95 Å2 with comparable biochemical IC50 values (Table 2). Therefore, WHSC1L1 is an appropriate choice to study the binding of the PTD2.

Fig 5. WHSC1L1 1054–1285 has a similar overall structure in relation to other NSD family proteins and can form a ternary complex with SAM and PTD2.

(A) Superposition of NSD family proteins (green/yellow = WHSC1L1-PTD2 (PDB code = 6CEN); cyan = WHSC1L1 (PDB code = 5UPD); magenta = WHSC1 (PDB code = 5LSU); purple = NSD1 (PDB code = 3OOI). All protein chains are shown as ribbons; SAM and PTD2 are depicted in stick representation. (B) Structure of WHSC1L1-PTD2-SAM ternary complex. Hydrogen bonds are indicated with dashed lines. (C) Superposition of WHSC1L1-PTD2 and SETD2-H3.3 K36M (grey; PDB code = 5JJY).

The 1.61 Å ternary structure of WHSC1L1 1054–1285 with PTD2 and SAM confirms the inhibitor peptide binds in the histone binding pocket with the norleucine residue occupying the substrate lysine channel (Fig 5B). Significant reorganization of the autoinhibitory loop is required for peptide binding, as the peptide overlaps with the loop in both the WHSC1 and NSD1 structures, both of which are in autoinhibitory conformations. In the complexed structure, the autoinhibitory loop moves towards the C-terminus of the construct to open the binding site for the peptide. The interactions of the peptide with the protein are dominated by hydrogen bonds; with the exception of the N-terminal acetyl group and NH of the glycine residue, all backbone amide and carbonyl moieties are involved in hydrogen bonds. Met 1201 and Thr 1203 of the autoinhibitory loop, Thr 1323 and residues 1260–1264 make direct and/or water mediated interactions with peptide (Fig 5B). As previously noted, the norleucine residue points towards SAM and the terminal methyl of the unnatural amino acid is 4.1 Å from the SAM methyl group. The alkyl chain also makes hydrophobic interactions with Tyr 1262 and Phe 1259. However, the norleucine residue is extremely important for the potency of PTD2, as mutation of this residue to alanine (PTD3) ablates binding of the peptide to WHSC1 (Table 3). The binding of PTD2 to WHSC1L1 is analogous to what was seen in peptide structures with SETD2, the closest protein to the NSD family on the lysine methyltransferase phylogenic tree [49]. Despite differences in length and sequences between the histone 3.3 peptide for SETD2 (PDB code = 5JJY) and PTD2 coupled with the variances in sequence and structure between the proteins, the location of the peptide in the binding site and the conformation of the autoinhibitory loop in the WHSC1L1 structure are extremely similar in the structure of SETD2 (Fig 5C).


The structures currently available for the NSD family show the autoinhibitory loop can adopt multiple conformations and that these conformational changes are likely important for catalytic activity. The importance of dynamics on enzymatic activity was noted in the first report of the autoinhibited NSD1 structure and molecular dynamic simulations were performed which showed an ensemble of possible conformations [38]. More extensive work exploring loop dynamics and the effect on catalytic activity has been performed with ASH1L, another closely related methyltransferase to the NSD family [49]. Mutations to residues within the ASH1L autoinhibitory loop were previously shown to affect catalytic activity [50] and more recent experiments showed the autoinhibitory loop movements also affected the protein structure in SET-I domain, indicating the loop rearrangement may be a more complex regulation of catalytic activity [51]. Importantly, these experiments with ASH1L were performed with truncated constructs rather than with the full length proteins. Like ASH1L, the NSD proteins are large, multidomain proteins, and it is possible that regulation of the autoinhibitory loop and the conformational states available to the protein are different in the full length protein when compared to truncated constructs. Of note, the binding site for PTD2 is less than 9 Å from the C-terminus of the WHSC1L1 construct used in the crystal structure, which is not the natural C-terminus. Therefore, studies investigating the effect of construct design on catalytic activity and inhibition by peptides or small molecule compounds targeting NSD family members would be desirable.

Although the link between WHSC1 and the t(4;14)(p16;q32) translocation in multiple myeloma has been known for years, potent and selective chemical matter for WHSC1 or other NSD family members has not yet been documented. This is in contrast to other members in the protein methyltransferase superfamily, many of which have in vitro and/or in vivo tool compounds available [52] and three protein methyltransferases have compounds that have reached the clinic (EZH2, DOT1L, PRMT5; Recently, two structurally related G9a inhibitors, UNC0638 [39] and BIX-01294 [40], have been reported to bind or inhibit one or more NSD family proteins. However, computational models of binding of BIX-01294 to the NSD co-enzyme site [40] are in direct contrast to the NMR data with UNC0638 indicating the compound interacted with the N-terminus of the SET domain construct [39]. It is possible that these two compounds differ in their interactions with WHSC1 and other NSD family members. However, a more complete understanding of these compounds and how they inhibit the enzymatic activity of NSD family members is needed before they can be considered validated chemical matter. In contrast, the work presented here shows PTD2 is the first validated inhibitor of the catalytic activity of WHSC1.

Several examples now exist of peptide inhibitors of protein methyltransferases utilizing hydrophobic moieties in place of the substrate lysine. Leucine and methionine containing peptides have been solved in SETD2 [46, 47]. Studies of inhibitory peptides for SETD8 [48] revealed a wide range of potencies could be measured by modifying the hydrophobic moiety in the lysine channel, with potencies ranging from 43 μM to ~100 nM. PTD2 is another example of this application of rational design to develop inhibitors and this approach may be useful in the design of inhibitors for other methyltransferase enzyme or other lysine binding proteins, particularly those for which small molecule drug discovery has been challenging.

Although PTD2 is a valuable asset in the toolkit for WHSC1 drug discovery efforts, additional work is needed to develop potent, selective and cell active inhibitors for WHSC1, WHSC1L1 and NSD. In SETD8, potency differences were seen by varying peptide length and the composition of residues N- and C-terminal to the core lysine channel binder [48]. The increased potency of PTD1 toward WHSC1 when compared to PTD2 may indicate that optimization of PTD2 is also possible. Although peptides typically have poor stability and cell permeability [53], advances in the field of cyclic peptides [54] and modifications such as N-methylation [55] have been successfully utilized to increase bioactivity of peptides, and this could be a potential avenue for optimization of PTD2. The structure of WHSC1L1-PTD2 may also inspire structure-based approaches for drug discovery efforts towards the NSD family, for which PTD2 will be a valuable tool for in vitro assay development and validation. Discovery and optimization of potent and selective inhibitors of WHSC1, WHSC1L1 and NSD1 would greatly help to understand the role of these enzymes in both normal and oncogenic cell lines and, potentially, human disease.

Supporting information

S1 Fig. Representative SPR sensorgram for SAM binding to Avi-tagged WHSC1 941–1240.


S2 Fig. Sensorgram for PTD1 showing non-specific matrix interactions with the WHSC1 941–1240 SPR surface.



We would like to thank Dr. Dennis Wegener for SPR data generation and analysis, to Wentao Wei and Fan Jiang for the crystallization and structure determination, to Dr. Colin Robinson for ITC data generation and analysis, and Dr. Neil Farrow for uploading the final structures to the PDB.


  1. 1. Kouzarides T. Chromatin modifications and their function. Cell. 2007;128(4):693–705. Epub 2007/02/27. pmid:17320507.
  2. 2. Arrowsmith CH, Bountra C, Fish PV, Lee K, Schapira M. Epigenetic protein families: a new frontier for drug discovery. Nat Rev Drug Discov. 2012;11(5):384–400. Epub 2012/04/14. pmid:22498752.
  3. 3. Ribich S, Harvey D, Copeland RA. Drug Discovery and Chemical Biology of Cancer Epigenetics. Cell Chem Biol. 2017;24(9):1120–47. Epub 2017/09/25. pmid:28938089.
  4. 4. Li Y, Trojer P, Xu CF, Cheung P, Kuo A, Drury WJ 3rd, et al. The target of the NSD family of histone lysine methyltransferases depends on the nature of the substrate. J Biol Chem. 2009;284(49):34283–95. Epub 2009/10/08. pmid:19808676; PubMed Central PMCID: PMC2797197.
  5. 5. Kuo AJ, Cheung P, Chen K, Zee BM, Kioi M, Lauring J, et al. NSD2 links dimethylation of histone H3 at lysine 36 to oncogenic programming. Mol Cell. 2011;44(4):609–20. Epub 2011/11/22. pmid:22099308; PubMed Central PMCID: PMC3222870.
  6. 6. Huang Z, Wu H, Chuai S, Xu F, Yan F, Englund N, et al. NSD2 is recruited through its PHD domain to oncogenic gene loci to drive multiple myeloma. Cancer Res. 2013;73(20):6277–88. Epub 2013/08/28. pmid:23980095.
  7. 7. Cerveira N, Correia C, Doria S, Bizarro S, Rocha P, Gomes P, et al. Frequency of NUP98-NSD1 fusion transcript in childhood acute myeloid leukaemia. Leukemia. 2003;17(11):2244–7. Epub 2003/08/22. pmid:12931227.
  8. 8. Wang GG, Cai L, Pasillas MP, Kamps MP. NUP98-NSD1 links H3K36 methylation to Hox-A gene activation and leukaemogenesis. Nat Cell Biol. 2007;9(7):804–12. Epub 2007/06/26. pmid:17589499.
  9. 9. Stec I, Wright TJ, van Ommen GJ, de Boer PA, van Haeringen A, Moorman AF, et al. WHSC1, a 90 kb SET domain-containing gene, expressed in early development and homologous to a Drosophila dysmorphy gene maps in the Wolf-Hirschhorn syndrome critical region and is fused to IgH in t(4;14) multiple myeloma. Hum Mol Genet. 1998;7(7):1071–82. Epub 1998/06/09. pmid:9618163.
  10. 10. Chesi M, Nardini E, Lim RS, Smith KD, Kuehl WM, Bergsagel PL. The t(4;14) translocation in myeloma dysregulates both FGFR3 and a novel gene, MMSET, resulting in IgH/MMSET hybrid transcripts. Blood. 1998;92(9):3025–34. Epub 1998/10/27. pmid:9787135.
  11. 11. Martinez-Garcia E, Popovic R, Min DJ, Sweet SM, Thomas PM, Zamdborg L, et al. The MMSET histone methyl transferase switches global histone methylation and alters gene expression in t(4;14) multiple myeloma cells. Blood. 2011;117(1):211–20. Epub 2010/10/27. pmid:20974671; PubMed Central PMCID: PMC3037745.
  12. 12. Rosati R, La Starza R, Veronese A, Aventin A, Schwienbacher C, Vallespi T, et al. NUP98 is fused to the NSD3 gene in acute myeloid leukemia associated with t(8;11)(p11.2;p15). Blood. 2002;99:3857–60. pmid:11986249
  13. 13. Hudlebusch HR, Santoni-Rugiu E, Simon R, Ralfkiaer E, Rossing HH, Johansen JV, et al. The histone methyltransferase and putative oncoprotein MMSET is overexpressed in a large variety of human tumors. Clin Cancer Res. 2011;17(9):2919–33. Epub 2011/03/10. pmid:21385930.
  14. 14. Li J, Yin C, Okamoto H, Mushlin H, Balgley BM, Lee CS, et al. Identification of a novel proliferation-related protein, WHSC1 4a, in human gliomas. Neuro Oncol. 2008;10(1):45–51. Epub 2008/01/10. pmid:18182627; PubMed Central PMCID: PMC2600837.
  15. 15. Kassambara A, Klein B, Moreaux J. MMSET is overexpressed in cancers: link with tumor aggressiveness. Biochem Biophys Res Commun. 2009;379(4):840–5. Epub 2009/01/06. pmid:19121287.
  16. 16. Jaffe JD, Wang Y, Chan HM, Zhang J, Huether R, Kryukov GV, et al. Global chromatin profiling reveals NSD2 mutations in pediatric acute lymphoblastic leukemia. Nat Genet. 2013;45(11):1386–91. Epub 2013/10/01. pmid:24076604; PubMed Central PMCID: PMCPMC4262138.
  17. 17. Oyer JA, Huang X, Zheng Y, Shim J, Ezponda T, Carpenter Z, et al. Point mutation E1099K in MMSET/NSD2 enhances its methyltranferase activity and leads to altered global chromatin methylation in lymphoid malignancies. Leukemia. 2014;28(1):198–201. Epub 2013/07/05. pmid:23823660; PubMed Central PMCID: PMCPMC3888226.
  18. 18. Yang ZQ, Liu G, Bollig-Fischer A, Giroux CN, Ethier SP. Transforming properties of 8p11-12 amplified genes in human breast cancer. Cancer Res. 2010;70(21):8487–97. Epub 2010/10/14. pmid:20940404; PubMed Central PMCID: PMC3089754.
  19. 19. Basavapathruni A, Jin L, Daigle SR, Majer CR, Therkelsen CA, Wigle TJ, et al. Conformational adaptation drives potent, selective and durable inhibition of the human protein methyltransferase DOT1L. Chem Biol Drug Des. 2012;80(6):971–80. Epub 2012/09/18. pmid:22978415.
  20. 20. Yu W, Chory EJ, Wernimont AK, Tempel W, Scopton A, Federation A, et al. Catalytic site remodelling of the DOT1L methyltransferase by selective inhibitors. Nat Commun. 2012;3:1288. Epub 2012/12/20. pmid:23250418.
  21. 21. Daigle SR, Olhava EJ, Therkelsen CA, Basavapathruni A, Jin L, Boriack-Sjodin PA, et al. Potent inhibition of DOT1L as treatment of MLL-fusion leukemia. Blood. 2013;122(6):1017–25. Epub 2013/06/27. pmid:23801631; PubMed Central PMCID: PMC3739029.
  22. 22. Yu W, Smil D, Li F, Tempel W, Fedorov O, Nguyen KT, et al. Bromo-deaza-SAH: a potent and selective DOT1L inhibitor. Bioorg Med Chem. 2013;21(7):1787–94. Epub 2013/02/26. pmid:23433670; PubMed Central PMCID: PMC4780262.
  23. 23. Yao Y, Chen P, Diao J, Cheng G, Deng L, Anglin JL, et al. Selective inhibitors of histone methyltransferase DOT1L: design, synthesis, and crystallographic studies. J Am Chem Soc. 2011;133(42):16746–9. Epub 2011/09/23. pmid:21936531; PubMed Central PMCID: PMC3492951.
  24. 24. Chang Y, Zhang X, Horton JR, Upadhyay AK, Spannhoff A, Liu J, et al. Structural basis for G9a-like protein lysine methyltransferase inhibition by BIX-01294. Nat Struct Mol Biol. 2009;16(3):312–7. Epub 2009/02/17. pmid:19219047; PubMed Central PMCID: PMC2676930.
  25. 25. Liu F, Chen X, Allali-Hassani A, Quinn AM, Wasney GA, Dong A, et al. Discovery of a 2,4-diamino-7-aminoalkoxyquinazoline as a potent and selective inhibitor of histone lysine methyltransferase G9a. J Med Chem. 2009;52(24):7950–3. Epub 2009/11/07. pmid:19891491; PubMed Central PMCID: PMC2825141.
  26. 26. Liu F, Chen X, Allali-Hassani A, Quinn AM, Wigle TJ, Wasney GA, et al. Protein lysine methyltransferase G9a inhibitors: design, synthesis, and structure activity relationships of 2,4-diamino-7-aminoalkoxy-quinazolines. J Med Chem. 2010;53(15):5844–57. Epub 2010/07/10. pmid:20614940; PubMed Central PMCID: PMC2920043.
  27. 27. Liu F, Barsyte-Lovejoy D, Li F, Xiong Y, Korboukh V, Huang XP, et al. Discovery of an in vivo chemical probe of the lysine methyltransferases G9a and GLP. J Med Chem. 2013;56(21):8931–42. Epub 2013/10/10. pmid:24102134; PubMed Central PMCID: PMC3880643.
  28. 28. Sweis RF, Pliushchev M, Brown PJ, Guo J, Li F, Maag D, et al. Discovery and development of potent and selective inhibitors of histone methyltransferase g9a. ACS Med Chem Lett. 2014;5(2):205–9. Epub 2014/06/06. pmid:24900801; PubMed Central PMCID: PMC4027767.
  29. 29. Xiong Y, Li F, Babault N, Dong A, Zeng H, Wu H, et al. Discovery of Potent and Selective Inhibitors for G9a-Like Protein (GLP) Lysine Methyltransferase. J Med Chem. 2017;60(5):1876–91. Epub 2017/01/31. pmid:28135087; PubMed Central PMCID: PMC5352984.
  30. 30. Vedadi M, Barsyte-Lovejoy D, Liu F, Rival-Gervier S, Allali-Hassani A, Labrie V, et al. A chemical probe selectively inhibits G9a and GLP methyltransferase activity in cells. Nat Chem Biol. 2011;7(8):566–74. Epub 2011/07/12. pmid:21743462; PubMed Central PMCID: PMCPMC3184254.
  31. 31. Ferguson AD, Larsen NA, Howard T, Pollard H, Green I, Grande C, et al. Structural basis of substrate methylation and inhibition of SMYD2. Structure. 2011;19(9):1262–73. Epub 2011/07/26. pmid:21782458.
  32. 32. Sweis RF, Wang Z, Algire M, Arrowsmith CH, Brown PJ, Chiang GG, et al. Discovery of A-893, A New Cell-Active Benzoxazinone Inhibitor of Lysine Methyltransferase SMYD2. ACS Med Chem Lett. 2015;6(6):695–700. Epub 2015/06/24. pmid:26101576; PubMed Central PMCID: PMC4468393.
  33. 33. Nguyen H, Allali-Hassani A, Antonysamy S, Chang S, Chen LH, Curtis C, et al. LLY-507, a Cell-active, Potent, and Selective Inhibitor of Protein-lysine Methyltransferase SMYD2. J Biol Chem. 2015;290(22):13641–53. Epub 2015/04/01. pmid:25825497; PubMed Central PMCID: PMC4447944.
  34. 34. Cowen SD, Russell D, Dakin LA, Chen H, Larsen NA, Godin R, et al. Design, Synthesis, and Biological Activity of Substrate Competitive SMYD2 Inhibitors. J Med Chem. 2016;59(24):11079–97. Epub 2016/12/23. pmid:28002961.
  35. 35. Eggert E, Hillig RC, Koehr S, Stockigt D, Weiske J, Barak N, et al. Discovery and Characterization of a Highly Potent and Selective Aminopyrazoline-Based in Vivo Probe (BAY-598) for the Protein Lysine Methyltransferase SMYD2. J Med Chem. 2016;59(10):4578–600. Epub 2016/04/15. pmid:27075367; PubMed Central PMCID: PMC4917279.
  36. 36. Mitchell LH, Boriack-Sjodin PA, Smith S, Thomenius M, Rioux N, Munchhof M, et al. Novel Oxindole Sulfonamides and Sulfamides: EPZ031686, the First Orally Bioavailable Small Molecule SMYD3 Inhibitor. ACS Med Chem Lett. 2016;7(2):134–8. Epub 2016/03/18. pmid:26985287; PubMed Central PMCID: PMCPMC4753551.
  37. 37. Mitchell LH, Drew AE, Ribich SA, Rioux N, Swinger KK, Jacques SL, et al. Aryl Pyrazoles as Potent Inhibitors of Arginine Methyltransferases: Identification of the First PRMT6 Tool Compound. ACS Med Chem Lett. 2015;6(6):655–9. Epub 2015/06/24. pmid:26101569; PubMed Central PMCID: PMCPMC4468411.
  38. 38. Qiao Q, Li Y, Chen Z, Wang M, Reinberg D, Xu RM. The structure of NSD1 reveals an autoregulatory mechanism underlying histone H3K36 methylation. J Biol Chem. 2011;286(10):8361–8. Epub 2011/01/05. pmid:21196496; PubMed Central PMCID: PMC3048720.
  39. 39. Tisi D, Chiarparin E, Tamanini E, Pathuri P, Coyle JE, Hold A, et al. Structure of the Epigenetic Oncogene MMSET and Inhibition by N-Alkyl Sinefungin Derivatives. ACS Chem Biol. 2016;11(11):3093–105. Epub 2016/08/30. pmid:27571355.
  40. 40. Morishita M, Mevius DEHF, Shen Y, Zhao S, di Luccio E. BIX-01294 inhibits oncoproteins NSD1, NSD2 and NSD3. Med Chem Res. 2017;26(9):2038–47.
  41. 41. Kabsch W. Xds. Acta Crystallogr D Biol Crystallogr. 2010;66(Pt 2):125–32. Epub 2010/02/04. pmid:20124692; PubMed Central PMCID: PMC2815665.
  42. 42. Murshudov GN, Vagin AA, Dodson EJ. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr D Biol Crystallogr. 1997;53(Pt 3):240–55. Epub 1997/05/01. pmid:15299926.
  43. 43. Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D Biol Crystallogr. 2010;66(Pt 4):486–501. Epub 2010/04/13. pmid:20383002; PubMed Central PMCID: PMC2852313.
  44. 44. Sankaran SM, Wilkinson AW, Elias JE, Gozani O. A PWWP Domain of Histone-Lysine N-Methyltransferase NSD2 Binds to Dimethylated Lys-36 of Histone H3 and Regulates NSD2 Function at Chromatin. J Biol Chem. 2016;291(16):8465–74. Epub 2016/02/26. pmid:26912663; PubMed Central PMCID: PMCPMC4861420.
  45. 45. Lewis PW, Muller MM, Koletsky MS, Cordero F, Lin S, Banaszynski LA, et al. Inhibition of PRC2 activity by a gain-of-function H3 mutation found in pediatric glioblastoma. Science. 2013;340(6134):857–61. Epub 2013/03/30. pmid:23539183; PubMed Central PMCID: PMC3951439.
  46. 46. Yang S, Zheng X, Lu C, Li GM, Allis CD, Li H. Molecular basis for oncohistone H3 recognition by SETD2 methyltransferase. Genes Dev. 2016;30(14):1611–6. Epub 2016/07/31. pmid:27474439; PubMed Central PMCID: PMC4973290.
  47. 47. Zhang Y, Shan CM, Wang J, Bao K, Tong L, Jia S. Molecular basis for the role of oncogenic histone mutations in modulating H3K36 methylation. Sci Rep. 2017;7:43906. Epub 2017/03/04. pmid:28256625; PubMed Central PMCID: PMC5335568.
  48. 48. Judge RA, Zhu H, Upadhyay AK, Bodelle PM, Hutchins CW, Torrent M, et al. Turning a Substrate Peptide into a Potent Inhibitor for the Histone Methyltransferase SETD8. ACS Med Chem Lett. 2016;7(12):1102–6. Epub 2016/12/21. pmid:27994746; PubMed Central PMCID: PMC5150682.
  49. 49. Richon VM, Johnston D, Sneeringer CJ, Jin L, Majer CR, Elliston K, et al. Chemogenetic analysis of human protein methyltransferases. Chem Biol Drug Des. 2011;78(2):199–210. Epub 2011/05/14. pmid:21564555.
  50. 50. An S, Yeo KJ, Jeon YH, Song JJ. Crystal structure of the human histone methyltransferase ASH1L catalytic domain and its implications for the regulatory mechanism. J Biol Chem. 2011;286(10):8369–74. Epub 2011/01/18. pmid:21239497; PubMed Central PMCID: PMC3048721.
  51. 51. Rogawski DS, Ndoj J, Cho HJ, Maillard I, Grembecka J, Cierpicki T. Two Loops Undergoing Concerted Dynamics Regulate the Activity of the ASH1L Histone Methyltransferase. Biochemistry. 2015;54(35):5401–13. Epub 2015/08/21. pmid:26292256; PubMed Central PMCID: PMC4664444.
  52. 52. Schapira M. Chemical Inhibition of Protein Methyltransferases. Cell Chem Biol. 2016;23(9):1067–76. Epub 2016/08/30. pmid:27569753.
  53. 53. Rubin S, Qvit N. Cyclic Peptides for Protein-Protein Interaction Targets: Applications to Human Disease. Crit Rev Eukaryot Gene Expr. 2016;26(3):199–221. Epub 2016/09/22. pmid:27650985.
  54. 54. White AM, Craik DJ. Discovery and optimization of peptide macrocycles. Expert Opin Drug Discov. 2016;11(12):1151–63. Epub 2016/10/11. pmid:27718641.
  55. 55. Rader AFB, Reichart F, Weinmuller M, Kessler H. Improving oral bioavailability of cyclic peptides by N-methylation. Bioorg Med Chem. 2017. Epub 2017/09/10. pmid:28886995.