The Polycomb group (PcG) of proteins is a family of important developmental regulators. The respective members function as large protein complexes involved in establishment and maintenance of transcriptional repression of developmental control genes. MBTD1, Malignant Brain Tumor domain-containing protein 1, is one such PcG protein. MBTD1 contains four MBT repeats.
We have determined the crystal structure of MBTD1 (residues 130–566aa covering the 4 MBT repeats) at 2.5 Å resolution by X-ray crystallography. The crystal structure of MBTD1 reveals its similarity to another four-MBT-repeat protein L3MBTL2, which binds lower methylated lysine histones. Fluorescence polarization experiments confirmed that MBTD1 preferentially binds mono- and di-methyllysine histone peptides, like L3MBTL1 and L3MBTL2. All known MBT-peptide complex structures characterized to date do not exhibit strong histone peptide sequence selectivity, and use a “cavity insertion recognition mode” to recognize the methylated lysine with the deeply buried methyl-lysine forming extensive interactions with the protein while the peptide residues flanking methyl-lysine forming very few contacts . Nevertheless, our mutagenesis data based on L3MBTL1 suggested that the histone peptides could not bind to MBT repeats in any orientation.
The four MBT repeats in MBTD1 exhibits an asymmetric rhomboid architecture. Like other MBT repeat proteins characterized so far, MBTD1 binds mono- or dimethylated lysine histones through one of its four MBT repeats utilizing a semi-aromatic cage.
This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1.
Citation: Eryilmaz J, Pan P, Amaya MF, Allali-Hassani A, Dong A, et al. (2009) Structural Studies of a Four-MBT Repeat Protein MBTD1. PLoS ONE 4(10): e7274. doi:10.1371/journal.pone.0007274
Editor: Nick Gay, University of Cambridge, United Kingdom
Received: August 7, 2009; Accepted: September 8, 2009; Published: October 20, 2009
Copyright: © 2009 Eryilmaz et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The Structural Genomics Consortium is a registered charity (number 1097737) that receives funds from the Canadian Institutes for Health Research, the Canadian Foundation for Innovation, Genome Canada through the Ontario Genomics Institute, GlaxoSmithKline, Karolinska Institutet, the Knut and Alice Wallenberg Foundation, the Ontario Innovation Trust, the Ontario Ministry for Research and Innovation, Merck & Co Inc., the Novartis Research Foundation, the Swedish Agency for Innovationrefer Systems, the Swedish Foundation for Strategic Research and the Wellcome Trust. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The nucleosome is the fundamental repeating unit of chromatin. The nucleosome core particle consists of approximately 147 base pairs of DNA wrapped around a histone octamer consisting of 2 copies each of the core histones H2A, H2B, H3, and H4. The four core histones are composed of a globular domain and an unstructured tail. The unstructured tails protrude from the nucleosome and are subject to a number of post-translational modifications including acetylation, methylation, phosphorylation and ubiquitylation . Methylation of lysine and arginine residues on the histones is an important regulator of eukaryotic transcription and genome integrity. These post-translational modifications are thought to act as markers that recruit proteins to specific regions of chromatin .
The MBT (Malignant Brain Tumor) repeat is a structural motif of ~100 amino acids that is conserved from C. elegans to humans and exists as tandem repeats . In the human genome, there are at least 9 MBT repeat proteins, each containing two, three or four MBT repeats, respectively. The MBT repeat was originally identified in the Drosophila tumor-suppressor protein L(3)MBT and mutations of L(3)MBT gene cause malignant transformations of the optic neuroblasts . Like other members in the ‘Royal Family’ , MBT repeat proteins have been shown to recognize methylated lysine residues on histones , and functional studies have suggested important connections between MBT domain-containing proteins and the transcriptional state of chromatin regions , , , , , .
Structural studies of MBT repeat proteins show that, unlike chromodomain , , , , MBT repeat proteins use a semi-aromatic cage to accommodate the methyllysine , , , , , . The difference in the residue composition of the binding pocket of the histone code effectors allows them to discriminate between different lysine methylation states . The chromodomain proteins preferentially bind trimethylated lysine histones whereas MBT repeats show specificity towards the lower methylation states of lysine. To gain insight into the conservation of the semiaromatic cage and its function in methyl recognition, we have determined the structure of the 4-MBT repeat domain of human MBTD1 and characterized its binding specificity against different histone peptides.
Results and Discussion
MBTD1 is a four MBT repeat protein comprising 628 amino acids. It contains a FCS-type zinc finger at the N-terminus with putative regulatory function  and four MBT repeats at the C-terminus. To investigate the crystal structure of the four MBT repeat fragment of MBTD1 and characterize its histone binding specificity, we cloned and purified a human MBTD1 fragment composed of all four MBT repeats (residues 130–566). MBTD1130–566 crystallized in the orthorhombic space group P212121 (a = 70.31 Å, b = 100.90 Å, c = 135.30 Å) with two molecules in the asymmetric unit (Figure 1). Crystal diffraction data and refinement statistics for the MBTD1 structure are displayed in Table 1. In the MBTD1 structure, each molecule contains four MBT repeats that exhibit irregular rhomboid architecture. A narrow channel runs through the middle of the structure and is filled with water molecules. Consistent with previously reported MBT repeat structures, each MBT repeat contains an extended “arm” which packs against a globular β subunit core of the preceding repeat . Structural and sequence alignment show that the β-barrel subunit core region has the highest conservation between MBTD1 and other MBT repeat proteins. The residues involved in intermolecular hydrogen bond interactions between the two molecules in the asymmetric unit are conserved throughout the whole MBT repeat family, possibly implying that MBTD1 functions as a dimer unit (Figure 2). Nonetheless, dynamic light scattering (DLS) experiments and size exclusion experiments demonstrate that MBTD1 is a monomer in solution at the working concentrations (~5 mg/ml, data not shown). Structurally, the last three MBT repeats in MBTD1 and L3MBTL2 form three-blade propeller architecture. Superposition of the three MBT repeats of L3MBTL1 with those of MBTD1 and L3MBTL2 shows a good structural alignment of Cα positions with a root-mean-squared deviation (RMSD) of 2.345 Å and 2.168 Å, respectively. Furthermore, the MBTD1 can also be well superimposed with L3MBTL2 with a root-mean-squared deviation of 0.697 Å.
There are two MBTD1 molecules in the asymmetric unit, and each molecule contains four MBT repeats (MBT1, MBT2, MBT3, MBT4), which exhibit irregular rhombus architecture.
a) One molecule is shown as surface representation and the other as ribbon representation with residues involved in the intermolecular interactions colored in green. b). Sequence alignment among different MBT modules. All four MBT repeats in MBTD1 were aligned with the methyllysine binding MBT repeats in other MBT proteins (the fourth MBT repeat in L3MBTL2, the second MBT repeat in L3MBTL1, the second MBT in SCMH1 and the second MBT repeat in SCML2). Key residues participating in binding pocket formation are colored in blue and residues involved in the intermolecular interactions colored in green.
To date, L3MBTL1 , , , L3MBTL2 , dScm , dSFMBT  and SCML2  have been shown to selectively bind lower methylated histone tails. To explore if MBTD1 also possesses histone binding ability, we have carried out the binding studies of the MBTD1 4MBT fragment against a set of fluorescently labeled peptides derived from the N-terminal tails of histones H3 and H4 by means of fluorescence polarization technique (Figure 3). The data revealed that MBTD1 specifically binds to mono and di-methylated lysine on histone H4K20 and exhibit negligible binding to the unmodified or trimethylated peptides. Furthermore, the binding results also show that MBTD1 weakly interacts with mono- or dimethylated lysine histone peptides H3K9, H3K4 and H3K27. This indicates that MBTD1 modestly selectively recognizes lower methylated H4K20 peptide over other histone lysine methylation sites.
The data shows that MBTD1 specifically binds to mono and di-methylated lysine on histone H4K20 and only exhibit negligible binding to the unmodified or trimethylated peptides. In addition, MBTD1 was also shown to bind weakly to mono- or dimethylated lysine histone peptides H3K9, H3K4 and H3K27.
Although the four MBT repeats of MBTD1 have similar three-dimensional structures and high sequence identity, only the fourth MBT repeat (MBT4) contains the semi-aromatic cage, which is formed by the loops between the first and second strands and the third and fourth strands of the β- barrel core domain and constitutes the binding site for methyllysine residue, analogous to L3MBTL2. The binding pocket of the MBTD1-MBT4 is shown in Figure 4, where an open cage in MBTD1 is formed by aromatic residues Phe526, Trp529, Tyr533, negatively charged Asp502, and Leu508. Three highly conserved aromatic residues Phe526, Trp529 and Tyr533 form the base and the walls of the hydrophobic pocket. Interestingly, Arg325 from a symmetry related molecule is inserted into the binding pocket, mimicking the methyl-lysine binding in L3MBTL2 . The arginine residue 325 is stabilized by a salt bridge with the pocket residue Asp502, and cation-π and van der Waals and interactions with the aromatic cage residues.
The MBTD1 binding pocket is formed by aromatic residues Phe526, Trp529, Tyr533, negatively charged Asp502 and Leu508. The binding pocket is occupied by Arg325 from a symmetry related molecule.
The utilization of a single MBT repeat for histone binding, despite the structural and sequence similarity shared by all four MBT repeats, is reminiscent of the histone binding mode employed by other MBT containing proteins, including L3MBTL1 , . This has been explored in parallel to MBTD1, where only the second MBT domain (MBT2) of the three MBT repeats in L3MBTL1 was bound to histone peptide. Based on sequence and structural alignment, we attributed this phenomenon to the steric hindrance generated by the long or bulky side chain residues (phenylalanine in MBT1 and arginine in MBT3) in the potential pockets instead of cysteine in MBT2. We also suggested that the electrostatic repulsion between the methyllysine and Arginine in MBT3 might be an additional factor, which prevents histone peptide binding. To test our hypothesis and to explore the possibilities of converting a naturally “non-functional” methyllysine binding MBT to a “functional” state, we generated a L3MBTL1 Arg467Cys mutant, which in principle should eliminate both the steric hindrance and electrostatic repulsion in MBT3 based on our prediction. Crystals of the L3MBTL1 R467C mutant were successfully grew in the presence of excess amount of H4K20me2 peptide. However, no electron density was observed in the predicted binding site in the mutated MBT3. ITC experiment of this mutant titrated with H4K20me2 also suggests that there is only one binding site (data not shown). Upon a closer examination of this mutant crystal structure, another residue Arg461 was identified, which potentially clashes with Arg17 on the histone peptide and therefore prevent the methylated lysine histone from binding in this pocket (Figure 5). In MBT2, the same position is occupied by Met357. All known MBT-peptide complex structures characterized to date do not exhibit strong histone peptide sequence selectivity, and use a “cavity insertion recognition mode” to recognize the methylated lysine with the deeply buried methyl-lysine forming extensive interactions with the protein while the peptide residues flanking methyl-lysine forming very few contacts . Although a “functional” MBT3 was not obtained, this mutagenesis study revealed that the histone peptide could not bind to MBT repeats in any orientation.
MBT3 residues are shown in magenta and MBT2 residues shown in cyan. In the mutant crystal structure Arg461 would potentially clash with Arg17 on the histone peptide and prevent the methylated lysine histone from binding in this pocket.
Materials and Methods
Protein expression and purification
The human MBTD1 protein (residues 130–566) was subcloned into pET28a-MHL vector and transformed in Escherichia coli BL21 (DE3)-V2R-pRARE2. Cells were grown in Luria-Bertanin media at 37°C until they reach an absorbance at 600nm of approximately 3.0, then cooled down to 14°C followed by induction with 1 mM of isopropyl-β-D-thiogalactoside (IPTG) overnight. Cells were harvested by centrifugation at 7500 rpm for 15 minutes, resuspended in a buffer solution containing 20 mM Tris 8.0, 250 mM NaCl, 10% glycerol and lyzed by sonication. The supernatant fraction obtained by centrifugation at 16,000 rpm for 1 hour and passed through a Ni-NTA Superflow resin (QIAGEN) that had been pre-equilibrated in 20 mM Tris-HCl (pH 8.0), 250 mM NaCl, 10% Glycerol, which was then washed and eluted with 20 mM Tris-HCl (pH 8.0), 250 mM NaCl, 10% Glycerol, 500 mM imidazole. HiTrap Q HP column (GE Healthcare, Piscataway, NJ) and Superdex 75 gel-filtration column (GE Healthcare, Piscataway, NJ) were carried out for further purification. The protein was concentrated to 10 mg/ml in a buffer containing 20 mM Tris-HCl, pH 8.0, 0.2 M NaCl, 10% Glycerol, 1 mM EDTA and 1 mM DTT. The human L3 MBTL1 R467C mutants containing 3 MBT repeats (residues 200–522, 3MBT) were generated by Stratagene's QuikChange method. The mutant protein was expressed and purified as previously described . Fluorescence polarization assays were performed as described in .
Protein crystallization and Structure determination
Crystals of MBTD1 were obtained by macroseeding at 18°C by vapor diffusion of hanging drops of 5 µl of 10 mg ml−1 protein solution mixed with 5 µl of a reservoir solution. The reservoir solution contained 20% PEG 3350, 0.2M CaOAc. For cryoprotection, crystals were soaked for a few seconds in a reservoir solution containing 20% (wt/vol) glycerol. The crystals were mounted in a cryoloop and subsequently flash-frozen in liquid nitrogen. X-ray data were collected at 100 K on beamline 23ID-B of Advance Photon Source (APS) at Argonne National Laboratory. A native data set was collected to 2.5 Å resolution. The crystal belongs to space group P212121, with unit cell parameters a = 70.31 Å, b = 100.90 Å, and c = 135.30 Å. There are two molecules in the asymmetric unit that have a VM of 2.42 Å3 Da− 1 and a solvent content of 48.2%. The structure of MBTD1 was determined by molecular replacement using PHASER  using the crystal structure of L3MBTL2 (PDB code 3f70) as a search model. Automated building was done with ARP/wARP  and manual intervention for corrections. Refinement was carried out using REFMAC  in CCP4. The progress of refinement was monitored with Rfree and inspection of 2|Fo|−|Fc| and |Fo|−|Fc| maps in COOT . When Rfree reached 29.8%, TLS refinement was applied and the Rfree dropped to 27.4%. Stereochemical analysis was done with Molprobity. The human L3MBTL1 R467C mutant crystals were grown and its structure was solved using the same methods as we previously reported for their wild-type counterpart .
Standalone iSee datapack - contains the enhanced version of this article for use offline. This file can be opened using free software available for download at http://www.molsoft.com/icm_browser.html.
Instructions for installation and use of the required web plugin (to access the online enhanced version of this article).
We would like to thank Ivona Kozieradzki, Peter Loppnau, Lissete Crombet, Angela Mok and Matthieu Schapira for advice and technical assistance. Diffraction data were collected at GM/CA-CAT (NCI Y1-CO-1020, NIGMS Y1-GM-1104) and Structural Biology Center at the Advanced Photon Source. Use of the Advanced Photon Source was supported by the U.S. Department of Energy, Basic Energy Sciences, Office of Science, under contract No. DE-AC02-06CH11357.
Conceived and designed the experiments: JM. Performed the experiments: JE PP MFA AAH AD MAAC FM. Analyzed the data: JE PP MFA AAH MV JM. Contributed reagents/materials/analysis tools: JE PP MFA. Wrote the paper: JE PP MFA.
- 1. Guo Y, Nady N, Qi C, Allali-Hassani A, Zhu H, et al. (2009) Methylation-state-specific recognition of histones by the MBT repeat protein L3MBTL2. Nucleic Acids Res 37: 2204–2210.
- 2. Kouzarides T (2007) Chromatin modifications and their function. Cell 128: 693–705.
- 3. Martin C, Zhang Y (2005) The diverse functions of histone lysine methylation. Nat Rev Mol Cell Biol 6: 838–849.
- 4. Wismar J (2001) Molecular characterization of h-l(3)mbt-like: a new member of the human mbt family. FEBS Lett 507: 119–121.
- 5. Wismar J, Loffler T, Habtemichael N, Vef O, Geissen M, et al. (1995) The Drosophila melanogaster tumor suppressor gene lethal(3)malignant brain tumor encodes a proline-rich protein with a novel zinc finger. Mech Dev 53: 141–154.
- 6. Maurer-Stroh S, Dickens NJ, Hughes-Davies L, Kouzarides T, Eisenhaber F, et al. (2003) The Tudor domain ‘Royal Family’: Tudor, plant Agenet, Chromo, PWWP and MBT domains. Trends Biochem Sci 28: 69–74.
- 7. Kim J, Daniel J, Espejo A, Lake A, Krishna M, et al. (2006) Tudor, MBT and chromo domains gauge the degree of lysine methylation. EMBO Rep 7: 397–403.
- 8. Klymenko T, Papp B, Fischle W, Kocher T, Schelder M, et al. (2006) A Polycomb group protein complex with sequence-specific DNA-binding and selective methyl-lysine-binding activities. Genes Dev 20: 1110–1122.
- 9. Grimm C, de Ayala Alonso AG, Rybin V, Steuerwald U, Ly-Hartig N, et al. (2007) Structural and functional analyses of methyl-lysine binding by the malignant brain tumour repeat protein Sex comb on midleg. EMBO Rep 8: 1031–1037.
- 10. Boccuni P, MacGrogan D, Scandura JM, Nimer SD (2003) The human L(3)MBT polycomb group protein is a transcriptional repressor and interacts physically and functionally with TEL (ETV6). J Biol Chem 278: 15412–15420.
- 11. Trojer P, Li G, Sims RJ 3rd, Vaquero A, Kalakonda N, et al. (2007) L3MBTL1, a histone-methylation-dependent chromatin lock. Cell 129: 915–928.
- 12. Wu S, Trievel RC, Rice JC (2007) Human SFMBT is a transcriptional repressor protein that selectively binds the N-terminal tail of histone H3. FEBS Lett 581: 3289–3296.
- 13. Kalakonda N, Fischle W, Boccuni P, Gurvich N, Hoya-Arias R, et al. (2008) Histone H4 lysine 20 monomethylation promotes transcriptional repression by L3MBTL1. Oncogene.
- 14. Nielsen PR, Nietlispach D, Mott HR, Callaghan J, Bannister A, et al. (2002) Structure of the HP1 chromodomain bound to histone H3 methylated at lysine 9. Nature 416: 103–107.
- 15. Min J, Zhang Y, Xu RM (2003) Structural basis for specific binding of Polycomb chromodomain to histone H3 methylated at Lys 27. Genes Dev 17: 1823–1828.
- 16. Jacobs SA, Khorasanizadeh S (2002) Structure of HP1 chromodomain bound to a lysine 9-methylated histone H3 tail. Science 295: 2080–2083.
- 17. Fischle W, Wang Y, Jacobs SA, Kim Y, Allis CD, et al. (2003) Molecular basis for the discrimination of repressive methyl-lysine marks in histone H3 by Polycomb and HP1 chromodomains. Genes Dev 17: 1870–1881.
- 18. Li H, Fischle W, Wang W, Duncan EM, Liang L, et al. (2007) Structural basis for lower lysine methylation state-specific readout by MBT repeats of L3MBTL1 and an engineered PHD finger. Mol Cell 28: 677–691.
- 19. Adams-Cioaba MA, Min J (2009) Structure and function of histone methylation binding proteins. Biochem Cell Biol 87: 93–105.
- 20. Min J, Allali-Hassani A, Nady N, Qi C, Ouyang H, et al. (2007) L3MBTL1 recognition of mono- and dimethylated histones. Nat Struct Mol Biol 14: 1229–1230.
- 21. Santiveri CM, Lechtenberg BC, Allen MD, Sathyamurthy A, Jaulent AM, et al. (2008) The malignant brain tumor repeats of human SCML2 bind to peptides containing monomethylated lysine. J Mol Biol 382: 1107–1112.
- 22. Lechtenberg BC, Allen MD, Rutherford TJ, Freund SM, Bycroft M (2009) Solution structure of the FCS zinc finger domain of the human polycomb group protein L(3)mbt-like 2. Protein Sci 18: 657–661.
- 23. Nady N, Min J, Kareta MS, Chedin F, Arrowsmith CH (2008) A SPOT on the chromatin landscape? Histone peptide arrays as a tool for epigenetic research. Trends Biochem Sci.
- 24. Schuetz A, Allali-Hassani A, Martin F, Loppnau P, Vedadi M, et al. (2006) Structural basis for molecular recognition and presentation of histone H3 by WDR5. Embo J 25: 4245–4252.
- 25. Trojer P, Zhang J, Yonezawa M, Schmidt A, Zheng H, et al. (2009) Dynamic Histone H1 Isotype 4 Methylation and Demethylation by Histone Lysine Methyltransferase G9a/KMT1C and the Jumonji Domain-containing JMJD2/KDM4 Proteins. J Biol Chem 284: 8395–8405.
- 26. Perrakis A, Harkiolaki M, Wilson KS, Lamzin VS (2001) ARP/wARP and molecular replacement. Acta Crystallogr D Biol Crystallogr 57: 1445–1450.
- 27. Murshudov GN, Vagin AA, Dodson EJ (1997) Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr D Biol Crystallogr 53: 240–255.
- 28. Emsley P, Cowtan K (2004) Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr 60: 2126–2132.