Unlike canonical pre-mRNAs, animal replication-dependent histone pre-mRNAs lack introns and are processed at the 3’-end by a mechanism distinct from cleavage and polyadenylation. They have a 3’ stem loop and histone downstream element (HDE) that are recognized by stem-loop binding protein (SLBP) and U7 snRNP, respectively. The N-terminal domain (NTD) of Lsm11, a component of U7 snRNP, interacts with FLASH NTD and these two proteins recruit the histone cleavage complex containing the CPSF-73 endonuclease for the cleavage reaction. Here, we determined crystal structures of FLASH NTD and found that it forms a coiled-coil dimer. Using solution light scattering, we characterized the stoichiometry of the FLASH NTD-Lsm11 NTD complex and found that it is a 2:1 heterotrimer, which is supported by observations from analytical ultracentrifugation and crosslinking.
Citation: Aik WS, Lin M-H, Tan D, Tripathy A, Marzluff WF, Dominski Z, et al. (2017) The N-terminal domains of FLASH and Lsm11 form a 2:1 heterotrimer for histone pre-mRNA 3’-end processing. PLoS ONE 12(10): e0186034. https://doi.org/10.1371/journal.pone.0186034
Editor: Yoon Ki Kim, Korea University, REPUBLIC OF KOREA
Received: June 25, 2017; Accepted: September 23, 2017; Published: October 11, 2017
Copyright: © 2017 Aik et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are available from the Protein Data Bank via the following accession numbers: 6ANO, 6AOZ, 6AP0.
Funding: This research is supported by NIH grants R35GM118093 and S10OD012018 (to LT) and GM29832 (to WFM and ZD) and Taiwan MOST grant 105-2320-B-010-012 and 106-2320-B-010-013 (to CYC). This work is based upon research conducted at the Northeastern Collaborative Access Team beamlines (NIH P41 GM103403), using the Pilatus 6M detector (NIH-ORIP HEI grant S10 RR029205).
Competing interests: The authors have declared that no competing interests exist.
In eukaryotic cells, histones play important roles in genomic DNA packaging as well as epigenetic regulation of gene expression. The levels of histone mRNAs are carefully controlled throughout the cell cycle and they dramatically increase during S phase to meet the growing demand for packaging the newly replicated DNA from the replicating genome . In metazoans, histone proteins for packaging of newly synthesized DNA are encoded by the replication-dependent histone genes. They are distinct from the replication-independent histone genes which are expressed constitutively . Unlike canonical pre-mRNAs, replication-dependent histone pre-mRNAs lack introns and undergo 3’-end processing that differs from cleavage coupled to polyadenylation. Histone pre-mRNAs contain two sequence elements essential for their 3’-end processing: a highly-conserved stem-loop structure and a purine-rich histone downstream element (HDE). Cleavage occurs between these two sequence elements and the polyadenylation step is omitted, giving rise to mature histone mRNAs that end with the stem-loop followed by a 4–5 nucleotide tail.
Biochemical studies of the 3’-end processing machinery that cleaves replication-dependent histone pre-mRNAs have shown that it is comprised of the stem-loop binding protein (SLBP), U7 small nuclear ribonucleoprotein (U7 snRNP), FLASH, and the histone pre-mRNA cleavage complex (HCC) [1, 3–6]. SLBP binds the 3’ stem-loop in the pre-mRNA and remains bound after mRNA maturation, and functions in translation [7, 8]. The 3’ stem-loop also recruits the 3’-5’ exoribonuclease 3’hExo [9, 10], which is not essential for processing  but trims the processed histone mRNAs and initiates degradation of histone mRNAs in the cytoplasm . The core U7 snRNP consists of two integral and stably associated components: ~60-nucleotide U7 snRNA and a unique Sm ring, which contains Lsm10 and Lsm11 in place of the spliceosomal SmD1 and SmD2 [13, 14]. The U7 snRNP recognizes the pre-mRNA through base-pairing between the 5’-end of U7 snRNA and the HDE [15, 16]. SLBP bound to the upstream stem-loop stabilizes this interaction, likely by directly or indirectly contacting a subunit(s) of U7 snRNP .
Lsm11 has an extended N-terminal domain (Fig 1A) that is unique among members of the functionally characterized Sm proteins. Through yeast two-hybrid and pull-down studies, this region was found to interact with the N-terminal region of FLASH (Fig 1B) . FLASH, Flice-associated huge protein, was originally discovered as a protein involved in Fas-mediated apoptosis  and later in regulation of expression of several genes, including oncogenes [19, 20]. Subsequent studies showed that FLASH localizes to Histone Locus Bodies in the nucleus, suggesting a role in expression of histone genes , and that it is essential for histone pre-mRNA processing .
(A) Lsm11 N-terminal domain. (B) FLASH N-terminal domain. Alignment was carried out with Clustal Omega [60, 61] and the results displayed with ESPript . Secondary structure for FLASH is based on the structure of human FLASH NTD, while that for Lsm11 is based on Psipred  secondary structure prediction of human Lsm11. Conserved residues are highlighted in red with white fonts, semi-conserved residues in red fonts, and other residues in black fonts. Blue dots indicate residues at the FLASH dimer interface. Gaps are indicated by dotted lines. Species abbreviations: Hs, Homo sapiens (human); Dr, Danio rerio (zebrafish); Dm, Drosophila melanogaster (fruit fly).
Biochemical studies revealed that the interacting N-terminal regions of Lsm11 and FLASH form a docking platform that recruits the HCC to the U7 snRNP . The HCC is composed of a specific subset of proteins that also participate in cleavage and polyadenylation [23, 24], including the endonuclease CPSF-73, CPSF-100, symplekin and CstF-64 [1, 25–27]. Mutational studies on FLASH identified an LDLY motif (residues 55–58 in human FLASH, Fig 1B) as essential for binding the HCC, while residues 100–139 are involved in Lsm11 binding [22, 28].
The molecular details of how FLASH acts as a mediator between Lsm11 and HCC are still unclear. To shed some light on the essential role of FLASH in 3’-end processing of replication-dependent histone pre-mRNA processing , we carried out structural studies on the human FLASH N-terminal domain (NTD) encompassing residues 51–137 using X-ray crystallography. We also performed biophysical studies on the FLASH NTD and the FLASH NTD-Lsm11 NTD complex to characterize their oligomeric states and the stoichiometry of their complex.
FLASH NTD forms a coiled-coil dimer
We determined a structure of the wild-type human FLASH NTD at 2.6 Å resolution using X-ray crystallography (Table 1). The initial phases were obtained by the single anomalous dispersion (SAD) method using crystals of selenomethionyl FLASH NTD. The structure showed that FLASH NTD forms a coiled-coil dimer consisting of two parallel α-helices, one from each protomer (Fig 2A). However, only residues 71 to 137 were observed in this structure, even though the expression construct contained residues 51–137. Residues 51–70, which include the LDLY motif previously shown to be essential for histone pre-mRNA processing  and for binding the HCC , are disordered in this crystal. The first 30 residues of FLASH are poorly conserved among homologs (Fig 1B), although there is substantial conservation from Drosophila to mammals for residues 55–137 in the N-terminal segment.
(A) Two views of the FLASH NTD forming a coiled-coil dimer with respective protomers colored in green and yellow orange. The hexahistidine-tag was observed in protomer 1. Side chains of residues involved in the dimer interface are shown as sticks, while the other side chains are shown as thin sticks. Colors of atoms: red, oxygen; blue, nitrogen; yellow, sulfur/selenium. (B) Hydrophilic interactions (black dashes) formed by Gln100/Asn101 and Glu107/Asn108 respectively. (C) 2Fo–Fc electron density for the Cys83 side chains, contoured at 1σ (blue).
Residues 71–137 observed in the structure form a single α-helix. The length of the coiled-coil FLASH NTD dimer is approximately 100 Å, excluding the C-terminal hexahistidine tag observed for one of the protomers. The protomers do not superimpose perfectly onto each other (r.m.s.d. ~1.5 Å), and one of them appears to adopt a straighter conformation (S1 Fig). For each protomer, the buried surface area is ~1800 Å2 (~25% of its total surface area, calculated using the program PISA ). The majority of the FLASH dimer interface residues are leucines and isoleucines, forming the bulk of the hydrophobic interactions (Fig 2A). The leucines and isoleucines are interspersed with other residues that form either polar or non-polar interactions. Other hydrophobic interactions are formed by bulky residues such as Tyr73, Tyr80, and Phe94, as well as Met87 (selenomethionine in this structure). Hydrophilic interactions include residues Gln100/Asn101 and Glu107/Asn108 near the mid-section of the structure (Fig 2B), and an ion pair between Lys129 and Asp130 at the C-terminal end of the coiled-coil (Fig 2A).
FLASH NTD double cysteine mutant forms a similar dimer
We also observed that Cys83 is situated in the dimer interface with the thiol side chains from the two protomers positioned near one another (Fig 2C). While the electron density did not provide conclusive evidence for the existence of a disulfide bond, and the two sulfur atoms are separated by 3.4 Å distance in the current model, the structure raises the possibility that the observed FLASH NTD dimer might be mediated by a disulfide connection, which is unlikely to occur in the reducing environment in the nucleoplasm, the site of 3’-end processing.
To rule out the possibility that the observed FLASH NTD dimer is a crystallographic artifact caused by oxidized cysteine residues, we determined the structures of the FLASH NTD C54S/C83A double mutant in two different crystal forms at 2.1 and 2.6 Å resolution, respectively (Table 1). In addition to Cys83, we also mutated the other Cys residue in the FLASH NTD, Cys54, in case it formed a disulfide as well even though the residue was disordered in the structure. The two structures adopt the same coiled-coil dimer (Fig 3A and 3B) as the wild-type FLASH NTD (Fig 3C), confirming that FLASH NTD dimer formation does not require the Cys83 disulfide. The individual protomers of these two mutant dimers also show differences, similar to those observed for the wild-type dimer (S1 Fig).
(A) Structure of FLASH NTD C54S/C83A crystal form 1 (resolution 2.1 Å) showing FLASH dimer without the presence of a disulfide bond. (B) Structure of FLASH NTD C54S/C83A crystal form 2 (resolution: 2.6 Å) showing observable residues 52–70 that adopt a helical structure on protomer 1, and are less ordered in protomer 2. The LDLY motif essential for binding the HCC is shown as sticks. (C) Superimposition of the structures of FLASH NTD C54S/C83A crystal forms 1 (cyan) and 2 (gray) with wild-type FLASH NTD dimers (green).
Interestingly, residues 53–70 from one of the protomers in the structure at 2.6 Å resolution are stabilized by crystal packing, showing strong electron density corresponding to an α-helix (Fig 3B). This protomer appears to form a single long α-helix from residues 53 to 137. The other protomer showed weak electron density for residues 56–62, while residues 51–55 and 63–66 are not observed (Fig 3B). These N-terminal residues are disordered in the other two structures (Fig 3C).
Overall, our structural data suggest that FLASH NTD alone forms a stable coiled-coil dimer from residues 71–137 while residues 53–70 can form another helix. It appears that the helix for residues 53–70 does not dimerize, and it may also be structurally independent of residues 71–137, even though a single long helix (residues 53–137) is observed in one crystal form. The LDLY motif (residues 55–58) essential for binding the HCC is situated in the N-terminal helix (Fig 3B). Whether the flexibility of this helix is a feature needed for binding the HCC will need to await further investigation.
FLASH NTD mutations can affect Lsm11 NTD binding but not dimerization
The region of FLASH NTD that interacts with Lsm11 has been previously mapped to residues 100–137 [3, 13, 28]. Because our structural data indicated that this region forms a dimer, we investigated the role of FLASH NTD dimerization in Lsm11 binding. Previous pull-down studies showed that substituting Leu118 and Ile119 with alanines abolished the ability of FLASH to bind Lsm11 . According to our FLASH NTD structures, both Leu118 and Ile119 are situated at the dimer interface (Fig 4A). It was possible that these mutations disrupt FLASH NTD dimerization, thereby affecting Lsm11 binding. To test this possibility, we generated the FLASH NTD L118A/I119A mutant in the background of C54S/C83A mutations and investigated its oligomeric state by analytical gel filtration (Fig 4B). Our results showed that this mutant had a similar migration behavior as the C54S/C83A mutant control, indicating that the deleterious effect of L118A/I119A mutation on binding Lsm11 is not due to the disruption of FLASH NTD dimerization.
(A) Close-up view of Leu118 and Ile119 with 2Fo–Fc electron density of the side chains (sticks) contoured at 1σ (blue). Coordinates and the electron density are derived from structure of FLASH NTD C54S/C83A crystal form 1 at 2.1 Å resolution. (B) Superose-12 analytical gel filtration profiles of FLASH NTD proteins. Peak heights are scaled to an arbitrary unit of 100. (C) SDS-PAGE analysis of co-purification of His-tagged Lsm11 NTD and wild-type and mutant FLASH NTD. All FLASH mutants have additional C54S/C83A mutations. Pellet and supernatant are the insoluble and soluble fractions, respectively, of the cell lysate.
We next investigated the oligomeric states of the mutants Y73A/L76A/Y80A and N101A/L104A/N108A, each containing substitutions of three consecutive residues in the dimer interface (Fig 2A). The first mutated cluster is located closer to the N-terminal end, while the second cluster is located near the middle of the NTD. In addition, since Lys129 and Asp130 form an ion pair in the dimer interface, we also replaced these two charged residues as well as the preceding Arg128 with alanines, generating the R128A/K129A/D130A mutant. Our gel filtration results showed that the Y73A/L76A/Y80A mutant had essentially the same migration behavior as the C54S/C83A control, while the N101A/L104A/N108A and R128A/K129A/D130A mutants actually migrated faster (Fig 4B). As a control, we made the K88A/K92A/K95A mutant, changing three residues located outside of the dimer interface. As expected, the migration behavior of this mutant was nearly the same as that for the C54S/C83A protein (Fig 4B). Our analytical ultracentrifugation studies on the N101A/L104A/N108A mutant suggested that it might be trimeric in solution (Table 2, see below), suggesting that the mutation has perturbed the structure of the NTD. Therefore, the N101A/L104A/N108A and R128A/K129A/D130A mutants will not be described further.
While the mutations were not able to disrupt the FLASH NTD dimer, we tested whether they affect the interactions with Lsm11 NTD. For these experiments, we co-expressed His-tagged Lsm11 NTD with un-tagged FLASH NTD in E. coli and monitored whether FLASH NTD could be co-purified by the nickel-NTA agarose beads. The results showed that the Y73A/L76A/Y80A and K88A/K92A/K95A mutants still interacted with Lsm11 NTD, while the L118A/I119A mutant could no longer interact with Lsm11 NTD (Fig 4C), consistent with earlier data . These experiments further demonstrated that the loss of binding between the FLASH NTD mutant and Lsm11 NTD is not linked to the dissociation of FLASH dimer.
FLASH NTD-Lsm11 NTD complex is a 2:1 heterotrimer
Given the ability of the FLASH NTD to dimerize, we next characterized the stoichiometry of the FLASH-Lsm11 complex. We co-expressed FLASH NTD C54S/C83A double mutant and His-tagged Lsm11 NTD (residues 23–130) in E. coli and purified their complex, demonstrating a stable interaction between the two proteins (S2 Fig). Extensive efforts at producing diffraction quality crystals of the FLASH NTD-Lsm11 NTD complex have so far been unsuccessful. To obtain estimates for the molar masses of the FLASH NTD-Lsm11 NTD complex as well as the FLASH NTD C54S/C83A mutant alone, we performed size exclusion chromatography multi-angle light scattering (SEC-MALS) experiments using buffers with high (500 mM) and low (250 mM) NaCl concentrations (Fig 5, S3 Fig). At high-salt concentration, both the FLASH NTD-Lsm11 NTD complex and FLASH NTD alone eluted in single peaks. However, the FLASH NTD-Lsm11 NTD complex peak had a trailing edge, suggesting some dissociation of the complex during chromatography. The weight-averaged molar masses of the samples eluting in the peaks are 31 kDa for the FLASH NTD-Lsm11 NTD complex (with a Stokes radius of 3.9 nm) and 21 kDa for FLASH NTD C54S/C83A mutant alone (with a Stokes radius of 3.4 nm). The molar mass of the FLASH NTD-Lsm11 NTD complex decreased gradually from 34.4 kDa (leading edge of peak) to 26.0 kDa (trailing edge of peak). For FLASH NTD C54S/C83A, the molar mass decreased slightly from 21.7 kDa (leading edge) to 20.8 kDa (trailing edge), indicating that it formed a stable dimer.
SEC-MALS traces (with superimposed calculated molar mass traces) of FLASH NTD-Lsm11 NTD complex at 500 mM NaCl (blue), FLASH NTD C54S/C83A at 500 mM NaCl (purple), FLASH NTD-Lsm11 NTD complex at 250 mM NaCl (red), and FLASH NTD C54S/C83A at 250 mM NaCl (green). Left axis, molar mass; right axis, light scattering signal; bottom axis, elution volume.
In the low-salt buffer, the positions of the peaks for both the FLASH NTD-Lsm11 NTD complex and FLASH NTD C54S/C83A mutant were slightly shifted to the left, suggesting a more extended structure for both. A small amount of higher order structures was present for the FLASH NTD-Lsm11 NTD complex suggesting the formation of aggregates in low-salt buffer condition. As in the high-salt buffer, the FLASH NTD-Lsm11 NTD complex peak had a trailing edge. The weight-averaged molar masses of the samples eluting in the peaks are 34.3 kDa for FLASH NTD-Lsm11 NTD complex (with a Stokes radius of 4.3 nm) and 21.8 kDa for FLASH NTD C54S/C83A (with a Stokes radius of 3.5 nm). Due to the presence of higher order structures, the molar mass of the FLASH NTD-Lsm11 NTD complex decreased gradually from 55 kDa and higher (leading edge of peak) to 26 kDa (trailing edge of peak). The FLASH NTD C54S/C83A molar mass decreased from 23.5 kDa (leading edge) to 19 kDa (trailing edge). Overall, the polydispersity of the FLASH NTD-Lsm11 NTD complex was slightly higher in low-salt buffer.
Based on the calculated molecular weights for Lsm11 NTD and FLASH NTD, the results from SEC-MALS showed that the FLASH NTD-Lsm11 NTD complex is a heterotrimer consisting of 2 molecules (a dimer) of FLASH and 1 molecule of Lsm11, while FLASH NTD C54S/C83A is a dimer (Table 2).
Analytical ultracentrifugation studies
We also performed analytical ultracentrifugation (AUC) sedimentation velocity and sedimentation equilibrium experiments on the FLASH NTD-Lsm11 NTD complex, as well as on wild-type FLASH NTD, FLASH NTD C54S/C83A double mutant, and Lsm11 NTD alone as controls in the low-salt condition (Fig 6, S4 Fig). Our AUC results showed that wild-type FLASH NTD and the FLASH NTD C54S/C83A double mutant have similar sedimentation coefficients of 1.61 and 1.59 and frictional ratios of 1.81 and 1.86, consistent with them forming dimers in solution (Table 3). Both proteins have Stokes radius of 3.2–3.4 nm, while the long-axis radius is about 6.5 nm, based on the crystal structures. The Kd values for the wild-type and C54S/C83A NTD dimers are 0.18 and 0.05 μM, respectively (Table 3). Wild-type FLASH NTD showed serious aggregation below the concentration of 0.1 mg/ml, where some of the proteins apparently dissociated into a monomeric but more elongated form (sedimentation coefficients of 0.89, frictional ratio of 3.09 and Stokes radius of 5.5 nm). Lsm11 NTD alone showed a sedimentation coefficient of 1.22 with a frictional ratio of 1.81, suggesting that it exists as a monomer in solution.
(A) Typical traces of absorbance at 280 nm of the protein in 20 mM Tris (pH 7.5) buffer during the sedimentation velocity experiment. The protein concentration was 1 mg/ml. For clarity, only every fifth scan is shown. The symbols represent experimental data and the lines are the results obtained after being fitted to the Lamm equation using the SEDFIT program . (B-E) Continuous c(s) distribution of FLASH NTD wild-type, C54S/C83A mutant, FLASH NTD-Lsm11 NTD complex and Lsm11 NTD. The distributions of the proteins at concentrations of 1 mg/ml (B-D) and 0.8 mg/ml (E) are shown by solid lines and those at concentrations of 0.1 mg/ml (B) and 0.05 mg/ml (D) are shown by dashed lines and that at 8 mg/ml (D) are showed by dotted line. The y-axis on the right is for the protein at a concentration of 0.1 mg/ml (B) and 8 mg/ml (D). The vertical dashed lines on the left and right indicate the monomer position of Lsm11 NTD and the dimer position of FLASH NTD, respectively. The residual bitmap of the raw data and the best-fit results are shown in the insets. The data are summarized in Table 2.
The FLASH NTD-Lsm11 NTD complex showed a shift in its sedimentation coefficient value from 1.36 to 3.2 with increasing concentration of the complex (Table 3, Fig 6D). This observation indicates that the dissociation and association of the complex is a rapid event  and the exact molar mass of the complex cannot be accurately estimated. The broad range of observed molecular weight by AUC (17.8 to 49.7 kDa, Table 3) is similar to that seen in the SEC-MALS experiment. Nevertheless, we were able to obtain a Kd value of 2.4 μM for the complex (Table 3).
We used glutaraldehyde to crosslink the FLASH NTD-Lsm11 NTD complex, FLASH NTD C54S/C83A, and FLASH NTD and analyzed it on SDS-PAGE (S5 Fig). We observed the strong presence of dimers for both FLASH wild-type and C54S/C83A double mutant and weaker presence of higher oligomers (trimer, tetramer etc.). The higher oligomers became less apparent at lower concentration (0.01 mg/mL) of the protein, suggesting that they are probably due to random collisions of monomer/dimer in solution. Dimer and trimer species were also observed for the FLASH NTD-Lsm11 NTD complex but Lsm11 NTD did not appear to be substantially crosslinked to FLASH or to itself, possibly due to the fact that it has only one lysine residue. Therefore, we conclude that the dimer and trimer species for the complex were probably crosslinked FLASH, as observed for the FLASH alone samples, and the cross-linking experiments by themselves did not provide conclusive information about the stoichiometry of FLASH NTD-Lsm11 NTD complex.
Human FLASH is a protein of 220 kDa that has been implicated in a broad spectrum of cellular processes. In spite of these diverse and important functions, the structural organization of FLASH remains largely unknown. Although FLASH consists of nearly 2,000 amino acid residues, the functions of only three small regions of the protein are understood: a 100 residue segment in the N-terminus required for histone pre-mRNA processing, the C-terminal segment which forms a SANT/Myb-like domain, interacts with the C-terminal region of NPAT, and is required for localization to the histone locus body ; and a small central region which binds Ars2  is essential for cell cycle progression.
Our crystallographic studies of FLASH NTD demonstrate that residues 71–137 adopt a continuous and stable α-helical fold and mediate the formation of a coiled-coil dimer between two FLASH molecules. This α-helical fold might also extend to residues 53–70, encompassing the LDLY motif, but this region is unlikely to contribute to the dimerization interface. Our data are consistent with recent H/D exchange studies , which showed that residues 75–136 underwent slow H/D exchange, indicative of extensive secondary structure in this region. Residues 58–62 exchanged significantly faster than the 75–136 region but slower than the directly surrounding sequences, suggesting the presence of a more dynamic secondary structure in the vicinity of the LDLY motif .
That amino acids in the N-terminal region of FLASH may fold into a coiled-coil domain was first predicted by bioinformatics . In addition, biochemical studies demonstrated that ectopically-expressed FLASH can self-associate in tissue culture cells and that this self-association requires the N-terminal 200 residues . These data, in conjunction with our current crystallographic study, strongly support the notion that the N-terminal domain of FLASH exists in solution as a coiled-coil dimer. We changed up to three consecutive residues in the dimer interface but failed to convert FLASH into monomers. The dimer interface of the FLASH NTD is extensive and local structural disturbances, such as the three consecutive residues that we mutated, are insufficient to prevent dimerization. Interestingly, the L118A/I119A mutation in the interface of the coiled-coil dimer failed to disrupt FLASH dimerization but was sufficient to abolish the ability of FLASH to interact with Lsm11.
The N-terminal α-helical region that mediates FLASH dimerization overlaps substantially with the core Lsm11 binding site in FLASH mapped to amino acids 100–140, prompting the hypothesis that Lsm11 may interact with a FLASH dimer. SEC-MALS experiments on the complex provided strong evidence that the FLASH NTD-Lsm11 NTD complex is a 2:1 heterotrimer. While our AUC data confirmed that FLASH is a dimer, the stoichiometry of FLASH and Lsm11 in the FLASH NTD-Lsm11 NTD complex was less clear, likely due to dissociation of Lsm11 NTD from the FLASH NTD dimer during prolonged ultracentrifugation. Some dissociation of the complex was observed during the short time scale of the SEC-MALS experiment. That Lsm11 interacts with a FLASH dimer is also consistent with the data from H/D exchange experiments. While the region between amino acids 100–120 showed the slowest H/D exchange within the entire FLASH NTD (which we show here can form a dimer), this region underwent slower H/D exchange in the presence of Lsm11 and the reduced rate of exchange extended to amino acid 130 in FLASH . Since H/D exchange occurs when hydrogen bonds are temporarily destabilized, this region of FLASH (residues 100–130) is in a more stable structure in the heterotrimer than in the homodimer.
Additional studies are required to determine the structure of the FLASH-Lsm11 heterotrimer and identify potential mechanisms that may regulate the binding of a FLASH dimer to Lsm11 to form the FLASH-Lsm11 heterotrimer. In animal cells, components of the transcription and 3’-end processing machinery are localized in Histone Locus Bodies (HLBs), nuclear domains that assemble at histone gene loci and are present throughout the cell cycle. Strikingly, histone gene expression is repressed during G1 phase and becomes activated only with the onset of S phase and DNA replication in response to cell cycle signals, including cyclin E/CDK2-mediated phosphorylation of NPAT, a universal coactivator of histone gene expression [35–39]. A growing body of evidence suggests that FLASH is targeted to HLBs as a separate entity rather than a subunit of the U7 snRNP. For example, mutations in either Lsm11 or FLASH that disrupt binding between their N-terminal domains do not affect localization of either FLASH or U7 snRNP to the Drosophila HLB, but abolishes processing in vivo [40, 41].
Our findings suggest that N-terminus of FLASH may be present as a homodimer throughout the cell cycle. In recent studies , we have found that a second region of Lsm11 interacts with the C-terminal region of FLASH, the same region of FLASH that binds to NPAT . This interaction strengthens the overall binding between FLASH and Lsm11 and could be part of an extensive reorganization of the factors in the HLB to activate histone gene expression as a result of phosphorylation of NPAT by cyclin E/CDK2.
Materials and methods
Protein expression and purification of FLASH NTD
C-terminally hexahistidine-tagged FLASH NTD (residues 51–137) and FLASH NTD C54S/C93A mutant constructs were cloned into pET26b vector and over-expressed in Escherichia coli BL21 Star (DE3) strains (Novagen). The cells were induced using 0.4 mM isopropyl β-D-1-thiogalactopyranoside and grown for 18 h at 20°C. The cells were harvested by centrifugation and the pellets were re-suspended in lysis buffer (20 mM Tris (pH 7.5), 500 mM NaCl, 10 mM imidazole, 5% (v/v) glycerol, 17.8 μg/mL phenylmethane sulfonyl fluoride (PMSF) and 10 mM β-mercaptoethanol) and lysed by sonication. Cell lysates were then centrifuged at 25,000 x g for 40 min at 4°C. The supernatant was incubated with nickel beads for 1 h before being loaded onto a gravity flow column (Bio-Rad). The nickel beads were washed with buffer containing 20 mM Tris (pH 7.5), 500 mM NaCl, 40 mM imidazole, and 10 mM β-mercaptoethanol. The proteins were eluted with 20 mM Tris (pH 7.5), 500 mM NaCl, 500 mM imidazole and 10 mM β-mercaptoethanol. The eluted proteins were further purified using size-exclusion chromatography (Sephacryl S-300; GE Healthcare) with a buffer containing 20 mM Tris (pH 8.5), 250 mM NaCl, and 5 mM dithiothreitol (DTT). Relevant fractions from size-exclusion chromatography were pooled and the proteins were concentrated to 9.4 mg/mL (wild-type) and 11 mg/mL (mutant), and stored at -80°C.
The selenomethionyl FLASH NTD protein was prepared using the Escherichia coli B834 methionine-auxotroph strain grown in the LeMaster media supplemented with selenomethionine . The protein was purified using the same protocol as the native protein, concentrated to 10 mg/mL and stored at –80°C.
Selenomethionyl FLASH NTD and native FLASH NTD C54S/C83A mutant were crystallized in a sitting drop by vapor diffusion. The sitting drops were set up by mixing 1 μL of 4 mg/mL selenomethionyl FLASH NTD or 5 mg/mL FLASH NTD C54S/C83A mutant protein with 1 μL of well solution. The well solution for selenomethionyl FLASH NTD crystals contained 100 mM Tris (pH 8.0) and 18% (w/v) PEG 4000; for FLASH NTD C54S/C83A crystal form 1, 4% (w/v) tacsimate pH 7.0, 11% (w/v) PEG 3350; and for FLASH NTD C54S/C83A crystal form 2, 100 mM sodium formate, 15% (w/v) PEG 3350, 3% (v/v) 1,6-hexanediol. The crystals were harvested and soaked in mother liquor supplemented with 15% (v/v) (selenomethionyl FLASH NTD) or 20% (v/v) (FLASH NTD C54S/C83A) ethylene glycol as cryoprotectant before being flash frozen in liquid nitrogen.
Data collection and structure determination
Initial X-ray diffraction data for selenomethionyl FLASH NTD were collected at APS beamline 24 ID-C with wavelength 0.9792 Å. Three datasets from three different crystals were processed using XDS and merged with XSCALE [43, 44]. The structure was solved by SAD using ShelxCDE  and the model built manually with the program Coot . The final structure was then refined using a higher resolution dataset (2.6 Å) collected at ALS beamline 501 (wavelength: 0.9774 Å) and processed with XDS. Data for the structures of FLASH NTD C54S/C83A were collected using single crystals at APS beamline 24-ID-E (0.9792 Å wavelength for both). The datasets were processed using HKL2000  (FLASH C54S/C83A crystal form 1) and XDS (FLASH C54S/C83A crystal form 2). Both structures were solved by molecular replacement using Phaser  with the selenomethionyl FLASH NTD structure as the search model. All three structures were refined using Phenix .
Expression and purification of FLASH NTD-Lsm11 NTD complex and Lsm11 NTD
N-terminally hexahistidine-tagged Lsm11 NTD (residues 23–130) construct was cloned into pET28a vector. FLASH NTD C54S/C83A mutant construct (without tag) was cloned into MCS2 of pCDF Duet vector. Both plasmids were co-transformed into E. coli BL21 (DE3) Star and the genes were co-expressed. The proteins were purified using the same protocol as for the FLASH NTD proteins with the exception of using 20 mM Tris (pH 7.5), 500 mM NaCl, and 5 mM DTT as the size exclusion chromatography buffer. Relevant fractions corresponding to Lsm11 NTD-FLASH NTD C54S/C83A complex and excess Lsm11 NTD alone were pooled separately and concentrated to 5.3 mg/mL and 2.4 mg/mL respectively.
Generation of mutant FLASH NTD constructs
Mutant FLASH NTD constructs were generated using site-directed mutagenesis PCR. Primers (see S1 Table) designed to mutate designated residues were used in PCR reactions to amplify plasmid templates encoding for wild type or mutant FLASH NTD (see S1 Table for templates used). 25 cycles of thermal cycling (98°C for melting, 55°C for annealing, 72°C for elongation) were performed using Phusion polymerase. PCR products were then digested with DpnI for 1 h at 37°C before being transformed into E. coli DH5α. Mutant constructs were confirmed by DNA sequencing.
Analytical gel filtration of FLASH NTD mutants
The C-terminally hexahistidine-tagged FLASH NTD mutant constructs were cloned into pET26b vector and expressed in E. coli BL21 Star (DE3) cells. The mutant proteins were purified by nickel affinity column as detailed in the earlier section. The eluted protein from nickel affinity purification was then injected into Superose 12 analytical gel filtration column, pre-equilibrated with 20 mM Tris (pH 7.5), 250 mM NaCl, and 5 mM DTT. Analytical gel filtration was performed with a flow rate of 0.5 mL/min using 20 mM Tris (pH 7.5), 250 mM NaCl and 5 mM DTT as buffer.
Co-purification of FLASH NTD mutants with His-tagged Lsm11 NTD
N-terminally hexahistidine-tagged Lsm11 NTD (residues 23–130) construct was cloned into pET28a vector. FLASH NTD mutant constructs (without tag) were cloned into MCS2 of pCDFDuet vector. Both plasmids were co-transformed into E. coli BL21 Star (DE3) and the genes were co-expressed in 5 mL LB media. The expressed proteins were co-purified with 15 μL of Ni-NTA agarose beads using the same buffers that were used for large-scale purification of FLASH NTD-Lsm11 NTD complex, and analyzed using SDS-PAGE.
The AUC experiments were performed on a XL-A analytical ultracentrifuge (Beckman Coulter) using an An-50 Ti rotor [50–56]. The sedimentation velocity experiments were performed using a double-sector epon charcoal-filled centerpiece at 20°C with a rotor speed of 42,000 rpm. Protein solutions of 0.05 to 1 mg/ml (330 μl) in a buffer containing 20 mM Tris (pH 8.5), 250 mM NaCl, and 5 mM DTT (low-salt condition) and reference (370 μl) solutions were loaded into the centerpiece, respectively. The absorbance at 280 nm was monitored in a continuous model with a time interval of 300 s and a step size of 0.003 cm. Multiple scans at different time intervals were then fitted to a continuous c(s) distribution model using the SEDFIT program . Additionally, the results with the various different protein concentrations were globally fitted to monomer-dimer self-association or A + B <—> AB hetero-association model using the SEDPHAT program to calculate the dissociation constant (Kd) .
To determine the precise molecular weight of the protein, the sedimentation equilibrium experiment was performed . Three different samples (0.10–0.12 ml) were loaded into the sample channels of six-channel epon charcoal-filled centerpieces, and 0.11–0.13 ml buffers were loaded into the reference channels. The cells were then loaded into the rotor and run at speed of 10,000, 15,000, and 25,000 rpm each for 12 h at 20°C. Ten A280 nm scans with time interval of 8–10 min were measured for every different rotor speed to check the status of sedimentation equilibrium. Global analyses of combined sedimentation equilibrium and sedimentation velocity data were conducted with SEDPHAT using species analysis model .
Size exclusion chromatography multi-angle light scattering (SEC-MALS)
FLASH NTD C54S/C83A-Lsm11 NTD complex and FLASH NTD C54S/C83A were loaded sequentially onto a Superdex 200 size exclusion column (24 mL) pre-equilibrated with 20 mM Tris pH 7.5, 500 mM NaCl, 5 mM DTT (high salt buffer) or 20 mM Tris pH 7.5, 250 mM NaCl, and 5 mM DTT (low salt buffer). The eluted samples first passed through a Wyatt multi-angle light scattering system (DAWN HELEOS-II) and then a Wyatt Trex refractometer. The data were analyzed using ASTRA version 6 software (Wyatt Technology, Santa Barbara, CA). The monomer peak of 3 mg/ml BSA was used for normalization, delay time determination, and band broadening correction using ASTRA.
Glutaraldehyde crosslinking assay
Cross-linking reactions were carried out in 20 mM HEPES (pH 7.5), 500 mM NaCl. A final concentration of 0.1% (w/v) of glutaraldehyde was added to 0.1 mg/mL (total volume ~100 μL) and 0.01 mg/mL (total volume ~ 1 mL) of FLASH NTD C54S/C83A-Lsm11 NTD complex, FLASH NTD C54S/C83A, and FLASH NTD wildtype. Controls with protein concentrations of 0.1 mg/mL without glutaraldehyde were set up for each sample type. All samples were incubated at 37°C for 3 min then chilled on ice. A final concentration of about 100 mM of Tris pH 8.0 was added into each sample to quench the cross-linking reaction. All samples were concentrated to a volume ~20 μL using Sartorius Vivaspin® 500 centrifugal concentrators with a molecular weight cut off of 10 kDa and finally analyzed by SDS-PAGE.
S1 Fig. Superimposition of all protomers from wildtype FLASH NTD (yellow orange and orange), FLASH C54S/C83A crystal form 1 (Mutant 1; cyan and teal), and FLASH C54S/C83A crystal form 2 (Mutant 2; gray and black).
Protomers were superimposed using residues 71–100 from wildtype FLASH NTD protomer 1 as reference coordinates.
S2 Fig. Sephacryl-300 (S-300) and SDS-PAGE analysis of purification of the Lsm11 NTD/FLASH C54S/C83A NTD complex.
A) Sephacryl-300 gel filtration profile shows two peaks: peak 1 corresponds to the Lsm11 NTD/FLASH NTD C54S/C83A complex, peak 2 corresponds to excess Lsm11 NTD. Lsm11 contains the N-terminal hexa-histidine tag. B) SDS-PAGE analysis of nickel affinity eluate (labeled Ni/E), and corresponding fractions from S-300 chromatography. A molecular weight marker is situated on the far-left lane.
S3 Fig. Additional data from SEC-MALS experiments.
Light scattering (solid), refractive index (dotted), and MW information for (A) FLASH NTD in high salt buffer; (B) FLASH NTD in low salt buffer; (C) FLASH NTD-Lsm11 NTD in high salt buffer; and (D) FLAST NTD-Lsm11 NTD in low salt buffer.
S4 Fig. Global analysis of FLASH-Lsm11 proteins at 8 mg/ml by AUC.
The speed of centrifugation for sedimentation equilibrium experiment (A) was 10,000 rpm (squares), 15,000 rpm (circles), and 25,000 rpm (triangles) at 20°C each for 14 h. The velocity experiment (B) was 42,000 rpm (circles) at 20°C for 6 h. The solid lines in two panels are the best fit results from global analysis of the two discrete species models by SEDPHAT (57). The residuals of each fit are shown below the panels. The calculated sedimentation coefficients and Mr from the best fit results are shown in Table 2.
S5 Fig. SDS-PAGE analysis of glutaraldehyde cross-linking of FLASH NTD C54S/C83A-Lsm11 NTD, FLASH NTD C54S/C83A, and FLASH NTD wild-type.
In the FLASH NTD-Lsm11 NTD complex, FLASH NTD is the lower band (as it lacks a His tag compared to FLASH NTD alone), and Lsm11 NTD is the upper band. While the FLASH NTD band disappeared in the presence of glutaraldehyde, the Lsm11 NTD band mostly stayed the same. Therefore, probably very small amount of Lsm11 NTD (if any) got crosslinked in the reaction, consistent with the fact that it has only 1 Lys residue.
We thank Aleksandra Skrajna for helpful discussions and comments on the manuscript; S. Banerjee, K. Perry, R. Rajashankar, J. Schuermann, N. Sukumar for access to NE-CAT 24-ID-C and 24-ID-E beamlines at the Advanced Photon Source; and M. Allaire, N. Smith, and S. Ortega for access to Beamline 5.0.1 at the Advanced Light Source; and J. Seetharaman for assistance in X-ray data collection. This research is supported by NIH grants R35GM118093 and S10OD012018 (to LT) and GM29832 (to WFM and ZD) and Taiwan MOST grant 105-2320-B-010-012 and 106-2320-B-010-013 (to CYC). This work is based upon research conducted at the Northeastern Collaborative Access Team beamlines (NIH P41 GM103403), using the Pilatus 6M detector (NIH-ORIP HEI grant S10 RR029205). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated by Argonne National Laboratory under Contract No. DE-AC02-06CH11357. The Berkeley Center for Structural Biology is supported in part by NIH, NIGMS, and HHMI. The Advanced Light Source is supported by the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.
- 1. Marzluff WF, Wagner EJ, Duronio RJ. Metabolism and regulation of canonical histone mRNAs: life without a poly(A) tail. Nat Rev Genet. 2008;9:843–54. pmid:18927579
- 2. Marzluff WF. Metazoan replication-dependent histone mRNAs: a distinct set of RNA polymerase II transcripts. Curr Opin Cell Biol. 2005;17:274–80. pmid:15901497
- 3. Yang XC, Burch BD, Yan Y, Marzluff WF, Dominski Z. FLASH, a proapoptotic protein involved in activation of caspase-8, is essential for 3' end processing of histone pre-mRNAs. Mol Cell. 2009;36:267–78. pmid:19854135
- 4. Dominski Z, Carpousis AJ, Clouet-d'Orval B. Emergence of the beta-CASP ribonucleases: highly conserved and ubiquitous metallo-enzymes involved in messenger RNA maturation and degradation. Biochim Biophys Acta. 2013;1829:532–51. pmid:23403287
- 5. Azzouz TN, Gruber A, Schumperli D. U7 snRNP-specific Lsm11 protein: dual binding contacts with the 100 kDa zinc finger processing factor (ZFP100) and a ZFP100-independent function in histone RNA 3' end processing. Nucleic Acids Res. 2005;33:2106–17. pmid:15824063
- 6. Dominski Z, Erkmann JA, Yang XC, Sanchez R, Marzluff WF. A novel zinc finger protein is associated with U7 snRNP an interacts with the stem-loop binding protein in the histone pre-mRNP to stimulate 3 '-end processing. Genes Dev. 2002;16:58–71. pmid:11782445
- 7. Battle DJ, Doudna JA. The stem-loop binding protein forms a highly stable and specific complex with the 3' stem-loop of histone mRNAs. RNA. 2001;7:123–32. pmid:11214174
- 8. Wang ZF, Whitfield ML, Ingledue TC 3rd, Dominski Z, Marzluff WF. The protein that binds the 3' end of histone mRNA: a novel RNA-binding protein required for histone pre-mRNA processing. Genes Dev. 1996;10:3028–40. pmid:8957003
- 9. Dominski Z, Yang XC, Kaygun H, Dadlez M, Marzluff WF. A 3' exonuclease that specifically interacts with the 3' end of histone mRNA. Mol Cell. 2003;12:295–305. pmid:14536070
- 10. Tan D, Marzluff WF, Dominski Z, Tong L. Structure of histone mRNA stem-loop, human stem-loop binding protein, and 3'hExo ternary complex. Science. 2013;339:318–21. pmid:23329046
- 11. Yang XC, Torres MP, Marzluff WF, Dominski Z. Three Proteins of the U7-Specific Sm Ring Function as the Molecular Ruler To Determine the Site of 3 '-End Processing in Mammalian Histone Pre-mRNA. Mol Cell Biol. 2009;29:4045–56. pmid:19470752
- 12. Hoefig KP, Rath N, Heinz GA, Wolf C, Dameris J, Schepers A, et al. Eri1 degrades the stem-loop of oligouridylated histone mRNAs to induce replication-dependent decay. Nat Struct Mol Biol. 2013;20:73–81. pmid:23202588
- 13. Pillai RS, Grimmler M, Meister G, Will CL, Luhrmann R, Fischer U, et al. Unique Sm core structure of U7 snRNPs: assembly by a specialized SMN complex and the role of a new component, Lsm11, in histone RNA processing. Genes Dev. 2003;17:2321–33. pmid:12975319
- 14. Pillai RS, Will CL, Luhrmann R, Schumperli D, Muller B. Purified U7 snRNPs lack the Sm proteins D1 and D2 but contain Lsm10, a new 14 kDa Sm D1-like protein. EMBO J. 2001;20:5470–9. pmid:11574479
- 15. Mowry KL, Steitz JA. Identification of the human U7 snRNP as one of several factors involved in the 3' end maturation of histone premessenger RNA's. Science. 1987;238:1682–7. pmid:2825355
- 16. Strub K, Birnstiel ML. Genetic complementation in the Xenopus oocyte: co-expression of sea urchin histone and U7 RNAs restores 3' processing of H3 pre-mRNA in the oocyte. EMBO J. 1986;5:1675–82. pmid:2943587
- 17. Skrajna A, Yang XC, Bucholc K, Zhang J, Hall TM, Dadlez M, et al. U7 snRNP is recruited to histone pre-mRNA in a FLASH-dependent manner by two separate regions of the Stem-Loop Binding Protein. RNA. 2017;23:938–951. pmid:28289156
- 18. Imai Y, Kimura T, Murakami A, Yajima N, Sakamaki K, Yonehara S. The CED-4-homologous protein FLASH is involved in Fas-mediated activation of caspase-8 during apoptosis. Nature. 1999;398:777–85. pmid:10235259
- 19. Alm-Kristiansen AH, Saether T, Matre V, Gilfillan S, Dahle O, Gabrielsen OS. FLASH acts as a co-activator of the transcription factor c-Myb and localizes to active RNA polymerase II foci. Oncogene. 2008;27:4644–56. pmid:18408764
- 20. Krieghoff E, Milovic-Holm K, Hofmann TG. FLASH meets nuclear bodies: CD95 receptor signals via a nuclear pathway. Cell Cycle. 2007;6:771–5. pmid:17377497
- 21. Barcaroli D, Bongiorno-Borbone L, Terrinoni A, Hofmann TG, Rossi M, Knight RA, et al. FLASH is required for histone transcription and S-phase progression. P Natl Acad Sci USA. 2006;103:14808–12.
- 22. Yang XC, Sabath I, Debski J, Kaus-Drobek M, Dadlez M, Marzluff WF, et al. A complex containing the CPSF73 endonuclease and other polyadenylation factors associates with U7 snRNP and is recruited to histone pre-mRNA for 3'-end processing. Mol Cell Biol. 2013;33:28–37. pmid:23071092
- 23. Shi Y, Di Giammartino DC, Taylor D, Sarkeshik A, Rice WJ, Yates JR 3rd, et al. Molecular architecture of the human pre-mRNA 3' processing complex. Mol Cell. 2009;33:365–76. pmid:19217410
- 24. Takagaki Y, Ryner LC, Manley JL. Four factors are required for 3'-end cleavage of pre-mRNAs. Genes Dev. 1989;3:1711–24. pmid:2558045
- 25. Dominski Z, Yang XC, Marzluff WF. The polyadenylation factor CPSF-73 is involved in histone-pre-mRNA processing. Cell. 2005;123:37–48. pmid:16213211
- 26. Kolev NG, Steitz JA. Symplekin and multiple other polyadenylation factors participate in 3'-end maturation of histone mRNAs. Genes Dev. 2005;19:2583–92. pmid:16230528
- 27. Wagner EJ, Burch BD, Godfrey AC, Salzler HR, Duronio RJ, Marzluff WF. A genome-wide RNA interference screen reveals that variant histones are necessary for replication-dependent histone pre-mRNA processing. Mol Cell. 2007;28:692–9. pmid:18042462
- 28. Yang Xc, Xu B, Sabath I, Kunduru L, Burch BD, Marzluff WF, et al. FLASH Is Required for the Endonucleolytic Cleavage of Histone Pre-mRNAs but Is Dispensable for the 5' Exonucleolytic Degradation of the Downstream Cleavage Product. Mol Cell Biol. 2011;31:1492–502. pmid:21245389
- 29. Krissinel E, Henrick K. Inference of macromolecular assemblies from crystalline state. J Mol Biol. 2007;372:774–97. pmid:17681537
- 30. Cheng SC, Chang GG, Chou CY. Mutation of Glu-166 blocks the substrate-induced dimerization of SARS coronavirus main protease. Biophys J. 2010;98:1327–36. pmid:20371333
- 31. Yang X-c, Sabath I, Kunduru L, van Wijnen AJ, Marzluff WF, Dominski Z. A Conserved Interaction That Is Essential for the Biogenesis of Histone Locus Bodies. J Biol Chem. 2014;289:33767–82. pmid:25339177
- 32. Kiriyama M, Kobayashi Y, Saito M, Ishikawa F, Yonehara S. Interaction of FLASH with Arsenite Resistance Protein 2 Is Involved in Cell Cycle Progression at S Phase. Molecular and Cellular Biology. 2009;29:4729–41. pmid:19546234
- 33. Skrajna A, Yang XC, Tarnowski K, Fituch K, Marzluff WF, Dominski Z, et al. Mapping the Interaction Network of Key Proteins Involved in Histone mRNA Generation: A Hydrogen/Deuterium Exchange Study. J Mol Biol. 2016;428:1180–96. pmid:26860583
- 34. Koonin EV, Aravind L, Hofmann K, Tschopp J, Dixit VM. Apoptosis. Searching for FLASH domains. Nature. 1999;401:662; discussion -3. pmid:10537104
- 35. Bongiorno-Borbone L, De Cola A, Vernole P, Finos L, Barcaroli D, Knight RA, et al. FLASH and NPAT positive but not Coilin positive Cajal Bodies correlate with cell ploidy. Cell Cycle. 2008;7:2357–67. pmid:18677100
- 36. Ghule PN, Dominski Z, Yang XC, Marzluff WF, Becker KA, Harper JW, et al. Staged assembly of histone gene expression machinery at subnuclear foci in the abbreviated cell cycle of human embryonic stem cells. Proceedings of the National Academy of Sciences of the United States of America. 2008;105:16964–9. pmid:18957539
- 37. Ma T, Van Tine BA, Wei Y, Garrett MD, Nelson D, Adams PD, et al. Cell cycle-regulated phosphorylation of p220(NPAT) by cyclin E/Cdk2 in Cajal bodies promotes histone gene transcription. Genes Dev. 2000;14:2298–313. pmid:10995387
- 38. Ye X, Wei Y, Nalepa G, Harper JW. The cyclin E/Cdk2 substrate p220(NPAT) is required for S-phase entry, histone gene expression, and Cajal body maintenance in human somatic cells. Mol Cell Biol. 2003;23:8586–600. pmid:14612403
- 39. Zhao J, Kennedy BK, Lawrence BD, Barbie DA, Matera AG, Fletcher JA, et al. NPAT links cyclin E-Cdk2 to the regulation of replication-dependent histone gene transcription. Genes Dev. 2000;14:2283–97. pmid:10995386
- 40. Burch BD, Godfrey AC, Gasdaska PY, Salzler HR, Duronio RJ, Marzluff WF, et al. Interaction between FLASH and Lsm11 is essential for histone pre-mRNA processing in vivo in Drosophila. Rna-a Publication of the Rna Society. 2011;17:1132–47.
- 41. Tatomer DC, Terzo E, Curry KP, Salzler H, Sabath I, Zapotoczny G, et al. Concentrating pre-mRNA processing factors in the histone locus body facilitates efficient histone mRNA biogenesis. J Cell Biol. 2016;213:557–70. pmid:27241916
- 42. Hendrickson WA, Horton JR, Lemaster DM. Selenomethionyl Proteins Produced for Analysis by Multiwavelength Anomalous Diffraction (Mad)—a Vehicle for Direct Determination of 3-Dimensional Structure. EMBO J. 1990;9:1665–72. pmid:2184035
- 43. Kabsch W. Xds. Acta Crystallogr D Biol Crystallogr. 2010;66:125–32. pmid:20124692
- 44. Kabsch W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallogr D Biol Crystallogr. 2010;66:133–44. pmid:20124693
- 45. Sheldrick GM. Experimental phasing with SHELXC/D/E: combining chain tracing with density modification. Acta Crystallogr D Biol Crystallogr. 2010;66:479–85. pmid:20383001
- 46. Emsley P, Lohkamp B, Scott WG, Cowtan K. Features and development of Coot. Acta Crystallogr D Biol Crystallogr. 2010;66:486–501. pmid:20383002
- 47. Otwinowski Z, Minor W. Processing of X-ray diffraction data collected in oscillation mode. Method Enzymol. 1997;276:307–26.
- 48. Mccoy AJ, Grosse-Kunstleve RW, Adams PD, Winn MD, Storoni LC, Read RJ. Phaser crystallographic software. J Appl Crystallogr. 2007;40:658–74. pmid:19461840
- 49. Adams PD, Afonine PV, Bunkoczi G, Chen VB, Davis IW, Echols N, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D Biol Crystallogr. 2010;66:213–21. pmid:20124702
- 50. Lin MH, Chuang SJ, Chen CC, Cheng SC, Cheng KW, Lin CH, et al. Structural and functional characterization of MERS coronavirus papain-like protease. J Biomed Sci. 2014;21:54. pmid:24898546
- 51. Chou CY, Lai HY, Chen HY, Cheng SC, Cheng KW, Chou YW. Structural basis for catalysis and ubiquitin recognition by the severe acute respiratory syndrome coronavirus papain-like protease. Acta Crystallogr D Biol Crystallogr. 2014;70:572–81. pmid:24531491
- 52. Wu CG, Cheng SC, Chen SC, Li JY, Fang YH, Chen YH, et al. Mechanism for controlling the monomer-dimer conversion of SARS coronavirus main protease. Acta Crystallogr D Biol Crystallogr. 2013;69:747–55. pmid:23633583
- 53. Cheng SC, Chang GG, Chou CY. Mutation of Glu-166 blocks the substrate-induced dimerization of SARS coronavirus main protease. Biophys J. 2009;98:1327–36.
- 54. Hsieh YH, Chou CY. Structural and functional characterization of human apolipoprotein E 72–166 peptides in both aqueous and lipid environments. J Biomed Sci. 2011;18:4. pmid:21219628
- 55. Chou YW, Cheng SC, Lai HY, Chou CY. Differential domain structure stability of the severe acute respiratory syndrome coronavirus papain-like protease. Arch Biochem Biophys. 2012;520:74–80. pmid:22391227
- 56. Ho BL, Cheng SC, Shi L, Wang TY, Ho KI, Chou CY. Critical Assessment of the Important Residues Involved in the Dimerization and Catalysis of MERS Coronavirus Main Protease. PLoS One. 2015;10:e0144865. pmid:26658006
- 57. Schuck P. Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and lamm equation modeling. Biophys J. 2000;78:1606–19. pmid:10692345
- 58. Schuck P. On the analysis of protein self-association by sedimentation velocity analytical ultracentrifugation. Anal Biochem. 2003;320:104–24. pmid:12895474
- 59. Chou CY, Jen WP, Hsieh YH, Shiao MS, Chang GG. Structural and functional variations in human apolipoprotein E3 and E4. J Biol Chem. 2006;281:13333–44. pmid:16540478
- 60. Goujon M, McWilliam H, Li WZ, Valentin F, Squizzato S, Paern J, et al. A new bioinformatics analysis tools framework at EMBL-EBI. Nucleic Acids Res. 2010;38:W695–W9. pmid:20439314
- 61. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li WZ, et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011;7.
- 62. Robert X, Gouet P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014;42:W320–W4. pmid:24753421
- 63. Jones DT. Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol. 1999;292:195–202. pmid:10493868