Figures
Abstract
While the basal transcription machinery in archaea is eukaryal-like, transcription factors in archaea and their viruses are usually related to bacterial transcription factors. Nevertheless, some of these organisms show predicted classical zinc fingers motifs of the C2H2 type, which are almost exclusively found in proteins of eukaryotes and most often associated with transcription regulators. In this work, we focused on the protein AFV1p06 from the hyperthermophilic archaeal virus AFV1. The sequence of the protein consists of the classical eukaryotic C2H2 motif with the fourth histidine coordinating zinc missing, as well as of N- and C-terminal extensions. We showed that the protein AFV1p06 binds zinc and solved its solution structure by NMR. AFV1p06 displays a zinc finger fold with a novel structure extension and disordered N- and C-termini. Structure calculations show that a glutamic acid residue that coordinates zinc replaces the fourth histidine of the C2H2 motif. Electromobility gel shift assays indicate that the protein binds to DNA with different affinities depending on the DNA sequence. AFV1p06 is the first experimentally characterised archaeal zinc finger protein with a DNA binding activity. The AFV1p06 protein family has homologues in diverse viruses of hyperthermophilic archaea. A phylogenetic analysis points out a common origin of archaeal and eukaryotic C2H2 zinc fingers.
Citation: Guillière F, Danioux C, Jaubert C, Desnoues N, Delepierre M, Prangishvili D, et al. (2013) Solution Structure of an Archaeal DNA Binding Protein with an Eukaryotic Zinc Finger Fold. PLoS ONE 8(1): e52908. https://doi.org/10.1371/journal.pone.0052908
Editor: Ramón Campos-Olivas, Spanish National Cancer Center, Spain
Received: September 14, 2012; Accepted: November 23, 2012; Published: January 9, 2013
Copyright: © 2013 Guillière et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Institut Pasteur and the Centre National de la Recherche Scientifique (CNRS UMR 3528). FG, CD and CJ were supported by the French «Ministère de l'Enseignement Supérieur et de la Recherche». The 600 MHz spectrometer was funded by the Région Ile de France and the Institut Pasteur. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
It is now well established that transcription in archaea, one of the three domains of life, displays characteristics of both eukaryal and bacterial transcription [1], [2]. The minimal basal machinery in archaea consists of an RNA polymerase and the general transcription factors TBP (TATA-box-binding protein) and TFB (transcription factor B), required for transcription initiation. These proteins are homologues of the eukaryal RNA polymerase II (RNAPII), TBP and TFIIB proteins, respectively. In particular, the eukaryal and archaeal RNA polymerases show a striking structural similarity [3], [4]. The archaeal basal machinery is thus homologous structurally and functionally to the core components of the eukaryal RNAPII machinery. In contrast, non-general transcription factors (TF) in archaea are often bacterial-like, and only a few are predicted to be of eukaryal type [1], [2]. For instance, a recent in silico analysis based on 52 archaeal genomes suggested that over 50% of the predicted transcription factors show at least one homologue in bacteria, about 43% are specific to archaea and less than 2% have homologues in eukaryotic organisms [5]. Though some transcription factors in archaea have been analysed in detail [6]–[8], transcription regulation in archaea is still poorly documented.
The presence in archaea of proteins with predicted zinc finger domains of the C2H2 or C2HC type is intriguing as the so-called “classical” zinc finger, hereafter named ZNF, is considered to be an eukaryal-specific motif. Initially discovered in the transcription factor TFIIIA from Xenopus oocytes [9], the ZNF domain has been shown to be very abundant in eukaryotes (e.g. 3% of human genes encode ZNF-containing proteins), practically absent in bacteria with some exceptions as in plant pathogens [10] and scarce, but represented in archaea and their viruses. The classical ZNF motif consists of a short (∼30 residue-long) sequence that uses two or three cysteines and two or one histidines to coordinate a zinc ion (C2H2 or C2HC types, respectively). The ZNF domains fold into a characteristic structure consisting of an α-helix and a β-hairpin held together by the zinc ion and hydrophobic interactions between hydrophobic residues at conserved positions of the sequence. Most of the proteins containing ZNF domains that have been characterised are involved in transcription regulation and bind DNA through their ZNF domains, although ZNF domains can also mediate protein-RNA or protein-protein interactions [11], [12]. ZNFs bind to DNA by inserting the α-helix into the major groove and use three or four exposed residues of the helix to make specific contacts with three or four DNA bases [13], [14]. To recognise their target DNA in a cellular context, ZNFs are usually present in tandem repeats separated by a short linker. Each ZNF repeat binds specifically to DNA using the α-helix and the repeats wrap around the DNA. Some ZNFs like SW15, ADR1 or GAGA, however, are present in only one to three copies and use extensions of the ZNF motif to further contact DNA [15]–[17].
Hyperthermophilic archaea that thrive in hot springs (>80°C) are infected by viruses that show unique morphological and genomic properties that distinguish them from bacteriophages and eukaryal viruses [18]. The majority of the proteins of these viruses does not have detectable homologues in the databases, however, a relatively high proportion is predicted to carry transcription-factor associated folds (up to ∼10% of proteins encoded in genomes with about 50 putative genes) [19]. The abundance of putative TFs in the genomes of hyperthermophilic archaeal viruses probably reflects the importance of transcription regulation in the life cycle of the viruses. As in the case of their hosts, the majority of the predicted TFs are bacterial-like and display a ribbon-helix-helix (RHH) or a helix-turn-helix (HTH) fold. One viral predicted TF, the SvtR protein from virus SIRV1, has been characterised and shown, indeed, to display a RHH structure and to repress transcription of viral genes [20]. Structural analysis of another viral protein (E73) coded by the SSV-like virus SSV-RH, also revealed the presence of a RHH motif involved in DNA recognition [21]. In addition to bacterial-like TFs, archaeal viruses from the Rudiviridae, Lipothrixviridae, Fuselloviridae and the Bicaudaviridae families as well as the unclassified viruses STSV1 and STIV typically present one or two sequences with ZNF motifs.
In this work, we focused on the protein AFV1p06 coded by the gene gp06 of the virus AFV1 (Acidianus filamentous virus 1 [22]), which infects the hyperthermophilic crenarchaeon Acidianus hospitalis. The protein has 59 residues and displays a single ZNF motif with the second zinc-binding histidine of the motif missing. The ZNF motif (28 residues) is flanked by N- and C-terminal regions of unknown structure. AFV1p06 has homologues in crenarchaeal spindle-shaped viruses from the Fuselloviridae family (SSV1, SSV2, SSV4, SSV5, SSV6 and SSVK-1), and is distantly related to eukaryal ZNF containing proteins [19]. Here, we describe the solution structure of AFV1p06 and analyse its DNA binding capabilities.
Materials and Methods
Cloning, Protein Expression and Purification
The gene AFV1p06 of AFV1 (NC_005830.1, also called ORF59a) was amplified by PCR using primers AFV1p06NdeI (5′-ATGCCATATGATTGAGGTTTCTAGTATGG-3′) and AFV1p06XhoI (5′-ATTTCTCGAGTCAGATAATCTTGTTTACAT-3′). The PCR product was digested with NdeI and XhoI and ligated with NdeI and XhoI digested pET-30a (Novagen) plasmid vector.
Recombinant AFV1p06 was expressed without any tag or cloning-derived additional residues using Escherichia coli Rosetta™ (BL21 DE3) pLysS (Novagen) cells. Cultures at 37°C in rich (Luria-Bertani broth) or in minimal M9 media for 15N or 15N/13C labelling, induction with 1 mM isopropyl-β-thio-galactopyranoside, cell harvesting after four hours of induction and cell freezing at −80°C were performed as described [20].
AFV1p06 was purified from inclusion bodies. Frozen cells were thawed, suspended in 50 mM HEPES pH 7.4, 150 mM NaCl, 3 mM dithiothreitol (DTT, buffer A) and lysed with a French press at 4°C adding phenyl-methane-sulphonyl fluoride. The cell lysate was centrifuged 20 min at 7000 g and 4°C and the supernatant was discarded. The cell pellet was suspended in buffer A supplemented with DNAse and RNAse to eliminate nucleic acids and centrifuged at 7000 g for 20 min at 4°C. The cell pellet was then suspended in buffer A containing 1% Triton to eliminate hydrophobic compounds, centrifuged and washed twice with buffer A by means of suspension and centrifugation cycles. The washed pellet was solubilised in buffer A containing 6 M urea (buffer B) and loaded into a size exclusion chromatography column (Sephacryl HR100, GE Healthcare) pre-equilibrated with buffer B. The sample was eluted with buffer B and the AFV1p06 containing fractions were pooled and dialysed at low concentration (∼25 µM) and temperature (4°C) against buffer A containing 500 mM arginine and 50 µM ZnCl2 (buffer C) to renature the unfolded protein. After renaturation, arginine was eliminated by dialysis against buffer C prepared without arginine (buffer D) and loaded on an ion-exchange column (SP Sepharose, GE Healthcare) previously equilibrated with the loading buffer. Proteins were eluted using a linear gradient of NaCl from 150 mM to 1 M in buffer D. The AFV1p06 containing fractions, which eluted at ca. 650 mM NaCl, were pooled, dialysed against the desired buffer (typically 50 mM HEPES pH 7.4, 100 or 150 mM NaCl, 50 µM ZnCl2, 3 mM DTT) and concentrated by centrifugation using Vivaspin (Sartorius) tubes with a 3 kDa cut-off. Protein preparations were aliquoted and kept at −80°C or used directly for NMR experiments.
Protein preparations were homogeneous as assessed by SDS-PAGE and NMR; protein integrity and identity were checked by SELDI-TOF mass spectrometry (Jacques d'Alayer, Microsequencing Facilities, Institut Pasteur). The concentration of the protein was determined using a molar extinction coefficient of 5960 M−1.cm−1 calculated from its sequence [23].
Flame Atomic Emission Spectrophotometry
Experiments were carried out at the Ecole Polytechnique (Palaiseau, France) on a Varian AA220 spectrophotometer equipped with an air-acetylene burner. Readings were performed at 213.9 nm in the peak height mode. Two samples in buffer A were analysed: one was prepared as described above and extensively dialysed to eliminate free zinc ions from the sample; the second one was obtained without adding ZnCl2 during renaturation or the following purification steps and adding a forty fold excess of NaEDTA relative to the protein during the refolding step.
Oligonucleotides
Oligonucleotides were purchased from Proligo (Sigma-Aldrich). Double-stranded DNA was obtained by annealing the corresponding single strand oligonucleotides following standard techniques. For PAGE experiments, oligonucleotides were 32P radiolabelled using the T4-polynucleotide kinase (Fermentas).
DNA Binding
Two 25-bp duplex DNA oligonucleotides, called dsATcomb (top strand sequence 5′-AATGATTCTAAGTATCTTAGAAACA-3′) and dsGCcomb (top strand sequence 5′-AGGGTGGCAGCGTCGGAGCCTCGCA-3′) were obtained by annealing the corresponding single strand complementary oligonucleotides. Prior to annealing, one strand of each oligonucleotide was 32P-radiolabelled. Each double-stranded labelled oligonucleotide (75 nM) was incubated with increasing amounts of AFV1p06 (from 0 to 2 µM) for 15 min at 48°C in 20 µl of binding buffer: 50 mM HEPES, 10 µM ZnCl2, 150 mM NaCl, 5% (v/v) glycerol, 0.02% Tween, 3 mM DTT, pH 7.4. The binding buffer was supplemented with 50 ng/µL of unspecific salmon sperm DNA. The DNA-protein mixtures were deposited in a non-denaturing 6% 37.5:1 acrylamide/bisacrylamide gel. PAGE was run in TBE buffer (89 mM Tris-borate, 2 mM NaEDTA, pH 8.3). After migration, the gel was vacuum-dried, exposed with Amersham Biosciences Hyperfilm™ MP and developed with a Kodak X-OMAT 2000 processor.
Binding of dsATcomb and dsGCcomb to AFV1p06 was also tested by competition experiments in which labelled dsATcomb or dsGCcomb at a fixed concentration (75 nM) were used as probes in electromobility gel shift assays (EMSA) and unlabelled dsGCcomb or dsATcomb at varying concentrations (0, 0.5, 1, 2 and 5-fold molar ratio of unlabelled/labelled oligonucleotide), were used as competitors. Oligonucleotides and AFV1p06 (0.5 µM) were incubated 15 min at 48°C in 20 µL of binding buffer. PAGE, gel drying and development were performed as described above.
NMR Samples
Samples were prepared in buffer E: 50 mM HEPES pH 7.4, 100 mM NaCl, 44 µM ZnCl2, 4.5 mM DTT, 12% D2O. Protein concentration typically ranged between 0.4 and 1.0 mM for 15N labelled and 13C/15N labelled samples.
NMR
Experiments were performed on a Varian NMR System 600 spectrometer (Agilent Technologies, Santa Clara) with a proton resonating frequency of 599.4 MHz. The spectrometer was equipped with a cryogenic probe. Spectra were recorded at 25°C and referenced to sodium 4,4-dimethyl-4-silapentane-1-sulphonate following IUPAC recommendations. Data were collected using VnmrJ 2.3A (Agilent Technologies), processed with NMRPipe [24] and analysed with NMRView 5.2.2 [25].
Standard two- and three-dimensional experiments were recorded to assign chemical shifts to the protein 1H, 13C and 15N nuclei: 13C or 15N HSQC (heteronuclear single quantum coherence) [26]), HNCO, HNCACB, CBCA(CO)NH [27], H(CC-TOCSY)NNH, C(CC-TOCSY)NNH [28], [29], (HB)CB(CGCD)HD and (HB)CB(CGCDCE)HE [30].
AFV1p06 backbone dynamics analysis was based on 15N relaxation experiments [31] used to calculate the longitudinal (R1) and transverse 15N (R2) relaxation rates.
NMR and structure calculations– Distance constraints for structure calculations were obtained from 3D 13C-edited (aromatic and aliphatic regions) and 15N-edited NOESY-HSQC (nuclear Overhauser effect spectroscopy - HSQC) experiments recorded with 120 ms mixing times [26], [32]. Proton JHN-HA scalar couplings were calculated from a HNHA experiment [33], [34] and transformed into dihedral φ angle constraints as follows: −120°±25° for 3JHN-Hα ≥ 8.0 Hz, −65°±25° for 3JHN-Hα ≤ 5.5 Hz. Further dihedral φ and ψ constraints were obtained with Talos [35]. A backbone hydrogen bond in regions of secondary structure was added as distance constraint if the chemical shift data, the nOe pattern and the amide hydrogen exchange data were in agreement with a hydrogen bond, and if it was present in at least 75% of the structures calculated without any hydrogen bond. Hydrogen exchange was analysed using the HET-SOFAST experiment [36]: two spectra with (saturation) or without (reference) inversion of the water signal were acquired to evaluate the protection against exchange from the saturation transfer between water and amide protons. The residues with a ratio of intensities higher than 0.75 between the saturation and reference experiments were considered to be exchange protected.
NOESY spectra assignments and structure calculations were performed with ARIA 2.2 [37], [38] coupled to CNS 1.2 [39] following ARIA's standard protocols with spin diffusion correction.
Spin diffusion was corrected using an isotropic rotation correlation time of 6.3 ns (±0.6 ns), which was determined from 15N relaxation data as described in [40]. Chemical shift tolerances were set to 0.03 and 0.04 ppm for protons in the direct and indirect dimensions, respectively, 0.5 ppm for 13C and 0.35 ppm for 15N. For structure calculations and nOe (nuclear Overhauser enhancement) assignments, the zinc atom was coordinated with a tetrahedral geometry by the Sγ atoms of cysteines 13 and 16 and the Nε2 atom of histidine 29 (see Results section). Histine was unprotonated. The zinc ion was attached to the Sγ atom of residue 16, and the tetrahedral geometry was maintained by modifying the force field topology and parameter files. Once the nOes were assigned and the distance constraints were obtained, two different final structure ensembles were calculated using either the full-length protein (residues 1–59) or only the structured region (residues 7–51). The final structures were obtained by calculating 200 structures with ARIA 2.2/CNS 1.2 and refining the lower energy 150 structures in explicit water using the PARALLHDG 5.3 force field [41]. The 10 lowest-total-energy structures were selected. The quality of the structures was analysed with Procheck 3.5.4 [42], What_check [43], Molmol 2K.2 [44] and Pymol (Schrödinger LLC).
Phylogenetic Analysis
To gather the amino acid sequences for phylogenetic analysis, we searched the non-redundant protein sequence database (nr) at NCBI for homologues of AFV1 virus p06-ORF59a (GI: 82056192) using the PSI-BLAST algorithm 2.2.26+ [45] in ten iterative steps with default parameters. Whenever the algorithm ran out of new proteins to include in the iteration, the protein with the best E-value and with a conserved C2H2 motif was manually picked. Sequences were aligned using the CLC Sequence Viewer software (CLC Bio, Denmark) with default parameters. The tree was calculated using the Neighbour-Joining method and a 100 replicate bootstrap analysis.
The protein knowledgebase UniProtKB database was questioned using the query “zinc AND finger AND C2H2” to obtain the sequences of proteins with predicted ZNF motifs in the three domains of life.
Accession Codes
The structure and chemical shifts of AFV1p06 have been deposited in the PDB protein data bank (http://www.pdb.org) and the BMRB database (http://www.bmrb.wisc.edu) under the accession numbers 2LVH and 18570, respectively.
Results
Zinc Chelation by AFVIp06
As the homology of AFV1p06 to C2H2 and C2HC zinc fingers suggested that the protein could bind zinc, we performed flame photometry experiments on samples that had been carefully depleted of free zinc. These experiments confirmed that AFV1p06 binds zinc and showed that one mole of protein binds one mole of zinc. In addition, sedimentation-diffusion equilibrium ultracentrifugation experiments performed at a 50 µM concentration (Bertrand Raynal, Plate-forme de Biophysique, Institut Pasteur), and NMR 1H-15N HSQC spectra, which were invariant for AFV1p06 concentrations between 50 µM and 1.0 mM, indicated that the protein is monomeric up to millimolar concentrations. Thus, one AFV1p06 monomer binds one zinc ion.
In ZNF proteins, zinc is tetra-coordinated by two Cys and two His residues (C2H2) or three Cys and one His residue (C2HC). AFV1p06 contains two Cys residues (C13 and C16) and a single His residue (H29) that are part of the ZNF motif and that could be involved in zinc coordination. Nevertheless, the protein lacks the fourth zinc ligand, which could be either a water molecule or the side chain of residue E34 that in the sequence alignments with ZNF proteins is positioned close to the fourth zinc ligand (H or C). We performed a NMR chemical shift analysis to verify if residues C13, C16 and H29 could be involved in metal chelation. On the one hand, the Cβ and Cα chemical shifts of residues C13 and C16 were in agreement with those of metalloproteases [46], indicating that both cysteine Sγ atoms bind zinc; on the other hand, the comparison of the aromatic ring carbon chemical shifts (Cδ2 and Cε1) of H29 with that of histidine residues (deposited in the BMRB database) that bind or do not bind zinc, indicated that H29 binds zinc and that it ligates zinc through its Nε2 atom. This analysis was corroborated by a recently published method to determine the coordination of zinc by His residues based on the difference of the aromatic Cδ2 and Cε1 chemical shifts [47]: in the case of AFV1p06, this difference is 12.96 ppm, which corresponds well to the value observed for Nε2 coordination 12.32±0.86 ppm. Based on this experimental data, we calculated AFV1p06 structures considering that zinc was coordinated by residues C13 (Sγ), C16(Sγ) and H29 (Nε2) and we used the structures to determine the fourth ligand of zinc. Importantly, no bias that could influence the determination of the fourth ligand was introduced in the calculations because the nOe assignments for distance constraints were performed automatically.
Resonance Assignments of AFV1p06
The 1H, 15N and 13C resonance frequencies of most backbone and side chain atoms were assigned (92%). Missing assignments mainly corresponded to exchangeable protons of lysine, arginine, asparagine and glutamine side chains, as well as to the backbone amide protons of residues S6, M7 and K23 (the assigned 15N-1H HSQC spectrum of AFV1p06 is shown in Figure S1).
Structure of AFV1p06
The structure ensemble of AFV1p06 shows a compact and convergent region between residues 8–50 and disordered N (1–7) and C (51–59) termini (Figure 1, Table 1). The structure consists of a three-stranded antiparallel β-sheet (residues 8–13, 19–20 and 45–50) packed against an α-helix (23–32), as well as of a short 310 helix (41–43) located at the end of a long loop between the α-helix and the third strand of the β-sheet. As expected, the region with the ZNF sequence motif (residues 9–35) shows a typical zing finger fold with an antiparallel β-hairpin packed against an α-helix and with the zinc ion sandwiched between the latter structural elements. Indeed, a search for structural homologues in the DALI database (http://www.dali.server.org) with the structure of AFV1p06 between residues 9–35 produces over 150 ZNF structures with statistically significant scores and low root mean square deviations (RMSD≤1.8 Å over ∼25 CA atoms). When the coordinates of the structured region between residues 8 and 50 were used to find structural homologues, only the ZNF region gave significant hits, indicating that AFV1p06 shows a novel extension of the ZNF fold (loop with a 310 helix+3rd strand of ZNF β-sheet).
The backbone superposition of the 10 structures calculated for the full-length protein is shown in two different orientations (A and B) and on a main-chain cartoon representation for residues 7–51 (C). A topology diagram of the structure, the sequence of AFV1p06 in the ZNF region (residues 9–35) and the ZNF sequence motif are shown in (D). Residues in the ZNF region are coloured in red. In (C), the side-chains of the residues that coordinate zinc are displayed in cyan (C13, C16 and H29) or violet (E34) and the zinc atom in blue. In (D), ψ stands for a hydrophobic residue. Helices are represented by rectangles and β-strands by arrows.
The lack of convergence observed for the N- and C-termini of the protein correlates with a very low number of nOes shown by residues 1–7 and 51–59 and more specifically, with the absence of medium or long range nOes. This disorder is due to the dynamics of the protein as assessed by the 15N relaxation characteristics of the backbone amide groups. For instance, most of the N and C-termini amide groups showed low 15N transverse relaxation rates (R2) values relative to those observed for the rest of the protein, indicating high amplitude motions in the nanosecond-picosecond time scale (Figure S2). Also, residue S5 showed a very high R2 rate, and amide resonances of residues S6 and M7 were not observed, presumably due to exchange broadening (high R2 rates), suggesting that the latter residues exchange between different conformations in the microsecond-millisecond time scale. Thus, the N- and C-termini of AFV1p06 are highly dynamic.
The fourth ligand of the zinc ion was identified using the structure ensemble of AFV1p06: in all the structures, a side-chain oxygen atom of the carboxylic group of residue E34 is close to the zinc ion at a distance (1.99±0.04 Å) that is in agreement with those observed for zinc coordinated by a glutamic acid residue [1.95±0.08 Å, [48]]. This observation indicates that residue E34 is the fourth residue implicated in zinc coordination. Although a glutamic acid residue is not commonly observed as a zinc ligand, it coordinates zinc in some proteins in which the latter ion plays a structural role [48]. In AFV1p06, the zinc ion is tightly bound. Indeed, the protein retains zinc in the presence of a 10 fold excess of NaEDTA, as evidenced by the lack of changes in the NMR 1H-15N HSQC spectrum of the protein in the presence of the latter chelating agent.
AFV1p06 Binds Preferentially to GC Rich DNA
The structure and zinc binding properties of AFV1p06 indicate that this protein is a classical zinc finger. As most of the ZNF proteins that have been characterised have been shown to bind double stranded DNA [14], we tested the DNA binding capabilities of AFV1p06. Because the putative binding site for AFV1p06 was not known, we initially performed EMSA experiments in the presence of unspecific DNA and high concentrations of the protein. We chose two DNA oligonucleotides that were extremely different in their nucleotide composition: the oligonucleotides, either single (ssDNA) or double stranded (dsDNA), were 24 nt long and exclusively composed of the succession of AT (polyAT) or CG (polyCG) pairs. The EMSA experiments indicated that AFV1p06 could bind dsDNA at micromolar concentrations and did not bind the corresponding single strand DNAs, and this independently of their DNA composition. Interestingly, the dsDNA polyCG oligonucleotide was clearly better recognised by AFV1p06 than the polyAT one (not shown). At high salt concentration (500 mM), AFV1p06 was also able to bind dsDNA and recognised better the polyCG oligonucleotide, suggesting that the interaction of this protein with DNA is not only based on protein – DNA-backbone electrostatic interactions but involves DNA bases.
Following these observations and in order to better characterise the DNA binding activity of AFV1p06, we designed two additional double strand oligonucleotides of 25 bp called “dsATcomb” (5′-AATGATTCTAAGTATCTTAGAAACA-3′) and “dsGCcomb” (5′-AGGGTGGCAGCGTCGGAGCCTCGCA-3′). The composition of these oligonucleotides was inspired by the crystal structure of the complex of the DNA-binding domain of the transcription factor Zif268 and its binding site. In the latter complex, each of the three ZNFs of Zif268 establishes specific contacts with 3 bases on one strand of the DNA [49]–[51]. Because the protein AFV1p06 has a single ZNF domain we hypothesized that its α-helix would interact with a short 3 nt DNA core site. The oligonucleotides “dsATcomb” and “dsGCcomb” were hence designed to carry regularly interspaced repetitions of different combinations of triplets (6 from 8 possible) composed of either A or/and T for the “dsATcomb” and G or/and C for the dsGCcomb oligonucleotides. With this combinatorial approach we tried to create one or several short DNA sub-regions in the analysed oligonucleotides that could be better recognised by AFV1p06 to test if the protein is able to discriminate between different DNA sequences.
PAGE-EMSA experiments were performed with radioactively labelled dsATcomb and dsGCcomb in the presence of an excess of non-specific unlabelled dsDNA (Figure 2A). Both oligonucleotides show a retard in migration in the presence of AFV1p06, indicating that the protein binds dsDNA on the µM concentration range. The binding of AFV1p06 to the GC-rich dsDNA oligonucleotide is at least twice more efficient than that observed for the AT-rich one. We also compared the efficiency of retardation of each oligonucleotide in the presence of the second one as a “cold” competitor. Even at a 1∶0.5 ratio between 32P labelled dsATcomb and unlabelled dsGCcomb a clear decrease of the signal corresponding to the shifted form of dsATcomb is observed, indicating that dsGCcomb can efficiently displace dsATcomb (Figure 2B). To observe a similar efficiency, a five-fold excess of “cold” dsATcomb has to be added to “hot” dsGCcomb. These results suggest that AFV1p06 shows a preference for GC motifs and thus can sense different bases and display some specificity in dsDNA recognition.
(A) Binding to dsATcomb (left) and dsGCcomb (right) at a fixed concentration (75 nM) with increasing concentrations of AFV1p06 (0 to 2 µM). (B) Competition assays: experiments were performed in the presence of 0.5 µM AFV1p06 using “hot” radiolabelled dsATcomb and increasing amounts of dsGCcomb as a “cold” competitor (top), or radiolabelled dsGCcomb and increasing amounts dsATcomb as a “cold” competitor (bottom). The ratios between “hot” and “cold” oligonucleotides are indicated. Arrows show the position of the shifted DNA band.
In an attempt to identify its presumed DNA target sequence, we followed a target candidate approach testing the binding of the protein to the region of its own promoter, as very often transcription regulators show cis-regulation. However, even if the promoter region of the gp06 gene is unusually GC rich compared to the generally low GC content of the AFV1 genome (36%), under the in vitro conditions used, the efficiency of AFV1p06 binding to this region was not significantly different from that of a “non-specific” AT rich DNA from a non related virus (data not shown).
Phylogenetic Studies
The AFV1p06-related proteins identified by the PSI-BLAST approach are divided into two clearly separated clusters of archaeal and eukaryal proteins that show a common origin (Figure S3). The archaeal proteins are grouped into two sub-clusters representing the two major archaeal phyla, Cren- and Euryarchaeota. No homologue could be identified in the domain of bacteria or in the third phylum of archaea, the Thaumarchaeota.
The group affiliated to Crenarchaeota includes 8 representatives forming the “AFV1p06 family”. All of them are coded either by crenarchaeal viruses (SVS-K1 [52], SSV1 [53], SSV2 ([54], SSV4 and SSV5 [55], SSV6 [56] and AFV1 [22] or by proviruses integrated into the chromosome. In the case of S. islandicus M.14.25 the AFV1p06 homologue (M1425–1829) is annotated as being coded by a chromosomal gene but a more detailed analysis of this region, which shows typical viral att-like sites, clearly indicates the viral origin of the locus. The alignment of the predicted ZNF regions of these proteins indicates the presence of the seven highly conserved amino acids of the ZNF motif (two Cys, two His as well as three hydrophobic amino acids indicated by squares in Figure 3) except in the case of AFV1p06 in which the last His residue is replaced by a Glu residue. Thirteen additional amino acids (Figure 3) are well conserved in crenarchaeal C2H2-like proteins and six of them (indicated by asterisks) are localised in the loop, helix and third β-strand situated downstream to the ZNF fold. The latter six residues are exposed to the solvent in the structure of AFV1p06, suggesting that these residues are conserved because of their functional importance rather than their role in structure maintenance. The conservation pattern of the proteins in the AFV1p06 family and the nature of the amino residues, strongly suggest that the crenarchaeal C2H2-like proteins show the same structure extension of the ZNF fold as AFV1p06.
The figure shows the alignment of 27 hits corresponding to archaeal zinc finger proteins bearing an AFV1p06-like motif. Squares: position of the seven idiosyncratic residues of the ZNF fold; open circles: amino acids conserved in archaea but not in eukaryotes; triangles: amino acids specific to cren- or euryarchaea; open squares: amino acids conserved only in crenarchaea in the ZNF fold extension observed in AFV1p06 (loop+helix+3rd strand of the β-sheet). The horizontal line separates the archaeal viral and cellular proteins.
A distant group of putative ZNF proteins found in the Euryarchaeota (20 representatives) is very similar to the crenarchaeal viral AFV1p06 family in the ZNF motif region but does not show any conservation in the ZNF downstream extension. The origin, cellular or viral, of the genes belonging to this sister of the AFV1p06 group of euryarchaeal ZNF proteins is unclear.
Noteworthy, although AFV1p06 is the only protein in the alignment shown in Figure 3 that displays a Glu residue at the position of the second histidine of the C2H2 motif, it should be mentioned that in eukaryotic ZNFs, the 4th ligand in the motif is also not conserved in a number of variant ZNFs. Conservation of a histidine seems thus less important at the fourth position, an observation that could be explained by the fact the 4th ligand is not crucial to retain zinc binding capabilities as shown in a mutation/folding and stability analysis [57] or by the fact that it can be replaced by a water molecule [58].
Discussion
The results described in this paper indicate that the archaeal virus protein AFV1p06 has a classical ZNF structure composed of an α-helix and a β-hairpin, a novel extension to this fold and disordered N and C terminal ends. In addition, the EMSA experiments show that the protein can bind to DNA at sub-micromolar concentrations and discriminate between different DNA sequences. Although the presumed biological target(s) of AFV1p06 on the AFV1 virus and or its host (Acidianus sp.) genome remains unknown, these results suggest that AFV1p06 could potentially be a transcription regulator.
Classical zinc fingers usually bind to DNA using exposed residues at positions −1, +2, +3 and +6 of the α-helix that make specific contacts with DNA bases and establish other non-specific contacts with DNA as well. The electrostatic potential of AFV1p06 on the α-helix face is positive (Figure 4) and seems thus favourable for interacting with the negatively charged DNA poly-anion. Moreover, residues that occupy the DNA-contacting positions in AFV1p06 [T (−1), K (+2), Q (+3) and L (+6)] have been observed in ZNF–DNA complexes and could in principle establish specific contacts with DNA bases [13]. Manual docking of AFV1p06 into ZNF–DNA complex structures [PDB codes 2JPA and 2GLI, [59], [60]] suggests that AFV1p06 may also interact with DNA using the α-helix: the superimposition of the structure of AFV1p06 with that of ZNFs in complex with DNA indicates that residues at key positions of the helix could indeed make contacts with DNA and that the basic residues R8, R21 (−2) and K23 (+1), would be close to the DNA phosphate groups.
A representative conformer was used. Positive charges are represented in blue and negative ones in red. The left and right views are rotated by 180° on the x-axis. The side chains of the residues that coordinate zinc are shown in cyan and the zinc ion as a blue sphere.
To recognise its cognate DNA in a cellular context, more than three or four specific DNA nucleotide bases/α -helix residue protein contacts must be established. To this end, eukaryal TFs usually show tandem repeats of ZNF motifs, or like in the case of the GAGA protein, make use of another module that also binds specifically to DNA [16]. The manual docking of AFV1p06 shows that the novel extension of the ZNF motif (the loop and third strand of the β-sheet) is far from the ZNF–DNA contact region (not shown), suggesting that this extension cannot directly contribute to the interaction without major conformational changes. Hence, two possibilities can be envisioned for specific binding of AFV1p06 to DNA: (i) the N- and/or C- disordered termini could participate in the interaction; (ii) another DNA binding protein could bind to AFV1p06 on the β-sheet face. Indeed, the hydrophobic character of the exposed β-sheet face and the conservation of its hydrophobicity in AFV1p06 homologues in SSV crenarchaeal viruses, make this face of the protein a good candidate for protein-protein interactions. In this respect, it is interesting to note that Pérez-Rueda and Janga have observed that although bacterial-like, predicted TFs in archaea are statistically smaller (shorter sequence) than in bacteria and specific ligand-binding modules are under-represented [5]. These authors have suggested that protein-protein interactions in archaeal TFs could mediate regulatory feedback. Similarly, it can be hypothesised that archaeal ZNF proteins could also need a protein partner for specific DNA recognition. In this sense, predicted archaeal and archaeal virus ZNFs appear in only one (∼74%) or two copies (∼20%) per protein in relatively short proteins (most often less than 100 residues). This situation is very rare in eukaryotes, in which very often ZNFs are present in tandem repeats. Although we cannot exclude that the ZNF fold in archaea may be preferentially used for protein-protein interactions or RNA-protein interactions, the fact that AFV1p06, which only contains one ZNF motif, does interact with non-specific DNA with relatively high affinities in vitro suggests to us that at least some of these proteins may be TFs and may use either other modules within the same protein or may interact with other proteins to control gene expression. Despite its small size, in the case of the N–terminal GATA-1 ZNF and its FOG ZNF partner, it has been observed that the ZNF fold can cope with simultaneous specific protein–DNA and protein–protein interactions or that two different ZNFs can bind to form heterodimers that bind DNA specifically [61]–[63]. Also, fungal GATA proteins involved in gene regulation display only one ZNF motif, bind to specific DNA sequences and can mediate protein–protein interactions that are important to regulate gene expression [64].
It should be underlined that proteins bearing the C2H2 zinc finger motif are essentially known and characterised in eukaryotes. In this domain of life, ZNFs are predicted to be coded by more than 1% of the genes compared to 0.07% for the archaea (278 examples in the Uniprot database) and only 0.003% (489 examples) for the bacteria. The phylogenetic analysis described here, clearly indicated a common origin of the AFV1p06-like ZNF domain for archaea and eukaryotes, and its absence in bacteria. In crenarchaea all the known genes have a virus related origin.
AFV1p06 is the first archaeal protein with an eukaryal ZNF fold to be characterised experimentally and the first for which the DNA binding and sequence preference capabilities have been demonstrated. It would be interesting in the future to identify its presumed targets on the AFV1 and/or Acidianus genomes and understand its role in the virus infection cycle.
Supporting Information
Figure S1.
Assigned 1H-15N HSQC spectrum of AFV1p06.
https://doi.org/10.1371/journal.pone.0052908.s001
(PDF)
Figure S2.
Structural (number of nOe-derived distance constraints and structure backbone RMSD) and dynamics (amide 15N transverse relaxation rate) data of AFV1p06 on a per residue basis.
https://doi.org/10.1371/journal.pone.0052908.s002
(PDF)
Figure S3.
Phylogenetic tree of the distribution of the AFV1p06-like ZNF fold in Eukarya and Archaea .
https://doi.org/10.1371/journal.pone.0052908.s003
(PDF)
Acknowledgments
We thank Bertrand Raynal (PFBMI, Institut Pasteur) for analytical centrifugation experiments and Michel Fromant for access to and help with the flame-photometer.
Author Contributions
Conceived and designed the experiments: JIG GS DP MD. Performed the experiments: FG CD JIG CJ ND. Analyzed the data: FG CD JIG GS CJ. Contributed reagents/materials/analysis tools: ND. Wrote the paper: JIG GS FG CD CJ DP MD.
References
- 1. Aravind L, Koonin E (1999) DNA-binding proteins and evolution of transcription regulation in the archaea. Nucleic Acids Res 27: 4658–4670.
- 2. Bell SD, Jackson SP (2001) Mechanism and regulation of transcription in archaea. Curr Opin Microbiol 4: 208–213.
- 3. Armache KJ, Mitterweger S, Meinhart A, Cramer P (2005) Structures of complete RNA polymerase II and its subcomplex, Rpb4/7. J Biol Chem 280: 7131–7134.
- 4. Hirata A, Klein BJ, Murakami KS (2008) The X-ray crystal structure of RNA polymerase from Archaea. Nature 451: 851–854.
- 5. Perez-Rueda E, Janga SC (2010) Identification and genomic analysis of transcription factors in archaeal genomes exemplifies their functional architecture and evolutionary origin. Mol Biol Evol 27: 1449–1459.
- 6. Bell SD (2005) Archaeal transcriptional regulation–variation on a bacterial theme? Trends Microbiol 13: 262–265.
- 7. Kessler A, Sezonov G, Guijarro JI, Desnoues N, Rose T, et al. (2006) A novel archaeal regulatory protein, Sta1, activates transcription from viral promoters. Nuc Ac Res 34: 4837–4845.
- 8. Abella M, Rodriguez S, Paytubi S, Campoy S, White MF, et al. (2007) The Sulfolobus solfataricus radA paralogue sso0777 is DNA damage inducible and positively regulated by the Sta1 protein. Nucleic Acids Res 35: 6788–6797.
- 9. Miller J, McLachlan AD, Klug A (1985) Repetitive zinc-binding domains in the protein transcription factor IIIA from Xenopus oocytes. Embo J 4: 1609–1614.
- 10. Bouhouche N, Syvanen M, Kado CI (2000) The origin of prokaryotic C2H2 zinc finger regulators. Trends Microbiol 8: 77–81.
- 11. Brown RS (2005) Zinc finger proteins: getting a grip on RNA. Curr Opin Struct Biol 15: 94–98.
- 12. Gamsjaeger R, Liew CK, Loughlin FE, Crossley M, Mackay JP (2007) Sticky fingers: zinc-fingers as protein-recognition motifs. Trends Biochem Sci 32: 63–70.
- 13. Wolfe SA, Nekludova L, Pabo CO (2000) DNA recognition by Cys2His2 zinc finger proteins. Annu Rev Biophys Biomol Struct 29: 183–212.
- 14. Klug A (2010) The discovery of zinc fingers and their applications in gene regulation and genome manipulation. Annu Rev Biochem 79: 213–231.
- 15. Dutnall RN, Neuhaus D, Rhodes D (1996) The solution structure of the first zinc finger domain of SWI5: a novel structural extension to a common fold. Structure 4: 599–611.
- 16. Omichinski JG, Pedone PV, Felsenfeld G, Gronenborn AM, Clore GM (1997) The solution structure of a specific GAGA factor-DNA complex reveals a modular binding mode. Nat Struct Biol 4: 122–132.
- 17. Bowers PM, Schaufler LE, Klevit RE (1999) A folding transition and novel zinc finger accessory domain in the transcription factor ADR1. Nat Struct Biol 6: 478–485.
- 18. Prangishvili D, Garrett RA (2004) Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses. Biochem Soc Trans 32: 204–208.
- 19. Prangishvili D, Garrett R, Koonin E (2006) Evolutionary genomics of archaeal viruses: unique viral genomes in the third domain of life. Virus Res 117: 52–67.
- 20. Guilliere F, Peixeiro N, Kessler A, Raynal B, Desnoues N, et al. (2009) Structure, function, and targets of the transcriptional regulator SvtR from the hyperthermophilic archaeal virus SIRV1. J Biol Chem 284: 22222–22237.
- 21. Schlenker C, Goel A, Tripet BP, Menon S, Willi T, et al. (2012) Structural studies of E73 from a hyperthermophilic archaeal virus identify the “RH3” domain, an elaborated ribbon-helix-helix motif involved in DNA recognition. Biochemistry 51: 2899–2910.
- 22. Bettstetter M, Peng X, Garrett RA, Prangishvili D (2003) AFV1, a novel virus infecting hyperthermophilic archaea of the genus acidianus. Virology 315: 68–79.
- 23. Pace CN, Vajdos F, Fee L, Grimsley G, Gray T (1995) How to measure and predict the molar absorption coefficient of a protein. Protein Sci 4: 2411–2423.
- 24. Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, et al. (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J Biomol NMR 6: 277–293.
- 25. Johnson BA, Blevins RA (1994) NMRView: a computer program for the visualisation and analysis of NMR data. J Biomol NMR 4: 603–614.
- 26. Kay LE, Keifer P, Saarinen T (1992) Pure absorption gradient enhanced heteronuclear single quantum correlation spectroscopy with improved sensitivity. J Am Chem Soc 114: 10663–10665.
- 27. Muhandiram DR, Kay LE (1994) Gradient-enhanced triple-resonance three-dimensional NMR experiments with improved sensitivity. J Magn Reson Series B 103: 203–216.
- 28. Grzesiek S, Anglister J, Bax A (1993) Correlation of backbone amide and aliphatic side-chain resonances in 13C/15N-enriched proteins by isotropic mixing of 13C magnetization. J Magn Reson Series B 101: 114–119.
- 29. Logan TM, Olejniczak ET, Xu RX, Fesik SW (1993) A general method for assigning NMR spectra of denatured proteins using 3D HC(CO)NH-TOCSY triple resonance experiments. J Biomol NMR 3: 225–231.
- 30. Yamazaki T, Forman-Kay JD, Kay LE (1993) Two-dimensional NMR experiments for correlating 13Cβ and 1Hδ/ε chemical shifts of aromatic residues in 13C-labeled proteins via scalar couplings. J Am Chem Soc 115: 11054–11055.
- 31. Farrow NA, Muhandiram R, Singer AU, Pascal SM, Kay CM, et al. (1994) Backbone dynamics of a free and phosphopeptide-complexed Src homology 2 domain studied by 15N NMR relaxation. Biochemistry 33: 5984–6003.
- 32. Zhang O, Kay LE, Olivier JP, Forman-Kay D (1994) Backbone 1H and 15N resonance assignments of the N-terminal SH3 domain of drk in folded and unfolded states using enhanced-sensitivity pulsed field gradient NMR techniques. J Biomol NMR 4: 845–858.
- 33. Vuister GW, Bax A (1993) Quantitative J correlation: a new approach for measuring homonuclear three-bond J HNHα coupling constants in 15N enriched proteins. J Am Chem Soc 115: 7772–7777.
- 34. Grzesiek S, Kuboniwa H, Hinck AP, Bax A (1995) Multiple-quantum line narrowing for measurement of Hα-Hβ J couplings in isotopically enriched proteins. J Am Chem Soc 117: 5312–5315.
- 35. Cornilescu G, Delaglio F, Bax A (1999) Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J Biomol NMR 13: 289–302.
- 36. Schanda P, Forge V, Brutscher B (2006) HET-SOFAST NMR for fast detection of structural compactness and heterogeneity along polypeptide chains. Magn Reson Chem 44 Spec No: S177–184.
- 37. Nilges M, Macias MJ, O'Donoghue SI, Oschkinat H (1997) Automated NOESY interpretation with ambiguous distance restraints: the refined NMR solution structure of the pleckstrin homology domain from β-spectrin. J Mol Biol 269: 408–422.
- 38. Rieping W, Habeck M, Bardiaux B, Bernard A, Malliavin TE, et al. (2007) ARIA2: automated NOE assignment and data integration in NMR structure calculation. Bioinformatics 23: 381–382.
- 39. Brünger AT, Adams PD, Clore GM, DeLano WL, Gros P, et al. (1998) Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Cryst D54: 905–921.
- 40. Mandel AM, Akke M, Palmer AG (1995) Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. J Mol Biol 246: 144–163.
- 41. Linge JP, Habeck M, Rieping M, Nilges M (2003) Refinement of protein structures in explicit solvent. Prot: Struct Funct & Genet 50: 496–506.
- 42. Laskowski RA, MacArthur MW, Moss DS, Thornton JM (1993) PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr 26: 283–291.
- 43. Hooft RW, Vriend G, Sander C, Abola EE (1996) Errors in protein structures. Nature 381: 272.
- 44. Koradi R, Billeter M, Wüthrich K (1996) MOLMOL: a program for display and analysis of macromolecular structures. J Mol Graphics 14: 51–55.
- 45. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–3402.
- 46. Sharma D, Rajarathnam K (2000) 13C NMR chemical shifts can predict disulfide bond formation. J Biomol NMR 18: 165–171.
- 47. Barraud P, Schubert M, Allain FH (2012) A strong (1)(3)C chemical shift signature provides the coordination mode of histidines in zinc-binding proteins. J Biomol NMR 53: 93–101.
- 48. Alberts IL, Nadassy K, Wodak SJ (1998) Analysis of zinc binding sites in protein crystal structures. Protein Sci 7: 1700–1716.
- 49. Pavletich NP, Pabo CO (1991) Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A. Science 252: 809–817.
- 50. Isalan M, Choo Y, Klug A (1997) Synergy between adjacent zinc fingers in sequence-specific DNA recognition. Proc Natl Acad Sci U S A 94: 5617–5621.
- 51. Isalan M, Klug A, Choo Y (1998) Comprehensive DNA recognition through concerted interactions from adjacent zinc fingers. Biochemistry 37: 12026–12033.
- 52. Wiedenheft B, Stedman K, Roberto F, Willits D, Gleske AK, et al. (2004) Comparative genomic analysis of hyperthermophilic archaeal Fuselloviridae viruses. J Virol 78: 1954–1961.
- 53. Palm P, Schleper C, Grampp B, Yeats S, McWilliam P, et al. (1991) Complete nucleotide sequence of the virus SSV1 of the archaebacterium Sulfolobus shibatae. Virology 185: 242–250.
- 54. Stedman KM, She Q, Phan H, Arnold HP, Holz I, et al. (2003) Relationships between fuselloviruses infecting the extremely thermophilic archaeon Sulfolobus: SSV1 and SSV2. Res Microbiol 154: 295–302.
- 55. Peng X (2008) Evidence for the horizontal transfer of an integrase gene from a fusellovirus to a pRN-like plasmid within a single strain of Sulfolobus and the implications for plasmid survival. Microbiology 154: 383–391.
- 56. Redder P, Peng X, Brugger K, Shah SA, Roesch F, et al. (2009) Four newly isolated fuselloviruses from extreme geothermal environments reveal unusual morphologies and a possible interviral recombination mechanism. Environ Microbiol 11: 2849–2862.
- 57. Simpson RJ, Cram ED, Czolij R, Matthews JM, Crossley M, et al. (2003) CCHX zinc finger derivatives retain the ability to bind Zn(II) and mediate protein-DNA interactions. J Biol Chem 278: 28011–28018.
- 58. Cordier F, Vinolo E, Veron M, Delepierre M, Agou F (2008) Solution structure of NEMO zinc finger and impact of an anhidrotic ectodermal dysplasia with immunodeficiency-related point mutation. J Mol Biol 377: 1419–1432.
- 59. Pavletich NP, Pabo CO (1993) Crystal structure of a five-finger GLI-DNA complex: new perspectives on zinc fingers. Science 261: 1701–1707.
- 60. Stoll R, Lee BM, Debler EW, Laity JH, Wilson IA, et al. (2007) Structure of the Wilms tumor suppressor protein zinc finger domain bound to DNA. J Mol Biol 372: 1227–1245.
- 61. Mackay JP, Kowalski K, Fox AH, Czolij R, King GF, et al. (1998) Involvement of the N-finger in the self-association of GATA-1. J Biol Chem 273: 30560–30567.
- 62. Newton A, Mackay J, Crossley M (2001) The N-terminal zinc finger of the erythroid transcription factor GATA-1 binds GATC motifs in DNA. J Biol Chem 276: 35794–35801.
- 63. Liew CK, Simpson RJ, Kwan AH, Crofts LA, Loughlin FE, et al. (2005) Zinc fingers as protein recognition motifs: structural basis for the GATA-1/friend of GATA interaction. Proc Natl Acad Sci U S A 102: 583–588.
- 64. Feng B, Marzluf GA (1998) Interaction between major nitrogen regulatory protein NIT2 and pathway-specific regulatory factor NIT4 is required for their synergistic activation of gene expression in Neurospora crassa. Mol Cell Biol 18: 3983–3990.