A Phylogenetic Study of SPBP and RAI1: Evolutionary Conservation of Chromatin Binding Modules

Our genome is assembled into and array of highly dynamic nucleosome structures allowing spatial and temporal access to DNA. The nucleosomes are subject to a wide array of post-translational modifications, altering the DNA-histone interaction and serving as docking sites for proteins exhibiting effector or “reader” modules. The nuclear proteins SPBP and RAI1 are composed of several putative “reader” modules which may have ability to recognise a set of histone modification marks. Here we have performed a phylogenetic study of their putative reader modules, the C-terminal ePHD/ADD like domain, a novel nucleosome binding region and an AT-hook motif. Interactions studies in vitro and in yeast cells suggested that despite the extraordinary long loop region in their ePHD/ADD-like chromatin binding domains, the C-terminal region of both proteins seem to adopt a cross-braced topology of zinc finger interactions similar to other structurally determined ePHD/ADD structures. Both their ePHD/ADD-like domain and their novel nucleosome binding domain are highly conserved in vertebrate evolution, and construction of a phylogenetic tree displayed two well supported clusters representing SPBP and RAI1, respectively. Their genome and domain organisation suggest that SPBP and RAI1 have occurred from a gene duplication event. The phylogenetic tree suggests that this duplication has happened early in vertebrate evolution, since only one gene was identified in insects and lancelet. Finally, experimental data confirm that the conserved novel nucleosome binding region of RAI1 has the ability to bind the nucleosome core and histones. However, an adjacent conserved AT-hook motif as identified in SPBP is not present in RAI1, and deletion of the novel nucleosome binding region of RAI1 did not significantly affect its nuclear localisation.


Introduction
Packaging genetic information into nucleosome structures represents barrier to any cellular process that needs to access DNA, including replication, repair, recombination, transcription, gene silencing and RNA maturation. There exist two main regulatory mechanisms, chromatin modifying activity and chromatin remodeling activity, which increase accessibility of genome to DNA binding proteins. Chromatin modifying activities lead post-translational modifications (PTMs) on histone tails and histone core, whereas chromatin remodeling activities modify non-covalent interactions between DNA and histones (reviewed in 1). The PTMs of histones and other chromatin proteins, form dynamic platforms that are able to assemble the machineries involved in DNA metabolism and to recruit chromatin modifying activities. Important players are chromatin binding proteins containing so-called "reader" modules recognizing specific PTMs on the histones. The "reader" proteins are often inherent in large multi-protein complexes classified as transcriptional co-activators [2][3][4]. Coactivator complexes generally exhibit several "reader" and effector modules, linking in a combinatorial way particular PTM marks to defined downstream events. A number of histone "reader" modules with distinct structural folds have been identified and characterized [reviewed in 5], such as bromo, chromo, Tudor, PWWP, MBT, plant Agenet and plant homeodomain (PHD) fingers. Recently several structural studies highlighting the combinatorial read out of multiple histone marks by single or tandemly arranged reader modules are reported [6][7][8][9][10][11][12][13][14][15][16][17].
Stromelysin-1 PDGF (platelet-derived growth factor)responsive element binding protein (SPBP; also named as TCF20) and Retinoic Acid Induced protein (RAI1) are two nuclear chromatin binding multidomain proteins containing seven regions with strong sequence similarity, in addition to similar domain organization (Figure 1 A). Their genome structures suggest that they probably have evolved by gene duplication [18]. Both proteins are involved in modulation of transcriptional activity. However, in our studies SPBP seems to act as a transcriptional activator, while RAI1 acts more as a transcriptional repressor [19]. Notably, SPBP exhibit several regions with ability to bind DNA and/or nucleosomes: i) an ePHD/ADD domain (consisting of an atypical PHD domain and a putative GATA-1 like zinc-finger both with the potential to bind chromatin) ii) a novel nucleosome binding region [20] and iii) a DNA binding domain with an AT-hook motif [21] ( Figure  1A). Additionally, the ePHD/ADD domain contributes in proteinprotein interactions [19]. Also the ePHD/ADD domain of RAI1 has the ability to bind nucleosomes [20]. However, other nucleosome or DNA binding activities in RAI1 has so far not been characterized. Interestingly, the ePHD/ADD domain of SPBP and RAI1 contains a long loop region between zinc ligands 2 and 3. Such long loop region is not found in other proteins containing ePHD/ADD domains, makes it a hallmark of SPBP and RAI1.
SPBP is expressed in most tissues ( [21] and www.proteinatlas.com), and was originally found to be involved in transcriptional activation of Matrix Metalloproteinase 3 promoter via a specific DNA sequence [22]. Later, SPBP is shown to act as a transcriptional coactivator enhancing the transcriptional activity of certain transcription factors and cofactors [21,23]. SPBP is also shown to repress the transcriptional activity of estrogen receptor α (ERα) [24]. Overexpression of SPBP inhibited the proliferation of an ERαdependent breast cancer cell line [24]. RAI1 shows a more restricted expression pattern than SPBP, mainly detected in kidney, endocrine glands, liver and neuronal cells (www.proteinatlas.com). RAI1 is clinically associated with Smith-Magenis Syndrome [25], Potocki-Lupaski Syndrome [26] and schizophrenia [27]. RAI1 is also described to be related with non-syndromic autism [28] and spinocerebellar ataxia type 2 [29]. So far, no clinically related disease has been connected to mutations or deletions in SPBP.
In this study, we utilized in vitro and yeast two-hybrid interaction studies to investigate whether the putative cysteinerich motif preceding the atypical PHD domain of SPBP has the ability to form a GATA-1 like structure. On the basis of the predicted structural organization of the SPBP and RAI1 ePHD/ADD domains, the conservation of SPBP and RAI1 like sequences were analyzed using phylogenetic constructions. Our findings indicate the existence of both genes in vertebrates, one gene in lancelet and insects, while no genes were identified in plants, fungi or lower eukaryotes. Furthermore, the novel nucleosome binding region with the adjacent AT-hook motif was highly conserved within the SPBP proteins, while the AT-hook motif was absent in RAI1. However, in vitro pull down assays suggested that the region of RAI1 similar to the novel nucleosome binding region of SPBP, had the ability to bind nucleosomes and histones. Hence, organization of two independent chromatin binding regions in the transcriptional coregulators SPBP and RAI1 seems to be conserved in evolution.

Plasmid constructs
Plasmids used in this work are listed in Supporting information, Table S1, and specific PCR primers used for amplification of cDNAs are listed in Supporting Information, Table S2. All PCR reactions were performed with the Pfu polymerase according to the instructions from the manufacturer (Stratagene). Plasmid constructs were verified by DNA sequencing using the BigDye sequencing kit (Applied Biosystems). Most of the expression plasmids were generated using the Gateway Cloning System (Invitrogen). The cDNA constructs were either subcloned using restriction enzymes, or transferred via the BP reaction of attB-flanked PCR products, into Gateway entry/donor vectors. Expression clones were made as described in the Gateway cloning technology instruction manual (Invitrogen). Mutagenesis of plasmid DNA was performed using the QuikChange site-directed mutagenesis kit (Stratagene, La Jolla, CA). Oligonucleotides for PCR, DNA sequencing and mutagenesis reactions were obtained from Invitrogen, Sigma Aldrich and Eurogentec. All other constructs including full length SPBP and RAI1 constructs are described previously [19][20][21]

Yeast two-hybrid screening and interaction assays
The yeast two-hybrid screening and interaction assays were carried out as described previously [19,30].

Bioinformatics
The Blastp program was used to search with the ePHD/ADD domain (aa1690-1939) of SPBP for similar regions in other species (http://www.uniprot.org/). The alignment of the deduced amino acid sequences were carried out using ClustalW multiple alignment software. The online server of PhyML 3.0 was used to study phylogenetic evolution of ePHD/ADD domain of SPBP (http://www.atgc-montpellier.fr/ phyml/). The phylogenetic and evolutionary trees were constructed by maximum likelihood method with the help of Mega5 software (http://www.megasoftware.net/).
The Blastp program was also used to search for putative nucleosome-binding regions in other species with similarity to the novel nucleosome binding region in SPBP and RAI1 (http:// www.uniprot.org/). The alignment of the deduced amino acid sequences were carried out using ClustalW.

Evolutionary Conservation of SPBP and RAI1
Cell imaging 6000 cells/well were seeded in eight-well cover glass slides (Nunc) and transfected with 100-200 ng of plasmid expression vectors using TransIT-LT1 (Mirus, MIR2300) reagent. One day post-transfection cells were analyzed by live-cell confocal microscopy and images were examined using Zeiss Axiovert 200 microscope with a 40X, 1.2W C-Apochroma objective, equipped with an LSM510-META confocal module. shown. Asterisks mark Cys and His residues that may serve as zinc ligands. The F box is indicated in dark grey. The N-and Cterminal amino acid positions demarcating the SPBP ePHD/ADD domain are shown. Truncated versions of the ePHD/ADD domains of SPBP and RAI1, containing zinc-ligands 3-12 interact with their own and the others F box, as revealed by two-hybrid analysis. The F boxes of both proteins were fused to GAL4-AD and tested towards GAL4-DBD constructs of SPBP(ePHD), SPBP(ePHD3-12), RAI1(ePHD3-12), MLL1(ePHD), MLL2(ePHD) and MLL3(ePHD). Interactions are indicated by growth on QDO medium. Yeast cell suspensions were spotted onto plates and allowed to grow at 30°C for 3 days. (D) Deletion-and pointmutational analysis of the ePHD/ADD domain of SPBP using the two-hybrid interaction system. SPBP (ePHD/ADD) (black), SPBP (ePHD3-12) (grey) and the PHD finger, named SPBP (ePHD5-12) (white) were fused to GAL4-DBD and tested for interaction with the F box of SPBP fused to GAL4-AD. In addition, three constructs obtained by site-directed mutagenesis, generating mutations in the putative zinc-ligands 1 and 2 in the F box of SPBP(ePHD/ADD) were fused to GAL4-DBD, and tested for interaction to the SPBP F box fused to GAL4-AD. Cysteine(s) are mutated to alanine(s) as indicated in (C). doi: 10.1371/journal.pone.0078907.g001

Isolation of soluble mono-and di-nucleosomes and GST-nucleosome pull-down assay
Soluble mono-and di-nucleosomes from HeLa cells were prepared as described [20]. GST-nucleosome pull-down assay is described previously [20].

GST pull-down assay
Expression and purification of GST, GST-fusion proteins and GST pull-down assay with in vitro translated 35 S-labeled proteins were carried out as described [19].

Experimental validation of a zinc finger module Nterminal of the atypical PHD domain of SPBP
Alignment of the putative ePHD/ADD domains of SPBP and RAI1 with structurally known ePHD/ADD domains, suggests that an approximately 150 amino acid region N-terminal to the atypical PHD module may form a zinc finger structure similar to the ATRX/DNMT3 ePHD/ADD domains [20]. However, SPBP and RAI1 contain an extraordinary long loop region between the putative zinc ligands 2 and 3. The loop consists of more than 100 amino acids, and hence may form into a structure restricting the formation of a GATA-1 like zinc finger as reported for ATRX [31] and DNMT3 [32]. To experimentally evaluate the predicted model of SPBP/RAI1 ePHD/ADD domain ( Figure 1B), we first experimentally validated interaction between the F-boxes of SPBP/RAI1, containing zinc ligands 1 and 2, and the C-terminal part of the SPBP/RAI1 ePHD/ADD domains starting from zinc ligand 3 (ePHD3-12). Fragments containing the F box region of SPBP and RAI1 were fused in frame with the GAL4 activation domain (GAL4-AD), and tested against different ePHD domain constructs of RAI1 and SPBP, fused to the GAL4 DNA binding domain (GAL4-DBD), in the yeast two-hybrid interaction system. Interestingly, the F boxes of both SPBP and RAI1 associated with the truncated ePHD domain (ePHD3-12) of both proteins ( Figure  1C), but not with the ePHD/ADD domains (ePHD/ADD) of the trithorax family proteins; the histone methyltransferases mixedlineage leukemia genes MLL1, MLL2 and MLL3 ( Figure 1C). In addition, they also failed to interact with the complete ePHD domains (ePHD1-12) of SPBP and RAI1 in this assay system ( Figure 1C and data not shown). The cross-reaction between the F-boxes of SPBP and RAI1 with their ePHD (3)(4)(5)(6)(7)(8)(9)(10)(11)(12) are not surprising, since approximately 45% of the amino acids, including the two cysteine residues that are proposed to function as zinc-ligands, are conserved between their F boxes.
Since the complete ePHD/ADD domain (ePHD/ADD) of SPBP failed to interact with the F box in the two-hybrid system, we asked if such an interaction could be observed if we disrupted the putative zinc finger structure formed by ligands 1-4, by mutating ligands 1 and 2. The idea being that in the yeast two-hybrid system, the intramolecular binding between the F box and the core PHD zinc finger would prevent any intermolecular interactions to take place. Thus, by mutating zinc-ligands 1 and 2, the interactions surface on the core PHD would be available for interactions in trans. As seen in Figure  1D, mutation of both putative zinc ligands (1 and 2) in the F box, or only ligand 2 to alanine(s) enabled interactions in trans in the two-hybrid system. As a control, we mutated the cysteine residue between ligands 1 and 2, which is not thought to be a zinc-ligand. This mutation did not enable an interaction in trans.
Taken together, the results presented above indicate a binding mode where the F box in one molecule interacts with the ePHD domain in the other. However, the assays used measure intermolecular-and not intramolecular interactions. We suggest that within the ePHD/ADD domains of both SPBP and RAI1 the F box may bind to part of the ePHD3-12 region in the same molecule. To determine whether the PHD module, containing zinc-ligands 5 to 12 of the SPBP ePHD domain, displays an interaction surface used by the F box, the PHD of SPBP was fused to GAL4-DBD, generating GAL4-DBD-SPBP (ePHD5-12). Subsequently, this construct was tested for binding to the F box of SPBP in the yeast two-hybrid system. Interestingly, the SPBP PHD module associated with the F box, showing that an interaction surface sufficient for binding the F box residues is present in the PHD module ( Figure 1D). To conclude, our results are compatible with the model shown in Figure 1B where the F box region interacts with the PHD module. This, together with the sequence homology between SPBP ePHD/ADD and other ePHD/ADD domains, suggest that SPBP ePHD/ADD share the cross-braced topology of zincbinding interactions demonstrated for the ePHD/ADD domains of ATRX and DNMT3 [31,32].

Both SPBP and RAI1 are present in vertebrates, while only one gene is found in insects and lancelet
The PHD finger represents one of the most abundant modules present in nuclear and chromatin binding proteins [recently reviewed in 33,34]. This structurally conserved motif consists of a two-strand antiparallel β-sheet often followed by a C-terminal α-helix, stabilized by two zinc atoms contacted by Cys4-His-Cys3 ligands forming a cross-branch topology ( Figure 1B) [35]. PHD fingers are generally accepted as "readers" of the histone code, with ability to read the N-terminal tail of histone H3, even some PHD fingers are found not to bind histones or nucleosomes. Based on their sequence homology and structural plasticity, the PHD fingers may be divided into five subgroups each "reading" specific sets of histone marks [33,34]. PHD fingers are frequently paired with other "reader" modules, the PHD finger itself, and the GATA-1 finger that may both be a DNA binding module or a "reader" module [6], or with other DNA binding modules such as the AT-hook motif. Above and previously, we show that the PHD fingers of SPBP and RAI1 is closely linked to an N-terminal zinc finger structure with similarity to the ePHD/ADD fingers found in eight other groups of chromatin binding proteins [20]. However, since the extra zinc fingers of SPBP and RAI1 contain an extraordinary long loop between zinc ligands 2 and 3, they constitute a specific subgroup of ePHD/ADD modules. This hallmark of SPBP and RAI1 was used as criteria in BLAST search to retrieve putative SPBP and RAI1 sequences in genomes beside humans. In addition, selection was based on protein size and two conserved tryptophan residues, one two positions N-terminal to zinc-ligand 3 and one three positions C-terminal to zinc-ligand 4, in the GATA-1 like finger (Figure 2A). This approach yielded Evolutionary Conservation of SPBP and RAI1 64 SPBP and RAI1-like sequences in vertebrates spanning from bony fishes to mammals, in addition to 17 SPBP/RAI1-like sequence in lancelets and insects ( Figure 2B/C). No SPBP or RAI1 like sequences were identified in fungi and plants, suggesting that SPBP/RAI1 like genes have occurred in the common ancestor of vertebrate and insects, and branched out in evolution. In figure 2, a representative selection of SPBP and RAI1 homologues and their distribution in a phylogenetic tree are presented. Both SPBP and RAI1proteins are found in vertebrate species like Western clawed frog (Amphibia), different mammals, birds, reptiles, zebra fish (Actinopterygii), and West Indian Ocean coelacanth (Sarcopterygii). A putative uncharacterized protein in invertebrate species such as Florida lancelet (Leptocardii) shows 38% sequence similarity to human RAI1 protein and various insect groups such as Drosophila, Mosquito and Ants species also show varying similarities with RAI1. From the phylogenetic and evolutionary trees we speculate that the RAI1 gene is the ancestral gene. A gene duplication event [36] may have taken place early in the vertebrate evolution, just after branching from insects, giving rise to SPBP. Alignment of the dataset from the retrieved SPBP and RAI1 ePHD/ADD domains resulted in a region with a length of approximately 150 amino acids (Supporting information, Figure S1), after removing the loop region between zinc ligand 2 and 3. Phylogenetic and evolutionary trees were constructed using the maximum likelihood method revealing two well supported clusters corresponding to SPBP and RAI1 in vertebrates ( Figure 2B). The SPBP ePHD/ADD domains from zebrafish to human are highly conserved, with an overall sequence identity of 48%. Similarly, conservation of the RAI1 ePHD/ADD sequences are 33%. Interestingly, the conservation of all RAI1 and SPBP ePHD/ADD domains are high, especially the residues constituting the GATA-1 like finger (Figure 2A). Structural and biochemical analysis of the GATA-1 finger in the ATRX ePHD/ADD domain has revealed that it is a "reader" module, recognizing H3K9me2/3. The residues involved (Tyr203, Ile209, and Ala224) are localized in the region between zinc-ligand 4 of the GATA-1 finger, and zinc-ligand 3 in the ATRX PHD finger, which forms a polar K9me3 reader pocket [37]. This region between zinc ligands 4 and 6 is highly conserved between the SPBP and RAI1 proteins, containing several conserved hydrophobic residues (gly, val, leu, ala, tyr, trp), in addition to conserved basic and acidic residues, similarly as ATRX. The positions corresponding to the three amino acids involved in H3K9me3 recognition (Tyr203, Ile209, and Ala224) in ATRX, are conserved and occupied by amino acids with similar properties (Trp, Leu, Gln respectively) in SPBP and RAI1. These similarities suggest that the GATA-1 like fingers of SPBP and RAI1 may constitute histone reader modules, recognizing specific histone marks such as the ATRX GATA-1 like finger. This hypothesis, suggesting the GATA-1 like finger to be a histone binding module, is further supported by the structural analysis of the PHD finger of UHRF1 [38]. Here the PHD finger including an additional N-terminal zincfinger motif together forms a large surface cavity that interacts with the N-terminal tail of Histone H3. Notably, the GATA-1 like fingers of SPBP and RAI1 exhibit this long loop region between zinc ligand 2 and 3. The loop region is poorly conserved and not included in the phylogenetic data sets. Comparison with the structure of ATRX suggests that this long loop region may interact with the long C-terminal α-helix often present in ePHD/ADD domains [6]. In SPBP, several serine and threonine residues within the long loop region is reported to be phosphorylated (www.phosphosite.org). Interestingly, a missense mutation of serine 1808 in the long loop of RAI1 (S1808N) is connected to Smith-Magenis Syndrome [39] indicating that the long loop is important for proper function of RAI1. Furthermore, phosphorylation of a residue in a loop region between the ePHD/ADD finger and the Tudor domain of UHRF1, can modulate the relative position of the reader modules and thereby alter its histone binding mode [9]. This indicates that the long loop regions between zinc ligand 2 and 3 in the GATA-1 like zinc finger may have regulatory roles modulating the SPBP/RAI1-nucleosome interaction.
The PHD fingers of SPBP and RAI1 are atypical, displaying "Cys4HisCys2His" fingers lacking the conserved tryptophan residue characteristic of the conventional PHD finger [20]. Figure 2A reveals that the two loop regions, Loop1 and Loop3, in the PHD are highly conserved. Loop1 is glycine rich containing glutamate and a conserved Threonine residue. Hence, this seems to be an acidic region, and phosphorylation of the threonine residue will make Loop1 even more acidic. Specific tyrosine and methionine residues in Loop1 are involved in recognizing H3K4me2/3 marks in certain PHD subgroups [34]. These residues are not present in the RAI1/ SPBP PHDs, confirming our previous finding that the SPBP PHD does not recognize the H3K4me2/3 marks. For both proteins, Loop3 is rich in acidic residues, in addition to a conserved serine residue and a bulge hydrophobic Phenylalanine residue. Interestingly, mutation of one of the charged amino acids in Loop3 of RAI1 (D1885N) may be involved in familial Smith-Magenis syndrome [40]. Loop3 in the PHDs of PYGO1 and MLL proteins, are involved in proteinprotein interactions with non-histone partners. We have previously reported the ePHD/ADD domain of SPBP to be important for specific protein-protein interactions [19]. Hence, Loop3 in the PHD of SPBP and RAI1 may constitute a negatively charged interaction surface important for recruitment of proteins to specific chromatin sites.

The novel chromatin binding region including the adjacent AT-hook motif is conserved in evolution
Previously we have identified a novel nucleosome binding region adjacent to the DNA binding domain with an AT-hook motif in human SPBP [20]. Sequence alignments predicted a similar region in human RAI1, however a complete AT hook motif could not be detected and a RAI1 DNA binding region has not been described in the literature. To investigate whether the nucleosome-binding region following an AT-hook motif is a conserved feature of the SPBP and RAI1 proteins, blast searches and manual inspection of the corresponding sequences were carried out. Alignment of the dataset from the novel nucleosome binding regions resulted in a region of approximately 130 amino acids ( Figure 3A). Alignments and phylogenetic studies divided the sequences into two clusters corresponding to SPBP-like proteins and RAI1 like proteins. Evolutionary Conservation of SPBP and RAI1 Cysteine and histidine residues that may serve as zinc-ligands are indicated by numbers above the alignment. The long loop region between zinc ligands 2 and 3 in all species is excluded from the alignment, but indicated above. The regions encompassing the putative GATA-1 like finger and the PHD finger are indicated above, while the Loop1, Loop2 and Loop3 indicated below are loop structures generally found in PHD fingers. The conserved Tryptophan units used to select blasted sequences are indicated by arrows, while amino acids probably involved in Smith-Magenis syndrome are encircled. Two of the RAI1 sequences (A.carolinensis and L.chalumnae) are truncated in the C-terminal end probably due to sequencing errors. The threshold for shading in manually refined Clustal W sequence alignment was set to 50% using BLOSUM 62 scoring matrix. (B) Phylogenetic tree of the ePHD/ADD domains of SPBP and RAI1 proteins. The phylogenetic tree was constructed by Mega5 software using the maximum likelihood method. Numbers in the branch represents the bootstrap values. Background color coding is used to represent species which possess SPBP (blue), RAI1 (pink) and uncharacterized SPBP/RAI1 like proteins (green). The phylogenetic tree is based on selected protein sequences from representative vertebrate organisms (e.g. Homo sapiens represent Mammalia). (C) The evolutionary tree of the ePHD/ADD domains of SPBP and RAI1 proteins. The alignment of all sequences is shown in Supporting information, Figure S1 and the protein sequence accession numbers are provided in the Supporting information, Table S3. The average sequence identity between the novel nucleosome binding regions in SPBP proteins was 65%, while the identity in the RAI1 cluster was 33%. Interestingly, the AT-hook motif is completely conserved in the SPBP proteins, indicating that DNA binding activity is an important feature ( Figure 3A). In RAI1 however, no AT-hook motif is present and the conservation of this region is low. Only in mammalian RAI proteins, the core GRP motif of the AT-hook is present. If RAI1 is the ancient protein, this GRP motif may be a remnant of a DNA binding activity lost in evolution. The C-terminal part of the novel nucleosome binding region is strongly conserved between SPBP and RAI1 in all species. This region exhibit the E-box ( Figure 1A), previously reported to be similar between SPBP and RAI1 in humans [21].

The putative nucleosome binding region of RAI1 displays nucleosome and histone binding activity in vitro
Based on the conservation of the region encompassing the novel SPBP nucleosome binding region; we wanted to determine whether this region of RAI1 has the ability to bind nucleosomes and histones. The corresponding region of RAI1 was expressed as a GST-fusion protein and exposed to nucleosomes isolated from HeLa cells in a GST-nucleosome binding assay. The results presented in Figure 4A show that RAI1 (1523-1627) binds to HeLa nucleosomes in vitro equally well as SPBP (1551-1666). This binding was not dependent on the histone tails, since partial Trypsin digestion of the nucleosomes did not reduce binding ( Figure 4A, right panel). The RAI(ePHD) and RAI(PHD), on the other hand, did not bind to Trypsin treated nucleosomes, indicating that the nucleosome binding activity of the RAI(ePHD) is histone tail dependent, as previously found for the SPBP(ePHD) domain [20]. Furthermore, GST-pulldown assays using various Histone H3 and Histone H2 variants, showed strong interaction between RAI1(1523-1627) and histones (Figure 4b). This is in line with our previous FRAP analysis of the RAI1 mobility in HeLa cells, which indicated strong association to nuclear structures [20]. Deletion of the region encompassing the novel nucleosome binding region in SPBP resulted in redistribution into round nuclear dots instead of the characteristic speckled distribution due to enrichment on chromatin [20]. To determine whether  Evolutionary Conservation of SPBP and RAI1 this corresponding nucleosome binding region of RAI1 was important for its nuclear distribution, a deletion construct (Δ1523-1627) of RAI1 fused to EGFP was transfected into HeLa cells and its localization analyzed by confocal microscopy ( Figure 4C). The nuclear distribution of RAI1(Δ1523-1627) was similar to full length RAI1 in all cells, indicating that also other regions of RAI1 are important for its localization to chromatin. A strong candidate here is the ePHD/ADD domain. However, a deletion construct expressing RAI1 without the ePHD/ADD domain displayed similar localization as full length RAI1. This strongly indicate that both the ePHD/ADD and the 1523-1627 region of RAI1 contributes to its nuclear distribution pattern. The presence of several regions in a protein that contribute to chromatin localization is shown for other proteins [6]. To summarize, the novel nucleosome binding region previously identified in SPBP is evolutionary conserved and present also in RAI1 proteins. However, in contrast to SPBP, RAI1 proteins are generally lacking the adjacent AT-hook motif, and the novel nucleosome binding region does not contribute alone to the enrichment of RAI1 on chromatin.

Conclusion
Here we have shown that SPBP and RAI1 are evolutionary conserved nuclear proteins containing several motifs for chromatin interactions. They exist as two different genes in vertebrates, while only one gene was identified in insects and lancelet. Two conserved regions for chromatin interaction were identified in RAI1, a novel nucleosome binding region located between amino acids 1523 and 1627, and the C-terminal atypical PHD finger connected to a GATA-1 like zinc finger structure (ePHD/ADD domain). SPBP on the other hand, seems to exhibit several conserved regions involved in DNA and chromatin binding; the novel nucleosome binding region, the AT-hook motif, and the ePHD/ADD domain. Interestingly, SPBP also contains a putative HP1 binding region (PxVxL) at position 1329 (PAVTL). All these regions are localized in the Cterminal half of the protein, while most of the N-terminal part has been identified as a transcriptional activation domain, suggesting that it has strong capability to interact with other proteins involved in transcription. Interaction of SPBP with the ligand activated transcription factors Androgen receptor and Estrogen receptor is known [23,24]. Interestingly, a yeast twohybrid screen acknowledged the RNA polymerase II subunit Rpb7, the RNA binding proteins hnRNPH/F, the SWI/SNF complex protein ARID1B and the chromatin modulator G9a as putative interaction partners of SPBP (our unpublished results). This suggests that SPBP and RAI1 may act as scaffold proteins, mediating interaction between specific sites on chromatin with chromatin modulation complexes, transcriptional regulators and the general transcription machinery simultaneously. Hence, SPBP and RAI1 may have an important role in integration of signaling pathways and regulation of gene transcription. Notably, the interaction between SPBP/RAI1 and chromatin seems to be strictly regulated, since several sites within the ePHD/ADD domain and the novel nucleosome binding domain is targeted by posttranslational modifications [www.phososite.org, [20]]. This putative ability to mediate regulated cross talk between the "histone code" and diverse cellular pathways that the nonhistone partners participate in, seems to be connected to several other multidomain proteins containing chromatin "reader" domains [6][7][8][9][10][11][12][13][14][15][16][17]33,41].
Evolutionary Conservation of SPBP and RAI1