In animals, circadian rhythms are driven by oscillations in transcription, translation, and proteasomal degradation of highly conserved genes, resulting in diel cycles in the expression of numerous clock-regulated genes. Transcription is largely regulated through the binding of transcription factors to cis-regulatory elements within accessible regions of the chromatin. Chromatin remodeling is linked to circadian regulation in mammals, but it is unknown whether cycles in chromatin accessibility are a general feature of clock-regulated genes throughout evolution. To assess this, we applied an ATAC-seq approach using Nematostella vectensis, grown under two separate light regimes (light:dark (LD) and constant darkness (DD)). Based on previously identified N. vectensis circadian genes, our results show the coupling of chromatin accessibility and circadian transcription rhythmicity under LD conditions. Out of 180 known circadian genes, we were able to list 139 gene promoters that were highly accessible compared to common promoters. Furthermore, under LD conditions, we identified 259 active enhancers as opposed to 333 active enhancers under DD conditions, with 171 enhancers shared between the two treatments. The development of a highly reproducible ATAC-seq protocol integrated with published RNA-seq and ChIP-seq databases revealed the enrichment of transcription factor binding sites (such as C/EBP, homeobox, and MYB), which have not been previously associated with circadian signaling in cnidarians. These results provide new insight into the regulation of cnidarian circadian machinery. Broadly speaking, this supports the notion that the association between chromatin remodeling and circadian regulation arose early in animal evolution as reflected in this non-bilaterian lineage.
The phenotype of an organism cannot be fully explained by its genome; it is now clear that other factors contribute to the ecology and evolution of animals. The DNA molecule is exceptionally long and has to fit into a small nucleus; therefore, the DNA is wound around proteins called histones in a unique architectural complex called a nucleosome (that further forms the chromatin). This DNA structure plays a crucial role in gene regulation, as nucleosomes alternate accessibility to regulatory sites, that in turn navigate expression patterns that allow gene expression or suppression. It is well known that chromatin remodeling is associated with circadian regulation in mammals, but it was uncertain whether cycles in chromatin accessibility are a conserved feature of clock-regulated genes throughout the animal kingdom. We used ATAC-seq and RNA-seq data to show that the association between chromatin remodeling and circadian regulation arose early in animal evolution as reflected in the sea anemone, Nematostella vectensis, model organism.
Citation: Weizman EN, Tannenbaum M, Tarrant AM, Hakim O, Levy O (2019) Chromatin dynamics enable transcriptional rhythms in the cnidarian Nematostella vectensis. PLoS Genet 15(11): e1008397. https://doi.org/10.1371/journal.pgen.1008397
Editor: Peter Sarkies, MRC Clinical Sciences Centre, UNITED KINGDOM
Received: December 5, 2018; Accepted: September 2, 2019; Published: November 6, 2019
Copyright: © 2019 Weizman et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: ATAC-seq data of this work can be found under bioproject PRJNA471067, URL: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA471067. RNA-seq data that was used in this work are under bioproject PRJNA246707, URL: https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJNA246707.
Funding: The research leading for this paper was funded by the Moore Foundation (https://www.moore.org), “Unwinding the Circadian Clock in a Sea Anemone” (Grant #4598) to A.T and O.L. The founders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Circadian clocks are present in most organisms and have evolved to help organisms anticipate daily and seasonal rhythms and adjust their biochemical, physiological, and behavioral processes accordingly. The molecular basis of the endogenous clock apparatus is manifested by transcriptional machinery that is controlled by regulatory factors and organized in auto-regulatory feedback loops . In mammals, these temporal oscillations in gene expression are paralleled by genome-wide chromatin remodeling events that provide flexibility to circadian regulation . In mammals, several chromatin remodelers are involved in circadian regulation, including the core circadian protein CLOCK, which can operate as an acetyltransferase on histone H3 at K9 and K14; this acetylation is associated with a permissive chromatin state for transcription . The CLOCK histone acetyltransferase (HAT) domain was previously shown to be conserved across species [4,5], and similar mechanisms of the circadian epigenome have been suggested in Drosophila [6,7]. However, no study to date has investigated chromatin dynamics concerning the biological clock in non-bilaterian animals.
Eukaryotic DNA is wound around histone proteins in a complex called a nucleosome, which helps to compress the molecule into the cell nucleus. This complex is regulated as histones are removed to expose regulatory sites, such as cis-regulatory elements (CREs) and promoters, to allow the binding of transcription factors (TFs) and other regulatory proteins. Identification of enriched motifs with these active CREs can, therefore, reveal genes associated with the regulation of the transcriptional network . Genome-wide mapping of TFs binding to chromatin is frequently done by chromatin immunoprecipitation (ChIP) based methods, such as ChIP-seq . However, these techniques are expensive and require a significant amount of tissue and extensive processing of the sample. An assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) is a technology that favors the sequencing of accessible chromatin loci  and has the potential to overcome these limitations. While ATAC-seq is a powerful and promising approach for epigenetic regulation research, it has primarily been applied within well-characterized model systems.
In this study, we set out to expand the current knowledge of metazoan circadian gene expression regulation by understanding the interplay between chromatin accessibility and circadian gene expression dynamics. We focused on the phylum Cnidaria, the sister-lineage to bilaterian animals, and specifically chose the sea anemone, Nematostella vectensis, which has emerged as a model for studying development, differentiation, and more recently, circadian regulation [11,12]. N. vectensis is widely distributed in shallow brackish environments and unsurpassed regarding the ease with which its entire lifecycle can be maintained in the laboratory [13,14]. Studies of N. vectensis locomotor activity and rhythmic gene expression, including previous work by our group [15–18], have provided a first glance into the evolution of the metazoan circadian clock (S1 Fig). In this study, developing an optimized protocol and applying ATAC-seq enabled us to detect accessible chromatin regions under light-dark (LD) and constant darkness (DD) conditions, and refine our lab’s previous findings . These findings led us to hypothesize that circadian modulation of chromatin remodeling occurs on a greater scale than previously shown by gene expression profile only. Integrating chromatin accessibility profiles with transcription profiles (RNA-seq) revealed that the majority of cyclic genes were associated with ATAC-seq peaks within their promoters. This work opens a path into the evolution of basal metazoans’ circadian transcription and regulation, showing the association of gene activity with chromatin accessibility. Therefore, chromatin structure may play an important role in regulating circadian gene expression in N. vectensis.
Nuclear isolation and ATAC-seq analysis from whole animal samples
ATAC-seq was applied to measure high-resolution chromatin accessibility in N. vectensis under two light regimes, resembling maximum and minimum temporal behavioural activity. Nuclear integrity  was achieved by (i) dissolving animal tissue into single cell suspensions and (ii) separating released nuclei from other organelles and cytoplasmic debris to reduce non-nuclear DNA contamination, as presented in Fig 1A. Each ATAC-seq library was prepared using a transposition reaction from ~400,000 cells that were sampled from one individual animal, from circadian time 13 (CT13) to circadian time 45 (CT45), in 8 hour intervals. Libraries were sequenced from two independent biological replicates from the LD treatment and compared to two independent biological replicates collected at the same time, under DD. After filtering the PCR duplicates, generated from the library amplification process and irregular reads between the biological replicates, our ATAC-seq libraries showed a median depth of 10 million unique, high-quality mapped reads per sample (Tables 1, 2 and Fig 1). As presented in S1 Table, the biological replicates were highly similar (average R2 = 0.96 SD ±0.014 and p-value<0.01), demonstrating highly reproducible data from N.vectensis whole animal sampling. Furthermore, the significant peaks (8366–36582 peaks with an irreproducible discovery rate (IDR) cut-off of 0.05) from both LD and DD treatments were clustered around transcriptional start sites (TSSs, Fig 1B and 1C and S2 Fig).
(A) A scheme describing the ATAC-seq process from sampling of whole animals to ATAC-seq analysis. (B and C) Example histograms from CT13 LD and DD, showing the peak distribution around TSS. The blue bars indicate the approximate place of the next nearest TSS (~8,600bp).
ATAC-seq provides a glimpse into N. vectensis genome regulatory regions
Many ATAC-seq peaks, from both the LD and DD samples, were mapped within 1500 bp upstream to the TSS (S2 Fig), marking the accessible chromatin of extended promoters. ATAC-seq libraries (LD and DD) were enriched, on average, with 20.87% (SD ±1.76) promoter regions, 20.35% (SD ±1.9) intron regions, and 43.1% intergenic regions (Proximal– 16.6% SD ±1.05, Distal– 26.5% SD ±2.65) that may act as distant regulatory elements (Fig 2A and 2B). Comparing the DD and LD libraries revealed time based clustering, which shows that chromatin accessibility is maintained in N. vectensis under constant conditions (Fig 2C). Moreover, gene ontology (GO) enrichment analysis of genes with accessible promoters showed that nucleic acid binding and transcription regulation activities remain similar between the two light regimes. Interestingly, DD-specific accessible gene promoters were highly enriched with rhodopsin-like, G-protein-coupled receptors (GPCRs), which are related to external signal transduction  (Fig 2D).
(A) Boxplot representing the percentage of genomic features of LD-treated Nematostella vectensis, calculated from biological replicates across experimental sampling points (n = 5). (B) Boxplot representing the percentage of genomic features of DD-treated Nematostella vectensis calculated from biological replicates across experimental sampling points (n = 4). (C) ATAC-seq signal within consensus ATAC-seq peaks was compared between all samples, using Spearman’s ρ to cluster samples. (D) Comparison of GO annotations associated with accessible promoters identified from each library. The degree of enrichment is indicated using normalized Z-scores.
Chromatin accessibility at rhythmic genes
Previously, we showed an association between circadian locomotor activity rhythm and transcriptional profile in N. vectensis. Using Fourier analysis, diel rhythmicity (i.e., 24-h periodicity) was identified in many genes. From these, 180 transcripts exhibiting significant oscillations (G-factor >0.5) were selected for further analysis. Through K-means clustering, these transcripts were divided into five clusters, each representing a different peak time of chronological expression (S1C Fig) . To further study the relationship between promoter accessibility and gene expression, 139 genes were selected, with promoter regions (within 1500 bp upstream of TSS) showing higher accessibility at CT13 (the time point with the strongest change in gene expression between the five clusters) than the average promoter accessibility, genome-wide. To test the association between all the data, we conducted a Pearson correlation test, finding a significant correlation between expression and accessibility (R2 = 0.343 and P-value<0.01 –see S2 Table). Moreover, comparing accessibility and expression within each time point showed that promoters’ accessible sites oscillated during a 24h cycle and correlated to the syn-expression pattern of circadian genes, as shown in Fig 3A and 3B and S2 Table (Note: at CT 37, the correlation is not significant, although the overall pattern is visible). However, we have to remain skeptical, as not all genes in the presented list (S4 Table) correspond to the rule of accessibility and expression correlation. For example, a subset of genes that share a common TF site (CEB/P) show no significant correlation between expression and accessibility. These genes show relatively high accessibility throughout the experiment, with an average–log2(FPKM) rate of 2.5 and a SE of ±0.3. These genes are related to core clock mechanism and development, as we show later.
(A—B) 139 out of 180 genes that were found to be rhythmic in a previous RNA-seq experiment (Oren et al., ) clustered by 5 expression groups and aligned to their accessibility score. RNA expression score is in -log2(fold-change) colored in blue in the range of -1 to 1. ATAC-seq accessibility score is in -log2(FPKM), colored in orange in the range of 0 to 3. (C) Browser view of peaks from two treatments (LD CT13 –red, DD CT13 –blue). LOGO graph indicates the NvC/EBP motif found within marked peaks. Arrows under the peaks track indicate true peaks with IDR cut-off of 0.05. (D) Enriched terms across 15 rhythmic genes, regulated by NvC/EBP in their promoter.
Accessible sites containing motifs and binding sites of TFs
Selective activation of functional regulatory DNA elements defines where TFs may bind and act. Therefore, to predict the identity of active TFs in treatment-specific peaks, the enrichment of sequence motifs was computed using the HOMER motif analysis tool . The identified motifs were divided into three groups: (i) common motifs—motifs that were enriched in both treatments, relative to their abundance within the genome (see Table 3), (ii) LD-enriched (see Table 4), and (iii) DD-enriched (see Table 5). Many of the identified motifs correspond to binding sites of TFs with previously-identified roles in regulating rhythmic processes [22,23]. For example, the MYB motif, enriched in LD-specific peaks, was shown to have an essential role in circadian rhythm maintenance in Arabidopsis . Interestingly MYB motif was enriched around the promoters of differentially expressed (DE) genes under LD conditions, particularly within clusters 1, 2, and 5. Another example is the homeobox motifs, enriched in both the LD and DD treatments, as well as around genes from clusters 1, 3, 4 and 5. Homeobox factors, in particular members of the NK homeobox gene family, with motifs enriched in both treatments, have been shown to contribute to rhythmic regulatory processes in mammals . Within cluster 5, 16 cyclic genes, including NvClock (a core component of the circadian clock machinery), contain the binding motif of C/EBP in their accessible promoter region (Fig 3C and S3 Table). C/EBP acts as an enhancer of promoter activation [26,27], and its association to NvClock promoter was predicted previously . Functional analysis of these 16 genes reveals that they are related to the GO term’s developmental growth (GO:0048589) and cellular response to hormone stimulus (GO:0032870) (Fig 3D) . These results should be treated with skeptical eyes, as it cannot be excluded that other, unidentified, TFs might be involved in circadian rhythm regulation in N. vectensis.
ATAC-seq accessibility and RNA-seq expression patterns
Promoter accessibility is essential for gene expression, so the proportion of promoters in the accessible chromatin loci is non-random. Many expression patterns found in our ATAC-seq data were visible in the RNA-seq data as well. For example, NvClock exhibits an expression peak from late day to early night (CT9-CT13), which overlaps our ATAC-seq results from CT13 that shows its promoter to be accessible (Fig 3C). In contrast, two cryptochromes (key components of the biological clock mechanism), NvCry1 and NvCry2, exhibit transcriptional rhythms, with peak expression during the day (CT4-CT11 for NvCry1 and CT0-CT4 for NvCry2) , prior to the ATAC-seq sampling at CT13. Concordantly, we did not find their promoters to be accessible in either treatment. Overall, the patterns observed for NvClock, NvCry1, and NvCry2 are aligned with the correlation between chromatin accessibility measured by ATAC-seq and expression profiles measured by RNA-seq (Fig 4 and S4 Table). Furthermore, the RNA data identified four minicollagen genes with strong rhythmicity, while three of these four gene promoters were identified as significantly accessible at CT13 (p-value < 0.05). Minicollagen is an important feature of the nematocyst structure and is expressed from the early stages of nematocyst morphogenesis until capsule maturation . By identifying enriched TF motifs within the peak sequences, potential gene regulators can be revealed. Within the peaks, at these gene promoters, we have identified motifs for C/EBP, Sox1, Pax-4, and Pax-6, all of which have been shown to act as clock-controlled gene regulators in mammals .
The high expression period of NvClock overlaps with the ATAC-seq time point CT13, and peaks occur within the NvClock promoter (gray rectangle). In contrast, the high expression periods of NvCry1 and NvCry2 do not overlap the ATAC-seq sampling time point, and no ATAC-seq peaks occur within their promotors (grey rectangles). ATAC-seq LD peaks in red, ATAC-seq DD peaks in blue, compared to Chip-seq published data peaks –H3K4me3 in dark green, H3K4me2 in green and H3K27ac in turquoise.
ATAC-seq identifies distal regulatory regions in adult Nematostella vectensis
Enhancers are distinct genomic regions containing binding site sequences for TFs that can regulate the transcription of a target gene. Along the linear genomic DNA sequence, active enhancers, marked with H3K27ac histon modification marker, can be located at a great distance from their target genes. About 70% of H3K27ac-marked enhancers in mammals are active and positively affect transcription in vivo [30,31]. Comparing our ATAC-seq profiles with previously published H3K27ac ChIP-seq  revealed that ~50% of ATAC-seq peaks overlap with H3K27ac sites in both treatments (S5 Table), indicating their potential enhancer activity. Out of ~5000 previously ChIP-seq-predicted enhancer elements in N. vectensis during different early development stages, our analysis identified 259 LD-treated and 333 DD-treated enhancers that overlap with the H3K27ac histone mark, and a total of 174 enhancers shared between the two treatments (S3 Fig).
The ATAC-seq technology was applied using a small quantity of nuclei, proved to be effective and produced ATAC-seq libraries with low, non-nucleic DNA contamination (less than 5% mitochondrial DNA per sample) and median depth of 10 million high-quality unique reads that represent the accessible chromatin of N. vectensis under the different treatments. Moreover, ATAC-seq can be utilized to increase the proportion of regulatory genomic features, such as promoters to ~20% within sequenced libraries. Therefore, ATAC-seq can act as a powerful tool in the N. vectensis genome research, and in other cnidarians, and can be applied to study epigenetics downstream to DNA methylation, RNA-seq and more.
In this work, we aim at illuminating the landscape of accessible chromatin, within the N. vectensis genome, to uncover valuable information about the active CREs and the TFs that bind them, and further enable us to examine the relationship between CREs and gene expression. Among the two light regimens surveyed, 8366–36582 peaks with IDR cut-off of 0.05 were identified, enabling the prediction of TFs binding sites within the accessible genome, specifically around rhythmic genes. The overlaps of ATAC-seq libraries with rhythmic genes lead us to conclude that there is an association between gene expression and DNA accessibility in N. vectensis, as presented in our time point sampling.
To identify groups of genes based on promoter accessibility, we conducted a GO enrichment analysis, comparing genes with accessible promoters to the full N. vectensis gene set. Our results showed a strong enrichment of rhodopsin-like GPCRs, particularly in the DD treatment. The rhodopsin-like GPCRs represent a diverse protein family that includes hormone receptors, neurotransmitters and photoreceptors, all of which transduce extracellular signals through interactions with nucleotide-binding proteins . Remarkably, the rhodopsin-like GPCRs in the DD-treated samples were enriched with SOX gene family binding sites, relative to LD-treated samples. The SOX TF family is found throughout the animal kingdom and is important in a variety of homeostasis and regeneration contexts. Most of the SOX genes found in mammals have homologs in invertebrates, including non-bilaterian lineages such as sponges [34,35]. There is also a direct connection between the SOX family and the circadian clock, as many SOX genes have been shown to be clock-controlled . the increase in SOX binding sites in DD-treated samples and not in LD-treated samples can be explained by the loss of coupling between the biological clock and the cell cycle .
We found that gene expression rhythmicity corresponded with changes in DNA accessibility within promoters of 139 out of 180 previously-identified rhythmic genes. However, this general rule does not apply to all circadian genes, and there was a subset of genes that showed relatively high and continues accessibility rate throughout the experiment (S3 Table and S4 Table). This state of constantly accessible genes suggests that the expression rhythmicity has more than one general regulatory mechanism controlling this process . When rhythmic genes were sorted according to their expression patterns, the enriched motifs within each cluster revealed a complex picture of regulatory activity. Analysis of accessible regions of individual promoters enabled predictions of TFs that are likely to bind to and regulate the associated genes. For example, the identification of a C/EBP motif within a peak on the NvClock promoter (Fig 3C) is important as C/EBP association to NvClock promoter was previously predicted . Moreover, C/EBP is overrepresented in promoters of clock-controlled mammal genes  and acts as an enhancer of promoter activation. The C/EBP motif is found in 15 more rhythmic gene promoters theat are characterized in this work (S3 Table). Another interesting finding is the presence (or absence) of homeobox-related TFs in rhythmic genes regulation. The identified motifs were enriched with NK Homeobox and HOX family motifs–some of which contribute to rhythmic regulatory processes  including CUX1, LIN-39 (HOX3A), caudal and CDX2 (Caudal-Type Homeobox 2). Interestingly, caudal has also been identified (in gene cluster 3) and previously reported as a synchronizer of locomotor activity in crayfish . Our observation that different motifs are more enriched in LD and not in DD, or vice versa, indicates that the light regimen affects the regulatory network that is activated around rhythmic genes. To investigate this issue, further work needs to be conducted, including a high-resolution sampling of these genes’ regulatory landscapes, to elucidate if these changes are linked to the rhythmic cycle or due to light deprivation impact. Surprisingly, many of the 41 rhythmic genes that do not have accessible promoters exhibited peak transcript expression at other times of day. For example, NvCry genes have expression peaks that did not overlap with the CT13 sampling point; (NvCry1 expression peak is at CT4-CT11 and NvCry2 expression peak is at CT0-CT4). As cryptochromes are photoreceptors, their expression profile peaks at mid-day (light time) [12,15], it is not surprising that at CT13 (dark time) we did not observe accessibility nor expression. Nonetheless, NVcry1 becoms accessible at sampling point CT29, This could indicate that its proximate chromatin region is more packed or inhibited, and further investigation is needed to validate this conclusion.
The genomic sequence in the immediate vicinity of the TSS, which is also known as the core promoter, is sufficient to assemble the Pol II complex with its associated proteins. However, transcription is often weak in the absence of regulatory DNA regions, such as enhancers, that are more distant from the TSS. Enhancers are key regulators of temporal and tissue-specific gene expression that display important and conserved functions and can be found at thousands of base pairs upstream or downstream to their target promoters . By comparing ATAC-seq to a known list of enhancers found in N. vectensis, we could identify enhancers that can serve as potential targets for rhythmic gene regulation (see S3 Fig).
Finally, the work presented here shows the association between gene expression and DNA accessibility by integrating two sequencing methods. This improves our understanding of the N. vectensis regulatory landscape, exposing the regulatory elements that participate in gene regulation genome-wide, which can be important for chronobiology and evolutionary investigations and future studies in epigenetics.
Materials and methods
Adult N. vectensis were kept in a plastic container filled with two liters of artificial seawater at a salinity of 12 PSU (Nematostella medium), under natural light and at a constant temperature of 18°C. Between 50 and 100 individuals were kept in each container in a recirculating water system. Animals were fed 5 times a week with freshly hatched brine shrimp (Artemia nauplii).
Female N. vectensis were incubated under two different light regimens: LD (12h light:12h dark) or DD (constant darkness), for 45 hours at a constant temperature of 18°C. Biological duplicates were sampled every 8 hours at CT13, CT21, CT29, CT37 and CT45 from both conditions and were processed as described in the “ATAC-seq nuclear isolation and library preparation” section. We did not sequence sample DD CT29 due to technical problems. These time points were chosen based on gene enrichment data from RNA-seq experiments previously published. The ATAC-seq sampling interval was due to the timing requirements of the ATAC-seq protocol (approximately 6–7 hours).
ATAC-seq nuclear isolation and library preparation
Nuclei were isolated from adult Nematostella that were incubated in different lighting treatments. From each sample, tissue was suspended in 500 μL PBS-NAC 2% (N-acetyl-cysteine, sigma) by pipetting in a 1.5 mL tube . The suspension was centrifuged at 1500 xg for 5 minutes, at 4°C. The pellet was re-suspended in 500 μL PBS and cells were counted. 400,000 cells were then re-suspended in 500 μL PBS and centrifuged at 1500 xg for 5 minutes, at 4°C. The pellet was suspended in 50 μL of ATAC-seq lysis buffer (10mM TRIS-Cl pH 7.4, 10mM NaCl, 3mM MgCl2, 0.1% IGEPAL CA630) and centrifuged at 300 xg for 10 minutes, at 4°C. The supernatant was collected and kept in a 1.5 mL tube on ice. The pellet was re-suspended in 50 μL and centrifuged at 300 xg for 10 minutes, at 4°C. The supernatant was combined with the supernatant from the previous step. Then 9 μL of isolated nuclei were stained with DAPI to verify the isolation of intact nuclei. The isolated nuclei were then centrifuged at 1500 xg for 10 minutes, at 4°C. Immediately following this centrifuge step, the pellet was re-suspended in the transposase reaction mix (25 μL 2× TD buffer, 2.5 μL transposase (Illumina REF: 15028212) and 22.5 μL nuclease-free water). The transposition reaction was carried out for 30 minutes, at 37°C. Directly following transposition, the sample was purified using an Invitrogen PureLink PCR purification kit (REF: K310001). Following purification, library fragments were amplified using 1× NEBnext PCR master mix (#M0541S) and 1.25 μM of custom Nextera PCR primers, forward and reverse, using the following PCR conditions: 72°C for 5 minutes, 98°C for 30 seconds and a variable number of cycles as needed (we added 4–9 cycles) at 98°C for 10 seconds, 63°C for 30 seconds and 72°C for 1 minute. To reduce GC and size bias in our PCR, we monitored the PCR reactions using qPCR to stop amplification before saturation. To do this, we amplified the full libraries for 5 cycles, after which we took a 4-μl aliquot of the PCR reaction and added 6 μl of the PCR cocktail with Sybr Green (Promega, REF: A6001), at a final concentration of 0.6×. We ran this reaction for 20 cycles to determine the additional number of cycles needed for the remaining 46-μl reaction. The libraries were purified using Agencourt AMPure XP beads (cat. No. 63881) and analyzed on a TapeStation. Primers used to amplify ATAC-seq libraries see S6 Table (Note: Ad1_noMX is a global primer used in all libraries).
Samples of whole N. vectensis were prepared using single-end 50bp reads from a single Illumina HiSeq run. Treatments were run on one lane of Illumina HiSeq2000. On average, ~50 million single-end reads were obtained for each sample.
Sequenced reads were aligned to the Nemve1 Nematostella vectensis genome using bowtie . Only unique mapped reads were used. Peaks were called by applying MACS2  with the following parameters: -g 450000000—nomodel—extsize 75—shift -30. TF-binding motifs enrichment were identified within the peaks using scripts within HOMER : findMotifsGenome.pl and annotatePeaks.pl were used with default parameters and the Nemve1 genome was used as background .
To compare chromatin accessibility with circadian patterns in gene expression, we evaluated a set of 180 transcripts that were previously shown to exhibit a diel expression pattern in N.vectensis . The genes had been sorted into 5 clusters with similar temporal expression patterns, using a K-means clustering, implemented in MatLab as described by Oren et al., . Pearson correlation tests were apllied to assess correlation between gene expression and DNA accessibility. We have compared all genes’ expressions to promoter accessibility within a single time point (for example, gene expression at CT13 was compared to promoter accessibility at CT13). In addition, we compared the total RNA-seq data from our 139 gene list to the total ATAC-seq data at all time points (see S2 Table). To identify possible distal enhancer sites in the ATAC-seq data, ChIP-seq data was used from a previous study of histone markers in N.vectensis . Reads were downloaded from NCBI GEO (accession number: GSE46488) and aligned to the genome using bowtie2. Only uniquely mapped reads were used. Peaks were called by applying MACS2  with default parameters. Further analysis was performed using the BamTools and BEDTools suites [46,47]. Gene promoters found within treatment specific peaks were defined and subsequently, analyzed for enriched GO terms using the metascape suit .
(A) Nematostella vectensis locomotor activity under a 12-hr light: 12-hr dark cycle (LD). (B) Nematostella vectensis locomotor activity under constant dark (DD). Yellow indicates light hours, grey indicates dark hours and pale blue indicates dark during subjective day. (C) 139 out of 180 genes that were found to be rhythmic in a previous RNA-seq experiment, clustered by expression groups. Above the heatmap is the aligned distance moved, as measured in LD. Data from Oren et al., .
S2 Fig. Histograms from all time points, sampled from LD and DD, showing the peak distribution around TSS.
The red bars indicate the approximate place of the nearest next TSS (~8,600bp).
(A-B) We determined the overlap of ATAC-seq peaks with H3K27ac ChIP-seq peaks in LD-treated and DD-treated N.vectensis. (C-D) We determined the overlap of ATAC-seq peaks with H3K4me2 peaks in LD-treated and DD-treated N.vectensis. (E-F) We determined the overlap of ATAC-seq peaks with H3K4me3 peaks in LD-treated and DD-treated N.vectensis. (G) Comparison of H3K27ac peak position to Planula and Gastrula enhancer list from previously published Chip-seq data (Schwaiger et al., ).
S1 Table. Correlation comparison between biological replicates of ATAC-seq libraries reads after filtering PCR duplicates.
P-value < 0.01 for all results.
S2 Table. Pearson R2 correlation of gene expression measured in–log2(fold-change) and promoter accessibility measures in–log2(FPKM) of 139 rhythmic genes.
S3 Table. List of 16 rhythmic genes, and their annotations, found to have a C/EBP accessible motif within their promoter region.
S4 Table. Annotation, expression and accessibility patterns of 139 Nematostella transcripts exhibiting circadian-like periodicity in expression.
Pie charts of genomic annotations showing histone acetylation marks identified from a previous ChIP-seq study, performed on adult female Nematostella by Schwaiger et al.,  (left column). Center and right columns show genomic annotations of ATAC-seq peaks that overlap with the ChIP-seq data. Promoters-TSS: -1500 bp to TSS, proximal Intergenic: -5000 to -1501 or TTS to 5000bp.
We would like to thank Ms. Adi Zweifler and Dr. Noa Simon Blecher for their help with Nematostella cultures and assistance. This study represents partial fulfillment of the requirements for a Ph.D. thesis for E. Weizman at the Faculty of Life Sciences Bar-Ilan University, Israel. Correspondence and requests for materials should be addressed to email@example.com and firstname.lastname@example.org
- 1. Golombek DA, Rosenstein RE. Physiology of circadian entrainment. Physiol Rev. 2010;90: 1063–1102. pmid:20664079
- 2. Masri S, Sassone-Corsi P. Plasticity and specificity of the circadian epigenome. Nat Neurosci. 2010;13: 1324. pmid:20975756
- 3. Doi M, Hirayama J, Sassone-Corsi P. Circadian regulator CLOCK is a histone acetyltransferase. Cell. 2006;125: 497–508. pmid:16678094
- 4. Nakahata Y, Grimaldi B, Sahar S, Hirayama J, Sassone-Corsi P. Signaling to the circadian clock: plasticity by chromatin remodeling. Curr Opin Cell Biol. 2007;19: 230–237. pmid:17317138
- 5. Kwok RS, Lam VH, Chiu JC. Understanding the role of chromatin remodeling in the regulation of circadian transcription in Drosophila. Fly (Austin). 2015;9: 145–154. pmid:26926115
- 6. Hardin PE, Yu W. Circadian Transcription: Passing the HAT to CLOCK. Cell. 2006;125: 424–426. pmid:16678086
- 7. Kwok RS, Li YH, Lei AJ, Edery I, Chiu JC. The Catalytic and Non-catalytic Functions of the Brahma Chromatin-Remodeling Protein Collaborate to Fine-Tune Circadian Transcription in Drosophila. PLOS Genet. 2015;11: e1005307. pmid:26132408
- 8. Tsompana M, Buck MJ. Chromatin accessibility: a window into the genome. Epigenetics Chromatin. 2014;7: 33. pmid:25473421
- 9. Furey TS. ChIP-seq and Beyond: new and improved methodologies to detect and characterize protein-DNA interactions. Nat Rev Genet. 2012;13: 840–852. pmid:23090257
- 10. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods. 2013;10: 1213. pmid:24097267
- 11. Hendricks WD, Byrum CA, Meyer-Bernstein EL. Characterization of Circadian Behavior in the Starlet Sea Anemone, Nematostella vectensis. PLOS ONE. 2012;7: e46843. pmid:23056482
- 12. Reitzel AM, Behrendt L, Tarrant AM. Light Entrained Rhythmic Gene Expression in the Sea Anemone Nematostella vectensis: The Evolution of the Animal Circadian Clock. PLOS ONE. 2010;5: e12805. pmid:20877728
- 13. Hand C, Uhlinger KR. The Culture, Sexual and Asexual Reproduction, and Growth of the Sea Anemone Nematostella vectensis. Biol Bull. 1992;182: 169–176. pmid:29303672
- 14. Stefanik DJ, Friedman LE, Finnerty JR. Collecting, rearing, spawning and inducing regeneration of the starlet sea anemone, Nematostella vectensis. Nat Protoc. 2013;8: 916. pmid:23579780
- 15. Oren M, Tarrant AM, Alon S, Simon-Blecher N, Elbaz I, Appelbaum L, et al. Profiling molecular and behavioral circadian rhythms in the non-symbiotic sea anemone Nematostella vectensis. Sci Rep. 2015;5: 11418. pmid:26081482
- 16. Reitzel AM, Behrendt L, Tarrant AM. Light Entrained Rhythmic Gene Expression in the Sea Anemone Nematostella vectensis: The Evolution of the Animal Circadian Clock. PLOS ONE. 2010;5: e12805. pmid:20877728
- 17. Reitzel AM, Tarrant AM, Levy O. Circadian Clocks in the Cnidaria: Environmental Entrainment, Molecular Regulation, and Organismal Outputs. Integr Comp Biol. 2013;53: 118–130. pmid:23620252
- 18. Reitzel AM, Tarrant AM, Levy O. Circadian Clocks in the Cnidaria: Environmental Entrainment, Molecular Regulation, and Organismal Outputs. Integr Comp Biol. 2013;53: 118–130. pmid:23620252
- 19. Buenrostro JD, Wu B, Chang HY, Greenleaf WJ. ATAC-seq: A Method for Assaying Chromatin Accessibility Genome-Wide. Curr Protoc Mol Biol. 2015;109: 21.29.1–9. pmid:25559105
- 20. Birnbaumer L. G Proteins in Signal Transduction. Annu Rev Pharmacol Toxicol. 1990;30: 675–705. pmid:2111655
- 21. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38: 576–589. pmid:20513432
- 22. Rath MF, Rohde K, Klein DC, Møller M. Homeobox genes in the rodent pineal gland: roles in development and phenotype maintenance. Neurochem Res. 2013;38: 1100–1112. pmid:23076630
- 23. Rohde K, Møller M, Rath MF. Homeobox genes and melatonin synthesis: regulatory roles of the cone-rod homeobox transcription factor in the rodent pineal gland. BioMed Res Int. 2014;2014: 946075. pmid:24877149
- 24. Nguyen NH, Lee H. MYB-related transcription factors function as regulators of the circadian clock and anthocyanin biosynthesis in Arabidopsis. Plant Signal Behav. 2016;11. pmid:26905954
- 25. Malt EA, Juhasz K, Malt UF, Naumann T. A Role for the Transcription Factor Nk2 Homeobox 1 in Schizophrenia: Convergent Evidence from Animal and Human Studies. Front Behav Neurosci. 2016;10. pmid:27064909
- 26. Korenčič A, Košir R, Bordyugov G, Lehmann R, Rozman D, Herzel H. Timing of circadian genes in mammalian tissues. Sci Rep. 2014;4: 5782. pmid:25048020
- 27. Bozek K, Relógio A, Kielbasa SM, Heine M, Dame C, Kramer A, et al. Regulation of Clock-Controlled Genes in Mammals. PLOS ONE. 2009;4: e4882. pmid:19287494
- 28. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10: 1523. pmid:30944313
- 29. Beckmann A, Özbek S. The nematocyst: a molecular map of the cnidarian stinging organelle. Int J Dev Biol. 2012;56: 577–582. pmid:22689365
- 30. Rada-Iglesias A, Bajpai R, Swigut T, Brugmann SA, Flynn RA, Wysocka J. A unique chromatin signature uncovers early developmental enhancers in humans. Nature. 2011;470: 279–283. pmid:21160473
- 31. Nord AS, Blow MJ, Attanasio C, Akiyama JA, Holt A, Hosseini R, et al. Rapid and Pervasive Changes in Genome-Wide Enhancer Usage During Mammalian Development. Cell. 2013;155: 1521–1531. pmid:24360275
- 32. Schwaiger M, Schönauer A, Rendeiro AF, Pribitzer C, Schauer A, Gilles AF, et al. Evolutionary conservation of the eumetazoan gene regulatory landscape. Genome Res. 2014;24: 639–650. pmid:24642862
- 33. Vassilatis DK, Hohmann JG, Zeng H, Li F, Ranchalis JE, Mortrud MT, et al. The G protein-coupled receptor repertoires of human and mouse. Proc Natl Acad Sci U S A. 2003;100: 4903–4908. pmid:12679517
- 34. Müller WEG, Schröder HC, Pisignano D, Markl JS, Wang X. Metazoan Circadian Rhythm: Toward an Understanding of a Light-Based Zeitgeber in Sponges. Integr Comp Biol. 2013;53: 103–117. pmid:23474951
- 35. Koopman P, Schepers G, Brenner S, Venkatesh B. Origin and diversity of the Sox transcription factor gene family: genome-wide analysis in Fugu rubripes. Gene AmsterdamGene Amst. 2004;328: 177–186.
- 36. Feillet C, Horst VD, J GT, Levi F, Rand DA, Delaunay F. Coupling between the Circadian Clock and Cell Cycle Oscillators: Implication for Healthy Cells and Malignant Growth. Front Neurol. 2015;6. pmid:26029155
- 37. Fu L, Kettner NM. The circadian clock in cancer development and therapy. Prog Mol Biol Transl Sci. 2013;119: 221–282. pmid:23899600
- 38. Hor CN, Yeung J, Jan M, Emmenegger Y, Hubbard J, Xenarios I, et al. Simple and complex interactions between sleep-wake driven and circadian processes shape daily genome regulatory dynamics in the mouse. bioRxiv. 2019; 677807.
- 39. Fuentes-Pardo B, Inclán-Rubio V. Caudal photoreceptors synchronize the circadian rhythms in crayfish—I. Synchronization of ERG and locomotor circadian rhythms. Comp Biochem Physiol A Physiol. 1987;86: 523–527.
- 40. Shlyueva D, Stampfel G, Stark A. Transcriptional enhancers: from properties to genome-wide predictions. Nat Rev Genet. 2014;15: 272–286. pmid:24614317
- 41. Rabinowitz C, Moiseeva E, Rinkevich B. In vitro cultures of ectodermal monolayers from the model sea anemone Nematostella vectensis. Cell Tissue Res. 2016;366: 693–705. pmid:27623804
- 42. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. pmid:22388286
- 43. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al. Model-based Analysis of ChIP-Seq (MACS). Genome Biol. 2008;9: R137. pmid:18798982
- 44. Putnam NH, Srivastava M, Hellsten U, Dirks B, Chapman J, Salamov A, et al. Sea Anemone Genome Reveals Ancestral Eumetazoan Gene Repertoire and Genomic Organization. Science. 2007;317: 86–94. pmid:17615350
- 45. Levy O, Kaniewska P, Alon S, Eisenberg E, Karako-Lampert S, Bay LK, et al. Complex diel cycles of gene expression in coral-algal symbiosis. Science. 2011;331: 175. pmid:21233378
- 46. Barnett DW, Garrison EK, Quinlan AR, Strömberg MP, Marth GT. BamTools: a C++ API and toolkit for analyzing and managing BAM files. Bioinforma Oxf Engl. 2011;27: 1691–1692. pmid:21493652
- 47. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinforma Oxf Engl. 2010;26: 841–842. pmid:20110278