Novel estrogen-responsive genes (ERGs) for the evaluation of estrogenic activity

Estrogen action is mediated by various genes, including estrogen-responsive genes (ERGs). ERGs have been used as reporter-genes and markers for gene expression. Gene expression profiling using a set of ERGs has been used to examine statistically reliable transcriptomic assays such as DNA microarray assays and RNA sequencing (RNA-seq). However, the quality of ERGs has not been extensively examined. Here, we obtained a set of 300 ERGs that were newly identified by six sets of RNA-seq data from estrogen-treated and control human breast cancer MCF-7 cells. The ERGs exhibited statistical stability, which was based on the coefficient of variation (CV) analysis, correlation analysis, and examination of the functional association with estrogen action using database searches. A set of the top 30 genes based on CV ranking were further evaluated quantitatively by RT-PCR and qualitatively by a functional analysis using the GO and KEGG databases and by a mechanistic analysis to classify ERα/β-dependent or ER-independent types of transcriptional regulation. The 30 ERGs were characterized according to (1) the enzymes, such as metabolic enzymes, proteases, and protein kinases, (2) the genes with specific cell functions, such as cell-signaling mediators, tumor-suppressors, and the roles in breast cancer, (3) the association with transcriptional regulation, and (4) estrogen-responsiveness. Therefore, the ERGs identified here represent various cell functions and cell signaling pathways, including estrogen signaling, and thus, may be useful to evaluate estrogenic activity.


Introduction
Estrogenic chemicals are found abundantly in natural and industrial materials, and they exhibit a variety of cellular and physiological activities [1]. Along with the increasing interest in the estrogenic activity of these chemicals and their beneficial applications, there is growing need to develop new technologies for their detection and characterization. Thus, a variety of assays have been developed, such as ligand-binding assays, reporter-gene assays, yeast twohybrid assays, transcription assays, protein assays, cell assays, and animal tests, which are based on the mechanisms of estrogen action at the levels of molecules, cells, tissues, and the whole body [2]. Among the assays, gene expression profiling, such as the DNA microarray assay, is a technology used to detect the alterations of gene expression by monitoring the

Sulforhodamine B (SRB) assay
MCF-7 cells were cultured in a phenol red-free RPMI 1640 medium (Gibco, Thermo Fisher Scientific, Waltham, MA, USA) supplemented with 10% fetal bovine serum (FBS) (Gibco, Thermo Fisher Scientific) and maintained at 37˚C with 5% CO 2 in an incubator (Thermo Fisher Scientific). The sulforhodamine B (SRB) assay was performed as described by Dong et al. [33,34]. Before stimulation of MCF-7 cells with chemicals, the cells were cultured at a density of 1.5×10 4 cells per well in RPMI 1640 containing 10% (v/v) dextran-coated charcoaltreated FBS (DCC-FBS) (Gibco, Thermo Fisher Scientific) for 3 days in 24-well plates. After treatment with 10 nM 17β-estradiol (E 2 ; Sigma-Aldrich, St. Louis, MO, USA), 1 μM ICI 182,780 (ICI; Sigma-Aldrich), or 0.1% dimethyl sulfoxide (vehicle) (DMSO; FUJIFILM Wako Pure Chemical, Osaka, Japan) for 3 more days, MCF-7 cells were fixed with trichloroacetic acid (Sigma-Aldrich) at 4˚C for 30 min and washed with ultrapure water. After washing, samples were stained with acetic acid containing 0.4% SRB (Sigma-Aldrich) at room temperature for 20 min and then washed with acetic acid. The bound protein was dissolved with 10 mM unbuffered Tris-base (pH = 10.5) at room temperature for 10 min and transferred into 96-well plates to measure the absorbance at 490 nm using the Modular-Designed Multimode Reader SH-9000 (Corona Electric, Ibaraki, Japan). Three independent assays were performed for each chemical, and the data were analyzed by t-test.

Western blotting
Western blotting was performed as described by Dong et al. [33,34]. MCF-7 cells at a density of 1.0×10 5 cell per well were cultured for 2 days in DCC-FBS in 6-well plates and then one more day in serum-free medium. The cells were pretreated with ICI for 1 h and then treated with 10 nM E 2 or vehicle (0.1% v/v DMSO). The total protein was extracted from the cells and examined by SDS-PAGE using e-PAGEL (ATTO, Tokyo, Japan) and electro-transferred onto nitrocellulose membranes (Millipore, Billerica, MA, USA) using a semi-dry transfer cell (Bio-Rad Laboratories, Benicia, CA, USA). The membranes were blocked with EzBlock BSA (ATTO) and incubated with the antibodies against signal proteins (T-Erk, P-Erk, T-Akt or P-Akt) overnight at 4˚C after appropriate dilution (1:500 or 1:1000). The antibody-antigen complexes were detected with a horseradish peroxidase-coupled goat antibody against rabbit IgG (Cell Signaling Technology) after dilution (1:1000) and visualized using WSE-6100 Lumi-noGraph I (ATTO). , and the gene expression was analyzed as described previously [15]. After RNA-seq was repeated 6 times for E 2 , the standard deviation (SD) and the average (Av) for the 203 genes, which includes ERGs and control genes [15], were determined and used for calculating the log 2 of FPKMs. These genes were evaluated based on the absolute value of SD/Av and used in the correlation analysis with the correlation coefficient (R value) between one of the assays (E 2 -1) and each of the other five assays (E 2 -2Ẽ 2 -6).

Real-time RT-PCR
The FPKM values obtained with RNA-seq were log 2 -transformed and a total of 300 genes were selected based on the log 2 value (gene expression ratio). Among the 300 genes, 30 genes were selected by ranking the SD/Av values and further analyzed by real-time RT-PCR [8]. Real-time RT-PCR was conducted with the CFX Connect real-time PCR detection system using the iTaq Universal SYBR Green One-Step Kit (Bio-Rad). The conditions for real-time RT-PCR were as follows: reverse transcription for cDNA at 42˚C for 10 min and denaturation at 95˚C for 1 min, followed by 46 cycles of denaturation at 94˚C for 10 sec, annealing at 57˚C for 30 sec, and extension at 72˚C for 20 sec. After PCR, a melting curve was constructed by increasing the temperature from 65 to 95˚C to confirm the validity of the reaction. Real-time RT-PCR was repeated 3 times for each gene and the Av and SD were calculated using CFX Maestro Software (Bio-Rad). The nucleotide sequences of PCR primers (S1 Table) were obtained from the references for EGR3 [8],

Gene expression profiling based on RNA-seq
Gene expression profiling is a method to evaluate the estrogenic activity of chemicals, and a variety of assays have been developed based on transcriptomic and proteomic technologies [1].
Here, we screened ERGs for evaluating estrogenic activity by transcriptomic assays such as RNA-seq and RT-PCR. To obtain a set of ERGs, we adopted a two-step process involving a correlation analysis and a coefficient of variation (CV) analysis ( Fig 1A). MCF-7 cells were used as a cell system for the evaluation, which was performed first by two assays, a cell proliferation assay (SRB assay; Fig 1B) and protein assay (Western blotting to detect the phosphorylation of two marker proteins, Erk1/2 and Akt; Fig 1C), which have frequently been used to evaluate estrogenic activity [42][43][44]. The cells showed the response to estrogen (17βestradiol; E 2 ), and the response was inhibited by treatment with an ER antagonist ICI in both assays in the same way as previously reported [15], suggesting that the cells responded to E 2 through the ER. Then, we performed the RNA-seq analysis to evaluate the cell response at the transcription level ( Fig 1D). A total of six sets of gene expression profiles for MCF-7 cells treated with E 2 were obtained by RNA-seq, where a set of the 120 ERGs, which were previously used in DNA microarray assays [15], were used to compare the profiles (first five panels in Fig  1D). The six combinations of profiles showed R-values of 0.83 to 0.95, indicating strong and statistically significant correlations among the profiles (see the data for all combinations in S2 Table). In contrast, there was no correlation between the profiles for the E 2 treatment and E 2 +ICI treatment (the last panel in Fig 1D), suggesting the response to be mediated by the ER. Therefore, the ERGs that we obtained using gene expression profiling methods and by RNAseq techniques can be used as markers for the evaluation of estrogenic chemicals.

Evaluation of RNA-seq-based gene expression profiling
Next, we examined the quality of the obtained ERGs by CV analysis (Fig 2). Among the six sets of ERGs obtained by RNA-seq (Fig 1D), the sets of 203 ERGs that were used to evaluate estrogenic activity by DNA microarray assays [15] were again examined by CV analysis (indicated by the bar graph in Fig 2), where the 203 genes included expression standards and 174 original ERGs, and the sets of 150, 120, 90, 60, or 30 genes were selected from the top of the list of genes ranked by CV values (mean of SD/Av). The top-ranked ERGs exhibited low CV values, 0.17 +/-0.04 for the top 30 ERGs, indicating that gene expression profiling by RNA-seq can provide mean CV values that are lower than those by DNA microarray assays (mean CV value of 0.22; [15]). Furthermore, the correlation coefficients (R-values) for the top 30 ERGs in the five data sets obtained by RNA-seq were 0.97 to 0.99, which are also more reliable than those with nine data sets obtained by DNA microarray assays (0.93 to 0.97; [15]), confirming that the statistical stability of data is greater in RNA-seq than in DNA microarray assays.

Selection of new ERGs for gene expression profiling
We selected new ERGs for gene expression profiling based on the approach shown in the previous section. A total of 300 ERGs were selected from the ranks created by CV analysis, from all human genes available for RNA-seq (a total of approx. 26,000 genes), which consist of 150 up-regulated and 150 down-regulated genes (see S3 Table for the complete list). Among the 300 ERGs, the top 30 ERGs (Table 1) were examined by real-time RT-PCR (Fig 3A), and the data were compared with those obtained by RNA-seq ( Fig 3B). Both datasets were identical, except for minor differences due to slight variation in the locations of the transcripts that were detected by these assays. Notably, the statistical significance levels are lower for some genes, probably due to the lack of expression levels in the statistical consideration.

Functional analysis of ERGs
The newly identified ERGs were examined by signaling pathway analysis using the GO and KEGG databases (Fig 4). The characteristics of ERGs were analyzed first by the statistical evaluation of the relationship of the significance (p-values; Fig 4A) or stability (CV values; Fig 4B) of expressional alterations with the degrees of the alterations (log 2 -transformed fold changes) and then by the functional searches in the GO (Fig 4C and 4D) and KEGG (Fig 4E and 4F) databases. We confirmed that 30 ERGs belong to the groups of genes showing statistically significant alterations of their expression due to the treatment with estrogen (red or green circles in Fig 4A and 4B). From the database searches, significant levels of association were found between the genes of significant alterations of their expression and specific cell functions and   Table). Furthermore, significant numbers of the 30 and 300 ERGs that were characterized in Fig 2 were included in the groups of genes found by the database searches (S4 Table), suggesting the association of these ERGs with specific cell functions.

ER subtype-specific interaction of ERGs
Since ERGs are likely controlled by ERs through the interaction at their promoter and enhancer regions, we examined the ChIP-seq data that are available in the Gene Expression  Omnibus database to determine whether the regions containing these ERGs were identified as ER subtype-specific binding sites (

ERGs for the evaluation of estrogenic activity
Estrogen is a hormone that regulates the female reproductive system and the physiological, neurological, and behavioral activities of females and, to some extent, males. Estrogen action is regulated and mediated by various genes and their protein products, including ERGs. Thus, ERGs can represent various estrogenic cell functions, and therefore, have been used as markers for estrogenic activity (reviewed in Kiyama & Zhu, 2014 [4]). Previously, specific ERGs were used as markers for gene expression or used as reporter-genes, which were later replaced by a set of ERGs in more statistically reliable transcriptomic assays such as DNA microarray assays and RNA-seq. A small number of ERGs, tens to hundreds, were enough for statistical stability, and including the genes without any response would increase the noise level. However, the quality of ERGs has not been extensively explored. Here, we found a set of 300 ERGs newly identified by six sets of RNA-seq data obtained from E 2 -treated and control MCF-7 cells (Figs  1 and 2). A set of the 30 genes with best statistical stability were further evaluated quantitatively by RT-PCR (Fig 3), and then qualitatively by a functional analysis using the GO and KEGG databases (Fig 4) and by a mechanistic analysis using ChIP-seq data (Fig 5). Since the 30 ERGs identified here exhibited alterations in gene expression at significant levels, they are associated with various cell functions and cell signaling pathways. How the genes are functionally associated with estrogen action and how they respond to estrogen are the questions that need to be examined further.   [59], and cancers [60][61][62]; PKIB encoding a competitive inhibitor of cAMP-dependent protein kinase involved in breast cancer development by enhancing ERα action [63]; RAB26 encoding a RAB-family small GTPase that regulates intercellular vesicle trafficking, including exocytosis, endocytosis, and recycling [64]; RAP1GAP encoding a GTPase-activating protein (GAP) that promotes the hydrolysis of GTP in RAP1-GTP and inactivates RAP1, which plays important roles in the regulation of cell adhesion and migration and in the progression and metastasis of several types of cancer [65]; RAPGEFL1 encoding a protein with predicted guanyl-nucleotide exchange factor (GEF) activity, which promotes RAP1 to load GTP and acquire the active GTP-bound status [66]; RAPGEFL1, a gene found in several types of cancer, acting as an estrogen-regulated gene in breast cancer cells [67]; SRGAP3 encoding a protein highly expressed in the brain, with structural characteristics of BAR/Rac1 GAP/SH3 domains and considered to be a tumor suppressor in breast cancer [68]. Secondly, the genes that are related to specific cell functions and cancer are included in the 30 ERGs. CRISP3 protein has been known for its androgen-responsive gene expression [69], which can help to avoid apoptosis induced by estrogen deficiency in epithelial cells in the salivary gland [70]. Cathepsins are lysosomal cysteine proteases, and their aberrant expression is related to the malignancy of tumors, and thus, the inhibition of cathepsins by CSTA expression has been considered to be a target for cancer treatment [71]. Down-regulation of CSTA expression after the treatment of E 2 in ERα-positive breast cancer cells were reported [71,72]. CTSD is an ERG and has been used as a marker of estrogen response [12]. The inhibition of CTSD activity is one of the targets for cancer treatment by controlling the expression of CSTA [71]. CTSD is also considered to be a target in antibody-based therapy for the treatment of triple-negative breast cancer due to its correlation with poor prognosis of breast cancer and providing a tumor-specific extracellular target [73,74]. DOK7 is associated with congenital myasthenic syndromes, a group of neuromuscular disorders caused by impaired neuromuscular transmission, and encodes an adaptor protein which mediates tyrosine kinase signaling to form the neuromuscular synapse [75,76]. DOK7 is also associated with malignancy of several types of cancer, including breast cancer, where DOK7 can inhibit cell proliferation, migration, and invasion of breast cancer by mediating signals through the PI3K/PTEN/AKT pathway, and thus, may act as a tumor suppressor gene [77]. IGSF1 encodes a transmembrane immunoglobulin superfamily glycoprotein highly expressed in the hypothalamus and in pituitary cells, and has roles in thyroid hormone biosynthesis [78], follicle-stimulating hormone biosynthesis [79], and growth hormone secretion [80]. Although its cellular function has not been clarified, IGSF1 mutations affecting the symptoms and disease pathologies revealed that extracellular ligands are crucial for its function [81]. IL20 encodes a proinflammatory cytokine involved in the pathogenesis of inflammatory diseases, such as psoriasis, rheumatoid arthritis, and atherosclerosis, by inducing factors such as TNF-α, IL-1β, MMP-1/13, and MCP-1, and plays important roles in regulating angiogenesis, osteoblastogenesis, and osteoclastogenesis [82,83]. IL20 was found to be an ERG [84], and its expression at the mRNA and protein levels in breast cancer cells was up-regulated by estrogen treatment [85]. INSYN1 encodes an inhibitory postsynaptic density protein involved in postsynaptic inhibition by decreasing inhibitory currents and increasing excitability in the hippocampus [86]. MATN2 encodes matrilin-2, a member of the matrilin subfamily of extracellular matrix proteins containing von Willebrand factor A and epidermal growth factor-like domains [87]. MATN2 is localized downstream of several growth factor receptors, such as IGF-1R, GPER, and TGFβ1, and involved in tumor progression by stimulating the growth, chemotaxis, and migration of cancer cells [88,89]. PDLIM3 encodes a protein containing a PDZ and LIM domain, which are used to bind to the actin cytoskeleton, and localizing at the Z-line of mature muscle fibers, and thus, the gene is likely involved in the development and maintenance of muscle cells [90,91]. SUSD3 encodes a cell surface protein with the sushi domain, a structure with a characteristic sandwich arrangement of βstrands, and has been known as an ERG [92]. The possibility of SUSD3 as a prognostic and diagnostic marker of breast cancer was investigated in several studies, including a DNA microarray analysis [93], a pathological study [94] and an epidemiological study [95].

Functional significance of ERGs
Additionally, several genes are considered to be tumor suppressor genes. CCDC68 is a putative tumor suppressor gene in several types of cancer [96], partly through regulating the cell cycle by controlling the degradation of CDK4 [97]. FRY encodes an evolutionary conserved protein with diverse functions, such as those controlling cell growth and morphogenesis through FRY-NDR kinase signaling, in invertebrates and vertebrates [98]. In mammals, FRY suppresses nuclear localization of YAP, a transcriptional coactivator with a role in promoting cell proliferation, and thus, may act as a tumor suppressor [99]. FRY is required for mammary gland development and may have a role in the suppression of breast cancer cell growth and proliferation [100].
Moreover, the genes associated with breast cancer are included in the 30 ERGs. The expression of B4GALT1 has been known to quickly respond to estrogen treatment [6,101], and the ERα-dependent activation of cell proliferation through membrane B4GALT1 implicated its crucial role in breast cancer development [102]. CYP1A1 is involved in estradiol metabolism by the hydroxylation of estradiol to 2-hydroxyestradiol, which may contribute to the antiestrogenic effects of aryl hydrocarbon receptor ligands such as TCDD [103]. Due to the connection of cholesterol metabolism to cancer, FDFT1 has been considered as a cancer prognostic marker and a target for anticancer therapy [104]. The involvement of FDFT1 in breast cancer progression through interaction with PGRMC1, a key protein in lipid metabolism, is considered to be a potential basis for breast cancer treatment [105]. LOXL2 stimulates the proliferation of breast cancer cells and contributes to their oncogenic transformation and pathogenesis, suggesting that the development of LOXL2 inhibitors is an effective therapeutic strategy [106]. SYNE1 encodes nesprin-1, a member of the spectrin family of structural proteins that links the plasma membrane to the actin cytoskeleton [107]. Since SYNE1 is located 19 kb downstream of ESR1 (the ERα gene), its association with estrogen action in ovarian cancer [108], endometrial cancer [109], and endometriosis [110] has been reported. TMPRSS3 was identified as an ERG by DNA microarray analysis, and its transcription level was increased after treatment with E 2 in MCF-7 cells with the help of cohesin, a protein complex that may connect distant transcriptional regulatory elements with gene promoters [111]. Furthermore, among 150 breast cancer risk regions identified by genome-wide association studies [112], several ERGs identified in this study (BARX2, CBX6, KCNN4, MYEOV, PDZK1 in 300 ERGs) were included as the targets for cancer drivers, transcription factors, and signaling proteins. We also found that some of the ERGs identified here are included in the genes listed in cancer genomics databases such as The Cancer Genome Atlas (TCGA; portal.gdc.cancer.gov) and International Cancer Genome Consortium (ICGC; dcc.icgc.org) (12 genes found in TCGA and 29 genes in ICGC; see S5 Table).
Thirdly, several ERGs are associated with transcriptional regulation. BARX2 was identified as a homeodomain transcription factor that binds to the transcriptional regulatory regions in the genes encoding cell adhesion molecules (CAMs) [113] and may have important roles in skeletal muscle development, such as digit chondrogenesis, by controlling the expression of CAMs [114]. The coordination of BARX2 with ER was suggested to regulate the growth, survival, and invasion of breast cancer cells [115]. EGR3 encodes a transcription factor belonging to a family (the early growth response family) of proteins characterized by conserved zinc finger motifs, which are associated with various roles such as the development of muscle spindles [116], the suppression of excessive immune response [117], and the progression of breast cancer [118]. EGR3, known as an ERG [119], has been used for evaluation of estrogenic activity of the constituents in foods and medicines [120]. GATA4 encodes a transcription factor that binds to the GATA motif as a cis-element and transactivates the genes for various cell functions such as the development of heart, liver, lung, and urogenital cells [121,122]. Due to its suppressive effects against ovarian and colorectal cancer cell proliferation, GATA4 is considered to act as a potential tumor suppressor gene [123,124]. GATA4 is also associated with bone differentiation through the signaling pathways such as the ERα and TGFβ pathways [125,126]. SPOCD1 encodes a TFIIS-family transcription factor and has been reported to promote the progression and metastasis of several types of cancer, including bladder and ovarian cancers, and osteosarcoma and glioblastoma, by activating signaling pathways such as the EGFR and VEGFR pathways [127][128][129][130]. ZNF521 encodes a zinc-finger transcription factor that has roles in various biological processes involving stem cells, such as erythroid/myeloid/ adipocyte/osteoblast differentiation, neural development, and bone formation, and is also implicated in diseases such as pancreatic, hepatic, gastric, bladder, breast, and ovarian cancers [131]. The role of ZNF521 in ER-positive breast cancer was indicated to be the expressional regulation of ZNF423, a homolog of ZNF521, which forms a heterodimer with ZNF521, through the transcriptional regulation involving ERGs [132].

Functions of ERGs through ER-binding
A genome-wide analysis of ERs and RNA polymerase II binding sites by ChIP-seq analysis has revealed a new paradigm of estrogen-mediated gene expression. For example, ChIP-seq analysis revealed the master transcriptional regulator, which may be the best cancer therapeutic target [133]. The mechanisms identified by ChIP-seq include the transcriptions of the genes with E 2 -regulated promoters and enhancers, miRNAs, enhancer RNAs, long-ncRNAs, and anti-sense RNAs [134]. The cistrome, or the genome-wide binding landscape, identified by ChIP-seq using ERα phosphorylated at serine 118 revealed the involvement of this site in a genome-wide transcriptional regulation, such as that by the transcription factor GRHL2, in estrogen action [135]. Such cistromic studies can be applied to develop breast cancer treatment [136].
About one third of the ERGs contained functional ERα-binding sites near their transcription start sites [137]. ChIP-seq analysis along with functional annotations of the genes in the vicinity of ERα-binding sites using databases, such as the GO, KEGG, and GeneGo databases, revealed that the genes with specific functions, such as biological regulation, cellular processes, and developmental processes, are the targets of ERα [138]. Furthermore, specific regulatory networks consisting of transcription factors, modulators, and targets, such as GRHL2, RUNX1, SRC1/SRC2/SRC3, and ERβ, have been identified as potential ER signaling regulators [135,[139][140][141]. In our study, the ER-subtype specificity among 29 ERGs was 10 ERα-specific, 2 ERβ-specific, and 14 ERα/β-specific genes, along with 3 genes classified as other types (Table 2). Thus, more than 80% of the identified ERGs are ERα-dependent. Interestingly, the ERα non-genomic network is more reliable than the genomic network [141], and thus, may be predominant in MCF-7 cells.
Three ERGs identified here (ZNF521, IGSF1, and CRISP3) were found to be ERα/β-independent, suggesting that they are involved in other signaling pathways. The expression of ZNF521 is known to potentially be controlled by ZNF423, an ERα-dependent ERG [132], and thus, its regulation may be indirect, as observed here. On the other hand, the expressions of IGSF1 and CRISP3 are known to be androgen receptor (AR)-dependent [142] and characteristic in prostate cancer cells [143]. The expression and transcriptional regulation of the AR in breast cancer cells, including MCF-7 cells, has been known [144,145], and the crosstalk between ERα and the AR in breast cancer progression [146,147] is a likely mechanism of transcriptional regulation of these two genes.
Based on ChIP-seq data, a total of 80,000 potential ERα-binding sites were classified into four groups based on the variations of the presence or absence of both estrogen-responsive elements and the treatment with E 2 [148], which indicates that some ERα-binding sites are not E 2 -dependent. Evidently, ERα is present prior to ligand activation in the majority of the binding sites [148]. An independent study also revealed the presence of estrogen-independent ER activity such as enhancer activity and the associated activity of histone modifications and enhancer RNA transcription [149]. Additionally, there are differences in ChIP-seq data for ERα-binding sites detected after the treatment with different ligands, E 2 , tamoxifen, and ICI 182,780, suggesting the mechanistic differences of ligand actions [150]. Thus, the ERα/β-independent ERGs may represent the types of transcriptional regulation with more complex mechanisms of estrogen action.

Conclusions
In this paper, we identified the ERGs that show statistical stability based on the CV values. We evaluated their usefulness as markers for estrogenic activity by examining the stability of the data when compared with correlation analysis, RT-PCR, and functional association with estrogen action through database searches. The ERGs identified here have been known for (1) the enzymes, such as metabolic enzymes, proteases, and protein kinases, (2) the genes with specific cell functions, such as cell-signaling mediators, tumor-suppressors, and roles in breast cancer, (3) the association with transcriptional regulation, and (4) estrogen-responsiveness. The top 30 ERGs were further classified as ERα-binding, ERβ-binding, ERα/β-binding, or ER-independent types based on ChIP-seq data. Therefore, the ERGs identified here represent various cell functions and cell signaling pathways, including estrogen signaling, and thus, may be useful for evaluations of estrogenic activity.
Supporting information S1