Despite improved outcomes in the past 30 years, less than half of all women diagnosed with epithelial ovarian cancer live five years beyond their diagnosis. Although typically treated as a single disease, epithelial ovarian cancer includes several distinct histological subtypes, such as papillary serous and endometrioid carcinomas. To address whether the morphological differences seen in these carcinomas represent distinct characteristics at the molecular level we analyzed DNA methylation patterns in 11 papillary serous tumors, 9 endometrioid ovarian tumors, 4 normal fallopian tube samples and 6 normal endometrial tissues, plus 8 normal fallopian tube and 4 serous samples from TCGA. For comparison within the endometrioid subtype we added 6 primary uterine endometrioid tumors and 5 endometrioid metastases from uterus to ovary. Data was obtained from 27,578 CpG dinucleotides occurring in or near promoter regions of 14,495 genes. We identified 36 locations with significant increases or decreases in methylation in comparisons of serous tumors and normal fallopian tube samples. Moreover, unsupervised clustering techniques applied to all samples showed three major profiles comprising mostly normal samples, serous tumors, and endometrioid tumors including ovarian, uterine and metastatic origins. The clustering analysis identified 60 differentially methylated sites between the serous group and the normal group. An unrelated set of 25 serous tumors validated the reproducibility of the methylation patterns. In contrast, >1,000 genes were differentially methylated between endometrioid tumors and normal samples. This finding is consistent with a generalized regulatory disruption caused by a methylator phenotype. Through DNA methylation analyses we have identified genes with known roles in ovarian carcinoma etiology, whereas pathway analyses provided biological insight to the role of novel genes. Our finding of differences between serous and endometrioid ovarian tumors indicates that intervention strategies could be developed to specifically address subtypes of epithelial ovarian cancer.
Citation: Kolbe DL, DeLoia JA, Porter-Gill P, Strange M, Petrykowska HM, Guirguis A, et al. (2012) Differential Analysis of Ovarian and Endometrial Cancers Identifies a Methylator Phenotype. PLoS ONE 7(3): e32941. https://doi.org/10.1371/journal.pone.0032941
Editor: Joseph Califano, John Hopkins Medical School, United States of America
Received: September 29, 2011; Accepted: February 2, 2012; Published: March 5, 2012
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: Funding is provided by The Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health (LE and LB), DoD GOC# 04-124 and Magee Womens Research Institute/Scaife Grant (TCK). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Ovarian cancer has an incidence of 21,500 cases per year in the United States and 204,000 worldwide, with an estimated annual mortality of 125,000 women. The condition ranks as the 5th leading cause of cancer-related deaths for women in the United States; the high mortality rate is a consequence of the asymptomatic nature of early-stage disease and the absence of a reliable screening test. The majority of cases (75%) are diagnosed at an advanced stage (III or IV) wherein the 5-year survival rate is less than 30% .
Of the four major histopathologic subtypes, serous is the most common, followed by endometrioid, mucinous and clear cell types. These subtypes have distinctive gene expression profiles  and are classified by virtue of their morphologic resemblance to normal fallopian tube, endometrium, endocervix and endometrial clear cells, respectively . The resemblance between tumor subtypes and distant tissues is consistent with models that propose migration of precursor lesions from disparate origins, such as the fallopian tube  or the mesothelial covering of the peritoneal cavity . For this reason, ovarian serous tumors (which resemble Mullerian epithelia) can be legitimately compared to normal fallopian tube (which is derived from Mullerian epithelia). Nevertheless, the ovarian surface epithelial (OSE) layer  shares its origin with epithelia of the endometrium (known as celomic epithelium ) and remains a plausible alternative explanation for “de novo” tumorigenesis.
Like serous tumors, the origin of endometrioid tumors is controversial , and progenitor cells have been proposed to originate from non-ovarian sources, such as endometriosis . Tumors with endometrioid histopathology are diagnosed in both the uterus and ovary. They frequently co-occur, as synchronous primary tumors or metastases from uterus to ovary . Whereas molecular differences have been reported for dual primary tumors, metastatic tumors are clonally identical .
Gene expression studies of ovarian tumors intended to detect cancer-specific profiles have yielded modest success and limited reproducibility , , , . However, mutational profiling studies have yielded more consistent results, showing that both serous and endometrioid tumors have aggressive, high-grade subtypes , ,  with mutations in the TP53 gene , ,  despite their obvious histological differences , , . Low-grade subtypes are most common in endometrioid tumors with mutations predominantly occurring in WNT and PIK3CA pathways .
In concert with gene expression and mutational profiles, delineating the epigenome of tumor cells should reveal relationships among samples reflecting common embryological origins, similar histopathological outcomes, or shared mutational events. In ovarian tumors, DNA methylation silences expression of critical genes , , and creates genetic haploinsufficiency , while hypomethylation at other sites enables expression of normally silenced genes. As proof of principle, site-specific patterns of DNA methylation were recently used to distinguish four subtypes of epithelial ovarian cancers, using a total of 1,505 target CpG loci , .
We hypothesized that DNA methylation patterns in ovarian tumors would resemble cells from their putative tissue of origin, with a small number of changes representing events associated with malignancy, that uniquely represent each tumor subtype. Moreover, we also hypothesized that uterine and ovarian endometrioid tumors were related by pathogenic mechanisms, which would be observed in DNA methylation patterns. To address these ideas, we examined methylation profiles of 27,578 target CpG sites representing 14,495 genes in the human genome, using DNA derived from serous and endometrioid ovarian tumors, normal fallopian tube and normal endometrium, and primary and metastatic endometrioid endometrial tumors. This large dataset was analyzed using a supervised analysis followed by de novo classification using unsupervised computational clustering. To improve the strength of the epigenetic profiling technique, we included raw methylation data from serous tumors and normal fallopian tube generated through The Cancer Genome Atlas (TCGA). These samples were analyzed using the same methylation platform, and performed by independent research laboratories using independent tumor specimens.
Experimental assay and design
We analyzed the DNA methylation status of genomic samples using the Illumina Infinium platform. DNA was treated with bisulfite to convert unmethylated cytosines to uracil, leaving methylated cytosines unchanged. The hybridization reaction on the HumanMethylation27 Illumina BeadChip provided signal specific to the methylated and unmethylated states, using the Illumina single base extension assay protocol . The differential hybridization of probes to methylated and unmethylated target sites was tabulated as the fraction of the total signal that corresponded to the methylated state. The initial sample set represented various tissue and tumor types including normal fallopian tube, normal endometrium, ovarian papillary serous carcinoma, ovarian endometrioid carcinoma, and primary and metastatic endometrial endometrioid carcinoma from 42 patients (Table 1). Technical replicates indicated highly reproducible results for the assay (Figure S1). In addition, we included data from the same Illumina methylation platform for 12 additional samples from the public database of The Cancer Genome Atlas project (TCGA). This set comprised 8 control samples (normal fallopian tube) and 4 tumor samples (ovarian serous), contributed in a single batch of samples examined under consistent experimental conditions from one data provider.
To assess gross changes in degree of methylation, we examined aggregate methylation levels of all samples. Assay values are reported as the proportion of fluorescence arising from the probe for the methylated state, from 0 (all DNA unmethylated) to 1 (all DNA methylated). Comparing all samples, the vast majority showed consistent methylation profiles. Across the genome, most assayed sites had methylation levels between 0 and 0.2, small numbers of sites had levels between 0.2 and 0.8, and a slightly larger number had levels of 0.8 to 1 (Figure 1).
Frequency of methylation at all loci for a given level of methylation (range 0 to 1). Each biological sample is represented as a single line; all non-metastatic samples were plotted. Respectively, panels from left to right contain all loci (N = 27,561), X chromosome only (N = 1038), or chromosome 10 only (N = 1044).
In contrast, considering only the X-chromosome, single-copy silencing by random X-inactivation was expected to produce a methylation level of 0.5. The observed pattern showed a broad peak centered at 0.5 for most samples, even though the tumor samples had more extensive heterogeneity. Five serous tumor samples showed a distinct profile, with many loci being unmethylated (i.e., methylation level <0.2), indicating either a failure to maintain X-inactivation or copy number alterations with relative excess of the active X . To assess whether this observed variation was simply due to the small number of loci on the X-chromosome, we also considered methylation levels on chromosome 10, which had a similar number of probes. The pattern for chromosome 10 resembled the pattern across all autosomal loci, with high levels of similarity from sample to sample.
We first considered whether the tumor subtypes (defined by histopathology) corresponded to specific methylation profiles. With the inclusion of the TCGA samples, 12 normal fallopian tube samples and 16 ovarian serous tumors yielded sufficient statistical power for a direct comparison. Probes with poor quality control, high variability in the controls, or located on X or Y-chromosomes were not considered (see Methods). Using a Wilcoxon summed-rank test to identify sites that consistently associated with prior classification, 36 were significant at p<0.05 after multiple-testing correction (14 at p<0.01; Table 2). Three genes were identified as members of the canonical pathway for ovarian cancer in an Ingenuity Pathway Analysis (IPA) (Table 2;    ). Supervised analysis of the ovarian endometrioid tumors against the fallopian tube or endometrial controls was not performed due to small numbers of samples.
To address whether other sample divisions with shared molecular phenotypes existed and to gain a broader picture of the relationships between the sample types, we moved to unsupervised clustering. Utilizing the complete set of 31 primary tumor samples and 18 normal tissues (and excluding the 5 metastatic samples used for secondary analysis), we limited this analysis to probes that were in the top 500 when ranked by variance, as reported by Houseman et al.  to reduce the dimensionality of the data. Results of multiple clustering algorithms converged on the same interpretation of distinct phenotypic groups (Figure S2). K-means clustering and partitioning using a β-mixture model designed for the data from this platform  both strongly supported the existence of 3 primary groups, roughly corresponding to control-type samples, serous tumor-type samples, and endometrioid samples. Additional analysis with hierarchical clustering (across multiple distance metrics and linkage methods) strongly supported the control-type and endometrial type clusters, and indicated the serous tumor-type samples were an outgroup from the control samples, but was inconsistent as to whether these samples formed a distinct subgroup (Figure S2).
The consensus groupings, as shown in Figure 2, are marked with a colored bar to indicate the normal-type, serous-type or endometrioid-type. Notably, the control group contained normal fallopian tube and normal endometrium, indicating a consistent phenotype across both tissues within the set of featured probes. The TCGA samples clustered with their identified sample types, confirming the reproducibility and robustness of the results as these tumors were obtained and classified at various institutions. Exclusion of the TCGA samples from the analysis had relatively small impact on the results, in which a marked division remained between the endometrioid and serous or control samples; however, only weak support remained for a subdivision of the serous-type tumors from the control samples.
Heatmap of methylation levels; blue = 0.0, black = 0.5, yellow = 1.0. At left, a representative sample of hierarchical clustering, and color blocks giving the consensus groups. Six columns on the right give sample characteristics: number; tumor (T) or normal (N); location in ovary (O), fallopian tube (F), or endometrium (E); histology of serous (S) or endometrioid (O); grade (1–3; where symbols used are ‘-’ for normals and ‘NA’ for information not available.) and at far right, dots for public TCGA data. Five samples at bottom are ovarian metastases from endometrial endometrioid tumors, excluded from the initial analysis and clustering.
The cluster of endometrioid samples, including tumors from both ovarian and uterine sites, displays a remarkably altered profile, with methylation at numerous sites that are normally unmethylated CpG islands, and a loss of methylation at sites that are normally methylated. The extent and reproducibility of these changes is strongly reminiscent of the methylator phenotypes noted in other cancers  . A methylator phenotype has previously been proposed for endometrial endometrioid carcinoma, based on methylation of promoters of a few target genes , but has not to our knowledge been described in a genome-level survey or in ovarian cancer.
These data confirm the hypothesis that endometrioid type tumors, whether at ovarian or uterine sites, share similarities at the molecular level. To further address this finding, we analyzed five ovarian metastases derived from primary endometrial tumors using a nested log-likelihood-ratio test. This test addressed consistency of clustering with the primary endometrioid samples versus the combined serous tumors and controls, and secondarily enabled classification within the serous or control groups when necessary. Four of the five samples were strongly identified as endometrioid-type, whereas the fifth was more similar to the control samples (sample 54; Figure 2). The fifth sample does not appear grossly altered from the methylation of normal endometrial tissue, indicating that the majority tumor phenotype is not universal. This outlier may represent an uncommon subdivision of endometrioid tumors, resulting from different underlying pathology, however it does not represent a low-grade tumor (Figure 2).
Assessment of primary tumors of all histopathologies also identified infrequent outliers. Examples included endometrial endometrioid samples 47 and 49, which clustered with normal tissues and contributed to the 15% of samples that showed discordant placement relative to their assigned histopathology (grade 1 and grade not available, respectively; Figure 2). Four ovarian serous tumors also clustered with normal tissues, showing very limited changes in methylation relative to controls (grades 2 and 3). Given the phenotypic similarity of these samples to normal controls, the biological underpinning of this tumor subset requires further investigation. Additionally, one ovarian endometrioid sample grouped with the serous tumors (sample 36, grade 2), suggesting either a rare endometrioid subtype with a more aggressive, serous-like profile, or mistyping of a poorly differentiated sample.
Differential analysis of clusters
Given the three primary groupings provided by the unsupervised clustering analysis, we wished to identify methylation loci most predictive of membership in a particular class. We repeated the Wilcoxon summed-rank test used in the supervised analysis, after removing from consideration the 500 probes used in clustering. Comparing the serous-type cluster with the control-type cluster, 35 probes remained significant at p<0.01 after stringent Bonferroni correction for multiple tests and 60 remained at p<0.05 (Table 3). The results showed a mixture of hyper- and hypomethylation relative to the controls (Figure S3). An IPA analysis identified known biomarkers for ovarian cancer among this list (Figures S4 and S5). Two genes with recorded relevance to DNA methylation were identified, including DNMT3A, a DNA methyltransferase gene and RB1. APC (from the beta-catenin pathway), RBAK1 (an RB1 interaction partner), MAPK15 and MAP2K2 kinases, and histone deacetylase HDAC1 were also on the list (Table 3). Although the supervised and unsupervised analyses utilized different comparator sets, the gene lists contained 10 overlapping entries (Table 3).
Considering the endometrioid-cluster versus the control cluster, we determined that the number and degree of differentially methylated sites increased by more than an order of magnitude. For example, 954 probes were significant at p<0.01. The sheer number of hypermethylated sites suggests an underlying defect in DNA methylation pathways, and limits the utility of considering altered methylation of individual genes.
Validation of differential methylation
To explore whether our set of 60 differentially methylated sites in serous tumors was reproducible, we assessed the methylation of 25 additional samples that were independent of the original analysis set. All were typed as ovarian serous carcinoma and analyzed independently (independent in ascertainment and in time of analysis) from the originals. For each site and for each validation sample, we assessed whether the methylation more closely resembled the normal sample cluster or the serous tumor sample cluster. Of the 25 tumor samples, 4 closely resembled the methylation pattern of normal samples, seven were altered at 21 or more of the 60 loci, and 14 samples showed the altered pattern at 40 or more of the 60 loci (Figure 3).
Methylation at 60 loci was used to evaluate an independent set of serous tumors. Each row represents an individual sample; each column corresponds to one of the previously identified differentially methylated sites. An empty box indicates methylation more similar to the control cluster (not necessarily unmethylated); a filled black box indicates methylation more similar to the cluster of ovarian serous tumors. Samples are ordered by the number of sites having methylation resembling the serous tumor group.
Evaluation of published methylation events
Our analysis of differential methylation focused on changes that defined characteristics of each group and were shared among all or nearly all samples. Many important changes in methylation state, previously reported in the literature, have lower prevalence and are not directly identified by our approach. When we assessed our samples for patterns of known methylation, our data were consistent with published results. For instance, BRCA1 was hypermethylated in 2 of 16 (12.5%) of ovarian serous samples. The tumor suppressor RASSF1A showed evidence of complete methylation in 11 tumors, and single-copy methylation in 4 more (31% of serous, and 60% of endometrioid). These changes, and others, are likely to be important transformative events, but are restricted to smaller subsets of the samples.
Gene ontology & pathway analyses
To tie these results to the literature on ovarian cancer, we performed a gene ontology (GO) analysis and an IPA analysis of the differentially methylated genes from the cluster-based comparison of serous tumors versus controls. The genes corresponding to the methylated loci in our list showed a statistically significant enrichment for GO terms involving regulation of cell cycle (Table 4). The top two networks identified by IPA included “Cell Cycle and Cell Morphology” and “Inflammatory Response” with network scores of 24 and 23, respectively. The network score is based on a hypergeometric distribution and is calculated with the right-tailed Fisher's Exact Test, implying that there is a 1 in 1023 or 1024 probability of either network occurring from a random list of genes (Table 4, Figure S4,S5). Notably, differentially methylated genes had a large presence in these networks, interacting with important genes in ovarian tumor development including AKT, PI3K, VEGF, and estrogen receptor.
This work represents one of the largest studies of methylation using several normal and tumor subtypes of gynecologic cancers. We initially examined the methylation status of 27,578 sites for 49 samples including normal fallopian tube and endometrium, serous ovarian cancer, endometrioid ovarian cancer, primary endometrioid endometrial cancer, and ovarian metastasis of endometrial cancer. Regardless of tumor or normal status, all samples showed similar profiles in the overall distribution of methylated sites. Although we did not find global shifts toward hyper- or hypomethylation across the assayed samples, a subset of samples showed drastically altered methylation for the X chromosome, consistent with loss of the inactive X chromosome, amplification of the remaining active X, or both . Examples of aneuploidy, including the autosomes, are common in high-grade serous ovarian cancer and are not directly ascertained by this analysis, but influence the proportional methylation levels at each locus. Therefore we removed all probes on the X-chromosome from our dataset.
Our data confirm that different histological subtypes have distinct patterns of methylation. Moreover, ovarian serous tumors are more similar to normal ovarian and endometrial tissues than to ovarian or endometrial endometrioid tumors, which are highly similar to each other and display drastic and consistent changes in their methylation. This result is consistent with a methylator phenotype and in agreement with a model of ovarian endometrioid tumors arising from endometriosis, where the cells ultimately derive from a uterine lineage. Endometrioid tumors from the ovary and uterus share several common somatic mutations , and these data support a similar pathogenic mechanism. The marked differences in methylation profiles between histological subtypes underscore the importance of characterizing tumors at the molecular level in order to develop tailored treatment strategies.
For the identification of differentially methylated loci, we used known labels and blinded (data-directed) subgroups. Known labels identified a few dozen genes, some with characterized roles in ovarian cancer. However, given the stringent bar for statistical significance in testing very large numbers of sites, we found that a few outliers within a group could obscure important patterns. By clustering data in an unbiased approach, we found similar methylation patterns among normal samples and some tumor outliers, indicating that current histologic subtyping strategies may miss important molecular distinctions between tumors. This point was further supported in metastatic endometrioid tumors, which also contained an outlier that looked like a normal sample in its methylation patterns. Our clustering approach clearly identified a set of 500 genes that could separate the majority of serous samples from endometrioid samples and normal controls. Although the clustering was distinctive for the three main classes of samples, its use precluded a statistical evaluation of the significance of genes within the set. Nevertheless, the increased power of clustering 49 samples identified an additional 60 loci that were independent of the clustering set and segregated samples into normal or serous subtypes with statistical significance. Several of these genes correspond to networks implicated in the development of ovarian cancer (Figures S4 and S5). We investigated the overlap between gene lists of statistically significant genes identified in the supervised and unsupervised approaches and found 10 genes. Notably, the kinase PDPK1 is in the PI3K signaling pathway involved in serous ovarian cancer . PDPK1 and PLEKHF1 share a pleckstrin homology domain, capable of binding inositol polyphosphates. PARP3 is involved in DNA repair and genome stability. Given the reproducible signal from these genes regardless of method, we conclude that uncharacterized genes in this list are strongly implicated in ovarian tumor development and require additional characterization.
A limitation of our analysis is that we did not screen the tumor DNA for gene mutations or ascertain gene expression levels; nevertheless, we found that RB1 and RBAK are differentially methylated between the papillary serous and normal fallopian tube samples. RB1 was recently reported by TCGA to be involved in serous tumor etiology, through mutation or deletion in 67% of tumors . The involvement of the RB1 pathway is consistent with concurrent Rb1 and Tp53 mutation in mice, which simulates characteristics of aggressive serous ovarian cancers, including formation of ascites and metastasis . Although we did not find significant overlap with the list of methylated genes in serous tumors published by TCGA, this discordance may be due to methodological issues. For example, we do not limit the gene list to candidates that become hypermethylated and found many that lose methylation. Furthermore, we required that scoring be consistent among all tumors. TCGA limits scoring to the top 10% of tumors. Additionally, we did not limit results to genes that become silenced, as methylation has been shown to cause both positive and negative regulatory outcomes .
Our analysis of methylation profiles in ovarian and endometrial tumors indicates value in characterizing tumors at the molecular level. The methylator phenotype indicates an aberration in the molecular function of enzymes regulating DNA methylation levels and suggests that a molecule acting upstream of the candidate genes is responsible for the cascade of events leading to tumor development. Studies in hepatocellular carcinoma have identified mutations in the beta-catenin gene in association with a methylator phenotype. Mutations in beta-catenin are also common in endometrial tumors , and suggest follow-up experiments to assess a direct relationship to DNA methylation in endometrioid tumors. Moreover, therapeutic strategies aimed at preventing extensive methylation (such as 5-aza-2′-deoxycytidine) should be evaluated in the context of tumors with a methylator phenotype.
The consistency of the methylation profiles, despite independent sample preparation and data collection for TCGA samples, was used to validate and extend our results. These data show that sample batch effects are minimal and do not disrupt data consistency. Our data provide a foundation for future genomic and genetic analyses of endometrial and serous tumors for diagnostic and treatment applications. Notably, our results show that methylation levels in serous tumors are less consistent than endometrioid tumors, but increase and decrease in a target-dependent way. In contrast endometrioid tumors show extensive changes that are likely linked to a common upstream mechanism gone awry.
Materials and Methods
Ovarian, endometrial and fallopian tube tissues were received from the Magee-Womens Hospital Tissue Procurement Program (Pittsburgh, PA). The tissues were snap frozen after surgery and stored at −80°C. Genomic DNA was isolated using the Puregene Blood Kit (Qiagen) following the manufacturer's instructions. DNA quality was assessed using a SmartSpec Plus spectrophotometer (BioRad, Hercules, CA).
Endometrial normal samples
Tissue samples were provided by the Cooperative Human Tissue Network, which is funded by the National Cancer Institute. Samples are from post-menopausal individuals with atrophic endometrium and were obtained from routine hysterectomy or pelvic resection for non-endometrial cancers. DNA was isolated following the protocol of Trizol reagent (Invitrogen).
The use of human subject material was approved by the University of Pittsburgh and the Office of Human Subjects Research at the NIH.
TCGA data were downloaded at the time of this analysis from the data portal (http://tcga-data.nci.nih.gov/tcga/dataAccessMatrix.htm). Data from the same batch contained unmatched serous tumors and normal fallopian tube samples. Normals: TCGA-01-0639-11A-01D-0383-05, TCGA-01-0631-11A-01D-0383-05, TCGA-01-0642-11A-02D-0383-05, TCGA-01-0628-11A-01D-0383-05, TCGA-01-0637-11A-01D-0383-05, TCGA-01-0633-11A-01D-0383-05, TCGA-01-0630-11A-01D-0383-05, TCGA-01-0636-11A-01D-0383-05. Tumors: TCGA-09-0367-01A-01D-0359-05, TCGA-09-0365-01A-02D-0359-05, TCGA-09-0366-01A-01D-0359-05, TCGA-09-0369-01A-01D-0359-05.
DNA was treated with bisulfite according to the protocol of Zymo Research (Irvine, CA), with slight modification. One half microgram of DNA was used for each conversion reaction. The hybridization reaction was performed according to the HumanMethylation27 Illumina BeadChip protocol and scanned using an Illumina iScan System.
Experimental confidence levels were recorded as p-value estimates for each methylation ratio measurement; all readings with a corresponding p-value >0.05 were censored. These low-confidence values were not uniformly distributed in the data, therefore a few loci had an unusually large number of exclusions; we chose to completely eliminate from consideration any probe location at which values for ten or more samples were unavailable. This step eliminated 61 loci.
We performed a comparison between normal fallopian tube (control) and ovarian serous tumors. We excluded probes with poor quality control metrics from the Illumina analysis software (61 probes), and probes that had high variability within the control samples (those with variance in the top 5%, 1,379 probes). We also censored all data from the X and Y chromosomes (1,092 probes). Some overlap in these sets resulted in eliminating a total of 2,489 loci. Differential analysis by Wilcoxon summed-rank test was performed with the R function wilcox.test, followed by Bonferroni correction for the 25,102 loci tested.
Unsupervised clustering was performed in R. To select a subset of loci to use, all primary samples were pooled, and the loci were ranked by sample variance. The number of probes considered was determined empirically, based on bootstrap support for clustering results obtained for data sets of 50, 100, 250, or 500 probes. Based on apparent stability of results with 250 or more probes, the top 500 probes were used for all clustering analyses. Model-based top-down clustering was conducted with the β-mixture model described in Houseman, 2008 . K-means analysis was done with the kmeans function, using within-group sum-of-squares to select the number of clusters. Hierarchical clustering analysis used the pvclust package, with the average and complete linkage methods, and the Euclidean, Manhattan, and correlation distance metrics.
To create a classifier from the clustering results, a beta distribution was estimated for each of the 500 loci used for clustering. These distributions were estimated separately for each of four groups of samples: the endometrioid-type tumor cluster, the serous-type tumor cluster, the control-type cluster, and a cluster including both the control-type and serous-type tumor samples. These groupings allowed a nested binary decision first between endometrioid-type and all other samples, and then a second division between the serous tumor and control clusters. The test metric was calculated as the log of the ratio of probability densities at the observed methylation level for the new sample, summed over all 500 loci.
In a pairwise comparison of a tumor cluster versus the control-type cluster, we selected cases showing consistent signal in the controls and alteration in the tumors. We again excluded probes with poor quality, high variability in controls, and sex chromosome location; we also excluded all probes used in clustering for the definition of classes (total: 2812). Differential analysis was again done with Wilcoxon summed-rank test, correction for 24,766 independent tests.
Validation of differential methylation
For each probe in the set of differentially methylated sites, a threshold was chosen that maximized the discrimination between the previously identified control and serous clusters, minimizing the total number of classification errors (false positives and false negatives). 25 additional samples from ovarian serous tumors were analyzed for DNA methylation as described above, and were assessed for which group they were classified with at each of the differential sites.
Comparison of methylation intensity plots from independent replicates and samples. Technical replicates of methylation signals in normal fallopian tube, endometrial tumors or ovarian metastases from primary endometrial samples, with best linear fit.
Clustering results for hierarchical and non-hierarchical methods. At top, the tree shows the result of top-down partitioning under a beta-mixture model. Below, the six trees show results of hierarchical clustering under either complete linkage (middle row) or average linkage (bottom row), for each of 3 distance metrics. For each method, color bocks beneath the tree show the correspondence to the consensus clusters, with the control-type cluster in blue, the serous-type cluster in black, and the endometrioid cluster in red. For hierarchical methods, black dots on tree nodes indicate ≥95% confidence in that grouping under bootstrap analysis.
Box plot graph of differentially methylated loci. Each differentially methylated gene, listed from top to bottom by p-value is represented by a box plot for the methylation in the control cluster (black) and the methylation in the serous tumor cluster (red).
IPA network for Cell Cycle and Cell Morphology. Differentially methylated genes participating in the network are colored red with increasing intensity representing smaller p-values. Shapes of molecules indicate distinct molecular functions. Arrows represent direct and indirect interactions. Designations for biomarkers are highlighted in green.
IPA network for Inflammatory Response. Differentially methylated genes participating in the network are colored red with increasing intensity representing smaller p-values. Shapes of molecules indicate distinct molecular functions. Arrows represent direct and indirect interactions. Designations for biomarkers are highlighted in green.
Prepared the samples: PPG AG MS HMP. Provided biological consultation and manuscript revision: JAD LCB TCK. Conceived and designed the experiments: JAD LCB LE DLK. Performed the experiments: DLK. Analyzed the data: JAD TCK LCB LE DLK. Contributed reagents/materials/analysis tools: PPG AG MS. Wrote the paper: DLK LE.
- 1. Lengyel E (2010) Ovarian cancer development and metastasis. Am J Pathol 177: 1053–1064.
- 2. Wu R, Hendrix-Lucas N, Kuick R, Zhai Y, Schwartz DR, et al. (2007) Mouse model of human ovarian endometrioid adenocarcinoma based on somatic defects in the Wnt/beta-catenin and PI3K/Pten signaling pathways. Cancer Cell 11: 321–333.
- 3. Marquez RT, Baggerly KA, Patterson AP, Liu J, Broaddus R, et al. (2005) Patterns of gene expression in different histotypes of epithelial ovarian cancer correlate with those in normal fallopian tube, endometrium, and colon. Clin Cancer Res 11: 6116–6126.
- 4. Folkins AK, Jarboe EA, Roh MH, Crum CP (2009) Precursors to pelvic serous carcinoma and their clinical implications. Gynecol Oncol 113: 391–396.
- 5. Shih Ie M, Kurman RJ (2004) Ovarian tumorigenesis: a proposed model based on morphological and molecular genetic analysis. Am J Pathol 164: 1511–1518.
- 6. Cho KR (2009) Ovarian cancer update: lessons from morphology, molecules, and mice. Arch Pathol Lab Med 133: 1775–1781.
- 7. Wong AS, Auersperg N (2003) Ovarian surface epithelium: family history and early events in ovarian cancer. Reprod Biol Endocrinol 1: 70.
- 8. Vigano P, Somigliana E, Parazzini F, Vercellini P (2007) Bias versus causality: interpreting recent evidence of association between endometriosis and ovarian cancer. Fertil Steril 88: 588–593.
- 9. Mandai M, Yamaguchi K, Matsumura N, Baba T, Konishi I (2009) Ovarian cancer in endometriosis: molecular biology, pathology, and clinical management. Int J Clin Oncol 14: 383–391.
- 10. van Niekerk CC, Bulten J, Vooijs GP, Verbeek AL (2010) The Association between Primary Endometrioid Carcinoma of the Ovary and Synchronous Malignancy of the Endometrium. Obstet Gynecol Int 2010: 465162.
- 11. Ricci R, Komminoth P, Bannwart F, Torhorst J, Wight E, et al. (2003) PTEN as a molecular marker to distinguish metastatic from primary synchronous endometrioid carcinomas of the ovary and uterus. Diagn Mol Pathol 12: 71–78.
- 12. Crijns AP, Fehrmann RS, de Jong S, Gerbens F, Meersma GJ, et al. (2009) Survival-related profile, pathways, and transcription factors in ovarian cancer. PLoS Med 6: e24.
- 13. Adib TR, Henderson S, Perrett C, Hewitt D, Bourmpoulia D, et al. (2004) Predicting biomarkers for ovarian cancer using gene-expression microarrays. Br J Cancer 90: 686–692.
- 14. Fehrmann RS, Li XY, van der Zee AG, de Jong S, Te Meerman GJ, et al. (2007) Profiling studies in ovarian cancer: a review. Oncologist 12: 960–966.
- 15. Guirguis A, Elishaev E, Oh SH, Tseng GC, Zorn K, et al. (2008) Use of gene expression profiles to stage concurrent endometrioid tumors of the endometrium and ovary. Gynecol Oncol 108: 370–376.
- 16. Vang R, Shih Ie M, Kurman RJ (2009) Ovarian low-grade and high-grade serous carcinoma: pathogenesis, clinicopathologic and molecular biologic features, and diagnostic problems. Adv Anat Pathol 16: 267–282.
- 17. McCluggage WG (2008) My approach to and thoughts on the typing of ovarian carcinomas. J Clin Pathol 61: 152–163.
- 18. Singer G, Kurman RJ, Chang HW, Cho SK, Shih Ie M (2002) Diverse tumorigenic pathways in ovarian serous carcinoma. Am J Pathol 160: 1223–1228.
- 19. Kolasa IK, Rembiszewska A, Janiec-Jankowska A, Dansonka-Mieszkowska A, Lewandowska AM, et al. (2006) PTEN mutation, expression and LOH at its locus in ovarian carcinomas. Relation to TP53, K-RAS and BRCA1 mutations. Gynecol Oncol 103: 692–697.
- 20. Petitjean A, Achatz MI, Borresen-Dale AL, Hainaut P, Olivier M (2007) TP53 mutations in human cancers: functional selection and impact on cancer prognosis and outcomes. Oncogene 26: 2157–2165.
- 21. Singer G, Stohr R, Cope L, Dehari R, Hartmann A, et al. (2005) Patterns of p53 mutations separate ovarian serous borderline tumors and low- and high-grade carcinomas and provide support for a new model of ovarian carcinogenesis: a mutational analysis with immunohistochemical correlation. Am J Surg Pathol 29: 218–224.
- 22. Senturk E, Cohen S, Dottino PR, Martignetti JA (2010) A critical re-appraisal of BRCA1 methylation studies in ovarian cancer. Gynecol Oncol 119: 376–383.
- 23. Balch C, Fang F, Matei DE, Huang TH, Nephew KP (2009) Minireview: epigenetic changes in ovarian cancer. Endocrinology 150: 4003–4011.
- 24. D'Andrea AD (2010) Susceptibility pathways in Fanconi's anemia and breast cancer. N Engl J Med 362: 1909–1919.
- 25. Houshdaran S, Hawley S, Palmer C, Campan M, Olsen MN, et al. (2010) DNA methylation profiles of ovarian epithelial carcinoma tumors and cell lines. PLoS One 5: e9359.
- 26. Shih Ie M, Chen L, Wang CC, Gu J, Davidson B, et al. (2010) Distinct DNA methylation profiles in ovarian serous neoplasms and their implications in ovarian carcinogenesis. Am J Obstet Gynecol 203: 584.e581–522.
- 27. Steemers FJ, Chang W, Lee G, Barker DL, Shen R, et al. (2006) Whole-genome genotyping with the single-base extension assay. Nat Methods 3: 31–33.
- 28. Kawakami T, Zhang C, Taniguchi T, Kim CJ, Okada Y, et al. (2004) Characterization of loss-of-inactive X in Klinefelter syndrome and female-derived cancer cells. Oncogene 23: 6163–6169.
- 29. Donninger H, Bonome T, Radonovich M, Pise-Masison CA, Brady J, et al. (2004) Whole genome expression profiling of advance stage papillary serous ovarian cancer reveals activated pathways. Oncogene 23: 8065–8077.
- 30. Plumb JA, Finn PW, Williams RJ, Bandara MJ, Romero MR, et al. (2003) Pharmacodynamic response and inhibition of growth of human tumor xenografts by the novel histone deacetylase inhibitor PXD101. Mol Cancer Ther 2: 721–728.
- 31. Rask K, Zhu Y, Wang W, Hedin L, Sundfeldt K (2006) Ovarian epithelial cancer: a role for PGE2-synthesis and signalling in malignant transformation and progression. Mol Cancer 5: 62.
- 32. Ni X, Huang K-C, Ng W-C, Ng S-K, Mok SC, et al. (2005) Transcriptional and proteomic profiling of secreted and membrane proteins in ovarian cancer. 102-c p.
- 33. Houseman EA, Christensen BC, Yeh RF, Marsit CJ, Karagas MR, et al. (2008) Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions. BMC Bioinformatics 9: 365.
- 34. Issa JP (2004) CpG island methylator phenotype in cancer. Nat Rev Cancer 4: 988–993.
- 35. Noushmehr H, Weisenberger DJ, Diefes K, Phillips HS, Pujara K, et al. (2010) Identification of a CpG island methylator phenotype that defines a distinct subgroup of glioma. Cancer Cell 17: 510–522.
- 36. Whitcomb BP, Mutch DG, Herzog TJ, Rader JS, Gibb RK, et al. (2003) Frequent HOXA11 and THBS2 promoter methylation, and a methylator phenotype in endometrial adenocarcinoma. Clin Cancer Res 9: 2277–2287.
- 37. Huang J, Zhang L, Greshock J, Colligon TA, Wang Y, et al. (2011) Frequent genetic abnormalities of the PI3K/AKT pathway in primary ovarian cancer predict patient outcome. Genes Chromosomes Cancer 50: 606–618. LID - 610.1002/gcc.20883 [doi].
- 38. Cancer Genome Atlas Research Network (2011) Integrated genomic analyses of ovarian carcinoma. Nature 474: 609–615. LID - 610.1038/nature10166 [doi].
- 39. Flesken-Nikitin A, Choi KC, Eng JP, Shmidt EN, Nikitin AY (2003) Induction of carcinogenesis by concurrent inactivation of p53 and Rb1 in the mouse ovarian surface epithelium. Cancer Res 63: 3459–3463.
- 40. Rishi V, Bhattacharya P, Chatterjee R, Rozenberg J, Zhao J, et al. (2010) CpG methylation of half-CRE sequences creates C/EBPalpha binding sites that activate some tissue-specific genes. Proc Natl Acad Sci U S A 107: 20311–20316.