Barrett's esophagus (BE) is a metaplastic precursor lesion of esophageal adenocarcinoma (EA), the most rapidly increasing cancer in western societies. While the prevalence of BE is increasing, the vast majority of EA occurs in patients with undiagnosed BE. Thus, we sought to identify genes that are altered in BE compared to the normal mucosa of the esophagus, and which may be potential biomarkers for the development or diagnosis of BE.
We performed gene expression analysis using HG-U133A Affymetrix chips on fresh frozen tissue samples of Barrett's metaplasia and matched normal mucosa from squamous esophagus (NE) and gastric cardia (NC) in 40 BE patients.
Using a cut off of 2-fold and P<1.12E-06 (0.05 with Bonferroni correction), we identified 1324 differentially-expressed genes comparing BE vs NE and 649 differentially-expressed genes comparing BE vs NC. Except for individual genes such as the SOXs and PROM1 that were dysregulated only in BE vs NE, we found a subset of genes (n = 205) whose expression was significantly altered in both BE vs NE and BE vs NC. These genes were overrepresented in different pathways, including TGF-β and Notch.
Our findings provide additional data on the global transcriptome in BE tissues compared to matched NE and NC tissues which should promote further understanding of the functions and regulatory mechanisms of genes involved in BE development, as well as insight into novel genes that may be useful as potential biomarkers for the diagnosis of BE in the future.
Citation: Hyland PL, Hu N, Rotunno M, Su H, Wang C, Wang L, et al. (2014) Global Changes in Gene Expression of Barrett's Esophagus Compared to Normal Squamous Esophagus and Gastric Cardia Tissues. PLoS ONE 9(4): e93219. https://doi.org/10.1371/journal.pone.0093219
Editor: Wayne A. Phillips, Peter MacCallum Cancer Centre, Australia
Received: January 17, 2013; Accepted: March 3, 2014; Published: April 8, 2014
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: This research was supported by Intramural Research Program of the National Institutes of Health, National Cancer Institute, the Division of Cancer Epidemiology and Genetics, the Cancer Prevention Fellowship Program, Division of Cancer Prevention, NCI (to P.L.H); and Health and Social Care (HSC), Northern Ireland, UK (to P.L.H). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have the following interests: Co-author Carol Giffen is employed by Information Management Services, Inc., a biomedical information management company that has contractually served as a biomedical and computing support resource for the National Cancer Institute for more than 30 years. Co-author Barbara Gherman is employed by Westat, a company that has contractually supported clinical research for the NCI for more than 30 years. There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to all the PLoS ONE policies on sharing data and materials.
Esophageal cancer is the sixth leading cause of cancer-related deaths worldwide and exhibits dramatic geographic differences in distribution in incidence and histological subtype . Over the last 35 years in the United States (USA), the incidence of esophageal adenocarcinoma (EA) has increased from 0.4 to more than 3 per 100,000 person-years, a 650% increase –. When gastroesophageal reflux disease (GERD) induces inflammation in the normal esophageal (NE) squamous epithelium, the damaged squamous cells are usually replaced by regeneration of more squamous cells. In some individuals, however, the reflux-damaged NE heals through a metaplastic process in which intestinal-type columnar cells (specialized intestinal metaplasia, SIM) replaces the reflux-damaged squamous epithelium . This metaplasia results in Barrett's esophagus (BE), which is the recognized precursor lesion to EA –, and increases the risk of EA by 11 times compared to that of the general population .
In addition to male sex and Caucasian race, the most well-documented risk factors for BE include increasing age, cigarette use, obesity, and a lack of Helicobacter pylori (H. pylori) infection , . However, the strongest risk factor for BE is GERD , which is also a primary risk factor for EA . A population-based study from Sweden  and a computer simulation using the Surveillance Epidemiology and End Results (SEER) data , suggest that 1.5–5.6% of the general population of western societies have BE. However, recent data indicate that the incidence of BE has increased and is rising , . Coleman et al. reported that average annual incidence rate of BE increased by 159% in the UK from 1993–2005, with a marked increase in individuals over 60 years, and particularly amongst males under 40 years . Although, less than 5% of patients with BE will go on to develop EA, it is generally accepted that most persons with BE are undiagnosed and the vast majority of EA occurs in patients with undiagnosed BE. For example, in the US a high prevalence of BE (7–25% for segments of any length) was reported in asymptomatic patients who agreed to have an upper gastrointestinal endoscopy screening when attending for colonoscopy . Furthermore, the most commonly used means of early detection of EA is endoscopic examination ; however at a population level this approach for screening is neither feasible nor cost-effective.
Previous biochemical studies have shown that BE has features in common with gastric mucosa, including mucus secretory capacity, mucus granules, and expression of columnar cell cytokeratins , but it also shares features with squamous esophageal cells, its presumed precursor, including expression of squamous cell cytokeratins , . In an attempt to advance our understanding of the etiology of BE and its progression to EA at the molecular level, as well as to identify potential gene targets for evaluation as diagnostic markers, numerous studies have reported on the differential expression of genes between BE and EA tissues and also between BE and normal squamous esophagus (NE) tissues –. However, only two studies to date have included a comparison of matched BE, NE, and normal gastric cardia (NC) tissues from the same patient. In 2002, Barrett et al.  compared the gene expression profiles from pooled BE, NE, and NC tissues using the HU6800 microarray, while a subsequent study  compared transciptomes from each of these tissues using serial analysis of gene expression (SAGE). Both of these techniques for quantitating gene expression have their limitations; the HU6800 microarray is specific, but contains probes for only 7000 genes in the genome, while SAGE, which is highly specific and very reproducible for abundant transcripts, can be prone to sequencing errors.
In this study we used Affymetrix HG-U133A microarrays on fresh frozen matched BE, NE, and NC tissues from 40 BE patients to evaluate differentially-expressed genes between these tissues. Specifically, we compared gene expression between BE and NE tissues and between the BE and NC tissues and identified major genes that were differentially expressed, using stringent criteria for gene selection (fold-change >2.0 and P<1.12E-06). We then performed Gene Ontology classification and pathway-based analyses to identify functional groups, pathways, and key regulators among the identified genes.
Ethics statement, study population and tissue collection
This study was approved by the Institutional Review Boards of the Walter Reed National Military Medical Center (WRNMMC) and the National Cancer Institute (NCI), USA. BE patients were recruited as part of the Barrett's Esophagus Early Detection Study (BEEDS), a case-control study conducted among patients presenting to the Gastroenterology Department at the WRNMMC in Bethesda, Maryland, USA. After obtaining written informed consent, patients were interviewed to obtain information on demographic, lifestyle, and clinical factors, other clinical data were retrieved from the medical record, a blood sample was taken, and tissue samples were obtained during endoscopy.
Clinical data were collected for each patient by an attending nurse and a GERD questionnaire (modified from Manterola et al. ) was administered to all participants. Esophagogastroduodenoscopy (EGD) was performed on all patients, using a GIF-Q180 gastroscope (Olympus). During endoscopy, a gastroenterologist used disposable forceps to obtain multiple mucosal biopsies of normal gastric cardia (NC; within one cm distal to the gastroesophageal junction or the top of the gastric folds), Barrett's esophagus (BE; four quadrant biopsies in accord with surveillance guidelines; if present), and normal esophageal (NE) squamous tissue (at 30 cm from the gums) from the same patient. The gastroenterologist first obtained a clinical biopsy that was placed in formalin for use in determining histological diagnosis. A second research biopsy was then taken as close as visually/endoscopically possible to the clinical biopsy. The research biopsy was snap frozen in liquid nitrogen and stored at −130°C until required for RNA extraction.
RNA preparation and microarray methods
RNA was extracted from whole frozen tissue biopsies using Trizol (Life Technologies) according to manufacturers' instructions; RNA purity and quantity were determined using an RNA 6000 Labchip/Agilent 2100 Bioanalyzer (Agilent Technology, Inc.). Each microarray experiment was carried out using 15 μg of total RNA and probes were prepared as described previously . Twenty micrograms of biotinylated cDNA were applied to each GeneChip Human Genome U133A 2.0 hybridization array and all tissue RNAs from the same BE patient were processed on separate arrays in the same batch. After hybridization at 45°C overnight, arrays were developed with phycoerythrin-conjugated streptavidin by using a fluidics station (Genechip Fluidics Station 450) and scanned (Genechip Scanner 3000) to obtain quantitative gene expression levels . Paired BE and normal tissue specimens from each patient were processed simultaneously throughout the RNA extractions and hybridizations.
Validation of microarray results by real-time RT-PCR
A total of nine genes (Table S6) with a >2-fold expression difference either in BE vs NE and/or BE vs NC were selected for technical validation and independent replication. We carried out a technical validation of two random genes (CD36 and SLC6A14) as well as an independent validation of seven other genes (ABP1, ATP2C2, CALML4, HOXB7 KRT7, MSLN, and TFF3), six of which have previously been reported to be differentially expressed in BE and implicated in its development. Increased expression of the remaining gene MSLN has been reported in other cancers and was significantly upregulated in both comparison groups from our array data. For each gene target, real-time quantitative PCR (qPCR) was carried out using cDNA from BE-, NE-, and NC-matched RNAs. Amplification conditions yielded efficiencies >90% and linear regression coefficients >0.990 for all assays which were carried out as previously described in http://docs.appliedbiosystems.com/search. All reactions were performed in triplicate using commercially available kits (Applied Biosystems Inc.). GAPDH was used as the internal control and PCRs were carried out on an ABI Prism 7000 Sequence Detection System. The average CT was calculated for each gene evaluated and GAPDH, and the ΔCT was determined as the mean of the triplicate CT values for the evaluated gene minus the mean of the triplicate CT values for GAPDH . The N-fold differential expression of the evaluated gene for a BE sample compared with its normal epithelial counterpart was expressed as 2−ΔΔCT (formula ΔΔCT = ΔCT of BE−ΔCT of its normal epithelial counterpart), which represents the fold change in the target gene expression in BE normalized to an internal control gene (GAPDH) and relative to the normal comparator tissues (NE and NC epithelial tissues, respectively).
Statistical analyses were carried out using R program language (http://www.r-project.org/). Gene expression data were processed and normalized using the Bioconductor Affy package, based on the Robust Multichip Average (RMA) method for single-channel Affymetrix chips. The GEO accession number for this array set is GSE39491. All 22,277 probe sets based on the RMA summary measures were used in class comparison analyses. For analyses including paired tissues (BE vs NE and BE vs NC from the same subjects), a linear mixed effects model was used to account for intra-person correlation. For comparative purposes at the individual probe level, we focused on gene probes with P-value <1.12E-06 (0.05/44,554 probes, Bonferroni corrected two-sided) and fold-change ≥2. Fold-change (fc) was defined as 2β, where log2 expression = α+β×metaplasia status. Principal component analysis (PCA) was conducted to explore global differences in gene expression profiles.
Ontology group classification and molecular pathway construction using Pathway Studio 9.0
Gene ontology (GO)  was used for functional classifications of genes significantly differentially expressed in both BE vs NE and BE vs NC comparative sets, regardless of direction. We further analyzed this gene set using Pathway Studio software (version 9.0) (Ariadne Genomic, Rockville, MD) (http://www.ariadnegenomics.com/products/pathway-studio/) and the Fisher's exact test. Pathway Studio 9.0 is a text-mining tool that detects relationships among genes, proteins, cell processes, and diseases as recorded in the PubMed database. Pathway Studio 9.0 constructs common regulatory networks by searching the Medscan Database for reported interactions. These analyses allowed us to identify signaling pathways as well as potential regulatory transcription factors and nuclear receptors enriched in our data set.
Characteristics of BEED study population
We analyzed 120 tissue samples (BE, NE, and NC) from 40 BE patients. A single pathologist expert in gastrointestinal pathology confirmed all histologic diagnoses of BE from paired adjacent biopsies that were formalin fixed, paraffin-embedded, and H&E stained. BE cases were predominantly male (74%), overweight (median BMI 27.9), and had a mean age of 54.5 years. In addition, alcohol drinking (85%) and smoking (58%) were common amongst BE patients (Table 1). Eighty three percent of BE patients reported having symptoms of GERD (Table 1). Eighty-eight percent (35/40) of BE patients were taking acid suppressants at the time of endoscopic biopsy. Detailed demographic and risk factor information for BE cases are shown in Table S1. Also, endoscopy findings together with pathology findings for BE, NE, and NC biopsies from BE cases (N = 40 cases) are shown in Tables S2 and S3. BE biopsies in cases were all non-dysplastic except for one with high-grade dysplasia and a second that was indeterminate for dysplasia.
Microarray experimental quality control
In the present study, we used the HG_U133A 2.0 hybridization array which contains 22,227 probes sets representing 14,500 genes with multiple probe sets for the same genes (Affymetrix). We assayed hybridization quality by using the Affymetrix GCOS software. The average MAS5 Present call of the 129 HG_U133A chips from the 40 BE patients was 49.9% (range 34.8%–61.8%). The following Affymetrix quality assessment metrics were carried out for all chips: scaling factors; 3′ to 5′ ratio in Beta-Actin and GAPDH; RNA degradation; relative log expression plot; normalized unscaled standard error plot; and chip pseudo-images based on a probe level model (PLM) fit. All chips passed the quality check tests and were used in these analyses. The samples were processed and normalized with the Robust Multichip Average (RMA) method, including background adjustment, quantile normalization, and median polish summarization. No data filtering was applied and all probe sets based on RMA summary measures were used in the analyses.
Global gene expression signatures
Gene probes showed significantly different expression levels among the three tissue types and for each of the comparative analyses (i.e., BE vs NE, BE vs NC). Using a 2 fc cutoff and a Bonferroni correction threshold of 1.12E-06 (0.05/44,554 probe comparisons), a total of 2427 gene probes showed significant differential expression between BE and NE, and BE and NC tissue types, of which, 1645 (967 upregulated and 678 downregulated probes) were between BE and NE tissues, and 782 (491 upregulated and 290 downregulated probes) were between BE and NC tissues (Tables S4 and S5, respectively). PCA analyses of the differentially-expressed probe sets resulted in separation of the samples into their respective tissue type groups. Separation appeared greater for differentially-expressed probes in the BE vs NE comparison than for BE vs NC (Figure 1). Compared to BE vs NE, BE vs NC showed fewer differentially-expressed probes (Table S5); these probes corresponded to 649 differentially-expressed genes (408 upregulated and 241 downregulated) (Figure 2). In contrast, the 1645 differentially-expressed probes identified between BE vs NE tissues corresponded to 1324 genes or gene regions (785 upregulated and 539 downregulated). Summaries showing the top 50 differentially-expressed genes in BE vs NE and BE vs NC tissues are listed in Tables 2 and 3, respectively. Sensitivity analyses that compared these results with results that excluded the single case diagnosed with high-grade dysplasia showed virtually identical results (data not shown).
PCA was applied to each set of differentially-expressed probes to reduce the dimensionality of the microarray data with respect to individual samples. Phenotypic subgroups or tissues (BE, NE, and NC) can be differentiated from each other in BE patients, although there is some mixing of the phenotypes, particularly between BE and NC, which are more similar in terms of gene expression profiles. Color key: Blue = NC, Yellow = NE, Red = BE.
A. The left Venn diagram represents the total number of genes with significant differential expression between BE and NE (≥2 fc and P<1.12E-06), whereas the right Venn diagram represents the total number with genes significant differential expression between BE and NC (≥2 fc and P<1.12E-06). The overlap between the two differentially-expressed sets contained 205 genes used for functional classification and pathway-based analyses. B. Enrichment of the 205 genes in known biological processes (GO, Ariadne Genomics, MD). The columns entitled: ‘Total no of genes in process’ and ‘Number of genes involved in process (% overlap)’ refer to the total number of genes currently known in each process according to the database (Pathway Studio 9.00), and the % overlap of the 205 genes with the total gene number in each process, respectively. Enrichment of the 205 genes identified in each process relative to the whole gene set is also shown. The largest proportion of genes (7.3%) out of the 205 genes were involved in response to drugs and cell adhesion; however, relative to the overall number of genes in the identified process, cellular response to insulin showed the greatest enrichment.
Between BE vs NE tissues, mucin 5AC (MUC5AC), carbonicanhydrase II (CA2), and claudin 18 (CLDN18) genes had the most significant differentially-expressed levels (P<1.5E-15, Table 2), while MUC5AC (69 fc), serine peptidase inhibitor, Kazal type 1 (SPINK1, 58 fc), and CLDN18 (54 fc) had the greatest fold-change in expression levels (Table 2 and Table S4). Other significantly differentially-expressed genes in BE vs NE included gastrokine 1 (GKN1, 34 fc, P = 2.72E-15), TFF2 (37 fc, P = 5.47E-15), SULT1C2 (24 fc, P = 4.86E-15) prominin 1 (PROM1, 27 fc, P = 7.10E-16), and trefoil factor 1 (TFF1, 40.0 fc, P = 6.17E-14) (Table 2). Likewise, statistical significance was greatest in the BE vs NC tissue expression comparisons for diacylglycerol kinase, alpha 80 kDa (DGKA), sulfotransferase family cytosolic, 2A (SULT2A1), homobox C6 (HOXC6), S100 calcium binding protein A10 (S100A10) (all P<1.0E-12) (Table 3); while the fold changes of the highest magnitude were observed for ATPase, H+/K+ exchanging, beta polypeptide (ATP4B, 0.04 fc); chitinase, acidic (CHIA, 0.04 fc); and keratin 13 (KRT13, 31.0 fc) (Table S5). Other significantly differentially-expressed genes in BE vs NC tissues included P21 protein-activated kinase 3 (PAK3, 0.4 fc, P = 1.05E-12), regulator of G-protein signaling 7 (RGS7) (IGKC, 20 fc, P = 1.19E-13), and carcinoembryonic antigen-related cell adhesion molecule 6 (CEACAM6, 21 fc, P = 6.64E-12) (Table S5).
Validation of microarray results by qRT-PCR
We validated the microarray expression of nine genes from matched BE, NE, and NC tissues using TaqMan qRT-PCR. Six of these genes, which were validated in independent tissue triplets (120 samples total) in the current study, have previously been reported in the development of BE (Table S6). In summary, validations were in agreement with microarray results and details of probes, kits, tissue numbers and significant expression levels are presented in Table S6.
Pathway and regulatory network analysis of the common differentially expressed genes in BE vs NE and BE vs NC
We determined the overlap of differentially expressed genes between the BE vs NE and BE vs NC sets and identified 205 genes (Figure 2 and Table S7). We then used Pathway Studio 9.0 to determine the functional groups, processes, upstream regulators, and pathways that were overrepresented in this common gene set. A number of functional categories were found in the 205 genes, including genes encoding nucleotide binding proteins, hydrolase activity, and GTP binding (Table S8). Evaluation of biological processes related to the 205 differentially expressed genes indicated that the greatest number of gene subsets were involved in response to drugs, cell adhesion, small GTPase-mediated signaling and lipid metabolic processes (Figure 2 and Table S8). However, the proportion of genes identified in relation to the total number of genes involved in each process according to the database was greatest for cellular response to insulin stimulus, followed by epidermis development, and negative regulation of endopeptidase activity. Analyses of the 205 genes revealed numerous relationships that could be mediated through a number of key upstream regulatory proteins, including TGF-β, IL1B, TP53, and INS (Figure 3 and Table S8); the direction and/or magnitude of expression were not the same for a number of genes in the specific networks between the BE vs NE and the BE vs NC comparative groups (Figure 3). We also aimed to determine if the 205 common differentially expressed genes correlated with specific signaling pathways, and found that the greatest number of these genes (n = 67) mapped to the Atlas of Signaling pathway, a single overview pathway depicting the main cellular signal transduction channels (from receptors to transcription factors). Other common pathways included cell cycle regulation (n = 24 genes), Notch (n = 19 genes), and the guanylate cyclase pathway (n = 18 genes); these latter two pathways represent those containing the greatest proportion of - genes (Table S8) relative to the total number of genes in the respective pathway (Pathway Studio 9.0). The top ten hits for all pathway-based analyses are shown in Table S8.
Two hundred and five overlapping genes were identified between BE vs. NE and BE vs. NC tissues after Bonferroni correction (P<1.12-E06) and with a 2-fold or greater differential expression. Diagram illustrates the TGF-β1 signaling pathway in BE vs NE and BE vs NC comparative groups. The direction and/or magnitude of expression is inverse for thirty two genes in the TGF-β1 network between the comparative groups. Genes include: SERPINB2, TMOD3, EPHA2, KRT15, ADCY7, SST, FMOD, CD36, CA9, ALCAM, IL1RN, TIMP2, CAST, MITF, SPRR1A, BHLHE40, MCL1, IL13RA1, ACPP, SRD5A1, NR2F2, ALDH1A1, DST, TJP1, CD9, TGFA, MUC4, PMAIP1, ZFP36, CCNG2, IL18 and CHGA. Data source: Signaling Pathways, Ariadne Pathways. Primary red and blue colors and shading indicate the direction and degree of differential expression with pink to red indicating degrees of increasing upregulation and light blue to darker blue indicating increasing downregulation. Grey indicates a gene product that is part of the pathway, but is absent in the experimental list tested. Abbreviations: BE, Barrett's; NE, normal squamous epithelium; NC, normal gastric cardia epithelium.
There is no accepted hypothesis for the molecular mechanism underlying the development of BE. However, identifying genes that are differentially expressed in BE compared to normal esophageal (NE) squamous epithelium and normal gastric cardia (NC) epithelium should improve our understanding of the biology of BE development and may also identify genes whose expression may be useful in the diagnosis and clinical management of BE. The results of the present study extend previous findings indicating that BE shares phenotypic elements with normal epithelia from both the squamous esophagus and the gastric cardia , . Importantly, the analyses of matched tissue samples from a large number of BE patients using a high-density microarray allowed us to evaluate the expression levels of 14,500 well-annotated genes in BE, NE, and NC tissues from the same patients.
In relation to individual gene expression changes between paired BE vs NE, and BE vs NC tissues from BE patients, we identified more significant genes than have been reported previously , . PCA analysis of these dysregulated genes suggests that there are more similarities in gene expression profiles between BE and NC than between BE and NE.
In support of previous data, we identified many genes previously reported to be associated with BE metaplasia. For example, in BE vs NE, intestinal markers such as trefoil factors (TFF) 1, 2, and 3 were upregulated as were mucins , , particularly MUC5AC, which has been associated with wound healing. We observed increased expression of lysozyme, a potent non-immunological antibacterial enzyme previously shown to be upregulated in BE . We also observed increased expression of CLDN18 and CLDN10 in BE vs NE . Similarly, in BE vs NC, the expression of claudin and mucin genes were altered. In agreement with other data –, , keratin expression profiles revealed numerous changes in BE metaplasia compared to NE (e.g., KRTs 4, 8, 15, and 20) and NC (KRTs 5, 6, 13, 14, and 15) tissues.
Current evidence suggests the conversion of NE to BE metaplasia can arise from four potential mechanisms. The first mechanism is transdifferentiation which involves the irreversible switch of mature squamous esophageal epithelial cells to another differentiated cell , . The intestinal epithelial-associated claudel-type homeobox (CDX) transcription factors CDX1 and CDX2 have been implicated in the pathogenesis of BE and in the transdifferentiation of stratified squamous epithelia into columnar intestinal epithelia . In the present study, we did not observe significant expression changes in CDX2 mRNA in either BE vs NE (1.1 fc, P = 0.36) or in BE vs NC (1.1 fc, P = 0.90) tissue comparisons. In addition, at the individual or BE patient level, only 11 of 40 BE tissue comparisons had a ≥1.5 fc increase in CDX2 mRNA compared to matched NE tissues. While data from smaller qRT-PCR studies ,  previously suggested an increase in CDX2 mRNA in BE, current data from array profiling suggests that CDX2 mRNA is not dramatically upregulated in BE vs NE –, , , but paradoxically, CDX2 protein is overexpressed in most BEs , . However, we did observe a significant overexpression of CDX1 mRNA in BE vs NE (1.38 fc, P = 0.008) and BE vs NC (1.28 fc, P = 0.02) comparisons as well as other more significant classical gene markers of BE (e.g., Villin, MUC2, MUC5B, KRT20, and CLDN18) , , . A second mechanism  for the development of BE involves opportunistic cell lineage, whereby unique embryonic progenitor cells existing in the squamocolumnar junction physically migrate to replace damaged p63-deficient squamous cells in the adult esophagus. Several studies have shown that p63 expression, which is critical for the development and differentiation of normal esophageal epithelia , is lost in BE compared to NE . We also observed significantly decreased expression of p63 in BE vs NE, while p63 expression was significantly increased in BE vs NC. Interestingly, Wang et al.  also reported that CDX2 was not upregulated in murine esophageal cells lacking p63, in spite of the columnar phenotype of the cells. More recent evidence suggests a third and potential epigenetic mechanism which may be at work in the development of BE and involves alteration of HOX gene expression . In agreement with di Pietro et al. , we also observed significant expression changes in the HOX genes in BE vs NE and BE vs NC. In particular, we observed increased expression of HOXB5, HOXB6, and HOXB7 and significant activation of the downstream intestinal markers KRT8, KRT18, and KRT20 in BE vs NE.
The final proposed mechanism suggests that BE may develop from the conversion of a tissue-specific stem or pluripotential cell in the esophagus (e.g., a bone marrow-derived pluripotential stem cell), which has the capacity for unlimited or prolonged self-renewal . We detected significantly increased expression of prominin-1 (PROM1) in BE vs NE but not in BE vs NC. PROM1 (also known as CD133) is a suggested marker for intestinal stem cells that are susceptible to neoplastic transformation and is recognized as a stem cell marker in several tissues and in many cancers . The sex-determining region Y (Sry) box-containing (SOX) factors are a family of transcription factors that are emerging as potent regulators of stem cell maintenance and cell fate decisions in multiple organ systems . While SOX2 is essential for the maintenance of embryonic stem cells ,  its expression is also essential for the normal development of the NE . Also, expression of SOX9 in NE cells is sufficient to drive columnar differentiation of squamous epithelium and expression of an intestinal differentiation marker, reminiscent of BE . An increased expression of SOX9 and SOX2 protein has also been described for EA tumor cell lines compared to BE cells . We found that SOX9 (3.37 fc and P = 4.05E-12) and SOX4 mRNAs (2.88 fc and P = 3.89E-11) were upregulated, while SOX2 (0.46 fc and P = 9.36E-07) and SOX15 mRNAs (0.34 fc and P = 3.12E-09) were significantly downregulated in BE tissues compared to NE, but not in BE vs NC. Interestingly, downregulation of SOX2 can lead to an intestinal phenotype in gastric epithelial cells via downregulation of MUC and CDX expression .
Besides individual genes in each tissue comparison, we also determined which genes were dysregulated in both tissue comparison groups. These 205 common differentially expressed genes were overrepresented by genes involved with nucleotide binding and GTP binding/activity as well as peptidase inhibitor activity. Precursors of pepsinogen A and pepsinogen C (progastricsin) have been demonstrated in BE epithelium . We observed an 18-fold upregulation of pepsinogen C in BE vs NE and a 5.3 fold downregulation in BE vs NC, results that may reflect the prevalence of GERD (64%) in the BE patients in the study. An overrepresentation of peptidase genes in BE compared to NE (and in EA compared to NE) was previously reported by Greenawalt et al. . Recent but limited data suggests that the involvement of insulin signaling may be important for BE development, particularly in the progression from BE to EA via increased expression of insulin-like growth factor 1 receptor (IGF1R) . While we did not observe increased expression of IGFR1 in either tissue comparison in this study, cellular response to insulin showed the greatest enrichment of differentially-expressed genes; further, we observed a significant upregulation of insulin receptor mRNA levels in BE vs NE (4.6 fc) and BE vs NC (1.8 fc).
Performing pathway-based analysis of the expression of the 205 genes commonly dysregulated in both comparison groups allowed us to evaluate relationships between genes and their encoding proteins as well as the associated pathways in which these proteins are involved. Analyses of the 205 genes differentially expressed in both comparison groups and potential upstream regulators revealed connectivity between many of the genes associated with BE status. However, for the majority of the gene relationships this connectivity appeared to be mediated through a number of key upstream regulatory factors that included TGF-β1 and INS. Interestingly, the direction of differential expression of each gene downstream of the regulator was different between the two comparison groups and in some cases was the inverse (Figure 3). In particular, more genes were downregulated in the TGF-β1 network in BE vs NE compared to BE vs NC (Figure 3), suggesting a loss of TGF-β signaling in the former. Several investigators have reported impaired TGF-β signaling in the BE metaplasia-dysplasia-adenocarincoma sequence , , . Mendelson et al.  recently evaluated TGF-β and Notch signaling in NE and BE tissues using immunohistochemistry. They found further evidence of loss of TGF-β in BE (and BE-associated EA) as well as activation of Notch in BE-associated EA compared to normal squamous epithelium as characterized by increased expression of HES-1 and JAG1 proteins. We also observed enrichment of 19 genes in the Notch signaling pathway (Table S8). The majority of Notch pathway genes (23 of 32) were downregulated in BE vs NE. Previous evidence suggests that in esophageal squamous cells, Notch signaling is growth repressive . Also, in contrast, 23 of 32 Notch-associated genes were significantly upregulated in BE vs NC. Thus, in agreement with other studies, the present results suggest that a loss of TGF-β signaling as well as Notch signaling may be important in the development of BE metaplasia, possibly by disrupting the ability of cells to differentiate or maintain the differentiated state. However, depending on whether BE is compared to NE or NC, the direction and/or magnitude of disruption in these pathways may appear very different, a finding which may have implications for targeted therapy for BE metaplasia.
The use of whole tissue biopsies is a limitation in this study, as an admixture of epithelium with inflammatory and stromal cells could affect the genes identified. However, stromal cells can have a significant impact on adjacent epithelia and, considering the implicated role of TGF-β signaling in the development of BE, epithelial-mesenchymal interactions are likely to be an important contributory factor to the development of BE , .
In conclusion, the results of the present study extend previous findings indicating that BE shares phenotypic elements with normal epithelia of the squamous esophagus (NE) and the gastric cardia (NC). The analyses of BE, NE, and NC tissues from the same patient provides a robust picture of differential gene expression of BE compared to other studies. The results of this study provide a rich source of data for the analysis of specific genes and pathways in relation to the development of BE metaplasia and for the identification of potential new biomarkers and/or treatment targets.
Demographic and risk factor information for BE cases.
Pathology findings for cardia, BE, and squamous esophagus biopsies from BE cases.
Summary of 1645 probe sets in BE versus NE (P<1.12E-06 with 2-fold or greater change).
Summary of 782 probe sets in BE versus NC (P<1.12E-06 with 2-fold or greater change).
Microarray-based and qRT-PCR-based expression results for genes selected for validation in discovery samples and independent replication in new samples.
Common genes significantly differentially expressed (P<1.12E-06 with 2-fold or greater change in expression) in BE vs NE and BE vs NC tissues comparisons.
Conceived and designed the experiments: PLH SMD CCA PEY RDA BDC PRT. Performed the experiments: NH HS CW LW KMJ. Analyzed the data: PLH RM RMP KMJ. Contributed reagents/materials/analysis tools: PLH BG CG CD NH HS CW LW RMP MR KMJ. Wrote the manuscript: PLH. Tissue and data collection and management: CD BG CG.
- 1. Parkin DM, Bray F, Ferlay J, Pisani P (2005) Global cancer statistics, 2002. CA Cancer J Clin 55: 74–108.
- 2. Ferlay J, Shin HR, Bray F, Forman D, Mathers C, et al. (2010) Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int J Cancer 127: 2893–2917.
- 3. (2010) SEER*Stat Database: Incidence -SEER9 Regs Research Data, Nov 2009 Sub (1973–2007) Attributes-Total U.S., 1969–2007 Counties, NCI, DCCPS, Surveillance Research Program April 2010 ed: Surveillance, Epidemiology, and End Results (SEER) Program
- 4. Cook MB, Chow WH, Devesa SS (2009) Oesophageal cancer incidence in the United States by race, sex, and histologic type, 1977–2005. Br J Cancer 101: 855–859.
- 5. Reid BJ, Kostadinov R, Maley CC (2011) New strategies in Barrett's esophagus: integrating clonal evolutionary theory with clinical management. Clin Cancer Res 17: 3512–3519.
- 6. Conio M, Cameron AJ, Romero Y, Branch CD, Schleck CD, et al. (2001) Secular trends in the epidemiology and outcome of Barrett's oesophagus in Olmsted County, Minnesota. Gut 48: 304–309.
- 7. Solaymani-Dodaran M, Logan RF, West J, Card T, Coupland C (2004) Risk of oesophageal cancer in Barrett's oesophagus and gastro-oesophageal reflux. Gut 53: 1070–1074.
- 8. Cook MB, Wild CP, Everett SM, Hardie LJ, Bani-Hani KE, et al. (2007) Risk of mortality and cancer incidence in Barrett's esophagus. Cancer Epidemiol Biomarkers Prev 16: 2090–2096.
- 9. Hvid-Jensen F, Pedersen L, Drewes AM, Sorensen HT, Funch-Jensen P (2011) Incidence of adenocarcinoma among patients with Barrett's esophagus. N Engl J Med 365: 1375–1383.
- 10. Edelstein ZR, Farrow DC, Bronner MP, Rosen SN, Vaughan TL (2007) Central adiposity and risk of Barrett's esophagus. Gastroenterology 133: 403–411.
- 11. Edelstein ZR, Bronner MP, Rosen SN, Vaughan TL (2009) Risk factors for Barrett's esophagus among patients with gastroesophageal reflux disease: a community clinic-based case-control study. Am J Gastroenterol 104: 834–842.
- 12. Lagergren J, Bergstrom R, Lindgren A, Nyren O (1999) Symptomatic gastroesophageal reflux as a risk factor for esophageal adenocarcinoma. N Engl J Med 340: 825–831.
- 13. Ronkainen J, Aro P, Storskrubb T, Johansson SE, Lind T, et al. (2005) Prevalence of Barrett's esophagus in the general population: an endoscopic study. Gastroenterology 129: 1825–1831.
- 14. Hayeck TJ, Kong CY, Spechler SJ, Gazelle GS, Hur C (2010) The prevalence of Barrett's esophagus in the US: estimates from a simulation model confirmed by SEER data. Dis Esophagus 23: 451–457.
- 15. van Soest EM, Dieleman JP, Siersema PD, Sturkenboom MC, Kuipers EJ (2005) Increasing incidence of Barrett's oesophagus in the general population. Gut 54: 1062–1066.
- 16. Coleman HG, Bhat S, Murray LJ, McManus D, Gavin AT, et al. (2011) Increasing incidence of Barrett's oesophagus: a population-based study. Eur J Epidemiol 26: 739–745.
- 17. Gerson LB, Shetler K, Triadafilopoulos G (2002) Prevalence of Barrett's Esophagus in asymptomatic individuals. Gastroenterology 123: 461–467.
- 18. Pera M, Manterola C, Vidal O, Grande L (2005) Epidemiology of esophageal adenocarcinoma. Journal of Surgical Oncology 92: 151–159.
- 19. Levine DS, Rubin CE, Reid BJ, Haggitt RC (1989) Specialized Metaplastic Columnar Epithelium in Barretts Esophagus - a Comparative Transmission Electron-Microscopic Study. Laboratory Investigation 60: 418–432.
- 20. Salo JA, Kivilaakso EO, Kiviluoto TA, Virtanen IO (1996) Cytokeratin profile suggests metaplastic epithelial transformation in Barrett's oesophagus. Annals of Medicine 28: 305–309.
- 21. Barrett MT, Yeung KY, Ruzzo WL, Hsu L, Blount PL, et al. (2002) Transcriptional analyses of Barrett's metaplasia and normal upper GI mucosae. Neoplasia 4: 121–128.
- 22. van Baal JW, Milano F, Rygiel AM, Bergman JJ, Rosmolen WD, et al. (2005) A comparative analysis by SAGE of gene expression profiles of Barrett's esophagus, normal squamous esophagus, and gastric cardia. Gastroenterology 129: 1274–1281.
- 23. Greenawalt DM, Duong C, Smyth GK, Ciavarella ML, Thompson NJ, et al. (2007) Gene expression profiling of esophageal cancer: Comparative analysis of Barrett's esophagus, adenocarcinoma, and squamous cell carcinoma. International Journal of Cancer 120: 1914–1921.
- 24. Hao Y, Triadafilopoulos G, Sahbaie P, Young HS, Omary MB, et al. (2006) Gene expression profiling reveals stromal genes expressed in common between Barrett's esophagus and adenocarcinoma. Gastroenterology 131: 925–933.
- 25. Stairs DB, Nakagawa H, Klein-Szanto A, Mitchell SD, Silberg DG, et al. (2008) Cdx1 and c-Myc Foster the Initiation of Transdifferentiation of the Normal Esophageal Squamous Epithelium toward Barrett's Esophagus. Plos One 3.
- 26. Wang S, Zhan M, Yin J, Abraham JM, Mori Y, et al. (2006) Transcriptional profiling suggests that Barrett's metaplasia is an early intermediate stage in esophageal adenocarcinogenesis. Oncogene 25: 3346–3356.
- 27. Manterola C, Munoz S, Grande L, Bustos L (2002) Initial validation of a questionnaire for detecting gastroesophageal reflux disease in epidemiological settings. J Clin Epidemiol 55: 1041–1045.
- 28. Su H, Hu N, Yang HH, Wang CY, Takikita M, et al. (2011) Global Gene Expression Profiling and Validation in Esophageal Squamous Cell Carcinoma and Its Association with Clinical Phenotypes. Clinical Cancer Research 17: 2955–2966.
- 29. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25: 25–29.
- 30. Rubio CA, Lorinc E (2011) Lysozyme is up-regulated in Barrett's mucosa. Histopathology 58: 796–799.
- 31. van Baal JW, Bozikas A, Pronk R, Ten Kate FJ, Milano F, et al. (2008) Cytokeratin and CDX-2 expression in Barrett's esophagus. Scand J Gastroenterol 43: 132–140.
- 32. Fitzgerald RC (2006) Molecular basis of Barrett's oesophagus and oesophageal adenocarcinoma. Gut 55: 1810–1820.
- 33. Lord RVN, Brabender J, Wickramasinghe K, DeMeester SR, Holscher A, et al. (2005) Increased CDX2 and decreased PITX1 homeobox gene expression in Barrett's esophagus and Barrett's-associated adenocarcinoma. Surgery 138: 924–931.
- 34. Vallbohmer D, DeMeester SR, Peters JH, Oh DS, Kuramochi H, et al. (2006) Cdx-2 expression in squamous and metaplastic columnar epithelia of the esophagus. Diseases of the Esophagus 19: 260–266.
- 35. Wang J, Qin R, Ma Y, Wu H, Peters H, et al. (2009) Differential gene expression in normal esophagus and Barrett's esophagus. J Gastroenterol 44: 897–911.
- 36. van Baal JWPM, Bozikas A, Pronk R, Ten Kate FJW, Milano F, et al. (2008) Cytokeratin and CDX-2 expression in Barrett's esophagus. Scandinavian Journal of Gastroenterology 43: 132–140.
- 37. Eda A, Osawa H, Satoh K, Yanaka I, Kihira K, et al. (2003) Aberrant expression of CDX2 in Barrett's epithelium and inflammatory esophageal mucosa. J Gastroenterol 38: 14–22.
- 38. Wang X, Ouyang H, Yamamoto Y, Kumar PA, Wei TS, et al. (2011) Residual embryonic cells as precursors of a Barrett's-like metaplasia. Cell 145: 1023–1035.
- 39. Daniely Y, Liao G, Dixon D, Linnoila RI, Lori A, et al. (2004) Critical role of p63 in the development of a normal esophageal and tracheobronchial epithelium. Am J Physiol Cell Physiol 287: C171–181.
- 40. Lefort K, Dotto GP (2011) p63 and epithelial metaplasia: a gutsy choice. Cell 145: 1003–1005.
- 41. di Pietro M, Lao-Sirieix P, Boyle S, Cassidy A, Saadi A, et al. (2012) Epigenetically Regulated Hoxb Cluster Genes Have a Functional Role in Barrett's Esophagus Development. Gastroenterology 142: S127–S127.
- 42. Schier S, Wright NA (2005) Stem cell relationships and the origin of gastrointestinal cancer. Oncology 69 Suppl 1: 9–13.
- 43. Zhu L, Gibson P, Currle DS, Tong Y, Richardson RJ, et al. (2009) Prominin 1 marks intestinal stem cells that are susceptible to neoplastic transformation. Nature 457: 603–607.
- 44. Sarkar A, Hochedlinger K (2013) The sox family of transcription factors: versatile regulators of stem and progenitor cell fate. Cell Stem Cell 12: 15–30.
- 45. Mendelson J, Song S, Li Y, Maru DM, Mishra B, et al. (2011) Dysfunctional transforming growth factor-beta signaling with constitutively active Notch signaling in Barrett's esophageal adenocarcinoma. Cancer 117: 3691–3702.
- 46. Que J, Okubo T, Goldenring JR, Nam KT, Kurotani R, et al. (2007) Multiple dose-dependent roles for Sox2 in the patterning and differentiation of anterior foregut endoderm. Development 134: 2521–2531.
- 47. Clemons NJ, Wang DH, Croagh D, Tikoo A, Fennell CM, et al. (2012) Sox9 drives columnar differentiation of esophageal squamous epithelium: a possible role in the pathogenesis of Barrett's esophagus. Am J Physiol Gastrointest Liver Physiol 303: G1335–1346.
- 48. Asonuma S, Imatani A, Abe Y, Koike T, Asano N, et al. (2008) The down-regulation of a HMG box gene Sox2 by exposure to acid and bile induces the progression of Barrett's esophagus. Gastroenterology 134: A437–A437.
- 49. Westerveld BD, Pals G, Bosma A, Defize J, Pronk JC, et al. (1987) Gastric proteases in Barrett's esophagus. Gastroenterology 93: 774–778.
- 50. Agarwal R, Jin Z, Yang J, Mori Y, Song JH, et al. (2012) Epigenomic program of Barrett's-associated neoplastic progression reveals possible involvement of insulin signaling pathways. Endocr Relat Cancer 19: L5–9.
- 51. Onwuegbusi BA, Aitchison A, Chin SF, Kranjac T, Mills I, et al. (2006) Impaired transforming growth factor beta signalling in Barrett's carcinogenesis due to frequent SMAD4 inactivation. Gut 55: 764–774.
- 52. Ohashi S, Natsuizaka M, Naganuma S, Kagawa S, Kimura S, et al. (2011) A NOTCH3-mediated squamous cell differentiation program limits expansion of EMT-competent cells that express the ZEB transcription factors. Cancer Res 71: 6836–6847.
- 53. Bhowmick NA, Chytil A, Plieth D, Gorska AE, Dumont N, et al. (2004) TGF-beta signaling in fibroblasts modulates the oncogenic potential of adjacent epithelia. Science 303: 848–851.