Oral and oropharyngeal squamous cell carcinomas (OSCC) are among the most common cancers worldwide, with approximately 60% 5-yr survival rate. To identify potential markers for disease progression, we used Affymetrix U133 plus 2.0 arrays to examine the gene expression profiles of 167 primary tumor samples from OSCC patients, 58 uninvolved oral mucosae from OSCC patients and 45 normal oral mucosae from patients without oral cancer, all enrolled at one of the three University of Washington-affiliated medical centers between 2003 to 2008. We found 2,596 probe sets differentially expressed between 167 tumor samples and 45 normal samples. Among 2,596 probe sets, 71 were significantly and consistently up- or down-regulated in the comparison between normal samples and uninvolved oral samples and between uninvolved oral samples and tumor samples. Cox regression analyses showed that 20 of the 71 probe sets were significantly associated with progression-free survival. The risk score for each patient was calculated from coefficients of a Cox model incorporating these 20 probe sets. The hazard ratio (HR) associated with each unit change in the risk score adjusting for age, gender, tumor stage, and high-risk HPV status was 2.7 (95% CI: 2.0–3.8, p = 8.8E-10). The risk scores in an independent dataset of 74 OSCC patients from the MD Anderson Cancer Center was also significantly associated with progression-free survival independent of age, gender, and tumor stage (HR 1.6, 95% CI: 1.1–2.2, p = 0.008). Gene Set Enrichment Analysis showed that the most prominent biological pathway represented by the 71 probe sets was the Integrin cell surface interactions pathway. In conclusion, we identified 71 probe sets in which dysregulation occurred in both uninvolved oral mucosal and cancer samples. Dysregulation of 20 of the 71 probe sets was associated with progression-free survival and was validated in an independent dataset.
Citation: Lohavanichbutr P, Houck J, Doody DR, Wang P, Mendez E, et al. (2012) Gene Expression in Uninvolved Oral Mucosa of OSCC Patients Facilitates Identification of Markers Predictive of OSCC Outcomes. PLoS ONE 7(9): e46575. doi:10.1371/journal.pone.0046575
Editor: Qing-Yi Wei, The University of Texas MD Anderson Cancer Center, United States of America
Received: May 25, 2012; Accepted: August 31, 2012; Published: September 28, 2012
Copyright: © Lohavanichbutr et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The study was supported by grants from the National Institutes of Health, National Cancer Institute (NIH/NCI 5R01 CA095419), and by institutional funds from the Fred Hutchinson Cancer Research Center. The study at the MD Anderson Cancer Center was supported by Cancer Center Support Core Grant CA16672 (Affymetrix Microarray Core Facility; the Bioinformatics Core), Specialized Program of Research Excellence in Head and Neck Cancer Grant P50 CA97007 from the National Cancer Institute, “Clinician Investigator Program in Translational Research” K12 CA88084, NIH Loan Repayment Program, Clinical Research Program 2 L30 CA117652-02A1, and THANC Foundation Young Investigator Award in Head and Neck Cancer. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: Dr. Neal Futran is a consultant with the Stryker conmpany, though it has no bearing on this manuscript; and Dr. Eduardo Mendez has a consulting/honorarium agreement with Intuitive Surgical, Inc. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials.
Oral and oropharyngeal squamous cell carcinomas (OSCC) are among the most common cancers, with approximately 400,000 new cases and 200,000 deaths worldwide in 2008 (http://www-dep.iarc.fr/). Approximately 40,000 new cases and almost 8,000 deaths from OSCC are estimated to occur in the United States in 2012 . The overall 5-yr survival rate of OSCC patients is approximately 60% . The prognosis of OSCC patients is adversely influenced by the development of recurrent cancer, which occurs in 5–50% of patients –. Better prediction of which patients are most at risk for recurrence or disease progression is needed. Several factors have been found to be predictive of the development of recurrent OSCC, including tumor stage, tumor depth, nodal status, lymphovascular or perineural invasion, positive surgical margins, and extracapsular spread –. However, further improvement in the prediction of risk for recurrence or disease progression could help physicians identify patients who need more aggressive treatment or more frequent follow-up. Genes that play roles in the progression of normal tissue to cancer may serve as markers to predict recurrence or disease progression of OSCC patients.
Based on the field cancerization concept proposed by Slaughter et al in 1953 , the changes in the mucosa of the entire upper aerodigestive tract may be the result of long term exposure to carcinogens and may explain the occurrence of local recurrence or second primary disease. The field cancerization concept was supported by subsequent studies which found abnormal histologic and molecular features in the uninvolved, clinically normal, oral mucosae of OSCC patients –. A number of studies have shown alterations at a molecular level, such as loss of heterozygosity (LOH) at 3p, 9p, and 17p , gain of chromosome region 20q13.33, 7p22.2-pter, 11p15.5-pter, and 16p13.3-pter , and p53 mutation ,  in the uninvolved oral mucosae, either adjacent to or distant from the tumor of OSCC patients. There is also evidence of increased expression of some genes such as epidermal growth factor receptors , cyclin D1 and mindbomb E3 ubiquitin protein ligase 1 , and cytokeratins  in the uninvolved oral mucosae of OSCC patients.
In addition to the studies of molecular changes in the uninvolved oral mucosae of OSCC patients, there have been hundreds of studies reporting on the molecular changes in the oral cancer tissues, either at an individual gene level or a genome-wide level. For example, there have been reports of LOH on Chromosome 1p31, 3p25-p26, 4q25, 5q21-22, 8p21-23, 9p21-22, 10 at D10S202 and DD10S217, 11q, 14q, 17p, 20q12-13.1, and 21q11.1 in OSCC samples –. Studies using array comparative genomic hybridization (CGH) further expand the knowledge of gains and losses of chromosomal regions across the genome. Gains at chromosomal regions 1q23, 3q23, 3q26, 5p15.2, 5p15.33, 7p11, 7p12.3-13, 7p22.3, 7q21.2, 7q35, 8q21.1-24.3, 8q24, 9q34.3, 11q13, 14q23,16p13.3, 19q12, 19q13, 20q13, and losses at 2p15, 3p21-3p12, 3p22, 3p14, 4q34.3, 4q35.2, 8p32,10p12, 16q23.2, 18q21-q23 in OSCC samples have been detected using array CGH –. Several researchers have used proteomics to identify diagnostic – or prognostic – biomarkers for OSCC; however, these studies were either small or had no external validation.
With the advent of a high-throughput microarray technology, investigation of gene expression on a genome-wide level has become feasible and routine. Microarray studies usually result in a list of many genes; further definition of the functions of or pathways involving these genes could provide additional knowledge about OSCC. A gene expression profile, if validated in multiple, well-designed, independent studies, could also be developed as a useful clinical test as demonstrated for breast cancer . Prior studies, including ours, have reported many genes differentially expressed between OSCC and normal tissue –. However, these studies compared OSCC to either uninvolved oral mucosae of OSCC patients or normal oral mucosae from people without cancer. To the best of our knowledge, no study has compared genome-wide gene expression profiles of normal oral mucosae from non-cancerous patients, uninvolved oral mucosae from OSCC patients, and OSCC samples in the same study. As proposed by Braakhuis , oral carcinogenesis can be viewed as a multistep process; from normal tissue to a patch, which progresses to a field, and finally to an invasive carcinoma with additional genetic alterations in each step. Identifying the changes in gene expression in these steps may help advance our understanding of the disease progression process and lead to discovery of markers to predict disease progression. The purpose of the current study is to identify genes that are dysregulated in uninvolved oral mucosae from OSCC patients compared with normal oral mucosae from patients without cancer, and show further dysregulation in cancer tissue. We believe that these genes may play an important role in the progression of OSCC, and we tested our hypothesis by determining whether the expressions of dysregulated genes are associated with disease progression or OSCC-specific mortality.
This study was conducted with written informed consent of the study participants and the approval of the Institutional Review Boards of the Fred Hutchinson Cancer Research Center, the Veterans Affairs Puget Sound Health Care System, and the MD Anderson Cancer Center.
Eligible cases are patients with first primary OSCC treated at one of the three University of Washington-affiliated medical centers in Seattle, WA from December 2003 to March 2010. Eligible controls are patients without OSCC who had oral surgery, such as tonsillectomy or uvulopalatopharyngoplasty, at the same institutions and during the same time period in which the OSCC cases were treated.
Data and Tissue Collection
Each patient was interviewed using a structured questionnaire regarding his or her demographic, medical, and lifestyle history, including tobacco and alcohol use. Data on tumor characteristics were obtained from medical records. The data on tumor recurrence were obtained from telephone interview and confirmed by medical record abstraction if patients reported having a tumor recurrence. If patients were not followed at one of the three University of Washington-affiliated medical centers, we attempted to obtain medical record from their physicians. Vital status was obtained from Social Security Death Index (SSDI) and Fred Hutchinson Cancer Research Center's Cancer Surveillance System (CSS), which is part of the Surveillance, Epidemiology, and End Results (SEER) program of the National Cancer Institute. Death certificates and medical records, if available, were reviewed by otolaryngologists to determine the cause of death. The last search for vital status of all patients was in September 2011.
The tumor samples and uninvolved oral mucosa were obtained from OSCC patients at the time of resection prior to chemo/radiation therapy, if any. The uninvolved oral mucosa was collected either from the opposite side of the tumor or from the same side but far from the tumor margin. From controls we obtained normal mucosa from buccal, uvula or anterior tonsillar pillar, the latter with effort to avoid surrounding lymphoid tissues. Between December 2003 and September 2008, the tissue samples were soaked in RNAlater™ immediately after surgical removal and transferred to long term storage at −80°C prior to use. After September 2008, the tissue samples were flash frozen in liquid nitrogen immediately after surgical removal.
From December 2003 to March 2010, we recruited 291 cases and 58 controls. Gene expression data from tumor samples of 167 cases and normal oral mucosae of 45 controls were generated in our previous study  using samples treated with RNAlater™. For comparability, in the current study we only used uninvolved oral mucosa samples that had been treated with RNAlater™. As we had limited funds to measure the gene expression in uninvolved oral mucosa, we could only examine a subset of these samples. We suspected that the gene expression of uninvolved oral samples from patients with and without local recurrence may be different. Uninvolved oral mucosa of patients who later developed local recurrence may be more likely to contain genes that are associated with disease progression, and thus oversampling of uninvolved oral mucosa from patients with local recurrence might enhance the opportunity to detect genes that are associated with disease progression. We therefore included uninvolved oral samples from all patients who had recurrent/second primary OSCC as of March 2010 (n = 29). We then used stratified sampling to select another 29 OSCC patients who had not had a recurrent/second primary OSCC, but otherwise had a similar follow-up time distribution as the 29 patients with recurrent/second primary OSCC. Forty-nine of the 58 selected OSCC patients also provided tumor samples that had already been processed in the previous study (part of the 167 tumor samples).
The DNA and RNA from each specimen were simultaneously extracted using the TRIzol method (Invitrogen, Carlsbad, CA). RNA was further purified using RNeasy mini kit (Qiagen, Valencia, California) and then converted to double-stranded complementary DNA (cDNA) using a GeneChip Expression 3′-Amplification One-cycle DNA Synthesis Kit (Affymetrix). The cRNA was produced from cDNA and was hybridized to a U133 2.0 Plus GeneChip (Affymetrix) as previously described . HPV DNA was tested using a nested PCR based protocol and confirmed by LINEAR ARRAY HPV Genotyping Test (Roche, Indianapolis, IN) under a research use only agreement as described in Lohavanichbutr et al .
For quality control, we re-extracted and processed two tumor samples, whose genome-wide gene expression had been assessed in our previous study, along with the uninvolved oral samples. We used Pearson correlation to determine whether the previous and new gene expression were comparable. We found a good correlation between samples previously processed and samples processed along with uninvolved oral samples. The Pearson's correlation coefficients for all probe sets of the two pair were 0.96 and 0.97.
The quality of the hybridized arrays was evaluated using the “affyQCReport” and “affyPLM” software in the Bioconductor package (http://bioconductor.org/). This included evaluation of RNA degradation and detection for possible outlier array. We examined 58 arrays of uninvolved oral samples separately and also together with 212 arrays (167 from tumor samples and 45 from normal oral samples) previously processed in order to detect a batch effect. All 58 arrays for uninvolved oral samples passed quality control and no batch effect was observed.
Assessment of Differential Gene Expression.
All 270 CEL files were normalized using the RMA algorithm in Partek® Genomics Suite™ software. Figure 1 shows steps of statistical analyses. We first identified “OSCC-related genes” by comparing gene expression profiles of normal oral samples from 45 controls to tumor samples from 167 OSCC cases using ANOVA implemented in Partek® Genomics Suite™ software, adjusting for age (continuous variable), sex (male vs. female), cigarette-smoking (current smoker vs. never/former smoker), alcohol use (current vs. never/former alcohol use) and HPV status (high risk vs. negative/low risk). We set the false discovery rate at 0.05 and required at least a 2-fold difference in gene expression as criteria for differential expression. The purpose of this first step is to reduce the number of genes for further comparison. The next step is to identify genes, among the “OSCC-related genes”, that show dysregulation in a field of carcinogenic exposure (uninvolved oral mucosa) and increased level of dysregulation in cancer stage. To identify these genes, we used linear regression to compare the gene expression level between 45 normal oral samples and 58 uninvolved oral samples, and to compare the gene expression level of 58 uninvolved oral samples to that of 167 tumor samples. We used three criteria to select the gene list: 1) the Bonferroni adjusted p-value must be less than 0.05 in both comparisons; 2) the magnitude of the difference in expression level must be greater than one standard deviation of the expression in the uninvolved oral samples; 3) the direction of the coefficients of each gene must be the same in both comparisons, i.e. the coefficients must be positive in both comparisons for up-regulated genes, and must be negative for both comparisons for down-regulated genes. The analyses were performed using STATA 11.1 (StataCorp, College Station, TX).
Step 1, compared normal mucosae from non-cancerous patients to cancer tissues from OSCC patients to identify genes associated with OSCC and to reduce the number of genes for further comparison. Step 2 had two comparisons: 1) compared normal mucosae from non-cancerous patients to uninvolved mucosae from OSCC patients, 2) compared uninvolved mucosae from OSCC patients to cancer tissues. We then selected the genes that overlapped between the two comparisons and passed three selection criteria (i. the Bonferroni adjusted p-value must be less than 0.05 in both comparisons; ii. the magnitude of the difference in expression level must be greater than one standard deviation of the expression in the uninvolved oral samples; iii. the direction of the coefficients of each gene must be the same in both comparisons, i.e. the coefficients must be positive in both comparisons for up-regulated genes, and must be negative for both comparisons for down-regulated genes. Step 3 among the genes selected from step 2, we identified those that were associated with progression-free survival. In Step 4, we validated the genes identified in Step 3 using an independent external dataset.
Evaluation of Gene Expression Profile in relation to Disease Outcome.
To determine whether the selected genes are associated with disease progression or death due to OSCC, we performed a Cox regression with robust standard error on each selected gene adjusting for age, gender, high-risk HPV status, and tumor stage (stage I/II vs. stage III/IV). In this study, disease progression is defined as a persistence or recurrence of squamous cell carcinoma in oral cavity, oropharynx, or in the head and neck area. Patients who were alive as of September 2011 or patients who died with other causes were censored at the time of last known disease status either at the last follow-up interview or at the last clinic visit. We used a Bonferroni adjusted p-value of 0.05 as a criterion to select genes that are associated with disease progression/OSCC-related death. We then built a Cox regression model with the genes associated with disease progression/OSCC-related death and used coefficients from this model to calculate a risk score for each patient.
Validation using External Dataset.
An independent dataset of 74 frozen tumor samples from OSCC patients treated at the MD Anderson Cancer Center (MDACC) was used for validation. The 74 tumor samples were hybridized at the MDACC to the same type of Affymetrix array as used in our study. We normalized the CEL files using RMA algorithm in Partek® Genomics Suite™ software. A risk score for each patient was calculated using coefficients from a Cox regression model from our study. We then investigated the association between risk score and disease progression/OSCC-related death using a Cox regression analysis adjusting for age (continuous variable), gender (male vs. female), and tumor stage (I/II vs, III/IV). We compared the model with tumor stage alone and tumor stage plus risk score using a log likelihood ratio test. The patients were divided into three equal size groups based on the risk scores (low, medium, and high). We then used a Kaplan-Meier method to compare progression free survival for patients in each group.
Validation of gene expression using qRT-PCR.
To affirm our findings based on the Affymetrix array, we conducted qRT-PCR to assess the gene expression of the top five genes (OCC1, DSE, ACTN1, RRAS2, and ITGA3). In brief, qRT-PCR was performed using 7.5 ng purified total RNA from each of 48 samples (a subset of 270 samples) using the QuantiTect SYBR Green RT-PCR kit (Qiagen, Valencia, CA) and bioinformatically validated QuantiTect primers (Qiagen, Valencia, CA). Each sample was run in triplicate on a 7900 HT Sequence Detection System (ABI, Foster City, CA). The cycling conditions were: 30 min, 50°C; 15 min, 95°C; 40 cycles of 15 sec at 94°C, 30 sec at 55°C, and 30 sec at 72°C. Beta-Actin (ACTB) was used as a reference gene for normalization. We used Pearson's correlation to determine the correlation between the Affymetrix gene expression values and the Ct (cycle threshold) values from qRT-PCR.
We used Gene Set Enrichment Analysis (GSEA)  to investigate pathways of the genes dysregulated in uninvolved oral samples and tumor samples. GSEA computes the overlap between genes of interest and the gene sets in the Molecular Signatures Database (MSigDB). The gene sets of the pathways in the MSigDB are derived from three pathway databases: the Biocarta pathway database (www.biocarta.com), the KEGG pathway database (www.genome.jp/keg), and the Reactome pathway database (www.reactome.org). The p-value indicating the significance of the overlap was calculated based on the hypergeometric distribution (identical to the corresponding one-tailed version of Fisher's exact test). We used 0.05 for a p-value cutoff. We also used GSEA to compare the pathways of the genes that we found in this study to the 131 genes that we previously reported to be associated with survival of OSCC patients .
Selected characteristics of the study participants are showed in Table 1. Compared to controls, OSCC patients were more likely to be older, white, and current smokers. Approximately two-thirds of the cases had an advanced stage tumor.
In the ANOVA analysis to identify “OSCC-related genes”, we found 2,596 probe sets differentially expressed between 167 tumor samples and 45 normal oral samples from controls, using the criteria of a FDR of 0.05 and at least a two-fold difference in the expression level. The result of linear regression comparing gene expression level of 2,596 probe sets between 45 normal oral samples from controls and 58 uninvolved oral samples from OSCC cases, and between 58 uninvolved oral samples and 167 tumor samples (both from OSCC cases), showed that 60 probe sets were significantly and consistently up-regulated and 11 probe sets were significantly and consistently down-regulated in both comparisons, using the three criteria described in the Methods section. The list of the 71 probe sets is presented in Table S1.
We excluded nine of 167 cases who died within 30 days of surgery (due to complication of surgery) or who had been followed for less than 30 days. Among 158 patients included in the survival analyses, 70 had disease progression/OSCC-related death. The follow-up time for patients without progression/OSCC-related death ranged from 3.6 to 83.9 months, with a median follow-up time of 43.3 months. The result of Cox regression analyses of each of the 71 probe sets adjusting for age, sex, tumor stage, and high-risk HPV status showed 20 of 71 probe sets significantly associated with disease progression/OSCC-related death with a p-value<0.0007 (Table 2). We then built a prediction model based on the Cox regression model incorporating the 20 probe sets. A coefficient of each probe set (Table 2) was multiplied with the expression of that probe set and summed to be a risk score for each patient. The risk score ranged from 11.0 to 18.6 (mean 14.9, standard deviation 1.2). In our study, each unit increase in risk score was associated with a hazard ratio of 2.7 for disease progression/OSCC-related death after adjusting for age, gender, tumor stage, and high-risk HPV status (95% CI: 2.0–3.8, p-value<0.001).
Analyses of 20 probe sets in an independent dataset
Among 74 OSCC patients from the MDACC, five patients had follow-up time less than 30 days and were excluded from the survival analyses. The age range of the patients was 22 to 84 years, with an average age of 58.2 years. The majority of patients had stage III or IV disease (73.9%). Twenty-five of 69 patients had disease progression/OSCC-related death. The follow-up time for 69 patients ranged from one month to 92.7 months. The median follow-up time for patients without events was 22.7 months (range 1.6 to 92.7 months). A risk score for each patient was calculated using coefficients of the prediction model from the University of Washington data and the expression values from each MDACC patients as described above. The risk score ranged from 14.8 to 20.4 (mean 17.8, standard deviation 1.4). The crude hazard ratio (HR) for each unit increase in the risk score was 1.63 (95% CI: 1.16–2.29, p-value 0.004). The hazard ratio associated with a risk score after adjusting for age, gender, and tumor stage was 1.59 (95% CI: 1.13–2.23, p-value 0.008). The hazard ratio for each variable in the model is shown in Table 3. Data on HPV status were not available for the MDACC: therefore we could not adjust for HPV status. Higher tumor stage (stage III/IV) was associated with a higher risk of disease progression/OSCC-specific mortality; however, it did not reach statistical significance in the MDACC (crude HR 2.3, 95% CI: 0.79–6.75, p-value 0.13, HR adjusted for age, gender, and risk score 1.65, 95%CI: 0.53–5.19, p-value 0.39). The prediction model incorporating the risk score and tumor stage provided a better fit to the data than the model with tumor stage alone (log likelihood ratio test p-value = 0.006); however, it was not better than the model with risk score alone (log likelihood ratio test p-value = 0.3). The HR adjusted for age, gender, and tumor stage for patients with medium risk score and high risk score compared to patients with low risk score was 1.79 (95% CI: 0.54–5.91, p-value 0.34) and 3.67 (95% CI:1.2–11.2, p-value 0.02), respectively. Kaplan-Meier curves provided additional illustration of the progression-free survival of patients in each group (Figure 2). Patients with high risk score had poorer progression free survival than patients with medium and low risk score, with a Log-rank p-value of 0.019.
Kaplan-Meier curves comparing progression free survival of 69 OSCC patients in the MDACC dataset with low, medium, and high risk score.
Correlation between Affymetrix expression values and qRT-PCR
We found good correlations between Affymetrix expression values and qRT-PCR for the five genes we tested. The Pearson's correlations for OCC1, DSE, ATCN1, RRAS2, and ITGA3 were 0.90, 0.73, 0.80, 0.67, and 0.87, respectively. The p-values for the correlations were <0.0001 for all five genes.
Results of GSEA of the 71 probe sets show that the most prominent biologic pathway belongs to the integrin cell surface interactions pathway. The complete list of the pathways is presented in Table S2. When we compared the 71 probe sets to the 131 probe sets that we previously reported to be associated with survival of OSCC patients , we found eight genes (KLF7, OSMR, PDPN, PADI1, CLEC3B, COL7A1, COL27A1, and NETO2) that overlapped between the two gene lists. GSEA showed three pathways (Integrin cell surface interactions, ECM-receptor interaction, and focal adhesion) common to both gene lists.
OSCC has a high mortality rate, with a 5-year survival rate of 30–50% for late stage cancer (www.cancer.org). Fortunately, the 5-year survival rate exceeds 80% in early stage cancer. Thus, early detection or prevention of disease progression may help improve survival of OSCC patients. Identifying the key genes that play an important role in the progression of the carcinogenesis process may have potential clinical implications, e.g., as targets for OSCC prevention or treatment, or as biomarkers for early detection or prediction of disease progression. Our study is unique in that we collected not only normal oral mucosae from controls and tumor samples from OSCC, but we also collected uninvolved oral samples from OSCC patients. This design provided an opportunity to study the effect of field cancerization by comparing gene expression between normal oral mucosae from controls and uninvolved oral mucosae from OSCC patients, a comparison which may help identify genes that play a role in a very early stage of carcinogenesis. In addition, by comparing gene expression of uninvolved oral samples to that of tumor samples, we can further select genes that not only play a role at an early stage but that also play a role in the later stage of neoplastic development. We believe that the genes found through both of these comparisons are important in the progression of normal mucosa to cancer.
A limitation of this study is that the uninvolved oral samples from OSCC patients were not processed at the same time as normal oral samples and tumor samples: thus a batch effect is a potential issue to consider. We attempted to investigate and minimize the batch effect in several ways. First, we re-processed some tumor samples along with uninvolved oral samples and compared the gene expression to that of the previously processed tumor samples from the same patients. The results showed a good correlation of the gene expression between the re-processed samples and the samples previously processed. Second, we examined the quality of all arrays together to detect batch effects, and we then normalized all arrays together. Third, we performed a first step analysis to compare gene expression between tumor samples and normal oral samples from controls. The benefit from this first step analysis was that it minimized the penalty from multiple comparisons for the next step by reducing the number of genes to be further studied from more than 50,000 probe sets to approximately 2,500 “OSCC-related” probe sets. In addition, since tumor samples and normal oral samples were processed simultaneously with each batch containing both tumor sample and normal oral sample, the identification of these 2,596 probe sets from which the 20 final probe sets were determined were not affected by batch effect. It is possible that, by oversampling the uninvolved oral mucosae from patients who later developed local recurrence, we may have detected more differentially expressed genes between normal mucosa and uninvolved mucosa than had we selected the latter at random. However, a separate analysis to investigate whether the gene expression of uninvolved mucosa from patients with local recurrence differ from that of patients without local recurrence showed no significant difference in gene expression between the two groups (data not shown).
We identified 71 probe sets that showed dysregulation in the uninvolved oral samples and that showed even higher level of dysregulation in tumor samples. As mentioned earlier, one potential clinical implication of these genes is to predict disease progression. Thus we further tested whether some of the 71 probe sets were associated with disease progression or death due to OSCC and built a prediction model based on these genes. The results that 20 genes were associated with disease progression/OSCC-related death and can be used to predict disease progression independent of age, sex, tumor staging, and high-risk HPV status support our hypothesis.
We validated our results using data from 74 OSCC patients recruited at the MDACC. In the MDACC cohort, we found a significant association between disease progression/OSCC-related death and the risk score calculated from the prediction model based on the Cox model incorporating 20 probe sets. This association was independent of age, gender, and tumor stage. Moreover, the prediction model with risk score plus stage was better than the model with stage alone, suggesting that adding risk score to tumor stage improves the prediction of disease progression or OSCC-related death. To the best of our knowledge, this is the first gene signature using microarray data to predict disease progression/OSCC-related death that has been validated in an independent dataset from a different institution. One limitation in the MDACC is the lack of information on HPV status. Patients with HPV-positive tumors are more likely to have better survival, and HPV-positive tumors are more commonly found in the oropharynx , . Among the 69 MDACC tumor samples, only three tumors were from the oropharynx. Thus, it is unlikely that HPV status would confound the association between risk scores and disease progression/OSCC-related death in the MDACC dataset. The fact that the tumor samples from the MDACC were frozen samples suggested that the use of this risk score is not limited to the RNAlater™-treated samples only.
Another potential use of the 20 or the 71 probe sets is to predict which premalignant lesions are likely to progress to cancer. Future study is needed to address this potential. In addition to prediction of disease progression, some of the 71 probe sets may serve as targets for detection or treatment of OSCC. For instance, SART2 (squamous cell carcinoma antigen recognized by T-cells 2) protein was found to be overexpressed in several types of cancer but not in normal cells . One potential study would be to investigate the level of SART2 protein in saliva or oral rinse to determine whether it could help improve detection of OSCC, especially those that are located in areas that are difficult to visualize. Since SART2 is a tumor antigen recognized by cytotoxic T cells, it could potentially serve as a target for cancer immunotherapy as well. SART2-derived peptide has been shown to be immunogenic in hepatocellular carcinoma patients . Further investigation is needed to explore the potential use of SART2 as a tumor marker or as a target for cancer immunotherapy for OSCC patients. Another gene that has been investigated as a potential anti-cancer target is Integrin α3β1 . Genes in the Integrin family have functional roles in migration/invasion of tumor cells –. Among the 71 probe sets, four were genes in the Integrin family (ITGA3, ITGAV, ITGB6, and ITGA6). The most prominent pathway for the 71 probe sets is Integrin cell surface interaction pathway. Our results lend support to the important role of Integrins in cancer.
Previously, we reported that a 131 gene expression signature provided high discrimination between OSCC samples and normal oral mucosae from controls, and it was associated with OSCC-specific mortality . Most of the 131 probe sets were in the list of 2,596 probe sets differentially expressed between tumor samples and normal oral samples in the current analyses. However, the difference in selection criteria provided different gene lists. The 131 probe sets were selected based on the most significant differences in gene expression between tumor and normal sample but the 71 probe sets were selected by emphasizing their potential involvement in both early and late stage of carcinogenesis.
In conclusion, we found dysregulation of gene expression of 71 probe sets, corresponding to 61 known genes, occurring early in uninvolved oral mucosae from OSCC patients, and the level of dysregulation was even higher in tumor samples. Dysregulation in the expression of 20 of the 71 probe sets was associated with disease free survival of OSCC patients. The result was validated in an independent dataset from the MDACC. If further confirmed in future studies, the expression of these genes has the potential to be developed into a clinical test. Such a test could help physicians to identify patients who need more aggressive treatment or frequent follow-up.
Seventy-one probe sets dysregulated in uninvolved oral samples and tumor samples of OSCC patients compared to normal oral mucosa from non-cancerous patients.
Gene set enrichment analysis identified pathways of 71 probe sets dysregulated in both uninvolved oral samples and cancer samples from OSCC patients.
We wish to express our deepest appreciation to the study participants and their families for their contribution to this study. The study is supported by resources and the use of facilities at the University of Washington Medical Center, Harborview medical Center and the Veterans Affairs Puget Sound Health Care System. We also thank Kevin R. Coombes, PhD of the MD Anderson Cancer Center for his mentorship to FCH and related contributions to this work.
Conceived and designed the experiments: CC PL SS PW. Performed the experiments: PL JH. Analyzed the data: DD PL PW. Contributed reagents/materials/analysis tools: EM NF MU FCH. Wrote the paper: PL.
- 1. Siegel R, Naishadham D, Jemal A (2012) Cancer statistics, 2012. CA Cancer J Clin 62: 10–29. doi: 10.3322/caac.20138
- 2. Gonzalez-Garcia R, Naval-Gias L, Roman-Romero L, Sastre-Perez J, Rodriguez-Campo FJ (2009) Local recurrences and second primary tumors from squamous cell carcinoma of the oral cavity: a retrospective analytic study of 500 patients. Head Neck 31: 1168–80. doi: 10.1002/hed.21088
- 3. Hockel M, Dornhofer N (2005) The hydra phenomenon of cancer: why tumors recur locally after microscopically complete resection. Cancer Res 65: 2997–3002.
- 4. Mucke T, Wagenpfeil S, Kesting MR, Holzle F, Wolff K (2009) Recurrence interval affects survival after local relapse of oral cancer. Oral Oncol 45: 687–91. doi: 10.1016/j.oraloncology.2008.10.011
- 5. Mishra RC, Parida G, Mishra TK, Mohanty S (1999) Tumour thickness and relationship to locoregional failure in cancer of the buccal mucosa. Eur J Surg Oncol 25: 186–9. doi: 10.1053/ejso.1998.0624
- 6. Larsen SR, Johansen J, Sorensen JA, Krogdahl A (2009) The prognostic significance of histological features in oral squamous cell carcinoma. J Oral Pathol Med 38: 657–62. doi: 10.1111/j.1600-0714.2009.00797.x
- 7. Woolgar JA, Rogers S, West CR, Errington RD, Brown JS, et al. (1999) Survival and patterns of recurrence in 200 oral cancer patients treated by radical surgery and neck dissection. Oral Oncol 35: 257–65. doi: 10.1016/s1368-8375(98)00113-4
- 8. Brandwein-Gensler M, Teixeira MS, Lewis CM, Lee B, Rolnitzky L, et al. (2005) Oral squamous cell carcinoma: histologic risk assessment, but not margin status, is strongly predictive of local disease-free and overall survival. Am J Surg Pathol 29: 167–78. doi: 10.1097/01.pas.0000149687.90710.21
- 9. Slaughter DP, Southwick HW, Smejkal W (1953) “Field Cancerization” in Oral Stratified Squamous Epithelium: Clinical Implications of Multicentric Origin. Cancer 6: 963–8. doi: 10.1002/1097-0142(195309)6:5<963::aid-cncr2820060515>3.0.co;2-q
- 10. Tabor MP, Brakenhoff RH, van Houten VM, Kummer JA, Snel MH, et al. (2001) Persistence of genetically altered fields in head and neck cancer patients: biological and clinical implications. Clin Cancer Res 7: 1523–32.
- 11. Giaretti W, Maffei M, Pentenero M, Scaruffi P, Donadini A, et al. (2012) Genomic aberrations in normal apperaing mucosa fields distal from oral potentially malignant lesions. Cell Oncol 35: 43–52. doi: 10.1007/s13402-011-0064-2
- 12. Nees M, Homann N, Discher H, Andi T, Enders C, et al. (1993) Expression of mutated p53 occurs in tumor-distant epithelia of head and neck cancer patients: A possible molecular basis for the development of multiple tumors. Cancer Res 53: 4189–96.
- 13. Bergler W, Bier H (1989) Ganzer (1989) The expression of epidermal growth factor receptors in the oral mucosa of patients with oral cancer. Arch Otorhinolaryngol 246: 121–5. doi: 10.1007/bf00456651
- 14. Bloching MM, Soulsby H, Naumann A, Aust W, Merkel D, et al. (2008) Tumor risk assessment by means of immunocytochemical detection of early pre-malignant changes in buccal smears. Oncol Rep 19: 1373–1379. doi: 10.3892/or.19.6.1373
- 15. Kale AD, Mane DR, Babji D, Gupta K (2012) Establishment of field cahgne by expression of cytokeratin 8/18, 19, and MMP-9 in an apparently normal oral mucosa adjacent to squamouse cell carcinoma: A immunohistochemical study. J Oral Maxillofac Pathol 16: 10–5. doi: 10.4103/0973-029x.92966
- 16. Kannan S, Kartha CC, Balaram P, Chandran GJ, Pillai MR, et al. (1996) Ultrastructural analysis of the adjacent epithelium of oral squamous cell carcinoma. Br J Oral Maxillofac Surg 34: 51–7. doi: 10.1016/s0266-4356(96)90136-9
- 17. Waridel F, Estreicher A, Bron L, Flaman JM, Fontolliet C, et al. (1997) Field cancerization and polyclonal p53 mutation in the upper aerodigestive tract. Oncogene 14: 163–9. doi: 10.1038/sj.onc.1200812
- 18. Thomson PJ (2002) Field change and oral cancer: new evidence for widespread carcinogenesis? Int J Oral Maxillofac Surg 31: 262–266. doi: 10.1054/ijom.2002.0220
- 19. Araki D, Uzawa K, Watanabe T, Shiiba M, Miyakawa A, et al. (2002) Frequent allelic losses on the short arm of chromosome 1 and decreased expression of the p73 gene at 1p36.3 in squamous cell carcinoma of the oral cavity. Int J Oncol 20: 355–360. doi: 10.3892/ijo.20.2.355
- 20. Kayahara H, Yamagata H, Tanioka H, Miki T, Hamakawa H (2001) Frequent loss of heterozygosity at 3p25-p26 is associated with invasive oral squamous cell carcinoma. J Hum Genet 46: 335–341. doi: 10.1007/s100380170069
- 21. Wang XL, Uzawa K, Imai FL, Tanzawa H (1999) Localization of a novel tumor suppressor gene associated with human oral cancer on chromosome 4q25. Oncogene 18: 823–825. doi: 10.1038/sj.onc.1202318
- 22. Mao EJ, Schwartz SM, Daling JR, Beckmann AM (1998) Loss of heterozygosity at 5q21-22 (adenomatous polyposis coli gene region) in oral squamous cell carcinoma is common and correlated with advanced disease. J Oral Pathol Med 27: 297–302. doi: 10.1111/j.1600-0714.1998.tb01960.x
- 23. El-Naggar AK, Coombes MM, Batsakis JG, Hong WK, Goepfert H, et al. (1998) Localization of chromosome 8p regions involved in early tumorigenesis of oral and laryngeal squamous carcinoma. Oncogene 16: 2983–2987. doi: 10.1038/sj.onc.1201808
- 24. Mao L, Lee JS, Fan YH, Ro JY, Batsakis JG, et al. (1996) Frequent microsatellite alterations at chromosomes 9p21 and 3p14 in oral premalignant lesions and their value in cancer risk assessment. Nat Med 2: 682–685. doi: 10.1038/nm0696-682
- 25. Yamashita Y, Miyakawa A, Mochida Y, Aisaki K, Yama M, et al. (2002) Genetic aberration on chromosome 10 in human oral squamous cell carcinoma. Int J Oncol 20: 595–598. doi: 10.3892/ijo.20.3.595
- 26. Lazar AD, Winter MR, Nogueira CP, Larson PS, Finnemore EM, et al. (1998) Loss of heterozygosity at 11q23 in squamous cell carcinoma of the head and neck is associated with recurrent disease. Clin Cancer Res 4: 2787–2793.
- 27. Lee DJ, Koch WM, Yoo G, Lango M, Reed A, et al. (1997) Impact of chromosome 14q loss on survival in primary head and neck squamous cell carcinoma. Clin Cancer Res 3: 501–505.
- 28. Adamson R, Jones AS, Field JK (1994) Loss of heterozygosity studies on chromosome 17 in head and neck cancer using microsatellite markers. Oncogene 9: 2077–2082.
- 29. Imai FL, Uzawa K, Miyakawa A, Shiiba M, Tanzawa H (2001) A detailed deletion map of chromosome 20 in human oral squamous cell carcinoma. Int J Mol Med 7: 43–47. doi: 10.3892/ijmm.7.1.43
- 30. Chen L, Wong MP, Cheung LK, Samaranayake LP, Baum L, et al. (2005) Frequent allelic loss of 21q11.1 approximately q21.1 region in advanced stage oral squamous cell carcinoma. Cancer Genet Cytogenet 159: 37–43. doi: 10.1016/j.cancergencyto.2004.09.011
- 31. Yamamoto N, Uzawa K, Yakushiji T, Shibahara T, Noma H, et al. (2001) Analysis of the ANA gene as a candidate for the chromosome 21q oral cancer susceptibility locus. Br J Cancer 84: 754–759.
- 32. Freier K, Knoepfle K, Flechtenmacher C, Pungs S, Devens F, et al. (2010) Recurrent copy number gain of transcription factor SOX2 and corresponding high protein expression in oral squamous cell carcinoma. Genes Chromosom Cancer 49: 9–16. doi: 10.1002/gcc.20714
- 33. Baldwin C, Garnis C, Zhang L, Rosin MP, Lam WL (2005) Multiple microalterations detected at high frequency in oral cancer. Cancer Res 65: 7561–7567.
- 34. Uchida K, Oga A, Nakao M, Mano T, Mihara M, et al. (2011) Loss of 3p26.3 is an independent prognostic factor in patients with oral squamous cell carcinoma. Oncol Rep 26: 463–469.
- 35. Kooren JA, Rhodus NL, Tang C, Jagtap PD, Horrigan BJ, et al. (2011) Evaluating the potential of a novel oral lesion exudate collection method coupled with mass spectrometry-based proteomics for oral cancer biomarker discovery. Clinical Proteomics 8: 13. doi: 10.1186/1559-0275-8-13
- 36. Remmerbach TW, Maurer K, Janke S, Schellenberger W, Eschrich K, et al. (2011) Oral brush biopsy analysis by matrix assisted laser desorption/ionisation-time of flight mass spectrometry profiling--a pilot study. Oral Oncol 47: 278–281. doi: 10.1016/j.oraloncology.2011.02.005
- 37. Jou YJ, Lin CD, Lai CH, Chen CH, Kao JY, et al. (2010) Proteomic identification of salivary transferrin as a biomarker for early detection of oral cancer. Anal Chim Acta 681: 41–48. doi: 10.1016/j.aca.2010.09.030
- 38. Wang Z, Feng X, Liu X, Jiang L, Zeng X, et al. (2009) Involvement of potential pathways in malignant transformation from oral leukoplakia to oral squamous cell carcinoma revealed by proteomic analysis. BMC Genomics 10: 383. doi: 10.1186/1471-2164-10-383
- 39. Sun G, He H, Ping F, Zhang F (2009) Proteomic diagnosis models from serum for early detection of oral squamous cell carcinoma. Artif Cells Blood Substit Immobil Biotechnol 37: 125–129. doi: 10.1080/10731190902913759
- 40. Hu S, Arellano M, Boontheung P, Wang J, Zhou H, et al. (2008) Salivary proteomics for oral cancer biomarker discovery. Clin Cancer Res 14: 6246–6252. doi: 10.1158/1078-0432.ccr-07-5037
- 41. Lo WY, Tsai MH, Tsai Y, Hua CH, Tsai FJ, et al. (2007) Identification of over-expressed proteins in oral squamous cell carcinoma (OSCC) patients by clinical proteomic analysis. Clin Chim Acta 376: 101–107. doi: 10.1016/j.cca.2006.06.030
- 42. Chang KP, Yu JS, Chien KY, Lee CW, Liang Y, et al. (2011) Identification of PRDX4 and P4HA2 as metastasis-associated proteins in oral cavity squamous cell carcinoma by comparative tissue proteomics of microdissected specimens using iTRAQ technology. Journal of Proteome Research 10: 4935–4947. doi: 10.1021/pr200311p
- 43. Chiang WF, Ho HC, Chang HY, Chiu CC, Chen YL, et al. (2011) Overexpression of Rho GDP-dissociation inhibitor alpha predicts poor survival in oral squamous cell carcinoma. Oral Oncol 47: 452–458. doi: 10.1016/j.oraloncology.2011.03.025
- 44. Liao KA, Tsay YG, Huang LC, Huang HY, Li CF, et al. (2011) Search for the tumor-associated proteins of oral squamous cell carcinoma collected in Taiwan using proteomics strategy. Journal of Proteome Research 10: 2347–2358. doi: 10.1021/pr101146w
- 45. Cardoso F, van't Veer L, Rutgers E, Loi S, Mook S, et al. (2008) Clinical application of the 70-gene profile: the MINDACT trial. J Clin Oncol 26: 729–735. doi: 10.1200/jco.2007.14.3222
- 46. Chen C, Mendez E, Houck J, Fan W, Lohavanichbutr P, et al. (2008) Gene expression profiling identifies genes predictive of oral squamous cell carcinoma. Cancer Epidemiol Biomarkers Prev 17: 2152–62. doi: 10.1158/1055-9965.epi-07-2893
- 47. Lo Muzio L, Santarelli A, Emanuelli M, Pierella F, Sartini D, et al. (2006) Genetic analysis of oral squamous cell carcinoma by cDNA microarrays focused apoptotic pathway. International Journal of Immunopathology and Pharmacology 19: 675–682.
- 48. Ziober AF, Patel KR, Alawi F, Gimotty P, Weber RS, et al. (2006) Identification of a gene signature for rapid screening of oral squamous cell carcinoma. Clin Cancer Res 12: 5960–5971. doi: 10.1158/1078-0432.ccr-06-0535
- 49. Cheong SC, Chandramouli GV, Saleh A, Zain RB, Lau SH, et al. (2009) Gene expression in human oral squamous cell carcinoma is influenced by risk factor exposure. Oral Oncol 45: 712–719. doi: 10.1016/j.oraloncology.2008.11.002
- 50. Kuriakose MA, Chen WT, He ZM, Sikora AG, Zhang P, et al. (2004) Selection and validation of differentially expressed genes in head and neck cancer. Cell Mol Life Sci 61: 1372–1383. doi: 10.1007/s00018-004-4069-0
- 51. Belbin TJ, Singh B, Smith RV, Socci ND, Wreesmann VB, et al. (2005) Molecular profiling of tumor progression in head and neck cancer. Arch Otolaryngol Head Neck Surg 131: 10–18. doi: 10.1001/archotol.131.1.10
- 52. Braakhuis BJM, Tabor MP, Kummer JA, Leemans CR, Brakenhoff RH (2003) A genetic explanation of Slaughter's concept of field cancerization: evidence and clinical implications. Cancer Res 63: 1727–30.
- 53. Lohavanichbutr P, Houck J, Fan W, Yueh B, Mendez E, et al. (2009) Genomewide gene expression profiles of HPV-positive and HPV-negative oropharyngeal cancer: potential implications for treatment choices. Arch Otolaryngol Head Neck Surg 135: 180–8. doi: 10.1001/archoto.2008.540
- 54. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545–50. doi: 10.1073/pnas.0506580102
- 55. Mendez E, Houck JR, Doody DR, Fan W, Lohavanichbutr P, et al. (2009) A genetic expression profile associated with oral cancer identifies a group of patients at high risk of poor survival. Clin Cancer Res 15: 1353–61. doi: 10.1158/1078-0432.ccr-08-1816
- 56. Gillison ML, Koch WM, Capone RB, Spafford M, Westra WH, et al. (2000) Evidence for a causal association between human papillomavirus and a subset of head and neck cancers. J Natl Cancer Inst 92: 709–20. doi: 10.1093/jnci/92.9.709
- 57. Schwartz SR, Yueh B, McDougall JK, Daling JR, Schwartz SM (2001) Human papillomavirus infection and survival in oral squamous cell carcinoma: a population based study. Otolaryngol Head Neck Surg 125: 1–9. doi: 10.1067/mhn.2001.116979
- 58. Nakao M, Shichijo S, Imaizumi T, Inoue Y, Matsunaga K, et al. (2000) Identification of a gene coding for a new squamous cell carcinoma antigen recognized by the CTL. J Immunol 164: 2565–74.
- 59. Mizukoshi E, Nakamoto Y, Arai K, Yamashita T, Sakai A, et al. (2011) Comparative analysis of various tumor-associated antigen-specific t-cell responses in patients with hepatocellular carcinoma. Hepatology 53: 1206–16. doi: 10.1002/hep.24149
- 60. Subbaram S, Dipersio CM (2011) Integrin alpha3beta1 as a breast cancer target. Expert Opinion on Therapeutic Targets 15: 1197–210. doi: 10.1517/14728222.2011.609557
- 61. Lee CY, Huang CY, Chen MY, Lin CY, Hsu HC, et al. (2011) IL-8 increases integrin expression and cell motility in human chondrosarcoma cells. J Cell Biochem 112: 2549–57. doi: 10.1002/jcb.23179
- 62. Wang Y, Shenouda S, Baranwal S, Rathinam R, Jain P, et al. (2011) Integrin subunits alpha5 and alpha6 regulate cell cycle by modulating the chk1 and Rb/E2F pathways to affect breast cancer metastasis. Molecular Cancer 10: 84. doi: 10.1186/1476-4598-10-84
- 63. Morgan MR, Jazayeri M, Ramsay AG, Thomas GJ, Boulanger MJ, et al. (2011) Psoriasin (S100A7) associates with integrin beta6 subunit and is required for alphavbeta6-dependent carcinoma cell invasion. Oncogene 30: 1422–35. doi: 10.1038/onc.2010.535