SOX2 is an embryonic developmental transcription factor, which is important in the development of the respiratory tract. SOX2 overexpression is associated with aggressive disease in several tumor types. However, SOX2 overexpression and gene amplification associates with favorable outcome in lung squamous cell carcinomas (SCC) and dissimilar results have been reported in lung adenocarcinomas (ADC). The aim of the present study was to evaluate SOX2 expression in NSCLC and determine the relationship with clinico-pathological variables and outcome.
SOX2 protein levels were measured in tissue microarrays (TMAs) containing FFPE samples from two independent lung cancer cohorts (n = 340 & 307) using automated quantitative immunofluorescence (QIF). Assay validation was performed using FFPE preparations of cell lines with known SOX2 expression. Associations of SOX2 levels with main clinico-pathological characteristics and with overall survival were studied using uni-and multivariate analysis.
SOX2 levels were higher in patients with SCC than in ADC in both cohorts (p value<0.0001). In the training cohort, NSCLC patients whose tumors showed high SOX2 (n = 245) had longer survival than those with low SOX2 levels (log rank p = 0.0002). Comparable results were observed in the second independent validation cohort, log rank p = 0.0113. SOX2 positive cases showed a 58% reduction in risk of death in Cox univariate analysis (hazards ratio-HR = 0.42 confidence interval-CI (0.36,0.73), p = 0.0002). SOX2 was associated with significantly longer survival independent of histology in multivariate analysis (hazards ratio-HR = 0.429 confidence interval-CI (0.295, 0.663), p = <0.001).
Citation: Velcheti V, Schalper K, Yao X, Cheng H, Kocoglu M, Dhodapkar K, et al. (2013) High SOX2 Levels Predict Better Outcome in Non-Small Cell Lung Carcinomas. PLoS ONE 8(4): e61427. https://doi.org/10.1371/journal.pone.0061427
Editor: Srikumar P. Chellappan, H. Lee Moffitt Cancer Center & Research Institute, United States of America
Received: December 27, 2012; Accepted: March 8, 2013; Published: April 19, 2013
Copyright: © 2013 Velcheti et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The authors have no support or funding to report.
Competing interests: The authors have read the journal's policy and have the following conflicts: Dr. Rimm is a consultant for Genoptix/Novartis, the exclusive licensee of the Yale-owned AQUA patent. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials. The rest of the authors declared no competing interests.
SOX2 belongs to the SRY-related HMG-box (SOX) family of embryonic developmental transcription factors . SOX2 plays a critical role in lineage determination and embryonic development of the respiratory tract and the central nervous system , . During development SOX2 is mainly expressed in the central/non-branching airways and appears to play a critical role in maintaining the stem cell-like phenotype in cancer cells , , . SOX2 is amplified and overexpressed in several malignancies including lung, head and neck, esophageal, breast, gastric, and colon carcinomas –. Moreover, SOX2 expression is associated with aggressive phenotype and poor prognosis in several tumor types –. In lung neoplasms SOX2 is frequently upregulated and its gene amplification correlates with protein overexpression in NSCLC , –. SOX2 amplification and overexpression is more common in squamous cell carcinomas (SCC) than in lung adenocarcinomas (ADC)  and the prognostic impact of SOX2 overexpression in NSCLC appears to be dependent on the histologic subtype , . Indeed. SOX2 amplification and overexpression were recently reported to be associated with better outcome in SCC , but with poor outcome in early stage lung ADC (n = 104)  The latter findings support the notion that SOX2 overexpression serves as positive prognostic indicator only in lung SCC and points towards a complex and dissimilar role of this transcription factor in the biology of the two major lung cancer subtypes. However and to our knowledge, these findings have not been clearly reproduced by other groups and validated in independent lung cancer cohorts  In addition, most of the studies evaluating SOX2 protein in lung tumors have used qualitative chromogenic immunohistochemistry with various antibodies and diverse scoring criteria.
In this study we investigate the prognostic role of SOX2 using automated quantitative immunofluorescence (QIF) in two independent lung cancer cohorts and analyzed the relationship between SOX2 levels and the main clinicopathologic features of patients with lung cancer. Our results show that increased tumor SOX2 levels predict better outcome in NSCLC and the effect is independent of histologic type.
Patient cohorts and Tissue Microarrays
Primary NSCLC tumor in the form of formalin-fixed paraffin-embedded tissue from patients at Yale University/New Haven Hospital between January 1980 and October 2003 were obtained from the Yale Pathology Tissue Services. The study was approved by the Human studies committee at Yale University. The data were analyzed anonymously from preexisting patient databases and hence exempt from consent by the human studies committee. In addition to our institutional cohort we assessed an independent cohort of 340 patients with NSCLC diagnosed between 1991 and 2001, obtained from the Sotiria General Hospital and Patras University General Hospital in Greece. In the Yale University lung cancer cohort, the median age of the patients was 66, with 147(48) % male and 160(52) % female patients. All the patients were treatment-naïve at the time of tumor resection or biopsy. The average follow-up period was 51 months (median 31 months Range (0,278)). In the Greek cohort the median age of the patients was 64, with 300(88) % male and 40(12) % female patients. The patient characteristics of this cohort are described in Table 1. The average follow-up period was 24 months (median 20 Range (0, 60)). The patient characteristics for both training cohort and validation cohort were described in Table 1.
Tissue specimens were prepared in a tissue microarray format: representative tumor areas were obtained from formalin-fixed, paraffin-embedded specimens of the primary tumor, and two 0.6-mm cores from each tumor block were arrayed in a recipient block. Control cell lines were formalin-fixed, paraffin-embedded and used as controls: HT29, MB435, BT474, SKBR3, H1299, A549, SW-480, H1666, H1335, MCF-7, HC15, A431, HCC2279, H2882, H1819, HC193, and H2126 were purchased from the American Type Culture Collection (Manassas, VA). Culture conditions and cell-line tissue microarray construction have been published in detail elsewhere .
Antibodies and Immunohistochemistry
The arrays were deparaffinized in heat oven for 20 minutes at 55 C followed by serial xylene washes. They were rehydrated in graded alcohols, and subjected to antigen retrieval using citrate buffer (pH 6) PT module set for 20 min at 97 C. Slides were preincubated with 0.3% bovine serum albumin in 0.1 mol/L TBS for 30 min at room temperature. Slides were then incubated with a cocktail of the SOX2 primary antibody (rabbit monoclonal, clone D6D9; Cell Signaling Technology) diluted 1/100 and a mouse monoclonal antihuman cytokeratin antibody (clone AE1/AE3, M3515; Dako) diluted 1∶100 in bovine serum albumin/TBS overnight at 4°C. Following this the TMAs were incubated for 1 hour with Alexa 546-conjugated goat antimouse secondary antibody (A11003; Molecular Probes) diluted 1∶100 in rabbit EnVision reagent (K4003, Dako). Cyanine 5 (Cy5) directly conjugated to tyramide (FP1117; Perkin-Elmer) at a 1∶50 dilution was used as the fluorescent chromagen for target detection. Prolong mounting medium (ProLong Gold, P36931; Molecular Probes) with 4′,6-diamidino-2-phenylindole (DAPI) was used to stain nuclei in the histospot. Serial sections of a smaller “index” NSCLC array were stained aside both cohorts to confirm assay reproducibility.
To evaluate for run to run variability both the Yale cohort and the Greek cohort were stained and analyzed on different days using the same conditions. The linear regressions (R2) between the two experiments were >0.7 for all the arrays (see figure 1). The specificity of the antibody was demonstrated with Western blot (using MCF7 and NTERA cell lysates as positive control and HELA, A431, BT20 and MB453 cell lysates as negative control).
The upper panels show SOX2 fluorescence (red channel) with characteristic nuclear staining pattern (left) and lack of SOX2 positivity (right). The lower panel shows cytokeratin immunoreactivity in the tumor compartment (AE1/AE3, green channel) and nuclear staining with DAPI (blue channel).
Automated quantitative immunofluorescence (QIF)
Automated quantitative analysis (AQUA™) enables objective measurement of protein concentration within user defined cellular compartments, as described in detail elsewhere . Briefly, a series of monochromatic high-resolution images were captured using an epifluorescent microscope and an algorithm for image collection. For each histospot images were acquired using the signals detected from the SOX2-Cy5 channel, 4′,6-diamidino-2-phenylindole (DAPI) and cytokeratin-Alexa 546 channel. Tumor was distinguished from stromal and nonstromal elements by creating an epithelial tumor “mask” from the cytokeratin-Alexa 546 channel signals. A binary mask is created with each pixel being either “on” or “off” on the basis of an intensity threshold set by visual inspection of histospots. The AQUA score of SOX2 in each subcellular compartment was calculated by dividing the SOX2 compartment pixel intensities by the area of the compartment within which they were measured. (Figure 1) AQUA scores were normalized to the exposure time and bit depth at which the images were captured, allowing scores collected at different exposure times to be directly comparable.
Using linear regressions, R2 greater than 0.4 was indicative of good inter- and intra-array reproducibility and thus the average values for SOX2 AQUA scores from duplicate samples were calculated and treated as independent continuous variables. X-tile software  was used to select the optimal SOX2 concentration cut point for the Greek lung cancer cohort (training set); this cut point was subsequently validated in the Yale University Lung Cancer cohort (validation set).
Patient characteristics were compared between training cohort and validation cohort by using t-test for continuous variables and chi-square test for categorical variables. The associations between SOX2 and characteristics variables were assessed by using t-test or analysis of variance (ANOVA). Overall survival (OS) functions between patients with high and low SOX2 expression were compared using log-rank test. Multivariate COX models were built to examine the effect of SOX2 on overall survival adjusted by the effect of age, gender, stage and histology.
Optimal cut point selection for SOX2 levels.
Since this study was done using a method that provides continuous data, and we had no biological evidence to support a specific cut-point for cases stratification, we used a statistical-based strategy through X-tile software  to determine the optimal cutpoint of the SOX2 scores. Using this method every point of continuous variables is divided into two classes and a standard Monte Carlo simulation is performed to produce chi squared values which can be maximized to find the optimal cut point. Since the optimal separation in survival was obtained using a score of 193 in the Greek cohort, a second independent cohort was required for cutoff validation. Of note, the X-tile generated cutpoint also is the signal detection threshold for the assay in both the cohorts. Both the cohorts were stained in the same experiment using similar conditions and hence the scores were directly comparable without needing normalization.
Assay validation and reproducibility in assessment of SOX2 measurements
To validate the QIF-based assay in FFPE specimens, cultured cell lines with known SOX2 expression were included together with tumor samples into TMAs, serially sectioned and stained. As expected, SOX2 signal was identified only in the tumor (CK-positive) compartment of lung carcinoma samples and showed a predominant nuclear staining pattern (Figure 1). Cell lines with known elevated SOX2 levels (MCF7 and BT474 ) showed higher SOX2 AQUA scores than the negative control cells with known low SOX2 expression (MB435, HT29 and A431, p<0.01, Figure 2).Of note, the AQUA scores obtained in the cell lines were considerably lower than in the SOX2-positive tumors.
SOX2 positive cell lines (MCF7 and BT474) show significantly higher SOX2 AQUA scores compared to cells with known low SOX2 expression (MB435, HT29 and A431 cell lines) (P value = <0.01). Representative fluorescence microphotographs of SOX2 immunoreactivity (red fluorescence channel) in MCF7 and HT29 cells. Insets shows cytokeratin immunostain(AE1/AE3, green channel) and nuclear DAPI fluorescence (blue channel). in the corresponding TMA spots.
The levels of SOX2 in staining from serial sections of the lung cohort arrays showed a high linear regression coefficient (R2 = 0.86, Figure 3), indicating assay reproducibility. Intra-array reproducibility was also demonstrated by comparison of scores in spots present in 2 fold redundancy (R2 = 0.7158, not shown).
Relationship between SOX2 levels and clinic-pathological variables
A description of the two lung cohorts is presented in Table 1.In the Greek cohort (training set), SOX2 levels were measured in 340 lung carcinomas, including 167 (49.12%) SCC,133 (39.12%) ADC, and 40 (11.76%) cases corresponding to other carcinoma subtypes. In the Yale cohort (validation set), SOX2 was measured in 335 lung tumors, including 76(22.69%) SCC, 212(63.28%) ADC and 47(14.03%) other histotypes.
The associations between SOX2 levels and the main clinical and pathology variables of the cohorts are presented in Table 2. In both lung cohorts SOX2 levels were ∼4 fold higher in patients with SCC as compared to ADC (p value<0.0001). In the Yale cohort only, SOX2 expression was higher in men than in female patients (p value 0.002). The overall SOX2 scores were lower in the Yale cohort than in the Greek array, possibly due to the considerable lower proportion of SCC in the former. No significant associations were noted with age or disease stage at diagnosis.
Difference in the overall survival (OS) between patients with high and low SOX2 levels in the Greek cohort
In the Greek cohort, NSCLC patients with high SOX2 tumor levels (n = 245, scores >193) had longer overall survival than subjects with low tumor SOX2 (Figure 4, median survival 42 vs. 21 months; log rank p = 0.0002,). Moreover, high SOX2 levels resulted in 48% risk reduction in NSCLC patients at Cox proportional univariate analysis (hazards ratio-HR = 0.52 confidence interval-CI (0.36,0.73), p = 0.0002).
Validation of the cutpoint and results in the Yale cohort
The cutpoint generated from the training set was applied in the validation cohort after normalization of scores to assess the potential of SOX2 to predict survival. In the Yale cohort, cases with high tumor SOX2 levels (n = 173) showed higher median survival compared to the low SOX2 expressers (Figure 4, median survival, not reached vs. 39 months respectively, log rank p = 0.0113). In addition, elevated tumor SOX2 resulted in 36% reduction in risk for all NSCLC patients at Cox univariate analysis (hazards ratio-HR = 0.64 confidence interval-CI (0.46,0.91), p = 0.01). Taken together, these results validated the X-tile defined cutpoint for SOX2 score stratification in lung carcinoma samples.
Assessment of the independent potential of SOX2 to predict survival
Using multivariate cox proportional hazards regression model, risk estimates related to survival for gender, stage, histological type and SOX2 levels were estimated (Table 3); cases with missing values were excluded. As in the Greek training cohort, SOX2 is an independent prognostic marker in NSCLC patients in the Yale cohort (hazards ratio-HR = 0.429 confidence interval-CI (0.295, 0.663), p = <0.001).
SOX2 expression has been correlated with aggressive phenotype and poor prognosis in several tumor types , , , . However, in patients with squamous cell NSCLC SOX2 expression appears to predict good outcomes . In patients with stage1 lung adenocarcinoma, Sholl et al found the SOX2 expression predicts poor survival . These findings have not been validated and are limited by small sample size and semi-quantitative methods of detection of SOX2 expression.
In this study, we measured the tissue SOX2 levels in two large independent cohorts of patients with NSCLC. We found that SOX2 levels are significantly higher in lung SCC relative to ADC. In addition, high SOX2 predicts better outcome in patients with NSCLC. In SCC subtype the association with survival was significant in the Greek cohort and had a trend towards better survival in the Yale cohort. This is likely because of small number of patients with SCC in the Yale cohort, reflecting the distribution of NSCLC histological subtypes in the US population.
These findings are consistent with previous findings from Bass et al and Wilbertz et al. ,  and confirm the association between SOX2 and SCC subtype; and the independent positive prognostic value of SOX2 in NSCLC. Interestingly, a high SOX2 level in ADC in both studied lung cohorts was also associated with longer survival. This is contrary to the findings reported by Sholl et al. where SOX2 expression correlated with poor outcomes in patients with stage I adenocarcinomas . Wilbertz et al evaluated SOX2 expression by IHC in two cohorts of patients comprising of 315 and 240 patients. They reported no association between SOX2 protein expression and survival in lung adenocarcinomas in either cohort. However, patients with low-level of SOX2 amplifications had poor survival in one of the cohorts (p = 0.009). SOX2 expression was associated with higher tumor grade . The findings from Sholl et al. however were in a relatively small cohort (n = 104) and included only early stage lung ADC. In addition, both studies by Scholl and Wilbertz used qualitative IHC for SOX2 detection and results were not validated in independent external cohorts. The aforementioned factors may, at least in part, account for the striking differences between the reported studies and our findings. Further work including larger series of lung ADC and comparable/multiple SOX2 detection assays will be required to clarify this apparent contradiction.
The mechanisms of regulation and functional role of SOX2 are not clearly understood. SOX2 levels correlate frequently with SOX2 gene amplification. However, SOX2 amplification is a less common event in adenocarcinoma relative to squamous cell carcinoma , . There may be mechanisms other than SOX2 gene amplification driving increased SOX2 protein, particularly in lung adenocarcinomas. Despite the molecular differences and differences in the copy number changes of SOX2 in squamous cell carcinoma and adenocarcinoma, SOX2 expression is an independent predictor of prolonged survival in both squamous cell carcinoma and adenocarcinoma of the lung.
While these results show that SOX2 expression is associated with better outcome in both SCC and ADC, our study has a number of limitations. First, this is a retrospective study and should be interpreted with caution. A second limitation is that the work was performed on TMAs, which may underestimate the heterogeneity of SOX2. Furthermore, TMAs are not used as a standard diagnostic method. Previous studies in other tumor types have shown that TMAs can be representative of tumor samples although the level of redundancy required for adequate representativity appears to vary with the antigen being measured . The level of heterogeneity for SOX2 has not yet been defined, but the observation of association with outcome in two unrelated cohorts increases our confidence in the use of TMAs for this study. In addition, the linear regression coefficient for the SOX2 measurements in samples studied in two fold redundancy within the Yale cohort further confirm this notion (R2 = 0.7158, not shown).
The measurement of SOX2 has potential to help risk stratification of NSCLC patients. While this work is hypothesis generating and the measurement of SOX2 in the clinic would be premature, this work reinforces the importance of SOX2 in NSCLC biology and raises the possibility of using this transcription factor in future NSCLC classification models.
Conceived and designed the experiments: VV DR SG. Performed the experiments: VV HC MK. Analyzed the data: VV XY YD KS HC KD DR SG. Contributed reagents/materials/analysis tools: KD DR XY YD. Wrote the paper: VV DR KS.
- 1. Lefebvre V, Dumitriu B, Penzo-Mendez A, Han Y, Pallavi B (2007) Control of cell fate and differentiation by Sry-related high-mobility-group box (Sox) transcription factors. Int J Biochem Cell Biol 39: 2195–2214.
- 2. Gontan C, de Munck A, Vermeij M, Grosveld F, Tibboel D, et al. (2008) Sox2 is important for two crucial processes in lung development: branching morphogenesis and epithelial cell differentiation. Dev Biol 317: 296–309.
- 3. Graham V, Khudyakov J, Ellis P, Pevny L (2003) SOX2 functions to maintain neural progenitor identity. Neuron 39: 749–765.
- 4. Yuan P, Kadara H, Behrens C, Tang X, Woods D, et al. (2010) Sex determining region Y-Box 2 (SOX2) is a potential cell-lineage gene highly expressed in the pathogenesis of squamous cell carcinomas of the lung. PLoS One 5: e9112.
- 5. Brahmer JR, Tykodi SS, Chow LQ, Hwu WJ, Topalian SL, et al. (2012) Safety and activity of anti-PD-L1 antibody in patients with advanced cancer. N Engl J Med 366: 2455–2465.
- 6. Freier K, Knoepfle K, Flechtenmacher C, Pungs S, Devens F, et al. (2010) Recurrent copy number gain of transcription factor SOX2 and corresponding high protein expression in oral squamous cell carcinoma. Genes Chromosomes Cancer 49: 9–16.
- 7. Long KB, Hornick JL (2009) SOX2 is highly expressed in squamous cell carcinomas of the gastrointestinal tract. Hum Pathol 40: 1768–1773.
- 8. Maier S, Wilbertz T, Braun M, Scheble V, Reischl M, et al. (2011) SOX2 amplification is a common event in squamous cell carcinomas of different organ sites. Hum Pathol 42: 1078–1088.
- 9. Lengerke C, Fehm T, Kurth R, Neubauer H, Scheble V, et al. (2011) Expression of the embryonic stem cell marker SOX2 in early-stage breast carcinoma. BMC Cancer 11: 42.
- 10. Zhang X, Yu H, Yang Y, Zhu R, Bai J, et al. (2010) SOX2 in gastric carcinoma, but not Hath1, is related to patients' clinicopathological features and prognosis. J Gastrointest Surg 14: 1220–1226.
- 11. Neumann J, Bahr F, Horst D, Kriegl L, Engel J, et al. (2011) SOX2 expression correlates with lymph-node metastases and distant spread in right-sided colon cancer. BMC Cancer 11: 518.
- 12. Wang X, Liang Y, Chen Q, Xu HM, Ge N, et al. (2012) Prognostic significance of SOX2 expression in nasopharyngeal carcinoma. Cancer Invest 30: 79–85.
- 13. Saigusa S, Mohri Y, Ohi M, Toiyama Y, Ishino Y, et al. (2011) Podoplanin and SOX2 expression in esophageal squamous cell carcinoma after neoadjuvant chemo-radiotherapy. Oncol Rep 26: 1069–1074.
- 14. Matsuoka J, Yashiro M, Sakurai K, Kubo N, Tanaka H, et al. (2012) Role of the stemness factors sox2, oct3/4, and nanog in gastric carcinoma. J Surg Res 174: 130–135.
- 15. Ge N, Lin HX, Xiao XS, Guo L, Xu HM, et al. (2010) Prognostic significance of Oct4 and Sox2 expression in hypopharyngeal squamous cell carcinoma. J Transl Med 8: 94.
- 16. Huang P, Qiu J, Li B, Hong J, Lu C, et al. (2011) Role of Sox2 and Oct4 in predicting survival of hepatocellular carcinoma patients after hepatectomy. Clin Biochem 44: 582–589.
- 17. Wilbertz T, Wagner P, Petersen K, Stiedl AC, Scheble VJ, et al. (2011) SOX2 gene amplification and protein overexpression are associated with better outcome in squamous cell lung cancer. Mod Pathol 24: 944–953.
- 18. Sholl LM, Barletta JA, Yeap BY, Chirieac LR, Hornick JL (2010) Sox2 protein expression is an independent poor prognostic indicator in stage I lung adenocarcinoma. Am J Surg Pathol 34: 1193–1198.
- 19. Bass AJ, Watanabe H, Mermel CH, Yu S, Perner S, et al. (2009) SOX2 is an amplified lineage-survival oncogene in lung and esophageal squamous cell carcinomas. Nat Genet 41: 1238–1242.
- 20. Hussenet T, Dali S, Exinger J, Monga B, Jost B, et al. (2010) SOX2 is an oncogene activated by recurrent 3q26.3 amplifications in human lung squamous cell carcinomas. PLoS One 5: e8960.
- 21. McCabe A, Dolled-Filhart M, Camp RL, Rimm DL (2005) Automated quantitative analysis (AQUA) of in situ protein expression, antibody concentration, and prognosis. J Natl Cancer Inst 97: 1808–1815.
- 22. Camp RL, Chung GG, Rimm DL (2002) Automated subcellular localization and quantification of protein expression in tissue microarrays. Nat Med 8: 1323–1327.
- 23. Camp RL, Dolled-Filhart M, Rimm DL (2004) X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res 10: 7252–7259.
- 24. Kao J, Salari K, Bocanegra M, Choi Y-L, Girard L, et al. (2009) Molecular Profiling of Breast Cancer Cell Lines Defines Relevant Tumor Models and Provides a Resource for Cancer Gene Discovery. PLoS One 4: e6146.
- 25. Tolles J, Bai Y, Baquero M, Harris LN, Rimm DL, et al. (2011) Optimal tumor sampling for immunostaining of biomarkers in breast carcinoma. Breast Cancer Res 13: R51.