Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Copy number gains at chr3p25 and chr11p11 are associated with lymph node involvement and survival in muscle-invasive bladder tumors

  • Karla J. Lindquist ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft

    karla.lindquist@ucsf.edu

    Affiliation Department of Urology, University of California San Francisco, San Francisco, CA, United States of America

  • Thomas Sanford,

    Roles Conceptualization, Data curation, Writing – review & editing

    Affiliation Department of Urology, University of California San Francisco, San Francisco, CA, United States of America

  • Terence W. Friedlander,

    Roles Conceptualization, Writing – review & editing

    Affiliation Department of Medicine, Division of Hematology & Medical Oncology, University of California San Francisco, San Francisco, CA, United States of America

  • Pamela L. Paris ,

    Contributed equally to this work with: Pamela L. Paris, Sima P. Porten

    Roles Conceptualization, Funding acquisition, Supervision, Writing – review & editing

    Affiliations Department of Urology, University of California San Francisco, San Francisco, CA, United States of America, Department of Medicine, Division of Hematology & Medical Oncology, University of California San Francisco, San Francisco, CA, United States of America

  • Sima P. Porten

    Contributed equally to this work with: Pamela L. Paris, Sima P. Porten

    Roles Conceptualization, Data curation, Supervision, Writing – review & editing

    Affiliation Department of Urology, University of California San Francisco, San Francisco, CA, United States of America

Abstract

Patients with muscle-invasive bladder cancer (MIBC) have poorer prognoses if cancer has metastasized to the lymph nodes. Genomic markers of lymph node involvement (LNI) would be useful for treatment planning, especially if measured at the biopsy stage, but large-scale studies of tumor tissue at any stage are needed to discover robust markers of LNI. We performed a genome-wide query of copy number alterations (CNA) in 237 MIBC surgical tumor specimens from patients in The Cancer Genome Atlas who had radical cystectomy and lymphadenectomy without neoadjuvant treatment. Pathology reports were independently reviewed to confirm LNI, and copy number data was analyzed to confirm gene-level gains and losses while adjusting for tumor purity and ploidy. Using logistic regression and elastic net models, we identified the CNA most significantly associated with LNI. Multivariable logistic regression was used to describe these CNA associations while adjusting for clinical variables. Kaplan-Meier and Cox regression were used to describe their association with overall survival. Gains in 26 genes were identified as having strong associations with LNI. After adjusting for age, gender, race, pathological tumor stage, histology, and number of nodes examined, gains in 22 genes on chr3p25 or chr11p11 remained significantly associated with LNI (p<0.01) and improved model discrimination over clinical variables alone (p = 0.04). They were also associated with shorter overall survival (adjusted p = 0.02). These results suggest that a simple genomic test for gains in chr3p25 and chr11p11 could inform adjuvant treatment or clinical trial decisions if validated in external cohorts. Additional studies will also be needed to determine if these CNA are detectible in biopsy tissue and can inform clinical decisions at the preoperative stage.

Introduction

Bladder cancer is the sixth most common cancer in the US. Metastatic bladder cancer is estimated to have caused 16,390 deaths in 2016 in the US, and almost all of these cases will have originated as muscle-invasive bladder cancer (MIBC) [1].

MIBC patients have a wide spectrum of clinical characteristics, but have markedly worse prognosis if cancer has metastasized with lymph node involvement (LNI). Though radical cystectomy with lymphadenectomy can improve outcomes, studies have shown that 25% of patients are clinically under-staged prior to cystectomy [2], and patients with LNI have higher recurrence and mortality rates compared to patients without LNI [35]. To illustrate this, 5-year overall and disease-free survival rates in patients with LNI are 35% and 31%, respectively, versus 89% and 69% for patients without LNI [3]. For patients with LNI, prognoses can vary, despite efforts to improve the TNM staging system to better reflect prognosis using lymph node information [611]. Better methods for determining prognosis in these patients are required to identify those at highest risk of recurrence who might benefit from the additional chemotherapy, radiotherapy, or a clinical trial.

Several RNA-based tests measuring gene expression in MIBC tumor samples taken at cystectomy have been associated with LNI [1214] and outcomes [1518], but are not yet in routine clinical use because they have not been validated in external cohorts or do not add significant value over clinical information alone. DNA-based markers, such as gene copy number alterations (CNA) that are associated with expression levels, may be more robust than RNA-based tests alone. The association of CNA in primary MIBC tumors and LNI has not previously been described. In this study, we aimed to discover robust genomic markers of LNI. We also tested for association of these markers with overall survival as a secondary outcome, and with expression and other biological relationships as forms of internal validation. We analyzed microarray-based copy number (CN) data from cystectomy-derived tumor samples from 237 MIBC patients represented in The Cancer Genome Atlas (TCGA) [19]. We independently reviewed pathology reports and re-processed raw CN data to confirm LNI and gene-level CNA in the tumor samples. Our goal was to test the hypothesis that there are CNA in MIBC tumor genomes that are associated with LNI and survival and that, along with other clinical and molecular markers, may be clinically useful in determining prognosis for MIBC patients.

Materials and methods

Patient and biological specimen selection

All sites contributing samples to TCGA obtained local IRB approval and informed consent prior to submission. The Human Research Protection Program of the Office of Ethics and Compliance at the University of California, San Francisco does not consider this study subject to IRB review because it is research conducted with de-identified or coded private information obtained from IRB-approved repositories that do not include any protected health identifiers.

In this retrospective study that follows REMARK criteria [20], we tested for CNA associated with LNI at the time of cystectomy and also short-term survival for up to two years post-cystectomy, since TCGA had follow-up data for this duration on all patients. TCGA selection criteria required that patients be diagnosed with MIBC (TNM stages T2-T4) with no prior malignancies or neoadjuvant treatment prior to cystectomy. Tumor and matched normal samples from peripheral blood or adjacent histologically normal tissue (>2cm from tumor) were immediately frozen after cystectomy and shipped to the Broad Institute for genomic testing by each treating institution. Samples were included only if >50% were urothelial carcinoma histology (versus variant histology such as squamous differentiation). Pathology review was carried out by four independent pathologists with training in genitourinary pathology. Additional information on patient selection, sample preservation, assay protocols, quality control, reproducibility have been reported previously [19].

For this study, it was required that pathology reports be available in cBioPortal [2122] for our independent review, and that each patient underwent a radical cystectomy and lymphadenectomy, with a minimum of ten nodes examined, since it has been shown that patients with <10 nodes removed have significantly higher recurrence rates and shorter survival [23].

Clinical and genomic measures considered

Two urologic oncologists (TS and SPP) performed a manual review of pathology reports in cBioPortal for each patient in the TCGA bladder cancer cohort to confirm pathological staging, tumor histology, and the number of involved and uninvolved nodes. Pathologic stage was classified according to the TNM classification of the American Joint Committee on Cancer (AJCC) cancer staging manual 7th edition [24]. We categorized histology as pure urothelial, urothelial with squamous differentiation, or other. Lymph node density was defined as the ratio of positive to total nodes examined. Extracapsular extension (ECE) was categorized as present or absent. Age at diagnosis, gender and race data were obtained from TCGA. Race was self-identified and categorized as white or non-white.

As previously reported [19], genomic assays for tumor and normal samples were conducted at the Broad Institute (BI). CN was measured on Affymetrix Genome-Wide Human 6.0 microarrays. The BI used circular binary segmentation (CBS) [25] in processing this data for making CNA calls. We independently processed the data—hereafter referred to as the University of California, San Francisco (UCSF) pipeline—using Allele-Specific Copy Number Analysis of Tumors (ASCAT) to adjust for tumor purity and ploidy [26]. CNA for each RefSeq gene [27] in chromosomes 1–22 and X were called by applying the Genomic Identification of Significant Targets in Cancer 2.0 (GISTIC2.0) algorithm [28] to both the BI and UCSF-processed data. The same GISTIC2.0 parameters were used for the BI and UCSF processing pipelines, with loss thresholds of log2(CN/2) < −0.1 and gain thresholds of log2(CN/2) > 0.1. The GISTIC2.0 algorithm empirically derives amplification and deletion thresholds. “BI-UCSF Calls” were created for each patient, categorized as concordant gains, losses, non-CNA, or discordant calls (S1 Fig). We also examined mRNA expression, somatic mutations, and methylation levels in tumor samples for the genes that had CNA significantly associated with LNI. Additional methodological details can be found in the S1 Text and on the TCGA website (http://cancergenome.nih.gov/).

Statistical and biological analysis

For bivariable tests comparing clinical and pathological characteristics by LNI, two-sided Wilcoxon rank-sum tests [29] for continuous variables and χ2 tests for categorical variables were used. To identify genes with CNA associated with LNI, a unique analysis pipeline consisting of four main stages was developed (S1 Fig). The goal with the first two stages was to screen genes for their association with LNI for the final two stages. In the first stage, all 20,333 RefSeq genes were liberally screened for CNA-LNI associations using bivariable logistic regression models with a multiple comparison-corrected false discovery rate (FDR) q-value of <0.2. The purpose of this first stage was to make the next stage more computationally feasible while focusing on the genes most likely to have robust LNI associations. In the second more stringent screening stage, all genes filtered through the first stage were entered into a multivariable elastic net (EN) regression model [30]. The EN method was chosen because it can simultaneously handle a large number of variables (genes) relative to the number of observations (patients), and can aid in selecting the subset of variables, or groups of correlated variables, that have the strongest association with the outcome. We identified the optimal EN model parameters and most associated subset of genes by using 5-fold cross-validation to assess prediction error in separate samples from those used to fit the model. This process was repeated 100 times, and we selected genes that were the most associated in at least 95% of these replications.

In the third stage of analysis, genes filtered by EN models were grouped into sets based on genomic position and EN model coefficient similarity. In the fourth stage, gene sets were entered into multivariable logistic regression models to test for association with LNI, with and without adjusting for clinical covariates (age, gender, race, pathological tumor stage, histology, and the number of nodes examined). C-statistics for models containing the gene sets or clinical data alone versus all combined were compared using DeLong’s test [31]. Logistic regression model fit was assessed using the Hosmer-Lemeshow test [32], and accuracy was further assessed using plots of actual versus predicted probabilities of LNI.

For overall and disease-free survival analyses, we calculated the time in months from the date of cystectomy to the date of each event, 24 months, loss-to-follow-up, or death for disease-free analyses (whichever came first). We also calculated overall survival up to 12 months post-cystectomy. Log-rank statistics and Kaplan-Meier [33] curves were used to compare overall and disease-free survival by LNI and gene set CNA. Cox proportional hazards models [34] were used to adjust for clinical covariates, with proportional hazards assumptions tested on the basis of Schoenfeld residuals [35], and accuracy was assessed using plots of actual versus predicted survival times.

For biological analyses, the Molecular Signatures Database (MSigDB) [36] was used to determine whether the final gene list had significant overlap with any functional gene sets curated from online pathway databases and PubMed. An FDR q-value of 0.05 was selected as the threshold for significance. Wilcoxon rank-sum tests were used to test for differences in average tumor expression and methylation levels by gene set CNA. cBioPortal was used for visualizing other alterations including amplifications and deletions and somatic mutations by LNI, and for performing network analysis to identify functionally related genes and drug interactions. Finally, other cancer types with the same CNA were searched for in both TCGA and the Catalog of Somatic Mutations in Cancer (COSMIC) [37].

All statistical tests were two-sided, and all 95% confidence intervals (CI) and p-values were estimated by 1,000 bootstrap repetitions. Analyses were conducted with R software version 3.3 [38] or Stata version 11.2 [39]. See S1 Text for additional details.

Results

Patient and biological specimens

There were 412 patients in the TCGA bladder cancer cohort with MIBC. We excluded one patient without a pathology report, 111 patients for whom pathology data was insufficient for LNI verification, three for whom CN data was unavailable, and 60 patients with less than ten nodes examined. This left 237 patients for study, all of whom were diagnosed between 1999 and 2013. One-hundred patients (42%) had LNI and 137 patients (58%) did not. Age at diagnosis, gender, race, tumor histology, and the number of nodes examined did not differ by LNI, but LNI patients were significantly more likely to have higher pathological tumor stages (p<0.01) (Table 1).

thumbnail
Table 1. Patient and clinical characteristics by lymph node involvement.

https://doi.org/10.1371/journal.pone.0187975.t001

Median overall survival time was 14.3 months for the 109 patients (46%) who died at any time during follow-up, and 24.0 months for those who were censored. Within one year after cystectomy 44 (19%) had died, and within two years 88 (37%) had died. Recurrence could not be determined for 55 patients, leaving 182 patients for disease-free survival analyses. Median disease-free survival time was 19.0 months for the 84 patients (46%) who recurred at any time, and 39.2 months for those who were censored. Within two years after cystectomy 19 (10%) had recurred.

Copy number associations

In the first analysis stage to liberally screen for genes associated with LNI, bivariable logistic regression models identified 8,794 of 20,333 RefSeq genes with CNA (based on BI-UCSF calls) that were associated with LN positivity at the q<0.2 level. After a second more stringent screening with multivariable EN models, 100 of these genes were associated with LNI. When these two stages were repeated using the BI and UCSF calls separately, 26 of these 100 genes were significantly associated with LNI. The 26 genes were therefore considered to have the most robust associations, and were retained for further analyses. They were then grouped into six sets based on genomic proximity and similarity in the degree of LNI associations. Twelve contiguous genes in a 0.3Mb section of chromosome 3, cytoband p25.3 (hg19 chr3:9691117–9958084) comprised one set (the “chr3p25” set), ten contiguous genes in a 2.7Mb section of chromosome 11, cytoband p11.2 (chr11:43702143–46402104) comprised another set (the “chr11p11” set), and the remaining four genes comprised single-gene sets on other chromosomes/chromosomal arms (EPHA3, ATE1, HIPK3, AATF). Odds ratios (OR) from bivariable analyses of each of the individual genes and gene sets are shown in Table 2.

thumbnail
Table 2. Bivariable logistic regression results for the 26 genes and 6 gene sets.

https://doi.org/10.1371/journal.pone.0187975.t002

For the final analyses, the chr3p25 and chr11p11 gene sets were combined since gains often co-occurred (χ2 p = 0.07) and were more strongly associated with LNI together (OR = 3.85, 95% CI 2.19–6.78, Table 3). Overall, 86 patients (36.3%) had gains in the chr3p25 or chr11p11 genes, of which 54 (62.8%) were LN-positive. Of the 151 patients without gains in these genes, 46 (30.5%) were LN-positive (χ2 p<0.01). In a multivariable logistic regression model including the combined chr3p25, chr11p11 and four single-gene sets, the single-gene sets were no longer associated with LNI. Since these four genes are also in different genomic regions than the rest, they were excluded from further analyses. After adjusting for clinical covariates (age, gender, race, pathological tumor stage, histology, and the number of nodes examined), the chr3p25, chr11p11 gene set remained significantly associated (p<0.01) with LN-positivity (OR = 3.76, 95% CI 1.96–7.23, Table 3). Model discrimination measured by c-statistics was significantly improved when including the gene sets with clinical data, compared to a model with clinical data alone (c = 0.744 versus c = 0.691 respectively, p = 0.04). The overall fit of the multivariable model with gene sets and clinical variables was good according to the Hosmer-Lemeshow test and predicted versus actual LNI (S2 Fig).

thumbnail
Table 3. Logistic regression results describing gene set association with lymph node involvement, with and without adjusting for clinical variables.

https://doi.org/10.1371/journal.pone.0187975.t003

In Kaplan-Meier analyses, LNI was significantly associated with two-year overall survival (p<0.01), but gains in the chr3p25, chr11p11 gene set were not associated with two-year overall survival (p = 0.09) (Fig 1). However, both LNI and gains in the chr3p25, chr11p11 gene set were significantly associated with one-year overall survival (p<0.01 for both). Stratified Kaplan-Meier curves revealed that the association between chr3p25, chr11p11 and one-year survival was only significant in patients with LNI (p = 0.02), not in those without LNI (p = 0.89) (Fig 1). Neither LNI nor chr3p25, chr11p11 were associated with two-year disease-free survival.

thumbnail
Fig 1. Kaplan-Meier survival curves for 24-month survival.

A. By lymph node involvement, B. By chr3p25, chr11p11 gain. C. By lymph node involvement and chr3p25, chr11p11 gain.

https://doi.org/10.1371/journal.pone.0187975.g001

In Cox models adjusting for age, gender, race, pathological tumor stage, histology and number of nodes examined, chr3p25, chr11p11 gains were not associated with two-year overall survival, but they were associated with one-year survival (HR = 2.20, 95% CI 1.11–4.36). LNI and chr3p25, chr11p11 gains did not have a significant interaction term (p = 0.12) in the multivariable model for one-year survival, but this model revealed that patients with LNI had significantly shorter survival times associated with gains in chr3p25, chr11p11 (HR = 2.66, 95% CI 2.09–3.46), and patients without LNI did not (HR = 1.04, 95% CI 0.80–1.48). Proportional hazards assumptions were met in this Cox model, and predictions were accurate over time patients with and without LNI (S3 Fig).

Biological analyses

MSigDB (36) was used to test the 26 genes for significant overlap with functional gene sets. A previously defined TP53 target gene set [40] significantly overlapped the 26 genes (FDR q<0.01, Table 2). Two of the seven overlapping TP53 target genes were in chr3p25, four were in chr11p11, and one (HIPK3) was in chr11p13. The combined chr3p25, chr11p11 set contained six TP53 target genes and 16 non-TP53 target genes.

Using mRNA measurements, expression levels were higher in tumors with gains versus no gains in the chr3p25 genes (difference in median z-scores = 0.87, Wilcoxon rank-sum p<0.01) and in the chr11p11 genes (difference in median z-scores = 0.34, Wilcoxon rank-sum p<0.01) (S4 Fig). We also tested for associations between LNI and expression in 27 genes from other regions of the genome described by Seiler et al. [13] as part of their expression-based signature of LNI. Adjusting the multivariable logistic regression model for mRNA levels in the Seiler genes did not change the significance of the association between gains in the chr3p25, chr11p11 set and LNI (p<0.01).

Lower methylation levels were associated with gains in chr3p25 (p<0.01), but adjusting for this also had no impact on the CNA associations with LNI. Mutations in the 26 genes were uncommon and not associated with CNA or LNI (S5 Fig). However, TP53 mutations were more likely to occur in patients with gains versus no gains in the chr3p25, chr11p11 set (50% vs. 28% respectively, χ2 p = 0.03), but were not associated with LNI.

In a network analysis of the 26 genes performed using cBioPortal [2122], ten were functionally connected to other genes via state change or expression control, being in the same complex, or by drug interactions (S6 Fig). Five genes in this network (E2F1, E2F3, EGFR, RAF1, and RB1) significantly overlapped genes in the bladder cancer pathway from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [4142] database (FDR q<0.01). Six genes in the network are oncogenes (AKT2, EGFR, EPHA3, PIK3CA, RAF1, TRIO) and one is a tumor suppressor (RB1). Two of the 22 genes in chr3p25 or chr11p11 are targets of experimental small molecule therapeutics (DrugBank number DB03214 targeting ACCS, and DB08235 and DB08236 targeting ARPC4).

CNA in the same direction in our 26 genes were noted in TCGA or COSMIC for bone, breast, intestine, ovary, prostate, S1 Table).

Sensitivity analyses

Genome-wide CNA occurred in similar regions for BI and UCSF processing pipelines (S7 Fig). Cohen’s kappa statistics [43] indicated significant agreement in the direction of CNA calls for 97% of all genes (FDR q<0.05). The median proportion of RefSeq genes with perfect agreement was 69%, which was not significantly associated with ASCAT-estimated tumor ploidy (median 3.0, range 1.5–5.3) or purity (median 60%, range 23–100%), or with LNI. Bivariable logistic regression results were compared for all RefSeq genes using the BI versus the UCSF calls. Coefficients and–log10(p-values) were positively correlated (p<0.01), although UCSF ASCAT-based calls produced more conservative results as expected (median gain/loss OR = 1.57 for BI versus OR = 1.22 for UCSF).

No genes had discordant BI versus UCSF calls associated with LNI. Adding discordant calls to the reference group (rather than just concordant non-CNA calls) resulted in the same 26 genes with CNA associations, plus an additional two genes that are in the chr11p11 region (API5 and TTC17, both having gains associated with LNI).

In multivariable models adjusting for age, gender, race, pathological tumor stage, histology, and number of nodes examined, amplifications (extreme gains of log2(CN/2)>0.9), empirically determined by GISTIC2.0) in the chr3p25 or chr11p11 genes were significantly associated with LNI (OR = 4.08, 95% CI 1.44–11.57). However, only 27 patients (11%) had amplifications in these genes.

Discussion

We have identified genes in chr3p25 and chr11p11 with CN gains that are associated with LNI and survival in patients with MIBC. These associations remained significant after adjusting for clinical characteristics, and discrimination between patients with and without LNI was significantly improved over clinical variables alone. To our knowledge this is the first study to describe tumor-based CN associations with LNI in MIBC patients.

After identifying the genes with CNA that were most associated with LNI, we used various methods to understand the biological mechanisms behind these associations. Over a quarter of the genes in our final list of 26 genes significantly overlapped a TP53 target gene set [40], suggesting that these could be the drivers of the associations we have observed. Network analyses revealed that some of the other genes may also have biological relevance. A total of nine chr3p25 and chr11p11 genes were functionally connected to genes in the KEGG bladder cancer pathway (E2F1, E2F3, EGFR, RAF1, RB1) or to other cancer-related genes (AKT2, PIK3CA, RB1, TRIO). The oncogenes RAF1 and PPARG are located in chr3p25.2, about 2.5Mb upstream from our genes in chr3p25.3. Gains in RAF1 and PPARG occurred in 45% and 46% of patients, respectively (these were the same patients except for two with gains in only PPARG). These gains were significantly associated (p<0.05) with LNI in bivariable analyses, but did not remain associated in multivariable analyses including the genes we identified. The fact that higher expression, lower methylation levels, and more TP53 gene mutations were all associated with gains in chr3p25 and chr11p11 also supports the biological relevance of our findings.

In addition to reviewing pathology reports, we independently processed CN data using a different method than what the BI used, since different methods can affect results. The BI used CBS [25] segmentation and did not account for tumor ploidy or purity. We used ASCAT [26] which adjusts copy number levels for these factors. Differences between the results were likely due to the fact that over half of the tumors in this cohort were estimated to be non-diploid and/or containing a mixture of tumor and normal cells. By using the CNA calls that were associated with LNI under both CBS and ASCAT, we may have excluded some associations that only one method captured, but are confident that our consensus system has resulted in associations that are robust to measurement bias and are therefore unlikely to be false positive findings. We also used cross-validation and reproducibility testing during our screening for CNA associations in order to reduce the risk of overfitting and internally validate our findings.

Another strength of our study is that our analyses were not confined to the regions of the genome with the most frequent or high-amplitude CNA. We considered all gains and losses as potentially associated with LNI. In fact, many of the frequently altered genes were not associated with LNI, including CDKN2A and others noted in the TCGA Research Network bladder cancer study [19]. Amplifications (extreme gains according to the GISTIC2.0 algorithm) [28] in chr3p25 and chr11p11 were strongly associated with LNI in this study, but the LN associations were primarily driven by lower-amplitude changes. In most patients, the gains in chr3p25 and chr11p11 were covered by just one or two segments of distinct copy number levels, so we believe these are the minimal regions associated with LNI. However, it is possible that more specific regions could be identified with different algorithms. All CNA that we identified were robust to two different CN processing pipelines and were detected using the reliable and stable DNA-based Affymetrix Genome-Wide Human Array 6.0 for measuring CN levels.

Although we have identified CNA in biologically plausible genes that have strong statistical associations with LNI and survival, our study does have some limitations. Despite our internal validations through pathology report reviews and statistical cross-validation, this does not guarantee successful validation in external cohorts. In addition, the gene sets were not associated with two-year disease-free survival (nor was LNI). This may be due to the fact that this study was underpowered to detect this because of incomplete overall survival and treatment information after cystectomy in the TCGA data. This also limited our ability to study overall survival beyond two years post-cystectomy, which would have been heavily censored and confounded. Both overall and disease-free survival should be further evaluated in appropriately powered prospective studies with more complete post-cystectomy clinical data.

We used cystectomy tissue for our analyses, whereas biopsy specimens (TURBT) prior to cystectomy are used for clinical staging and for pre-surgical treatment planning. Previous studies have compared classifiers and validated genomic signatures between different tissue sources, with the assumption that in the absence of neoadjuvant chemotherapy, the concordance of TURBT and cystectomy tissue may be high [14,16]. No direct comparisons between TURBT and cystectomy tissues in the literature in the same patient have been made. This is another limitation of our study. Only if validated in a prospective cohort using biopsy tissue from TURBT specimens could these biomarkers be associated with LNI prior to surgical extirpation and thereby provide guidance for neoadjuvant based treatments.

Using cystectomy-derived tumor samples, we detected CN gains in chr3p25 and chr11p11 genes that are associated with LNI and one-year survival over clinical measures alone. Prospective validation studies in external cohorts will be required to determine if a simple genomic test for these CNA can be used to identify patients with LNI that would not have been detected otherwise.

Supporting information

S1 Fig. Analysis pipeline used for identifying the 26 genes considered in final multivariable models.

BI = Broad Institute, UCSF = University of California, San Francisco. CBS = circular binary segmentation. ASCAT = Allele-specific copy number analysis of tumors. CNA = copy number alteration. Mb = Megabase, SD = standard deviation.

https://doi.org/10.1371/journal.pone.0187975.s001

(PDF)

S2 Fig. Accuracy of multivariable logistic regression model predicting LNI with the combined gene set chr3p25, chr11p11 and clinical variables (age, gender, race, pathological tumor stage and race, and number of lymph nodes examined).

95% confidence intervals for the proportions are bootstrapped with 1,000 repetitions. Points are labeled with the number of LN-positive cases / total cases in the predicted probability group. The green line indicates perfect agreement between actual proportion of LN-positive cases and the predicted probability midpoint.

https://doi.org/10.1371/journal.pone.0187975.s002

(PDF)

S3 Fig. Accuracy of multivariable Cox regression model predicting one-year survival with the combined chr3p25, chr11p11 gene set, adjusted for clinical variables (age, gender, race, pathological tumor stage and race, and number of lymph nodes examined).

A. For LN-negative patients. B. For LN-positive patients.

https://doi.org/10.1371/journal.pone.0187975.s003

(PDF)

S4 Fig. Normalized mRNA expression levels in tumor samples by copy number change in each of the 6 gene sets.

Expression is expected to be higher in genes/sets with copy number gains, and lower in genes/sets with losses. Wilcoxon rank-sum tests were two-sided.

https://doi.org/10.1371/journal.pone.0187975.s004

(PDF)

S5 Fig. Copy number alterations (amplifications, deletions, gains and losses), differential expression, and somatic mutations in the 26 genes considered in final multivariable models, by lymph node involvement.

https://doi.org/10.1371/journal.pone.0187975.s005

(PDF)

S6 Fig. Biological network analysis of the 26 genes with gains or losses associated with lymph node-positivity.

Performed using cBioPortal. Alterations include copy number, differential expression, or somatic mutations in the TCGA bladder cancer cohort. Functional connections are via state change or expression control, being in the same protein complex, or having drug interactions.

https://doi.org/10.1371/journal.pone.0187975.s006

(PDF)

S7 Fig. Gain (red) and loss (blue) peaks across the genome, as determined by Genomic Identification of Significant Targets in Cancer (GISTIC2.0) for copy number levels from the Broad Institute (BI) and our (UCSF) copy number processing pipelines.

The amplitude of each peak (y-axis for panels B and C) reflects both the magnitude and frequency of gains and losses across all tumor samples from the 237 patients in the cohort. The same GISTIC parameters were used to determine peaks for both processing pipelines. A. Chromosome. B. GISTIC peaks based on the BI pipeline data. C. GISTIC peaks based on the UCSF pipeline data.

https://doi.org/10.1371/journal.pone.0187975.s007

(PDF)

S1 Table. Tumors in The Cancer Genome Atlas (TCGA) and other studies in the Catalog of Somatic Mutations In Cancer (COSMIC version 78, 37) with the same copy number alterations in the 26 genes associated with lymph node-positivity in the TCGA bladder cancer cohort.

https://doi.org/10.1371/journal.pone.0187975.s008

(DOCX)

S1 Text. Supplemental methods and references.

https://doi.org/10.1371/journal.pone.0187975.s009

(DOCX)

References

  1. 1. American Cancer Society: Cancer Facts and Figures 2016. Atlanta, GA: American Cancer Society, 2016. Available from: http://www.cancer.org/research/cancerfactsstatistics/cancerfactsfigures2016.
  2. 2. Maurer T, Souvatzoglou M, Kübler H, Opercan K, Schmidt S, Herrmann K, et al. Diagnostic efficacy of [11C]choline positron emission tomography/computed tomography compared with conventional computed tomography in lymph node staging of patients with bladder cancer prior to radical cystectomy. Eur Urol. 2012;61(5):1031–8. pmid:22196847.
  3. 3. Stein JP, Lieskovsky G, Cote R, Groshen S, Feng AC, Boyd S, et al. Radical cystectomy in the treatment of invasive bladder cancer: long-term results in 1,054 patients. J Clin Oncol. 2001;19(3):666–75. PubMed pmid:11157016.
  4. 4. Herr HW, Donat SM. Outcome of patients with grossly node positive bladder cancer after pelvic lymph node dissection and radical cystectomy. J Urol. 2001;165(1):62–4; discussion 4. pmid:11125364.
  5. 5. Kassouf W, Agarwal PK, Grossman HB, Leibovici D, Munsell MF, Siefker-Radtke A, et al. Outcome of patients with bladder cancer with pN+ disease after preoperative chemotherapy and radical cystectomy. Urology. 2009;73(1):147–52. pmid:18848348; PubMed Central PMCID: PMCPMC2674246.
  6. 6. Bruins HM, Huang GJ, Cai J, Skinner DG, Stein JP, Penson DF. Clinical outcomes and recurrence predictors of lymph node positive urothelial cancer after cystectomy. J Urol. 2009;182(5):2182–7. pmid:19758623.
  7. 7. Park J, Park S, Song C, Doo C, Cho YM, Ahn H, et al. Effectiveness of adjuvant chemotherapy in transitional cell carcinoma of the urinary bladder with lymph node involvement and/or lymphovascular invasion treated by radical cystectomy. Urology. 2007;70(2):257–62. pmid:17826485.
  8. 8. Hurle R, Naspro R. Pelvic lymphadenectomy during radical cystectomy: a review of the literature. Surg Oncol. 2010;19(4):208–20. pmid:19500973.
  9. 9. Bruins HM, Dorin RP, Rubino B, Miranda G, Cai J, Daneshmand S, et al. Critical evaluation of the American Joint Committee on Cancer TNM nodal staging system in patients with lymph node-positive disease after radical cystectomy. Eur Urol. 2012;62(4):671–6. pmid:22575915.
  10. 10. Vieweg J, Gschwend JE, Herr HW, Fair WR. The impact of primary stage on survival in patients with lymph node positive bladder cancer. J Urol. 1999;161(1):72–6. pmid:10037372.
  11. 11. Zehnder P, Studer UE, Daneshmand S, Birkhäuser FD, Skinner EC, Roth B, et al. Outcomes of radical cystectomy with extended lymphadenectomy alone in patients with lymph node-positive bladder cancer who are unfit for or who decline adjuvant chemotherapy. BJU Int. 2014;113(4):554–60. pmid:24131453.
  12. 12. Dancik G, Aisner D, Theodorescu D. A 20 gene model for predicting nodal involvement in bladder cancer patients with muscle invasive tumors. PLoS Curr. 2011;3:RRN1248. pmid:21858252; PubMed Central PMCID: PMCPMC3157080.
  13. 13. Seiler R, Lam LL, Erho N, Takhar M, Mitra AP, Buerki C, et al. Prediction of Lymph Node Metastasis in Patients with Bladder Cancer Using Whole Transcriptome Gene Expression Signatures. J Urol. 2016;196(4):1036–41. pmid:27105761.
  14. 14. Smith SC, Baras AS, Dancik G, Ru Y, Ding KF, Moskaluk CA, et al. A 20-gene model for molecular nodal staging of bladder cancer: development and prospective assessment. Lancet Oncol. 2011;12(2):137–43. pmid:21256081; PubMed Central PMCID: PMCPMC3613042.
  15. 15. Laurberg JR, Jensen JB, Schepeler T, Borre M, Ørntoft TF, Dyrskjøt L. High expression of GEM and EDNRA is associated with metastasis and poor outcome in patients with advanced bladder cancer. BMC Cancer. 2014;14:638. pmid:25175477; PubMed Central PMCID: PMCPMC4164753.
  16. 16. Lindgren D, Frigyesi A, Gudjonsson S, Sjödahl G, Hallden C, Chebil G, et al. Combined gene expression and genomic profiling define two intrinsic molecular subtypes of urothelial carcinoma and gene signatures for molecular grading and outcome. Cancer Res. 2010;70(9):3463–72. pmid:20406976.
  17. 17. Mitra AP, Lam LL, Ghadessi M, Erho N, Vergara IA, Alshalalfa M, et al. Discovery and validation of novel expression signature for postcystectomy recurrence in high-risk bladder cancer. J Natl Cancer Inst. 2014;106(11). pmid:25344601; PubMed Central PMCID: PMCPMC4241889.
  18. 18. Riester M, Taylor JM, Feifer A, Koppie T, Rosenberg JE, Downey RJ, et al. Combination of a novel gene expression signature with a clinical nomogram improves the prediction of survival in high-risk bladder cancer. Clin Cancer Res. 2012;18(5):1323–33. pmid:22228636; PubMed Central PMCID: PMCPMC3569085.
  19. 19. Network CGAR. Comprehensive molecular characterization of urothelial bladder carcinoma. Nature. 2014;507(7492):315–22. pmid:24476821; PubMed Central PMCID: PMCPMC3962515.
  20. 20. McShane LM, Altman DG, Sauerbrei W, Taube SE, Gion M, Clark GM, et al. REporting recommendations for tumor MARKer prognostic studies (REMARK). Nat Clin Pract Urol. 2005;2(8):416–22. pmid:16482653.
  21. 21. Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2(5):401–4. pmid:22588877; PubMed Central PMCID: PMCPMC3956037.
  22. 22. Gao J, Aksoy BA, Dogrusoz U, Dresdner G, Gross B, Sumer SO, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013;6(269):pl1. pmid:23550210; PubMed Central PMCID: PMCPMC4160307.
  23. 23. Herr HW, Faulkner JR, Grossman HB, Natale RB, deVere White R, Sarosdy MF, et al. Surgical factors influence bladder cancer outcomes: a cooperative group report. J Clin Oncol. 2004;22(14):2781–9. Epub 2004/06/15. pmid:15199091.
  24. 24. Edge SB, Compton CC. The American Joint Committee on Cancer: the 7th edition of the AJCC cancer staging manual and the future of TNM. Ann Surg Oncol. 2010;17(6):1471–4. pmid:20180029.
  25. 25. Olshen AB, Venkatraman ES, Lucito R, Wigler M. Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004;5(4):557–72. pmid:15475419.
  26. 26. Van Loo P, Nordgard SH, Lingjærde OC, Russnes HG, Rye IH, Sun W, et al. Allele-specific copy number analysis of tumors. Proc Natl Acad Sci U S A. 2010;107(39):16910–5. pmid:20837533; PubMed Central PMCID: PMCPMC2947907.
  27. 27. Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, et al. RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 2014;42(Database issue):D756–63. pmid:24259432; PubMed Central PMCID: PMCPMC3965018.
  28. 28. Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. 2011;12(4):R41. pmid:21527027; PubMed Central PMCID: PMCPMC3218867.
  29. 29. Wilcoxon F. Individual comparisons by ranking methods. Biometrics Bulletin 1945;1(6):80–3.
  30. 30. Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society Series B, Statistical methodology. 2005;67(2):19.
  31. 31. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45. pmid:3203132.
  32. 32. Hosmer DW Jr, Lemeshow S. Applied Logistic Regression, 2nd ed. New York: John Wiley & Sons; 2000.
  33. 33. Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. J Am Stat Assoc 1958;53(282):457–81.
  34. 34. Cox DR, Oakes D. Analysis of Survival Data. Monographs on Statistics and Applied Probability. New York, NY: Chapman and Hall; 1984.
  35. 35. Grambsch P, Therneau T. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 1994;81:515–26.
  36. 36. Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP. Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011;27(12):1739–40. pmid:21546393; PubMed Central PMCID: PMCPMC3106198.
  37. 37. Forbes SA, Bindal N, Bamford S, Cole C, Kok CY, Beare D, et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer. Nucleic Acids Res. 2011;39(Database issue):D945–50. pmid:20952405; PubMed Central PMCID: PMC3013785.
  38. 38. R Development Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2008.
  39. 39. StataCorp. 2009. Stata Statistical Software: Release 11. College Station, TX: StataCorp LP.
  40. 40. Perez CA, Ott J, Mays DJ, Pietenpol JA. p63 consensus DNA-binding site: identification, analysis and application into a p63MH algorithm. Oncogene. 2007;26(52):7363–70. pmid:17563751.
  41. 41. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30. pmid:10592173; PubMed Central PMCID: PMCPMC102409.
  42. 42. Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012;40(Database issue):D109–14. pmid:22080510; PubMed Central PMCID: PMC3245020.
  43. 43. Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas 1960;20(1):37–46.