Table 1.
Clinical information statistics of TCGA training datasets, TCGA validation datasets and GSE20685.
Fig 1.
A: Significantly amplified fragments in the breast cancer genome. B: Significantly deleted fragments in the breast cancer genome. C: The distribution of the 50 most significant P genes in breast cancer patients, the top histogram shows the total number of synonymous and non-synonymous mutations in 50 genes per patient, and the right histogram shows the number of mutations in all the 50 genes. Different colors in the heat map indicate the type of mutation, and the gray color indicates no mutation.
Fig 2.
A: The 1021 genes with copy number variation and mutation are involved in the KEGG pathway. B: The 1021 genes with copy number variation and mutation are involved in the GO bp pathway.
Fig 3.
A: The relationship between the error rate and the number of classification trees. B: The importance of out-of-bag of the first 6 genes. C: KM survival curve distribution of the 6-gene signature in the TCGA training set. D: The ROC curve and AUC of the 6-gene signature classification. E: Risk score, survival time, survival status and expression of the 6 genes in TCGA training.
Table 2.
The 6 genes were significantly associated with the overall survival in the training-set patients.
Fig 4.
A: KM survival curve of the distribution of the 6-gene signature in the TCGA test set. B: The ROC curve and AUC of the 6-gene signature classification in the TCGA test set. C: KM survival curve distribution of the 6-gene signature in GSE20685 dataset. D: The ROC curve and AUC of the 6-gene signature classification in GSE20685 dataset.
Table 3.
Univariate and multivariate COX regression analysis was used to identify prognostic clinical factors and clinical independence in the TCGA training set, TCGA test set and GSE20685.
Fig 5.
KEGG_BASE_EXCISION_REPAIR, KEGG_MELANOMA, KEGG_PYRIMIDINE_METABOLISM, KEGG_NUCLEOTIDE_EXCISION_REPAIR, KEGG_SPLICEOSOME, KEGG_CALCIUM_SIGNALING_PATHWAY, KEGG_NEUROACTIVE_LIGAND_RECEPTOR_INTERACTION and KEGG_CYTOSOLIC_DNA_SENSING_PATHWAY GSEA enrichment results for eight pathways.
Fig 6.
A: KM survival curves of the distribution of the 6-gene signature in breast cancer subtypes Basal. B: KM survival curves of the distribution of the 6-gene signature in breast cancer subtypes Luminal A. C: KM survival curves of the distribution of the 6-gene signature in breast cancer subtypes Luminal B. D: KM survival curves of the distribution of the 6-gene signature in breast cancer subtypes Her2-enriched. E: KM survival curves of the distribution of the 6-gene signature in breast cancer subtypes triple negative breast cancer.