Table 1.
All called SNVs and indels.
Fig 1.
Recurrently mutated genes in 6 basal-like cell lines.
The 17 genes with identified somatic mutations in two or more cancer cell lines are shown, colored according to mutation type. Heterozygous non-synonymous single nucleotide variants (SNV) are in yellow; homozygous non-synonymous SNVs are in red; heterozygous stopgain/short indels mutations are in light blue; homozygous stopgain/short indels mutations are in blue. 1Non-synonymous SNVs, stopgain SNVs or short indels also identified in a breast cancer study from TCGA (basal-like tumors) [14]. 2Non-synonymous SNVs, stopgain SNVs or short indels also identified in a breast cancer study of triple-negative tumors [28]. The subtype classification and receptor status have been described earlier [17, 18, 24].
Table 2.
Sanger sequencing validation of novel SNVs and indels.
Table 3.
Sanger sequencing validation of SNVs and indels in COSMIC v68, not detected by GATK.
Fig 2.
Copy number estimation using targeted sequencing data.
(a) Whole genome plot of HCC1395 copy number variations derived from targeted sequencing data analyzed with CONTRA in comparison to segmented Affymetrix 6.0 copy number data for the same cell line. For the sequencing data, the black datapoints are the CONTRA adjusted mean log2 ratios, and red datapoints are the CONTRA/GLAD segmented copy log2 ratios. For the Affymetrix 6.0 data, blue datapoints are the segmented copy number data. Zoomed-in plots of three selected genes, (b) CDKN2A and (c) PTEN in HCC1395, and (d) ERBB2 in HCC1954 (color codes are as above). (e) Representative correlation plot for one cell line, HCC1395, of segmented CONTRA copy number data versus Affymetrix 6.0 segmented copy number data (Pearson r = 0.89). Correlation plots for all cell lines are presented in S2 Fig.
Fig 3.
Integrated recurrent somatic mutations and copy number variations.
The 34 genes affected by mutation and/or copy number gain or deletion (|log2 ratio|>2) in two or more cell lines are shown. Color codes are as in Fig 1, with the addition that large homozygous deletions are in green and amplifications are in purple. 1Non-synonymous SNVs, stopgain SNVs, short indels respective deep deletions and amplifications also identified in a breast cancer study from TCGA (basal-like tumors) [14]. 2Non-synonymous SNVs, stopgain SNVs, short indels respective homozygous deletions and amplifications also identified in a breast cancer study of triple negative tumors [28]. The subtype classification and receptor status have been described earlier [17, 18, 24].
Fig 4.
Base replacements in coding regions versus non-coding regions.
Forest plot indicating odds ratio (marker) and 95% confidence intervals (whiskers) for the frequency of indicated base replacements in coding regions (CDS) versus non-coding (non-CDS) regions. The dimensions of the squares are inversely proportional to the standard error (SE) of ln(odds ratio). * P = 0.024; *** P<0.0001 (Bonferroni adjusted).
Fig 5.
Genomic context of mutations in coding regions versus non-coding regions.
Forest plot indicating the odds ratio (marker) and 95% confidence intervals (whiskers) for the trinucleotide context for base replacements (the center base within square brackets, both strands indicated) in coding regions (CDS) versus non-coding (non-CDS) regions. The dimensions of the squares are inversely proportional to the standard error (SE) of ln(odds ratio). ** P = 0.001; *** P<0.0001 (Bonferroni adjusted). Note that the odds ratio for G[A]C/G[T]C is undefined (2 substitutions in coding regions versus 0 in non-coding regions).