In colorectal cancer (CRC), chromosomal instability (CIN) is typically studied using comparative-genomic hybridization (CGH) arrays. We studied paired (tumor and surrounding healthy) fresh frozen tissue from 86 CRC patients using Illumina's Infinium-based SNP array. This method allowed us to study CIN in CRC, with simultaneous analysis of copy number (CN) and B-allele frequency (BAF) - a representation of allelic composition. These data helped us to detect mono-allelic and bi-allelic amplifications/deletion, copy neutral loss of heterozygosity, and levels of mosaicism for mixed cell populations, some of which can not be assessed with other methods that do not measure BAF. We identified associations between CN abnormalities and different CRC phenotypes (histological diagnosis, location, tumor grade, stage, MSI and presence of lymph node metastasis). We showed commonalities between regions of CN change observed in CRC and the regions reported in previous studies of other solid cancers (e.g. amplifications of 20q, 13q, 8q, 5p and deletions of 18q, 17p and 8p). From Therapeutic Target Database, we identified relevant drugs, targeted to the genes located in these regions with CN changes, approved or in trials for other cancers and common diseases. These drugs may be considered for future therapeutic trials in CRC, based on personalized cytogenetic diagnosis. We also found many regions, harboring genes, which are not currently targeted by any relevant drugs that may be considered for future drug discovery studies. Our study shows the application of high density SNP arrays for cytogenetic study in CRC and its potential utility for personalized treatment.
Citation: Jasmine F, Rahaman R, Dodsworth C, Roy S, Paul R, Raza M, et al. (2012) A Genome-Wide Study of Cytogenetic Changes in Colorectal Cancer Using SNP Microarrays: Opportunities for Future Personalized Treatment. PLoS ONE 7(2): e31968. doi:10.1371/journal.pone.0031968
Editor: Amanda Ewart Toland, Ohio State University Medical Center, United States of America
Received: September 16, 2011; Accepted: January 19, 2012; Published: February 20, 2012
Copyright: © 2012 Jasmine et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the National Institutes of Health grants U01 CA122171, P30 CA 014599, P42ES010349, R01CA102484, and R01CA107431. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Colorectal cancer (CRC) is a common malignancy in developed countries. In the USA it is the second highest cause of cancer-related deaths, with an estimated 102,900 new cases occurring during 2010 , . CRC is typically much less common in developing countries of the world, including Southeast Asia; however, rates are on the rise, perhaps due to aging populations, smoking, changes in diet and a lack of screening programs . In the South Asian population, patients tend to present with CRC at a younger age and typically at later stage , .
Cancer cells are characterized by cytogenetic abnormalities that can be used to define specific disease entities and their prognostic and predictive markers. In CRC, chromosomal abnormalities occur in a non-random pattern along the pathway from adenoma to carcinoma and then to advanced lesions and the formation of metastasis –. There are three known pathways in CRC pathogenesis: chromosomal instability (CIN), microsatellite instability (MSI), and the CpG island methylator phenotype (CIMP) pathways . These three pathways are closely related and tumors occasionally exhibit features of multiple pathways. Most cases of CRC arise through the CIN pathway: for example, via structural rearrangements, amplifications and deletions , with copy number (CN) variation being a common finding , . Some of the believed consequences of CIN are loss of tumor suppressor genes and amplification of oncogenes in the affected regions. In contrast, MSI is less common and is more likely to be associated with hereditary CRC and a better prognosis , . CIN and MSI are thought to involve two separate pathways in the development of CRC , .
Chromosomal abnormalities in CRC have been studied by multiple groups using either comparative genomic hybridization (CGH) or array comparative genomic hybridization (aCGH) –, –, –. This has led to the discovery of many chromosomal aberrations, including gains and losses, portraying a complex picture of disease progression. Particularly common findings are gains in 20q, 13q, 7p, and 8q and losses in 17p, 18q, 8p, 4q, and 5q –. High-density single nucleotide polymorphism (SNP) arrays are an alternative and advantageous method for the analysis of chromosomal abnormalities. This is because a higher resolution can be achieved alongside simultaneous analysis of loss of heterozygosity (LOH) and CN variation . To our knowledge, there are only a few published cytogenetic studies in CRC performed using relatively low-density SNP arrays , , –, and these studies plead a strong case for their use. In 2007, Andersen et al., using a SNP array (Affymetrix 10 K array), found copy neutral LOH (cnLOH) as a common occurrence in CRC . Middeldorp et al. genotyped FFPE tissues from 19 MUTHY-associated CRC patients using Illumina's Golden Gate genotyping and found similar changes mainly in 17p, 18q, 15q and 6p region . CGH or aCGH are unable to detect the occurrence of cnLOH. The consequences of such lesion during human development and cancer have been recently reviewed . SNP arrays therefore capture more detailed information regarding the underlying possible mechanism of the abnormalities detected. Gaasenbeek et al.  compared SNP array analysis on the Affymetrix platform (10 K Xba131) with CGH on 45 unselected CRC cell lines and found many abnormalities that were apparent using the SNP array but were undetected in CGH, confirming the advantage of using a SNP array platform. Lips et al. compared 4 FFPE CRC tissue using Illumina's GoldenGate SNP array (containing 5,861 probes) with DNA from leukocyte, normal FFPE and fresh frozen tissue from the same cases and found comparable results . Furthermore, Lips et al.  used Affymetrix 10 K SNP array for 78 fresh frozen rectal tumor tissues to show the difference in CIN in adenoma and carcinoma. They found five specific chromosomal aberrations that could discriminate between adenoma and carcinoma. Sheffer et al.  used the Affymetrix 50 K XbaI SNP array to correlate CN variations in CRC with gene expression (using Affymetrix U133A), disease progression and survival outcomes. They tested 130 SNP array profiles from different types of tissue (normal, adenoma, different stages CRC and metastasis).They found deletions of 8p, 4p and 15q to be associated with progression of CRC and poor survival outcome.
In contrast to the previous studies, with paired and much larger sample size from a homogenous group of 86 CRC patients, our present study aimed to detect chromosomal aberrations using Illumina's Infinium based high-density (610 k and 300 k) SNP array. We analyzed the data for correlations between CIN and various histopathological and clinical parameters. Furthermore, we identified genes harbored within the affected regions which may contribute to tumor development or may have a protective effect on tumor metastasis. Using the Therapeutic Targets Database (TTD) we matched the genes in the amplification and deletion regions with drugs that are either already approved or are in stages of drug development for targeting these genes. With this novel interrogation, it is possible to identify appropriate drugs for patients exhibiting these particular aberrations and therefore tailor treatment to the individual. Our study, to our knowledge, is the first to report the findings from the use of a high density SNP array platform on a paired and much larger sample size to analyze chromosomal abnormalities in relation to tumor type, location, grade, stage and sex. Our study is also unique in correlating the altered genes with drugs targeting the genes for potential identification of personalized therapy. Finally, our study was conducted in an understudied Southeast Asian population with CRC.
Patient characteristics and histopathological data are presented in Table 1. Paired CN analysis of the SNP and Copy Number Variation (CNV) probe data from the 86 CRC tissues and their corresponding surrounding healthy colonic tissue samples revealed a total of 10,878 genomic segments (autosomes only) with amplification (1501), deletion (1630) and no CN change (7747). By “amplification” we mean an increase in the number of copies of a specific DNA fragment and by “deletion” we mean a loss of part of the DNA from a chromosome (http://www.ornl.gov/sci/techresources/Human_Genome/glossary/). The chromosomal locations of these amplifications (in red) or deletions (in blue) found in the CRC samples are shown in Figure 1. The width of the bar is proportional to the number of samples showing the CN change. For example, amplification was most frequently found in 20q13.2 region (21 of 86 CRC samples) and deletion was most frequently found in 18q21 region (19 of 86 CRC samples). In comparison to corresponding healthy tissue we found amplification in the following cytoband regions of the CRC tissue: chromosome 20 (q11, q12, q13, p11), chromosome 13 (q12, q13, q14, q21, q22, q24, q32, q33), chromosome 8 (q12, q21, q22, q23, q24), chromosome 5 (p12, p13, p14, p15) and chromosome 9 (p21, p22, p23, p24). Similarly, deletions were found in chromosome 18 (q21, q22, q23, q11, q12, p11), chromosome 8 (p21, p22, p23, p12), chromosome 17 (p11, p12, p13), chromosome 9p21 and chromosome 1p36. Most of these findings are also reported by other authors , , .
Amplifications (mean CN≥2.5) are shown in red, and deletions (mean CN≤1.5) are shown in blue. The horizontal size of the bar representation how many samples (at least 9 samples, 10%) exhibited the amplification/deletion.
Types of identified cytogenetic abnormalities
Unlike CGH, which provides only CN data, SNP chips provide information about both CN and BAF. The BAF refers to a measure of “allelic composition”, i.e., the relative contribution to the signal from the B allele. In other words, BAF for a sample shows the theta value for a SNP corrected for cluster position. Therefore, the BAF values for a particular locus for an individual who is homozygous for the B allele, homozygous for the A allele or heterozygous for AB are 1, 0 or 0.5 respectively. BAF offers two major advantages to the use of high density SNP array platforms in cytogenetic studies, as described below with illustration from relevant findings from our study.
First, mosaicism or proportions of different cell populations (in this case, tumor cells) can be obtained from a back calculation using the formula described in our “Statistical Methods” section. In other words, the cytogenomic data derived from such high density SNP chips can also provide information regarding admixture of normal and chromosomally abnormal cells in the studied DNA sample. Figures 2–6 illustrate this in our study. In each of the figures, the upper panel shows the genomic segmentation in different samples (where each row represents a sample, amplification is shown in red, deletion in blue and normal CN without color); the second panel shows the CN in linear scale for a single sample marked in the first and third panel; the third panel shows the BAF of the same sample; and the bottom or fourth panel shows the corresponding chromosomal location with cytoband information. Figure 2 shows that almost the entire 8q arm of sample C_10 had amplification (CN 2.9) with splitting of heterozygote (AB) pane of BAF at ∼0.34 (representing AAB) and 0.66 (representing ABB) indicating mono-allelic amplification in ∼90% cells (in this case cancer cells) giving rise to a mixture of cells. The formula for calculation of mosaicism is shown in “Material and Method” section. It may be noted that if 100% of the cells had similar change, then theoretically we would expect CN = 3 and BAF to be 0.33 and 0.67 (a typical picture of trisomy). The same sample shows deletion in 8p in ∼45% cells. Figure 3 shows amplification in the same 8q arm in another sample, C_1. In contrast to Figure 2, however, BAF in this case does not show any split at all despite increase in CN suggesting bi-allelic amplification in ∼30% of the cell population. BAF, in this case looks absolutely like normal cell, but the CN data clearly shows amplification. Only the combination of CN and BAF data could suggest the possible pathology. If 100% cells had such change, we would expect CN = 4 and BAF = 0.5 (i.e., no splitting). Figure 4 shows chromosome 8p of sample C_1 and clearly shows mono-allelic deletion (e.g. A_ or _B) in ∼80% cells. If 100% cells had such mono-allelic deletion, we would expect CN = 1 and splitting of BAF to 0.0 and 1.0 (a typical picture of complete LOH due to mono-allelic loss). Figure 5 shows mono-allelic deletion of 8p and mono-allelic amplification of 8q both in one sample (C_21). The CN and the BAF splitting suggest this deletion-amplification in ∼65% cell population. Comparison of the deletion events in Figure 4 and Figure 5 clearly show the advantage of analyzing BAF data and CN data together to obtain the proportion of abnormal cells in the tumor sample. Figure 5 also clearly shows that splitting of AB pane in BAF plot may be found both in deletion and amplification states emphasizing the importance of combined data (CN and BAF) for interpreting the cytogenetic change seen in tumor samples. Figure 6 shows an example of splitting of AB pane in BAF data in chromosome 17p without any change in CN status (may be called cnLOH). This type of change is possible in case of acquired UPD or isodisomy. In this case (sample C_19) ∼50% cells show cnLOH in 17p arm only whereas the entire 17q arm is normal. It may be noted that this type of abnormality can not be detected by aCGH. Figure 7 shows an example of monoallelic amplification superimposed on cnLOH in chromosome 20q. A summary of the combination of changes in CN status and BAF and their possible explanation for the mechanism of cytogenetic abnormality is presented in Table 2.
In 8q (where BAF = 0.34, 0.66 and CN = 2.9), approximately 90% of cells have monoallelic amplifications (AAB or ABB). In 8p (where BAF = 0.35, 0.65 and CN = 1.55), approximately 40% of cells have monoallelic deletions (A_ or B_).
BAF = 0.5 and CN = 2.6. Approximately 30% of cells have biallelic amplification (ABAB).
BAF = 0.17, 0.83 and CN = 1.2. Approximately 80% cells have mono-allelic deletion (A_ or B_).
In 8q (where BAF = 0.38, 0.62 and CN = 2.65), approximately 65% of cells have monoallelic amplifications (AAB or ABB). In 8p (where BAF = 0.26, 0.74 and CN = 1.35), approximately 65% of cells have monoallelic deletions (A_ or B_).
Note that 17p shows the normal heterozygote pattern in all cells (BAF = 0.5 and CN = 2) but approximately 50% of cells show cnLOH (AA or BB) in 17q (BAF = 0.25, 0.75 and CN = 2).
Second, with the use of high density SNP arrays, possible mechanisms such as mono-allelic or bi-allelic amplification/deletion, as well as cnLOH (or acquired UPD-like changes) can be detected, as illustrated by Figures 8–10. We used CN≥2.5 or ≤1.5 to mark a segment as amplification or deletion. So the software would consider any segment with CN between 1.6 and 2.4 as copy neutral, but to be conservative we considered only the regions which had CN 1.9–2.1 as copy neutral. The ideogram in Figure 8 shows the regions (in green) where we detected possible cnLOH in a number of cases. Each vertical line represents a sample. While some of these regions were reported previously , we identified several novel abberations in CRC (chromosome 8p, 10, 11, 12, 13, 14, 16p, 17q, 19p, 21q). The cnLOH can be found in the entire chromosome (Figure 9), in parts of the chromosome (Figure 10, entire p-arm and telomeric end of q-arm) or it may also be found in combination with deletion in another part of the chromosome (Figure 11, cnLOH in part of chromosome 6p and deletion in part of chromosome 6q).
Each vertical green line shows the occurrence copy-neutral LOH in one sample.
The cnLOH occurs in the entire chromosome.
The cnLOH occurs in all of 4p and parts of 4q where BAF = 0.25, 0.75 and CN = 2. The portion of 4q where BAF = 0.5 and CN = 2 shows normal heterozygote patterns in all cells.
Association of cytogenetic abnormalities with tumor characteristics
It is well known that chromosomal abnormalities differ in CRC by presence or absence of MSI. In our study, among the MSI (n = 24) and MSS (n = 62) CRC cases, there was statistical significant difference in frequency of amplification/deletion in a total of 217 genomic regions in mainly four chromosomes: chr 13, chr 17, chr 18 and chr 20 (see Table S1 for details). In MSS cases, amplification was more frequent in chr 13q and chr 20q and deletion was more frequent in chr17p and chr18 compared to the MSI cases (see Figure 12). So while looking at the association of chromosomal abnormalities with other tumor characteristics, we stratified the analyses by MSI status. We also analyzed the MSS and MSI cases combined and the results were very similar to MSS cases only (results are not shown). Detailed comparisons of all the regions with amplification and deletion in 62 MSS CRC samples with respect to different cancer and patient characteristics are presented in Table S2, S3, S4, S5, S6. A summary of these analyses is presented in the Table 3 and Table 4 for MSS and MSI cases respectively. Among the MSS cases (see Table 3), regarding the histological diagnosis, deletions at 18q12.1 and 15q15.3 were more frequently found in adenocarcinoma compared to mucinous adenocarcinoma, which supports the findings of Kazama Y et al. . Regarding tumor location, some CIN were exclusive to the left sided CRC, including amplification at 20q13.2 and 13q34 regions. When tumor grading was considered, some CN changes were almost exclusive to low grade tumors, for example deletion at 18q21. Deleted18q21 region encode a total of 227 transcripts (including multiple transcripts from the same gene). Considering only genes with at least 90% overlap in the regions of aneuploidy, there were three transcripts in the deleted 18q21 region. These transcripts are from the oncogenes RAB27B and CCDC68, suggesting that deletion of this region in low-grade cases may be a protective, reactive change. Similarly, amplified region of 13q31.1 contains transcript from the gene SPRY2, which is associated with microtubules but may function as an antagonist of fibroblast growth factor (FGF) in stimulated cells. The amplification of SPRY2 in low-grade tumors as well as lymph node negative tumors (see below) may play an important role by inhibiting the tumor growth and invasion function of FGF. When we looked at cytogenetic changes associated with staging, we found that amplification of 20q13 and deletion of 1p35 were more frequent in stage 1 tumors, compared to stage 2 and 3; where as amplification of 5p12 was more frequently seen in stage 2. Regarding lymph node metastasis, several amplifications (5p15.2, 5p13.1, 13q31.1 and 20q13.2) were more frequently found in lymph node negative cases compared to lymph node positive cases, possibly suggesting amplification of these regions to be protective against lymph node metastasis. We also validated the CN changes detected from this SNP array data by real-time PCR as well as by running the PCR product on Agilent BioAnalyzer 2100 using DNA 12000 chips with multiple amplicons (data not shown). We selected two genes for this validation experiment – Cyp24A1 from 20q13.2 region, amplification of which was significantly associated with lymph node metastasis and RAB27B from 18q22.2 region deletion of which was significantly associated with tumor grade in combined analyses.
Amplification regions are shown in red; deletion regions are shown in blue; regions with no significant change are shown in green. Each column represents a single sample.
Comparison to cytogenetic findings in other cancer
We compared the genomic regions with CN change identified in our study to the genomic regions reported by other investigators to have CN change in other cancers. These studies used CGH arrays, rather than the SNP arrays used in our study. Previous studies on prostate cancer –, lung cancer – and stomach cancer – found some common regions of amplification and deletion that we also detected in CRC in our study. The Venn diagrams in Figure 13a and 13c show chromosomal abnormalities in CRC and at least in one other cancer. There were 60 common regions of amplification and 33 common regions of deletion respectively, whose locations are shown in the ideogram of Figure 13b and 13d. It may be noted that amplifications of 5p, 8q, 20q and parts of 9p21 and 13q13 were common to multiple cancers. Similarly, deletions of 8p, 17p, a large part of chromosome 18 and portions of 1p and 9p21 were also common to multiple cancers. Therefore drugs targeted to these regions may show promising results in several different cancers.
(a) Regions of amplification in our study (red circle) were also found in previous studies of other solid cancers (green circle); several are also known to be down-regulated by drugs that are approved or in development (blue circle). (b) Locations of the common regions of amplification in our study and previous studies of other solid cancers (intersection of red and green circles in Fig. 13a) are shown. (c) Regions of deletion in our study (red circle) were also found in previous studies of other solid cancers (green circle); several are also known to be up-regulated by drugs that are approved or in development (blue circle). (d) Locations of the common regions of deletion in our study and previous studies of other solid cancers (intersection of red and green circles in Fig. 13d) are shown.
Therapeutic Targets Database matches
The Therapeutic Targets Database (TTD) (Version 4.3.02, July 7, 2011)  is a publicly available database of gene products that are known to be targeted by one or more drugs. We searched for genes that overlapped with the regions of significant amplification and deletion in our study to identify drugs that target the relevant genes. A total of 1,229 genes were identified residing in the 115 regions with amplification (100% overlapping with the amplification region in at least 10% of the CRC cases) and 920 genes were identified residing in the 60 regions with deletion (100% of the gene overlapping with the deleted regions in at least 10% of the CRC cases). The details of these regions are presented in Tables S8 (amplification) and S9 (deletion).
For genes located in amplification regions, we searched for drugs that act as antagonists, antibodies, or inhibitors. We found a total of 29 amplified regions (out of 115, see Figure 13a) harboring 39 genes that are currently being targeted by 248 different drugs of these types. It may be noted that 20 of these regions (out of 29) are also amplified in other cancers. After excluding the drugs for which approval status is not known, or discontinued in any phase of drug trial, there were a total of 69 available drugs (4 antibodies, 18 antagonists and 47 inhibitors) in different phases of trial for different purposes. The approved relevant drugs targeting these amplification regions are shown in Table 5 (for complete list, see Table S7a).
Likewise, for genes located in the deletion regions, we searched for drugs that act as activators, agonists, or inducers. We found a total of 10 regions of deletion (out of 60 identified in the present study, see Figure 13c) harboring 11 known genes that are currently being targeted by 29 different drugs. It may be noted that deletion in all these 10 regions are also found in other cancers. After similar filtering, as we did for regions of amplification, we found a total of 21 available drugs (1 inducer, 6 activator and 14 agonists) in different phases of trial for different purposes (for complete list, see Table S7b). Only the approved relevant drugs targeting these deletion regions are shown in Table 6.
Next, we identified amplified or deleted genes that are not yet targeted by any known drug in the TTD. Figure 13a shows that there are 40 more regions of amplification common to CRC and other cancers, which are not yet explored for drug development. These regions harbor a total of 207 possible gene targets (including a number of micro-RNAs and SNO) [data not shown in this paper and will be addressed in subsequent publication]. Figure 13c also shows that there are 23 more regions of deletion common to CRC and other cancers, which are not yet explored for drug development.
We also looked for agonistic drugs that target genes, amplification of which may protect against metastasis. We found a number of genes more frequently amplified in lymph node negative cases than lymph node positive cases. After searching the TTD for these genes, we found a number of agonist drugs at different stages of development against CHRNA4, EGFL7, GRIN1, HRH3, NTSR1, PTK6 genes. As for example, Varenciline (an approved drug for smoking cessation) or AZD1446 (phase II completed for cancer and Alzheimer's disease) are agonists for CHRNA4 (cholinergic nicotinic receptor) and can be tested whether it would be useful to prevent or delay the lymph node metastasis in CRC.
Our study is one of the first and largest to use such a high density SNP array on paired fresh frozen tissue (tumor and healthy) from CRC patients from a Southeast Asian population to identify CIN and correlate these abnormalities with many clinical and histopathological parameters. For the first time, we have illustrated the interpretation of CN and BAF data obtained from these SNP arrays in CRC tissue to understand the different underlying mechanisms for these CIN. However, it may be noted that SNP arrays can not detect translocations. Chromosomal abnormalities in CRC have been studied mostly by using either CGH or aCGH) –, –, –. In contrast to SNP arrays, these platforms need pure and unmixed cells for the interpretation of genomic abnormalities and do not provide BAF data. However, SNP arrays do not require pure samples, since BAF patterns clearly denote the proportion of the cell mixture (mosaicism) in the tumor itself. We have presented a few examples of this.
Using large number of paired samples, we identified a number of genomic amplifications and deletions in CRC tissue which were also reported in previous studies, with the most frequently reported gains in 20q, 13q, 7p, and 8q and losses in 17p, 18q, 8p, 4q, and 5q –, –, –. Proportion of samples showing CIN in a particular genomic region may vary from study to study depending on patient selection and the resolution of reported cytoband regions. For example, amplification in 20q13.2, in our unselected samples, was found in 24% cases, compared to 43% to 81% in other studies , , , . It may be noted that in the series of Douglas et al. (showing gain of 20q13.3 in 81% samples) included only the CIN+ cases . In our study, deletion in 18q was found in 20% cases, compared to 36% to 60% in other studies , , , . Within our samples also, the proportions of CIN in different genomic regions were higher in MSS cases than in MSI (see Figure 12).
Our study suggests that certain chromosomal aberrations can be correlated to histopathological subtypes for CRC, cell differentiation, tumor location and lymph node metastasis, and so that this may be helpful in indicating appropriate treatments and identifying future gene targets for personalized treatments. Like a previous study , in left sided tumors, we also identified a deletion in 18q12.1 (23% of samples in our series). Additionally, we identified amplifications in 20q13.2 and 13q22 and deletions in 8p21.3. No significant amplifications or deletions were observed in right-sided tumors in these regions, suggesting that these changes are specific to left-sided CRC. Left- and right-sided tumors develop via different pathways: tumors with MSI are more frequently found on the right side of the colon, and tumors with CIN somewhat more frequently found on the left side , , . Our study confirmed this distribution.
In our study, 87% of tumor samples were adenocarcinoma (mostly located in the left colon) and 13% were mucinous adenocarcinoma (mainly located in the right colon). We found that deletions in 15q15.3 and 15q21.1 were associated with adenocarcinoma. Kazama et al (2005) showed that mucinous adenocarcinomas are more likely to show MSI rather than CIN . However, mucinous carcinomas are more likely to develop metastasis in the lymph nodes and peritoneum than adenocarcinoma .
As found in other studies , , , , , we found amplification of 20q13.2 and deletion of 17p12 to be common in early stage I CRC, suggesting that these events occur early in CRC carcinogenesis. Additionally, we found amplification of 5p15.2 and deletion of 1p36.21 to occur in 25% of stage I cases, the latter of which has been found previously to be associated with adenomas and early stage disease . However, other investigators have correlated losses of 18q and 19p with stage I disease . Our analysis identified several amplification events related to status of lymph node metastases in CRC. We found amplifications of 5p15.1 - 5p14.3, 13q21.2–13q21.31, 13q21.31–13q21.32, 20q13.2 and 20q13.33 to be associated with the absence of lymph node metastases. This finding suggests that amplification of these regions (and hence the over expression of the genes residing in these regions) may be a protective phenomenon against the development of lymph node metastases. Therefore, existing drugs that induce or activate these particular genes may be of clinical importance in limiting and preventing lymph node metastases. One study found decreased CN in Xp, 13q, 7q and increased CN at 11q13, 15q26, 1q23 and 8p12 to be associated with lymph node metastases. Ghadimi et al  found gain of 8q23–24 to be indicative of the presence of metastases in the lymph nodes which we did not find. Others have found gains in 1q23, 7p, 17q, 13 and deletions in 4p, 8p to be correlated with formation of lymph node and other metastatic lesions , , .
We found deletion of 18q21.1 and amplification of 13q31 in 25% and 17% of low-grade CRC cases, respectively. These changes were not found in any of the high grade cases suggesting the possibility of these abnormalities to be specific for low-grade CRC. Hermsen et al  found loss of 18q to be correlated with the grade of dysplasia in adenomas along with gain of 20q. Another study found gain in chromosome 1, 13, 20, 7p and 8p and loss of 4, 8p and 18q to be associated with adenoma in the progression to carcinoma .
Important genes in affected cytoband regions
In TTD, we found approved antagonists and inhibitors that target particular genes found to be amplified in CRC. For example, Ranibizumab is used in age-related macular degeneration to target FLT1 and Topetecan, a second line of therapy against small cell lung cancer is an inhibitor of TOP1. Denosumab is an approved drug, targeted against TNFSF11, and is currently being used for rheumatoid arthritis and postmenopausal osteoporosis (Table 5). The effectiveness of these drugs in CRC has not yet been investigated. Similarly, Table S6 also shows a number of inhibitors (Topotecan, Bosutinib, Sorafenib, Sunitinib, Herbimycin A etc), that are currently in trials or in use for other solid cancers, that could also be tried in CRC. Our data also suggests that there are a number of inhibitors (Cladribine, Fludarabine, etc.) currently in trials or in use for hematological oncology that may be of benefit in selective CRC patients. However, we are not in a position to comment on actual effectiveness until a clinical trial is done. It may be noted that in a previous publication, we reported some integrative analysis of CN data and gene expression data in a smaller number of CRC cases and showed that 48 up-regulated genes and 60 down regulated genes reside in genomic amplification and deletion regions respectively . Our previous data also suggested that perhaps CN has a bigger influence than methylation on gene expression in CRC .
Our study identified amplification in 5p12 and deletion in 18p11.21 in CRC. These regions are less commonly reported in the literature. The 5p12 region harbors some important genes, for example GHR (growth hormone receptor), SEPPI, PAIP1, NNT and FGF10 that may be related to cancer (for complete list of genes in amplification regions, see Table S8). The deleted 18p11.21 region contains important tumor inhibitory genes, for example EPB41L3, L3MBTL4, ARHGAP28, PTPRM, and ROCK1 (for complete list of genes in deletion regions, see Table S9). Drugs that are agonists to the genes in these regions may be promising individualized treatments for patients with these specific deletions.
In contrast to the other studies, we found significant amplification of 5p15.1 - 5p14.3, 13q21.2–13q21.31, 13q21.31–13q21.32, 20q13.2 and 20q13.33 cytoband regions in the absence of lymph node metastases. At 20q13.33, there are some important genes like DIDO1 (tumor suppressor, apoptosis inducer), COL9A3 (major component of collagen tissue), OGFR (negative regulator of cell proliferation) and ADRM1 (a plasma membrane protein which promotes cell adhesion and expression and is induced by gamma interferon). Since these particular amplifications are found in lymph node negative cases, the over expression of these gene products may in fact have a protective role against the formation of metastases and thus they may be important genes for the delay or even prevention of tumor spread. Drugs that target these genes and induce expression may help prevent tumor spread if given early enough in disease development. Another important group of genes, CDH12 (at 5p15.1-5p14.4) and PCDH17 (13q21.2), were found to be amplified in lymph node negative patients. CDH12 is responsible for synthesis of cadherins, calcium dependent cell adhesion proteins, which contribute to the sorting of heterogeneous cell types. Cellular adhesion is an important function in tumor localization, or in other words may play a role in prevention of tumor spread and therefore may be another protective gene. Chalmers et al have previously suggested that cadherins are important tumor suppressor genes and are often lost in many cancers . The protein encoded by PCDH17 may play a role in the establishment and function of specific cell-cell connections in the brain, but its role in tumor tissue is not clear. However, recently Giefing et al (2011) identified PCDH17 as a candidate tumor suppressor gene in laryngeal squamous cell carcinoma, as it was found to be significantly down-regulated in cancer cell lines . Our data also supports that this gene may be a tumor suppressor and important in inhibiting tumor spread.
AURKA (aurora kinase A) is an important gene located at 20q13.2–20q13.33 which we found to be significantly amplified in left sided tumors compared to right sided tumors (20/67 vs. 0/19) suggesting Aurora kinase inhibitors may be effective in left sided CRC but may not be suitable for right sided CRC. From another point of view, tumors with lymph node metastasis were more often found to have amplification of the AURKA gene than tumors without metastasis, which suggests this gene may have a role in progression and promotion of tumor spread. In fact, this gene encodes a cell cycle-regulated kinase protein required for normal mitosis that appears to be involved in microtubule formation and stabilization at the spindle pole during chromosome segregation. Other studies also showed the similar finding , . Aurora kinase inhibitors are an intense area of research for the development of anticancer therapies, and as such there are several drugs currently in phase I and phase II development targeting this gene. These drugs may be of future use to treat CRC patients who have this amplification to aid in the prevention of tumor progression. The TMEM189-UBE2V1 gene (has a role in cell cycle control and cell differentiation, as well as DNA repair), TPD52L2 gene (regulator of cell proliferation), TP53RK gene (stimulator of a tumor suppressor) and PSMA7 gene (a regulator of the cell cycle) also resides in this 20q13 region. Their function indicates similar role in delaying lymph node metastasis.
Another important gene in the same region 13q31.1, SPRY2 (protein sprouty homolog2, fibroblast growth factor antagonist), was found to be more frequently amplified in low grade (12/67 vs. 0/19), left sided (13/67 vs. 0/19) and lymph node negative tumor cases (9/38 vs. 4/48) than in high grade, right sided and lymph node positive cases. The amplification of the SPRY2 gene in lymph node negative tumors may play an important role by inhibiting the tumor growth and invasion function of FGF and therefore could be a potential therapeutic target.
In conclusion, our study is one of the first to use a high density SNP array to analyze chromosomal abnormalities between paired tumor and healthy tissue from CRC patients and also the first to investigate this in a Southeast Asian population. We successfully identified many regions of amplification, deletion and cnLOH that are related to particular clinical and histopathological subtypes. We also identified key genes within these regions, some of which are targets for currently available drugs or drugs in the process of development suggesting that identifying chromosomal abnormalities in CRC could help in the planning of personalized treatment in future.
Materials and Methods
The study included a total of 86 consecutive patients (M = 50, F = 36) with histologically confirmed CRC from Bangladesh meeting our study criteria. None of the patients received any radiotherapy or chemotherapy before surgery. Samples were collected from the operating room immediately after the surgical resection. For each patient, one sample was collected from the tumor mass, and another sample was taken from the resected, unaffected part of the colon about 5–10 cm away from the tumor mass. From both sites, the tissue was collected as fresh frozen, in RNAlater (RNA-stabilizing buffer, Ambion Inc.) and in Allprotect (DNA and RNA stabilizing buffer, QIAGEN) and stored at −86°C. For tumor tissue collection, the tumor specimen was sectioned and non-tumor areas were trimmed off if possible. H&E sections from the tumor samples were examined for proportion of tumor cells in each sample. Readings from at least five microscopic fields/sample examined by same histopathologist (MR) were averaged to calculate the proportion of tumor cells in a given sample which was on average 72.1% (SD 6.5%) for all 86 tumor samples with no difference between right and left sided tumors (72.1±6.31 vs. 72.1±6.67, p = 0.99). However, the proportion of tumor cells was slightly higher in MSI samples compared to MSS ones (74.6±6.24% vs. 71.2±6.48%, p = 0.029). The histopathological diagnosis was done independently by two histopathologists at Bangabandhu Sheikh Mujib Medical University (BSMMU), Dhaka, Bangladesh. For each patient we also abstracted key demographic and clinical data and tumor characteristics from hospital medical records. Written informed consent was obtained from all participants.
The research protocol was approved by the ethical review committee of BSMMU, Dhaka, Bangladesh (BSMMU/2010/10096) and by the IRB of the University of Chicago (10-264-E), IL, USA. The samples were shipped on dry ice to the molecular genomics lab at the University of Chicago for subsequent DNA extraction and high density SNP array analysis.
A total of 67 patients (M = 41, F = 34) had left sided tumors and 19 (M = 13, F = 6) had right sided tumors. The right side includes the cecum (n = 4), ascending colon (n = 7), hepatic flexure of transverse colon (n = 4) and transverse colon (n = 4) up to the splenic flexure. The left side includes the descending colon (n = 4), sigmoid colon (n = 6), rectosigmoid junction (n = 3) and the rectum (n = 54). Histopathological diagnosis showed adenocarcinoma in 75 patients and mucinous adenocarcinoma in 11 patients. Mucinous adenocarcinoma was recorded when the mucin content was more than 50% . A total of 19 patients had poorly differentiated carcinoma, whereas the tumors in remaining 67 were well to moderately differentiated. TNM classification (http://www.cancerstaging.org/staging/index.html) was used for tumor staging. T1, T2, T3 and T4 represent the presence of tumor within the submucosa, with extension up to the muscular layer, involvement up to the subserous coat or extension beyond the serous coat of the colon respectively. N0 and N1 represent the tumor without or with lymph node involvement respectively. M0 represents no metastasis and M1 represents distant metastasis. Stage I, stage II and stage III are represented by T1/T2N0M0, T3/T4N0 M0 and T3/T4N1M0 respectively.
DNA was extracted from fresh frozen tissue using the Gentra Puregene kit (QIAGEN). Quality Control (QC) was performed for all samples using the Nanodrop 1000.
Microsatellite Instability (MSI) detection
A high-resolution melting (HRM) analysis method was used for detection of two mononucleotide MSI markers – BAT25 and BAT26 . A tumor was defined as MSI when it showed instability with at least one of these markers (BAT25 and BAT26), and as MSS when it showed no instability for both the markers. In this way, a total of 24 tumor samples showed MSI and all were confirmed by another relatively novel MSI marker CAT25  using HRM analysis. We used published primer sequences , . Thermocycling and melting conditions were optimized for CFX96 instrument and Bio-Rad Precision Melt Analysis software was used to identify MSI by differential melting curve characteristics. A representative example is shown in figure S1.
Illumina SNP array
According to Illumina protocol 250 ng of tissue DNA was genotyped on either 610 Quad chips with 620,901 markers (first 24 pairs) or Cyto12 chips with 299,671 markers (remaining 62 pairs) and read on the BeadArray Reader. Each pair (normal and tumor) was placed in the same chip to avoid chip-to-chip variation. Image data was processed in BeadStudio software to generate genotype calls, B allele frequency and logR ratio. The data discussed in this paper have been deposited in NCBI's GEO and will be accessible through GEO Series accession number GSE34678.
To compare the continuous variables (e.g. age, BMI, proportion of tumor cells, average signal intensity), we used one-way analysis of variance (ANOVA). For the categorical variables we used chi-square tests.
Genome-wide CN analysis
BeadStudio normalized intensity values and BAF data were imported into PARTEK genomic suit . Intensity data from the normal tissue was used as a reference for the corresponding CRC tissue for generating CN data for each marker (paired CN analysis). Standard Principal Component Analysis (PCA) and a sample histogram were generated as part of QC. After obtaining the paired CN data for each locus, to identify the genomic segments with amplification, normal CN or deletion, we used genomic segmentation algorithm of PARTEK. By “genomic segment” we mean a stretch of DNA segment of a particular sample showing amplification or deletion. A genomic segment with CN variation in one sample may or may not fully overlap with genomic segment in another sample. By amplification or deletion “region”, we mean the stretch of amplified or deleted genomic segment that is common to a minimum of 9 CRC samples (∼10% of the 86 samples in this paper). For the data from the 610Quad chips and Cyto12 v2.1 chips, the genomic segmentation was done with a setting of a minimum of 25 markers and 10 markers respectively, p-value threshold of 0.001 for two neighboring regions having significantly differing means. A segment was considered as amplified if the geometric mean CN was ≥2.5 and a deletion if the mean was ≤1.5. After segmentation, the data from the two types of chips were combined to get the total genomic segments from all the 86 CRC samples. Out of total 11,905 genomic segments in 86 samples we restricted the analysis for the autosomes only (10,878 segments – of which 1501 showed amplification, 1630 showed deletion and 7,747 no change in CN). The length of a genomic segment was calculated from the genomic location of the start and end SNP for that genomic segment. A total of 8,912 genomic segments (82%) were ≥500 kb size. While looking at the associations between chromosomal amplification/deletion and different CRC phenotypes findings, we restricted the analysis to these genomic segments. In this paper, we reported a genomic segment in a sample to have amplification or deletion only if it was at least 500 kb size and the geometric mean of the CN within the genomic boundary of the segment for that particular sample was ≥2.5 or ≤1.5 respectively. The phenotype of interest was tested for association with amplification/deletion status of the sample using Pearson's Chi-square test. To keep the list of associations between amplification/deletion regions and histopathological findings short, we report only the regions where the CN abnormality was detected in at least 10% of samples (Tables S8 and S9).
Mosaicism between two cell populations can be estimated from SNP array data using an equation described and experimentally verified using cell lines of known aneuploidy in specially-created admixtures by Conlin et al.  or by Nancarrow et al . In our work, we used an equation equivalent to Conlin's:where Bexp is the expected BAF for a given SNP in admixture of two cell types; b1 and b2 are the number (not frequency) of B alleles for the given SNP in cell populations 1 and 2; n1 and n2 are the CN for the given loci in cell populations 1 and 2; and P1 and P2 are the proportions of cell types 1 and 2 in the admixture.
To estimate P1 of a sample, we made back-calculations of Bexp for all probable values of b, n, and P1 and reported the values for which Bexp matched the observed BAF. In practice, the observed BAF and CN limit the possible values for b and n. The macro-enabled spreadsheet used for these back-calculations is included in our supporting information.
Therapeutic Targets Database Matching
Genes in regions of significant amplification or deletion in at least 9 samples were matched against the Therapeutic Targets Database (TTD [Version 4.3.02, July 7, 2011]) , . The database was rebuilt using official gene symbols as the unique identifier. This was necessary for our purposes because: (a) targets in the TTD are identified by UniProt ID, a protein sequence identifier; (b) genes can be associated with multiple UniProt IDs when multiple transcripts are known. IDs were converted using the DAVID Gene ID Conversion tool , , and the database was rebuilt with our own Python scripts. When a single gene corresponded to multiple Uniprot IDs, and hence multiple TTD entries, information from those entries were combined to avoid information loss.
Normalized melt curves and corresponding differential curves for BAT25, BAT26 and CAT 25 amplicons. Each line represents one sample. MSS samples are shown in red. MSI samples are shown in green.
Association of CN changes with MSI status.
Association of CN changes with histological diagnosis.
Association of CN changes with tumor location.
Association of CN changes with tumor grade.
Association of CN changes with tumor stage.
Association of CN changes with lymph node metastasis.
Therapeutic Target Database matches for: (A) antibody, antagonist, inhibitor targeting genes in the amplification regions; (B) agonist, activator and inducer targeting genes in the deletion regions.
List of genes found to be significantly amplified in tumor tissue in at least 10% of samples.
List of genes found to be significantly deleted in tumor tissue in at least 10% of samples.
The authors are thankful to Prof. Zahidul Haq, who performed most of the surgeries and supplied the biological samples.
Conceived and designed the experiments: FJ MK MGK. Performed the experiments: RP MR RR CD SR RPB MK. Analyzed the data: MGK RR. Contributed reagents/materials/analysis tools: MK HA. Wrote the paper: FJ RR CD HA MGK.
- 1. Jemal A, Bray F, Center MM, Ferlay J, Ward E, et al. (2011) Global cancer statistics. CA Cancer J Clin 61: 69–90.
- 2. Jemal A, Siegel R, Xu J, Ward E (2010) Cancer statistics, 2010. CA Cancer J Clin 60: 277–300.
- 3. Ahmed S, Banerjea A, Hands RE, Bustin S, Dorudi S (2005) Microarray profiling of colorectal cancer in Bangladeshi patients. Colorectal Dis 7: 571–575.
- 4. Moore MA, Ariyaratne Y, Badar F, Bhurgri Y, Datta K, et al. (2009) Cancer epidemiology in South Asia - past, present and future. Asian Pac J Cancer Prev 11: Suppl 249–66.
- 5. Berg M, Agesen TH, Thiis-Evensen E, Merok MA, Teixeira MR, et al. (2010) Distinct high resolution genome profiles of early onset and late onset colorectal cancer integrated with gene expression data identify candidate susceptibility loci. Mol Cancer 9: 100.
- 6. Douglas EJ, Fiegler H, Rowan A, Halford S, Bicknell DC, et al. (2004) Array comparative genomic hybridization analysis of colorectal cancer cell lines and primary carcinomas. Cancer Res 64: 4817–4825.
- 7. Hermsen M, Postma C, Baak J, Weiss M, Rapallo A, et al. (2002) Colorectal adenoma to carcinoma progression follows multiple pathways of chromosomal instability. Gastroenterology 123: 1109–1119.
- 8. Knosel T, Petersen S, Schwabe H, Schluns K, Stein U, et al. (2002) Incidence of chromosomal imbalances in advanced colorectal carcinomas and their metastases. Virchows Arch 440: 187–194.
- 9. Pino MS, Chung DC (2010) The chromosomal instability pathway in colon cancer. Gastroenterology 138: 2059–2072.
- 10. Derks S, Postma C, Carvalho B, van den Bosch SM, Moerkerk PT, et al. (2008) Integrated analysis of chromosomal, microsatellite and epigenetic instability in colorectal cancer identifies specific associations between promoter methylation of pivotal tumour suppressor and DNA repair genes and specific chromosomal alterations. Carcinogenesis 29: 434–439.
- 11. Nakao K, Mehta KR, Fridlyand J, Moore DH, Jain AN, et al. (2004) High-resolution analysis of DNA copy number alterations in colorectal cancer by array-based comparative genomic hybridization. Carcinogenesis 25: 1345–1357.
- 12. Nakao M, Kawauchi S, Furuya T, Uchiyama T, Adachi J, et al. (2009) Identification of DNA copy number aberrations associated with metastases of colorectal cancer using array CGH profiles. Cancer Genet Cytogenet 188: 70–76.
- 13. Watanabe T, Wu TT, Catalano PJ, Ueki T, Satriano R, et al. (2001) Molecular predictors of survival after adjuvant chemotherapy for colon cancer. N Engl J Med 344: 1196–1206.
- 14. Aragane H, Sakakura C, Nakanishi M, Yasuoka R, Fujita Y, et al. (2001) Chromosomal aberrations in colorectal cancers and liver metastases analyzed by comparative genomic hybridization. Int J Cancer 94: 623–629.
- 15. De Angelis PM, Clausen OP, Schjolberg A, Stokke T (1999) Chromosomal gains and losses in primary colorectal carcinomas detected by CGH and their associations with tumour DNA ploidy, genotypes and phenotypes. Br J Cancer 80: 526–535.
- 16. Ghadimi BM, Grade M, Liersch T, Langer C, Siemer A, et al. (2003) Gain of chromosome 8q23–24 is a predictive marker for lymph node positivity in colorectal cancer. Clin Cancer Res 9: 1808–1814.
- 17. Grade M, Becker H, Liersch T, Ried T, Ghadimi BM (2006) Molecular cytogenetics: genomic imbalances in colorectal cancer and their clinical impact. Cell Oncol 28: 71–84.
- 18. Grade M, Gaedcke J, Wangsa D, Varma S, Beckmann J, et al. (2009) Chromosomal copy number changes of locally advanced rectal cancers treated with preoperative chemoradiotherapy. Cancer Genet Cytogenet 193: 19–28.
- 19. Knosel T, Schluns K, Stein U, Schwabe H, Schlag PM, et al. (2003) Genetic imbalances with impact on survival in colorectal cancer patients. Histopathology 43: 323–331.
- 20. Reid JF, Gariboldi M, Sokolova V, Capobianco P, Lampis A, et al. (2009) Integrative approach for prioritizing cancer genes in sporadic colon cancer. Genes Chromosomes Cancer 48: 953–962.
- 21. Ried T, Knutzen R, Steinbeck R, Blegen H, Schrock E, et al. (1996) Comparative genomic hybridization reveals a specific pattern of chromosomal gains and losses during the genesis of colorectal tumors. Genes Chromosomes Cancer 15: 234–245.
- 22. Sandberg AA, Meloni-Ehrig AM (2010) Cytogenetics and genetics of human cancer: methods and accomplishments. Cancer Genet Cytogenet 203: 102–126.
- 23. Andersen CL, Wiuf C, Kruhoffer M, Korsgaard M, Laurberg S, et al. (2007) Frequent occurrence of uniparental disomy in colorectal cancer. Carcinogenesis 28: 38–48.
- 24. Bardi G, Johansson B, Pandis N, Bak-Jensen E, Orndal C, et al. (1993) Cytogenetic aberrations in colorectal adenocarcinomas and their correlation with clinicopathologic features. Cancer 71: 306–314.
- 25. Delattre O, Olschwang S, Law DJ, Melot T, Remvikos Y, et al. (1989) Multiple genetic alterations in distal and proximal colorectal cancer. Lancet 2: 353–356.
- 26. Diep CB, Kleivi K, Ribeiro FR, Teixeira MR, Lindgjaerde OC, et al. (2006) The order of genetic events associated with colorectal cancer progression inferred from meta-analysis of copy number changes. Genes Chromosomes Cancer 45: 31–41.
- 27. Kazama Y, Watanabe T, Kanazawa T, Tada T, Tanaka J, et al. (2005) Mucinous carcinomas of the colon and rectum show higher rates of microsatellite instability and lower rates of chromosomal instability: a study matched for T classification and tumor location. Cancer 103: 2023–2029.
- 28. Lindblom A (2001) Different mechanisms in the tumorigenesis of proximal and distal colon cancers. Curr Opin Oncol 13: 63–69.
- 29. Lips EH, de Graaf EJ, Tollenaar RA, van Eijk R, Oosting J, et al. (2007) Single nucleotide polymorphism array analysis of chromosomal instability patterns discriminates rectal adenomas from carcinomas. J Pathol 212: 269–277.
- 30. Brenner BM, Rosenberg D (2010) High-throughput SNP/CGH approaches for the analysis of genomic instability in colorectal cancer. Mutat Res 693: 46–52.
- 31. Gaasenbeek M, Howarth K, Rowan AJ, Gorman PA, Jones A, et al. (2006) Combined array-comparative genomic hybridization and single-nucleotide polymorphism-loss of heterozygosity analysis reveals complex changes and multiple forms of chromosomal instability in colorectal cancers. Cancer Res 66: 3471–3479.
- 32. Lips EH, Dierssen JW, van Eijk R, Oosting J, Eilers PH, et al. (2005) Reliable high-throughput genotyping and loss-of-heterozygosity detection in formalin-fixed, paraffin-embedded tumors using single nucleotide polymorphism arrays. Cancer Res 65: 10188–10191.
- 33. Middeldorp A, van Eijk R, Oosting J, Forte GI, van Puijenbroek M, et al. (2011) Increased frequency of 20q gain and copy-neutral loss of heterozygosity in mismatch repair proficient familial colorectal carcinomas. Int J Cancer.
- 34. Middeldorp A, van Puijenbroek M, Nielsen M, Corver WE, Jordanova ES, et al. (2008) High frequency of copy-neutral LOH in MUTYH-associated polyposis carcinomas. J Pathol 216: 25–31.
- 35. Sheffer M, Bacolod MD, Zuk O, Giardina SF, Pincas H, et al. (2009) Association of survival and disease progression with chromosomal instability: a genomic exploration of colorectal cancer. Proc Natl Acad Sci U S A 106: 7131–7136.
- 36. Lapunzina P, Monk D (2011) The consequences of uniparental disomy and copy number neutral loss-of-heterozygosity during human development and cancer. Biol Cell 103: 303–317.
- 37. Gu G, Brothman AR (2011) Cytogenomic aberrations associated with prostate cancer. Cancer Genet 204: 57–67.
- 38. Jenkins RB, Qian J, Lieber MM, Bostwick DG (1997) Detection of c-myc oncogene amplification and chromosomal anomalies in metastatic prostatic carcinoma by fluorescence in situ hybridization. Cancer Res 57: 524–531.
- 39. Matsuyama H, Pan Y, Oba K, Yoshihiro S, Matsuda K, et al. (2001) Deletions on chromosome 8p22 may predict disease progression as well as pathological staging in prostate cancer. Clin Cancer Res 7: 3139–3143.
- 40. Pettus JA, Cowley BC, Maxwell T, Milash B, Stephenson RA, et al. (2004) Multiple abnormalities detected by dye reversal genomic microarrays in prostate cancer: a much greater sensitivity than conventional cytogenetics. Cancer Genet Cytogenet 154: 110–118.
- 41. Boelens MC, Kok K, van der Vlies P, van der Vries G, Sietsma H, et al. (2009) Genomic aberrations in squamous cell lung carcinoma related to lymph node or distant metastasis. Lung Cancer 66: 372–378.
- 42. Job B, Bernheim A, Beau-Faller M, Camilleri-Broet S, Girard P, et al. Genomic aberrations in lung adenocarcinoma in never smokers. PLoS One 5: e15145.
- 43. Nymark P, Wikman H, Ruosaari S, Hollmen J, Vanhala E, et al. (2006) Identification of specific gene copy number changes in asbestos-related lung cancer. Cancer Res 66: 5737–5743.
- 44. Moskaluk CA, Hu J, Perlman EJ (1998) Comparative genomic hybridization of esophageal and gastroesophageal adenocarcinomas shows consensus areas of DNA gain and loss. Genes Chromosomes Cancer 22: 305–311.
- 45. Peng DF, Sugihara H, Mukaisho K, Tsubosa Y, Hattori T (2003) Alterations of chromosomal copy number during progression of diffuse-type gastric carcinomas: metaphase- and array-based comparative genomic hybridization analyses of multiple samples from individual tumours. J Pathol 201: 439–450.
- 46. Uchida M, Tsukamoto Y, Uchida T, Ishikawa Y, Nagai T, et al. (2010) Genomic profiling of gastric carcinoma in situ and adenomas by array-based comparative genomic hybridization. J Pathol 221: 96–105.
- 47. Zhu F, Han B, Kumar P, Liu X, Ma X, et al. (2010) Update of TTD: Therapeutic Target Database. Nucleic Acids Res 38: D787–791.
- 48. Lothe RA, Peltomaki P, Meling GI, Aaltonen LA, Nystrom-Lahti M, et al. (1993) Genomic instability in colorectal cancer: relationship to clinicopathological variables and family history. Cancer Res 53: 5849–5852.
- 49. Lanza G, Messerini L, Gafa R, Risio M (2011) Colorectal tumors: the histology report. Dig Liver Dis 43: Suppl 4S344–355.
- 50. Kibriya MG, Raza M, Jasmine F, Roy S, Paul-Brutus R, et al. (2011) A genome-wide DNA methylation study in colorectal carcinoma. BMC Med Genomics 4: 50. doi:10.1186/1755-8794-1184-1150.
- 51. Chalmers IJ, Aubele M, Hartmann E, Braungart E, Werner M, et al. (2001) Mapping the chromosome 16 cadherin gene cluster to a minimal deleted region in ductal breast cancer. Cancer Genet Cytogenet 126: 39–44.
- 52. Giefing M, Zemke N, Brauze D, Kostrzewska-Poczekaj M, Luczak M, et al. (2011) High resolution ArrayCGH and expression profiling identifies PTPRD and PCDH17/PCH68 as tumor suppressor gene candidates in laryngeal squamous cell carcinoma. Genes Chromosomes Cancer 50: 154–166.
- 53. Sillars-Hardebol AH, Carvalho B, de Wit M, Postma C, Delis-van Diemen PM, et al. (2010) Identification of key genes for carcinogenic pathways associated with colorectal adenoma-to-carcinoma progression. Tumour Biol 31: 89–96.
- 54. Zhang C, Fang Z, Xiong Y, Li J, Liu L, et al. (2010) Copy number increase of aurora kinase A in colorectal cancers: a correlation with tumor progression. Acta Biochim Biophys Sin (Shanghai) 42: 834–838.
- 55. Janavicius R, Matiukaite D, Jakubauskas A, Griskevicius L (2010) Microsatellite instability detection by high-resolution melting analysis. Clin Chem 56: 1750–1757.
- 56. Bianchi F, Galizia E, Catalani R, Belvederesi L, Ferretti C, et al. (2009) CAT25 is a mononucleotide marker to identify HNPCC patients. J Mol Diagn 11: 248–252.
- 57. Downey T (2006) Analysis of a multifactor microarray study using Partek genomics solution. Methods Enzymol 411: 256–270.
- 58. Conlin LK, Thiel BD, Bonnemann CG, Medne L, Ernst LM, et al. (2010) Mechanisms of mosaicism, chimerism and uniparental disomy identified by single nucleotide polymorphism array analysis. Hum Mol Genet 19: 1263–1275.
- 59. Nancarrow DJ, Handoko HY, Stark MS, Whiteman DC, Hayward NK (2007) SiDCoN: a tool to aid scoring of DNA copy number changes in SNP chip data. PLoS One 2: e1093.
- 60. Chen X, Ji ZL, Chen YZ (2002) TTD: Therapeutic Target Database. Nucleic Acids Res 30: 412–415.
- 61. Huang da W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4: 44–57.
- 62. Huang da W, Sherman BT, Lempicki RA (2009) Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 37: 1–13.