Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

High Fidelity Copy Number Analysis of Formalin-Fixed and Paraffin-Embedded Tissues Using Affymetrix Cytoscan HD Chip

  • Yan P. Yu,

    Affiliation Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America

  • Amantha Michalopoulos,

    Affiliation Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America

  • Ying Ding,

    Affiliation Department of Statistics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America

  • George Tseng,

    Affiliation Department of Statistics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America

  • Jian-Hua Luo

    Affiliation Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, United States of America

High Fidelity Copy Number Analysis of Formalin-Fixed and Paraffin-Embedded Tissues Using Affymetrix Cytoscan HD Chip

  • Yan P. Yu, 
  • Amantha Michalopoulos, 
  • Ying Ding, 
  • George Tseng, 
  • Jian-Hua Luo


Detection of human genome copy number variation (CNV) is one of the most important analyses in diagnosing human malignancies. Genome CNV detection in formalin-fixed and paraffin-embedded (FFPE) tissues remains challenging due to suboptimal DNA quality and failure to use appropriate baseline controls for such tissues. Here, we report a modified method in analyzing CNV in FFPE tissues using microarray with Affymetrix Cytoscan HD chips. Gel purification was applied to select DNA with good quality and data of fresh frozen and FFPE tissues from healthy individuals were included as baseline controls in our data analysis. Our analysis showed a 91% overlap between CNV detection by microarray with FFPE tissues and chromosomal abnormality detection by karyotyping with fresh tissues on 8 cases of lymphoma samples. The CNV overlap between matched frozen and FFPE tissues reached 93.8%. When the analyses were restricted to regions containing genes, 87.1% concordance between FFPE and fresh frozen tissues was found. The analysis was further validated by Fluorescence In Situ Hybridization on these samples using probes specific for BRAF and CITED2. The results suggested that the modified method using Affymetrix Cytoscan HD chip gave rise to a significant improvement over most of the previous methods in terms of accuracy in detecting CNV in FFPE tissues. This FFPE microarray methodology may hold promise for broad application of CNV analysis on clinical samples.


Genome abnormalities are the hallmark of human malignancies [1]. These include chromosome deletion, amplification, translocation, inversion and isochromosome formation. Analysis of genome abnormality is critical in making diagnosis of human malignancies, congenital birth defects and a variety of inheritable diseases. Array comparative genome hybridization (aCGH) or Affymetrix SNP array has been frequently applied to clinical samples to examine loss of heterozygosity and to detect amplification or deletion of genome fragments in the chromosomes [2][8]. The current methodologies using aCGH or Affymetrix SNP6.0 require high quality genome DNA from fresh frozen tissues. However, most of the samples for pathological evaluation are formalin-fixed and paraffin-embedded (FFPE) tissue blocks. Suboptimal and high background results are obtained when tissues from FFPE tissue blocks are analyzed due to the fragmenting nature of genome DNA in FFPE tissues. The low quality of genome copy number analysis from FFPE tissues practically precludes the application of whole genome copy number variation (CNV) analyses in clinical setting. Thus, a new method that can reproducibly generate high quality CNV analysis from FFPE tissues is needed to make high throughput genome CNV analysis applicable to clinical setting.

Lymphoma is one of the human malignancies that are frequently associated with large number of structural genome abnormalities. The classification and treatment of lymphomas are based on their genotypes in the tumor cells. Thus, lymphoma is an ideal human malignancy to investigate whether FFPE tissue is suitable for CNV analysis using Affymetrix Cytoscan HD chip. In this report, we describe a method that is adapted to the genomic DNA extracted from FFPE tissues to prepare the DNA cocktail for Affymetrix Cytoscan HD analysis. To validate this method, frozen genome DNA samples from matched lymphoma cases were also analyzed on Affymetrix Cytoscan HD chips. The results showed close overlaps in CNV profiles between FFPE and matched frozen tissues.

Materials and Methods

Tissue samples

Fresh frozen tissues of eight cases of human malignant lymphomas, including 5 diffused large B cell lymphomas, 2 follicular lymphomas and 1 T cell non-Hodgkins lymphoma, were obtained from clinical services. These tissues were dissected to have at least 70% purity of tumor cells. The study was approved by University of Pittsburgh Medical Center Quality Insurance Committee and Institutional Review Board, and exempted from informed-consent. The matched FFPE tissues from the same patients were also frozen sectioned onto slides, fixed and dehydrated with 100% ethanol and similarly micro-dissected to obtain tumor cells. Karyotyping analyses were performed on all these cases to detect chromosome abnormalities.

Affymetrix CytoScan HD chip analysis of copy number variation of tumor cells

For macrodissected frozen tissues, DNA was extracted using QIAamp blood and tissue kit (Qiagen, Valencia, CA). Five hundred nanograms of genome DNA were digested with Nsp1 for 2 hours at 37°C. The digested DNA was purified and ligated with primer/adaptors at 16°C for 12–16 hours. Amplicons were generated by performing PCR using primers provided by the manufacturer (Affymetrix, CA) on the ligation products using the following program: 94°C for 3 min, then 35 cycles of 94°C 30 second, 60°C for 45 sec and 65°C for 1 minute. This was followed by extension at 68°C for 7 min. The PCR products were then purified and digested with DNAseI for 35 min at 37°C to fragment the amplified DNA. The fragmented DNA was then labeled with biotinylated nucleotides through terminal deoxynucleotide transferase for 4 hours at 37°C. Two hundred fifty micrograms of fragmented DNA were hybridized with a pre-equilibrated Affymetrix chip Cytoscan HD chip at 50°C for 18 hours. The procedures of washing and scanning of CytoscanHD chips followed the manuals provided by Affymetrix, Inc. Cel files were generated from AGCC software from Affymetrix, Inc. (Santa Clara, CA). For FFPE tissues, micro-dissected tumor cells were treated with xylene for 12 hours. DNA was then extracted using QiaAmp FFPE DNA extraction kit. Gel purification of DNA sizes ranging from 200 to 1000 bp was performed. Two hundred fifty nanograms of purified DNA was then digested with NSP1, and similarly processed as frozen tissues.

Statistical analysis

Sixteen cel files were analyzed with Genotyping console for quality control analysis. Samples with QC call above 80% were admitted into the analysis. To analyze CNV, cel files were imported into Chromosome Analysis Suite 1.2 (Affymetrix, Inc) to generate copy number from raw intensity. For frozen tissues, Cytoscan HD files from fresh frozen tissues of 380 healthy individuals provided by Affymetrix were used as a baseline. For FFPE tissues, Cytoscan HD files from FFPE tissue of 100 healthy individuals from Affymetrix were used. Deletions or amplifications of genomes were analyzed by first limiting to the regions with p-value less than 0.05/total number of regions detected, i.e. family-wise error rate (FWER) is controlled using Bonferroni's correction [9]. The selected regions were filtered by limiting to the regions with at least 25 markers and 500 kb. For genome fragment gain determination, a mean of >2.3 for autosomal chromosomes or >1.5 for sex chromosomes of male was required, while for genome fragment loss determination, a mean of <1.7 for autosomal chromosomes or <0.5 of sex chromosomes of male was required. Loss of heterozygosity was not analyzed due to lack of matched normal tissues.

Fluorescence In-situ Hybridization (FISH)

Tissue slides (5 microns) were placed in 2×SSC at 37°C for 30 min. Slides were then removed and dehydrated in 70% and 85% ethanol for 2 min each at room temperature, and air dried. The probes for CITED2 and BRAF FISH analysis were obtained from BACPAC Resource Center, Oakland, CA. The DNA from the selected clone was extracted using Nucleobond Ax kit (Macherey-Nagel, Easton, PA). The probe was prepared by combining 7 µl of biotin-labeled genomic sequence containing CITED2 or BRAF (150 Kb)/50% formamide with 1 µl of direct-labeled CEP7 spectrum green for BRAF or CEP6 for CITED2 (Vysis, Downers Grove, IL). The probe was denatured for 5 min at 75°C. Sections of formalin-fixed tissues were denatured in 70% formamide for 3 min, and dehydrated in 70%, 85%, and 100% ethanol for 2 min each at room temperature. The denatured probe was placed on the slide, cover-slipped, sealed with rubber cement, placed in a humidified chamber and hybridized overnight at 37°C. Coverslips were removed and the slides were washed in 2×SSC/0.3% NP-40 at 72°C for 2 min. Slides were then held in phosphate buffered saline (PBS) at room temperature in the dark for 2 min. The biotin label was visualized by conjugation with Avidin-spectrum orange (Zymed, San Francisco, CA), cover-slipping and incubating in a moist chamber in the dark at 37°C for 20 min. Slides were washed 3 times for 2 min each in fresh PBS. Slides were then air-dried in the dark and counterstained with DAPI. Analysis was performed using a Nikon Optiphot-2 and Quips Genetic Workstation equipped with Chroma Technology 83000 filter set with single band exitors for Texas Red/Rhodamine, FITC and DAPI (uv 360 nm). Only individual and well delineated cells with two hybridization signals were scored. Overlapping cells were excluded from the analysis. Fifty to 100 cells per sample were scored to obtain an average of signals.

Karyotyping analysis

Cytogenetic analysis was performed on cell cultures of lymphoma samples. The cell cultures were stimulated with phytohemagglutinin (PHA) for 72 hours, and harvested and treated briefly with a hypotonic solution. The cells were then fixed with Carnoy fixative. GTG-banding was carried out using standard protocols [10].


Affymetrix Cytoscan HD contains 2.6 million markers covering all RefSeq genes and 750K SNPs of human genome, and has >99% genotype accuracy. It has been widely applied to ascertain genome abnormalities of a variety of human diseases. However, it is rarely applied to FFPE tissues. To evaluate the usage of Cytoscan HD on clinical FFPE samples, eight cases of lymphoma FFPE tissues were micro-dissected and analyzed through Affymetrix Cytoscan HD chips. The frozen counterparts of these cases were similarly analyzed. Karyotype analyses were performed on all these cases for validation purpose. As shown in Table 1, karyotype abnormalities in 24 of 51 loci had complete matches with frozen tissue copy number analyses. Twenty-two loci from Karyotype analyses had significant overlapped regions detected by frozen tissue CNV analysis. Overall, this represents 90.2% (46/51) concordance between Karyotype and Cytoscan HD analyses. Five loci of Karyotype analyses, however, did not concord with CNV analysis from both frozen and FFPE tissues analyses. These differences may result from heterogeneity of the tumor samples that some genome abnormalities appear in only a fraction of tumor cells.

Most of the clinical samples are in the form of formalin-fixed and paraffin-embedded tissue blocks. The dependence of array analysis on fresh frozen tissues significantly limits its application in clinical setting. To investigate whether FFPE tissues are suitable for cytoscan HD analysis, DNA from the matched FFPE tissue samples was extracted. Similar Affymetrix Cytoscan HD analyses were performed. Cel files of FFPE tissues from 100 normal individuals were used as baseline to calculate the copy number of genome fragments of these FFPE samples. CNV was determined by p<0.05 with at least 25 markers and a minimal length of 500 Kb. As shown in Tables 12 and figure 1, the concordant rates based on CNV length between arrays from the matched FFPE and frozen tissues ranged from 82% to over 99%. Seven of these FFPE blocks are over 1 year old and one is 3 years. The average concordant rate for 1 year old samples is 94.1%, while the rate for the 3 year old sample is 92%. This suggests that the quality of the assays is stable, and is probably not adversely affected by the age of the fixed tissues for at least 1–3 years. When CNV calling was limited to regions that cover at least one gene, we found that 482 genome segments were either deleted or amplified in at least one of the 16 samples. Among these segments, 68 of 86 segments that were determined as deletion in FFPE samples matched the same callings from fresh frozen tissue counterparts, while 352 of 365 segments that were determined as amplification matched those from the fresh tissues (Table 3). The results confirmed a strong correlation between FFPE and fresh frozen tissues (87.1%, Pearson correlation coefficient = 0.72, p<2.2×10−16). Interestingly, when CNV results of FFPE were matched with Karyotype studies, the results were extremely similar to those between frozen CNV and Karyotype. There is no statistically significant difference in terms of accuracy. In fact, a slight improvement (25/51 complete match versus 24/51 frozen) was seen due to detection of X chromosome gain in a case that was missed in frozen tissue.

Figure 1. Histograms of matched FFPE and Fresh frozen samples on selected chromosomes.

Top panel: FFPE and frozen histograms of chromosome 16 of Case 1; Mid panel: FFPE and frozen histograms of chromosome 17 of Case 2; Lower panel: FFPE and frozen histograms of chromosome 9 of Case 4. Red bar denotes deletion. Blue bar denotes amplification.

Table 2. CNV Overlaps between Frozen tissues and matched formalin-fixed paraffin-embedded tissues.

Table 3. Correlation of CNV callings between FFPE and matched fresh frozen tissues by segment number.

To validate the CNV analysis at individual gene level, FISH assays were performed on all 8 cases of lymphoma using probes specific for BRAF and CITED2. Four samples of lymphoma were found to have gain of BRAF gene by CNV analyses in both frozen and FFPE tissues. Each of these 4 cases was found to have similar amplification of BRAF in the FISH assays, while the other 4 CNV neutral samples were found to have copy number near diploid condition in FISH. The concordant rate between the CNV calls from FFPE or frozen samples and the FISH results from the matched samples reaches 100% (8/8, Table 4 and figure 2). CITED2, a transcription modulator essential for glycolytic metabolism for adult hematopoietic stem cells and potential tumor suppressor [11], [12], was found deleted in 2 cases of lymphoma by CNV analysis in frozen tissues. On the other hand, CNV analysis in the matched FFPE tissues suggests loss of CITED2 in only one sample. FISH assays using a probe specific for CITED2 found loss of one copy of CITED2 gene in both cases where deletions were also indicated by Cytoscan HD chip analyses of frozen tissues (Table 4 and figure 2). However, 55% (117/212) of the counted cells in one of the cases (case 6) in the FISH assay do not contain deletion of CITED2. The large dilution by the diploid cells in this sample may result in a negative result in FFPE CNV analysis.

Figure 2. FISH validation of copy number variation detected by Cytoscan HD analysis.

(A) Representative images of FISH analysis using probes specific for BRAF (7q34, red) and centromere of chromosome 7 (green). Left, normal diploid control, right-case 4 lymphoma (BRAF amplified). (B) Representative images of FISH analysis using probes specific for CITED2 (6q23.3, red) and centromere of chromosome 6 (green). Left, normal diploid control, right-case 2 lymphoma (CITED2 deleted).

Despite the high level statistical CNV concordance between FFPE and frozen tissues, a moderately higher level of fluctuation was readily detected in FFPE array analysis (figure 1). The average of copy number for genome deletion for FFPE tissues is 1.44. This magnitude of deletion is significantly less than that from frozen tissues (1.29, p = 6.7×10−11). As a result, CNV analysis from FFPE tissues could be less sensitive. Despite this mild drawback, excellent CNV concordance between FFPE and frozen samples was evident in most of the CNV regions, particularly those CNV loci with large DNA fragment abnormalities (figure 1).


The current methodologies to detect chromosome abnormalities include Karyotyping using Giemsa staining of chromosomes, high throughput array analysis of single nucleotide polymorphism and DNA copy number, and whole genome sequencing. The first two are the most commonly used methods to determine the copy number of genome fragments, while the last one might be highly precise but expensive. Array genome copy number analysis offers a high resolution alternative to Karyotyping assay. However, its clinical application is limited due to the requirement of high quality DNA. Most clinical specimens, however, are stored in the form of formalin-fixed and paraffin-embedded tissue blocks. The DNA from FFPE is highly fragmented and cross-linked. This produces a significant challenge in using FFPE tissues for high resolution genotyping analysis. Our method using FFPE tissues to analyze chromosomes shows an average of concordance of 93.8% between FFPE and fresh frozen tissues. It suggests that FFPE Cytoscan HD analysis is readily applicable to clinical setting.

Studies using FFPE tissues to analyze copy number variation had been peviously attempted on Affymetrix 10K and SNP6.0 chips, GenomePlex aCGH, Illumina beadArray and Agilent 244K chips (Table 5) [13][19]. However, many of these studies lack direct validation using other methodologies. In one study, high noise level on CNV analysis was found in FFPE samples when using Affymetrix 6.0 chip. Only 53% concordant rate was found between the matched frozen and FFPE samples [15]. In another study, only selected concordance analyses were performed on selected regions of FISH and Agilent 244K chips using FFPE tissues [14]. The study concluded complementary roles of between aCGH and FISH analyses. Both analyses were performed on FFPE tissues. Recently, a FFPE OncoScan service was developed by Affymetrix Inc to use Molecular Inversion Probe technology to detect CNV of oncogenic hot spots [20], [21]. The signal-to-noise ratios were reduced even using FFPE tissues more than 5 years old. However, these studies lacked direct validation comparison between FFPE and matched frozen tissues. Thus, the fidelity of CNV callings was not determined. Another study using optimization of universal linkage system labeling to analyze 3 cases FFPE and matched frozen tissues. They found a good correlation in 2 cases (Pearson correlations 0.54–0.58), but found poor correlation in the other case [22]. To our knowledge, this is the first report that shows high concordance in CNV analysis between FFPE and frozen tissues using Affymetrix Cytoscan HD chip. The CNV results from FFPE not only matched well with those from frozen tissues, but were also largely validated by cytogenetic karyotyping and FISH analyses. The Cytoscan HD FFPE analysis holds promise for being widely used in solid tumor and hematological diseases diagnosis. It may be also useful in differential diagnoses of hereditary diseases.


We thank Song-Yang Zheng and Kathleen Cieply for technical support in the study. Sarah Gibson provided pathology support in identifying lymphoma regions. We also thank Steven Swedlow and George Michalopoulos for frank and constructive comments on the manuscript.

Author Contributions

Conceived and designed the experiments: JHL. Performed the experiments: YPY AM YD GT. Analyzed the data: YPY YD GT. Contributed reagents/materials/analysis tools: YD GT. Wrote the paper: JHL.


  1. 1. Barigozzi C, Cusmano L (1947) Chromosome numbers in cancer cells. Nature 159: 505.
  2. 2. Ren B, Yu G, Tseng GC, Cieply K, Gavel T, et al. (2006) MCM7 amplification and overexpression are associated with prostate cancer progression. Oncogene 25: 1090–1098.
  3. 3. Yu G, Tseng GC, Yu YP, Gavel T, Nelson J, et al. (2006) CSR1 suppresses tumor growth and metastasis of prostate cancer. American Journal of Pathology 168: 597–607.
  4. 4. Luo JH, Ren B, Keryanov S, Tseng GC, Rao UN, et al. (2006) Transcriptomic and genomic analysis of human hepatocellular carcinomas and hepatoblastomas. Hepatology (Baltimore, Md 44: 1012–1024.
  5. 5. Yu YP, Luo JH (2007) Pathological factors evaluating prostate cancer. Histology and histopathology 22: 1291–1300.
  6. 6. Liu W, Laitinen S, Khan S, Vihinen M, Kowalski J, et al. (2009) Copy number analysis indicates monoclonal origin of lethal metastatic prostate cancer. Nature medicine 15: 559–565.
  7. 7. Yu YP, Song C, Tseng G, Ren BG, Laframboise W, et al. (2012) Genome abnormalities precede prostate cancer and predict clinical relapse. The American journal of pathology 180: 2240–2248.
  8. 8. Nalesnik MA, Tseng G, Ding Y, Xiang GS, Zheng ZL, et al. (2012) Gene deletions and amplifications in human hepatocellular carcinomas: correlation with hepatocyte growth regulation. The American journal of pathology 180: 1495–1508.
  9. 9. Strassburger K, Bretz F (2008) Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni-based closed tests. Statistics in medicine 27: 4914–4927.
  10. 10. Hu J, Surti U (1991) Subgroups of uterine leiomyomas based on cytogenetic analysis. Human pathology 22: 1009–1016.
  11. 11. Du J, Li Q, Tang F, Puchowitz MA, Fujioka H, et al. Cited2 Is Required for the Maintenance of Glycolytic Metabolism in Adult Hematopoietic Stem Cells. Stem cells and development
  12. 12. Cheung KF, Zhao J, Hao Y, Li X, Lowe AW, et al. (2013) CITED2 is a novel direct effector of peroxisome proliferator-activated receptor gamma in suppressing hepatocellular carcinoma cell growth. Cancer 119: 1217–1226.
  13. 13. Thompson ER, Herbert SC, Forrest SM, Campbell IG (2005) Whole genome SNP arrays using DNA derived from formalin-fixed, paraffin-embedded ovarian tumor tissue. Human mutation 26: 384–389.
  14. 14. Wang L, Rao M, Fang Y, Hameed M, Viale A, et al. (2013) A Genome-Wide High-Resolution Array-CGH Analysis of Cutaneous Melanoma and Comparison of aCGH to FISH in Diagnostic Evaluation. J Mol Diagn
  15. 15. Tuefferd M, De Bondt A, Van Den Wyngaert I, Talloen W, Verbeke T, et al. (2008) Genome-wide copy number alterations detection in fresh frozen and matched FFPE samples using SNP 6.0 arrays. Genes, chromosomes & cancer 47: 957–964.
  16. 16. Little SE, Vuononvirta R, Reis-Filho JS, Natrajan R, Iravani M, et al. (2006) Array CGH using whole genome amplification of fresh-frozen and formalin-fixed, paraffin-embedded tumor DNA. Genomics 87: 298–306.
  17. 17. Nasri S, Anjomshoaa A, Song S, Guilford P, McNoe L, et al. (2010) Oligonucleotide array outperforms SNP array on formalin-fixed paraffin-embedded clinical samples. Cancer genetics and cytogenetics 198: 1–6.
  18. 18. Oosting J, Lips EH, van Eijk R, Eilers PH, Szuhai K, et al. (2007) High-resolution copy number analysis of paraffin-embedded archival tissue using SNP BeadArrays. Genome research 17: 368–376.
  19. 19. Lips EH, Dierssen JW, van Eijk R, Oosting J, Eilers PH, et al. (2005) Reliable high-throughput genotyping and loss-of-heterozygosity detection in formalin-fixed, paraffin-embedded tumors using single nucleotide polymorphism arrays. Cancer research 65: 10188–10191.
  20. 20. Krijgsman O, Israeli D, Haan JC, van Essen HF, Smeets SJ, et al. (2012) CGH arrays compared for DNA isolated from formalin-fixed, paraffin-embedded material. Genes, chromosomes & cancer 51: 344–352.
  21. 21. Wang Y, Cottman M, Schiffman JD (2012) Molecular inversion probes: a novel microarray technology and its application in cancer research. Cancer genetics 205: 341–355.
  22. 22. Salawu A, Ul-Hassan A, Hammond D, Fernando M, Reed M, et al. (2012) High quality genomic copy number data from archival formalin-fixed paraffin-embedded leiomyosarcoma: optimisation of universal linkage system labelling. PloS one 7: e50415.