Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Association of Genes, Pathways, and Haplogroups of the Mitochondrial Genome with the Risk of Colorectal Cancer: The Multiethnic Cohort

  • Yuqing Li ,

    Affiliations Cancer Prevention Institute of California, Fremont, California, United States of America, Stanford Cancer Institute, Palo Alto, California, United States of America

  • Kenneth B. Beckman,

    Affiliation University of Minnesota Genomics Center, Minneapolis, Minnesota, United States of America

  • Christian Caberto,

    Affiliation Epidemiology Program, University of Hawaii Cancer Center, University of Hawaii, Honolulu, Hawaii, United States of America

  • Remi Kazma,

    Affiliations Centre National de Génotypage, CEA, Evry, France, Biomarker Development, Novartis Institutes for BioMedical Research, Basel, Switzerland

  • Annette Lum-Jones,

    Affiliation Epidemiology Program, University of Hawaii Cancer Center, University of Hawaii, Honolulu, Hawaii, United States of America

  • Christopher A. Haiman,

    Affiliation Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America

  • Loïc Le Marchand,

    Affiliation Epidemiology Program, University of Hawaii Cancer Center, University of Hawaii, Honolulu, Hawaii, United States of America

  • Daniel O. Stram,

    Affiliation Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America

  • Richa Saxena,

    Affiliations Center for Human Genetic Research, Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America, Program of Medical and Population Genetics, The Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America

  • Iona Cheng

    Affiliations Cancer Prevention Institute of California, Fremont, California, United States of America, Stanford Cancer Institute, Palo Alto, California, United States of America

Association of Genes, Pathways, and Haplogroups of the Mitochondrial Genome with the Risk of Colorectal Cancer: The Multiethnic Cohort

  • Yuqing Li, 
  • Kenneth B. Beckman, 
  • Christian Caberto, 
  • Remi Kazma, 
  • Annette Lum-Jones, 
  • Christopher A. Haiman, 
  • Loïc Le Marchand, 
  • Daniel O. Stram, 
  • Richa Saxena, 
  • Iona Cheng


The mitochondrial genome encodes for the synthesis of 13 proteins that are essential for the oxidative phosphorylation (OXPHOS) system. Inherited variation in mitochondrial genes may influence cancer development through changes in mitochondrial proteins, altering the OXPHOS process, and promoting the production of reactive oxidative species. To investigate the role of the OXPHOS pathway and mitochondrial genes in colorectal cancer (CRC) risk, we tested 185 mitochondrial SNPs (mtSNPs), located in 13 genes that comprise four complexes of the OXPHOS pathway and mtSNP groupings for rRNA and tRNA, in 2,453 colorectal cancer cases and 11,930 controls from the Multiethnic Cohort Study. Using the sequence kernel association test, we examined the collective set of 185 mtSNPs, as well as subsets of mtSNPs grouped by mitochondrial pathways, complexes, and genes, adjusting for age, sex, principal components of global ancestry, and self-reported maternal race/ethnicity. We also tested for haplogroup associations using unconditional logistic regression, adjusting for the same covariates. Stratified analyses were conducted by self-reported maternal race/ethnicity. In European Americans, a global test of all genetic variants of the mitochondrial genome identified an association with CRC risk (P = 0.04). In mtSNP-subset analysis, the NADH dehydrogenase 2 (MT-ND2) gene in Complex I was associated with CRC risk at a P-value of 0.001 (q = 0.015). In addition, haplogroup T was associated with CRC risk (OR = 1.66, 95% CI: 1.19–2.33, P = 0.003). No significant mitochondrial pathway and gene associations were observed in the remaining four racial/ethnic groups—African Americans, Asian Americans, Latinos, and Native Hawaiians. In summary, our findings suggest that variations in the mitochondrial genome and particularly in the MT-ND2 gene may play a role in CRC risk among European Americans, but not in other maternal racial/ethnic groups. Further replication is warranted and future studies should evaluate the contribution of mitochondrial proteins encoded by both the nuclear and mitochondrial genomes to CRC risk.


Colorectal cancer is the third most common cancer among men and women in the United States. In 2015, an estimated 220,800 new colorectal cancers (CRC) were diagnosed in the United States [1]. Approximately 35% of the risk of CRC is attributed to inherited factors [2]. Close to fifty risk loci for CRC have been identified by genome-wide association studies, which have focused on common variants of the nuclear genome [37]. However, these loci explain only a small proportion of the heritability of colorectal cancer and additional heritable factors remain to be discovered.

Over 70 years ago, Otto Warburg reported an altered metabolism among cancer cells characterized by an increase in glucose uptake and glycolysis despite an adequate oxygen supply for mitochondrial respiration, a phenomenon referred as ‘aerobic glycolysis’.[8] Warburg hypothesized that this shift towards ‘aerobic glycolysis’ signified a deficiency in mitochondrial respiration, representing a fundamental cause of cancer.[9] This observation has now been confirmed in many types of cancer cells that exhibit elevated levels of glucose transport and increased rates of glycolysis—referred to as the Warburg effect.[10, 11]

The mitochondrial genome is a double-stranded circular DNA molecule of 16,569 base pairs which is highly polymorphic and contains almost no intergenic regions. [12, 13] The proteins it encodes for essential functions in cellular metabolism and regulation of cell death[14]. Thirty-seven proteins are encoded by the mitochondrial DNA (mtDNA), of which 13 are involved in the oxidative phosphorylation (OXPHOS) machinery and 24 make up the RNA machinery (2 ribosomal RNAs and 22 transfer RNAs). The primary function of the mitochondrion is the production of the energy molecule, adenosine triphosphate (ATP), through the metabolic OXPHOS pathway.

Variations in mtDNA, including mitochondrial single nucleotide polymorphisms (mtSNPs), have the potential to modify mitochondrial function and lead to increased oxidative stress and cancer risk [1517]. A Scottish study examined 132 mtSNPs in 2,854 cases and 2,822 controls and found no association with overall CRC risk. [18]. To our knowledge, no study to date has comprehensively examined the relationship between mtDNA variants and CRC risk across different racial/ethnic populations. Furthermore, a pathway based approach, which increases study efficiency for effects of modest size, may help to reveal associations between the mitochondrial genome and cancer risk.

Mitochondrial haplogroups are defined by unique sets of mtSNPs, reflecting specific ancestral populations as a result of the sequential accumulation of mitochondrial mutations through maternal lineages. Mitochondrial haplogroups have been associated with breast, prostate, and nasopharyngeal cancers [1922]. Three studies have investigated the association between mitochondrial haplogroups and CRC risk in European and Asian populations with inconsistent results [18, 21, 23].

To comprehensively examine the role of the mitochondrial genome and CRC risk across multiple racial/ethnic groups, we genotyped a set of 185 mtSNPs to evaluate the association of genetic variation in the mitochondrial genome, pathways and genes, as well as of single mtSNPs and haplogroups, among 2,453 CRC cases and 11,930 controls of the Multiethnic Cohort (MEC) Study.

Materials and Methods

Study Subjects

The MEC is a large population-based cohort study of more than 215,000 men and women from Hawaii and California. The cohort is predominantly comprised of individuals from five racial/ethnic groups: African Americans, Asian Americans, European Americans, Latinos, and Native Hawaiians. Participants between the ages of 45 and 75 years were recruited from March 1993 through May 1996 and completed a 26-page self-administered questionnaire that included information regarding medical history, family history of cancer, diet, dietary supplements, medication use, and physical activity. Further details about this cohort are provided elsewhere [24].

Incident CRC cases were identified up to December 9, 2010 by cohort linkage to population-based Surveillance, Epidemiology and End Results (SEER) cancer registries covering Hawaii and California. Information on stage of disease at the time of diagnosis was also collected from the cancer registries. Blood samples were collected from incident colorectal, breast, and prostate cancer cases after their diagnosis, as well as a random sample of cohort members to serve as controls from 1996 through 2001, and prospectively from all willing surviving participants from 2002 through 2007. Informed consent was obtained at blood draw. Among the CRC cases used in this analysis, 70.4% had their blood drawn after diagnosis and 29.6% prior to diagnosis. Control subjects were men and women selected to serve as matched controls for nested case-control studies of colorectal, breast and prostate cancer. They were also selected to not have developed CRC before cohort entry or during follow-up as of December 9, 2010. This nested case-control study consisted of 2,453 CRC cases and 11,930 controls.

This study was approved by the institutional review board at the Cancer Prevention Institute of California.

mtSNP selection and genotyping

We abstracted mtSNP information from publicly deposited mtDNA sequencing data (PhyloTree mtDNA build 8, March 21, 2010) for 3,674 individuals comprising 599, 1,401, 1,118 and 556 subjects of African, European, Asian, and Latino ancestry, respectively. In addition, we sequenced the mtDNA of 160 Native Hawaiians using the Affymetrix resequencing array and identified 241 mtSNPs (MAF > 2%) in this population at a density of 1 mtSNP per 64 base pairs with an average call rate of 90.6% [25]. A total of 863 mtSNPs were selected, including 160 mtSNPs identified from the sequencing data and all missense mtSNPs (n = 230) and those previous associated with cancer (n = 37).

The genotyping of mtDNA was carried out in three phases using the Sequenom MassArray platform (Sequenom, San Diego). In the phase I, quantitative allelotyping was performed on DNA pools from 75 samples, to enable the rapid and affordable screening of the entire list of 863 putative mtSNPs. Allelotyping provides a quantitative estimate of allelic frequency in a mixture of DNA [26] with the goal of phase I to eliminate those mtSNPs with an undetectable minor allele frequency (MAF). A total of 240 mtSNPs were eliminated in phase I. In phase II, 619 of the remaining 623 mtSNPs were genotyped in a multiethnic panel of 376 subjects using the Sequenom iPLEX platform, providing robust MAFs of these mtSNPs across all five major ethnicities. Of the 619 mtSNPs genotyped, 186 mtSNPs were identified to have MAF greater than 0.02. In phase III, these 186 mtSNPs were genotyped in our nested CRC case-control study of 2,498 cases and 12,070 controls, using the Sequenom iPLEX platform. A total of 185 mtSNPs passed our quality control criteria of 95% call rate and MAF threshold >0.001. Stratifying on reported maternal race/ethnicity, 175, 168, 165, and 102 mtSNPs had a MAF>0.001 in African Americans, European Americans, Latinos, and Asian Americans, respectively; and 50 mtSNPs had a MAF > 0.005 in Native Hawaiians (using a less stringent threshold due to smaller sample size) (S1 Table). A total of 2,453 CRC cases and 11,930 controls were successfully genotyped with a call rate > 95%. The average individual call rate was 99.6% and the average concordance rate for 8% replicated samples was 99.7%.

Statistical analysis

To evaluate the cumulative effect of all mtDNA variants, variants in the OXPHOS pathway, complexes, and genes, we used the sequence kernel association test (SKAT_commonrare) [2729]. The SKAT_commonrare test is an omnibus procedure allowing for both rare and common variants to contribute to the overall test statistic [29]. To estimate haplogroups, we used the HaploGrep software ( based on Phylotree build 16 [30, 31] and categorized individuals based on the major haplogroups. We conducted unconditional logistic regression to examine the association between major haplogroups and CRC risk, using the most common haplogroup as the reference category. To test for single mtSNP associations with CRC risk, we also conducted unconditional logistic regression estimating p-values using a 1-degree-of-freedom Wald test. The overall analysis was adjusted for age, sex, self-reported maternal race/ethnicity, and the first five principal components of global ancestry. Principal components of genetic ancestry were estimated from genotype data for a panel of 128 ancestry informative markers genotyped in the MEC [32, 33]. Previous work in the Multiethnic Cohort has shown that modest population stratification within simulated nested case-control studies was readily corrected for by adjusting for race/ethnicity or the top principal components of ancestry [34]. Additional adjustment for family history of colorectal cancer, dietary intakes of fiber, calcium, folate, alcohol, vigorous physical activity, and smoking did not notably alter results. Thus, these covariates were not included in our final multivariate models. Moreover, we also tested all these associations stratifying on self-reported maternal race/ethnicity and anatomical subsite. All statistical tests presented are two-sided. A false discovery rate (FDR) was estimated to address p-value inflation due to multiple hypothesis testing and a q value<0.1 was used to determine statistical significance.

Single mtSNP analyses were done using PLINK software (version 1.9). mtSNP-set based analyses were done using the SKAT package in R (version 3.0.3). The Mitochondrial solar plot (Fig 1) was drawn using ggplot2 package in R.

Fig 1. Mitochondrial solar plot for European Americans.

From outside to inside, the three grey circles correspond to the P value of 10–3, 10–2and 10–1. The teal circle represents a p-value of 0.05. Each dot represents the mtSNP association with CRC color coded by mitochondrial gene. The size of each dot represents the correlation (R2) between mt4917 and other mtSNPs among European Americans.


Study Characteristics

Study characteristics of the 14,383 study subjects (2,453 CRC cases; 11,930 controls) are presented in Table 1. Colorectal cases were older and had a higher proportion of males than controls. The distribution of self reported maternal race/ethnicity included Asian Americans (28.69%), African Americans (24.35%), European Americans (21.42%), Latinos (20.45%), and Native Hawaiians (4.90%). Cases were more likely to report a family history of colorectal cancer, a history of polyps, and a history of diabetes than controls. Approximately 76% of cases occurred in the colon and 50% of cases were localized stage.

Table 1. Study characteristics of 2,453 colorectal cancer cases and 11,930 controls.

Mitochondrial Genome, Pathway, and Gene Associations

A global test of all 168 mtSNPs in the mitochondrial genome (MAF >0.001) showed a significant association with CRC risk in self-reported maternal European Americans (P = 0.04; Table 2), while no associations were seen in other maternal racial/ethnic groups or in the whole sample (S2 Table). For European Americans, when restricting the mtSNP-set to the OXPHOS pathway, comprised of 133 mtSNPs, the association with CRC risk had a P = 0.029 (q = 0.054 Table 2). Within the OXPHOS pathway, complex I (80 mtSNPs; P = 0.025; q = 0.081) and complex III (15mtSNPs; P = 0.027; q = 0.081) were associated with CRC risk. To further investigate the Complex I association, we conducted an analysis focusing on missense and non-missense mtSNPs separately. Collectively, both missense (22 mtSNPs, P = 0.024) and non-missense mtSNPs (58 mtSNPs, P = 0.04) in Complex I were associated with CRC risk at P<0.05 (S3 Table).

Table 2. Association between mitochondrial genome, pathways, genes and CRC risk in European Americans.

Of the thirteen genes and the rRNA and tRNA subunits in the mitochondrial genome, four genes were associated with CRC at a P value < 0.05: mitochondrially encoded NADH dehydrogenase 2 (MT-ND2) (P = 1.0x 10–3) and mitochondrially encoded NADH dehydrogenase 4 (MT-ND4) (P = 0.015) in complex I; mitochondrially encoded cytochrome b (MT-CYB) (P = 0.027) in complex III; and mitochondrially encoded ATP synthase 8 (MT-ATP8) (P = 0.036) in Complex V. The MT-ND2 gene remained significantly associated with CRC after multiple correction (q = 0.015; Table 2). Both missense and non-missense mtSNPs in the MT-ND2 were associated with CRC risk at P<0.05 (2 mtSNPs Pmissense = 0.008, 12 mtSNPs Pnon-missense = 0.006; S3 Table). In a stratified analysis by anatomical subsite (S4 Table), a stronger association for MT-ND2 was seen in colon tumors (P = 7.0x10–4) and no association was seen in rectal tumors (P = 0.79).

mtSNP Associations

Overall, 14 of 185 mtSNPs were associated with CRC at P<0.05 in the total study population (S5 Table). In stratified analysis by maternal race/ethnicity, 7 of 154 mtSNPs, 4 of 97 mtSNPs, 18 of 147 mtSNPs, and 22 of 156 mtSNPs were associated with CRC risk in African Americans, Asian Americans, European Americans, and Latinos, respectively at P<0.05 (S5 Table). No mtSNP associations were observed in Native Hawaiians. Of the 14 mtSNPs associated with overall CRC risk, the most significant association was seen with the missense mtSNP, mt4917 located in MT-ND2 (OR = 1.52; 95% CI: 1.16–2.01; P = 0.0029, q = 0.308). The minor allele mt4917 (G) varies substantially across the five maternal racial/ethnic groups. Specifically, mt4917 was common in European Americans (MAF = 0.10), rare in African American (MAF = 0.005), Latinos (MAF = 0.006), and was monomorphic in Asian Americans and Native Hawaiians. In European Americans, three mtSNPs in the gene MT-ND2 were nominally associated with CRC risk at P<0.05 (Table 3). The strongest association was observed with mt4917 (OR = 1.55; 95% CI:1.15–2.10; P = 0.004, q = 0.16). Fig 1 presents the mitochondrial solar plot (given the circular nature of mtDNA) of mtSNP associations with CRC risk among European Americans and the correlation between mtSNPs with mt4917. There was a high correlation between mt4917 (r2 >0.75) and the seven other mtSNPs across the mitochondrial genome.

Table 3. Association between mtSNPs in MT-ND2 and CRC risk by self-reported maternal race/ethnicity and overall.

Haplogroup Associations

Haplogroup T was common in European ancestry populations, occurring at a frequency of 9.6% in controls and absent in the other racial/ethnic groups, was significantly associated with CRC risk in European Americans (OR = 1.66, 95% CI: 1.19–2.33, P = 0.003, Pcorrection = 0.015, Table 4). Haplogroup L was associated with CRC risk (OR = 1.54, 95%CI: 1.02–2.31, P = 0.039, Pcorrection = 0.20) in Latinos (4.8% in controls). No clear associations with haplogroups were observed among the remaining racial/ethnic groups (S6 Table).

Table 4. Association between haplogroup and CRC risk in European Americans.


In this study of 14,383 CRC cases and controls, we comprehensively examined the contribution of the mitochondrial genome to CRC risk. To our knowledge, this is the first study to systematically evaluate the mitochondrial genome and its pathway, gene sets, and haplogroups in relation to CRC across multiple maternal racial/ethnic groups. Pathway analyses revealed that the mitochondrial genome and the oxidative phosphorylation pathway play a suggestive role in the CRC risk among European Americans. In addition, an association between the MT-ND2 gene and CRC risk was observed among European Americans with stronger association seen in colon tumors. Haplogroup T was found to be associated with CRC risk among European Americans independent of global ancestry.

Our analysis of the entire mitochondrial genome demonstrated evidence of an association with CRC risk in the European Americans (P = 0.04), in which the OXPHOS pathway may play an important role (P = 0.029; q = 0.054). A byproduct of OXPHOS is the production of reactive oxygen species (ROS), which can generate free radicals and is involved in many cellular processes including apoptosis, inflammation and oxidative stress that may contribute to aging, degenerative diseases and cancer [15, 35]. Our gene based analysis further suggested that MT-ND2, a member of the OXPHOS pathway that encodes for the subunit of NADH, is associated with CRC risk in European Americans (P = 0.001; q = 0.015). A recent study reported over expression of MT-ND2 in CRC tumors vs. normal tissue, which was correlated with lower methylation of the mtDNA D-loop and also significantly associated with stage of disease [36]. These findings support the role of MT-ND2 in CRC development.

The distribution of haplogroups in the MEC was consistent with previously published data on U.S. population-based samples [37]. The frequency of haplogroup T among our European American control subjects (9.57%) is consistent with the Mitomap database (variance between 8%-11% from West to East Europeans) [38] and non-Hispanic Whites in the National Health and Nutrition Examination Surveys (NHANES) (9.6%)[37]. Two studies have reported no associations between mtDNA haplogroups and CRC risk among Chinese and Scottish populations [18, 21], while an association between haplogroup B4 and CRC risk was reported in a Korean population [23]. We identified an association between haplogroup T and CRC risk in European Americans independent of global ancestry. Haplogroup T is defined by nine polymorphisms [30, 39], including five RNA variants (G709A, G1888A, T8697A, T10463C, G15928A), three synonymous (G13368A, G14905A, A15607G), and one non synonymous (A4917G) polymorphisms. The mtSNP A4917G is the diagnostic mtSNP for haplogroup T and a highly conserved polymorphism in the MT-ND2 gene [22, 30, 39]. The lack of an association with haplogroup T in the Scottish study [18] may be due to the use of different mtSNPs to define haplogroup T (T4217, G10399A and A12309G). Ruiz-Pesini et al. [22]hypothesized that mt4917 has been retained by adaptive selection and is believed to play an important role in human migration out of Africa into colder climates, with only the MT-ND2 lineage retrained in haplogroup T due to selection pressures [22]. This may explain the higher frequency of mt4917 in European Americans and its relative absence in African Americans, Latinos, Asian Americans, and Native Hawaiians.

The Scottish study found no association between 132 mtSNPs and overall CRC risk, yet suggested the variant A5657G in tRNA (MAF = 0.01) was associated with colon tumors (P = 0.002) [18]. While we did not genotype this mtSNP, which is located close to the MT-ND2 gene (145 base pair distance), we did observe an association between the MT-ND2 gene and colon tumors (P = 7.0x10–4), which may support the reported association. Given the wide spectrum of risk alleles of rare, low-frequency, and common genetic variants in mitochondrial genome, our study is strengthened by using the SKAT common/rare approach to collectively test multiple risk alleles that may have modest effects [27, 28]. This approach has improved power compared to single SNP tests in the presence of correlation between SNPs and overcomes the limitation of previous methods that upweight rare variants [29, 40]. Using this approach, we were able to capture the role of MT-ND2 gene and CRC risk. In addition, our study strengths include the investigation of multiple racial/ethnic populations, the examination of mtSNPs based on sequencing data for all five populations, and a comprehensive evaluation of the mitochondrial genome and CRC risk. Limitations of this study include the modest sample size for each population to detect weak genetic effects, particularly among Native Hawaiians. For a mtSNP with MAF = 0.10 and alpha = 0.05, our study has 80% power to detect a minimum OR of 1.40 in African Americans, OR = 1.32 for Japanese Americans, OR = 1.38 for Latinos, OR = 1.40 for European Americans, and OR = 1.80 for Native Hawaiians. In addition, there is a possibility of false positive results given the number of hypothesis tested as our findings do not meet a stringent Bonferroni correction.

In summary, our study suggests that variation in the mitochondrial genome may play a role in CRC risk among European Americans. The findings of associations between genetic variants in MT-ND2 and haplogroup T with CRC risk warrants replication in other European American populations. Future studies should examine the expression of MT-ND2 in colorectal tumor and test mitochondrial genes encoded by both the nuclear and mitochondrial genomes to fully examine their contribution to CRC risk.

Supporting Information

S1 Table. 185 mtSNPs tested in 2,453 colorectal cancer cases and 11,930 controls.


S2 Table. Association between mitochondrial genome, pathways, genes and CRC risk by self-reported maternal race/ethnicity and overall.


S3 Table. mtDNA complex and gene based models of CRC risk in European Americans.


S4 Table. Association between mitochondrial genome, pathways, genes and colon and rectal cancer in European Americans.


S5 Table. Association between 185 mtSNPs and CRC risk overall and by self-reported maternal race/ethnicity.


S6 Table. Association between mitochondrial haplogroups and colorectal cancer risk by self-reported maternal race/ethnicity and overall.



We thank Dr. Vanessa Oliveira for providing the Perl script for mtDNA haplogroup calling; Dr. Yonghu Sun for providing the adapted R script for mitochondrial solar plot. We also thank the participants of the Multiethnic Cohort, who have contributed to a better understanding of the lifestyle and genetic contributions to colorectal cancer.

Web resources

The URLs for data presented herein are as follows:

PLINK package (version v1.07) is developed by Shaun Purcell at the Center for Human Genetic Research (CHGR), Massachusetts General Hospital (MGH), and the Broad Institute of Harvard & MIT.

R package (version 3.0.3. R Core Team (2014). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria.

ggplot2 is an R package, was used for mito solar plot.

HaploGrep (version 2.0 Beta) is a collaboration between the Division of Genetic Epidemiology at the Medical University Innsbruck and the Department of Database and Information Systems—Institute of Computer Science at the University of Innsbruck.

MITOMAP: A Human Mitochondrial Genome Database., 2008.Phylotree (Build 16) van Oven M, Kayser M. 2009. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation.

Author Contributions

Conceived and designed the experiments: IC KBB DOS RS. Performed the experiments: KBB ALJ. Analyzed the data: YQL CC RK DOS. Contributed reagents/materials/analysis tools: RS. Wrote the paper: YQL IC. Revised the manuscript: YQL KBB CC RK ALJ CAH LLM DOS RS IC. Proofreading: YQL IC.


  1. 1. American Cancer Society. Cancer Facts & Figures 2015. Atlanta: American Cancer Society2015.
  2. 2. Lichtenstein P, Holm NV, Verkasalo PK, Iliadou A, Kaprio J, Koskenvuo M, et al. Environmental and heritable factors in the causation of cancer—analyses of cohorts of twins from Sweden, Denmark, and Finland. N Engl J Med. 2000 Jul 13;343(2):78–85. pmid:10891514
  3. 3. Broderick P, Carvajal-Carmona L, Pittman AM, Webb E, Howarth K, Rowan A, et al. A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk. Nat Genet. 2007 Nov;39(11):1315–7. pmid:17934461
  4. 4. Zanke BW, Greenwood CM, Rangrej J, Kustra R, Tenesa A, Farrington SM, et al. Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24. Nat Genet. 2007 Aug;39(8):989–94. pmid:17618283
  5. 5. Tomlinson I, Webb E, Carvajal-Carmona L, Broderick P, Kemp Z, Spain S, et al. A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21. Nat Genet. 2007 Aug;39(8):984–8. pmid:17618284
  6. 6. Tomlinson IP, Webb E, Carvajal-Carmona L, Broderick P, Howarth K, Pittman AM, et al. A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3. Nat Genet. 2008 May;40(5):623–30. pmid:18372905
  7. 7. Jia WH, Zhang B, Matsuo K, Shin A, Xiang YB, Jee SH, et al. Genome-wide association analyses in East Asians identify new susceptibility loci for colorectal cancer. Nat Genet. 2013 Feb;45(2):191–6. pmid:23263487
  8. 8. Warburg O, Wind F., Neglers E (Ed.). Metabolism of Tumors. Constable and Co. 1930;London:254–70.
  9. 9. Warburg O. On respiratory impairment in cancer cells. Science. 1956 Aug 10;124(3215):269–70. pmid:13351639
  10. 10. Pedersen PL. Tumor mitochondria and the bioenergetics of cancer cells. Prog Exp Tumor Res. 1978;22:190–274. pmid:149996
  11. 11. Simonnet H, Alazard N, Pfeiffer K, Gallou C, Beroud C, Demont J, et al. Low mitochondrial respiratory chain content correlates with tumor aggressiveness in renal cell carcinoma. Carcinogenesis. 2002 May;23(5):759–68. pmid:12016148
  12. 12. Crimi M, Sciacco M, Galbiati S, Bordoni A, Malferrari G, Del Bo R, et al. A collection of 33 novel human mtDNA homoplasmic variants. Hum Mutat. 2002 Nov;20(5):409.
  13. 13. Nicholls TJ, Minczuk M. In D-loop: 40 years of mitochondrial 7S DNA. Exp Gerontol. 2014 Aug;56:175–81. pmid:24709344
  14. 14. Carew JS, Huang P. Mitochondrial defects in cancer. Mol Cancer. 2002 Dec 9;1:9. pmid:12513701
  15. 15. Hervouet E, Simonnet H, Godinot C. Mitochondria and reactive oxygen species in renal cancer. Biochimie. 2007 Sep;89(9):1080–8. pmid:17466430
  16. 16. Lee HC, Yin PH, Lin JC, Wu CC, Chen CY, Wu CW, et al. Mitochondrial genome instability and mtDNA depletion in human cancers. Ann N Y Acad Sci. 2005 May;1042:109–22. pmid:15965052
  17. 17. Thyagarajan B, Wang R, Barcelo H, Koh WP, Yuan JM. Mitochondrial copy number is associated with colorectal cancer risk. Cancer Epidemiol Biomarkers Prev. 2012 Sep;21(9):1574–81. pmid:22787200
  18. 18. Webb E, Broderick P, Chandler I, Lubbe S, Penegar S, Tomlinson IP, et al. Comprehensive analysis of common mitochondrial DNA variants and colorectal cancer risk. Br J Cancer. 2008 Dec 16;99(12):2088–93. pmid:19050702
  19. 19. Cano D, Gomez CF, Ospina N, Cajigas JA, Groot H, Andrade RE, et al. Mitochondrial DNA haplogroups and susceptibility to prostate cancer in a colombian population. ISRN Oncol. 2014;2014:530675. pmid:24616820
  20. 20. Hu SP, Du JP, Li DR, Yao YG. Mitochondrial DNA haplogroup confers genetic susceptibility to nasopharyngeal carcinoma in Chaoshanese from Guangdong, China. PLoS One. 2014;9(1):e87795. pmid:24498198
  21. 21. Fang H, Shen L, Chen T, He J, Ding Z, Wei J, et al. Cancer type-specific modulation of mitochondrial haplogroups in breast, colorectal and thyroid cancer. BMC Cancer. 2010;10:421. pmid:20704735
  22. 22. Ruiz-Pesini E, Mishmar D, Brandon M, Procaccio V, Wallace DC. Effects of purifying and adaptive selection on regional variation in human mtDNA. Science. 2004 Jan 9;303(5655):223–6. pmid:14716012
  23. 23. Lim SW, Kim HR, Kim HY, Huh JW, Kim YJ, Shin JH, et al. High-frequency minisatellite instability of the mitochondrial genome in colorectal cancer tissue associated with clinicopathological values. Int J Cancer. 2012 Sep 15;131(6):1332–41. pmid:22120612
  24. 24. Kolonel LN, Henderson BE, Hankin JH, Nomura AM, Wilkens LR, Pike MC, et al. A multiethnic cohort in Hawaii and Los Angeles: baseline characteristics. Am J Epidemiol. 2000 Feb 15;151(4):346–57. pmid:10695593
  25. 25. Kim SK, Gignoux CR, Wall JD, Lum-Jones A, Wang H, Haiman CA, et al. Population genetic structure and origins of Native Hawaiians in the multiethnic cohort study. PLoS One. 2012;7(11):e47881. pmid:23144833
  26. 26. Bansal A, van den Boom D, Kammerer S, Honisch C, Adam G, Cantor CR, et al. Association testing by DNA pooling: an effective initial screen. Proc Natl Acad Sci U S A. 2002 Dec 24;99(26):16871–4. pmid:12475937
  27. 27. Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet. 2011 Jul 15;89(1):82–93. pmid:21737059
  28. 28. Lee S, Wu MC, Lin X. Optimal tests for rare variant effects in sequencing association studies. Biostatistics. 2012 Sep;13(4):762–75. pmid:22699862
  29. 29. Ionita-Laza I, Lee S, Makarov V, Buxbaum JD, Lin X. Sequence kernel association tests for the combined effect of rare and common variants. Am J Hum Genet. 2013 Jun 6;92(6):841–53. pmid:23684009
  30. 30. van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009 Feb;30(2):E386–94. pmid:18853457
  31. 31. Kloss-Brandstatter A, Pacher D, Schonherr S, Weissensteiner H, Binna R, Specht G, et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum Mutat. 2011 Jan;32(1):25–32. pmid:20960467
  32. 32. Kosoy R, Nassir R, Tian C, White PA, Butler LM, Silva G, et al. Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America. Human Mutation. 2009;30(1):69–78. pmid:18683858
  33. 33. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006 Aug;38(8):904–9. pmid:16862161
  34. 34. Wang H, Haiman CA, Kolonel LN, Henderson BE, Wilkens LR, Le Marchand L, et al. Self-reported ethnicity, genetic structure and the impact of population stratification in a multiethnic study. Hum Genet. 2010 Aug;128(2):165–77. pmid:20499252
  35. 35. Kujoth GC, Prolla TA. Evolving insight into the role of mitochondrial DNA mutations in aging. Exp Gerontol. 2008 Jan;43(1):20–3. pmid:18054193
  36. 36. Feng S, Xiong L, Ji Z, Cheng W, Yang H. Correlation between increased ND2 expression and demethylated displacement loop of mtDNA in colorectal cancer. Mol Med Rep. 2012 Jul;6(1):125–30. pmid:22505229
  37. 37. Mitchell SL, Goodloe R, Brown-Gentry K, Pendergrass SA, Murdock DG, Crawford DC. Characterization of mitochondrial haplogroups in a large population-based sample from the United States. Hum Genet. 2014 Jul;133(7):861–8. pmid:24488180
  38. 38. Ruiz-Pesini E, Lott MT, Procaccio V, Poole JC, Brandon MC, Mishmar D, et al. An enhanced MITOMAP with a global mtDNA mutational phylogeny. Nucleic Acids Res. 2007 Jan;35(Database issue):D823–8. pmid:17178747
  39. 39. Herrnstadt C, Elson JL, Fahy E, Preston G, Turnbull DM, Anderson C, et al. Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups. Am J Hum Genet. 2002 May;70(5):1152–71. pmid:11938495
  40. 40. Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, et al. Powerful SNP-set analysis for case-control genome-wide association studies. Am J Hum Genet. 2010 Jun 11;86(6):929–42. pmid:20560208