Human ageing has been predicted to be caused by the accumulation of molecular damage in cells and tissues. Somatic mitochondrial DNA (mtDNA) mutations have been documented in a number of ageing tissues and have been shown to be associated with cellular mitochondrial dysfunction. It is unknown whether there are selective constraints, which have been shown to occur in the germline, on the occurrence and expansion of these mtDNA mutations within individual somatic cells. Here we compared the pattern and spectrum of mutations observed in ageing human colon to those observed in the general population (germline variants) and those associated with primary mtDNA disease. The pathogenicity of the protein encoding mutations was predicted using a computational programme, MutPred, and the scores obtained for the three groups compared. We show that the mutations associated with ageing are randomly distributed throughout the genome, are more frequently non-synonymous or frameshift mutations than the general population, and are significantly more pathogenic than population variants. Mutations associated with primary mtDNA disease were significantly more pathogenic than ageing or population mutations. These data provide little evidence for any selective constraints on the occurrence and expansion of mtDNA mutations in somatic cells of the human colon during human ageing in contrast to germline mutations seen in the general population.
Mitochondrial DNA encodes essential components of the mitochondrial respiratory chain and is strictly maternally inherited, making it vulnerable to the accumulation of deleterious mutations. To avoid this, mtDNA is subjected to a bottleneck phenomenon whereby only a small number of mtDNA molecules are passed on to the oocyte precursor. These are then amplified to the required number of mtDNA molecules in the mature oocyte, meaning that any mutations may be either lost or rapidly fixed. Purifying selection is thought to be an important protective mechanism against pathogenic mtDNA mutations in the germline, as this is essential for mtDNA stability. It is unknown whether there are any such protective mechanisms in the somatic tissues. To investigate this we have compared the spectrum of mutations present in ageing human colonocytes with those population variants passed through the maternal germline and mtDNA mutations responsible for primary mtDNA disease. We show that pathogenic mtDNA mutations are present at a significantly higher frequency in the somatic cells of the human colon in contrast to variants that have passed though the germline, showing little evidence for purifying selection in the somatic tissues studied here, but strong evidence of this selective mechanism in the germline.
Citation:Greaves LC, Elson JL, Nooteboom M, Grady JP, Taylor GA, et al. (2012) Comparison of Mitochondrial Mutation Spectra in Ageing Human Colonic Epithelium and Disease: Absence of Evidence for Purifying Selection in Somatic Mitochondrial DNA Point Mutations. PLoS Genet 8(11): e1003082. doi:10.1371/journal.pgen.1003082
Editor: Nils-Göran Larsson, Max Planck Institute for Biology of Aging, Germany
Received: July 9, 2012; Accepted: September 28, 2012; Published: November 15, 2012
Copyright: © 2012 Greaves et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding:This work was supported by the Newcastle University Centre for Brain Ageing and Vitality supported by BBSRC, EPSRC, ESRC, and MRC as part of the cross-Council Lifelong Health and Wellbeing Initiative (G0700718), the Food Standards Agency [N12015], and the Wellcome Trust Centre for Mitochondrial Research (906919). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Ageing is a stochastic process commonly defined as the progressive decline in the condition of an organism which is accompanied by a reduction in fertility and an increasing risk of death . Ageing is unlikely to be genetically programmed as evolution theory suggests that nature would not select for a process which is generally harmful to the viability of the organism . The disposable soma theory ,  suggests that there are ‘trade-offs’ between somatic cell maintenance, growth and reproduction. Under constant pressure of natural selection to make optimal use of resources, organisms can afford to make only limited investment in the maintenance and repair of somatic tissues. Metabolic processes associated with tissue maintenance are costly and there is no advantage in keeping somatic cells in good condition beyond the typical survival period in the wild, plus a little reserve. This contrasts with the requirement to protect the germline, where constant high maintenance is needed to maximise capacity for reproduction in the current generation, and to minimise the risk of transferring damage. Nowadays most humans survive long past the life expectancy of our distant ancestors, and experience ageing due to the gradual accumulation of unrepaired damage in somatic cells.
Damage to mitochondrial DNA (mtDNA) resulting in dysfunction of the oxidative phosphorylation (OXPHOS) system has been proposed to be an important contributor to the ageing phenotype . The mitochondrial genome is a circular, double-stranded, ~16.5 Kb molecule  encoding 13 essential polypeptides of the OXPHOS system in addition to 24 mt-RNA genes required for their translation. Its organisation is an example of extreme economy; the vast majority of the molecule is coding and there are no introns, so mutation of the mtDNA is likely to have a functional effect. However, given that there are multiple copies of mtDNA within individual cells, the vast majority of mutations are functionally recessive and a detectable OXPHOS defect occurs only when a critical threshold level of mutant mtDNA is exceeded . A commonly used assay to look for cells or tissues with mtDNA mutations is a histochemical test to demonstrate cytochrome c oxidase (COX) activity . Cells lacking functional COX activity commonly contain mtDNA mutations at high levels , , and whilst this assay may not pick up all mutations it is a good initial screen.
MtDNA is inherited in a non-Mendelian manner exclusively through the female line  and bi-parental recombination, if it has ever occurred, has left no footprint on the phylogeny . Asexual inheritance should leave the mitochondrial genome vulnerable to extinction due to the irreversible accumulation of mtDNA mutations, a process known as Muller's Ratchet . In the absence of recombination, the only means to prevent this outcome is a selection process against mutants which is strong enough to substantially abate the relatively high mutation rate experienced by mtDNA. In principle, selection against mtDNA mutations can act at several levels since cells typically contain large numbers of mtDNA molecules which have a replication cycle that is not strictly coupled with that of the cell itself . Selection against mtDNA mutations could therefore arise (i) through direct competition between mtDNA variants within the cell, (ii) through selection at the cell level, if mutants affect cellular survival and/or division, or (iii) through selection at the whole organism level, via effects on fitness of the individual.
A distinctive mechanism that is thought to enhance the power of selection to keep mtDNA mutations in check within the germline is the mitochondrial “bottleneck” . The bottleneck event effectively samples mtDNA molecules ensuring only a small proportion are selected to form the mtDNA pool  in the mature oocyte which is passed to the offspring. The mammalian oocyte contains over 100,000 mtDNA molecules which do not replicate during early embryogenesis . They segregate during cell division events and when the primordial germ cells are formed they contain only 200–400 mtDNA molecules . As primordial germ cells differentiate into mature oocytes which will transmit their mtDNA to the next generation , there is re-initiation of mtDNA replication of a small subset of mtDNA molecules . Currently it is a matter of intense debate as to whether this expansion occurs during embryogenesis or post-natal folliculargenesis , , but regardless of the exact timing, deleterious mutations will either be lost or they will be more strongly exposed to natural selection and, if not eliminated, become fixed more rapidly . Germline selection adds to the purifying selection of mtDNA that had already been shown to occur in human populations by comparison of different lineages , . Together the population studies and experimental studies in the mouse ,  demonstrate a multi-level selective process which can act to remove deleterious and slightly deleterious variants.
While the full repertoire of mechanisms to combat mtDNA mutation accumulation act within the germline, there is less of an imperative to prevent such accumulation within the soma, and there are fewer opportunities to do so. Somatic mtDNA mutations have been shown to accumulate to high levels in a number of human ageing tissues including brain, heart, skeletal muscle, colon, stomach and liver, leading to OXPHOS dysfunction , –. Thus, the accumulation of mtDNA mutations in the soma of ageing humans may contribute to the decline in cellular functionality commonly seen during ageing. In addition mouse models with increased levels of mitochondrial DNA (mtDNA) mutations – show a premature ageing phenotype suggesting that mitochondrial defects are important in the ageing process.
Although the accumulation of age-associated somatic mtDNA mutations is not subject to the full set of selective constraints that operate in the germline, it is possible that selection at the intracellular and intercellular levels could influence the spectrum of mutations that is seen. Experimental support for this hypothesis is limited and information on the pattern of mtDNA mutations which occur with age is likely to inform our understanding of the underlying mechanisms that might be involved. Previous studies have shown that there are differences in the type of mutations detected with ageing in post-mitotic and mitotic tissues. In post-mitotic tissues the predominant mutated species are large-scale mtDNA deletions –, whereas in dividing cells mtDNA point mutations are commonly identified , , , –. As large-scale mtDNA deletions are rarely transmitted through the germline we investigated possible selective constraints only on mtDNA point mutations which occur somatically during human ageing by analysing location, mutation type and predicted mutation pathogenicity using the human colon as a model ageing system. We have previously generated a number of mtDNA sequences from ageing colonocytes , ,  providing an excellent data set of age-related somatic mutations for investigation. We then compared the mutational spectra with germline mtDNA point mutations seen in a geographically-comparable human population and well-characterised pathogenic mtDNA point mutations which are documented causes of primary mtDNA disease phenotypes.
A dataset of age-related mtDNA mutations
In order to ensure that we had a high quality data set of age-related somatic mtDNA mutations to use in the subsequent analyses, 72 individual cells from 9 participants over 70 years of age were sequenced, and the presence of clonally-expanded mtDNA mutations was documented. These data were combined with those which we have published previously on human colon (age range 70–76 years) , , . In our previous studies cells were sequenced based on the absence or presence of COX activity. The majority of cells were COX deficient as these studies aimed to detect the underlying genetic defect in these cells. This strategy increases the likelihood of detecting pathogenic mtDNA mutations. Here we wanted to obtain a more complete picture of the mutational spectrum across the human colon including non-pathogenic mutations and those which do not affect COX activity, therefore we sequenced an additional 48 COX positive cells and 24 COX deficient cells. This gave a total of 156 whole mitochondrial mtDNA sequences (71 sequences are from COX positive cells and 85 from COX deficient cells , , ). These sequences yielded a collection of 129 clonally-expanded mtDNA point mutations which were found in 89 different cells. 67 of the cells in this dataset did not contain any clonally expanded mtDNA mutations. Heteroplasmy levels were estimated based on the ratio of peak heights on the electropherograms. A relatively high proportion of the mutations (39%) were present at homoplasmic levels, however we believe that this is due to the fact that crypts were selected based on their COX activity rather than any selective mechanism. COX deficient cells contained 88% of the homoplasmic mutations. This highlights the fact that the majority of mtDNA mutations require a high level of mutation to be reached before a biochemical defect is observed . Of the cells which contained mtDNA mutations, the majority (61) contained only one mutation, 20 contained 2 mutations, 5 contained 3 mutations and 3 contained 4 or more mutations. Where there were multiple mutations present only 5 cells contained mutations that were present at the same levels of heteroplasmy and in all cases this was 100%, the remaining cells had varying levels of heteroplasmy, indicating that each mutation occurred at different times or on different molecules. All mutations in the data set are listed in Table S1 along with their COX status. Only mutations in the coding region were included in the following analyses, as mutations in these regions are more likely to have deleterious effects. From here on the somatic mutations detected in the human colon will be referred to as ‘somatic’ or ‘age-related’ mutations.
Age-related mtDNA mutations occur randomly within the genome
To investigate the location of the changes on the mtDNA of age-associated mutations they were grouped by base pair number (per 2000 bases; 1–2000, 2001–4000 etc). Based on the assumption that all groups had an equal chance of containing a mutation, chi-squared analysis showed no significant difference in the observed and expected frequencies (p = 0.293) which supports the conclusion that age-associated mutations are located randomly throughout the mitochondrial genome.
To be confident that these mutations truly were random they were then grouped according to gene status (e.g. whether they occurred within mt-tRNA, mt-rRNA or protein encoding genes). Similarly, the difference between the observed and expected frequencies was not significant (p = 0.129) (chi-squared analysis comparing observed to expected frequencies based on the proportion of the genome occupied by each gene type). A comparison was then made with the disease (full mutation list in Table S2) and population variants (full list in Table S3) (Figure 1A). In the disease-associated mutation group, there was a significantly higher proportion of mutations in mt-tRNA genes (53%) compared to other gene types (p<0.0003). This has been reported previously in other studies , . In the rare population variants, the majority of mutations were in the protein encoding genes (85%) (p<0.003), probably because a high proportion of the changes in the protein encoding genes of the population predict synonymous amino acid changes which are non-pathogenic and are therefore unlikely to be selected against in the germline. The nature of changes (e.g. transitions, transversions and insertions/deletions) was then examined in the three groups. In all cases, the overwhelming majority of changes were transitions (90%, 94% and 87% in age-related mutations, population variants and disease mutations respectively). There was a significantly higher frequency of insertion and deletion mutations in the both the age-related mutation group compared to the population variants, and the disease mutation group compared to the population variants (p<0.003 in both cases, contingency analysis with Bonferroni correction for multiple testing, Figure 1B).
A: Gene location of mutations. Data are represented as the percentage of the total coding region mutations. Contingency analysis with Bonferroni correction for multiple testing was carried out on the frequencies of the changes in each gene type (ageing n = 117, population n = 182, disease n = 176). Thresholds for statistical significance are; ***<0.0003, ** 0.003, * = 0.017. B: Types of changes observed in ageing, population and disease. Data are represented as the percentage of the total coding region mutations. Contingency analysis with Bonferroni correction for multiple testing was carried out on the frequencies of the changes in each mutational category. Thresholds for statistical significance are; ***<0.0003, ** 0.003, * = 0.017.
Non-synonymous mtDNA mutations are present more frequently during ageing and mtDNA disease than in populations
Purifying selection models have shown that, in the germline, selection is strongest against mutations in protein-encoding genes which cause an amino acid change , those which predict a frameshift, or those which cause premature translation termination . A comparison of such mutations in the three data sets was made (Figure 2A). This revealed a significantly higher number of non-synonymous (65.4%) and frameshift/premature termination codons (16.5%) in the ageing cells compared with the rare population variant group (34% and 0.6% respectively) (contingency analysis with Bonferroni correction for multiple testing, p<0.0003). This suggests that human somatic cells are not protected from accumulation of age-related deleterious mutations in the same way as has been shown during germline transmission of mtDNA mutations in the mouse . Comparison of age-associated mutations and disease-causing mutations revealed a significantly higher number of frameshift/premature termination codons in the disease group (32%) (p<0.0003).
A: The percentage of changes which predict synonymous, non-synonymous and premature termination codons or frameshifts in protein encoding genes. Contingency analysis with Bonferroni correction for multiple testing was carried out on the frequencies of the changes in each gene type (ageing n = 81, population n = 155, disease n = 76). Thresholds for statistical significance are; ***<0.0003, ** 0.003, * = 0.017. B: Frequency of termination codon and frameshift mutations in inherited and sporadic disease-causing mtDNA mutations. Chi-squared analysis showed a significantly higher frequency such mutations in sporadic than inherited cases (p = 0.003).
To investigate further the apparent selection-free environment, from the stand point of mtDNA mutations, in somatic cells, protein encoding disease-causing mutations were grouped according to whether these had been reported as ‘inherited’ or ‘sporadic’ mtDNA mutations (as defined in the original reports, see Table S2 for details). Whilst inherited mtDNA mutations are passed through the germline, it is proposed that sporadic mutations occur during early embryological development beyond the point of the genetic bottleneck , . Further examination of frameshift and termination codon mutations (which might be predicted to be the most pathogenic) showed that a significantly higher number were sporadic (78%) rather than inherited (22%) (chi-squared analysis, p = 0.003) (Figure 2B), supporting the idea that these arose after the selective check point.
Age-related and disease-causing mutations in protein-encoding genes are more pathogenic than rare population variants
Pathogenicity of the non-synonymous mutations in the three groups was predicted using MutPred software , and the groups compared. The MutPred score is determined by a set of features reflecting protein structure, evolutionary conservation, presence of functional residues and amino acid biases at the substitution site and its neighbours. Amino acid variants are identified by comparison to their counterparts defined by the rCRS sequence . Mutations are assigned a score between 0 and 1, with higher pathogenicity scores corresponding to a greater likelihood that the amino acid change is pathogenic. Only non-synonymous protein encoding gene mutations were used in this analysis because the software is unable to analyse the pathogenicity of frameshift mutations and those which predict a premature termination codon. In addition, pathogenicity scores for all possible amino acid changes in the mitochondrial genome, predicted by the MutPred software (published as a supplementary table in ) were included in these analyses (Figure 3A). There was no significant difference in the mean pathogenicity score between the age-related mutations (0.651±0.203 (sd), n = 52) and all possible mutations (0.642±0.154, n = 24206) (p = 0.94, Wilcoxon rank sum test). This evidence supports the hypothesis that there are no selective constraints on mutations which accumulate in the human colon during ageing. The age-related mtDNA mutations had significantly higher MutPred scores (0.651±0.203, n = 52) than the rare population mtDNA mutations (0.481±0.202, n = 53) (p<0.00017) but significantly lower scores than disease-associated mtDNA mutations (0.761±0.13, n = 54) (p = 0.0016), supporting the hypothesis that these mutations occur at random during the life-course and their expansion is not curtailed by purifying selection.
A: The distribution of pathogenicity scores in ageing, populations, disease and all possible mutations. Results of Wilcoxon rank sum analysis are shown. With six tests, the threshold for significance is 0.05/6 = 0.008, ** = p<0.0017, *** = p<0.00017. B: The distribution of pathogenicity scores including synonymous mtDNA mutations which were assigned a pathogenicity score of 0, in ageing, population and disease. Results of Wilcoxon rank sum analysis are shown. With three tests the threshold for significance is 0.05/3 = 0.017, ** = p<0.003, *** = p<0.0003. C: The distribution of pathogenicity scores in sporadic and inherited disease-causing mtDNA mutations. Results of Wilcoxon rank sum test are shown,* p<0.05.
To obtain a more complete picture of the pathogenicity of the protein encoding mtDNA mutations, the synonymous mutations in the ageing and population groups (there were none in the disease group) were assigned a pathogenicity score of 0 (as they do not change an amino acid and are therefore very unlikely to have a functional effect) and included in the analysis (Figure 3B). There were significant differences between the three groups (Wilcoxon rank sum tests, p<0.0003 in all cases) with pathogenicity scores highest in the disease-associated mutation group (0.761±0.13, n = 54) followed by age-associated mutations (0.513±0.323, n = 66) and population variants (0.171±0.26, n = 149). Given the very high pathogenicity scores predicted in the disease mutation group, mutations in this group were separated into inherited and sporadic mutations and the mean scores compared (Figure 3C). Mean MutPred pathogenicity score was significantly higher in the sporadic mutation group (0.831±0.059, n = 16) compared with the inherited mutations (0.737±0.136, n = 26) (p = 0.02, Wilcoxon rank sum test).
MtDNA mutations in other somatic tissues
All of the above analyses have been based on somatic mtDNA point mutations detected in individual ageing human colonic crypts as this is the most comprehensive dataset of such mutations to date. However it is known that there are differences in mutation occurrence and accumulation between different tissues. For example clonally expanded point mutations and not large scale deletions are commonly found in dividing tissues , whereas large-scale deletions are the predominant species of mtDNA mutation in post-mitotic tissues , . In addition, it has been shown that in mice with two different mitochondrial genotypes, there is tissue specific accumulation of one genotype over another . Therefore a literature search was carried out and a dataset of mtDNA point mutations in normal ageing tissues compiled. Mutations were only included if they had been detected using the same methodology and quality control analyses as the mutations detected in the colonic epithelium, and were in normal, not tumour tissues. A total of 24 mutations were included from stomach epithelium , prostatic epithelium , bladder urothelium , small intestinal epithelium , pancreas  and liver , . All mutations were detected in COX deficient cells. Full details of all mutations are in Table S4. The mutations detected had a similar mutation spectrum as the colon mutations, with 92% of the mutations being transitions, 4% transversions and 4% single base pair deletions. Due to the small size of the dataset, it was not possible to do a detailed comparison of the location of the mutations, but mutations were found in all gene types (protein encoding, mt-rRNA and mt-tRNA genes). There were 8 non-synonymous protein encoding mutations, all in COX encoding genes, which had a mean MutPred score of 0.811±0.03. This was then compared to the MutPred score for COX-encoding gene mutations in COX deficient cells of the colon. To exclude the effects of multiple mtDNA mutations within a cell, mutations were only included if they were either the only mutation in the cell, or the only coding-region mutation in the cell. There was no significant difference in the MutPred score between the mtDNA mutations in the colonic crypts (0.743±0.03, n = 10) and those in the other tissues analysed (p = 0.07, Wilcoxon rank sum test). These data provide evidence that occurrence and clonal expansion of pathogenic protein encoding mutations with age is not restricted to only the colonic epithelium, but is likely to be relevant to other mitotic tissues.
The aim of this study was to compare the selective constraints on the occurrence and expansion of age-related mtDNA mutations in the human colon with those in the general population and those which cause primary mtDNA disease. Purifying selection in the germline has been demonstrated in two studies using two different mouse models. One group utilised a mouse with a homozygous knock-in allele expressing a proof-reading- deficient mtDNA polymerase γ which, as a consequence, accumulates high levels of random mtDNA point mutations . This mouse was backcrossed with mice with wild-type mtDNA polymerase and the offspring showed rapid elimination from the germline of non-synonymous protein encoding mutations. A complementary study generated a mouse model of mtDNA disease with 2 mtDNA mutations, one relatively mild mtDNA mutation and one severe mtDNA mutation. Only the mild mutation was transmitted through the germline to the next generation. . The reduction of mtDNA copy number at the mtDNA bottleneck followed by huge expansion of a small number of mtDNA molecules allows rapid fixation of new mtDNA mutations and the opportunity to select against deleterious mutants by unknown mechanisms which could operate at the DNA, organelle or cellular level (primordial germ cell lineage) .
Our study has shown no evidence for the action of purifying selection on age-related mtDNA mutations in a number of different tissue types, which we observed are distributed randomly throughout the mitochondrial genome. Further, frequency of age-related mtDNA mutations in each gene is equal to the proportion of the mitochondrial genome occupied by that gene suggesting that there is no functional selection. There was a predominance of transition mutations compared with transversions and insertions/deletions, with 65% of those transitions being C:G to T:A mutations. These mutation types are likely to be due to errors during replication of mtDNA and/or spontaneous cytosine deamination , the remaining transitions are likely to be due to replication errors. There was only one C:G to A:T mutation in the dataset, suggesting that there is little contribution of 8-oxo-deoxyguanine mediated mutagenesis, thus casting doubt on the importance of oxidative damage in ageing, at least in this tissue type. The mutational spectrum here probably reflects the initial mechanism of mutation occurrence rather than selection during clonal expansion.
There were differences between the somatic age-related mtDNA mutations and the mutations we observe in the general population. The most prominent differences are the expansion of non-synonymous protein encoding mutations, including frameshift and termination mutations, in somatic cells compared with the germline, and the higher pathogenicity scores of the age-related mutations. These observations provide evidence that there is little or no selection against pathogenic mutations in the soma, whereas the germline is relatively protected from such mutations by the mtDNA bottleneck . Pathogenic mtDNA mutations in both mt-tRNA genes and mt-mRNA genes which cause disease can either be maternally- inherited or sporadic. Our predicted pathogenicity analysis of the mutations in the protein encoding genes shows that those which are sporadic, and therefore occur post-bottleneck, are more pathogenic than those which are inherited. The high prevalence of mt-tRNA mutations which cause primary mtDNA disease suggests that they are passed through the germline more easily than mt-mRNA variants, perhaps because in the majority of cases they produce a less severe phenotype , . The mouse study by Stewart et al., also showed no evidence of purifying selection on mt-tRNA mutations .
A number of the mtDNA mutations detected in the human colon were also associated with loss of COX activity, and COX deficiency is a common observation in patients with mtDNA disease. This suggests that mitochondrial quality control systems such as mitophagy have limited efficiency to remove dysfunctional mitochondria to restore normal function. This is in contrast to a study in which Parkin, an E3 ligase which selectively translocates to depolarised mitochondria and activates their elimination by autophagy, was continually overexpressed in a cybrid cell line containing a mutation in MT-CO1 . Parkin overexpression resulted in selective degradation of dysfunctional mitochondria, an increase in the proportion of wild-type mtDNA, and restoration of COX activity. In our study we see accumulation of pathogenic mtDNA mutations in individual cells. The difference could either be because mtDNA mutations which lower membrane potential to the extent by which Parkin becomes activated are selected against and are therefore not able to accumulate in individual somatic cells, or that physiologically normal levels of Parkin do not have the same effect as Parkin over expression.
These studies have shown that mutations in genes other than MT-CO genes can result in COX deficiency. Disease-associated mutations in mt-tRNA genes are commonly associated with COX defects and mutations in mt-rRNA genes could be causing a translational defect to explain the COX deficiency. We have previously shown that multiple respiratory chain complex deficiency is a common occurrence in the human colon, with approximately 50% of crypts exhibiting COX deficiency also showing a loss of complex I and III subunits immunohistochemically . This multiple respiratory chain complex deficiency could therefore be attributed to mutations in any of the mt-mRNA genes, explaining why we observe COX deficiency associated with mutations in non-MTCO genes. Multiple biochemical defects have been observed in patients with mutations in structural subunits of complex I  and complex III ,  highlighting the fact that complexes of the respiratory chain do not simply exist as simple entities but associate to form ‘supercomplexes’ , . Moreover, it has recently been shown that if one component complex is absent, supercomplexes are not formed  and that the incorporation of a defective structural component, such as a truncated form of COX1, can lead to the rapid degradation of supercomplexes and downregulation of other mitochondrially-encoded respiratory chain subunits 
From an evolutionary perspective, our data are consistent with the predictions of the disposable soma theory of ageing . By comparing the somatic age-related mtDNA mutations which clonally expand within individual cells of the colon with those passed through the germline in populations we have shown considerable evidence supporting the hypothesis that there is selection against pathogenic mtDNA mutations in the human germ-line but little evidence of any such selection in the somatic tissues studied here.
As there are multiple copies of the mitochondrial genome in an individual cell, any mutation occurring in one molecule will be functionally recessive . A functional defect becomes apparent only when a mutation expands clonally to high levels in a cell (typically >70%) . Numerous studies in human tissues have shown an age-dependent increase in the fraction of individual cells which are respiratory-chain deficient , , –,  and that the cause of respiratory chain deficiency is clonally expanded somatic mtDNA mutations , , . Since our data suggest that there is no obvious imprint of selection on the spectrum of age-related mtDNA mutations in the human colon, it remains of interest to consider how such clonal expansion occurs. One leading hypothesis, supported by computational models, is that clonal expansion occurs simply through a process of random genetic drift . Unlike nuclear DNA, mitochondrial DNA is replicated independently of the cell cycle and not all molecules are always replicated (relaxed replication) . The random genetic drift hypothesis suggests that relaxed replication of mtDNA coupled with random degradation of some mtDNA molecules can lead to one mutant genotype becoming the dominant genotype of the cell. A study using heteroplasmic mice containing two different mtDNA genotypes measured the relative contribution of each genotype in individual colonic crypts over time. The study showed that in individual crypts from 4 month old mice there was a mixture of the two genotypes, however in the crypts of the 15 month old mice two distinct crypt populations were observed with one or other of the genotypes predominating. This experimental data fits with a model of neutral drift . In humans, the random genetic drift model predicts that clonal expansion takes decades to occur . Experimental data from a number of tissues has shown that respiratory chain deficient cells are not detectable in humans before approximately age 35 years , , and in the human colon we have shown previously that such defects are present in one third or more of colonocytes from those aged 70+ . This, coupled with the mutational spectrum detected in the human colon is supportive of the ‘early mutation, slow expansion’ theory of mitochondrial ageing .
In conclusion, these investigations have shown significant differences in the spectrum and predicted pathogenicity of mtDNA mutations in the somatic tissues studied here which are associated with ageing, rare population variants which have passed through the germ-line and those pathogenic mtDNA mutations associated with disease. We found little evidence for purifying selection in the somatic tissues investigated, whereas there is strong evidence for purifying selection in the germ-line.
Materials and Methods
Generation of a dataset of age-associated somatic mtDNA mutations
Colorectal mucosal samples were collected from the same anatomical site (10 cm from the anal verge) from subjects (n = 9, age range 71–78 years) undergoing colonoscopy for disturbed bowel function in whom no evidence of bowel disease was identified (BORICC 1 Study). Informed consent was given by all participants in the study and ethical approval was obtained from the Northumbria NHS Trust Local Research Ethics Committee (project reference 04/Q0902/6/).
Single colonocytes were laser-microdissected into sterile 0.5 ml PCR tubes using a Leica Laser Microdissection (AS-LMD) System. Following centrifugation (7000×g for 10 min), the cell was lysed in 14 µl of cell lysis buffer (50 mM Tris-HCl pH 8.5, 1 mM EDTA, 0.5% Tween-20, 200 ng/ml proteinase K) at 55°C for 2 hours and then 95°C for 10 minutes to denature the proteinase K. The entire sequence of the mitochondrial genome from microdissected colonocytes was determined using the single cell lysate as the DNA template. Two rounds of PCR were carried out as previously described . PCR products were cycle sequenced using ABI BigDye chemistries per standard manufacturer's protocols and analysed on an ABI3100 genetic analyser (Applied Biosystems). Sequences obtained were compared with the rCRS  and the homogenate sequence for each patient (phylogenetic quality control), using SeqScape software (Applied Biosystems). Due to potential problems with low mtDNA copy number PCR , all putative mutations were re-amplified from the original DNA lysate and were sequenced in both the forward and reverse directions. This strategy aimed to eliminate any errors due to either amplification of nuclear pseudogenes or errors introduced by the DNA polymerase during PCR amplification. These mutations were added to an existing database of published somatic mtDNA mutations detected in the ageing colon , , . A full list of mutations can be found in Table S1.
Generation of a disease-associated mtDNA mutation data set
Due to the inherent difficulty in assigning pathogenicity to reported mtDNA disease-associated mutations we applied strict criteria when assembling a database of such mutations for the purpose of this study. Protein encoding mutations were included only if they were assigned as ‘probably’ or ‘definitely pathogenic’ using a validated scoring system designed in this laboratory . Our database includes all such mutations previously assigned pathogenicity scores by Wong in 2007 , as well as 4 more recently published mutations which were shown to score in these categories –. We also included tRNA mutations proven to be pathogenic by a similar scoring system . Only two rRNA mutations have been confirmed on mitomap to be pathogenic and therefore only these two were included in the analysis . A full list of mutations can be found in Table S2.
Rare population mtDNA variant dataset
All of the participants whose data were included in the ageing mutation database were of European haplotype therefore the 100 European coding region sequences were chosen. The 100 European sequences were randomly selected reflecting the European haplogroup distribution. The sequences were drawn from the MitoKor database, the changes selected as the rare population variants (Table S3) had been classified as “private” for the purposes of the MitoKor database. In the context of the “Mitokor” database a private change was one that was seen in a single individual only from a continental group i.e. for the European sequences it was seen in only one from 433 individuals. In the time since the generation of the MitoKor database there has been a vast increase in the amount of available sequences but the variants analysed remain a pool of rare variants on European haplogroup backgrounds. In addition, all sequencing to yield data for the Mitokor database was carried out with fourfold redundancy which provides confidence that these are rare variants not sequencing errors.
The pathogenicity of non-synonymous mutations in the 13 protein encoding genes was predicted using the computational software MutPred , which is based on the established SIFT method , . The MutPred scores for all 24,206 possible amino acid variations published by Pereira et al  were also included in these analyses.
Somatic mitochondrial DNA point mutations detected in ageing human colonic crypt cells.
Mitochondrial DNA point mutations which are reported to cause human mitochondrial disease.
Rare population variants in 100 European coding region mitochondrial DNA sequences.
Somatic mitochondrial DNA point mutations reported in ageing human tissues.
Conceived and designed the experiments: LCG JLE DMT JCM TBLK. Performed the experiments: LCG MN GAT. Analyzed the data: LCG JLE JPG RWT. Contributed reagents/materials/analysis tools: JCM. Wrote the paper: LCG JLE MN JPG GAT RWT JCM TBLK DMT. Statistical analysis: JLE JPG.
- 1. Troen BR (2003) The biology of aging. Mt Sinai J Med 70: 3–22.
- 2. Kirkwood TB, Melov S (2011) On the programmed/non-programmed nature of ageing within the life history. Curr Biol 21: R701–707. doi: 10.1016/j.cub.2011.07.020
- 3. Kirkwood TB (1977) Evolution of ageing. Nature 270: 301–304. doi: 10.1038/270301a0
- 4. Kirkwood TB (2005) Understanding the odd science of aging. Cell 120: 437–447. doi: 10.1016/j.cell.2005.01.027
- 5. Linnane AW, Marzuki S, Ozawa T, Tanaka M (1989) Mitochondrial DNA mutations as an important contributor to ageing and degenerative diseases. Lancet 1: 642–645. doi: 10.1016/s0140-6736(89)92145-4
- 6. Nass MM (1966) The circularity of mitochondrial DNA. Proc Natl Acad Sci U S A 56: 1215–1222. doi: 10.1073/pnas.56.4.1215
- 7. Taylor RW, Turnbull DM (2005) Mitochondrial DNA mutations in human disease. Nat Rev Genet 6: 389–402. doi: 10.1038/nrg1606
- 8. Old SL, Johnson MA (1989) Methods of microphotometric assay of succinate dehydrogenase and cytochrome c oxidase activities for use on human skeletal muscle. Histochem J 21: 545–555. doi: 10.1007/bf01753355
- 9. Greaves LC, Preston SL, Tadrous PJ, Taylor RW, Barron MJ, et al. (2006) Mitochondrial DNA mutations are established in human colonic stem cells, and mutated clones expand by crypt fission. Proc Natl Acad Sci U S A 103: 714–719. doi: 10.1073/pnas.0505903103
- 10. Taylor RW, Barron MJ, Borthwick GM, Gospel A, Chinnery PF, et al. (2003) Mitochondrial DNA mutations in human colonic crypt stem cells. J Clin Invest 112: 1351–1360. doi: 10.1172/jci19435
- 11. Shoubridge EA, Wai T (2007) Mitochondrial DNA and the mammalian oocyte. Curr Top Dev Biol 77: 87–111. doi: 10.1016/s0070-2153(06)77004-1
- 12. Elson JL, Andrews RM, Chinnery PF, Lightowlers RN, Turnbull DM, et al. (2001) Analysis of European mtDNAs for recombination. Am J Hum Genet 68: 145–153. doi: 10.1086/316938
- 13. Muller HJ (1964) The relation of recombination to mutational advance. Mutat Res 106: 2–9.
- 14. Bogenhagen D, Clayton DA (1977) Mouse L cell mitochondrial DNA molecules are selected randomly for replication throughout the cell cycle. Cell 11: 719–727. doi: 10.1016/0092-8674(77)90286-0
- 15. Rand DM (2008) Mitigating mutational meltdown in mammalian mitochondria. PLoS Biol 6: e35 doi:10.1371/journal.pbio.0060035.
- 16. Lee HC, Chang CM, Chi CW (2010) Somatic mutations of mitochondrial DNA in aging and cancer progression. Ageing Res Rev 9Suppl 1: S47–58. doi: 10.1016/j.arr.2010.08.009
- 17. Cree LM, Samuels DC, de Sousa Lopes SC, Rajasimha HK, Wonnapinij P, et al. (2008) A reduction of mitochondrial DNA molecules during embryogenesis explains the rapid segregation of genotypes. Nat Genet 40: 249–254. doi: 10.1038/ng.2007.63
- 18. McFarland R, Taylor RW, Turnbull DM (2007) Mitochondrial disease–its impact, etiology, and pathology. Curr Top Dev Biol 77: 113–155. doi: 10.1016/s0070-2153(06)77005-3
- 19. Wai T, Teoli D, Shoubridge EA (2008) The mitochondrial DNA genetic bottleneck results from replication of a subpopulation of genomes. Nat Genet 40: 1484–1488. doi: 10.1038/ng.258
- 20. Shoubridge EA, Wai T (2008) Medicine. Sidestepping mutational meltdown. Science 319: 914–915. doi: 10.1126/science.1154515
- 21. Elson JL, Turnbull DM, Howell N (2004) Comparative genomics and the evolution of human mitochondrial DNA: assessing the effects of selection. Am J Hum Genet 74: 229–238. doi: 10.1086/381505
- 22. Sun C, Kong QP, Zhang YP (2007) The role of climate in human mitochondrial DNA evolution: a reappraisal. Genomics 89: 338–342. doi: 10.1016/j.ygeno.2006.11.005
- 23. Stewart JB, Freyer C, Elson JL, Wredenberg A, Cansu Z, et al. (2008) Strong purifying selection in transmission of mammalian mitochondrial DNA. PLoS Biol 6: e10 doi:10.1371/journal.pbio.0060010.
- 24. Fan W, Waymire KG, Narula N, Li P, Rocher C, et al. (2008) A mouse model of mitochondrial disease reveals germline selection against severe mtDNA mutations. Science 319: 958–962. doi: 10.1126/science.1147786
- 25. Cottrell DA, Blakely EL, Johnson MA, Ince PG, Borthwick GM, et al. (2001) Cytochrome c oxidase deficient cells accumulate in the hippocampus and choroid plexus with age. Neurobiol Aging 22: 265–272. doi: 10.1016/s0197-4580(00)00234-7
- 26. Fellous TG, Islam S, Tadrous PJ, Elia G, Kocher HM, et al. (2009) Locating the stem cell niche and tracing hepatocyte lineages in human liver. Hepatology 49: 1655–1663. doi: 10.1002/hep.22791
- 27. McDonald SA, Greaves LC, Gutierrez-Gonzalez L, Rodriguez-Justo M, Deheragoda M, et al. (2008) Mechanisms of field cancerization in the human stomach: the expansion and spread of mutated gastric stem cells. Gastroenterology 134: 500–510. doi: 10.1053/j.gastro.2007.11.035
- 28. Muller-Hocker J (1989) Cytochrome-c-oxidase deficient cardiomyocytes in the human heart–an age-related phenomenon. A histochemical ultracytochemical study. Am J Pathol 134: 1167–1173.
- 29. Muller-Hocker J (1990) Cytochrome c oxidase deficient fibres in the limb muscle and diaphragm of man without muscular disease: an age-related alteration. J Neurol Sci 100: 14–21. doi: 10.1016/0022-510x(90)90006-9
- 30. Kujoth GC, Hiona A, Pugh TD, Someya S, Panzer K, et al. (2005) Mitochondrial DNA mutations, oxidative stress, and apoptosis in mammalian aging. Science 309: 481–484. doi: 10.1126/science.1112125
- 31. Trifunovic A, Wredenberg A, Falkenberg M, Spelbrink JN, Rovio AT, et al. (2004) Premature ageing in mice expressing defective mitochondrial DNA polymerase. Nature 429: 417–423. doi: 10.1038/nature02517
- 32. Vermulst M, Wanagat J, Kujoth GC, Bielas JH, Rabinovitch PS, et al. (2008) DNA deletions and clonal mutations drive premature aging in mitochondrial mutator mice. Nat Genet 40: 392–394. doi: 10.1038/ng.95
- 33. Bender A, Krishnan KJ, Morris CM, Taylor GA, Reeve AK, et al. (2006) High levels of mitochondrial DNA deletions in substantia nigra neurons in aging and Parkinson disease. Nat Genet 38: 515–517. doi: 10.1038/ng1769
- 34. Cortopassi GA, Arnheim N (1990) Detection of a specific mitochondrial DNA deletion in tissues of older humans. Nucleic Acids Res 18: 6927–6933. doi: 10.1093/nar/18.23.6927
- 35. Cortopassi GA, Shibata D, Soong NW, Arnheim N (1992) A pattern of accumulation of a somatic deletion of mitochondrial DNA in aging human tissues. Proc Natl Acad Sci U S A 89: 7370–7374. doi: 10.1073/pnas.89.16.7370
- 36. Kraytsberg Y, Kudryavtseva E, McKee AC, Geula C, Kowall NW, et al. (2006) Mitochondrial DNA deletions are abundant and cause functional impairment in aged human substantia nigra neurons. Nat Genet 38: 518–520. doi: 10.1038/ng1778
- 37. De Alwis N, Hudson G, Burt AD, Day CP, Chinnery PF (2009) Human liver stem cells originate from the canals of Hering. Hepatology 50: 992–993. doi: 10.1002/hep.23160
- 38. Greaves LC, Barron MJ, Plusa S, Kirkwood TB, Mathers JC, et al. (2010) Defects in multiple complexes of the respiratory chain are present in ageing human colonic crypts. Exp Gerontol 45: 573–579. doi: 10.1016/j.exger.2010.01.013
- 39. Shin MG, Kijigaya S, McCoy JP Jr, Levin BC, Young NS (2004) Marked mitochondrial DNA sequence heterogeneity in single CD34+ cell clones from normal adult bone marrow. Blood 103: 553–561. doi: 10.1182/blood-2003-05-1724
- 40. Elson JL, Swalwell H, Blakely EL, McFarland R, Taylor RW, et al. (2009) Pathogenic mitochondrial tRNA mutations–which mutations are inherited and why? Hum Mutat 30: E984–992. doi: 10.1002/humu.21113
- 41. Li B, Krishnan VG, Mort ME, Xin F, Kamati KK, et al. (2009) Automated inference of molecular mechanisms of disease from amino acid substitutions. Bioinformatics 25: 2744–2750. doi: 10.1093/bioinformatics/btp528
- 42. Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, et al. (1999) Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet 23: 147. doi: 10.1038/13779
- 43. Pereira L, Soares P, Radivojac P, Li B, Samuels DC (2011) Comparing phylogeny and the predicted pathogenicity of protein variations reveals equal purifying selection across the global human mtDNA diversity. Am J Hum Genet 88: 433–439. doi: 10.1016/j.ajhg.2011.03.006
- 44. Reeve AK, Krishnan KJ, Elson JL, Morris CM, Bender A, et al. (2008) Nature of mitochondrial DNA deletions in substantia nigra neurons. Am J Hum Genet 82: 228–235. doi: 10.1016/j.ajhg.2007.09.018
- 45. Jenuth JP, Peterson AC, Shoubridge EA (1997) Tissue-specific selection for different mtDNA genotypes in heteroplasmic mice. Nat Genet 16: 93–95. doi: 10.1038/ng0597-93
- 46. Blackwood JK, Williamson SC, Greaves LC, Wilson L, Rigas AC, et al. (2011) In situ lineage tracking of human prostatic epithelial stem cell fate reveals a common clonal origin for basal and luminal cells. J Pathol 225: 181–188. doi: 10.1002/path.2965
- 47. Gaisa NT, Graham TA, McDonald SA, Canadillas-Lopez S, Poulsom R, et al. (2011) The human urothelium consists of multiple clonal units, each maintained by a stem cell. J Pathol 225: 163–171. doi: 10.1002/path.2945
- 48. Gutierrez-Gonzalez L, Deheragoda M, Elia G, Leedham SJ, Shankar A, et al. (2009) Analysis of the clonal architecture of the human small intestinal epithelium establishes a common stem cell for all lineages and reveals a mechanism for the fixation and spread of mutations. J Pathol 217: 489–496. doi: 10.1002/path.2502
- 49. Fellous TG, McDonald SA, Burkert J, Humphries A, Islam S, et al. (2009) A methodological approach to tracing cell lineage in human epithelial tissues. Stem Cells 27: 1410–1420. doi: 10.1002/stem.67
- 50. Fliss MS, Usadel H, Caballero OL, Wu L, Buta MR, et al. (2000) Facile detection of mitochondrial DNA mutations in tumors and bodily fluids. Science 287: 2017–2019. doi: 10.1126/science.287.5460.2017
- 51. Vermulst M, Bielas JH, Kujoth GC, Ladiges WC, Rabinovitch PS, et al. (2007) Mitochondrial point mutations do not limit the natural lifespan of mice. Nat Genet 39: 540–543. doi: 10.1038/ng1988
- 52. Hauswirth WW, Laipis PJ (1982) Mitochondrial DNA polymorphism in a maternal lineage of Holstein cows. Proc Natl Acad Sci U S A 79: 4686–4690. doi: 10.1073/pnas.79.15.4686
- 53. Suen DF, Narendra DP, Tanaka A, Manfredi G, Youle RJ (2010) Parkin overexpression selects against a deleterious mtDNA mutation in heteroplasmic cybrid cells. Proc Natl Acad Sci U S A 107: 11835–11840. doi: 10.1073/pnas.0914569107
- 54. Greaves LC, Reeve AK, Taylor RW, Turnbull DM (2012) Mitochondrial DNA and disease. J Pathol 226: 274–286. doi: 10.1002/path.3028
- 55. Hinttala R, Smeets R, Moilanen JS, Ugalde C, Uusimaa J, et al. (2006) Analysis of mitochondrial DNA sequences in patients with isolated or combined oxidative phosphorylation system deficiency. J Med Genet 43: 881–886. doi: 10.1136/jmg.2006.042168
- 56. Bruno C, Santorelli FM, Assereto S, Tonoli E, Tessa A, et al. (2003) Progressive exercise intolerance associated with a new muscle-restricted nonsense mutation (G142X) in the mitochondrial cytochrome b gene. Muscle Nerve 28: 508–511. doi: 10.1002/mus.10429
- 57. Rana M, de Coo I, Diaz F, Smeets H, Moraes CT (2000) An out-of-frame cytochrome b gene deletion from a patient with parkinsonism is associated with impaired complex III assembly and an increase in free radical production. Ann Neurol 48: 774–781. doi: 10.1002/1531-8249(200011)48:5<774::aid-ana11>3.3.co;2-9
- 58. Acin-Perez R, Fernandez-Silva P, Peleato ML, Perez-Martos A, Enriquez JA (2008) Respiratory active mitochondrial supercomplexes. Mol Cell 32: 529–539. doi: 10.1016/j.molcel.2008.10.021
- 59. Schagger H, Pfeiffer K (2000) Supercomplexes in the respiratory chains of yeast and mammalian mitochondria. EMBO 19: 1777–1783. doi: 10.1093/emboj/19.8.1777
- 60. Hornig-Do HT, Tatsuta T, Buckermann A, Bust M, Kollberg G, et al. (2012) Nonsense mutations in the COX1 subunit impair the stability of respiratory chain complexes rather than their assembly. EMBO J 31: 1293–1307. doi: 10.1038/emboj.2011.477
- 61. Larsson NG, Clayton DA (1995) Molecular genetic aspects of human mitochondrial disorders. 151–178. doi: 10.1146/annurev.ge.29.120195.001055
- 62. Brierley EJ (1997) Mitochonrial damage in human ageing. Reveiws in clinical gerontology 7: 95–105.
- 63. Elson JL, Samuels DC, Turnbull DM, Chinnery PF (2001) Random intracellular drift explains the clonal expansion of mitochondrial DNA mutations with age. Am J Hum Genet 68: 802–806. doi: 10.1086/318801
- 64. Khrapko K (2011) The timing of mitochondrial DNA mutations in aging. Nat Genet 43: 726–727. doi: 10.1038/ng.895
- 65. Yao YG, Bandelt HJ, Young NS (2007) External contamination in single cell mtDNA analysis. PLoS ONE 2: e681 doi:10.1371/journal.pone.0000681.
- 66. Mitchell AL, Elson JL, Howell N, Taylor RW, Turnbull DM (2006) Sequence variation in mitochondrial complex I genes: mutation or polymorphism? J Med Genet 43: 175–179.
- 67. Wong LJ (2007) Pathogenic mitochondrial DNA mutations in protein-coding genes. Muscle Nerve 36: 279–293. doi: 10.1002/mus.20807
- 68. Alston CL, Morak M, Reid C, Hargreaves IP, Pope SA, et al. (2010) A novel mitochondrial MTND5 frameshift mutation causing isolated complex I deficiency, renal failure and myopathy. Neuromuscul Disord 20: 131–135. doi: 10.1016/j.nmd.2009.10.010
- 69. Downham E, Winterthun S, Nakkestad HL, Hirth A, Halvorsen T, et al. (2008) A novel mitochondrial ND5 (MTND5) gene mutation giving isolated exercise intolerance. Neuromuscul Disord 18: 310–314. doi: 10.1016/j.nmd.2008.01.003
- 70. Marotta R, Chin J, Kirby DM, Chiotis M, Cook M, et al. (2011) Novel single base pair COX III subunit deletion of mitochondrial DNA associated with rhabdomyolysis. J Clin Neurosci 18: 290–292. doi: 10.1016/j.jocn.2010.06.001
- 71. Fragaki K, Procaccio V, Bannwarth S, Serre V, O'Hearn S, et al. (2009) A neonatal polyvisceral failure linked to a de novo homoplasmic mutation in the mitochondrially encoded cytochrome b gene. Mitochondrion 9: 346–352. doi: 10.1016/j.mito.2009.06.002
- 72. Yarham JW, Al-Dosary M, Blakely EL, Alston CL, Taylor RW, et al. (2011) A comparative analysis approach to determining the pathogenicity of mitochondrial tRNA mutations. Hum Mutat 32: 1319–1325. doi: 10.1002/humu.21575
- 73. MITOMAP: A Human Mitochondrial Genome Database. http://www.mitomap.org (2011) Accessed 2012 May.
- 74. Ng PC, Henikoff S (2001) Predicting deleterious amino acid substitutions. Genome Res 11: 863–874. doi: 10.1101/gr.176601
- 75. Ng PC, Henikoff S (2003) SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 31: 3812–3814. doi: 10.1093/nar/gkg509