Several human diseases have been associated with mutations in mitochondrial genes comprising a set of confirmed and reported mutations according to the MITOMAP database. An analysis of complete mitogenomes across 139 primate species showed that most confirmed disease-associated mutations occurred in aligned codon positions and gene regions under strong purifying selection resulting in a strong evolutionary conservation. Only two confirmed variants (7.1%), coding for the same amino acids accounting for severe human diseases, were identified without apparent pathogenicity in non-human primates, like the closely related Bornean orangutan. Conversely, reported disease-associated mutations were not especially concentrated in conserved codon positions, and a large fraction of them occurred in highly variable ones. Additionally, 88 (45.8%) of reported mutations showed similar variants in several non-human primates and some of them have been present in extinct species of the genus Homo. Considering that recurrent mutations leading to persistent variants throughout the evolutionary diversification of primates are less likely to be severely damaging to fitness, we suggest that these 88 mutations are less likely to be pathogenic. Conversely, 69 (35.9%) of reported disease-associated mutations occurred in extremely conserved aligned codon positions which makes them more likely to damage the primate mitochondrial physiology.
Citation: Tavares WC, Seuánez HN (2017) Disease-associated mitochondrial mutations and the evolution of primate mitogenomes. PLoS ONE 12(5): e0177403. https://doi.org/10.1371/journal.pone.0177403
Editor: Roscoe Stanyon, University of Florence, ITALY
Received: March 8, 2017; Accepted: April 26, 2017; Published: May 16, 2017
Copyright: © 2017 Tavares, Seuánez. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data are available in GenBank and the accession numbers are listed in the paper.
Funding: Conselho Nacional de Desenvolvimento, CNPq-Brazil (www.cnpq.br) provided funding for computing facilities (grant 303306/2010-6). Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-CAPES-Brazil (www.capes.gov.br) and Fundação de Amparo à Pesquisa do Estado do Rio de Janeiro- Brazil-FAPERJ (www.faperj.br) supprted William Tavares with post-doctoral grant 209101/E_44/2014. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The mammalian mitogenome comprises a set of 13 genes coding for proteins of the electron transport chain, two genes specifying for 12S- and 16S rRNAs, 22 for tRNAs, and a non-coding D-loop region, altogether encompassing approximately 16,600 bp . The mt-proteins of the electron transport chain together with other 80 proteins encoded by nuclear genes  comprise the oxidative phosphorylation (OXPHOS) machinery consisting of five multimeric protein complexes, one exclusively containing nuclear proteins (complex II) and four other (I, III, IV and V) with nuclear and mitochondrial proteins .
As OXPHOS dysfunction accounts for serious impairments in energy production, mt-genes have been subject to strong evolutionary constraints and a drastically restricted variability under the effect of strong negative selection [4–6] despite punctual cases of positive selection and molecular adaptations to extreme metabolic demands [7–11].
Most pathogenic mt-mutations are eliminated or maintained with significantly reduced frequencies in populations [5,6,12]. Analyses of a large set of human mitochondrial genomes showed that amino acid variants with predicted high pathogenicity were rarely detected, while those with a predicted low pathogenicity emerged several times during human evolutionary diversification . Several mt-mutations are associated with metabolic disorders, human diseases or syndromes, like Leber Hereditary Optic Neuropathy (LHON), Mitochondrial Encephalomyopathy, Lactic Acidosis, and Stroke-like episodes (MELAS), Exercise Intolerance (EXIT), Mitochondrial Myopathy, Dystonia, Leigh Disease (LD), Alzheimer's Disease, Parkinson’s Disease, and Progressive Encephalomyopathy [14–18]. However, only a fraction of these mutations has been confirmed to be etiologically pathogenic by independent clinical studies, robust statistical analyses and well-known biochemical effects while most mutations are only reported to be associated with human pathologies without a clear causal relation with disease .
A phylogenetic approach might be useful for identifying mt-mutations etiologically responsible for OXPHOS damage because protein regions under strong functional and structural constraints are highly conserved . In fact, some methods designed for predicting the pathogenicity and severity of mutations consider evolutionary conservation of amino acid residues, but usually include a large set of distantly related organisms [20,21], likely with disparate physiological demands. On the other hand, we would expect that mutations similar to those associated with pathologic conditions in humans would also reduce fitness in very closely related species, being therefore subject to purifying selection . However, this would not be necessarily true if the physiological characteristics of other species were unaffected by these mutations or other compensatory mutations nullified their pathogenic effects . Conversely, non-deleterious mutations might have subsisted under neutral selection, with a high probability of being maintained across different taxa and, consequently to the lack of severe constraints, higher rates of amino acid substitutions would be expected. In this scenario, mapping the occurrence of disease-associated mutations in a phylogenetic tree of closely related species may shed light on the evolutionary constraints affecting these genes [2,3,22] as well as assessing their effect as causal factors.
In this paper, we analyze the diversity of the mitogenome in 139 primate species to test whether disease-associated mutations in humans were predominantly concentrated in codons under strong evolutionary constraints and to trace these changes within a phylogenetic approach, including ancestry, number of independent events and polarity with respect to their presumed or confirmed pathogenicity in humans.
Material and methods
Primate phylogeny based on mitogenomes
One hundred and forty-eight reference sequences of complete mitogenomes of 139 primate species were downloaded from GenBank (S1 Table). Taxonomic sampling included all living primate families, all hominid living species, three fossil hominids (Homo heidelbergensis, Homo neanderthalensis and the unnamed Homo from Denisova) and two fossil lemurs (Palaeopropithecus ingens and Megaladapis edwardsi).
Separate alignments of the 13 protein-coding mt-genes were run with Muscle  and checked manually with Mega v.6.0 . These alignments were subsequently concatenated in a single dataset with 11,406 bp for Maximum Likelihood (ML) phylogenetic reconstructions. ML was independently performed with PhyML v.3.1 in SeaView v.4.5.4 [25,26] and the GARLI online platform , with estimations of approximate likelihood-ratio tests (aLRT) using the General Time Reversible model with gamma distribution and invariable sites (GTR+Γ+I), which was indicated as the best evolutionary model by ModelGenerator v.0.85 . The GTR+Γ+I model assumes a symmetric substitution matrix, in which each pair of nucleotide substitutions has a different rate, nucleotides occurring at different frequencies, rates varying among sites according a gamma distribution and some sites remaining unchanged . The alignment of the 13 concatenated genes is available in S1 File.
Analyses of evolutionary conservation of codons and amino acid residues
The evolutionary conservation of each aligned codon position and the amino acid residue therein encoded was estimated with three different indices, named Ind1, Ind2, Ind3. The simplest index (Ind1) recorded the number of different amino acids per aligned codon position. Ind2 estimated the total number of missense mutations per aligned codon position along primate radiation using SLAC in DataMonkey server [30,31]. Finally, Ind3 reported the standardized evolutionary rates per aligned residue position with ConSurf server [32,33] using a Rate4Site algorithm under an empirical Bayesian methodology [34,35]. The Rate4Site algorithm uses the tree topology and branch lengths to calculate rates of amino acid substitutions per site. The resulting conservation scores are usually highly correlated with dN/dS (ratio of non-synonymous substitutions per non-synonymous sites to synonymous substitutions per synonymous sites) scores based on the coding DNA sequences .
Additionally, TreeSAAP was used for investigating whether, along primate diversification, non-synonymous substitutions in codon positions with confirmed and reported disease-associated mutations resulted in substitutions of amino acids with radically different physicochemical properties [36,37]. Analysis was carried out testing 31 physicochemical amino acid properties, and only changes of categories 6, 7 and 8 with z-scores > 3.09 (p < 0.001) were considered to be radical changes [36,37]. With TreeSAAP we estimated, for each residue position, the number of independent amino-acid substitutions resulting in changes of radical physicochemical properties, and the maximum number of physicochemical properties radically affected in each residue position. Amino acid substitutions resulting in radically different physicochemical properties would be expected to occur very rarely in residue positions under strong functional constraints while substitutions by amino acids with similar properties would be expected to be frequent.
SLAC was used for identifying codons under negative selection. SLAC reconstructs codon ancestral sequences, estimates the normalized expected and observed numbers of synonymous and non-synonymous substitutions and computes dN/dS ratios. A p-value derived from a two-tailed extended binomial distribution is used to assess significance of dN and dS differences (α = 0.05). Codons are considered to be negatively selected when dN < dS.
Disease-associated mutations per aligned codon position
The disease-associated missense mutations found in 13 protein-coding, human mt-genes were downloaded from UniProt and MITOMAP databases in November 2016 [17,38,39] for analyzing Ind1, Ind2 and Ind3 data. Two kinds of disease-associated mutations were considered according to MITOMAP criteria: (i) “reported” (when one or more publications considered a mutation as possibly pathogenic) and (ii) “confirmed” (when two or more independent laboratories reported the pathogenicity of a specific mutation, and the mitochondrial research community generally accepted it as being pathogenic). This classification will be used throughout this report.
The frequency distribution of each index (Ind1, Ind2 and Ind3) for (i) all aligned codon positions, (ii) aligned codon positions with reported disease-associated mutations, and (iii) aligned codon positions with confirmed disease-associated mutations was plotted to test whether disease-associated mutations were predominantly concentrated in highly conserved positions. The Kolmogorov-Smirnov tests were used for comparing the distributions of i-, ii- and iii- groups of aligned codon positions per index.
A t-test was used for investigating whether predicted pathogenicity scores of confirmed disease-associated mutations, estimated with MutPred  in a previous study , were higher than for reported disease-associated mutations.
ML analyses showed a very similar phylogenetic topology to previous reports [7,40–42] with high node support. A detailed description of this topology is available in S1 Fig. A newick formatted tree is available in S2 File.
Variation and evolution of primate mt-genes
The mt-CO1 gene was the most evolutionarily conserved of all 13 mt-genes (mean GTR+Γ+I distance = 0.253) while mt-ATP8 was the most variable (mean GTR+Γ+I distance = 0.582). A significant, negative correlation was observed between gene size and mean GTR+Γ+I distance (Fig 1A; r2 = 0.461; p = 0.011) but not between gene size and proportion of codons under negative selection (Fig 1B; r2 = 0.136; p = 0.214).
(A) Significant, positive correlation between mitochondrial gene size and mean GTR+Γ+I distance (r2 = 0. 461; p = 0.011). (B) Non-significant, negative correlation between gene size and percentage of codons under negative selection (r2 = 0.136; p = 0.214).
Except for mt-ATP6, mt-CO1, mt-CO2, mt-CO3 and mt-ND4L, where indels were absent, 45 indels occurring as independent evolutionary events were clustered in highly variable gene regions (S2 Table). This was the case of codon 85 of mt-ND3 with one deletion in bushbabies (Galago) and three independent insertions between codon 85 and 86 in mouse lemurs (Microcebus) and baboons (Papio). The mt-ND4 gene showed four independent deletions of codon 47 in the New World monkeys (Platyrrhini), bushbabies (Galago), lorises (Loris) and gibbons (Nomascus), as well as two independent deletions of the adjacent codons 49 and 50 in aye-aye (Daubentonia madagascariensis) and lineage including galagos and lorises (Lorisiformes). Twelve independent deletions and eight independent insertions occurred between codons 105 and 115 of the mt-ND6 gene in 20 taxa.
Analyses of 3,786 aligned codon positions showed that the number of different amino acids coded per position (Ind1) varied from 1 to 14 (mode = 1; mean = 3.020; S.D. = 2.408; Fig 1) while the number of non-synonymous substitutions per aligned codon position (Ind2) varied from 0 to 76 (mode = 0; mean = 6.564; S.D. = 9,318). The evolutionary rate of each amino acid position (Ind3) varied from − 0.719 to 6.529 (mode = − 0.707; median = − 0.479; mean of standardized rates = 0; S.D. = 1). The distributions of Ind1, Ind2 and the transformed Ind3 (for including only positive estimates) fitted well to the exponential distributions for Ind1 (χ2 = 272.931, df = 14, p < 0.001), Ind2 (χ2 = 711.737; df = 7; p < 0.001) and Ind3 (χ2 = 591.790; df = 8; p < 0.001). In 1,443 (38.1%) of aligned codon positions, missense substitutions were not observed across primate diversification because only a single amino acid was coded in all species. On the other hand, 3,381 (88.9%) aligned codon positions appeared to be under negative selection with SLAC.
Aligned codon positions and disease-associated mutations in humans
Missense substitutions associated with human diseases were identified in 220 aligned codon positions (S3 Table). Twenty-eight aligned codon positions showed mutations classified as “confirmed” and 192 as “reported” by MITOMAP and UNIPROT. The predicted pathogenicity scores of the confirmed disease-associated mutations were significantly higher than those of reported disease-associated mutations (t = 2.956; d.f. = 226; p = 0.003; Fig 2). Kolmogorov-Smirnov tests of Ind1, Ind2 and Ind3 consistently showed that aligned codon positions with confirmed disease-associated mutations were not randomly distributed but concentrated among the most evolutionary conserved positions. On the other hand, Ind1, Ind2 and Ind3 estimates showed that the aligned codon positions with reported disease-associated mutations were not significantly concentrated and were not significantly different from the whole set of aligned codon positions (Fig 3; Table 1). All aligned codon positions with confirmed disease-associated mutations were found to be under negative selection with SLAC, contrary to only 85.5% of aligned codon positions with reported disease-associated mutations, similarly to found for all positions of all mt-genes (88.9%).
Significant difference between means of predicted pathogenicity scores between confirmed disease-associated mutations and reported disease-associated mutations (t = 2.956; d.f. = 226; p = 0.003).
Frequency distributions of Ind 1 (left), Ind2 (middle) Ind3 (right) of three groups of aligned codon positions (all positions, positions with confirmed disease-associated mutations and positions with reported disease-associated mutations).
Aligned codon positions with confirmed disease-associated mutations in humans
Of the 28 aligned codon positions with confirmed disease-associated mutations, 78.6% (n = 22) were invariant in all reference sequences of the primate species herein analyzed. Of the six positions with missense mutations, two were found to be highly variable across primate taxa. This was the case of codon positions 34 and 45 of mt-ND3 with missense substitutions m.10158T>C (S34P) and m.10191T>C (S45P) accounting for confirmed disease-associated mutations with Leigh Disease in humans. In these two positions, serine is the only coded residue in normal humans. These positions, however, were found to be highly variable along primate evolution (Ind334 = 1.442; Ind345 = 0.882), with 20 and 15 non-synonymous substitutions, coding for eight and six amino acids, respectively, albeit none encoding proline. Among the 20 non-synonymous substitutions affecting codon 34 of mt-ND3, 10 resulted in radical changes in amino acid physicochemical properties (affecting up to seven different properties), and among the 15 non-synonymous substitutions affecting codon 45 of mt-ND3, six resulted in radical changes in amino acid physicochemical properties (affecting up to 12 different properties).
Confirmed disease-associated mutations in humans were found to occupy two slightly variable aligned codon positions, but with amino acid physicochemical properties highly conserved along primate evolution. This was the case of the m.8528T>C substitution in the partially overlapping codons 1 of mt-ATP6 and 54 of mt-ATP8 resulting in M1T and W54R, respectively, and associated with neuromuscular disorder and infantile cardiomyopathy (Fig 4). Along the primate diversification, the amino acid substitutions in these residue positions did not result in any radical change in physicochemical properties. A phylogenetic reconstruction indicated that ATG was the ancestral codon 1 of mt-ATP6 and TGA the ancestral codon of codon 54 of mt-ATP8. Two independent m.8527A>G substitutions resulted in identical amino acid replacements (M1V) in the first residue of the ATP synthase F0-a polypeptide in the Wallace's tarsier (Tarsius wallacei) and Hubbard's sportive lemur (Lepilemur hubbardorum), while m.8527A>G in codon 53 of mt-ATP8 was a samesense mutation.
Comparison between a confirmed disease-associated missense mutation in humans (m.8528T>C; left) and a mutation in an adjacent region (m.8527A>G, right) in two non-human primates, Tarsius wallacei and Lepilemur hubbardorum. These mutations affected nucleotide positions in overlapping coding regions of mt-ATP6 (in blue, above) and mt-ATP8 (in red, below). m.8528T>C is a missense mutation in both genes, while m.8527A>G is a missense mutation only in mt-ATP6.
Moreover, in the aligned codon position 64 of mt-ND6, three different missense mutations (m.14482C>A; M64I, m.14482C>G; M64I, and m.14484T>C; M64V) resulting in substitutions of isoleucine and valine for methionine, have been confirmed to be associated with LHON in humans. A phylogenetic reconstruction indicated that codon 64 has been variable along primate evolution, with TTG being the most ancestral triplet, coding for leucine, and with all five other leucine codons being present along different lineages. Methionine is therefore a derived trait (L64M) resulting from two different, independent missense mutations, one in the fat-tailed dwarf lemur (Cheirogaleus medius) lineage (CTG→ATG) and another in the ancestor of all anthropoid (simiiforme) primates (TTG→ATG). Within this latter lineage, methionine has been coded by both ATG and ATA. The two non-synonymous mutations affecting codon 64 of mt-ND6 did not result in any radical change in physicochemical properties.
In none of the four above mentioned codon positions, specific amino acids from confirmed disease-associated mutations were found in the non-human primate species herein studied. This, however, was not the case of both mt-ND1 missense mutations m.3635G>A (S110N) and m.3700G>A (A132T), identified as confirmed disease-associated mutations with LHON in humans. Asparagine and threonine, resulting from these mutations were deduced from the reference sequence of the sooty mangabey (Cercocebus atys) and the Bornean orangutan (Pongo pygmaeus), respectively, apparently without evidence of a pathologic condition in these species. Furthermore, in another available mt-ND1 sequence of C. atys (KP090062) asparagine was also found at position 110, showing that this substitution was not a peculiarity of the reference specimen. Similarly, in all four other available mt-ND1 sequences of P. pygmaeus (NC_001646, X97713.1, X97713.1 and X97713.1) threonine was coded by codon 132, while in four other sequences of the Sumatran orangutan (Pongo abelli; X97713.1, X97713.1, X97713.1 and X97713.1) this position coded for alanine (Fig 5A). The substitution of serine by asparagine, the only amino acid change in residue position 110 of NADH1, did not result in radical changes in physicochemical properties. Conversely, two of the three amino acid substitutions affecting residue 132 of NADH1 resulted in radical changes in physicochemical properties (affecting up to three different properties); the substitution of alanine by threonine, as observed in the linage leading to P. pygmaeus, corresponded to a radical change in the tendencies of forming alpha-helices.
A. Confirmed, disease-associated A132T mutation in mt-ND1 showing its occurrence in the Bornean orangutans (Pongo pygmaeus). B. Reported, disease-associated, A64S mutation in mt-ND1, showing that all hominids show serine in residue 64 of NADH1, except modern humans and closely related Neanderthals.
Aligned codon positions with reported disease-associated mutations in the human
Among the 192 aligned codon positions with reported disease-associated mutations in the human, only 35.9% (n = 69) did not show missense mutations, similarly to the proportion of codon positions without missense substitutions of all mt-genes (38.114%). In the other 123 aligned codon positions, 88 (71.5%) mutations were found to code for amino acids associated with human disease in at least one non-human primate species (Table 2; S3 Table and S2 Fig). This was the case of a missense mutation in mt-CO1 codon 4 (m.5913G>A; D4N) substituting asparagine for aspartic acid, and associated with human prostate cancer and hypertension. Asparagine was estimated to be the ancestral primate residue for this position with SLAC, and was also present in 119 other primate species herein analyzed. Similarly, a missense mutation in codon 159 of mt-ND5 (m.12811T>C; Y159H) substituting histidine for tyrosine was reported to be a likely LHON factor . However, the presence of histidine in NADH5 residue position 159 independently appeared 11 times along primate diversification, in 70 different species herein analyzed.
Several amino acids coded for by reported disease-associated mutations in humans were also coded for by 76 aligned codon positions in at least one Old World primate (catarrhine) species. Similarly, this was observed at 36 aligned codon positions, 24 in great ape species and eight in species of the genus Homo (Table 2 and S3 Table). The mean number of independent appearances, per codon position, of amino acids resulting from reported disease-associated mutations in the human equaled 1.7 (min = 0; max = 17; S.D. = 2.844), and the mean number of primate species with amino acid variants resulting from reported disease-associated mutations in the human equaled 11.1 per codon position (min = 0; max = 121; S.D. = 23.780; Fig 6).
Left: Number of independent occurrences, per codon position, of amino acids accounting for reported disease-associated mutations in the human across primate phylogeny. Right: Number of species with amino acids accounting for reported disease-associated mutations in the human.
Sixty-nine aligned codon positions where amino acid replacements resulted in reported disease-associated mutations were found to be completely conserved (with Ind1 = 1 and Ind2 = 0) and under negative selection along primate evolution (Table 3). Most of these mutations in these codons were associated with MELAS (n = 14), LHON (n = 13), Leigh Disease (n = 9), and EXIT (n = 5). Non-synonymous substitutions that did not result in radical changes in amino acid properties were found to occur in other 14 aligned codon positions where amino acid replacements resulted in reported disease-associated mutations in humans (Table 4).
Well-resolved phylogenies are necessary for tracing character evolution in morphological, physiological, ecological, biogeographic, behavioral and molecular studies . This can be achieved by the continuous input of primate genomic data (nuclear and mitochondrial) from which more robust and reliable phylogenetic topologies can be reconstructed [45–47]. The ML topology herein provided (S1 Fig), was largely consistent with previous ones [40–42,46–53] and was valuable for mapping nucleotide and amino acid changes and assessing the evolutionary conservation of codons and amino acids.
Evolutionary constraints on mt-genes
Our findings, based on the frequency of the dN/dS ratio, showed that a large fraction of codon positions (88.9%) had been subjected to negative selection along primate radiation, in agreement with the postulation that mt-genes evolved under a strong purifying selection [54,55]. Codons under negative selection were particularly concentrated in the highly-conserved genes of complex IV (mt-CO1, mt-CO2 and mt-CO3) and mt-ND4L, while mt-ATP8 and mt-ND6 showed the lowest proportions of codons under negative selection. Reports on several mammalian orders, amphibians, birds and reptiles also found Complex IV and mt-ND4L to be highly conserved and mt-ATP8 and mt-ND6 as the most variable mt-genes [8,56,57]. This divergent pattern of molecular conservation and variation has been apparently constant at least from the beginning of tetrapod diversification. Furthermore, gene size and evolutionary conservation were found to be positively correlated as was the case of mt-CO1, in agreement with the proposition that the length of coding region of a gene affects its evolutionary rate [58–60] and is positively correlated with essentiality .
The four genes with the highest concentration of codons under negative selection did not show indels, while, on the other hand, mt-ND6 showed 20 indels that took place as independent evolutionary events. The high conservation of complex IV genes and mt-ND4L indicated that mutations affecting these genes were likely to be more drastically adverse for fitness than mutations in other genes. In fact, complex IV genes and mt-ND4L did not show any mutation corresponding to a confirmed, disease-associated mutation in humans. Altogether, these findings suggested that these missense mutations have been consistently eliminated along the evolution of the non-human primates herein studied, while other missense mutations, corresponding to reported, disease-associated mutations in humans have been maintained.
Disease-associated mt-mutations in humans within an evolutionary context in the primates
Several mt-mutations have been associated with human diseases  mainly with common adult forms of inherited neurological disorders . Most confirmed mt-mutations affect tRNA genes and the translational efficiency of mitochondria impairing the proper functioning of four OXPHOS enzyme complexes . Conversely, the effects of mt-mutations on protein-coding genes are more restricted and more difficult to be clearly identified  and mutations in only 28 codon positions are generally considered pathogenic by the mitochondrial research community (MITOMAP database ).
In this study, confirmed, disease-associated mt-mutations showed higher scores of predicted pathogenicity than reported ones and all of them were located in aligned codon positions under strong negative selection, and almost all in highly conserved positions along primate evolution (Table 3). This finding suggested that similar mutations in other primate species would be likely to be pathogenic and impair fitness. In fact, most of these 28 codons were responsible for critical regions of protein domains directly involved in OXPHOS. This was the case of the mt-ATP6 region encoding a transmembrane domain of the ATP synthase F0 subunit involved in proton translocation . One mutation (m.8993T>C) affecting this region has been etiologically associated with NARP, Leigh Disease, MILS and other diseases in humans . Similarly, m.15579A>G, affecting one mt-CYB region encoding a protein segment exposed to the intermembrane space bearing an ubiquinone binding site has been etiologically associated with multisystem disorder and EXIT in humans [3,66,67].
Only two confirmed disease-associated mutations, both in mt-ND3 (m.10158T>C, S34P; m.10191T>C, S45P), were located in highly variable codon positions, flanking a highly conserved mt-ND3 loop domain (S3 Table and S2 Fig). Previous predictions about their pathogenicity have been contradictory [13,68] while a previous report has indicated that m.10158T>C affected an extremely conserved amino acid position among mammals , a finding that was not corroborated in the primates herein studied. A previous report proposed that a hydrophobic residue like proline at position 45 would disrupt folding of the NADH-ubiquinone oxidoreductase chain 3 and showed that only hydrophilic amino acids were present across different taxa in this position . Missense mt-mutations, however, resulting in residues with manifold physicochemical properties, including hydrophobic amino acids other than proline have been identified in this study. It is therefore likely that mutations at this site have not been severely detrimental for most primates or, alternatively, a proline-45 residue may be specifically disruptive for folding of NADH-ubiquinone oxidoreductase chain 3.
Only two confirmed disease-associated missense mutations in humans, both in mt-ND1 (m.3635G>A; S110N and m.3700G>A; A132T), showed similar counterparts in other primates, in apparently healthy sooty mangabeys (Cercocebus atys) and Bornean orangutans (Pongo pygmaeus), respectively [71,72]. This latter mutation was located in a highly conserved mt-ND1 region, comprising approximately 27 codons, encoding a loop domain exposed to the intermembrane space, a critical region for NADH catalytic function [73,74]. The presence of a polar amino acid like threonine at position 132 in Bornean orangutans was unexpected, since this position appeared to be invariably occupied by alanine, a non-polar amino acid, in several mammals other than primates (mouse, rabbit, horse, cattle, whale, seal, cat and platypus), other animals (nematodes, sea-urchins, fish and chickens), and plants; the only exception being found in two fungi species with the polar amino acid serine . Conversely, m.3635G>A (S110N) occurred in an adjacent region to the loop domain, encoding a slightly more variable transmembrane domain, presumably without a critical relevance for NADH function .
Occurrence of some pathogenic alleles for humans in healthy gorillas, chimpanzees and macaques has been documented for several nuclear genes [75,76]. It is not completely understood how these alleles might be innocuous for non-human primates; it is speculated that compensatory changes in other genome regions or environmental differences might neutralize their effects [22,76]. A third explanation, postulating the existence of different physiological demands between species, might explain why an allele responsible for low ATP production might be severely detrimental to a species with a high metabolic demand, like Homo sapiens, but not to species with low metabolisms like the slow lories (genus Nycticebus; ). Orangutans show an extremely low rate of energy use, even lower relative to body mass, than nearly any eutherian mammal ever studied . This may explain why the S110N substitution in the NADH-ubiquinone oxidoreductase chain 1, affecting a highly conserved polypeptide domain, might be tolerated in orangutans but not in other mammals.
Interestingly, the mt-ATP6 start codon changed from ATG (coding for methionine) to GTG (coding for valine) in two non-anthropoid (strepsirrhine) species, the Wallace’s tarsier (Tarsius wallacei) and the Hubbard's sportive lemur (Lepilemur hubbardorum). In humans, the first, GTG, codon of mt-ATP6 does not completely impair gene transcription and translation, but is reportedly associated with LHON . Unfortunately, it was not possible to access whether the GTG condition might be a common finding in these two species due to dearth of mt-ATP6 data.
Differently from confirmed disease-associated mutations, the reported ones were not especially concentrated in highly conserved codon positions, suggesting that an important fraction of them have not effectively reduced organismal fitness along primate diversification. Additionally, they showed lower predicted pathogenicity scores than confirmed disease-associated mutations and 45% of them also showed similar substitutions in other primates, including very closely related species, like great apes and other species of the genus Homo. This was the case of the m.3496G>T (A64S), affecting mt-ND1 of Japanese families and associated with LHON . Codon 64 was highly variable along primate evolution, but more recently coded for serine in the ancestral hominid and all descendant hominid species except Homo sapiens and H. neanderthalensis in which an S64A mutation occurred. The m.3496G>T (A64S) found in Japanese families can thus be interpreted as an evolutionary reversion (Fig 5B).
In fact, all reported disease-associated mutations in humans with similar substitutions in the reference sequence of Homo species other than Homo sapiens (m.6150G>A, m.8021A>G, m.15077G>A, m.3421G>A, m.3496G>T, m.4659G>A, m.10398A>G, m.11253T>C and m.13528A>G) occurred in highly variable aligned codon positions. These findings indicated that they must have been someway compatible with adequate fitness in our congeneric extinct relatives.
It must be noted that the human reference sequence corresponds to a single individual of European descent with some rare polymorphisms . It is likely that the reference sequences of non-human primate species might contain nucleotide variants that do not code for the most common amino acids in these species but population data on their mitochondrial genome are not presently available. However, a phylogenetic approach might indicate the persistence of variants along phyletic diversification, suggesting that they were probably frequent in ancestral populations . This was the case of most reported, disease-associated amino acid variants with counterparts in non-human primates (n = 68; 75.6%) which were found to be transmitted to more than one species.
Several methods have been used for predicting the pathogenicity and severity of mutations in humans [82–86]; some of them applied to the mitochondrial genome . These frequently considered evolutionary data predicted functional properties and protein structure [20,21], although some of these reports did not include topological analyses of phylogenies like determining ancestry, number and independence of evolutionary events and their polarity. Our findings showed that several reported disease-associated mutations in humans appeared to be recurrent along primate diversification in closely related species that diverged from humans less than 80 million years ago. It is therefore likely that the reported disease-associated mutations in humans in extremely conserved aligned codon positions along primate evolution are more likely to be actually pathogenic. Fourteen associated mutations with MELAS, 13 with LHON, 9 with LD, 5 with EXIT, and 30 with other pathologies fitted this criterion that might be helpful for distinguishing effectively pathogenic mutations from polymorphisms. Similarly, 14 other reported disease-associated mutations in humans occurred in variable residue positions along primate evolution resulting in amino acid changes with similar physicochemical properties. Thirteen of these positions were found to be under negative selection with SLAC (S3 Table), suggesting that they might be critically relevant for mitochondrial function.
S1 Fig. Maximum likelihood phylogenetic reconstruction of primate mitogenomes.
Numbers close to nodes indicate aLRT support estimates. Nodes without numbers showed aLRT estimates = 100.
S2 Fig. Graphic representation of Information on codon positions.
Analyzed aligned codon position respective to Ind1, Ind2 and Ind3 estimates, negative selection, disease-associated mutations, number of species sharing amino acids resulting from disease-associated mutations, and number of independent occurrences of amino acids resulting from disease-associated mutations.
S1 File. Alignment of 13 concatenated mt-genes.
S2 File. Newick tree file resulting from maximum likelihood phylogenetic reconstruction of primate mitogenomes.
S1 Table. Genbank accession of mitogenomes and list of species.
S2 Table. List of independent events of codon indels in mt-genes along primate diversification.
Work supported by Conselho Nacional de Desenvolvimento, CNPq-Brazil, grant 303306/2010-6. WCT was supported by a post-doctoral fellowship from Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES-Brazil) and Fundação de Amparo à Pesquisa do Estado do Rio de Janeiro, Brazil (FAPERJ; 209101/E_44/2014).
- Conceptualization: HNS.
- Data curation: WCT.
- Formal analysis: WCT.
- Funding acquisition: HNS.
- Investigation: HNS WCT.
- Methodology: HNS WCT.
- Project administration: HNS.
- Resources: HNS.
- Software: WCT.
- Supervision: HNS.
- Validation: HNS WCT.
- Visualization: WCT.
- Writing – original draft: HNS WCT.
- Writing – review & editing: HNS WCT.
- 1. Gissi C, Iannelli F, Pesole G. Evolution of the mitochondrial genome of Metazoa as exemplified by comparison of congeneric species. Heredity (Edinb). 2008;101: 301–320. pmid:18612321
- 2. Wallace DC. A Mitochondrial Paradigm of Metabolic and Degenerative Diseases, Aging, and Cancer: A Dawn for Evolutionary Medicine. Annu Rev Genet. 2005;39: 359–407. pmid:16285865
- 3. Lloyd RE, McGeehan JE. Structural Analysis of Mitochondrial Mutations Reveals a Role for Bigenomic Protein Interactions in Human Disease. PLoS One. 2013;8: e69003. pmid:23874847
- 4. Nabholz B, Ellegren H, Wolf JBW. High Levels of Gene Expression Explain the Strong Evolutionary Constraint of Mitochondrial Protein-Coding Genes. Mol Biol Evol. 2013;30: 272–284. pmid:23071102
- 5. Stewart JB, Freyer C, Elson JL, Larsson N- G. Purifying selection of mtDNA and its implications for understanding evolution and mitochondrial disease. Nat Rev Genet. 2008;9: 657–662. pmid:18695671
- 6. Stewart JB, Freyer C, Elson JL, Wredenberg A, Cansu Z, Trifunovic A, et al. Strong purifying selection in transmission of mammalian mitochondrial DNA. PLoS Biol. 2008;6: 63–71.
- 7. Menezes AN, Viana MC, Furtado C, Schrago CG, Seuánez HN. Positive selection along the evolution of primate mitogenomes. Mitochondrion. 2013;13: 846–851. pmid:23756226
- 8. da Fonseca RR, Johnson WE, O’Brien SJ, Ramos M, Antunes A. The adaptive evolution of the mammalian mitochondrial genome. BMC Genomics. 2008;9: 119. pmid:18318906
- 9. Hassanin A, Ropiquet A, Couloux A, Cruaud C. Evolution of the Mitochondrial Genome in Mammals Living at High Altitude: New Insights from a Study of the Tribe Caprini (Bovidae, Antilopinae). J Mol Evol. 2009;68: 293–310. pmid:19294454
- 10. Tomasco IH, Lessa EP. Two mitochondrial genes under episodic positive selection in subterranean octodontoid rodents. Gene. 2014;534: 371–378. pmid:24113079
- 11. Luo Y, Yang X, Gao Y. Mitochondrial DNA response to high altitude: A new perspective on high-altitude adaptation. Mitochondrial DNA. 2013;24: 313–319. pmid:23350576
- 12. Fan W, Waymire KG, Narula N, Li P, Rocher C, Coskun PE, et al. A Mouse Model of Mitochondrial Disease Reveals Germline Selection Against Severe mtDNA Mutations. Science. 2008;319: 958–962. pmid:18276892
- 13. Pereira L, Soares P, Radivojac P, Li B, Samuels DC. Comparing phylogeny and the predicted pathogenicity of protein variations reveals equal purifying selection across the global human mtDNA diversity. Am J Hum Genet. 2011;88: 433–439. pmid:21457906
- 14. Schon EA. Mitochondrial genetics and disease. Trends Biochem Sci. 2000;25: 555–560. pmid:11084368
- 15. DiMauro S, Schon EA. Mitochondrial Respiratory-Chain Diseases. N Engl J Med. 2003;348: 2656–2668. pmid:12826641
- 16. Wallace DC. Mitochondrial DNA mutations in disease and aging. Environ Mol Mutagen. 2010;51: 440–450. pmid:20544884
- 17. Lott MT, Leipzig JN, Derbeneva O, Xie HM, Chalkia D, Sarmady M, et al. mtDNA Variation and Analysis Using Mitomap and Mitomaster. Current Protocols in Bioinformatics. Hoboken, NJ, USA: John Wiley & Sons, Inc.; 2013. p. 1.23.1–1.23.26.
- 18. Tuppen HAL, Blakely EL, Turnbull DM, Taylor RW. Mitochondrial DNA mutations and human disease. Biochim Biophys Acta—Bioenerg. 2010;1797: 113–128.
- 19. Echave J, Spielman SJ, Wilke CO. Causes of evolutionary rate variation among protein sites. Nat Rev Genet. 2016;17: 109–121. pmid:26781812
- 20. Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4: 1073–1081. pmid:19561590
- 21. Li B, Krishnan VG, Mort ME, Xin F, Kamati KK, Cooper DN, et al. Automated inference of molecular mechanisms of disease from amino acid substitutions. Bioinformatics. 2009;25: 2744–2750. pmid:19734154
- 22. Magalhães JP. Human Disease-Associated Mitochondrial Mutations Fixed in Nonhuman Primates. J Mol Evol. 2005;61: 491–497. pmid:16132471
- 23. Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. 2004;5: 113. pmid:15318951
- 24. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 2013;30: 2725–2759. pmid:24132122
- 25. Gouy M, Guindon S, Gascuel O. SeaView Version 4: A Multiplatform Graphical User Interface for Sequence Alignment and Phylogenetic Tree Building. Mol Biol Evol. 2010;27: 221–224. pmid:19854763
- 26. Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Syst Biol. 2010;59: 307–321. pmid:20525638
- 27. Bazinet AL, Zwickl DJ, Cummings MP. A gateway for phylogenetic analysis powered by grid computing featuring GARLI 2.0. Syst Biol. 2014;63: 812–818. pmid:24789072
- 28. Keane TM, Creevey CJ, Pentony MM, Naughton TJ, Mclnerney JO. Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol. 2006;6: 29. pmid:16563161
- 29. Rodríguez F, Oliver JL, Marín A, Medina JR. The general stochastic model of nucleotide substitution. J Theor Biol. 1990;142: 485–501. pmid:2338834
- 30. Delport W, Poon AFY, Frost SDW, Kosakovsky Pond SL. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26: 2455–2457. pmid:20671151
- 31. Kosakovsky Pond SL, Frost SDW. Not So Different After All: A Comparison of Methods for Detecting Amino Acid Sites Under Selection. Mol Biol Evol. 2005;22: 1208–1222. pmid:15703242
- 32. Celniker G, Nimrod G, Ashkenazy H, Glaser F, Martz E, Mayrose I, et al. ConSurf: Using evolutionary data to raise testable hypotheses about protein function. Isr J Chem. 2013;53: 199–206.
- 33. Ashkenazy H, Abadi S, Martz E, Chay O, Mayrose I, Pupko T, et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44: 1–7.
- 34. Mayrose I, Graur D, Ben-Tal N, Pupko T. Comparison of site-specific rate-inference methods for protein sequences: Empirical Bayesian methods are superior. Mol Biol Evol. 2004;21: 1781–1791. pmid:15201400
- 35. Pupko T, Bell RE, Mayrose I, Glaser F, Ben-Tal N. Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues. Bioinformatics. 2002;18: S71–S77. pmid:12169533
- 36. McClellan DA, Ellison DD. Assessing and improving the accuracy of detecting protein adaptation with the TreeSAAP analytical software. Int J Bioinform Res Appl. 2010;6: 120–133. pmid:20223735
- 37. Woolley S, Johnson J, Smith MJ, Crandall KA, McClellan DA. TreeSAAP: Selection on Amino Acid Properties using phylogenetic trees. Bioinformatics. 2003;19: 671–672. pmid:12651734
- 38. UniProt Consortium. The universal protein resource (UniProt). Nucleic Acids Res. 2008;36: D190–5. pmid:18045787
- 39. Magrane M, UniProt Consortium. UniProt Knowledgebase: A hub of integrated protein data. Database. 2011;2011: 1–13.
- 40. Perelman P, Johnson WE, Roos C, Seuánez HN, Horvath JE, Moreira MAM, et al. A molecular phylogeny of living primates. PLoS Genet. 2011;7: 1–17.
- 41. Finstermeier K, Zinner D, Brameier M, Meyer M, Kreuz E, Hofreiter M, et al. A Mitogenomic Phylogeny of Living Primates. PLoS One. 2013;8: e69504. pmid:23874967
- 42. Pozzi L, Hodgson JA, Burrell AS, Sterner KN, Raaum RL, Disotell TR. Primate phylogenetic relationships and divergence dates inferred from complete mitochondrial genomes. Mol Phylogenet Evol. 2014;75: 165–183. pmid:24583291
- 43. Cai W, Fu Q, Zhou X, Qu J, Tong Y, Guan MX. Mitochondrial variants may influence the phenotypic manifestation of Leber’s hereditary optic neuropathy-associated ND4 G11778A mutation. J Genet Genomics. 2008;35: 649–655. pmid:19022198
- 44. Paradis E. An Introduction to the Phylogenetic Comparative Method. Modern Phylogenetic Comparative Methods and Their Application in Evolutionary Biology. Berlin, Heidelberg: Springer Berlin Heidelberg; 2014. pp. 3–18.
- 45. Pecon-Slattery J. Recent advances in primate phylogenomics. Annu Rev Anim Biosci. 2014;2: 41–63. pmid:25384134
- 46. Springer MS, Meredith RW, Gatesy J, Emerling CA, Park J, Rabosky DL, et al. Macroevolutionary Dynamics and Historical Biogeography of Primate Diversification Inferred from a Species Supermatrix. PLoS One. 2012;7.
- 47. Fabre PH, Rodrigues A, Douzery EJP. Patterns of macroevolution among Primates inferred from a supermatrix of mitochondrial and nuclear DNA. Mol Phylogenet Evol. 2009;53: 808–825. pmid:19682589
- 48. Pozzi L, Disotell TR, Masters JC. A multilocus phylogeny reveals deep lineages within African galagids (Primates: Galagidae). BMC Evol Biol. 2014;14: 72. pmid:24694188
- 49. Schrago CG, Menezes AN, Furtado C, Bonvicino CR, Seuanez HN. Multispecies Coalescent Analysis of the Early Diversification of Neotropical Primates: Phylogenetic Inference under Strong Gene Trees/Species Tree Conflict. Genome Biol Evol. 2014;6: 3105–3114. pmid:25377940
- 50. Kistler L, Ratan A, Godfrey LR, Crowley BE, Hughes CE, Lei R, et al. Comparative and population mitogenomic analyses of Madagascar’s extinct, giant “subfossil” lemurs. J Hum Evol. 2015;79: 45–54. pmid:25523037
- 51. Wang XP, Yu L, Roos C, Ting N, Chen CP, Wang J, et al. Phylogenetic Relationships among the Colobine Monkeys Revisited: New Insights from Analyses of Complete mt Genomes and 44 Nuclear Non-Coding Markers. PLoS One. 2012;7: e36274. pmid:22558416
- 52. Aristide L, Rosenberger AL, Tejedor MF, Perez SI. Modeling lineage and phenotypic diversification in the New World monkey (Platyrrhini, Primates) radiation. Mol Phylogenet Evol. 2015;82: 375–385. pmid:24287474
- 53. Liedigk R, Roos C, Brameier M, Zinner D. Mitogenomics of the Old World monkey tribe Papionini. BMC Evol Biol. 2014;14: 176. pmid:25209564
- 54. Meiklejohn CD, Montooth KL, Rand DM. Positive and negative selection on the mitochondrial genome. Trends Genet. 2007;23: 259–263. pmid:17418445
- 55. Popadin KY, Nikolaev SI, Junier T, Baranova M, Antonarakis SE. Purifying Selection in Mammalian Mitochondrial Protein-Coding Genes is Highly Effective and Congruent with Evolution of Nuclear Genes. Mol Biol Evol. 2013;30: 347–355. pmid:22983951
- 56. Eo SH, DeWoody JA. Evolutionary rates of mitochondrial genomes correspond to diversification rates and to contemporary species richness in birds and reptiles. Proc R Soc B Biol Sci. 2010;277: 3587–3592.
- 57. Xia Y, Zheng Y, Miura I, Wong PBY, Murphy RW, Zeng X. The evolution of mitochondrial genomes in modern frogs (Neobatrachia): nonadaptive evolution of mitochondrial genome reorganization. BMC Genomics. 2014;15: 691. pmid:25138662
- 58. Liao BY. Impacts of Gene Essentiality, Expression Pattern, and Gene Compactness on the Evolutionary Rate of Mammalian Proteins. Mol Biol Evol. 2006;23: 2072–2080. pmid:16887903
- 59. Lipman DJ, Souvorov A, Koonin E V., Panchenko AR, Tatusova TA. The relationship of protein conservation and sequence length. BMC Evol Biol. 2002;2: 20. pmid:12410938
- 60. Lemos B, Bettencourt BR, Meiklejohn CD, Hartl DL. Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. Mol Biol Evol. 2005;22: 1345–1354. pmid:15746013
- 61. Shin S- H, Choi SS. Lengths of coding and noncoding regions of a gene correlate with gene essentiality and rates of evolution. Genes Genomics. 2015;37: 365–374.
- 62. Zuo L, Zhou T, Chuang C. The Consequences of Damaged Mitochondrial DNA. In: Buhlman LM, editor. Mitochondrial Mechanisms of Degeneration and Repair in Parkinson’s Disease. Cham: Springer International Publishing; 2016. pp. 49–61.
- 63. Gorman GS, Schaefer AM, Ng Y, Gomez N, Blakely EL, Alston CL, et al. Prevalence of nuclear and mitochondrial DNA mutations related to adult mitochondrial disease. Ann Neurol. 2015;77: 753–759. pmid:25652200
- 64. Jonckheere AI, Smeitink JAM, Rodenburg RJT. Mitochondrial ATP synthase: architecture, function and pathology. J Inherit Metab Dis. 2012;35: 211–225. pmid:21874297
- 65. Saneto R, Ruhoy I. The genetics of Leigh syndrome and its implications for clinical practice and risk management. Appl Clin Genet. 2014;7: 221–234. pmid:25419155
- 66. Hashimoto T, Morita H, Tada T, Maruyama T, Yamada Y, Ikeda S-I. Neuronal activity in the globus pallidus in Chorea caused by striatal lacunar infarction. Ann Neurol. 2001;50: 528–531. pmid:11601504
- 67. Ghelli A, Tropeano CV, Calvaruso MA, Marchesini A, Iommarini L, Porcelli AM, et al. The cytochrome b p.278Y>C mutation causative of a multisystem disorder enhances superoxide production and alters supramolecular interactions of respiratory chain complexes. Hum Mol Genet. 2013;22: 2141–2151. pmid:23418307
- 68. Mitchell AL, Elson JL, Howell N, Taylor RW, Turnbull DM. Sequence variation in mitochondrial complex I genes: mutation or polymorphism? J Med Genet. 2006;43: 175–179. pmid:15972314
- 69. Crimi M, Papadimitriou A, Galbiati S, Palamidou P, Fortunato F, Bordoni A, et al. A New Mitochondrial DNA Mutation in ND3 Gene Causing Severe Leigh Syndrome with Early Lethality. Pediatr Res. 2004;55: 842–846. pmid:14764913
- 70. Taylor RW, Singh-Kler R, Hayes CM, Smith PEM, Turnbull DM. Progressive mitochondrial disease resulting from a novel missense mutation in the mitochondrial DNA ND3 gene. Ann Neurol. 2001;50: 104–107. pmid:11456298
- 71. Achilli A, Iommarini L, Olivieri A, Pala M, Hooshiar Kashani B, Reynier P, et al. Rare Primary Mitochondrial DNA Mutations and Probable Synergistic Variants in Leber’s Hereditary Optic Neuropathy. PLoS One. 2012;7: e42242. pmid:22879922
- 72. Bi R, Zhang AM, Jia X, Zhang Q, Yao YG. Complete mitochondrial DNA genome sequence variation of Chinese families with mutation m.3635G>A and Leber hereditary optic neuropathy. Mol Vis. 2012;18: 3087–3094. pmid:23304069
- 73. Bridges HR, Birrell JA, Hirst J. The mitochondrial-encoded subunits of respiratory complex I (NADH:ubiquinone oxidoreductase): identifying residues important in mechanism and disease. Biochem Soc Trans. 2011;39: 799–806. pmid:21599651
- 74. Sinha PK, Torres-Bacete J, Nakamura-Ogiso E, Castro-Guerrero N, Matsuno-Yagi A, Yagi T. Critical roles of subunit NuoH (ND1) in the assembly of peripheral subunits with the membrane domain of Escherichia coli NDH-1. J Biol Chem. 2009;284: 9814–9823. pmid:19189973
- 75. Rogers J, Gibbs RA. Comparative primate genomics: emerging patterns of genome content and dynamics. Nat Rev Genet. 2014;15: 347–359. pmid:24709753
- 76. Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, et al. Insights into hominid evolution from the gorilla genome sequence. Nature. 2012;483: 169–175. pmid:22398555
- 77. Pontzer H, Raichlen DA, Shumaker RW, Ocobock C, Wich SA. Metabolic adaptation for low energy throughput in orangutans. Proc Natl Acad Sci. 2010;107: 14048–14052. pmid:20679208
- 78. Dubot A, Godinot C, Dumur V, Sablonnière B, Stojkovic T, Cuisset J, et al. GUG is an efficient initiation codon to translate the human mitochondrial ATP6 gene. Biochem Biophys Res Commun. 2004;313: 687–693. pmid:14697245
- 79. Matsumoto M, Hayasaka S, Kadoi C, Hotta Y, Fujiki K, Fujimaki T, et al. Secondary mutations of mitochondrial DNA in Japanese patients with Leber’s hereditary optic neuropathy. Ophthalmic Genet. 1999;20: 153–160. pmid:10520236
- 80. Turnbull DM, Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat Genet. 1999;23: 147–147. pmid:10508508
- 81. Levin L, Zhidkov I, Gurman Y, Hawlena H, Mishmar D. Functional recurrent mutations in the human mitochondrial phylogeny: Dual roles in evolution and disease. Genome Biol Evol. 2013;5: 876–890. pmid:23563965
- 82. Thusberg J, Olatubosun A, Vihinen M. Performance of mutation pathogenicity prediction methods on missense variants. Hum Mutat. 2011;32: 358–368. pmid:21412949
- 83. Niroula A, Vihinen M. Predicting Severity of Disease-Causing Variants. Hum Mutat. 2017; 1–8.
- 84. Ng PC, Henikoff S. Predicting Deleterious Amino Acid Substitutions. Genome Res. 2001;11: 863–874. pmid:11337480
- 85. Zeng S, Yang J, Chung BHY, Lau YL, Yang W. EFIN: predicting the functional impact of nonsynonymous single nucleotide polymorphisms in human genome. BMC Genomics. 2014;15: 455. pmid:24916671
- 86. Fleming MA, Potter JD, Ramirez CJ, Ostrander GK, Ostrander EA. Understanding missense mutations in the BRCA1 gene: An evolutionary approach. Proc Natl Acad Sci. 2003;100: 1151–1156. pmid:12531920