The acquisition of mitochondria was a key event in eukaryote evolution. The aim of this study was to identify homologues of the components of the mitochondrial protein import machinery in the brown alga Ectocarpus and to use this information to investigate the evolutionary history of this fundamental cellular process. Detailed searches were carried out both for components of the protein import system and for related peptidases. Comparative and phylogenetic analyses were used to investigate the evolution of mitochondrial proteins during eukaryote diversification. Key observations include phylogenetic evidence for very ancient origins for many protein import components (Tim21, Tim50, for example) and indications of differences between the outer membrane receptors that recognize the mitochondrial targeting signals, suggesting replacement, rearrangement and/or emergence of new components across the major eukaryotic lineages. Overall, the mitochondrial protein import components analysed in this study confirmed a high level of conservation during evolution, indicating that most are derived from very ancient, ancestral proteins. Several of the protein import components identified in Ectocarpus, such as Tim21, Tim50 and metaxin, have also been found in other stramenopiles and this study suggests an early origin during the evolution of the eukaryotes.
Citation: Delage L, Leblanc C, Nyvall Collén P, Gschloessl B, Oudot M-P, Sterck L, et al. (2011) In Silico Survey of the Mitochondrial Protein Uptake and Maturation Systems in the Brown Alga Ectocarpus siliculosus. PLoS ONE 6(5): e19540. https://doi.org/10.1371/journal.pone.0019540
Editor: Edward Newbigin, University of Melbourne, Australia
Received: December 17, 2010; Accepted: March 31, 2011; Published: May 18, 2011
Copyright: © 2011 Delage et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the French GIS "Institut de la Génomique Marine", the Centre National de Recherche Scientifique, the European Union network of excellence Marine Genomics Europe, the GIS Europôle Mer, the University Pierre and Marie Curie. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
There is now strong evidence that mitochondria are derived from an α-proteobacterium that was enslaved by an ancestral eukaryotic cell during a single event that occurred about 1.5 billion years ago , . The endosymbiotic process involved both the transfer of many of the symbiont's genes to the host nuclear genome and substantial gene loss. As a result, present day mitochondrial genomes encode only a small number of proteins. The nucleus-encoded mitochondrial proteins needed to be imported into the organelle. Consequently, one of the key steps for the establishment of functional mitochondria was the acquisition of an efficient protein import system. Once established, this system not only served to import proteins encoded by endosymbiont-derived nuclear genes, but also permitted the targeting of novel proteins to the organelle. The original bacterial endosymbiont is thought to have had a proteome that consisted of about 630 distinct proteins  whereas present day mitochondria have been estimated to contain about 1,000 to 1,500 different polypeptides , .
The mitochondrial protein import machinery is complex because a heterogeneous set of proteins needs to be correctly targeted to several different compartments within the mitochondrion: the matrix, the outer membrane (OM), the inner membrane (IM) and the intermembrane space (IMS). These compartments can be considered to be equivalent to the two cellular membranes and the periplasmic space of bacteria. Over the past ten years, studies using yeast have yielded a great deal of information about the molecular mechanisms of mitochondrial protein import. This process involves the cooperation of five membrane and three IMS complexes . The major components of this machinery are conserved in both opisthokonts (animals and fungi) ,  and plantae , , , indicating a very ancient origin . Five translocase complexes are involved in the import of proteins through the mitochondrial membranes and insertion of proteins into the phospholipid bilayer. The Transporter of the Outer Membrane (TOM) and two Transporter of the Inner Membrane complexes (TIM22 and TIM23) are specialized in importing proteins across the membranes. The outer membrane Sorting and Assembly Machinery (SAM) and the inner membrane insertase (OXA) are involved in the insertion of proteins into the membrane.
The TOM complex is the first channel encountered by all proteins that enter mitochondria. The SAM is also located in the OM, where it mediates correct folding of β-barrel proteins. This complex is related to the bacterial β-barrel assembly machinery (BAM), and hence presumably of bacterial origin . SAM is also known as the complex for Topogenesis of mitochondrial Outer membrane β-Barrel proteins (TOB). Unlike the bacterial OM, the mitochondrial OM contains a large number of proteins with an α-helical transmembrane segment suggesting that the ability to insert this type of protein (in addition to β-barrel proteins) evolved in the mitochondrial system . Indeed, it has been demonstrated that the substrate specificity of SAM is not restricted to β-barrel proteins but also includes α-helical proteins .
The two major IM translocases, TIM22 and TIM23, perform similar functions. TIM23 imports matrix soluble or weakly hydrophobic proteins (e.g. proteins containing a single a-helix transmembrane domain), whereas TIM22 mediates the transfer of multi-spanning proteins . Most of the components of TIM22 and TIM23 are probably endosymbiont-derived . The third translocase, OXA, is responsible for the insertion of some polytopic proteins into the IM from the matrix side. Oxa1 is the only component of this machinery identified so far .
Two small TIM protein complexes consisting of Tim9 and Tim10 and of Tim8 and Tim13, respectively, are located in the IMS. They serve as guide complexes (chaperones), stabilising hydrophobic protein precursors in the hydrophilic medium of the IMS during their transit to their target membrane , . Skp performs a similar function in bacteria but is phylogenetically unrelated to the small Tim proteins . Despite its small size, the IMS is an important compartment that contains proteins involved in key functions such as respiration and programmed cell death. The IMS is equivalent to the periplasmic space of prokaryotes and, as in bacterial cells, cysteine residues occur mainly as cystines. In bacteria, thiol oxidation is carried out by the DsbA-DsbB machinery whereas eukaryotes have independently evolved the Mia40-Erv1 system to carry out the same function in mitochondria .
Bacteria possess two distinct systems to export folded proteins through the IM, the SECretion (SEC) and Twin-Arginine Transport (TAT) systems . The predominant route is the SEC pathway, although a subset of periplasmic proteins (including many that bind redox-active cofactors) is transported by the TAT system. This machinery has been conserved in plastids but it is not yet clear whether mitochondria possess a functional TAT system .
Mitochondrial protein precursors contain specific targeting sequences that allow them to be directed to the correct subcompartment , . Most targeting sequences are short, cleavable, N-terminal, amphipathic helices , , although many membrane proteins possess alternative targeting information within the mature protein . N-terminal targeting peptides are rich in positive, hydrophobic, and hydroxylated amino acids whilst acidic residues are absent. The removal of these cleavable targeting signals is mediated by a variety of proteases. Mitochondria possess five main proteolytic activities that are involved in the maturation of mitochondrial proteins. The five proteases are located in either the IMS or the matrix and include both soluble and membrane-bound enzymes .
In this study we searched the genome of the brown alga Ectocarpus siliculosus for genes that are predicted to encode components of mitochondrial protein uptake and maturation systems and the complement of import/maturation proteins in Ectocarpus was compared with those of other stramenopiles and species from other eukaryotic supergroups. Previous studies of the evolutionary history of the mitochondrial import machinery have concentrated on opisthokonts, green lineage members and a small number of protists including Giardia, Trichomonas, trypanosomatids and amoebozoa. The inclusion of a comprehensive analysis of Ectocarpus and other stramenopiles (diatoms and oomycetes) in the current picture of mitochondrial protein import has provided several insights into the evolutionary history of this essential machinery.
Identification and annotation of mitochondrial proteins
Mitochondrial proteins encoded by the Ectocarpus nuclear genome were identified using several different approaches. First, proteins with N-terminal mitochondrial targeting sequences were identified by applying the HECTAR program  to the complete set of proteins encoded by the genome . Additional analyses (Table 1 and S1) were carried out with the mitochondrial localisation predictors listed in Table 2. Note that although most of these programs (including HECTAR) only recognise N-terminal targeting sequences, some, such as MitoPred, Cello and SubLoc, also take into account other features . Second, additional mitochondrial proteins were identified by searching for genes that had been assigned either definition descriptions containing the word mitochondrial or gene ontology annotations with mitochondria as the subcellular localisation during the automatic annotation of genome sequence (which was based on matches to existing sequence databases). Finally, detailed manual searches were carried out for specific mitochondrial proteins involved in protein import and maturation. During the manual functional annotation process, particular attention was paid to the identification of protein motifs, transmembrane domains, shared structural characteristic and key amino acids as a means to validate protein function (Tables 2, S2 and S3, Figs. S1 and S2).
Phylogenetic analyses were performed using Phylogeny.fr . Full-length protein alignments were constructed with MUSCLE and were curated by removal of poorly aligned positions using GBlocks (using the lowest stringency parameters), excepting for the Oxa protein phylogeny where only gap positions were removed in order to retain a sufficiently large data set. The resulting set of aligned amino-acids was used for reconstruction of phylogenetic trees using the maximum likelihood (ML) method implemented in PhyML (v3.0 aLRT). The default substitution model was selected, assuming an estimated proportion of invariant sites and four gamma-distributed rate categories to account for rate heterogeneity across sites. The neighbour-joining method was performed with JTT amino acid substitution matrix using BioNJ. For both the ML and NJ methods, bootstrap analyses of 100 replicates were used to provide confidence estimates for the phylogenetic tree topologies. Finally, Bayesian inference analyses were performed on the Metaxin, Tim21 and Tim50 curated amino-acid alignments using the MrBayes program with default parameters.
Modelling of three dimensional structures and validation
To estimate whether Ectocarpus proteins were structurally similar to known Tom70 orthologues, their sequences were carefully checked using a similar procedure to that employed for the validated model of Blastocystis Tom70 . First, Ectocarpus proteins were analysed with SOPMA to determine their secondary structure composition. Second, opisthokont and Blastocystis Tom70 sequences were aligned with other putative stramenopile sequences that exhibited conservation of the TPR domains (Fig. S3). The alignment of Esi0007_0019 (residues 353–945) and Esi0232_0002 (residues 164–739) with the crystal structure of yeast Tom70 (accession number 2GW1A) was then used to generate a homology model at the SIB modelling server.
The quality of the predicted structural models of the putative Ectocarpus Tom70 proteins was assessed using the ProSa Web Server , . Finally, the predicted structures of Esi0007_0019 and Esi0232_0002 were aligned with the yeast Tom70 template using FATCAT , .
Results and Discussion
The TOM complex
The TOM complex consists of a central, pore-forming protein, Tom40, which is associated with the small Tom7 and possibly other accompanying proteins such as Tom5, Tom6 (Fig. 1 and Table 1 and S1). In yeast, several receptors (Tom70, Tom20 and Tom22) mediate the recognition of targeting sequences or internal information in the proteins being imported . Tom70 is responsible for the translocation of presequence-less proteins with multiple membrane-spanning domains, which are subsequently directed to the OM or IM . Tom20 and Tom22 carry out similar functions, preferentially importing polypeptides with N-terminal targeting sequences, but do not share sequence similarity , .
Proteins of bacterial origin that have conserved their primary function are shown in brown. Proteins which are derived from bacterial ancestors but have evolved a new function in eukaryotes are shown in green. Proteins that have evolved in the eukaryotes are shown in blue. Other proteins that are present in most opisthokonts or plantae but could not be found in Ectocarpus have been colored in red.
All opisthokont and plant genomes examined so far encode Tom40, Tom22 and Tom7. These three proteins represent the minimum machinery necessary to import proteins from the cytosol into mitochondria and they were all acquired early in mitochondrial evolution . Homologues of Tom40 and Tom7 were identified in Ectocarpus (Fig. S1A,B). Esi0246_0018 resembles the short Tom22 found in plants, lacking the acidic N-terminal cytosolic domain typical of the opisthokont proteins (Fig. S1C) .
No clear homologue of either the opisthokont-type or the plant-type Tom20 receptor was found in the Ectocarpus genome or in other stramenopile genomes. In stramenopiles, it is possible that mitochondrial presequences are recognised by Tom22 rather than Tom20. Tom22 has been reported to share the common signal recognition pathway with Tom20  and both proteins exhibit a chaperone-like activity, stabilising unfolded preprotein substrates . Also, it is important to note that, although the acidic domain of opisthokont Tom22 has been shown to be involved in the recognition of presequences , an abundance of negative charges in this cytosolic domain is not essential for binding or import of mitochondrial precursor proteins, suggesting that other features of the domain are involved . Hence, stramenopile Tom22 proteins could recognise mitochondrial presequences, despite the absence of an acidic domain. Finally, there is evidence, for Tom20, that presequence binding is mediated mainly by hydrophobic rather than ionic interactions . Nevertheless, it cannot be excluded that a highly divergent Tom20 receptor or a novel protein with the same role is present in stramenopiles.
Previous studies have indicated that Tom70 is present in opisthokonts but absent from the plantae and many protozoan lineages . But recently, a functional Tom70 gene has been characterised in the stramenopile Blastocytis sp. and homologues are present in other stramenopile genomes  suggesting that Tom70 did not arise within the opisthokont lineage as previously proposed by Chan et al. . Interestingly, Ectocarpus possesses a gene (Esi0232_0002) that is homologous with the Blastocystis sp. sequence, plus a second Tom70-like gene, Esi0007_0019, both of which share several characteristics with Tom70 including strong sequence similarities with opisthokont Tom70 proteins. Like Tom70, both proteins are predicted to possess eleven distinct TPR domains and an N-terminal transmembrane segment (Figs. 2A and S3A). Moreover, Esi0007_0019 and Esi0232_0002 could be superimposed on the crystal structure model of yeast Tom70 (Figs. 2B and S3B). A high proportion of α-helical structures were predicted for both the Ectocarpus proteins (54.8% for Esi0007_0019 and 59.8% for Esi0232_0002). This was comparable with the composition of the yeast (68.2%) and Blastocystis (49.9%) proteins. Neither Esi0007_0019 nor Esi0232_0002 were predicted to contain significant amounts of β-sheet. Their Z-scores calculated using the ProSa Web Server (−4.31 for Esi0007_0019 and −4.96 for Esi0232_0002) were in the range of X-ray crystal structures of a comparable size. Using FATCAT, the average reported root mean square deviation values of Esi0007_0019 (1.04 for 478 equivalent positions) and Esi0232_0002 (1.15 for 475 equivalent positions) were reported to be significantly similar with a P value of 0.00e+00 (respective raw scores of 1218.11 and 1245.72). Structure pairs with a probability of <0.05 are significantly similar. The resulting modelled structures exhibited good conservation of the overall α-helical content and several of the potentially key residues for substrate binding and dimerisation identified by Wu and Sha  appeared to be conserved. These residues are indicated in the alignment of opisthokont and stramenopile Tom70 sequences shown in Fig. S3A. Note that in many cases it is not the exact amino acid but rather the class of the amino acid (hydrophobic, acidic, basic, etc.) that is conserved. This was to be expected, given the level of conservation of these key residues within the opisthokont and stramenopile groups. Several of the variants at key residues in the E. siliculosus proteins (in the TPR9 and TPR10 domains, for example) were also observed in the Blastocystis and oomycete Tom70 proteins suggesting a possible modification of preprotein binding interactions in stramenopiles. Only oomycetes seem to have homologues of the Tom70-like Esi0007_0019, suggesting that this protein was lost from diatoms and Blastocystis. No Tom70-like proteins were identified in Aureococcus anophagefferens (pelagophyceae). The function of these proteins in mitochondrial protein import in Ectocarpus will need to be confirmed experimentally.
(A) Comparison of the different domain structures found in Tom70 homologues in Ectocarpus, oomycetes and diatoms. Transmembrane and TPR domains regions are shaded in grey and red respectively. (B) Model of the Ectocarpus Esi0007_0019 protein obtained using the yeast Tom70 crystal structure (pdb accession: 2GW1) as a template. Residues implicated in substrate binding are coloured in green and blue for yeast and Ectocarpus, respectively. Residues involved in dimerisation are coloured in orange and yellow for yeast and Ectocarpus, respectively. The Esi0007_0019 model was generated in alignment mode on the SIB website and the pdb file was compared to yeast Tom70 using pdbViewer software. The superimposition of the two models (middle panel) shows the degree of conservation of key amino acids between the two proteins.
In plantae, a mitochondrial OM protein similar to chloroplast Toc64 called mtOM64 is thought to carry out a similar role to Tom70 in protein import, although it is not part of the TOM complex , , . No homologue of mtOM64 was identified in stramenopiles.
The SAM complex
Sam50, the major SAM subunit, forming the central channel of the complex , is derived from the bacterial protein Omp85/BamA . Compared with Omp85/BamA, Sam50 has lost four of the five repeats of the polypeptide transport (POTRA) domains located on the IMS side of the membrane and an N-terminal sequence, but nonetheless carries out the same function as the bacterial protein. Sam50 is accompanied by the peripheral proteins Sam35 and Sam37 , . Together they constitute the minimal functional SAM complex in yeast. Animal Metaxin1 and Metaxin2 are distantly related orthologues of Sam37 and Sam35 respectively . Although they are involved in mitochondrial protein import, Metaxins are not part of mammalian SAM complexes .
The Ectocarpus genome was found to encode homologues of only two SAM proteins, Sam50 (Esi0503_0006) and Metaxin (Esi0338_0018). Comparison of the secondary structures of the yeast Sam50 protein and Esi0503_0006 indicated that they share a conserved α-helix in the N-terminus and a major β-strand structure (Table S2). Moreover, phylogenetic analyses of Sam50 proteins from a broad range of eukaryotes, using the α-proteobacterial Omp85 as outgroup, indicated that, Ectocarpus Sam50 is most closely related to the diatom and oomycete homologues and that stramenopile Sam50 proteins seem to cluster independently of well-supported clades formed by the green lineage Sam50 proteins and those of the opisthokont lineage (Fig. S4A). Surprisingly, the phylogenetic analysis of Metaxin proteins showed that the Ectocarpus homologue is more closely related to animal Metaxin1 than to animal Metaxin2 and green plant Metaxins, which form two well-supported, independent clusters (Fig. 3). Oomycetes and diatoms also possess a single Metaxin, which is again most similar to animal Metaxin1. In contrast with opisthokonts, which possess two Metaxins, only one of these proteins seems to be required in green plants and stramenopiles. It will be important to identify additional putative metaxin proteins in other stramenopile species and in other eukaryotic lineages in order to determine whether the distribution of these proteins is the result of vertical gene inheritance and successive lost, or due to ancient lateral gene transfer.
A PHYML tree was constructed as described in the materials and methods section, with 108 amino acid sites after curation of the alignment. Bootstrap values above 50 are indicated (100 replicates) for PHYML (first value) and BioNJ (second value) analyses and the thick branches correspond to posterior probabilities of >0.9 with the Bayesian method.
Small TIM Machinery
In many organisms, the IMS contains two small TIM complexes, both of which help direct precursor proteins to their correct destination. In yeast, Tim9 and Tim10 form the major functional complex, which chaperones hydrophobic proteins directed to the SAM or the TIM22 machineries, whereas the non-essential complex formed by Tim8 and Tim13 transports preproteins to the TIM23 complex. The Tim8-Tim13 complex is also involved in the earliest step of Tom40 assembly .
The Ectocarpus genome encodes homologues of four small Tim proteins (Table 1 and S1) corresponding putatively to Tim8, Tim9, Tim10 and Tim13 on the basis of a motif search carried out using the HMMPanther database (Table S2). All have a double α-helix structure and a twin CX3C motif, except Tim8, which has only one cysteine in each motif (Fig. S1H). Tim9 and Tim10 have a short, internal targeting signal that directs them to the IMS . We identified a similar internal targeting signal in the Ectocarpus Tim9 homologue (Fig. S1H).
In fungi, another small Tim protein, Tim12, is associated with the TIM22 complex . This protein, which is anchored to the IM at its C-terminal end, plays an important role in the insertion of polytopic proteins into the phospholipid bilayer. Like animals and plants, Ectocarpus does not possess Tim12. In animals, variant forms of Tim9, Tim8, and Tim13 were found instead of Tim12  but no similar sequences could be identified in Ectocarpus.
The MIA-ERV Machinery
Mitochondrial Intermembrane space import and Assembly protein 40 (Mia40) is a receptor protein involved in the import of small cysteine-containing proteins. Mia40 and its partners the FAD-dependent sulfhydryl oxidase Essential for Respiration and Vegetative growth 1 (Erv1) and the zinc finger protein Hot13 regulate the redox states of these proteins. Electrons are delivered to Mia40 by the TOM machinery and then transferred to Erv1. In yeast, Hot13 interacts with Mia40 to remove a zinc atom and thereby facilitates efficient reoxidation by Erv1 .
We found Mia40 candidates in diatoms and oomycetes using a low stringency search for the CXXC signature and the twin CX9C motifs of the D. discoideum sequence (Fig. S1E) but we failed to identify an Ectocarpus homologue.
Erv1 proteins play crucial roles in the biogenesis of cytosolic or mitochondrial Fe/S proteins. The mammalian orthologue is Alr (Augmenter of Liver Regeneration) and Erv1/Alr seem to be ubiquitously distributed across the eukaryote tree with the exception of some protozoans . This enzyme possesses a YPCXXC consensus sequence, which is implicated in the primary redox-reaction  and two additional cysteine pairs that are essential for correct function . It has recently been suggested that Erv1 could have originated from marine bacteria via horizontal gene transfer  but the Psychroflexus torquis Erv1/Alr sequence (ZP_01257237) on which this suggestion was based has since been removed from the NCBI database because it was a contamination. Consequently, the current data suggest that Erv1 was a novel, eukaryotic invention.
The Ectocarpus genome encodes two proteins that are highly similar to Erv1/Alr: the first is presumably the mitochondrial sulfhydryl oxidase (Esi0202_0015, Fig. S1F) whereas a second gene (Esi0052_0149) is encoded by an inserted viral genome similar to EsV-1  and its role, if any, remains to be specified.
Curiously, Ectocarpus possesses homologues of Hot13 (Fig. S1G) and Erv1, despite the absence of Mia40. Also, like most organisms that apparently lack Mia40 (Encephalitozoon cuniculi, Cryptosporidium parvum, Plasmodium falciparum, Giardia lamblia, Trichomonas vaginalis), Ectocarpus possesses many of the proteins that have been shown to be Mia40 substrates in yeast . As sequence similarity is very low (even between stramenopiles), it is possible that an Ectocarpus Mia40 homologue exists but that we were not able to identify it. Alternatively, the putative Ectocarpus Erv1 and Hot13 proteins could interact with an, as yet, unidentified protein that carries out the same function as Mia40. Note also that some Erv1-like proteins are known to function in the absence of a partner protein .
The TIM23 complex
TIM23 is the largest of the mitochondrial protein import complexes, composed of nine different subunits in yeast . The most important components are the embedded proteins Tim23 and Tim17 and the major receptor for N-terminal targeting peptide recognition, Tim50. Additional proteins include Tim44, Tim16/Pam16, Tim14/Pam18, mtHsp70 and Mge1, all of which are located on the matrix side of the complex. Complete translocation of precursors into the matrix requires the heat shock protein mtHsp70, which is recruited to the translocase by Tim44 and binds to incoming precursors in an ATP-dependent manner. Binding is regulated by the nucleotide exchange factor Mge1 and the J-complex, which consists of Tim14 and Tim16 . Tim21 and Pam17 do not seem to be essential for TIM23 to function; they modulate the activity of the translocase by acting as antagonists .
There are single homologues of Tim23, Tim17, Tim44, Tim16/Pam16, Tim14/Pam18, mtHsp70 and Mge1 in Ectocarpus. All the corresponding proteins appear to be ubiquitous in opisthokonts and green plants (Table 1 and S1). Tim21 and Pam17 were found in opisthokonts but were reported to be absent from Arabidopsis, Physcomitrella and Chlamydomonas . A recent study of plant import systems described the presence of Tim21 homologues  and we have identified Tim21 homologues in Ectocarpus, diatoms and oomycetes. Phylogenetic analysis indicated that these proteins group together in a well-supported monophyletic clade, as a sister group to animal Tim21-like proteins (Fig. 4A). On average, Tim21-like proteins shared between 26 and 35% amino-acid identity within each well-defined cluster, but there was a low level of overall sequence conservation. The stramenopile Tim21 proteins were most similar to their opisthokont homologues (15% amino-acid identity). These results suggest that, like their stramenopile counterparts, the newly identified plant Tim21-like proteins may actually be derived Tim21 proteins. It will be necessary, however, to demonstrate biochemically that the latter are components of the TIM23 complex to confirm this hypothesis.
PHYML trees were constructed as described in the materials and methods section, with 66 and 149 amino acid sites for Tim21 and Tim50, respectively, after curation of the alignment. Bootstrap values above 50 are indicated (100 replicates) for PHYML (first value) and BioNJ (second value) analyses and the thick branches correspond to posterior probabilities of >0.9 with the Bayesian method.
The TIM23 complex imports proteins to both the matrix and the IM and both types of substrate are recognized by Tim50 . Interaction between the IMS domains of Tim50 and Tim23 is important for the transfer of preproteins between the TOM and TIM23 complexes. Tim50 is composed of a single transmembrane region located N-terminal to a large soluble domain exposed to the IMS. The origin of Tim50 is unclear but it shares sequence homology with a family of nuclear LIM interactor-interacting factor-like (NLI-IF) phosphatases, most of which are targeted to the nucleus . Esi0000_0471, the Ectocarpus Tim50 homologue, is predicted to be imported into mitochondria and it shares strong similarity with all Tim50 proteins in its C-terminal part. Esi0000_0471 is predicted to possess an N-terminal α-helical transmembrane domain. As with the Tim21 proteins, the Tim50 proteins grouped into four relatively well-supported clusters in phylogenetic analyses (corresponding to green lineage, stramenopile, animal and fungal proteins) but with low resolution as far as deep branching was concerned (Fig. 4B). This phylogenetic analysis indicated that these proteins were acquired early and have evolved independently in each of the major eukaryotic lineages
The TIM22 complex
TIM22 is responsible for the insertion of carrier proteins in a membrane potential-dependent manner. Tim22 is a member of the Tim17/Tim22/Tim23 family and is the main component of the TIM22 complex. Tim17, Tim23 and Tim22 are all composed of four α-helical segments and it has been suggested that they have evolved from the bacterial LivH protein . In yeast, Tim22 is associated with at least three partners: Tim18, Tim54 and the small Tim12. No counterparts of these proteins have been found in other eukaryotic lineages suggesting that they arose recently in fungi. Ectocarpus possesses a single Tim22 gene (Esi0063_0018). Two additional genes (Esi0019_0065 and Esi0046_0104) were assigned to the Tim17/Tim22/Tim23 family but they could not be precisely attributed to one sub-family. Their role remains to be elucidated.
Like plants and animals, it is possible that accompanying proteins analogous to fungal Tim54, Tim18 or Tim12 are present in the stramenopile TIM22 complex but at present, there is no biochemical evidence to support this.
The OXA complex
Oxa1 mediates the insertion of mitochondrial-encoded subunits of respiratory complexes and several nuclear-encoded proteins into the IM from the matrix side. Oxa1 is part of the YidC/Oxa/Alb3 family, which includes proteins that are specialised in the integration of polytopic proteins into membranes in bacteria, mitochondria and chloroplasts, respectively. The members of this large family are derived from an ancestral, bacterial YidC protein . The primary sequences of the members of this family are not well conserved but they share common structural features including five transmembrane segments flanked by N- and C-terminal extensions of variable length . In addition to Oxa1, mitochondria often contain a second member of this family called Oxa2 or Cox18. In yeast, Cox18 is implicated in the insertion of Cox2 into the membrane . Unlike Cox18, Oxa1 has a long C-terminal extension containing a ribosome binding domain capable of interacting with the mitochondrial protein translation machinery. This domain mediates cotranslational insertion of nascent polypeptides . Establishment of the Oxa1 and Oxa2 subfamilies appears to have occurred following an ancient duplication, because most eukaryotes, including plants and opisthokonts possess two Oxa genes . The family has further expanded in other lineages, such as protists, resulting in a rich diversity of these proteins . Three members of the Oxa sub-families were identified in the Ectocarpus genome (Table 1 and S1). Our phylogenetic analysis of these proteins (based on the study by Zhang et al. ) indicated that Esi0028_0040 is probably an Oxa1 homologue, whereas Esi0170_0057 groups with the Oxa2 proteins from oomycetes (Fig. S4B). Alignment of Esi0170_0057 with Oxa1 and Oxa2 proteins showed that it lacks a C-terminal extension supporting its classification as an Oxa2 orthologue (Fig. S5). Esi0025_0161 did not cluster within the Oxa1 and Oxa2 clades and it was therefore not possible to classify it into one of these sub-families. Of the three genes, only Esi0025_0161 lacked EST support.
Do mitochondria have a functional TAT system?
The TAT system is a protein-targeting pathway that is found in many bacteria. TatA and TatC are the minimal requirements for a functional pathway . TatB is phylogenetically related to TatA but performs a distinct role in the TAT translocation system . TatC is a polytopic membrane protein with several transmembrane helices. This protein is involved in both the recognition and translocation of proteins containing the “twin-arginine” signal peptide (S/TRRxFLK) consensus sequence .
TatC homologues have been found in both the mitochondrial and chloroplast genomes of many green plants, algae and protists, including stramenopiles (Organelle Genome Database, http://gobase.bcm.umontreal.ca) , . One animal mitochondrial genome has also been shown to contain a homologue . TatA and TatB have been found in the nuclear genomes of several eukaryotes and phylogenetic analysis has identified a third, intermediate class (called TatA/B here) in land plants, green algae and cyanobacteria . The proteins encoded by these nuclear genes are predicted to be targeted to either plastids or mitochondria. Taken together, these data suggest that TAT systems may be present in both the plastids and mitochondria of wide range of photosynthetic eukaryotes. However, although there is now convincing evidence for a functional TAT system in plastids , this remains to be demonstrated experimentally for mitochondria. Recently, TatC gene expression has been reported in tobacco suggesting that this gene plays a role in mitochondria . Notably, it has been proposed that a TatC pathway might be responsible either for the correct functioning of FeS-containing protein complexes in the IM  or for the import and/or export of folded proteins from the matrix into the IMS .
TatC homologues were found in both the plastid and mitochondrial genomes of Ectocarpus . However, the nuclear genome only contains one TatA/B homologue (Esi0067_0034). The presence of a single, nuclear TatA/B gene does not rule out the existence of a mitochondrial TAT system provided that the protein is targeted to both the plastids and the mitochondria. MitoprotII and SubLoc predict that Esi0067_0034 has a mitochondrial destination. There is now clear evidence that many proteins can be targeted to both mitochondria and chloroplasts in plants  and to both mitochondria and the endoplasmic reticulum in yeast . In silico and biochemical studies suggest that the plastids of land plants have two distinct but closely related TAT complexes. In some instances, TatB may be absent, in which case TatA alone may be associated with TatC . It is therefore possible for a functional TAT system to consist of TatC plus a single protein of the TatA/TatB family. Taken together, therefore, analysis of the genome indicated that Ectocarpus might possess a mitochondrial TAT system consisting of a TatA/B and a TatC. However, we cannot rule out the possibility that TatC only plays a role in the recognition of twin-arginine proteins in the mitochondrion, in which case it would not necessarily function as a component of a TAT system.
Inner membrane peptidases
Mitochondrial peptidase complexes play important roles both in the removal of N-terminal targeting peptides and in the turnover of mitochondial proteins. Of the five major classes of mitochondrial peptidase complex, three are located on the IM (Fig. 5).
Three of the five mitochondrial peptidases implicated in protein maturation after import are located on the inner membrane. The IMP and Rhomboid proteases release mature proteins into the IMS after cleavage of a transmembrane segment (represented respectively in pink and grey) whereas the m-AAA protease cuts precursors in the matrix side (green). m-AAA protease and its counterpart on the IMS side of the inner membrane are also involved in the general degradation pathway of mitochondrial proteins. The two main enzymes that produce mature mitochondrial proteins are the matrix-located MPP and MIP. They can act sequentially or independently to generate functional proteins. The peptides recognized by MPP and MIP are colored in blue and orange, respectively. In Ectocarpus, non functional paralogues of β-MPP and α-MPP (core1 and core2 subunits) were identified in the genome suggesting their presence in the complex III of respiratory chain (OXPHOS). Inner membrane (IM); intermembrane space (IMS).
The Inner Membrane Peptidase complex (IMP) is derived from the α-proteobacterial leader peptidase LepB . Bacterial IMP is monomeric but mitochondrial IMP is a heterodimer of Imp1 and Imp2 proteins. Imp1 and Imp2 are relatively similar, possessing the same catalytic S and K residues . Imp1 is more similar to LepB than Imp2. Each subunit is anchored to the IM with its N-terminal extremity and the C-terminal catalytic domains are located in the IMS . Recently, it has been proposed that the motifs RX5P and NX5S, found in the catalytic domains of Imp1 and Imp2 respectively, are discriminatory for these two proteins in most species .
Two Imp1 homologues and one Imp2 homologue were identified in Ectocarpus (Table 3). All have the conserved catalytic residues and the discriminatory motif RX5P is present in Esi0007_01117 and Esi0009_0118 while an imperfect NX5p motif was found in Esi0324_0009. Moreover, analysis of an alignment that included plant and stramenopile sequences indicated that new and more restrictive consensuses (GDNX7RX5P and EGDX8NX5(S/P)) could be defined for Imp1 and Imp2, respectively. IMP substrates have been well studied in opisthokonts (Table 4). Homologues of four of the six proteins that have shown to be Imp1 substrates in other species were identified in the Ectocarpus nuclear (Mcr1, GPDM and AIF) and mitochondrial (cox2) genomes, as well as a homologue of the Imp2 substrate cyt c1. Further analysis will be required to determine whether these proteins are substrates of IMP in Ectocarpus, but their presence suggests that there is a need for functional IMP complexes in this brown alga.
m-AAA protease, a member of the AAA superfamily (ATPases Associated with diverse cellular Activities), is principally involved in the proteolysis of membrane proteins and in the maintenance of mitochondrial activities. It is anchored in the IM with the catalytic domain in the matrix. A similar enzyme, i-AAA protease, is required for degrading polypeptides between the mitochondrial membranes. Both enzymes are involved in the turnover of unincorporated or mistranslated mitochondrial translation products, notably in the quality control of several OXPHOS proteins . Phylogenetic analysis has revealed three main types of enzyme, represented by S. cerevisiae Yme1, human Spg7/paraplegin and S. cerevisiae Yta10/Afg3 and Yta12/Rca1, respectively . Yme1 is an i-AAA protease, whereas the other three are m-AAA proteases. Bacteria generally contain only one AAA protease, the FtsH metalloprotease. The m-AAA and i-AAA proteases probably evolved from an ancestral α-proteobacterial FtsH gene via a duplication mechanism that created a novel and specific pathway for processing of a subset of matrix proteins including Ccp1 and MrpL32 , . The Ectocarpus genome encodes both i-AAA and m-AAA proteases (Table 3), the latter being more similar to paraplegin than to yeast m-AAA proteases. Both Ectocarpus proteins have the ATP binding and the HEXXH metal binding domains required for activity .
Rhomboid proteases are a small, highly conserved protein family within the superfamily of serine proteases. Generally rhomboid proteases function in intercellular communication but recently some of them have been shown to act within the lipid bilayer of the mitochondrial IM . These integral membrane proteins are intramembrane cleaving proteases, cutting substrates at membrane-spanning segments. Two rhomboid proteases, Pcp1/Rbd1 and Rbd2, have been isolated from the IM of yeast mitochondria . Mitochondrial rhomboid proteases have also been found in other eukaryotes including human (PARL), Drosophila (Rho-7) and A. thaliana (AtRBL12). Based on sequence similarity, there appear to be ten rhomboid-like proteases in Ectocarpus. Esi0140_0003 is similar to Pcp1/Rbd1. No clear homologue of Rbd2 was detected. Esi0044_0125 shares similarity with the human, Drosophila and Arabidopsis sequences. A third sequence Esi0140_0044 may also be related to these proteins. The catalytic residues of rhomboid proteases (an Asn-Ser-His triad ) are conserved in Pcp1/Rbd1, PARL, Rho-7 and AtRBL12. Esi0140_0003 and Esi0140_0044 possess all three residues but Esi0044_0125 contains only the serine. Three substrates of Pcp1 have been described and two of these, Mgm1 and Ccp1, are predicted to be present in Ectocarpus. Both are expected to reside in the IMS sub-compartment. Note, however, that in humans Opa1 (the homologue of Mgm1) is not processed by PARL (the homologue of Pcp1) and therefore experimental evidence will be required to confirm that these proteins are actually Pcp1 substrates in Ectocarpus.
The mitochondrial processing peptidase (MPP) is the principal protease that cleaves N-terminal targeting sequences of nuclear encoded proteins during their transport into the organelle . MPP is a heterodimeric enzyme made up of α-MPP and β-MPP subunits. In mammals and yeast, the enzyme is localized in the matrix whilst in land plants MPP is a component of complex III of the respiratory chain, replacing the Core1 and Core2 subunits. An intermediate situation is found in some fungi, like Neurospora crassa, which have a fully active, soluble α/β-MPP complex in the matrix and a partially active Core2/β-MPP complex associated with complex III , . MPP subunits and Core subunits have probably evolved from the bacterial metallopeptidase RPP . All known β-MPP subunits possess an inverted zinc binding motif HXXEH(X)76E, which is required for zinc binding and catalytic activity of MPP. This sequence is less conserved in Core and a-MPP proteins. Both the α and β components of MPP are normally inactive in the monomeric state, but it has been recently demonstrated that β-MPP orthologues in some protozoans are functional as monomers , . This observation suggests that α-MPP and β-MPP are derived from an ancestral protein that was active as a monomer.
Four MPP-homologous genes were identified in Ectocarpus, corresponding to two putative α-MPP/Core2-type (Esi0268_0010 and Esi0111_001) and two putative β-MPP/Core1-type (Esi0098_0070 and Esi0011_0084) proteins. Alignment of their sequences showed that only Esi0098_0070 possessed the inverted zinc-binding consensus HXXEH(X)76E suggesting that Esi0011_0084 is a Core1 subunit of complex III without enzymatic activity (Fig. S6A). Similarly, Esi0268_0010 possesses the essential glycine-rich region and probably corresponds to the α-MPP subunit whilst Esi0111_0016, which does not share this feature, would probably correspond to Core2 of the respiratory chain complex III.
In contrast to the situation in Ectocarpus, we only found two or three MPP genes in other stramenopile genomes. For example, as has been previously described for land plants, P. tricornutum possesses putative α-MPP and β-MPP subunits but no Core proteins (Fig. S6B). T. pseudonana and Phytophthora species appear to have α-MPP and β-MPP plus an additional α-MPP/Core2-type subunit, which is probably inactive. A recent proteomic analysis of Chlamydomonas reinhardtii reported the existence of a Core2 protein in addition to the soluble α-MPP and β-MPP . This resembles more closely the situation described for N. crassa than that described for land plants. The situation in Ectocarpus is similar to that described for opisthokonts, except that Ectocarpus may also have a Core2 subunit. Note that β-MPP is probably able to associate with Core2 as well as with α-MPP.
Mitochondrial Intermediate Peptidase (MIP) is a thiol-dependent metalloprotease, which has the ability to cleave intermediate-size proteins following cleavage by MPP. MIP proteins are related to bacterial zinc metalloproteases. Many proteins that are targeted to the mitochondrial matrix or to the IM are processed in two sequential steps, first by MPP and then by MIP. After MPP cleavage at R+1, a typical processing octapeptide-carrying intermediate is generated and subsequently recognized by MIP, leading to production of the mature protein . The Ectocarpus gene Esi0033_0093 is predicted to encode an MIP protein.
Despite the large evolutionary distances that separate the stramenopiles from well-studied organisms such as yeast or Arabidopsis, most of the core components of mitochondrial protein import systems appear to be encoded by the Ectocarpus genome. On the other hand, in many instances we failed to find genes encoding accessory proteins (small Toms, Tim12 or Pam17 for example) and this is consistent with previous observations suggesting that these proteins often arose later in evolution within individual eukaryotic lineages to increase the efficiency of the import systems . Components that are known to be different between opisthokonts and plantae, such as TOM complex receptors, were also variable in Ectocarpus. However, a candidate counterpart for Tom70, plus a closely related Tom70-like protein, were identified in this brown alga, concurring with the recent finding of Tsaousis et al.  that the Tom70 receptor is not found only in opisthokonts. Additional potential Tom70 homologues were identified in recent genomic data for other stramenopiles, suggesting that the presequence-less import pathway mediated by Tom70 could be more ancient than previously supposed, being acquired before the separation of opisthokonts and stramenopiles. In contrast, no Tom20 counterpart was identified in Ectocarpus. Even if Tom22 is also able to bind N-terminal presequences and has been proposed to be the ancestral receptor for the presequence-mediated mitochondrial import, Tom20 is the major protein involved in this pathway in opisthokonts and plantae. This component, if it exists in brown algae, remains to be discovered.
Other interesting observations included the absence of Mia40, despite the apparent presence of other components of the MIA/ERV system, and the identification of stramenopile Tim21-like proteins related to opisthokont Tim21, suggesting that this protein might be more broadly distributed across the eukaryotic tree that previously thought.
All the known mitochondrial peptidases appear to be present in Ectocarpus, including the IMP, m-AAA and rhomboid proteases, MPP and MIP. Interestingly, different combinations of MPP and Core subunits were predicted to be present in diverse stramenopiles, as has been reported for opisthokonts and plantae. This indicates that incorporation of MPP into the respiratory chain could have occurred several times during the evolution of eukaryotes.
Previous studies focusing on the plantae  and opisthokont ,  lineages, and on other protist groups  have indicated that mitochondrial protein import complexes consist of very ancient and conserved components together with a variety of variable accompanying proteins depending on the lineage. The analysis presented here indicates that a similar situation exists in the stramenopiles, with individual protein uptake complexes exhibiting more or less complexity depending on the subgroup analysed.
Alignments of conserved regions of predicted Ectocarpus mitochondrial proteins with the corresponding regions from their eukaryotic homologues. (A) Alignment of Tom7 proteins from diverse eukaryotes. EctsiTom7 (Esi0179_0016, Ectocarpus siliculosus), ChlreTom7 (XP_001690575, Chlamydomonas reinhardtii), ArathTom7-1 (NP_568593, Arabidopsis thaliana), ApimeTom7 (NP_001155870, Apis mellifera), VitviTom7 (XP_002269243, Vitis vinifera), RiccoTom7 (XP_002523617, Ricinus communis), SacceTom7 (CAA95944, Saccharomyces cerevisiae), NeucrTom7 (AAK18812, Neurospora crassa), HomsaTom7 (NP_061932, Homo sapiens), SoltuTom7 (O82067, Solanum tuberosum), PhatrTom7 (sequence obtained from , Phaeodactylum tricornutum). Transmembrane sequence and Tom7 motif are shaded in grey and dark grey respectively (this alignment representation is based on Fig. 2, Maćasev et al. ). (B) Alignment of Tom40 proteins from diverse eukaryotes. EctsiTom40 (Esi0055_0058, Ectocarpus siliculosus), PhatrTom40 (XP_002182279, Phaeodactylum tricornutum), ThapsTom40 (XP_002293745, Thalassiosira pseudonana), PhysoTom40 (Physo1_1_108992, Phytophthora sojae), PhyraTom40 (Phyra1_1_72372, Phytophthora ramorum), PhyinTom40 (EEY60531, Phytophthora infestans), MicpuTom40 (EEH60685, Micromonas pusilla), ChlreTom40 (XP_001702575, Chlamydomonas reinhardtii), ArathTom40-1 (NP_188634, Arabidopsis thaliana), ArathTom40-2 (NP_175457, Arabidopsis thaliana), PhypaTom40 (XP_001783489, Physcomitrella patens), OstluTom40 (XP_001417025, Ostreococcus lucimarinus), SacceTom40 (EDN64139, Saccharomyces cerevisiae), ToxgoTom40 (EEE23864, Toxoplasma gondii), PermaTom40 (EER18361, Perkinsus marinus), PlafaTom40 (CAG24986, Plasmodium falciparum), HydmaTom40 (XP_002159190, Hydra magnipapillata), NeucrTom40 (XP_961545, Neurospora crassa), HomsaTom40 (O96008, Homo sapiens), DicdiTom40 (EAL68763, Dictyostelium discoideum), CyameTom40 (AP006488 DNA, Cyanidioschyzon merolae), CioinTom40 (XP_002131405, Ciona intestinalis). Two predicted β-strands are indicated in yellow as shown in Fig. 1 of Maćasev et al. . The PoxGxxφxφ consensus has been described in Kutik et al.  as an important C-terminal motif for recognition by β-barrel proteins assembly machinery (where Po is a polar amino acid, φa hydrophobic residue and x any amino acid). (C) Alignment of Tom22 proteins from diverse eukaryotes. EctsiTom22 (Esi0246_0018, Ectocarpus siliculosus), PhatrTom22 (XP_002179609, Phaeodactylum tricornutum), PhyinTom22 (EEY62448, Phytophthora infestans), OstluTom22 (XP_001416904, Ostreococcus lucimarinus), MicpuTom22 (EEH59409, Micromonas pusilla), SacceTom22 (CAA96013, Saccharomyces cerevisiae), NeucrTom22 (XP_956695, Neurospora crassa), HomsaTom22 (BAB16408, Homo sapiens), DromeTom22 (NP_523910, Drosophila melanogaster), ArathTom22-1 (NP_563699, Arabidopsis thaliana), ArathTom22-2 (NP_199210, Arabidopsis thaliana). Transmembrane region (grey and dark grey) exhibits a conserved WxxxT(T/S)xxxxxxP as observed in . Acidic residues (D/E) are shown in red and basic residues (K/R) are in light blue. The cytosolic region is larger in opisthokonts with a patch of acidic amino acids. This additional domain is currently restricted to opisthokonts (not found in plantae, stramenopiles and other protists). (D) Alignment of Mia40 proteins from diverse eukaryotes. Stramenopiles are represented by Thalassiosira pseudonana (Thaps, XP_002292670), Phytophthora ramorum (Phyra, Phyra1_1_82670), Phytophthora sojae (Physo, Physo1_1_133729), Phaeodactylum tricornutum (Phatr, XP_002182915). Plantae sequences include Ostreococcus lucimarinus (Ostlu, XP_001419132) and Micromonas pusilla (Micpu, EEH56979), a red alga Cyanidioschyzon merolae (Cyame, AP006492) and a land plant Arabidopsis thaliana (Arath, NP_680211) and Physcomitrella patens (Phypa, XP_001765902). The opisthokont sequences are from Homo sapiens (Homsa, 2K3J) and Sacce (Saccharomyces cerevisiae). Amoeboza are represented by Dictyostelium discoideum (Dicdi, XP_643501). Mia40 was not found in the Ectocarpus genome. A new CXXC consensus (colored black) than previously described CPC signature is present in Mia40 homologues and this sequence is followed by a twin CX9C motif (colored black). (E) Alignment of Erv1 proteins from diverse eukaryotes. SacceErv1 (NP_011543, Saccharomyces cerevisiae), HomsaALR (CAB87993, Homo sapiens), EctsiErv1 (Esi0202_0015, Ectocarpus siliculosus), ThapsErv1 (XP_002291504, Thalassiosira pseudonana), PhatrErv1 (XP_002185840, Phaeodactylum tricornutum), PhyinErv1 (EEY69279, Phytophthora infestans), PhysoErv1 (Physo1_1_129835 estExt, Phytophthora sojae), PhyraErv1 (Phyra1_1_93536 C, Phytophthora ramorum), ArathErv1 (NP_564557, Arabidopsis thaliana), ChlreErv1 (EDP03768, Chlamydomonas reinhardtii), CyameErv1 (AP006502 DNA, Cyanidioschyzon merolae). Cys residues are shaded in brown for Stramenopiles, in green for plantae and in blue for opisthokonts. YPCXXC and CX16X motifs are conserved in all Erv1 proteins and are important for FAD binding . Specific twin Cys motifs are present in SacceErv1 (CRSC), HomsaALR (CRAC AND CX10C), in ArathErv1 (CEQKSC) and in Stramenopiles (CX4C). (F) Alignment of Hot13 proteins from diverse eukaryotes. In panel 1, EctsiHot13 (Esi0046_012, Ectocarpus siliculosus), PhysoHot13 (Physo1_1_123261 estExt, Phytophthora sojae), PhyraHot13 (Phyra1_1_49641, Phytophthora ramorum), PhyinHot13 (EEY65160, Phytophthora infestans), CyameHot13 (AP006495 DNA, Cyanidioschyzon merolae), SacceHot13 (AAS56591, Saccharomyces cerevisiae), NeucrHot13 (Neurospora crassa, EAA28094). Conserved Cys and His of Zinc-ion coordination are shaded in black as referred to panel 2. (G) Conserved motifs in the Ectocarpus small Tim proteins. Part a. Alignment of E. siliculosus small Tim sequences showing the twin CX3C motif (conserved cysteines are shaded in black and conserved residues with consensus determinedin  are in blue). Part b. Example of the presence of an internal sequence signal for import into mitochondria in E. siliculosus Tim9 as observed in different eukaryotic species (Ectsi: Ectocarpus siliculosus, Sacce: Saccharomyces cerevisiae, Neucr: Neurospora crassa, Orysa: Oryza sativa, Homsa: Homo sapiens, Drome: Drosophila melanogaster). (H) Partial alignment of Tim44 proteins. (I) Partial alignment of Tim23 proteins. (J) Partial alignment of Tim17 proteins. (K) Partial alignment of Tim16 proteins. (L) Partial alignment of Tim14 proteins. (M) Partial alignment of mtHsp70 proteins. (N) Partial alignment of Mge1 proteins. (O) Partial alignment of Tim22 proteins. For (H) to (O) accession numbers and corresponding names of species are listed in Table S3. Black and grey coloured residues correspond to the amino acid identities (>80% and >60% respectively).
Alignments of complete, predicted Ectocarpus mitochondrial proteins with their eukaryotic homologues. (A) Metaxin, (B) Tim21, (C) Tim50, (D) Sam50, (E) YidC, Oxa1, Cox18/Oxa2, Alb3 proteins. These alignments were used for the phylogenetic analyses. Accession numbers and corresponding names of species are presented in Table S3. Black and grey coloured residues correspond to the amino acid identities (>80% and >60% respectively).
Alignment of Tom70 proteins showing the presence of a single transmembrane region and multi-TPR domains. TPR regions have been superimposed on the Saccharomyces cerevisiae model and these domains were validated by bioinformatic analysis using the HHrep ID and TPRpred tools. Transmembrane regions were predicted by DAS and TMpred. Residues shaded in green (opisthokonts) or blue (stramenopiles) represent amino acids implicated in substrate binding and red-coloured (opisthokonts) or yellow-coloured (stramenopiles) positions correspond to residues involved in dimerisation according to , . Esi0007_0019 and Esi0232_0002 are the Ectocarpus siliculosus proteins and the sequences from other organisms correspond to Blastocystis sp. (Blasp), Phaeodactylum tricornutum (Phatr), Phytophthora infestans (Phyin), Pichia pastoris (Picpa), Saccharomyces cerevisiae (Sacce), Naumovia castellii (Nauca), Neurospora crassa (Neucr), Aspergillus nidulans (Aspni), Trichoplax adhaerens (Triad), Nematostella vectensis (Nemve), Danio rerio (Danre), Homo sapiens (Homsa) and Xenopus laevis (Xenla).
Maximum likelihood trees based on alignments of (A) Sam50 and (B) Oxa protein families. PHYML trees were constructed as described in the materials and methods section, with 113 and 88 amino acid sites for Sam50 and Oxa trees, respectively. Bootstrap values (100 replicates) above 50 are indicated for Maximum likelihood (first value) and Neighbour-Joining (second value) analyses.
Alignment of the C-terminal regions of Oxa1 family proteins. C-terminal domains characteristic of Oxa1 and Oxa2 are boxed.
MPP proteins. (A) Alignment of MPP and core subunit motifs from diverse eukaryotes. Esi0111_0016, Esi0268_0010, Esi0011_0084 and Esi0098_0070 are the Ectocarpus sequences. The sequences from other organisms are from Homo sapiens (Homsa), Saccharomyces cerevisiae (Sacce), Neurospora crassa (Neucr), Chlamydomonas reinhardtii (Chlre), Solanum tuberosum (Soltu). alphaMPP and betaMPP indicate functional MPP subunits (shown in negative type). Core2 and core1 represent the corresponding non-functional MPP homologues inserted into the respiratory chain. The highlighted residues correspond to the key amino acids of alphaMPP and betaMPP activities. (B) Schematic representations of mitochondrial MPP complexes from diverse eukaryotes. Panels 1, 2 and 3 reflect the evolution of MPP proteins as suggested by . See text for details.
Comparative analysis of mitochondrial protein import systems across the eukaryotic tree. The table is based on searches of public databases of protein sequences. ALL, predictions obtained using all the subcellular localisation predictors listed in Materials and Methods. NI, not identified.
Summary of the results of manual searches for components of mitochondrial protein import systems. Relevant hits are in bold characters.
Mitochondrial protein import components across the eukaryotic tree. These accessions have been used in amino acid alignments and phylogenetic analyses. Note that the table includes both proteins identified in previous published studies and proteins found in this study by manual searches.
Wrote the paper: LD CL JMC. Generated the annotated genome data: LS JP J-MA JMC. Analyzed the genome data and carried out comparative analyses: LD CL PNC BG M-PO.
- 1. Gray MW (1999) Evolution of organellar genomes. Curr Opin Genet Dev 9: 678–687.
- 2. Gray MW, Burger G, Lang BF (2001) The origin and early evolution of mitochondria. Genome Biol 2: REVIEWS1018.
- 3. Gabaldón T, Huynen MA (2003) Reconstruction of the proto-mitochondrial metabolism. Science 301: 609.
- 4. Sickmann A, Reinders J, Wagner Y, Joppich C, Zahedi R, et al. (2003) The proteome of Saccharomyces cerevisiae mitochondria. Proc Natl Acad Sci U S A 100: 13207–13212.
- 5. Calvo S, Jain M, Xie X, Sheth SA, Chang B, et al. (2006) Systematic identification of human mitochondrial disease genes through integrative genomics. Nat Genet 38: 576–582.
- 6. Kutik S, Guiard B, Meyer HE, Wiedemann N, Pfanner N (2007) Cooperation of translocase complexes in mitochondrial protein import. J Cell Biol 179: 585–591.
- 7. MacKenzie JA, Payne RM (2007) Mitochondrial protein import and human health and disease. Biochim Biophys Acta 1772: 509–523.
- 8. Neupert W, Herrmann JM (2007) Translocation of proteins into mitochondria. Annu Rev Biochem 76: 723–749.
- 9. Glaser E, Sjöling S, Tanudji M, Whelan J (1998) Mitochondrial protein import in plants. Signals, sorting, targeting, processing and regulation. Plant Mol Biol 38: 311–338.
- 10. Rassow J, Dekker PJ, van Wilpe S, Meijer M, Soll J (1999) The preprotein translocase of the mitochondrial inner membrane: function and evolution. J Mol Biol 286: 105–120.
- 11. Figueroa-Martínez F, Funes S, Franzén LG, González-Halphen D (2008) Reconstructing the mitochondrial protein import machinery of Chlamydomonas reinhardtii. Genetics 179: 149–155.
- 12. Dolezal P, Likic V, Tachezy J, Lithgow T (2006) Evolution of the molecular machines for protein import into mitochondria. Science 313: 314–318.
- 13. Pfanner N, Wiedemann N, Meisinger C, Lithgow T (2004) Assembling the mitochondrial outer membrane. Nat Struct Mol Biol 11: 1044–1048.
- 14. Burri L, Vascotto K, Gentle IE, Chan NC, Beilharz T, et al. (2006) Integral membrane proteins in the mitochondrial outer membrane of Saccharomyces cerevisiae. FEBS J 273: 1507–1515.
- 15. Stojanovski D, Guiard B, Kozjak-Pavlovic V, Pfanner N, Meisinger C (2007) Alternative function for the mitochondrial SAM complex in biogenesis of alpha-helical TOM proteins. J Cell Biol 179: 881–893.
- 16. Peixoto PM, Graña F, Roy TJ, Dunn CD, Flores M, et al. (2007) Awaking TIM22, a dynamic ligand-gated channel for protein insertion in the mitochondrial inner membrane. J Biol Chem 282: 18694–18701.
- 17. Kutik S, Stroud DA, Wiedemann N, Pfanner N (2009) Evolution of mitochondrial protein biogenesis. Biochim Biophys Acta 1790: 409–415.
- 18. Stuart R (2002) Insertion of proteins into the inner membrane of mitochondria: the role of the Oxa1 complex. Biochim Biophys Acta 1592: 79–87.
- 19. Webb CT, Gorman MA, Lazarou M, Ryan MT, Gulbis JM (2006) Crystal structure of the mitochondrial chaperone TIM9.10 reveals a six-bladed alpha-propeller. Mol Cell 21: 123–133.
- 20. Hoppins SC, Nargang FE (2004) The Tim8-Tim13 complex of Neurospora crassa functions in the assembly of proteins into both mitochondrial membranes. J Biol Chem 279: 12396–12405.
- 21. Korndörfer IP, Dommel MK, Skerra A (2004) Structure of the periplasmic chaperone Skp suggests functional similarity with cytosolic chaperones despite differing architecture. Nat Struct Mol Biol 11: 1015–1020.
- 22. Herrmann JM, Kauff F, Neuhaus HE (2009) Thiol oxidation in bacteria, mitochondria and chloroplasts: common principles but three unrelated machineries? Biochim Biophys Acta 1793: 71–77.
- 23. Natale P, Brüser T, Driessen AJ (2008) Sec- and Tat-mediated protein secretion across the bacterial cytoplasmic membrane--distinct translocases and mechanisms. Biochim Biophys Acta 1778: 1735–1756.
- 24. Bogsch EG, Sargent F, Stanley NR, Berks BC, Robinson C, et al. (1998) An essential component of a novel bacterial protein export system with homologues in plastids and mitochondria. J Biol Chem 273: 18003–18006.
- 25. Hartl FU, Pfanner N, Nicholson DW, Neupert W (1989) Mitochondrial protein import. Biochim Biophys Acta 988: 1–45.
- 26. Chacinska A, Koehler CM, Milenkovic D, Lithgow T, Pfanner N (2009) Importing mitochondrial proteins: machineries and mechanisms. Cell 138: 628–644.
- 27. van Loon AP, Eilers M, Baker A, Verner K (1988) Transport of proteins into yeast mitochondria. J Cell Biochem 36: 59–71.
- 28. de Marcos-Lousa C, Sideris DP, Tokatlidis K (2006) Translocation of mitochondrial inner-membrane proteins: conformation matters. Trends Biochem Sci 31: 259–267.
- 29. Gakh O, Cavadini P, Isaya G (2002) Mitochondrial processing peptidases. Biochim Biophys Acta 1592: 63–77.
- 30. Gschloessl B, Guermeur Y, Cock J (2008) HECTAR: a method to predict subcellular targeting in heterokonts. BMC Bioinformatics 9: 393.
- 31. Cock J, Sterck L, Rouzé P, Scornet D, Allen A, et al. (2010) The Ectocarpus genome and the independent evolution of multicellularity in brown algae. Nature 465: 617–621.
- 32. Gaston D, Tsaousis AD, Roger AJ (2009) Predicting proteomes of mitochondria and related organelles from genomic and expressed sequence tag data. Methods Enzymol 457: 21–47.
- 33. Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, et al. (2008) Phylogeny.fr: robust phylogenetic analysis for the non-specialist. Nucleic Acids Res 36: W465–469.
- 34. Yamano K, Yatsukawa Y, Esaki M, Hobbs AE, Jensen RE, et al. (2008) Tom20 and Tom22 share the common signal recognition pathway in mitochondrial protein import. J Biol Chem 283: 3799–3807.
- 35. Steger HF, Söllner T, Kiebler M, Dietmeier KA, Pfaller R, et al. (1990) Import of ADP/ATP carrier into mitochondria: two receptors act in parallel. J Cell Biol 111: 2353–2363.
- 36. Moczko M, Dietmeier K, Söllner T, Segui B, Steger HF, et al. (1992) Identification of the mitochondrial receptor complex in Saccharomyces cerevisiae. FEBS Lett 310: 265–268.
- 37. Lithgow T, Junne T, Suda K, Gratzer S, Schatz G (1994) The mitochondrial outer membrane protein Mas22p is essential for protein import and viability of yeast. Proc Natl Acad Sci U S A 91: 11973–11977.
- 38. Maćasev D, Whelan J, Newbigin E, Silva-Filho MC, Mulhern TD, et al. (2004) Tom22′, an 8-kDa trans-site receptor in plants and protozoans, is a conserved feature of the TOM complex that appeared early in the evolution of eukaryotes. Mol Biol Evol 21: 1557–1564.
- 39. Yano M, Terada K, Mori M (2004) Mitochondrial import receptors Tom20 and Tom22 have chaperone-like activity. J Biol Chem 279: 10808–10813.
- 40. Bolliger L, Junne T, Schatz G, Lithgow T (1995) Acidic receptor domains on both sides of the outer membrane mediate translocation of precursor proteins into yeast mitochondria. EMBO J 14: 6318–6326.
- 41. Nargang FE, Rapaport D, Ritzel RG, Neupert W, Lill R (1998) Role of the negative charges in the cytosolic domain of TOM22 in the import of precursor proteins into mitochondria. Mol Cell Biol 18: 3173–3181.
- 42. Abe Y, Shodai T, Muto T, Mihara K, Torii H, et al. (2000) Structural basis of presequence recognition by the mitochondrial protein import receptor Tom20. Cell 100: 551–560.
- 43. Chan NC, Likić VA, Waller RF, Mulhern TD, Lithgow T (2006) The C-terminal TPR domain of Tom70 defines a family of mitochondrial protein import receptors found only in animals and fungi. J Mol Biol 358: 1010–1022.
- 44. Tsaousis AD, Gaston D, Stechmann A, Walker PB, Lithgow T, et al. (2010) A Functional Tom70 in the Human Parasite Blastocystis sp. Mol Biol Evol. Implications for the Evolution of the Mitochondrial Import Apparatus..
- 45. Chew O, Lister R, Qbadou S, Heazlewood JL, Soll J, et al. (2004) A plant outer mitochondrial membrane protein with high amino acid sequence identity to a chloroplast protein import receptor. FEBS Lett 557: 109–114.
- 46. Mirus O, Bionda T, von Haeseler A, Schleiff E (2009) Evolutionarily evolved discriminators in the 3-TPR domain of the Toc64 family involved in protein translocation at the outer membrane of chloroplasts and mitochondria. J Mol Model 15: 971–982.
- 47. Lister R, Carrie C, Duncan O, Ho LH, Howell KA, et al. (2007) Functional definition of outer membrane proteins involved in preprotein import into mitochondria. Plant Cell 19: 3739–3759.
- 48. Paschen SA, Neupert W, Rapaport D (2005) Biogenesis of beta-barrel membrane proteins of mitochondria. Trends Biochem Sci 30: 575–582.
- 49. Gentle I, Gabriel K, Beech P, Waller R, Lithgow T (2004) The Omp85 family of proteins is essential for outer membrane biogenesis in mitochondria and bacteria. J Cell Biol 164: 19–24.
- 50. Milenkovic D, Kozjak V, Wiedemann N, Lohaus C, Meyer HE, et al. (2004) Sam35 of the mitochondrial protein sorting and assembly machinery is a peripheral outer membrane protein essential for cell viability. J Biol Chem 279: 22781–22785.
- 51. Chan NC, Lithgow T (2008) The peripheral membrane subunits of the SAM complex function codependently in mitochondrial outer membrane biogenesis. Mol Biol Cell 19: 126–136.
- 52. Kozjak-Pavlovic V, Ross K, Benlasfer N, Kimmig S, Karlas A, et al. (2007) Conserved roles of Sam50 and metaxins in VDAC biogenesis. EMBO Rep 8: 576–582.
- 53. Milenkovic D, Ramming T, Müller JM, Wenz LS, Gebert N, et al. (2009) Identification of the signal directing Tim9 and Tim10 into the intermembrane space of mitochondria. Mol Biol Cell 20: 2530–2539.
- 54. Lionaki E, de Marcos Lousa C, Baud C, Vougioukalaki M, Panayotou G, et al. (2008) The essential function of Tim12 in vivo is ensured by the assembly interactions of its C-terminal domain. J Biol Chem 283: 15747–15753.
- 55. Gentle IE, Perry AJ, Alcock FH, Likić VA, Dolezal P, et al. (2007) Conserved motifs reveal details of ancestry and structure in the small TIM chaperones of the mitochondrial intermembrane space. Mol Biol Evol 24: 1149–1160.
- 56. Mesecke N, Bihlmaier K, Grumbt B, Longen S, Terziyska N, et al. (2008) The zinc-binding protein Hot13 promotes oxidation of the mitochondrial import receptor Mia40. EMBO Rep 9: 1107–1113.
- 57. Allen JW, Ferguson SJ, Ginger ML (2008) Distinctive biochemistry in the trypanosome mitochondrial intermembrane space suggests a model for stepwise evolution of the MIA pathway for import of cysteine-rich proteins. FEBS Lett 582: 2817–2825.
- 58. Thorpe C, Hoober KL, Raje S, Glynn NM, Burnside J, et al. (2002) Sulfhydryl oxidases: emerging catalysts of protein disulfide bond formation in eukaryotes. Arch Biochem Biophys 405: 1–12.
- 59. Hofhaus G, Lee JE, Tews I, Rosenberg B, Lisowsky T (2003) The N-terminal cysteine pair of yeast sulfhydryl oxidase Erv1p is essential for in vivo activity and interacts with the primary redox centre. Eur J Biochem 270: 1528–1535.
- 60. Delaroque N, Müller D, Bothe G, Pohl T, Knippers R, et al. (2001) The complete DNA sequence of the Ectocarpus siliculosus Virus EsV-1 genome. Virology 287: 112–132.
- 61. Senkevich TG, White CL, Koonin EV, Moss B (2002) Complete pathway for protein disulfide bond formation encoded by poxviruses. Proc Natl Acad Sci U S A 99: 6667–6672.
- 62. Kozany C, Mokranjac D, Sichting M, Neupert W, Hell K (2004) The J domain-related cochaperone Tim16 is a constituent of the mitochondrial TIM23 preprotein translocase. Nat Struct Mol Biol 11: 234–241.
- 63. Popov-Celeketić D, Mapa K, Neupert W, Mokranjac D (2008) Active remodelling of the TIM23 complex during translocation of preproteins into mitochondria. EMBO J 27: 1469–1480.
- 64. Mokranjac D, Sichting M, Popov-Celeketić D, Mapa K, Gevorkyan-Airapetov L, et al. (2009) Role of Tim50 in the transfer of precursor proteins from the outer to the inner membrane of mitochondria. Mol Biol Cell 20: 1400–1407.
- 65. Satow R, Chan TC, Asashima M (2002) Molecular cloning and characterization of dullard: a novel gene required for neural development. Biochem Biophys Res Commun 295: 85–91.
- 66. Zhang YJ, Tian HF, Wen JF (2009) The evolution of YidC/Oxa/Alb3 family in the three domains of life: a phylogenomic analysis. BMC Evol Biol 9: 137.
- 67. Funes S, Hasona A, Bauerschmitt H, Grubbauer C, Kauff F, et al. (2009) Independent gene duplications of the YidC/Oxa/Alb3 family enabled a specialized cotranslational function. Proc Natl Acad Sci U S A 106: 6656–6661.
- 68. Souza RL, Green-Willms NS, Fox TD, Tzagoloff A, Nobrega FG (2000) Cloning and characterization of COX18, a Saccharomyces cerevisiae PET gene required for the assembly of cytochrome oxidase. J Biol Chem 275: 14898–14902.
- 69. Szyrach G, Ott M, Bonnefoy N, Neupert W, Herrmann JM (2003) Ribosome binding to the Oxa1 complex facilitates co-translational protein insertion in mitochondria. EMBO J 22: 6448–6457.
- 70. Funes S, Nargang FE, Neupert W, Herrmann JM (2004) The Oxa2 protein of Neurospora crassa plays a critical role in the biogenesis of cytochrome oxidase and defines a ubiquitous subbranch of the Oxa1/YidC/Alb3 protein family. Mol Biol Cell 15: 1853–1861.
- 71. Wu LF, Ize B, Chanal A, Quentin Y, Fichant G (2000) Bacterial twin-arginine signal peptide-dependent protein translocation pathway: evolution and mechanism. J Mol Microbiol Biotechnol 2: 179–189.
- 72. Sargent F, Stanley NR, Berks BC, Palmer T (1999) Sec-independent protein translocation in Escherichia coli. A distinct and pivotal role for the TatB protein. J Biol Chem 274: 36073–36082.
- 73. Berks BC (1996) A common export pathway for proteins binding complex redox cofactors? Mol Microbiol 22: 393–404.
- 74. Settles AM, Martienssen R (1998) Old and new pathways of protein export in chloroplasts and bacteria. Trends Cell Biol 8: 494–501.
- 75. Wang X, Lavrov DV (2007) Mitochondrial genome of the homoscleromorph Oscarella carmela (Porifera, Demospongiae) reveals unexpected complexity in the common ancestor of sponges and other animals. Mol Biol Evol 24: 363–373.
- 76. Cline K, Mori H (2001) Thylakoid DeltapH-dependent precursor proteins bind to a cpTatC-Hcf106 complex before Tha4-dependent transport. J Cell Biol 154: 719–729.
- 77. van der Merwe JA, Dubery IA (2007) Expression of mitochondrial tatC in Nicotiana tabacum is responsive to benzothiadiazole and salicylic acid. J Plant Physiol 164: 1231–1234.
- 78. Marienfeld J, Unseld M, Brennicke A (1999) The mitochondrial genome of Arabidopsis is composed of both native and immigrant information. Trends Plant Sci 4: 495–502.
- 79. Le Corguillé G, Pearson G, Valente M, Viegas C, Gschloessl B, et al. (2009) Plastid genomes of two brown algae, Ectocarpus siliculosus and Fucus vesiculosus: further insights on the evolution of red-algal derived plastids. BMC Evol Biol 9: 253.
- 80. Peeters N, Small I (2001) Dual targeting to mitochondria and chloroplasts. Biochim Biophys Acta 1541: 54–63.
- 81. Dinur-Mills M, Tal M, Pines O (2008) Dual targeted mitochondrial proteins are characterized by lower MTS parameters and total net charge. PLoS One 3: e2161.
- 82. Berks BC, Palmer T, Sargent F (2005) Protein targeting by the bacterial twin-arginine translocation (Tat) pathway. Curr Opin Microbiol 8: 174–181.
- 83. Gatsos X, Perry AJ, Anwari K, Dolezal P, Wolynec PP, et al. (2008) Protein secretion and outer membrane assembly in Alphaproteobacteria. FEMS Microbiol Rev 32: 995–1009.
- 84. Burri L, Strahm Y, Hawkins CJ, Gentle IE, Puryer MA, et al. (2005) Mature DIABLO/Smac is produced by the IMP protease complex on the mitochondrial inner membrane. Mol Biol Cell 16: 2926–2933.
- 85. Jan PS, Esser K, Pratje E, Michaelis G (2000) Som1, a third component of the yeast mitochondrial inner membrane peptidase complex that contains Imp1 and Imp2. Mol Gen Genet 263: 483–491.
- 86. Langer T (2000) AAA proteases: cellular machines for degrading membrane proteins. Trends Biochem Sci 25: 247–251.
- 87. Juhola MK, Shah ZH, Grivell LA, Jacobs HT (2000) The mitochondrial inner membrane AAA metalloprotease family in metazoans. FEBS Lett 481: 91–95.
- 88. Esser K, Jan PS, Pratje E, Michaelis G (2004) The mitochondrial IMP peptidase of yeast: functional analysis of domains and identification of Gut2 as a new natural substrate. Mol Genet Genomics 271: 616–626.
- 89. Nolden M, Ehses S, Koppen M, Bernacchia A, Rugarli EI, et al. (2005) The m-AAA protease defective in hereditary spastic paraplegia controls ribosome assembly in mitochondria. Cell 123: 277–289.
- 90. Weber ER, Hanekamp T, Thorsness PE (1996) Biochemical and functional analysis of the YME1 gene product, an ATP and zinc-dependent mitochondrial protease from S. cerevisiae. Mol Biol Cell 7: 307–317.
- 91. Urban S, Lee JR, Freeman M (2001) Drosophila rhomboid-1 defines a family of putative intramembrane serine proteases. Cell 107: 173–182.
- 92. McQuibban GA, Saurya S, Freeman M (2003) Mitochondrial membrane remodelling regulated by a conserved rhomboid protease. Nature 423: 537–541.
- 93. Hawlitschek G, Schneider H, Schmidt B, Tropschug M, Hartl FU, et al. (1988) Mitochondrial protein import: identification of processing peptidase and of PEP, a processing enhancing protein. Cell 53: 795–806.
- 94. Braun HP, Schmitz UK (1997) The mitochondrial processing peptidase. Int J Biochem Cell Biol 29: 1043–1045.
- 95. Glaser E, Dessi P (1999) Integration of the mitochondrial-processing peptidase into the cytochrome bc1 complex in plants. J Bioenerg Biomembr 31: 259–274.
- 96. Smíd O, Matusková A, Harris SR, Kucera T, Novotný M, et al. (2008) Reductive evolution of the mitochondrial processing peptidases of the unicellular parasites Trichomonas vaginalis and Giardia intestinalis. PLoS Pathog 4: e1000243.
- 97. Kitada S, Uchiyama T, Funatsu T, Kitada Y, Ogishima T, et al. (2007) A protein from a parasitic microorganism, Rickettsia prowazekii, can cleave the signal sequences of proteins targeting mitochondria. J Bacteriol 189: 844–850.
- 98. van Lis R, Atteia A, Mendoza-Hernández G, González-Halphen D (2003) Identification of novel mitochondrial protein components of Chlamydomonas reinhardtii. A proteomic approach. Plant Physiol 132: 318–330.
- 99. Isaya G, Kalousek F, Rosenberg LE (1992) Amino-terminal octapeptides function as recognition signals for the mitochondrial intermediate peptidase. J Biol Chem 267: 7904–7910.
- 100. Kutik S, Stojanovski D, Becker L, Becker T, Meinecke M, et al. (2008) Dissecting membrane insertion of mitochondrial beta-barrel proteins. Cell 132: 1011–1024.
- 101. Emanuelsson O, Nielsen H, Brunak S, von Heijne G (2000) Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol 300: 1005–1016.
- 102. Small I, Peeters N, Legeai F, Lurin C (2004) Predotar: A tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics 4: 1581–1590.
- 103. Hawkins J, Bodén M (2006) Detecting and sorting targeting peptides with neural networks and support vector machines. J Bioinform Comput Biol 4: 1–18.
- 104. Claros MG, Vincens P (1996) Computational method to predict mitochondrially imported proteins and their targeting sequences. Eur J Biochem 241: 779–786.
- 105. Guda C, Guda P, Fahy E, Subramaniam S (2004) MITOPRED: a web server for the prediction of mitochondrial proteins. Nucleic Acids Res 32: W372–374.
- 106. Nakai K, Horton P (1999) PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci 24: 34–36.
- 107. Bannai H, Tamada Y, Maruyama O, Nakai K, Miyano S (2002) Extensive feature detection of N-terminal protein sorting signals. Bioinformatics 18: 298–305.
- 108. Yu CS, Chen YC, Lu CH, Hwang JK (2006) Prediction of protein subcellular localization. Proteins 64: 643–651.
- 109. Chen H, Huang N, Sun Z (2006) SubLoc: a server/client suite for protein subcellular location based on SOAP. Bioinformatics 22: 376–377.
- 110. Zdobnov EM, Apweiler R (2001) InterProScan-an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17: 847–848.
- 111. Thomas PD, Campbell MJ, Kejariwal A, Mi H, Karlak B, et al. (2003) PANTHER: a library of protein families and subfamilies indexed by function. Genome Res 13: 2129–2141.
- 112. Finn RD, Mistry J, Tate J, Coggill P, Heger A, et al. (2010) The Pfam protein families database. Nucleic Acids Res 38: D211–222.
- 113. de Castro E, Sigrist CJ, Gattiker A, Bulliard V, Langendijk-Genevaux PS, et al. (2006) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Research 34: 362–365.
- 114. Biegert A, Söding J (2008) De novo identification of highly diverged protein repeats by probabilistic consistency. Bioinformatics 24: 807–814.
- 115. Karpenahalli MR, Lupas AN, Söding J (2007) TPRpred: a tool for prediction of TPR-, PPR- and SEL1-like repeats from protein sequences. BMC Bioinformatics 8: 2.
- 116. Cserzö M, Wallin E, Simon I, von Heijne G, Elofsson A (1997) Prediction of transmembrane alpha-helices in prokaryotic membrane proteins: the dense alignment surface method. Protein Eng 10: 673–676.
- 117. Hofmann K, Stoffel M (1993) TMbase - A database of membrane spanning proteins segments. Biological Chemistry Hoppe-Seyler 374: 166.
- 118. Sonnhammer EL, von Heijne G, Krogh A (1998) A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol 6: 175–182.
- 119. Kelley LA, Sternberg MJ (2009) Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4: 363–371.
- 120. Geourjon C, Deléage G (1995) SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Comput Appl Biosci 11: 681–684.
- 121. Arnold K, Bordoli L, Kopp J, Schwede T (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22: 195–201.
- 122. Sippl MJ (1993) Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures. J Comput Aided Mol Des 7: 473–501.
- 123. Wiederstein M, Sippl MJ (2007) ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res 35: W407–410.
- 124. Ye Y, Godzik A (2004) FATCAT: a web server for flexible structure comparison and structure similarity searching. Nucleic Acids Res 32: W582–585.
- 125. Li Z, Ye Y, Godzik A (2006) Flexible Structural Neighbourhood--a database of protein structural similarities and alignments. Nucleic Acids Res 34: D277–280.
- 126. Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, et al. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4: 41.
- 127. Hahne K, Haucke V, Ramage L, Schatz G (1994) Incomplete arrest in the outer membrane sorts NADH-cytochrome b5 reductase to two different submitochondrial compartments. Cell 79: 829–839.
- 128. Pratje E, Guiard B (1986) One nuclear gene controls the removal of transient pre-sequences from two yeast proteins: one encoded by the nuclear the other by the mitochondrial genome. EMBO J 5: 1313–1317.
- 129. Pratje E, Mannhaupt G, Michaelis G, Beyreuther K (1983) A nuclear mutation prevents processing of a mitochondrially encoded membrane protein in Saccharomyces cerevisiae. EMBO J 2: 1049–1054.
- 130. Otera H, Ohsakaya S, Nagaura Z, Ishihara N, Mihara K (2005) Export of mitochondrial AIF in response to proapoptotic stimuli depends on processing at the intermembrane space. EMBO J 24: 1375–1386.
- 131. Luo W, Fang H, Green N (2006) Substrate specificity of inner membrane peptidase in yeast mitochondria. Mol Genet Genomics 275: 431–436.
- 132. Herlan M, Vogel F, Bornhovd C, Neupert W, Reichert AS (2003) Processing of Mgm1 by the rhomboid-type protease Pcp1 is required for maintenance of mitochondrial morphology and of mitochondrial DNA. J Biol Chem 278: 27781–27788.
- 133. Whitworth AJ, Lee JR, Ho VM, Flick R, Chowdhury R, et al. (2008) Rhomboid-7 and HtrA2/Omi act in a common pathway with the Parkinson's disease factors Pink1 and Parkin. Dis Model Mech 1: 168–174; discussion 173: