Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Heteroplasmy in the Mitochondrial Genomes of Human Lice and Ticks Revealed by High Throughput Sequencing

  • Haoyu Xiong,

    Affiliation Parasitology Section, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia

  • Stephen C. Barker,

    Affiliation Parasitology Section, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia

  • Thomas D. Burger,

    Affiliation Parasitology Section, School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia

  • Didier Raoult,

    Affiliation Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE), UMR CNRS 6236 IRD 198, Faculté de Mé decine, Mé diterranée Infection, Aix-Marseille Université, Marseille, France

  • Renfu Shao

    rshao@usc.edu.au

    Affiliation GeneCology Research Centre, Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Maroochydore, Queensland, Australia

Correction

2 Apr 2015: The PLOS ONE Staff (2015) Correction: Heteroplasmy in the Mitochondrial Genomes of Human Lice and Ticks Revealed by High Throughput Sequencing. PLOS ONE 10(4): e0123492. https://doi.org/10.1371/journal.pone.0123492 View correction

Abstract

The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation.

Introduction

Mitochondrial (mt) genomes exist usually in multiple copies per cell and vary in copy number among different cells and organisms. For instance, there are 30 to 40 copies of mt DNA molecules in the sperm of Caenorhabditis elegans whereas ∼250 copies of mt DNA molecules in the oocyte [1]. A mouse sperm has ∼100 copies of mt DNA molecules whereas a mouse oocyte can have up to 100,000 copies of mt DNA molecules [2]. In humans, a sperm has 100 to 1,500 copies of mt DNA molecules whereas an oocyte can have up to 400,000 copies [3], [4], [5], [6]. The multiple copies of mt DNA molecules in a cell or an organism often are not identical in sequence. Heteroplasmy, i.e. variation in mt genome sequence within a cell, an organism or an individual, is common in bilateral animals but usually occurs at very low levels [7], [8], [9], [10], [11].

Heteroplasmy has been most studied in humans in association with mt diseases, aging and cancers; nevertheless, heteroplasmy is present in health individuals too [10], [12], [13], [14]. In arthropods, heteroplasmy in the mitochondrial A+T-rich regions (also called control regions) has been most studied [15], [16], [17]; heteroplasmy in mt tRNA genes and protein-coding genes has also been reported [18], [19], [20]. Andrea et al. (1992) showed deletion heteroplasmies in six protein-coding genes and four tRNA genes in Drosophila subobscura [19]. Leeuwen et al. (2008) showed that heteroplasmy in cytb gene was linked to resistance to insecticide bifenazate in mites; such resistance was inherited in a non-Mendelian manner [20].

For bilateral animals, the mt genomes typically consist of a single circular chromosome that is ∼16 kb in size and has 37 genes [4], [21]. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, contain 20 minichromosomes; each minichromosome is 3–4 kb in size and has one to three genes [22], [23]. Herd et al. (2012) showed that the heteroplasmy level in the mt cox1 gene of the human body lice and the human head lice is much higher than that in humans and other animals [24]. At a lower coverage (∼5x) with Sanger sequencing, Herd et al. (2012) found 13 heteroplasmic sites in cox1 gene in human body lice and four heteroplasmic sites in this gene in human head lice, in comparison to only one heteroplasmic site in this gene in humans revealed by high throughput sequencing [9]. Herd et al. (2012) suggested that the high-level heteroplasmy might be linked to the fragmented mt genome organization in the human lice, possibly through increased recombination activities between minichromosomes. In the present study, we investigated further the link between heteroplasmy level and mt genome organization by sequencing in high coverage the entire coding regions (15.4 kb) of all of the mt minichromosomes of six human body lice and six human head lice collected in Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks. Ticks are arthropods, so are human lice. Ticks, however, have the typical mt genome organization of bilateral animals [25], [26], [27], [28], [29]. Comparison of human lice with ticks, thus, will allow us to test whether or not heteroplasmy level is indeed linked to changes in mt genome organization.

Materials and Methods

Ethical statement

The research ethics committees of The University of Queensland and the University of the Sunshine Coast were consulted for the experimental work undertaken. No ethical approval or permits were required for conducting the research reported in this manuscript according the laws and regulations in Australia and Queensland.

Sample collection, DNA extraction, PCR amplification and Illumina sequencing

Total DNA was extracted individually from six human body lice, Pediculus humanus, six human head lice, Pediculus capitis, from Ethiopia, China and France, and seven species of ticks with DNeasy Tissue Kit (QIAGEN) (Table 1). For the human lice, a forward primer (USF1PH1: 5′-GAAATTAAAATTTCAACAAATCTCAACTCG-3′) and a reverse primer (PHR10: 5′-CCCCCCAAGCTATTTATAGCTTGGAGTATTAACGG-3′) were designed from two conserved sequence blocks adjacent to the coding region of each mt minichromosome of the human body louse and the human head louse [19], [23]. These primers were used to amplify the coding regions of the 20 mt minichromosomes of the human body lice and the human head lice. The PCR conditions were: 96°C for 2 minutes; 39 cycles of 98°C for 10 seconds, 62°C for 30 seconds, 72°C for 2.5 minutes; followed by 72°C for 5 minutes and then 25°C for 2 minutes. Each 25-µl PCR reaction contains 0.5 µl La Taq, 2 µl forward primer (10 µM), 2 µl reverse primer (10 µM), 4 µl dNTP (2.5 mM), 2.5 µl La Taq Buffer (10x), 1 µl DNA template, and 13 µl deionized water. Negative controls were always included for contamination check. PCR amplicons were purified and concentrated with the Wizard® SV Gel and PCR Clean-Up System (Promega) and then submitted to the Beijing Genome Institute (BGI) for deep sequencing. Sequencing libraries were constructed for each louse and tick sample at the BGI. For library construction, PCR amplicons were sheared into 180-bp fragments, purified and then end-repaired. Adapters were ligated to both ends of sheared fragments; DNA cluster preparation were then performed and the products were sequenced with an Illumina Hiseq 2000 platform at the BGI (details available at: http://bgitechsolutions.com/service-solutions/services/genomics/de-novo-sequencing; http://support.illumina.com/sequencing/documentation.ilmn).

thumbnail
Table 1. Samples of human lice and ticks used in this study.

https://doi.org/10.1371/journal.pone.0073329.t001

For the tick species, short regions of cox1, cytb and rrnS genes were amplified first and sequenced using Sanger method with arthropod-conserved primers [30], [31]. Specific primers were then designed for each species from these regions, and used in most species with conserved tick primers in cytb and rrnS, designed from known sequences (Table S1). Entire mitochondrial genomes were then amplified in two overlapping fragments for the soft ticks: one fragment from cytb to cox1, and the other from cox1 to cytb. The mt genomes of hard ticks were amplified in three overlapping fragments that span from cytb to cox1, from cox1 to rrnS, and from rrnS to cytb. The long amplicons from the soft and hard ticks were subject to Illumina sequencing. TaKaRa Ex Taq DNA polymerase kit was used to amplify the short cox1, cytb and rrnS fragments (<1 kb). Expand Long Range dNTPack kit (Roche) was used to amplify the long fragments (>1 kb). PCR conditions were optimized for each reaction, with the annealing temperature adjusted to suit the primers used, and the extension time set to one minute per kb of expected amplicon. General PCR conditions for Ex Taq were: 94°C for 60 seconds, followed by 40 cycles of 98°C for 10 seconds, 60°C for 30 seconds, 72°C for 1 minute and a final extension of 72°C for 2 minutes. General PCR conditions for Expand dNTPack were: 92°C for 2 minutes, 10 cycles of 92°C for 10 seconds, 55°C for 15 seconds, 60°C for 8 minutes, followed by 25 cycles of 92°C for 10 seconds 55°C for 15 seconds, 60°C for 8 minutes (increasing by 20 seconds per cycle), and a final extension of 68°C for 7 minutes. PCR products were examined on 1% agarose gel stained with ethidium bromide. DNA Molecular Weight Marker VII (Roche Diagnostics) and Low DNA Mass Ladder (Invitrogen) were used, respectively, to estimate the length and concentration of PCR products. Wizard SV Gel and PCR Clean-up System (Promega) was used to purify PCR products for sequencing. Sanger sequencing was with the ABI Prism BigDye v3.1 Terminator kit (Applied Biosystems) and the Applied Biosystems 3730xl DNA Analyzer at the Australian Genome Research Facility (AGRF). Illumina sequencing was with the HiSeq 2000 platform at the BGI, as stated above for the human lice. The sequence reads from the mitochondrial genomes of the human lice and ticks have been deposited in NCBI Sequence Read Archive (SRA) under the accession number SRR616254 and SRR616964.

Sequence and statistical analyses

Raw Illumina sequence-reads were processed before assembly. The adapter sequences were removed from each sequence read; the reads that contain more than 50% low quality bases (quality value < = 5) were excluded from further analysis. The Q20 bases rate of each pool is above 99.26% (Q20 refers to less than 1% error rate in the sequencing base reads). Processed sequence reads were assembled into contigs with Geneious (Biomatters), using the mitochondrial genome sequences of the human lice and ticks available as references [22], [23], [29]. The assembly parameters were: minimum overlap length 80 bp, and minimum identity 95%. Consensus sequences were made for each contig with the “Generate consensus sequence” function in Geneious with the settings: 1) Threshold: Highest Quality – use chromatogram quality to call the best base; and 2) Assign Quality: Total – Sum quality of contributing bases minus non-contributing bases. Protein-coding genes and rRNA genes were identified with BLAST searches of GenBank [32], [33]. tRNA genes were identified with tRNA-Scan [34] and ARWEN [35]. Heteroplasmy was measured with the “Find SNPs/variations” function in Geneious under the threshold “minimum variant frequency” >1.5%, and “find SNPs only”.

Statistic analyses were conducted to estimate the false positive rate and the false discovery rate for the heteroplasmy called above, according to Storey and Tibshirani (2003) [36]. The false positive rate, indicated as p-value in Storey and Tibshirani (2003), was measured as the probability of a heteroplasmy to be false positive (Pf) under a binomial distribution given the sequence-read coverage of that heteroplasmic site and the sequencing error of the Illumina platform. Pf was calculated with the formula Pf  = C (n,k) * p∧k * (1-p)∧(n-k), where n is the sequence-read coverage of a heteroplasmy called at a given site of a contig; k is the times of that heteroplasmy seen at that site; C(n, k) is n!/(k! * (n-k)!); and p is the sequencing error, 0.5%, of the Illumina platform. The false discovery rate, indicated as q-value in Storey and Tibshirani (2003), was measured as the probability of a statistically significant heteroplasmy to be false positive (Qf). Qf was calculated from the Pf values obtained above with the QValue program [36], [37]. The statistical significance threshold for the heteroplasmy called was set at Pf<1% and Qf<0.1%. Measurement of heteroplasmy was further corrected by excluding those at the sequencing-error hotspots revealed in Nakamura et al. (2011) and Minoche et al. (2011): 1) after four or more identical base calls; and 2) two adjacent SNP variants [38], [39]. One sample T test and Independent samples T test (Pt) were used to compare the mean heteroplasmy levels between any two samples or genes.

Results

Heteroplasmy in the human lice

Twenty contigs were obtained for each of the six human body lice and six human head lice, corresponding to the 20 mt minichromosomes of these lice. The gene content and gene arrangement in each minichromosome of these lice are the same as that reported in Shao et al. (2009, 2012) [22], [23]. At the “minimum variant frequency” >1.5%, Pf<1% and Qf<0.1%, a total of 494 heteroplasmic sites (excluding those at sequencing-error hotspots here and throughout this manuscript) were found in the mt coding regions (15.4 kb) of six human body lice and six human head lice, i.e. an average of 41.1±20.3 (mean ± standard deviation, n = 12) heteroplasmic sites per louse (Table 2). These heteroplasmic sites have an average coverage of ∼6,000x. Of the 494 heteroplasmic sites, 374 sites are in the protein-coding genes, 21 heteroplasmic sites in the rRNA genes, and 99 heteroplasmic sites in the tRNA genes (Table 2, Table S2). Of the 374 heteroplasmic sites in the protein-coding genes, 252 sites are synonymous, i.e. resulting in no amino acid changes, whereas the other 122 sites are nonsynonymous, i.e. resulting in amino acid changes (Table 2, Table S6). There is no significant variation in the level of heteroplasmy among different mt genes in the human lice with the exception of cox1 gene, which has significantly higher level of heteroplasmy (13.3±15.7 sites per kb, n = 12) than other genes (Pt<0.05) (Figure 1; Table 3). Each of the 12 human lice share 15 to 63 heteroplasmic sites with one or more other lice (Figure 2; Table 4; Table S3), indicating part of the heteroplasmic sites are inheritable and have limited conservation within species.

thumbnail
Figure 1. Level of heteroplasmy in the mitochondrial genes of human lice and ticks.

Error bars stand for the standard error of the mean value. Amblyomma cajennense was excluded due to its extremely high level of heteroplasmy relative to other ticks.

https://doi.org/10.1371/journal.pone.0073329.g001

thumbnail
Figure 2. Numbers of shared and unshared heteroplasmic sites in human lice.

Error bars stand for the standard error of the mean value.

https://doi.org/10.1371/journal.pone.0073329.g002

thumbnail
Table 2. Heteroplasmy in the mitochondrial coding regions of human lice and ticks.

https://doi.org/10.1371/journal.pone.0073329.t002

thumbnail
Table 3. Number of heteroplasmic sites in the mitochondrial genes of the human lice.

https://doi.org/10.1371/journal.pone.0073329.t003

thumbnail
Table 4. Number of heteroplasmic sites in the mitochondrial coding regions shared between human body lice and head lice.

https://doi.org/10.1371/journal.pone.0073329.t004

The three body lice and three head lice from Ethiopia have an average of 62.2±10.6 heteroplasmic sites per louse (n = 6), which is significantly higher than that of the lice from China (an average of 33.5±2.1 sites per louse, n = 2) (Pt<0.05) and France (an average of 24.8±5.9 sites per louse, n = 4) (Pt<0.05) (Table 5). Four heteroplasmic sites in four tRNA genes, trnN, trnD, trnT and trnM, are present only in the lice from Ethiopia (n = 6) but not in those from China (n = 2) and France (n = 4) (Table S2). There is no significant difference in the heteroplasmy level among the lice from Ethiopia, nor among those from China and France (Pt>0.05). There is no significant difference either between the lice from China and those from France in the level of heteroplasmy (Pt>0.05).

thumbnail
Table 5. Heteroplasmy in the mitochondrial coding regions of human lice from China, France and Ethiopia.

https://doi.org/10.1371/journal.pone.0073329.t005

Heteroplasmy in the ticks

A single contig was obtained for each of the seven species of ticks, corresponding to the mt chromosomes of these ticks. The mt genomic organization of these ticks is the same as that reported for hard ticks and soft ticks in a number of previous studies [27], [28], [29]. At the “minimum variant frequency” >1.5%, Pf<1% and Qf<0.1%, a total of 261 heteroplasmic sites were found in the seven tick samples, an average of 37.3±22.0 sites per tick (n = 7) (Table 2). These heteroplasmic sites have an average coverage of ∼1,700x. Of the 261 heteroplasmic sites, 221 sites are in the protein-coding genes, 24 sites are in the rRNA genes, and the other 16 sites are in the tRNA genes (Table 2; Table S4). There is no significant variation in the level of heteroplasmy among different mt genes in the ticks (Pt>0.05) (Figure 1; Table 6). Of the 221 heteroplasmic sites in the protein-coding genes, 63 sites are synonymous whereas the rest are nonsynonymous. No heteroplasmic sites were shared between any two species of ticks except one in nad3 shared between Amblyomma cajennense and Otobius megnini, indicating nearly no conservation of heteroplasmy between species. The level of heteroplasmy varies among different species of ticks (Table 7). Amblyomma cajennense has 166 heteroplasmic sites (11.5 sites per kb, n = 1) and is significantly higher in heteroplasmy level than the other six ticks (1.1±0.9 sites per tick per kb, n = 6) (Pt<0.01), and even higher than the human lice (2.9±1.4 sites per louse per kb, n = 12; Pt<0.01).

thumbnail
Table 6. Number of heteropalsmic sites in the mitochondrial genes of ticks.

https://doi.org/10.1371/journal.pone.0073329.t006

thumbnail
Table 7. Heteroplasmy in the mitochondrial coding regions of ticks.

https://doi.org/10.1371/journal.pone.0073329.t007

Discussion

Heteroplasmy, i.e. the presence of more than one type of mitochondrial DNA sequence within an individual, is common in humans, other bilateral animals and plants [7]. Heteroplasmy occurs at a relatively low level in humans, with 32 heteroplasmic sites revealed in a human individual by high throughput sequencing at 16,700x coverage [9]. Studies of other animals and plants with low-coverage sequencing also showed low levels of heteroplasmy [12], [18], [20], [40], [41]. Thus, low level of heteroplasmy has been assumed to be a general feature for eukaryotes.

Herd et al. (2012) identified, however, substantially more heteroplasmic sites in mt cox1 gene in the human lice than in humans, with 13 heteroplasmic sites in this gene in human body lice and four heteroplasmic sites in human head lice, in comparison to one heteroplasmic site in this gene in humans revealed by high throughput sequencing [9]. Herd et al. (2012) suggested that the high level of heteroplasmy in the human lice might be linked to the fragmentation of mt genomes in these lice, possibly through increased recombination activities between minichromosomes [22], [23]. In the present study, we investigated further whether or not changes in mt genome organization is associated with the level of heteroplasmy by sequencing the entire mt coding region (13 protein-coding genes, two rRNA genes and 22 tRNA genes) of six human body lice, six head lice and seven species of ticks with an Illumina HiSeq platform.

To ensure the accuracy of the measurement of the heteroplasmy level from high throughput sequencing data, we set the minimum heteroplasmy-detection threshold to 1.5%, i.e. only the heteroplasmic sites that have a frequency above 1.5% are counted; sites with a frequency below 1.5% are discarded to exclude sequencing errors. As a general guideline, the sequencing-error rate of the Illumina HiSeq platform is ∼0.5% or less [9], [42], [43]. Our threshold-setting to 1.5% is consistent with those in He et al. (2010) and Doan et al. (2012), which set the thresholds to 1.6% and 1.5% respectively (i.e. triple the sequencing-error rate) [9], [44]. Furthermore, we set the statistical significance threshold at Pf<1.0% and Qf<0.1%, i.e. the probability of a heteroplasmy we called to be false positive is smaller than 1.0%, and the probability of a statistically significant heteroplasmy we called to be false positive is smaller than 0.1% (Table S5). We also excluded the heteroplasmy that fall into sequencing-error hotspots for Ilumina Hiseq platform revealed by Nakamura et al. (2011) and Minoche et al. (2011) [38], [39]. By all these measures, we were able to catch the full spectrum of the heteroplasmy while minimize the influence of high throughput sequencing errors.

We should point out that the parameters we used above to call heteroplasmy from high throughput sequence data, i.e. minimum detection threshold >1.5%, Pf<1.0% and Qf<0.1%, are arbitrary. Indeed, both more relaxed and more stringent parameters can be used, which may increase or decrease the total number of heteroplasmic sites called from the data. The choice of these parameters, however, will not affect the conclusion for the comparison of the heteroplasmy levels between any two samples as long as the same set of parameters are applied. For instance, we also tested minimum detection threshold >1.0%, >3.0% and >5.0%. These three parameters changed substantially the cut-off sequence-read coverage at which heteroplasmy is called (Figure 3) but did not change the conclusion made when minimum detection threshold >1.5% was used.

thumbnail
Figure 3. Plots of false positive rate (Pf) against heteroplasmic-site coverage at minimum variant frequency 1.0%, 1.5%, 3.0% and 5.0% respectively.

https://doi.org/10.1371/journal.pone.0073329.g003

Despite all having fragmented mt genomes, the human lice from Ethiopia have significantly higher level of heteroplasmy (62.2±10.6 per louse, n = 6) than those from China (33.5±2.1 sites per louse, n = 2; Pt<0.05) and France (24.8±5.9 sites per louse, n = 4; Pt<0.05), indicating substantial variation in heteroplasmy level among different populations of human lice. The same is true for the seven species of ticks. Although all of these ticks have the typical mt genomes of animals with all genes on a single chromosome, Amblyomma cajennense has significantly more heteroplasmic sites (166, n = 1) than the other six species of ticks (15.8±13.6 sites per tick, n = 1; Pt<0.05) (Table 7). Fragmentation of mt genome in the human lice does not appear to have any bearing on the level of heteroplasmy as the two human lice from China have similar level of heteroplasmy (33.5±2.1 per louse) as humans, 32 heteroplasmic sites reported in He et al. (2010) [9] and 34 sites in Li et al. (2012) [45], whereas the four human lice from France have slightly lower level of heteroplasmy (24.8±5.9 sites per louse) than humans (Pt>0.05). On the other hand, the tick, Amblyomma cajennense, has significantly higher level of heteroplasmy (166, n = 1) even than the human lice from Ethiopia (62.2±10.6 per louse, n = 6; Pt<0.05). With the exception of Amblyomma cajennense, the other six species of ticks have significant lower level of heteroplasmy (15.8±13.6 sites per tick, n = 1; Pt<0.05) than both the human lice and humans, reconfirming the view that mt genome organization has no direct influence on the level of heteroplasmy.

In conclusion, our results showed substantial variation in heteroplasmy level in mt genomes among human lice from different countries and among different species of ticks. Thus, we can reject the possible link between the elevated level of heteroplasmy and the fragmented mt genome organization in the human lice. Many questions, however, remain to be answered. Why does cox1 have more heteroplasmy than other genes in human lice (but not in ticks)? Why do human lice from Ethiopia have higher levels of heteroplasmy than those from China and France? Why does Amblyomma cajennense have more heteroplasmy than other species of ticks? Are there any genomic, biochemical or life history factors that may contribute to the level of heteroplasmy? And more importantly, what does the varying level of heteroplasy mean to the fitness and evolution of individuals, populations and species? With the data generated in the current study, we cannot answer these questions yet. Both lice and ticks are ectoparasites of vertebrates. Parasitic lifestyle, however, does not appear to have any role in the variation of heteroplasmy level as there is clear difference in heteroplasmy level between lice and ticks and within each of them although all of them have the parasitic lifestyle. Recombination between mt minichromosomes observed in human lice does not appear to contribute much to heteroplasmy variation either as heteroplasmic sites are not limited to recombinant regions but spread across minichromosomes. More studies are needed to look into the issue of heteroplasmy in eukaryotes with deep sequencing technologies and to address the questions raised above in the current study.

Supporting Information

Table S1.

Conserved and specific primers used in this study.

https://doi.org/10.1371/journal.pone.0073329.s001

(DOC)

Table S2.

Heteroplasmic sites in mitochondrial tRNA genes of human body lice and human head lice.

https://doi.org/10.1371/journal.pone.0073329.s002

(DOC)

Table S3.

Shared heteroplasmic sites in human lice.

https://doi.org/10.1371/journal.pone.0073329.s003

(DOC)

Table S4.

Heteroplasmic sites in mitochondrial tRNA genes of ticks.

https://doi.org/10.1371/journal.pone.0073329.s004

(DOC)

Table S5.

Pf and Qf values for heteroplasmic sites at different sequence-read coverage (minimum variant 1.5%, sequencing-error rate 0.5%, π0 = 0.05).

https://doi.org/10.1371/journal.pone.0073329.s005

(DOC)

Table S6.

Putative amino acid changes caused by heteroplasmy in the 12 human lice and 7 ticks. AA stands for amino acid. “+” stands for positive charge. “−” stands for negative charge.

https://doi.org/10.1371/journal.pone.0073329.s006

(DOC)

Acknowledgments

We thank Kate Herd for assistance with PCR experiments and Tianfang Wang for assistance with QValue analysis. We thank the anonymous reviewers for comments that have greatly improved this paper. We thank Lidia Chitimia for the H. parva; Matsuo Kobayashi and Kyoko Sawabe for the H. formosensis; Dennis Poppi and Wayne Jovgenson for the R. microplus from Cambadia; Paulo Batista and Renato Andreaffi for the R. geigyi and O. megnini. We thank Tusunjiang for helping R.S. collect human lice in China and Kassahun Desalegn Bilcha, Jemal Ali and Richard Pollack for helping S.C.B collect human lice in Ethiopia.

Author Contributions

Conceived and designed the experiments: HX RS. Performed the experiments: HX TB RS. Contributed reagents/materials/analysis tools: DR SB RS. Wrote the paper: HX RS.

References

  1. 1. Lemire B (2005) Mitochondrial genetics. WormBook, ed. The C. elegans Research Community, WormBook. doi/10.1895/wormbook.1.25.1, http://www.wormbook.org.
  2. 2. Rantanen A, Larsson N (2000) Regulation of mitochondrial DNA copy number during spermatogenesis. Hum Reprod 15: 86–91.
  3. 3. Lightowlers RN, Chinnery PF, Turnbull DM, Howell N (1997) Mammalian mitochondrial genetics: heredity, heteroplasmy and disease. Trends Genet 13: 450–455.
  4. 4. Boore JL (1999) Animal mitochondrial genomes. Nucleic Acids Res 27: 1767–1780.
  5. 5. Song GJ, Lewis V (2008) Mitochondrial DNA integrity and copy number in sperm from infertile men. Fertil Steril 90: 2238–2244.
  6. 6. Chiaratti MR, Meirelles FvV (2010) Mitochondrial DNA copy number, a marker of viability for oocytes. Biol Reprod 83: 1–2.
  7. 7. Kmiec B, Woloszynska M, Janska H (2006) Heteroplasmy as a common state of mitochondrial genetic information in plants and animals. Curr Genet 50: 149–159.
  8. 8. Light JE, Allen JM, Long LM, Carter TE, Barrow L, et al. (2008) Geographic distributions and origins of human head lice (Pediculus humanus capitis) based on mitochondrial data. J Parasitol 94: 1275–1281.
  9. 9. He Y, Wu J, Dressman DC, Iacobuzio-Donahue C, Markowitz SD, et al. (2010) Heteroplasmic mitochondrial DNA mutations in normal and tumor cells. Nature 25: 610–614.
  10. 10. Goto H, Dickins B, Afgan E, Paul IM, Taylor J, et al. (2011) Dynamics of mitochondrial heteroplasmy in three families investigated via a repeatable re-sequencing study. Genome Biol 12: R59.
  11. 11. Payne BA, Wilson IJ, Yu-Wai-Man P, Coxhead J, Deehan D, et al. (2012) Universal heteroplasmy of human mitochondrial DNA. Hum Mol Genet 22: 384–390.
  12. 12. Chinnery PF, Samuels DC, Elso J, Turnbull DM (2002) Accumulation of mitochondrial DNA mutations in ageing, cancer, and mitochondrial disease: is there a common mechanism? Lancet 360: 1323–1325.
  13. 13. Santos C, Sierra B, Alvarez L, Ramos A, Fernández E, et al. (2008) Frequency and pattern of heteroplasmy in the control region of human mitochondrial DNA. J Mol Evol 67: 191–200.
  14. 14. Legros F, Malka F, Frachon P, Manuel Rojo AL (2012) Organization and dynamics of human mitochondrial DNA. J Cell Sci 125: 2653–2663.
  15. 15. Boyce TM, Zwick ME, Aquadro CF (1989) Mitochondrial DNA in the bark weevils: Size, structure and heteroplasmy. Genetics 123: 825–836.
  16. 16. Nardi F, Caraoelli A, Fanciulli PP, Dallai R, Frati F (2001) The Complete mitochondrial DNA sequence of the basal hexapod Tetrodontophora bielanensis: Evidence for heteroplasmy and tRNA translocations. Mol Biol Evol 18: 1293–1304.
  17. 17. Marcadé I, Cordaux R, Doublet V, Debenest C, Bouchon D, et al. (2007) Structure and evolution of the atypical mitochondrial genome of Armadillidium vulgare (Isopoda, Crustacea). J Mol Evol 65: 651–659.
  18. 18. Kann LM, Rosenblum EB, Rand DM (1997) Aging, mating, and the evolution of mt DNA heteroplasmy in Drosophila melanogaster. Proc Nati Acad Sci U S A 95: 2372–2377.
  19. 19. Volz-Lingenhöhl A, Solignac M, Sperlich D (1992) Stable heteroplasmy for a large-scale deletion in the coding region of Drosophila subobscura mitochondrial DNA. Proc Nati Acad Sci U S A 89: 11528–32.
  20. 20. Leeuwen TV, Bsartel V, Pottelberge SV, Nieuwenhjuyse PV, Nauen R, et al. (2008) Mitochondrial heteroplasmy and the evolution of insecticide resistance: Non-Mendelian inheritance in action. Proc Nati Acad Sci U S A 105: 5980–5985.
  21. 21. Lavrov DV (2007) Key transitions in animal evolution: a mitochondrial DNA perspective. Integr Comp Biol 47: 734–743.
  22. 22. Shao R, Kirkness EF, Barker SC (2009) The single mitochondrial chromosome typical of animals has evolved into 18 minichromosomes in the human body louse, Pediculus humanus. Genome Res 19: 904–912.
  23. 23. Shao R, Zhu XQ, Barker SC, Herd K (2012) Evolution of extensively fragmented mitochondrial genomes in the lice of humans. Genome Biol Evol 4: 1088–1101.
  24. 24. Herd K, Barker SC, Shao R (2012) High-level of heteroplasmy in the mitochondrial cox1-minichromosome of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis. Open Genomics J 5: 14–17.
  25. 25. Black WC, Roehrdanz RL (1998) Mitochondrial gene order is not conserved in arthropods: prostriate and metastriate tick mitochondrial genomes. Mol Biol Evol 15: 1772–1785.
  26. 26. Shao R, Aoki Y, Mitani H, Tabuchi N, Barker SC, et al. (2004) The mitochondrial genomes of soft ticks have an arrangement of genes that has remained unchanged for over 400 million years. Insect Mol Biol 13: 219–224.
  27. 27. Shao R, Barker SC, Mitani H, Aoki Y, Fukunaga M (2005) Evolution of duplicate control regions in the mitochondrial genomes of metazoa: a case study with Australasian Ixodes ticks. Mol Biol Evol 22: 620–629.
  28. 28. Shao R, Barker SC (2007) Mitochondrial genomes of parasitic arthropods: implications for studies of population genetics and evolution. Parasitology 134: 153–167.
  29. 29. Burger TD, Shao R, Beati L, Miller H, Barker SC (2012) Phylogenetic analysis of ticks (Acari: Ixodida) using mitochondrial genomes and nuclear rRNA genes indicates that the genus Amblyomma is polyphyletic. Mol Phylogenet Evol 64: 45–55.
  30. 30. Kambhampati S, Smith PT (1995) PCR primers for the amplification of four insect mitochondrial gene fragments. Insect Mol Biol 4: 233–236.
  31. 31. Simon C, Frati F, Beckenbach A, Crespi B, Liu H, et al. (1994) Evolution, weighting, and phylogenetic utility of mitochondrial gene-sequences and a compilation of conserved polymerases chain reaction primers. Ann Entomol Soc Am 87: 651–701.
  32. 32. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215: 403–410.
  33. 33. McGinnis S & Madden TL (2004) BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 32: 20–25.
  34. 34. Lowe TM & EddySR (1997) tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic Sequence. Nucleic Acids Res 25: 955–964.
  35. 35. Laslett D & Canback B (2008) ARWEN: a program to detect tRNA genes in metazoan mitochondrial nucleotide sequences. Bioinformatics 24(2): 172–175.
  36. 36. Storey JD (2002) A direct approach to false discovery rates. J R Stat Soc Series B 64: 479–498.
  37. 37. Storey JD, Tibshirani R (2003) Statistical significance for genome-wide studies. Proc Nati Acad Sci U S A 100: 9440–9445.
  38. 38. Nakamura K, Oshima T, Morimoto T, Ikeda S, Yoshikawa H, et al. (2011) Sequence-specific error profile of Illumina sequencers. Nucleic Acids Res 39: e90.
  39. 39. Minoche A, Dohm JC, Himmelbauer H (2011) Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems. Genome Biol 12: R112.
  40. 40. Sachadyn P, Zhang XM, Clark LD, Naviaux RK, Heber-Katz E (2008) Naturally-Occurring Mitochondrial DNA heteroplasmy in the MRL mouse. Mitochondrion 8: 358–366.
  41. 41. Messenger LA, Llewellyn MS, Bhattacharyya T, Franze O, Lewis MD, et al. (2012) Multiple mitochondrial introgression events and heteroplasmy in Trypanosoma cruzi revealed by maxicircle MLST and next generation sequencing. PLoS Negl Trop Dis 6: 1584–1596.
  42. 42. Quinlan AR, Stewart DA, Stromberg MP, Marth GT (2008) Pyrobayes: Animproved base caller for SNP discovery in pyrosequences. Nat Methods 5: 179–181.
  43. 43. Sosa MX, Sivakumar KA, Maragh S, Veeramachaneni V, Hariharan R, et al. (2012) Next-generation sequencing of human mitochondrial reference genomes uncovers high heteroplasmy frequency. PLoS Comput Biol 8: e1002737.
  44. 44. Doan R, Cohen ND, Sawyer J, Ghaffari N, Johnson CD, et al.. (2012) Whole-genome sequencing and genetic variant analysis of a quarter Horse mare. BMC Genomics 13: doi:10.1186/1471-2164-13-78.
  45. 45. Li MK, Schönberg A, Schaefer M, Schroeder R, Nasidze I, et al. (2010) Detecting heteroplasmy from high-throughput sequencing of complete human mitochondrial DNA genomes. Am J Hum Genet 87: 237–249.