Coniothyrium glycines, the causal agent of soybean red leaf blotch, is a USDA APHIS-listed Plant Pathogen Select Agent and potential threat to US agriculture. Sequencing of the C. glycines mt genome revealed a circular 98,533-bp molecule with a mean GC content of 29.01%. It contains twelve of the mitochondrial genes typically involved in oxidative phosphorylation (atp6, cob, cox1-3, nad1-6, and nad4L), one for a ribosomal protein (rps3), four for hypothetical proteins, one for each of the small and large subunit ribosomal RNAs (rns and rnl) and a set of 30 tRNAs. Genes were encoded on both DNA strands with cox1 and cox2 occurring as adjacent genes having no intergenic spacers. Likewise, nad2 and nad3 are adjacent with no intergenic spacers and nad5 is immediately followed by nad4L with an overlap of one base. Thirty-two introns, comprising 54.1% of the total mt genome, were identified within eight protein-coding genes and the rnl. Eighteen of the introns contained putative intronic ORFs with either LAGLIDADG or GIY-YIG homing endonuclease motifs, and an additional eleven introns showed evidence of truncated or degenerate endonuclease motifs. One intron possessed a degenerate N-acetyl-transferase domain. C. glycines shares some conservation of gene order with other members of the Pleosporales, most notably nad6-rnl-atp6 and associated conserved tRNA clusters. Phylogenetic analysis of the twelve shared protein coding genes agrees with commonly accepted fungal taxonomy. C. glycines represents the second largest mt genome from a member of the Pleosporales sequenced to date. This research provides the first genomic information on C. glycines, which may provide targets for rapid diagnostic assays and population studies.
Citation: Stone CL, Frederick RD, Tooley PW, Luster DG, Campos B, Winegar RA, et al. (2018) Annotation and analysis of the mitochondrial genome of Coniothyrium glycines, causal agent of red leaf blotch of soybean, reveals an abundance of homing endonucleases. PLoS ONE 13(11): e0207062. https://doi.org/10.1371/journal.pone.0207062
Editor: Cecile Fairhead, Institut de Genetique et Microbiologie, FRANCE
Received: September 27, 2017; Accepted: October 24, 2018; Published: November 7, 2018
This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Data Availability: The nucleotide sequence is available as GenBank accession number MH337273. Sequence reads are available from the GenBank sequence read archive SRR7245480.
Funding: The genome referenced in this publication was sequenced at MRIGlobal through funding provided by U.S. Department of Homeland Security, Science & Technology Directorate through Contract No. HSHQDC-13-C-B0009 "Capturing Global Biodiversity of Pathogens by Whole Genome Sequencing". The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. MRIGlobal provided support in the form of salaries for authors (BC and RAW), but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the "author contributions" section.
Competing interests: BC and RAW are employees of MRIGlobal (www.mriglobal.org). The commercial affiliation of BC and RAW does not alter our adherence to all PLOS ONE policies on sharing data and materials, and there are no patents, products in development or marketed products to declare. All other authors declare no competing interests.
Coniothyrium glycines (R.B. Stewart) Verkely & Gruyter is a soilborne pathogen that infects soybeans and the perennial soybean, Neonotonia wightii, causing lesions on foliage, petioles, pods and stems and eventual defoliation and premature senescence . C. glycines produces melanized sclerotia that can germinate to either form infectious mycelia or produce pycnidia that in turn produce infectious conidia. The pathogen is spread locally via rain/water splash and human or animal movement, which scatter sclerotia and conidia onto neighboring plants. Leaf drop of infected leaves delivers sclerotia and pycnidia to the soil where they serve as sources of secondary inoculum. Sclerotia may also remain in the soil and restart the cycle of infection in the next growing season. There is no evidence that the fungus is seed-borne, but spread might occur from infected plant debris mixed in with untreated seed or through movement of contaminated soil.
The disease red leaf blotch (RLB) occurs predominantly in central and southern Africa  and the incidence of the disease has increased concomitantly with increased soybean production in regions where the pathogen is found. Yield losses of up to 50% have been reported in Zambia and Zimbabwe . While it does not currently occur within the United States, the ability of sclerotia to survive high temperatures and dry conditions suggest it could survive in soybean growing regions of the southern United States . As a result, the Secretary of Agriculture has determined that C. glycines poses a significant risk to U.S. agriculture, and the pathogen is listed by USDA-APHIS as a Plant Pathogen Select Agent under 7 CFR, part 331 . Additionally, while C. glycines has been found to naturally infect only soybean and N. wightii, there is no evidence as to the pathogen’s potential ability to infect other leguminous species, such as cultivated peanut and native, wild legumes that occur in the USA.
In the early stages of disease development, RLB may not be readily distinguished from other foliar soybean diseases such as Alternaria leaf spot, brown spot, or target spot. Current methods to identify C. glycines require time-consuming examination of morphological characteristics and temperature requirements. No molecular diagnostic assay currently exists to identify C. glycines. The examination of genomic sequences such as the mtDNA may provide targets for the development of diagnostic tools and also may provide insight into the mechanisms of disease resistance.
Phylogenetic analysis of the mtDNA will also be useful to clarify the taxonomy of this fungus. RLB was first observed on soybean in Ethiopia in 1955 and, based on the morphology of the pycnidial state, the causal fungus was identified as Pyrenochaeta glycines . In 1964, Dactuliophora glycines was described as the cause of a leaf spot disease, and was subsequently identified as the sclerotial state of P. glycines . Hartman and Sinclair  established the genus Pyrenochaeta to accommodate these synanamorphs. The fungus was re-classified as Phoma glycinicola in 2002 based on morphological characteristics, and most recently was again re-classified as Coniothyrium glycines (R.B. Stewart) Verkely & Gruyter based on sequence analysis of regions of the ITS, SSU, LSU . The mt genomes of only eight other members of the class Dothidiomycete, which includes several economically important plant pathogens such as the wheat pathogen, Stagonospora nodorum, and wheat leaf blotch, Zymoseptoria tritici (M. graminicola), can currently be found in GenBank. Six of these also share membership in the order Pleosporales with C. glycines. Comparison of the mt genome of C. glycines with the mt genome of these other eight fungi may help support or clarify the recent re-classification of C. glycines, as mitochondrial genomes are considered to be effective tools for evolutionary studies because they evolve independently of and at an accelerated rate from nuclear genomes .
This study provides the complete mitochondrial genome of a pathogenic fungus identified as a USDA-APHIS Plant Pathogen Select Agent due to its potential impact on soybean production. Previously, the only genomic data available were specific sequences used in phylogenetic analysis of Phoma and Septoria spp . This sequence data may provide targets for the development of a rapid diagnostic assay and will help further clarify the evolving fungal taxonomy of the genus.
Materials and methods
Fungal isolate, library construction, and sequence assembly
C. glycines-infected leaves were collected from soybean at the Rattray Arnold Research Station, Harare, Zimbabwe in March 2005 and shipped to the USDA-ARS Foreign Disease-Weed Science Research Unit at Fort Detrick, MD under Animal and Plant Health Inspection Service permit. Isolate Pg-21 was recovered from the leaves and maintained on 20% V8-juice agar at 20°C in the dark. A 10% V8-juice broth was seeded with agar plugs containing mycelium of Pg-21 and grown for several weeks in the dark at 20°C without shaking. Tissue was collected through vacuum filtration onto Whatman No. 1 filter paper in a Buchner funnel. Total DNA was extracted using the DNeasy Plant Mini kit (Qiagen, Germantown, MD). Culture identification was confirmed through sequencing of ITS fragments.
The mt genome was sequenced as part of a whole genome sequencing project with Illumina sequence libraries prepared using Nextera XT. Whole genome 2×300 paired-end sequencing was performed using Illumina MiSeq instrument. Reads were filtered and trimmed using Trimmomatic v.0.32 . The iMetAMOS pipeline v. 1.5 was used to optimize de novo assembly and perform quality checks. Elements of the pipeline include FastQC v. 0.10.0; Spades v. 3.1.1; IDBA v. 1.1.1; KmerGenie v. 1.6741; and QUAST v. 2.2 . Resulting assemblies were polished using Pilon v. 1.8 . Samtools v. 1.1 and BLAST were used to remove low coverage and contaminating contigs. Initial shotgun assembly produced 1431 contigs greater than 1kb in size, with a median size of 11kb and median depth of coverage of 274X. Contig 76 was identified as an outlier with a size of 98,482 bp and average depth of coverage of 1542X. Discontinuous MegaBLAST searches revealed homology with fungal mt genome sequences. Finishing of the mt sequence was performed using CLC Genomics Workbench Genome Finishing Module (Qiagen, Germantown, MD), mapping raw Illumina reads back to contig 76, correcting assembly errors, and extending the contig ends.
The MFannot tool (http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl) was used to annotate the mt genome using genetic code 4 . Annotation of open reading frames (ORFs) was reviewed and revised by BLAST homology searches against the NCBI protein database . tRNAs were further evaluated against output from tRNAscan-SE, Dogma (Dual Organellar GenoMe Annotator), and ARAGORN . RNAweasel was used to classify identified introns as group I or group II introns . Repeats were identified and analyzed with the Tandem Repeats Finder  and Palindrome and Einverted EMBOSS programs . Codon usage for concatenated ORFs of twelve protein-coding genes was determined using the codon usage tool at http://www.bioinformatics.org/sms2/codon_usage.html with genetic code 4 . The physical map of the Coniothyrium mtDNA was constructing using SnapGene Viewer (GSL Biotech; available at snapgene.com). The complete mt sequence of C. glycines isolate Pg-21 has been deposited in GenBank under the accession number MH337273.
The complete mt genomes of the eight fungi belonging to the Dothidiomycetes were retrieved from GenBank (Bipolaris cookei, MF784482; Didymella pinodes, NC_029396; Parastagonospora nodorum, NC_009746; Pithomyces chartarum, KY792993; Shiraia bambusicola, NC_026869; Stemphylium lycopersici, KX453765; Zasmidium cellare, NC_030334; and Zymoseptoria tritici, NC_010222.) Mitochondrial gene content and gene order of C. glycines was compared visually to these eight fungi. Nineteen additional complete mt genomes were retrieved from GenBank for a comparison of general features, including size, GC content, core protein coding genes, rRNAs, and tRNAs, and the presence of introns.
Amino acid sequences of the twelve protein-coding genes shared in common among 25 fungal mt genomes were each aligned with MUSCLE from EMBL-EBI , and amino acids sharing low similarity were removed by Gblocks . Sequences were concatenated using Seaview . A maximum likelihood tree of aligned sequences was constructed with PhyML 3.0 using LG as the evolutionary model . Branch support was assessed using the PhyML default of aLRT test (SH-Like).
Gene content and genome organization
The mt genome of C. glycines is a circular molecule with a length of 98,533 bp (Fig 1). The sequence is AT-rich with an overall G + C content of 29.01%, and 28.9% in the coding regions of the protein-coding genes. The RNA genes had a higher GC content of 35.1% while the intergenic spacers had a lower GC content of 24.8%.
Black blocks, grey blocks, hatched blocks, stipled blocks, and bars show, respectively, protein-coding, orfs, rRNA, introns, and tRNA genes. Arrows indicate the direction of transcription.
Protein-coding genes of the mt genome included one gene encoding for ATP-synthase complex F0 subunit (atp6), three cytochrome oxidase subunits (cox1, cox2, cox3), seven nicotinamide adenine dinucleotide ubiquinone oxireductase subunits (nad1-6, nad4L), cytochrome b (cob), one ribosomal protein (rps3), and four hypothetical proteins (orf208, orf284, orf929, and orf1407) (Fig 1 and Table 1). The mt genome also encodes for small and large subunit ribosomal RNAs (rns and rnl) and 30 tRNAs (Fig 1 and Table 1). Genes were transcribed from both DNA strands. The cox1 and cox2 genes were adjacent to each other with no intergenic spacers. Similarly, nad2 and nad3 were adjacent with no intergenic spacers and nad5 is immediately followed by nad4L with an overlap of one base (Fig 1 and Table 1).
Within the intergenic spacers, four open reading frames (orf208, orf284, orf929, and orf1407) were found (Fig 1 and Table 1). Putative functions could be assigned to three of the ORFs: orf1407 encodes a putative DNA polymerase type B, orf929 encodes a putative DNA-dependent RNA polymerase, and orf208 encodes a putative GIY-YIG endonuclease protein. All three showed similarity to relevant sequences in other fungi and possessed conserved domain motifs. Only orf284 contained no conserved motifs and could not be assigned a putative function, but showed similarity to hypothetical proteins from whole genome shotgun sequencing of Bipolaris maydis and B. zeicola. An additional GIY-YIG endonuclease motif was identified in the intergenic spacer between the rnl and atp6. This region showed similarity to endonucleases from other fungi, however no clear ORF could be identified suggesting that this may represent a degenerate endonuclease. Only 14.4% of the mt sequence is comprised of intergenic spacers.
Within the intergenic spacers, 10 perfect or near identical tandem repeats were identified ranging in size from 12–62 bp and with 2–5 copies (S1 Table). In addition, fifteen palindromes were identified ranging in size from 10–15 bp. A single inverted repeat of 30 bp was found.
Introns made up 54.1% of the mt genome with a total of 32 introns identified within 8 of the protein-coding genes and the rnl (Fig 1 and Table 2). Thirty of the introns were classified as group I introns. One intron was classified as a group II intron (intron3 of the rnl) and one intron could not be definitively classified (intron2 of cox2). Eighteen of the identified introns were determined to contain putative intronic ORFs with either GIY-YIG or LAGLIDADG homing endonuclease (HE) motifs. An additional eleven introns showed evidence of truncated or degenerate HE motifs and one possessed degenerate N-acetyl-transferase domains. Only two introns had no identifiable ORFs and BLAST analysis revealed no homology in the NCBI protein database. All putative HEs showed significant similarity to those found in the mt genomes of other fungi and most were identified in other members of the Pezizomycotina subphylum. However, each was unique within C. glycines, showing no similarity to other intronic ORFs within the mt genome.
The cox1 gene was the most common site for intron insertion, possessing ten of the 32 identified introns. Each of the ten introns also possessed either complete or degenerative putative HEs. Of these ten, only five were found to have high sequence identity to annotated introns found in the same location in the cox1 gene of the other Pleosporales. However, no other member of the Pleosporales possessed all five introns in common. The GIY-YIG HE of intron1 of cox1 showed 87% and 88% nucleotide identity to the corresponding introns of D. pinodes and P. chartarum, respectively. However, there was not a corresponding HE in the mt genomes of the other four Pleosporales species. Likewise, cox1 intron4, containing a LAGLIDADG HE, showed 88% nucleotide identity to the corresponding intron in B. cookei, but was found in no other Pleosporales species. The remaining five introns showed varying degrees of identity with introns from the mt genomes of more distantly related fungi, such as intron8 which showed 85% nucleotide identity with an intron from the corresponding location in Sclerotinia sclerotiorum (S2 Table).
The 2041-bp intron2 of cox1 has two regions with partial LAGLIDADG HE domains that showed 95–97% nucleotide identity with the 1208bp intron that occurs in the same position in the cox1 gene of D. pinodes. However, the central 1200 bp region of cox1 intron2 possessed a truncated GIY-YIG HE domain with no significant nucleotide similarity to any other fungus (S2 Table). This central region does show amino acid identity with a GIY-YIG HE located within an intron from the cob gene of the more distantly-related Chrysoporthe deutercubensis (Table 2).
While most introns showed nucleotide identity with introns inserted into the same gene in other fungi, nad4L intron1 shared identity with free standing orfs in S. sclerotiorum and P. nodorum. One intron, nad1 intron2, showed no nucleotide identity with other species from the Ascomycota, but rather showed identity with introns from two members of the Basidiomycota. This intron showed identity with an intron from the nad1 gene of Moniliophthora roreri and an intron from the cox1 gene of Fomitopsis palustris.
Codon usage and tRNA genes
Codon usage, summarized in S3 Table, shows a bias towards AT-rich codons, which reflects the high AT content of the C. glycines mt genome. Most protein coding genes start with the canonical translation initiation codon ATG with the exception of cox2 and orf1407, which appear to utilize UUA and AUA start codons, respectively. The preferred stop codon in the mt genome was TAA, occurring in 12 genes. The alternative stop codon TAG occurs in 3 genes. A traditional stop codon could not be identified for cox1. This absence, combined with the location of cox1 adjacent to cox2 with no intergenic spacers, suggested the possibility of a fused cox1-cox2 polyprotein rather than two separate proteins. Thirty tRNAs were identified and twenty of them occurred in two large clusters around the rnl, while five occurred singly between mt genes (Fig 1). The tRNAs occurred on both DNA strands.
Comparative genomics and phylogenetic analysis
Comparison of the mt genome of C. glycines with those from eight other members of the Dothidiomycetes revealed that in all nine species genes are encoded on both mtDNA strands. Comparison also found some conservation of gene order, most notably within the Order Pleosporales (Fig 2). In all nine species, nad4L and nad5 were adjacent, and in all but P. nodorum there are no intergenic spacers but rather a one base pair overlap between the two genes. Within C. glycines and the six members of the Pleosporales, cox1 and cox2 were also adjacent with no intergenic spacers. Three members of the Pleosporales possess a conserved gene block of nad5, nad4L, nad3, and nad2. C. glycines shows the same gene order, however the block is disrupted by insertion of cob, nad1, and nad4 between nad4L and nad3. C. glycines and the other Pleosporales species also lack the atp8 and atp9 genes which are typically found in fungal mt genomes, while both Capnodiales species possess both genes.
Asterisk (*) indicates reverse direction of transcription. Each gene is assigned a separate color. Gene order was obtained from GenBank: Bipolaris cookei (MF784482), Didymella pinodes (NC_029396), Parastagonospora nodorum (NC_009746), Pithomyces chartarum (KY792993), Shiraia bambusicola (NC_026869), Stemphylium lycopersici (KX453765), Zasmidium cellare (NC_030334), and Zymoseptoria tritici (NC_010222).
All nine species also exhibit large clusters of tRNA genes around the rnl, and within the Pleosporales tRNA order is maintained as well. The conservation of gene and tRNA order is expanded among the Pleosporales, with six of the seven possessing a nad6-rnl-atp6 gene block with associated conserved tRNA cluster patterns (Table 3). P. chartarum possesses a similar gene block and tRNA cluster pattern, but the atp6 is displaced relative to the other Pleosporales. This conservation of tRNA gene order is carried to a lesser extent to the Capnodiales.
Comparison of the mt genome of C. glycines and the other Dothidiomycetes with those of an additional 19 ascomycetous fungal species revealed several potentially distinguishing characteristics of this class. Of the 25 mt genomes compared, fifteen carry all genes on the same strand of DNA and an additional four mt genomes show the core coding genes encoded on the same strand with only tRNAs or hypothetical proteins encoded in the opposite direction (S4 Table). However, all nine members of the Dothidiomycetes contain genes distributed on both mtDNA strands. Also, while ribosomal protein S3 or S5 occurs within an intron of the rnl in 17 of the 25 species examined, among the Pleosporales rps3/rps5 occurs as a free standing ORF and the gene appears to be absent from the two Capnodiales species (Table 4). Additionally, while atp8 and atp9 are absent from the Pleosporales species, both are found in the other species with the exception of Pseudogymnoascus pannorum which lacks only atp9 (Table 4). The proximity of cox1 and cox2, also characteristic of the Pleosporales examined to date, is not apparent among the other ascomycetous species.
Several similarities across the species were revealed as well. The G+C content is consistent among all species, ranging from 23–32%, with the exception of Pyronema omphalodes with 43%, and all show some tRNA clustering around the rnl. In all but four species, nad4L and nad5 are adjacent with either no intergenic spacer or a single base pair overlap (S4 Table).
The size of the mt genome and the presence of introns varies across all species, ranging from 23743 bp in Z. cellare with no introns to 203051 bp in Sclerotinia borealis with 61 introns. In general, a larger number of introns is reflected in a larger genome size (Table 4). Among the Pleosporales, S. bambusicola has the smallest mt genome at 39030 bp, of which only 3.2% is comprised of the one intron identified. P. nodorum (49761 bp) contains five introns, which make up 13% of the mtDNA , while D. pinodes (55973 bp) contains 14 introns, making up 26% of its mt genome size (NC_029396). Within C. glycines, the 32 identified introns comprised 54% of total mt genome size.
A phylogenetic tree was built with twelve protein-coding genes in common from 25 fungal species (Fig 3). This tree agrees with commonly accepted fungal taxonomy and supports the placement of C. glycines among the Pleosporales and recent reclassification to its own family, the Coniothyriaceae.
Topology shown was inferred with PhyML 3.0 using LG as the evolutionary model. Sequences were obtained from GenBank: Arthroderma otae (NC_012832); Aspergillus niger (NC_007445); Beauveria bassiana (NC_010652); Botryotinia fuckeliana (KC832409); Cladophialophora bantiana (NC_030600); Didymella pinodes (NC_029396): Epichloe typhina (NC_032063); Glarea lozoyensis (KF169905); Hypocrea jecorina (NC_003388); Lecanicillium saksenae (NC_028330); Metarhizium anisopliae (NC_008068); Parastagonospora nodorum (NC_009746); Peltigera dolichorrhiza (NC_031804); Penicillium polonicum (NC_030172); Pseudogymnoascus pannorum (NC_027422); Pyronema omphalodes (NC_029745); Sclerotinia borealis (NC_025200); Shiraia bambusicola (NC_026869); Talaromyces marneffei (NC_005256); Trichophyton rubrum (NC_012824); Verticillium dahliae (NC_008248); Zasmidium cellare (NC_030334); Zymoseptoria tritici (NC_010222); Phialocephala subalpina (NC_015789).
This research provides the first genomic information on the USDA APHIS-listed Plant Pathogen Select Agent C. glycines; data which may provide targets for rapid diagnostic assays and population studies. Additionally, C. glycines represents the second largest mt genome from a member of the Pleosporales sequenced to date. Mitochondrial genome size among fungi varies greatly from the smallest, Rozella allomyces, at 12055 bp  to the largest, Rhizoctonia solani, at 235849 bp . At 98,533 bp, C. glycines is of larger than average size and only 23 other currently available fungal mt genomes are larger. Among the fungi there is no correlation between mtDNA size and gene content.
The gene content of fungal mt genomes is largely conserved. However, it is notable that C. glycines lacked two of the core set of genes typical of fungal mt genomes: atp8 and atp9. These two genes were also absent from the mt genomes of other Pleosporales species . While gene content may be conserved, gene order is not equally conserved and relative gene order varies both between and within major fungal phyla . Alignment of the C. glycines mt genome with other members of the Dothidiomycetes identified a lack of synteny in gene order and gene orientation. However, limited conserved gene blocks were observed. The uninterrupted gene pairs of nad2-nad3 and nad4L-nad5 occurred in all nine Dothidiomycetes species, while the pairing of cox1-cox2 occurred only within all seven Pleosporales species and not the two Capnodiales species. Additionally, nad1-nad4 remain coupled in only three species from the Pleosporales. A conserved gene block nad2-nad3 and nad4L-nad5 was identified among three of the Pleosporales, but within the C. glycines mt genome this block is interrupted by three other genes. However, six of the seven Pleosporales species showed an atp6-rnl-nad6 conserved gene block, which included two large clusters of tRNAs on either side of the rnl in a relatively conserved pattern. Additionally, protein-coding and tRNA genes of C. glycines and the eight other Dothidiomycetes are encoded on both mtDNA strands, while the majority of ascomycetes species examined here have genes encoded on a single DNA strand. The pattern of gene order in mt genomes may provide a road map to trace the evolutionary route of fungal taxonomy. As additional species from the Dothidiomycetes, and the Pleosporales specifically, are analyzed, the additional mt signals will indicate if conserved gene blocks identified to date are characteristic of the Order Pleosporales and further help elucidate fungal taxonomy. Comparative genomics and phylogenetic analysis presented here supports the placement of C. glycines within the Pleosporales and its recent reclassification to its own family, the Coniothyriaceae .
With gene content being largely conserved, the size variation evident among fungal mt genomes is instead attributable to variations in the structure and size of intergenic spacers and the number and size of introns . The larger than average mt genome size of C. glycines was attributed to the relatively high number of introns identified, with 32 introns comprising over half of the total mt genome size. This abundance of introns, most of which possess complete or degenerate HEs, may also provide valuable tools for the evaluation of evolutionary history and intron mobility . While the cox1 gene is considered the most common insertion site for group I introns in fungal mt genomes, the number of introns inserted varies widely from zero in some fungi to the fourteen identified in Podospora anserina . The present study of C. glycines found five of ten cox1 introns, which all possess either complete or truncated HE domains, shared high sequence identity with corresponding introns from the six other Pleosporales species annotated, suggesting common ancestral origin. However, it is notable that none of these five putative HEs occurred in all seven Pleosporales species. The remaining five cox1 introns showed varying degrees of identity with introns from the mt genomes of more distantly related fungi. For example, cox1 intron8 contained a GIY-YIG HE that showed 85% nucleotide identity with an intron from the corresponding location in S. sclerotiorum of the Helotiales and intron5, with its LAGLIDADG HE, shared 71% identity with an intron from Lachancea mirantina, a member of the Saccharomycotina subphylum (S2 Table). The similarity to HEs from more distantly related fungi suggest possible acquisition through horizontal transfer rather than retention from a common ancestor. Additional evidence of horizontal transfer comes from nad1 intron2 and its LAGLIDADG HE which showed no nucleotide identity with introns from other species of the Ascomycota, but rather showed identity with introns from two distantly related members of the Basidiomycota.
The examination of cox1 HEs also revealed evidence of multiple insertion events during the course of evolution. While cox1 intron2 possessed end regions with truncated LAGLIDADG domains and high nucleotide identity to a single orthologous intron from D. pinodes, the central region of this intron, with a truncated GIY-YIG domain, showed only amino acid similarity to an intron from the cob gene of the more distantly-related C. deutercubensis, suggesting the insertion of a new sequence into an already present HE.
It is difficult to determine the precise roles that intron retention, intron acquisition through horizontal transfer, and intron loss have played in constructing the C. glycines mt genome as it has been annotated here. The question remains if some fungal lineages possess a mechanism by which they accumulate and retain HEs while other fungal lineages appear to have lost all introns, and what that mechanism might be. However, this analysis of HEs does suggest that a complex pattern of insertions and horizontal transfers of introns are responsible for the relatively large mt genome size of C. glycines.
S1 Table. Repeat sequences in the Coniothyrium glycines mitochondrial genome.
S2 Table. Sequence similarity betwen mt introns of Coniothyrium glycines and introns of other fungal mitochondrial genomes.
S3 Table. Codon usage in twelve protein-coding mitochondrial genes of Coniothyrium glycines.
The authors wish to thank Melissa Carter for extraction and quality assessment of the DNA used in sequencing.
Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture.
The U.S. Department of Agriculture (USDA) prohibits discrimination in all its programs and activities on the basis of race, color, national origin, age, disability, and where applicable, sex, marital status, familial status, parental status, religion, sexual orientation, genetic information, political beliefs, reprisal, or because all or a part of an individual’s income is derived from any public assistance program. (Not all prohibited bases apply to all programs.) Persons with disabilities who require alternative means for communication of program information (Braille, large print, audiotape, etc.) should contact USDA’s TARGET Center at (202) 720–2600 (voice and TDD). To file a complaint of discrimination, write to USDA, Director, Office of Civil Rights, 1400 Independence Avenue, SW, Washington, DC 20250–9410 or call (800) 795–3272 (voice) or (202) 720–6382 (TDD). USDA is an equal opportunity provider and employer.
- 1. Hartman GL, Sinclair JB. Dactuliochaeta, a new genus for the fungus causing red leaf blotch of soybeans. Mycologia. 1988;80(5):696–706.
- 2. Punithalingam E. CMI descriptions of fungi and bacteria: Dactuliochaeta glycines (No. 1012). 1990.
- 3. Datnoff LE, Naik DM, Sinclair JB. Effect of red leaf blotch on soybean yields in Zambia. Plant Dis. 1987;71:132–135.
- 4. Hartman GL, Sinclair JB. Red leaf blotch (Dactuliochaeta glycines) of soybeans (Glycine max) and its relationship to yield. Plant Pathol. 1996;45(2):332–343.
- 5. Hartman GL, Sinclair JB. Cultural studies on Dactuliochaeta glycines, the causal agent of red leaf blotch of soybeans. Plant Dis. 1992;76(8):847–852.
- 6. Federal Select Agent Program Animal and Plant Health Inspection Service, Agriculture Select Agent Services, Riverdale, MD 20737. 2014; http://www.selectagents.gov/SelectAgentsandToxinsList.html
- 7. Hartman GL, Haudenshield J, Smith K, Tooley P, Shelton J, Bulluck R, et al. Recovery Plan for Red Leaf Blotch of Soybean caused by Phoma glycinicola. Government Publication/Report. 2009;4–21. Online at: http://www.ars.usda.gov/SP2UserFiles/Place/00000000/opmp/Soybean%20RLB%20FINAL%20July%202009.pdf.
- 8. Stewart RB. An undescribed species of Pyrenochaeta on soybean. Mycologia. 1957;49(1):115–117.
- 9. Leakey CLA. Dactuliophora, a new genus of mycelia sterilia from tropical Africa. Trans Br Mycol Soc. 1964;47(3):341–350, IN349-IN310.
- 10. Datnoff LE, Levy C, Naik DM, Sinclair JB. Dactuliophora glycines, a sclerotial state of Pyrenochaeta glycines. Trans Br Mycol Soc. 1986;87(2):297–301.
- 11. Gruyter J. de and Boerema GH. Contributions towards a monograph of Phoma (Coelomycetes)—VIII. Section Paraphoma: Taxa with setose pycnidia. Persoonia. 2002;17(4):541–561.
- 12. Boerema GH, de Gruyter J, Noordeloos ME, Hamers MEC. Phoma identification manual. Differentiation of specific and infra-specific taxa in culture. CABI Publishing, Wallingford; UK. 2004.
- 13. Gruyter J de, Woudenberg JHC, Aveskamp MM, Verkely GJM, Groenewald JZ, Crous PW. Redisposition of phoma-like anamorphs in Pleosporales. Studies in Mycology. 2013;75(1):1–36. pmid:24014897
- 14. Ballard JWO, Whitlock MC. 2004. The incomplete natural history of mitochondria. Mol Ecol. 13:729–744. pmid:15012752
- 15. Burger G, Gray MW, Lang BF. 2003. Mitochondrial genomes: Anything goes. Trends Genet 19:709–716. pmid:14642752
- 16. Hane JK, Lowe RG, Solomon PS, Tan KC, Schoch CL, Spatafora JW, et al. Dothideomycete plant interactions illuminated by genome sequencing and EST analysis of the wheat pathogen Stagonospora nodorum. Plant Cell. 2007;19(11):3347–68. pmid:18024570
- 17. Quaedvlieg W, Verkley GJ, Shin HD, Barreto RW, Alfenas AC, Swart WJ, et al. Sizing up Septoria. Stud Mycol. 2013;75(1):307–90. pmid:24014902
- 18. Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina Sequence Data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
- 19. Koren S, Treangen TJ, Hill CM, Pop M, Phillippy AM. Automated ensemble assembly and validation of microbial genomes. BMC Bioinformatics. 2014;15:126. pmid:24884846
- 20. Andrews S. FastQC: A quality control tool for high throughput sequence data. 2010. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- 21. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing. J Comput Biol. 2012;19(5):455–477. pmid:22506599
- 22. Peng Y, Leung HCM, Yiu SM, Chin FYL. IDBA-A Practical Iterative de Bruijn Graph De Novo Assembler. In: Berger B editor. Research in Computational Molecular Biology. Springer-Verlag Berlin Heidelberg; 2010. pp 426–440.
- 23. Chikhi R, Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics. 2014;30(1):31–37. pmid:23732276
- 24. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29(8):1072–1075. pmid:23422339
- 25. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement. PLoS ONE. 2014;9(11):e112963. pmid:25409509
- 26. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079. pmid:19505943
- 27. Beck N, Lang B. 2010. MFannot, organelle genome annotation websever. http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl.
- 28. Gish W, States DJ. 1993. Identification of protein coding regions by database similarity search. Nat Genet. 3:266–272. pmid:8485583
- 29. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. pmid:9023104
- 30. Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004;20:3252–3255. pmid:15180927
- 31. Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 2004;32:11–16. pmid:14704338
- 32. Lang BF, Laforest MJ, Burger G. Mitochondrial introns: a critical view. Trends in Genetics 2007;23:119–125. pmid:17280737
- 33. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573–80. pmid:9862982
- 34. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16(6):276–7. pmid:10827456
- 36. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–1797 pmid:15034147
- 37. Talavera G, Castresana J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol. 2007;56, 564–577. pmid:17654362
- 38. Gouy M, Guindon S, Gascuel O. SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol Biol Evol. 2010;27(2):221–224. pmid:19854763
- 39. Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst Biol. 2010;59(3):307–21. pmid:20525638
- 40. Shen XY, Li T, Chen S, Fan L, Gao J, Hou CL. Characterization and phylogenetic analysis of the mitochondrial genome of Shiraia bambusicola reveals special features in the order of Pleosporales. PLoS One. 2015;10(3):e0116466. pmid:25790308
- 41. James TY, Pelin A, Bonen L, Ahrendt S, Sain D, Corradi N, et al. Shared signatures of parasitism and phylogenomics unite Cryptomycota and microsporidia. Curr Biol. 2013;23(16):1548–53. pmid:23932404
- 42. Losada L, Pakala SB, Fedorova ND, Joardar V, Shabalina SA, Hostetler J, et al. Mobile elements and mitochondrial genome expansion in the soil fungus and potato pathogen Rhizoctonia solani AG-3. FEMS Microbiol Lett. 2014;352 (2), 165–173. pmid:24461055
- 43. Aguileta G, de Vienne DM, Ross ON, Hood ME, Giraud T, Petit E., et al. High variability of mitochondrial gene order among fungi. Genome Biol Evol. 2014;6(2):451–465. pmid:24504088
- 44. Kouvelis VN, Ghikas DV, Typas MA. The analysis of the complete mitochondrial genome of Lecanicillium muscarium (synonym Verticillium lecanii) suggests a minimum common gene organization in mtDNAs of Sordariomycetes: phylogenetic implications. Fungal Genet Biol. 2004;41:930–940. pmid:15341915
- 45. Stone CL, Posada-Buitrago ML, Boore JL, Frederick RD. Analysis of the complete mitochondrial genome sequences of the soybean rust pathogens Phakopsora pachyrhizi and P. meibomiae. Mycologia. 2010;102(4):887–987. pmid:20648755
- 46. Litter J, Keszthelyi A, Hamari Z, Pfeiffer I, Kucsera J. Differences in mitochondrial genome organization of Cryptococcus neoformans strains. Antonie van Leeuwenhoek. 2005;88:249–255. pmid:16284931
- 47. Wang Y, Zeng F, Hon CC, Zhang Y, Leung FCC. The mitochondrial genome of the Basidiomycete fungus Pleurotus ostreatus (oyster mushroom). FEMS Microbiol Lett. 2008;280:34–41. pmid:18248422
- 48. Formighieri EF, Tiburcio RA, Armas ED, Medrano FJ, Shimo H, Carels N, et al. The mitochondrial genome of the phytopathogenic basidiomycete Moniliophthora perniciosa is 109 kb in size and contains a stable integrated plasmid. Mycol Res. 2008;112:1136–1152. pmid:18786820
- 49. Cusimano N, Zhang LB, Renner SS. Reevaluation of the cox1 group I intron in Araceae and angiosperms indicates a history dominated by loss rather than horizontal transfer. Mol Biol Evol. 2008;25(2): 265–276. pmid:18158323
- 50. Férandon C, Moukha S, Callac P, Benedetto J-P, Castroviejo M, Barroso G. The Agaricus bisporus cox1 gene: The longest mitochondrial gene and the largest reservoir of mitochondrial group I introns. PLoS ONE. 2010;5(11): e14048. pmid:21124976
- 51. Sanchez-Puerta MV, Cho Y, Mower JP, Alverson AJ, Palmer JD. Frequent, phylogenetically local horizontal transfer of the cox1 group I Intron in flowering plant mitochondria. Mol Biol Evol. 2008;25(8):1762–1777. pmid:18524785
- 52. Saguez C, Lecellier G, Koll F. Intronic GIY-YIG endonuclease gene in the mitochondrial genome of Podospora curvicolla: evidence for mobility. Nucleic Acids Res. 2000;28(6):1299–1306. pmid:10684923
- 53. Sethuraman J, Majer A, Friedrich NC, Edgell DR, Hausner G. Genes within Genes: Multiple LAGLIDADG Homing Endonucleases Target the Ribosomal Protein S3 Gene Encoded within an rnl Group I Intron of Ophiostoma and Related Taxa. Mol Biol Evol. 2009;26(10):2299–2315. pmid:19597163
- 54. Mardanov AV, Beletsky AV, Kadnikov VV, Ignatov AN, Ravin NV. The 203 kbp Mitochondrial Genome of the Phytopathogenic Fungus Sclerotinia borealis Reveals Multiple Invasions of Introns and Genomic Duplications. PLoS One. 2014;9(9):e107536. pmid:25216190
- 55. Cummings DJ, McNally KL, Domenico JM, Matsuura ET. The complete DNA sequence of the mitochondrial genome of Podospora anserina. Curr Genet. 1990;17:375. pmid:2357736