Miniature inverted-repeat transposable elements (MITEs) are short, non-autonomous DNA transposons, which are widespread in most eukaryotic genomes. However, genome-wide identification, origin and evolution of MITEs remain largely obscure in microsporidia. In this study, we investigated structural features for de novo identification of MITEs in genomes of silkworm microsporidia Nosema bombycis and Nosema antheraeae, as well as a honeybee microsporidia Nosema ceranae. A total of 1490, 149 and 83 MITE-related sequences from 89, 17 and five families, respectively, were found in the genomes of the above-mentioned species. Species-specific MITEs are predominant in each genome of microsporidian Nosema, with the exception of three MITE families that were shared by N. bombycis and N. antheraeae. One or multiple rounds of amplification occurred for MITEs in N. bombycis after divergence between N. bombycis and the other two species, suggesting that the more abundant families in N. bombycis could be attributed to the recent amplification of new MITEs. Significantly, some MITEs that inserted into the homologous protein-coding region of N. bombycis were recruited as introns, indicating that gene expansion occurred during the evolution of microsporidia. NbS31 and NbS24 had polymorphisms in different geographical strains of N. bombycis, indicating that they could still be active. In addition, several small RNAs in the MITEs in N. bombycis are mainly produced from both ends of the MITEs sequence.
Citation: He Q, Ma Z, Dang X, Xu J, Zhou Z (2015) Identification, Diversity and Evolution of MITEs in the Genomes of Microsporidian Nosema Parasites. PLoS ONE 10(4): e0123170. https://doi.org/10.1371/journal.pone.0123170
Academic Editor: Nicolas Corradi, University of Ottawa, CANADA
Received: September 25, 2014; Accepted: January 27, 2015; Published: April 21, 2015
Copyright: © 2015 He et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: Relevant data are within the paper and its Supporting Information files.
Funding: This study is supported by National High-tech R&D Program (863 Program, No.2013AA102507), National Natural Science Foundation of China (No. 31001037), Natural Science Foundation Project of Chongqing Science&technology Commission (No.cstc2012jjA80006 and No.cstc2014jcyjA80039), Key Project of Ministry of Education of China (No.210180), Science and Technology Project of Chongqing Municipal Education Commission (No.KJ110611, KJ10627 and KJ090813). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Miniature inverted-repeat transposable elements (MITEs) were originally discovered in maize genome, and subsequently found in most eukaryotic genomes, including rice, Arabidopsis, mosquitoes, silkworm and humans [1–5]. MITEs are non-autonomous DNA transposons, and have many common characteristics: terminal inverted repeat (TIR) and target site duplication (TSD); high AT content; high copies. Despite their inability to encode transposase, MITEs can transpose using transposase from other autonomous DNA transposons [6–10]. MITEs play an important role in gene expression and genome evolution [2, 3, 11–14]. Previous study showed that rice mPing elements provide new binding sites for transcription factors or other regulatory proteins to significantly increase the expression level of the gene near mPing insertion . Abundant copies of MITEs usually affect genomic size, and diversify genotype polymorphism loci in the genome [15, 16]. MITE-derived small RNAs can regulate specific target genes at the transcriptional and post-transcriptional levels [17, 18]. Transposition of MITEs has been demonstrated in many species, including mPing and mGing in rice, MADE1 in human cell culture, mimp1 in fungus and dTstu1 in potato [9, 19–22]. Therefore, MITEs were considered as sources of genetic material to transfer heterogeneous gene. For instance, rice Stowaway element has been used as a genetic engineering tool to deliver cargo genes into the yeast genome .
Microsporidia are obligate intracellular parasitic fungi that can infect a wide range of organisms, including vertebrates and invertebrates (particularly insects). Some species can cause severe diarrhea, encephalitis and hepatitis in Acquired Immunodeficiency Syndrome (AIDS) patients infected with microsporidia [24, 25]. Nosema bombycis, the earliest identified species of microsporidia, is a fungal pathogen causing pebrine disease, which can lead to devastating economic losses in the silkworm industry. N. bombycis contains large amounts of repetitive elements, which comprise about 25% of the whole genome . Nosema antheraeae is an obligatory parasite to free-range silkworms Antheraea pernyi, and is closely related to N. bombycis [26, 27]. Nosema ceranae is a common pathogen of western honeybee (Apis mellifera), and was thought to be relevant to colony collapse disorder . Since the full-length transposons had been firstly reported in microsporidia N. bombycis , a large amount and various types of transposons have been identified continuously in microsporidia genomes based on high-throughput sequencing technique. Up to now, there were at least seventeen microsporidia species genomes have been known to contain transposons, including the most insect-pathogenic species, Nematoda-pathogenic species, mariner-pathogenic species and so on [26, 30–37]. Transposons have become important components facilitated by host-parasite interaction in microsporidia. Several studies have indicated that HGT from metazoan host should be the main origin of several transposons in microsporidian genomics, such as piggyBac family NbPB1, Helitron family NbLep1 and Mariner/Tc1 superfamilies [26, 33, 38]. Although many types of transposons are reported in various microsporidia species mentioned above, studies on MITEs were seldom performed at the whole genome level. The distribution, origin and evolutionary history of MITEs among these related Nosema species remain unknown. A recent report showed that in N. bombycis genome, MITEs have destroyed the copy of small ribosomal subunits but are still transcribed together with the latter , indicative of penitential association with gene structure variation.
In this study, genome-wide identification of MITEs was performed on the whole genomes of microsporidian N. bombycis and N. antheraeae, as well as their distantly related species, N. ceranae. Classification and polymorphism of MITEs and their roles in genomic evolution are discussed, which could contribute to further understanding of MITE function in microsporidia.
Materials and Methods
Data and material resources
Genomic sequences and gene annotation information of N. bombycis CQ1 and N. antheraeae YY were freely downloaded from public Silkworm Pathogen Database (SilkPathDB, http://silkpathdb.swu.edu.cn/silkpathdb/ftpserver), or can be downloaded from Genbank (Accession no. ACJZ01000001-ACJZ01003558) and S1–S2 File. Information on N. ceranae genome was obtained from Genbank accession no.ACOL01000001-ACOL01005465. Considering the genes annotation of N. bombycis and N. antheraeae might exist a potential over-prediction of small genes, re-annotating the genes in N. antheraeae and N. bombycis were fulfilled referring to methods described in previous study . Base on this method that taking advantage of transcriptional signals close to translation initiation sites (TIS) in 5’ upstream of AUG initiation codons, the re-annotating procedure for N. bombycis and N. antheraeae was executed as following: CCC-like or GGG-like motifs in the 30 nts upstream and downstream of AUG initiation codons from previous annotation of CDSs were searched to revise the start codons, and then AT content >80% was supplemented as an additional criterion to ensure this revision . Finally, there were 359 over-prediction genes in N. bombycis and 202 over-prediction genes in N. antheraeae have been rectified and then used in this study. The information of re-annotation genes for N. bombycis and N. antheraeae were both listed in S1 Table.
Four strains of N. bombycis involved in this study were kindly provided by Dr Pan Guoqing (Southwest University, Chongqing Province, China), Boling Yue, Sericulture Research Institute, Guangxi Province, China), Dr. Qiong yang (Sericulture & Agri-food Research institute GAAS, Guangdong Province, China), Shiliang Chen (Sericultural and Apicultural Institute, Yunnan Province, China), which were labeled as strains CQ1, GX, GD and YN, respectively. All four strains were isolated from the local silkworm and reserved in their own labs mentioned above.
Identification of MITEs from microsporidian Nosema
Based on the structure of MITEs, MITE-Hunter was used to identify MITE candidates from N. bombycis, N. antheraeae and N. ceranae genomes . MITE-Hunter parameters were set as follows: TIR: 5-60bp; TSD: 2-20bp; length <1500bp; TIR mismatch <2. After filtering sequences of candidate MITEs that contain <3 N and TSD mismatches, those elements that shared ≧90% sequence similarity by blast-all were considered to belong to the same MITE family. The families containing less than three members were discarded from this study.
To mine all the remnant copies of each MITE family, the conserved sequences from each family were used as the query sequence to scan each genome sequence by homology BLAST search. The false results were filtered out based on the criteria of nucleotide identity rate <90% and query coverage <80%. All MITE families were then searched against the RepBase (version 18.05)  and Genbank database, respectively, to be classified as known and unknown families. The MITE families were also assigned into superfamilies based on their similarities of TIR and TSD sequences. Each MITE family identified here was designated as NxY#, where x is a letter representing the microsporidian species, Y is a letter representing a superfamily and # is a number representing the family. Letter S stands for Stowaway-like, T for Tourist-like, h for hAT-like, Me for Merlin-like, Mu for Mutator-like, and N for others.
Sequence analysis and phylogenetic construction
The copy elements of each MITE family were aligned using MUSCLE , and neighbor-joining trees were constructed using MEGA5 . Likewise, the nucleotide divergence for each MITE family was calculated using MEGA with the Kimura2-parameter model. The expected hairpin structures for representative intact MITEs were predicted using the mfold web server (http://mfold.rna.albany.edu/). To find small RNAs derived from the MITE transposons, the elements from each MITE family were used as the query sequence to scan the N. bombycis small RNA database by Bowtie2.0, with mismatch <2 . These MITE-derived small RNAs were mapped to the canonical MITE sequences to evaluate their derived position in MITEs.
Analysis of presence/absence of MITE polymorphisms
The presence/absence of a MITE in a particular region can produce polymorphism between the relatives, so the different isolates from the same species can be distinguished by MITE insertion polymorphism (MIP) analysis . Accordingly, the presence/absence of polymorphism due to specific MITEs in four different geographical strains of N. bombycis was explored by MIP analysis. Firstly, orthologous genes between N. bombycis and N. antheraeae were acquired by reciprocal best-hits BLAST. Then, these orthologous genes in a collinear gene order were considered as syntenic regions in respective species. Subsequently, these shared syntenic regions were used to investigate whether there were the shared syntenic regions harboring MITE elements in the N. bombycis, but harboring no MITE elements in the N. antheraeae. Ultimately, eighteen shared syntenic regions where MITEs had inserted in the 5’-flanking regions of orthologous genes from N. bombycis but not inserted in the same region from N. antheraeae were selected as candidates to execute the subsequent PCR analysis (S1 Fig). The primer pairs used for PCR amplification were designed according to these candidate regions harboring MITEs and homologous genes. Primer information and expected product size, generated by Primer 5.0, are listed in S2 Table. Consequently, the molecular size of PCR products for several different geographical isolates of N. bombycis can be judged for the presence/absence of polymorphism due to specific MITEs in those syntenic regions mentioned above. The genomic DNA of the spores was extracted using the CTAB (cetyltrimethylammonium bromide) method. PCR was performed as follows: 5 min at 94 oC, 35 cycles of 95 oC for 40 sec, 53–59 oC (dependent on the primers) for 40 sec, and 72 oC for 1 min, with a final 5-min extension at 72 oC. The PCR products were separated on 1% agarose gels, stained with ethidium bromide and visualized on a UV trans-illuminator.
Small RNA library preparation and sequencing
Silk glands and whole bodies of day-3 fifth instars larvae infected with N. bombycis were collected for RNA isolation. Following purification, total RNA samples were instantly preserved in ethanol and stored at -80 oC until further use. For deep sequencing, the small RNA samples were prepared as follows: total RNA of each sample was size fractionated on a 15% PAGE gel, and small RNAs around 16–30 nt in length was collected. The 5’RNA adapter (5’-GUUCAGAGUUCUACAGUCCGACGAUC-3’) was ligated to the RNA pool with T4 RNA ligase. Ligated RNA was size fractionated on a 15% agarose gel, and a 40–60 nt fraction excised. The 3’RNA adapter (5’-pUCGUAUGCCGUCUUCUGCUUGidT-3’; p, phosphate; idT, inverted deoxythymidine) was subsequently ligated to precipitated RNA using T4 RNA ligase. Ligated RNA was size-fractionated on a 10% agarose gel, and the 70–90 nt fraction (small RNA + adaptors) excised. Small RNAs ligated with adaptors were subjected to RT-PCR (Superscript II reverse transcriptase, 15 cycles of amplification) to produce sequencing libraries. PCR products were purified and small RNA libraries were sequenced using Solexa, a massively parallel sequencing technology.
Identification of MITEs in three Nosema genomes
Based on the common structure of MITEs, genome-wide search using the MITE-Hunter program yielded 409, 69 and 31 candidate MITE families from N. bombycis, N. antheraeae and N. ceranae, respectively. After filtering out false-positive results (see Materials and Methods), 89, 17 and five families of MITEs were finally included. The length of canonical elements (defined as full-length elements with a TSD and ITR) varied among different families, with TSD length range from 2 to 10 bp and TIR length range from 5 to 34 bp (Table 1 and S3 Table). Based on TIR and TSD sequences, some families identified in this study were classified as known superfamilies including Stowaway-like, Tourist-like, hAT-like, Merlin-like and Mutator-like, while others were classified as new unknown superfamilies. The characteristics of each superfamily are shown in Table 1. Notably, among the new families in N. bombycis, N. antheraeae and N. ceranae, there was a common family whose TSD was 5-6bp in length and TIR sequence started with TGT (S2 Fig), which is similar to MiS5 and MiS22 of Solanaceae .
After scanning the three Nosema genome sequences, copy numbers of each MITE family were estimated. The genomes of N. bombycis, N. antheraeae and N. ceranae harbor 1490, 149 and 83 MITE-related sequences, constituting 3.5%, 0.51% and 0.15% of the total genome, respectively (Table 1).The information of all families’ sequences of MITEs and its distribution in each of three Nosema genomes can be seen in S4 Table and S3–S5 File. AT content was 47–79% in the sequence of each MITE family, which is common for MITEs. Nbh4 was the most abundant in N. bombycis, with 122 copies in the genome including 112 intact copies (S3 Table). NbT2, NbT8 and NbS2 in N. bombycis share high sequence homology with NaT8, NaT6 and NaS2 in N. antheraeae, respectively (S3 Fig), indicating that NbT2 and NaT8, or NbT8 and NaT6, or NbS2 and NaS2 must have originated from a common ancestor before the divergence between N. bombycis and N. antheraeae. However, no MITE family in N. ceranae shared homology with any MITE family from N. bombycis or N. antheraeae.
Some MITE families in N. bombycis have undergone multiple rounds of rapid amplifications
To further investigate the mechanisms of MITEs amplifications in N. bombycis, sequence diversity was analyzed for those elements containing more than 50 copies, including Nbh1, Nbh2, Nbh4, NbN5 and NbS24.The histograms for these five MITE families have peaks at different levels of genetic distance, ranging from 0 to 0.26 (Fig 1A), suggesting that the amplification bursts occurred at distinct time points. Notably, for Nbh2, dominant amount of the copies distributed in two areas of genetic distance (two peaks, Fig 1A) and phylogenetic trees constructed for Nbh2 copies were also grouped into two well-supported clades (Fig 1B), indicating that Nbh2 must have experienced two rounds of rapid amplifications in evolutionary history. Nbh4 has 122 copies in the genome, including 112 intact copies and 10 fragment copies. The distributions of genetic distance for Nbh4 members show that they have at least three peaks (Fig 1A). Notably, the 112 intact copies of Nbh4 can be divided into 24 groups based on 100% nucleic acid sequence identity (S4 Fig). For instance, the largest group consisted of 38 identical elements, while the second group was made up of 29 identical elements. These results suggested that Nbh4 must have recently experienced multiple rounds amplification.
(A) The distribution profile of genetic distance for copies of MITE families. X-axis represents the interval of genetic distance; Y-axis represents the numbers of any pairwise copies. (B) The tree diagram for the copies of Nbh2 families with neighbor-joining (NJ) method.
Distribution of MITEs in genomes of three Nosema species
The distribution of MITEs in genomes was surveyed to examine whether there is any bias for MITEs to associate with genes. If a MITE inserts into within the 300bp upstream and downstream flanking regions of coding sequences, this MITE was regarded as associate with gene regions. We found that 189 (12.7%), 61 (40.9%), 24 (28.9%) of predicted MITEs inserted into gene regions in N. bombycis, N. antheraeae and N. ceranae, respectively. To determine whether insertions of MITEs associated with gene regions were incidental, a computer simulation was performed . However, after a computer simulation was performed as a negative control, chi-square tests between samples and control suggested that the variations were not statistically significant (S5 Table), implying that MITEs of Nosema species do not preferentially insert into gene regions. When a MITE insertion into within respective 200, 100bp, 0bp upstream and downstream flanking regions of a CDS was assumed to be associate with genes, the patterns of MITE insertion in the genome were also similar to that a MITE within 300bp flanking regions of a CDS (S5 Table). Therefore, all these results suggested that there was no insertion bias of MITE toward gene regions.
To explore the effects of MITEs insertions on structures of protein-coding sequences in N. bombycis, 21 protein-coding sequences associated with MITEs insertions were targeted to compare the impact of MITEs on gene structure in N. bombycis with their corresponding homologous genes in N. antheraeae (Fig 2 and S6 Table). Among these 21 genes, 13 were single-copy genes and eight were one copy of duplicated genes. By BLAT searching of EST database for N. bombycis, seven single copy genes inserted with MITEs were transcribed except for MITE fragments (Fig 2), indicating that these MITEs were recruited as introns.
(A) A dot-plot sequencing comparison of MITE-inserted gene NBO_283gi001 with its expressed sequences tag. (B) A dot-plot sequencing comparison of MITE-inserted gene NBO_6gi004 with its expressed sequences tag. The structural comparison of targeted gene and its homologous genes from N. antheraeae, N. ceranae and Encephalitozoon cuniculi are presented as dot plots, respectively.
Presence/absence of MITEs polymorphisms in different N. bombycis geographical strains
Eighteen genomic fragments inserted with certain MITEs in the genome of CQ1 strain were PCR-amplified to detect the presence/absence of MITE polymorphisms by using three other N. bombycis geographical strains as DNA templates. Since segmental duplication had occurred in the genome of N. bombycis CQ1 strain , two distinct PCR-amplified bands whose size difference is similar to the length of NbS31 were visualized (Fig 3A), which is in accordance with previous genomic sequence analysis. As a result, the PCR-amplified DNA fragment containing no NbS31 elements was common in all four strains, while the other DNA fragment harboring the NbS31 element was only present in CQ1 and GX strains (Fig 3A). This result showed the existence of an interstrain presence/absence of polymorphism of NbS31 in N. bombycis. Likewise, the targeted genomic region containing NbS24 in CQ1 isolate was PCR-amplified for CQ1, GX and GD strains. PCR-amplified DNA fragment containing NbS24 elements was only present in CQ1 and GX, while DNA fragment containing no NbS24 elements was present in GD (Fig 3B). Moreover, the size difference in these two PCR-amplified fragments was similar to the length of NbS24. Therefore, NbS24 of N. bombycis also has interstrain presence/absence of polymorphism.
(A) The product of NbS3-inserted PCR amplification in four strains, SD1 and SD2 represent one pair of segmental duplication in N. bombycis. (B) The product of NbS24-inserted PCR amplification in three strains and the illustration. M, DNA marker, CQ1: Chongqing isolate, YN: Yunnan isolate, GD: Guangdong isolate, GX: Guangxi isolate. Black arrowhead: MITE-inserted PCR-amplified products. Gray arrowheads: PCR-amplified product without MITEs insertion. Rectangular arrowheads: genes in the syntenic region among several N. bombycis isolates and N. antheraeae. Same color rectangles correspond to homologous genes.
MITE-derived small RNAs
We obtained raw data by sequencing small RNA pools of Silk glands and whole bodies of day-3 fifth instar larve infected with N. bombycis and filtered the low quality reads, using the sequencing Solexa technology. Finally, a total number of 3,844,786 raw sequence reads were obtained. After filtration, 1,418,249 (36.89%) high-quality reads were remained and 6661,609 reads were matched to N. bombycis genomic sequence by Bowtie 2.0 software . MITE-derived small RNAs were identified in N. bombycis by matching all the small RNA tags to the canonical MITEs with Bowtie alignment. About 2,589 non-redundant reads were matched by each MITE family conservative sequence. These MITE-derived small RNAs range from 18–29 nt in length, among which 24–25 nt small RNAs were dominant (Fig 4A). The regions of MITE sequences that can generate small RNAs were also investigated. The result showed that small RNAs with high sequence coverage extended throughout the MITEs, and were mainly derived from two terminals of MITEs (Fig 4B).
(A) Length distribution of small RNAs generated by MITE sequences. (B) Density (sense, black; antisense, red) of small RNA tags assigned to MITE sequences. Frequency is shown along the Y-axis. Relative nucleotide position within the consensus sequence is indicated along the X-axis.
In this study, we performed a systematic and genome-wide analysis to search for MITEs in N. bombycis, N. antheraeae and N. ceranae, using an efficient program, MITE-hunter . MITE-hunter, which can identify repetitive sequences with TIR and TSD structures, is an excellent, successful tool for de novo identification of MITEs with low false-positive rates as compared to other tools. After removing pseudo-MITEs by a strict filtering process, we identified 89, 17 and five MITE families, respectively. A previous report has shown that N. bombycis experienced genomic expansion, which was mainly attributed to segmental duplication and a large number of transposons . There were many more MITE families in N. bombycis as compared to N. antheraeae or N. ceranae, which indicated that proliferation of MITEs contributed to genomic expansion in N. bombycis.
While only three canonical families in N. antheraeae genome, NaT8, NaT6 and NaS2, were found to share homology with those in N. bombycis genome, remnants of eight MITE families of N. bombycis also exist in N. antheraeae. These residual elements indicated that many old MITE families of N. antheraeae might have lost their activity and cannot amplify, along with accumulation of mutations and deficiency. In contrast, some MITEs in the N. bombycis genome recently experienced multiple rounds of amplification, and others were newly inserted after divergence of N. bombycis. Hence, recent insertion and amplification of MITEs must be responsible for the much larger number of MITE families in N. bombycis. Several MITEs that had recently inserted into the protein-coding genes of N. bombycis were recruited as introns, suggesting that introns were more generally present in different genes of microsporidia parasite, rather than in ribosomal genes, as previously reported . In addition, although there was no conclusive evidence to show if certain MITEs were still active in N. bombycis, two MITEs had presence/absence of polymorphisms in several N. bombycis geographical strains, and their activity could be further explored.
The link between MITEs distribution and spore traits of the three Nosema species still remained unclear. Recent study reported that in domestic silkworm Bombyx mori, a partial transposon named as Taguchi had inserted in the cis-regulatory region of ecdysone oxidase (EO) gene and therein improve its transcribed level, which lead to the more stable developmental phenotype than that silkworm without the transposon insertion when suffering from food shortage [46, 47]. So, the potential function of the Nosema genes that have been targeted by MITE insertion are worth in studying further.
Although it was unclear which factors had driven the amplification of MITEs, "genome shock" theory was proposed to explain the amplification of transposons: vast majority of transposons are not active in the evolutionary history, only external stimuli or catalysis through transposase encoded by homologous autonomous transposon will lead to amplification of MITEs [9, 48]. Studies using yeast expression system have shown that non-MITE ancestral DNA transposons can also catalyze transposition of MITEs . These studies prompted us to analyze the amplification of MITEs in N. bombycis, since there are many families in this species. Surprisingly, an autonomous hAT-like element encoding an intact transposase was found in N. bombycis, and its TIR sequence is highly homologous to Nbh4 element, which is the most abundant element in N. bombycis. So, it can be hypothesized that the recent amplification of Nbh4 in N. bombycis could have been catalyzed by a hAT-like transposase.
Many studies have shown that organisms have a complete mechanism to reduce transposon activity, such as DNA methylation . Recent studies on plant transposons have shown that a large number of small RNAs originated from MITEs, and could potentially regulate the activity of MITEs [17, 50]. In this study, the MITE-derived small RNAs in N. bombycis genome were predominantly 24 and 25nt long. Moreover, these RNAs were mainly generated in the position of two-terminal sequence, which was similar to the formation of miRNAs. It would be interesting to examine if these MITE-derived small RNAs in N. bombycis were generated by a pathway similar to miRNA or siRNA biogenesis. Likewise, the effect of MITE-derived small RNAs on transposons or genes of N. bombycis deserves further study.
S1 Fig. The selected genomic fragments used for evaluate presence/absence of MITEs polymorphisms in N. bombycis.
The positions of designed primers were marked as pair of arrows.
S2 Fig. Multiple sequence alignment of MITE families whose TSD length is 5-6bp and TIR begin with TGT in N. bombycis, N. antheraeae and N. ceranae.
MITE families of MiS5 and MiS22 have been identified in Solanaceae.
S3 Fig. (A)Structure and homologous regions of three MITEs identified in N. bombycis and N. antheraeae.
Grey rectangle is TSD, black triangles are TIRs, and white rectangles are homologous regions of each transposon in both species. The corresponding names and percentages of identity are shown on the left.
S4 Fig. Sequence alignment of representative Nbh4 MITE sequences.
The copy numbers of identical elements are shown on the left of each sequence. Red arrowheads are TIR.
S1 File. The genome sequences of N. antheraeae.
S2 File. The gene annotation information of N. antheraeae.
S3 File. All the MITE copies sequences identified in N. bombycis genome.
S4 File. All the MITE copies sequences identified in N. antheraeae genome.
S5 File. All the MITE copies sequences identified in N. ceranae genome.
S1 Table. List of the re-annotation genes information in N. bombycis and N. antheraeae.
S2 Table. Primers designed for analyzing presence/absence polymorphisms of MITEs based on N. bombycis CQ1 genomic sequences.
S3 Table. Characteristics of all MITE families in three genomes of N. bombycis, N. antheraeae and N. ceranae.
S4 Table. Distribution information of MITEs in each of three genomes of microsporidian Nosema.
S5 Table. Chi-square test of biased insertion of MITE associated with gene regions in N. bombycis, N. antheraeae and N. ceranae genomes.
Conceived and designed the experiments: JSX ZYZ. Performed the experiments: QH XQD ZGM. Analyzed the data: QH JSX. Contributed reagents/materials/analysis tools: QH. Wrote the paper: JSX. Collected literature and software tools: QH JSX.
- 1. Han MJ, Shen YH, Gao YH, Chen LY, Xiang ZH, Zhang Z. Burst expansion, distribution and diversification of MITEs in the silkworm genome. BMC genomics. 2010;11:520. pmid:20875122
- 2. Oki N, Yano K, Okumoto Y, Tsukiyama T, Teraishi M, Tanisaka T. A genome-wide view of miniature inverted-repeat transposable elements (MITEs) in rice, Oryza sativa ssp. japonica. Genes & genetic systems. 2008;83(4).
- 3. Santiago N, Herráiz C, Goñi JR, Messeguer X, Casacuberta JM. Genome-wide analysis of the Emigrant family of MITEs of Arabidopsis thaliana. Molecular biology and evolution. 2002;19(12):2285–93. pmid:12446819
- 4. Tu Z. Eight novel families of miniature inverted repeat transposable elements in the African malaria mosquito, Anopheles gambiae. Proceedings of the National Academy of Sciences of the United States of America. 2001;98(4):1699–704. pmid:11172014
- 5. Bureau TE, Wessler SR. Tourist: a large family of small inverted repeat elements frequently associated with maize genes. The Plant Cell Online. 1992;4(10):1283–94. pmid:1332797
- 6. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409(6822):860–921. pmid:11237011
- 7. Yaakov B, Ceylan E, Domb K, Kashkush K. Marker utility of miniature inverted-repeat transposable elements for wheat biodiversity and evolution. TAG Theoretical and applied genetics Theoretische und angewandte Genetik. 2012;124(7):1365–73. pmid:22286503
- 8. Yang G, Nagel DH, Feschotte C, Hancock CN, Wessler SR. Tuned for transposition: molecular determinants underlying the hyperactivity of a Stowaway MITE. Science. 2009;325(5946):1391–4. pmid:19745152
- 9. Jiang N, Bao Z, Zhang X, Hirochika H, Eddy SR, McCouch SR, et al. An active DNA transposon family in rice. Nature. 2003;421(6919):163–7. pmid:12520302
- 10. Feschotte C, Jiang N, Wessler SR. Plant transposable elements: where genetics meets genomics. Nature Reviews Genetics. 2002;3(5):329–41. pmid:11988759
- 11. Sampath P, Lee SC, Lee J, Izzah NK, Choi BS, Jin M, et al. Characterization of a new high copy Stowaway family MITE, BRAMI-1 in Brassica genome. BMC plant biology. 2013;13:56. pmid:23547712
- 12. Naito K, Zhang F, Tsukiyama T, Saito H, Hancock CN, Richardson AO, et al. Unexpected consequences of a sudden and massive transposon amplification on rice gene expression. Nature. 2009;461(7267):1130–4. pmid:19847266
- 13. El Amrani A, Marie L, Ainouche A, Nicolas J, Couee I. Genome-wide distribution and potential regulatory functions of AtATE, a novel family of miniature inverted-repeat transposable elements in Arabidopsis thaliana. Molecular genetics and genomics: MGG. 2002;267(4):459–71. pmid:12111553
- 14. Yang G, Lee YH, Jiang Y, Shi X, Kertbundit S, Hall TC. A two-edged role for the transposable element Kiddo in the rice ubiquitin2 promoter. The Plant cell. 2005;17(5):1559–68. pmid:15805485
- 15. Sampath P, Murukarthick J, Izzah NK, Lee J, Choi HI, Shirasawa K, et al. Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea. PloS one. 2014;9(4):e94499. pmid:24747717
- 16. Lyons M, Cardle L, Rostoks N, Waugh R, Flavell AJ. Isolation, analysis and marker utility of novel miniature inverted repeat transposable elements from the barley genome. Molecular Genetics and Genomics. 2008;280(4):275–85. pmid:18612649
- 17. Kuang H, Padmanabhan C, Li F, Kamei A, Bhaskar PB, Ouyang S, et al. Identification of miniature inverted-repeat transposable elements (MITEs) and biogenesis of their siRNAs in the Solanaceae: new functional implications for MITEs. Genome research. 2009;19(1):42–56. pmid:19037014
- 18. Cai Y, Zhou Q, Yu C, Wang X, Hu S, Yu J, et al. Transposable-element associated small RNAs in Bombyx mori genome. PloS one. 2012;7(5):e36599. pmid:22662121
- 19. Dong HT, Zhang L, Zheng KL, Yao HG, Chen J, Yu FC, et al. A Gaijin-like miniature inverted repeat transposable element is mobilized in rice during cell differentiation. BMC genomics. 2012;13:135. pmid:22500940
- 20. Momose M, Abe Y, Ozeki Y. Miniature inverted-repeat transposable elements of Stowaway are active in potato. Genetics. 2010;186(1):59–66. pmid:20610409
- 21. Miskey C, Papp B, Mates L, Sinzelle L, Keller H, Izsvak Z, et al. The ancient mariner sails again: transposition of the human Hsmar1 element by a reconstructed transposase and activities of the SETMAR protein on transposon ends. Molecular and cellular biology. 2007;27(12):4589–600. pmid:17403897
- 22. Dufresne M, Hua-Van A, El Wahab HA, Ben M'Barek S, Vasnier C, Teysset L, et al. Transposition of a fungal miniature inverted-repeat transposable element through the action of a Tc1-like transposase. Genetics. 2007;175(1):441–52. pmid:17179071
- 23. Fattash I, Bhardwaj P, Hui C, Yang G. A rice stowaway MITE for gene transfer in yeast. PloS one. 2013;8(5):e64135. pmid:23704977
- 24. Desportes I, Charpentier YL, Galian A, Bernard F, Cochand-Priollet B, Lavergne A, et al. Occurrence of a New Microsporidan: Enterocytozoon bieneusi ng, n. sp., in the Enterocytes of a Human Patient with AIDS1. Journal of Eukaryotic Microbiology. 1985;32(2):250–4. pmid:4009510
- 25. Snowden K. Zoonotic microsporidia from animals and arthropods with a discussion of human infections. Opportunistic infections: toxoplasma, sarcocystis, and microsporidia: Springer; 2004. p. 123–34.
- 26. Pan G, Xu J, Li T, Xia Q, Liu S- L, Zhang G, et al. Comparative genomics of parasitic silkworm microsporidia reveal an association between genome expansion and host adaptation. BMC genomics. 2013;14(1):186.
- 27. Xu J, Zhou Z. Improving phylogenetic inference of microsporidian Nosema antheraeae among Nosema species with RPB1, α-and β-tubulin sequences. Afr J Biotechnol. 2010;9:7900–4.
- 28. Higes M, Martín R, Meana A. Nosema ceranae, a new microsporidian parasite in honeybees in Europe. Journal of invertebrate pathology. 2006;92(2):93–5. pmid:16574143
- 29. Xu J, Pan G, Fang L, Li J, Tian X, Li T, et al. The varying microsporidian genome: existence of long-terminal repeat retrotransposon in domesticated silkworm parasite Nosema bombycis. International journal for parasitology. 2006;36(9):1049–56. pmid:16797019
- 30. Chen Yp, Pettis J, Zhao Y, Liu X, Tallon L, Sadzewicz L, et al. Genome sequencing and comparative genomics of honey bee microsporidia, Nosema apis reveal novel insights into host-parasite interactions. BMC genomics. 2013;14(1):451.
- 31. Campbell SE, Williams TA, Yousuf A, Soanes DM, Paszkiewicz KH, Williams BA. The genome of Spraguea lophii and the basis of host-microsporidian interactions. PLoS genetics. 2013;9(8):e1003676. pmid:23990793
- 32. Peyretaillade E, Parisot N, Polonais V, Terrat S, Denonfoux J, Dugat-Bony E, et al. Annotation of microsporidian genomes using transcriptional signals. Nature communications. 2012;3:1137. pmid:23072807
- 33. Heinz E, Williams TA, Nakjang S, Noel CJ, Swan DC, Goldberg AV, et al. The genome of the obligate intracellular parasite Trachipleistophora hominis: new insights into microsporidian genome dynamics and reductive evolution. PLoS pathogens. 2012;8(10):e1002979. pmid:23133373
- 34. Cuomo CA, Desjardins CA, Bakowski MA, Goldberg J, Ma AT, Becnel JJ, et al. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth. Genome research. 2012;22(12):2478–88. pmid:22813931
- 35. Corradi N, Haag KL, Pombert JF, Ebert D, Keeling PJ. Draft genome sequence of the Daphnia pathogen Octosporea bayeri: insights into the gene content of a large microsporidian genome and a model for host-parasite interactions. Genome biology. 2009;10(10):R106. pmid:19807911
- 36. Cornman RS, Chen YP, Schatz MC, Street C, Zhao Y, Desany B, et al. Genomic analyses of the microsporidian Nosema ceranae, an emergent pathogen of honey bees. PLoS pathogens. 2009;5(6):e1000466. pmid:19503607
- 37. Williams BA, Lee RC, Becnel JJ, Weiss LM, Fast NM, Keeling PJ. Genome sequence surveys of Brachiola algerae and Edhazardia aedis reveal microsporidia with low gene densities. BMC genomics. 2008;9:200. pmid:18445287
- 38. Guo X, Gao J, Li F, Wang J. Evidence of horizontal transfer of non-autonomous Lep1 Helitrons facilitated by host-parasite interactions. Scientific reports. 2014;4:5119. pmid:24874102
- 39. Liu H, Pan G, Dang X, Li T, Zhou Z. Characterization of active ribosomal RNA harboring MITEs insertion in microsporidian Nosema bombycis genome. Parasitology research. 2013;112(3):1011–20. pmid:23254587
- 40. Han Y, Wessler SR. MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences. Nucleic acids research. 2010;38(22):e199. pmid:20880995
- 41. Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenetic and genome research. 2005;110(1–4):462–7. pmid:16093711
- 42. Edgar RC. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC bioinformatics. 2004;5(1):113.
- 43. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Molecular biology and evolution. 2011;28(10):2731–9. pmid:21546353
- 44. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Meth. 2012;9(4):357–9.
- 45. Naito K, Cho E, Yang G, Campbell MA, Yano K, Okumoto Y, et al. Dramatic amplification of a rice transposable element during recent domestication. Proceedings of the National Academy of Sciences. 2006;103(47):17620–5. pmid:17101970
- 46. Sun W, Shen YH, Han MJ, Cao YF, Zhang Z. An Adaptive Transposable Element Insertion in the Regulatory Region of the EO Gene in the Domesticated Silkworm, Bombyx mori. Molecular biology and evolution. 2014;31(12):3302–13. pmid:25213334
- 47. Katinka MD, Duprat S, Cornillot E, Méténier G, Thomarat F, Prensier G, et al. Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi. Nature. 2001;414(6862):450–3. pmid:11719806
- 48. McClintock B. The significance of responses of the genome to challenge. Science 1984;16;226(4676)(0036–8075 (Print)):792–801.
- 49. Ngezahayo F, Xu C, Wang H, Jiang L, Pang J, Liu B. Tissue culture-induced transpositional activity of mPing is correlated with cytosine methylation in rice. BMC plant biology. 2009;9:91. pmid:19604382
- 50. Lu C, Chen J, Zhang Y, Hu Q, Su W, Kuang H. Miniature inverted-repeat transposable elements (MITEs) have been accumulated through amplification bursts and play important roles in gene expression and species diversity in Oryza sativa. Molecular biology and evolution. 2012;29(3):1005–17. pmid:22096216