The carmine spider mite (CSM), Tetranychus cinnabarinus, is an important pest mite in agriculture, because it can develop insecticide resistance easily. To gain valuable gene information and molecular basis for the future insecticide resistance study of CSM, the first transcriptome analysis of CSM was conducted. A total of 45,016 contigs and 25,519 unigenes were generated from the de novo transcriptome assembly, and 15,167 unigenes were annotated via BLAST querying against current databases, including nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO). Aligning the transcript to Tetranychus urticae genome, the 19255 (75.45%) of the transcripts had significant (e-value <10−5) matches to T. urticae DNA genome, 19111 sequences matched to T. urticae proteome with an average protein length coverage of 42.55%. Core Eukaryotic Genes Mapping Approach (CEGMA) analysis identified 435 core eukaryotic genes (CEGs) in the CSM dataset corresponding to 95% coverage. Ten gene categories that relate to insecticide resistance in arthropod were generated from CSM transcriptome, including 53 P450-, 22 GSTs-, 23 CarEs-, 1 AChE-, 7 GluCls-, 9 nAChRs-, 8 GABA receptor-, 1 sodium channel-, 6 ATPase- and 12 Cyt b genes. We developed significant molecular resources for T. cinnabarinus putatively involved in insecticide resistance. The transcriptome assembly analysis will significantly facilitate our study on the mechanism of adapting environmental stress (including insecticide) in CSM at the molecular level, and will be very important for developing new control strategies against this pest mite.
Citation: Xu Z, Zhu W, Liu Y, Liu X, Chen Q, Peng M, et al. (2014) Analysis of Insecticide Resistance-Related Genes of the Carmine Spider Mite Tetranychus cinnabarinus Based on a De Novo Assembled Transcriptome. PLoS ONE 9(5): e94779. https://doi.org/10.1371/journal.pone.0094779
Editor: Mario Soberón, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Mexico
Received: September 19, 2013; Accepted: February 19, 2014; Published: May 15, 2014
Copyright: © 2014 Xu et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was funded in part by the Ministry of Education Program for New Century Excellent Talents (NCET-10-0700), the Fundamental Research Funds for the Central Universities (XDJK2013A007), the National Nature Sciences Foundation (31071984, 31272059) and the Special Fund for Agro-scientific Research in the Public Interest (201103020). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Carmine spider mite (CSM), Tetranychus cinnabarinus, also known as cotton red spider, belongs to Class Arachnida, Subclass Acari, Order True Acarina, Family Tetranychidae , . It is one of the most damaging pest mites in agriculture and forestry. The CSM mainly distributes in warm regions of the world and utilizes stylet to suck plant sap, causing mechanical damage to the host tissue , . Serious infestation of CSM might cause leaves to dry off loss water or even die, causing severe economic losses. It parasitize more than one hundred plant species, such as cotton, various vegetables, melons, beans, roses, jujube, Chinese herbal medicine and many other economic crops and ornamental plants, leading to significantly reduced quality and yield , .
CSM and two-spotted spider mite (TSM), Tetranychus urticae Koch, are both widely distributed polyphagous pest mites. Both of them are polymorphic in morphology, and are very similar in external morphologies. In different hosts and different regions, these two species present obvious similarities in external morphology. Therefore, many researchers considered them as two forms (red and green) of a single species (Tetranychus urticae) –. In 1956, Boudreaux  first separated CSM from TSM as an independent species based on experimental results of breeding and morphological characteristics. In 1990, Kuang  further confirmed that CSM and TSM were two entirely different species with complete reproductive isolation by performing a comprehensive comparative study of the two species, focusing on the aspects of hybridization, changes in body color, body size, external morphological features, ultrastructure, physiology and biochemistry, and ecology. So far, many taxonomists still questioning the two species just were red and green forms of T. urticae –. Therefore, the published genomic information of TSM  could not be fully utilized when investigating CSM.
Currently, the control and prevention of CSM mainly depends on spraying chemical insecticides and acaricides. However, due to its characteristics such as small individual size, strong reproductivity, short developmental duration, high inbreeding rate, frequent opportunities of receiving insecticide, strong adaptability and high mutation rate, this species of mite can easily develop insecticide resistance , . Insecticide resistance is a micro-evolutionary phenomenon, and the enhanced resistant capability selected by insecticides is hereditary. At the molecular level, there are two mechanisms underlying the insecticide resistance in arthropods, namely enhanced insecticide metabolism and reduced sensitivity of targets to insecticides . However, the lack of genetic information of CSM limits our ability to understand the mechanisms of insecticide resistance development, preventing us from developing effective resistance management strategies.
The three main targets for commonly used insecticides are ligand-gated ion channels, voltage-gated ion channels and acetylcholinesterase . Currently, the most frequently studied ligand-gated ion channels include the nicotinic acetylcholine receptor (nAChR), gamma-aminobutyric acid (GABA) receptor and glutamate-gated chloride channels (GluCls). The most frequently studied voltage-gated ion channels is sodium channels. In addition, cytochrome b (Cyt b) is a new target, which act as an alternative target of acaricide, bifennazate , . Metabolic resistance is generally associated with enzymes encoded by multiple gene families including cytochrome P450, carboxylesterases (CarEs) and glutathione S-transferase (GSTs).
Prior to this study, the NCBI database only contains 122 nucleotide sequences and 97 amino acids sequences of CSM. These genetic data are not enough to elucidate the mechanisms of insecticide resistance development and gene regulation of CSM from the molecular level. Investigating the gene sequences by traditional biological methods is often time-consuming and inefficient. However, the emergence of high-throughput sequencing technology provides researchers with a fast and low-cost way to obtain genetic data.
Transcriptome is the complete repertoire of mRNAs transcribed by a living cell, i.e. the sum of genetic information transcribed from the genomic DNA. Investigation of transcriptome is an important approach to study phenotypes and functions of cells. In order to obtain the information of transcribed genes of CSM, especially the genes that involved in the development of insecticide resistance, we employed the high-throughput sequencing platform-Illumina HiSeq™ 2000 to complete the whole transcriptome sequencing of the CSM. Based on the transcriptome analysis, several categories of genes that might be involved in metabolic and target resistance were analyzed.
Results and Discussion
De novo Transcriptome Assembly of CSM
The CSM transcriptomes were generated from four life stages (egg, larva, nymph and adult) of CSM via the Illumina sequencing. We then constructed a mass gene database that contains a total of 4,687,231,140 nucleotides (nt) and 54,350,230 reads, each of which was approximately 90 base pair (NCBI: SRA052165). After eliminating low quality reads from the raw reads, there were 52,080,346 clean reads remained, which accounted for 95.82% of the raw reads. All clean reads were assembled, compared and ligated using the short reads assembly software Trinity, so as to get the contigs and unigenes from the CSM transcriptome. These reads were assembled into 25,519 unigenes, which had been submitted to BioProject with accession number PRJNA210716 in the NCBI-database and the basic statistics for the transcriptome dataset were shown in Table 1. The 47.21% of contigs ranged from 100 to 200 nt and 26.86% contigs were more than 500 nt in length (Figure 1-A). The 44.58% of unigenes ranged from 100 to 200 nt, 24.31% ranged from 500 to 1000 nt and 31.10% were more than 1000 nt in length (Figure 1-B).
Sequencing Accuracy of the CSM Transcriptome
Ten complete mRNA sequences of CSM were chosen randomly from the NCBI nucleotide database to evaluate the sequence accuracy of transcriptome assembly. Sequence alignment was performed for 10 chosen full-length mRNA sequences and 23 corresponding annotated unigenes from the transcriptome assembly, in which 95% identity and 80% alignment length were set as thresholds. The average identity between the 10 previously identified CSM cDNA sequences in the NCBI nucleotide database and the 23 assembled sequences identified in the CSM transcriptome was 99.21% (Table 2), which is similar to the level of sequencing accuracy reported by other studies for Illumina technology , . Sequencing error or nucleotide polymorphisms may be responsible for the nucleotide diversity between assembled sequences and the submitted sequences in the NCBI nucleotide database.
Genome Mapping Results to T. Urticae
Since there is no genome sequence available for CSM, unambiguous sequence alignment of the transcripts to a reference Tetranychidae genome could provide additional measures of the transcriptome assembly accuracy and completeness. The assembled sequences were mapped to the T. urticae genomes. Aligning the transcript to T. urticae genome, the 19255 (75.45%) transcripts had significant (e-value <10−5) matches to T. urticae genome database (Table S1). Transcripts that did not map to the T. urticae genome (24.55%), as well as partial alignments may represent mis-assemblies in the transcriptome or the genome, rapidly evolving genes or the rapid evolution of untranslated regions (UTRs) .
As a starting point for transcript analysis, the proportion of the CSM transcriptome that was homologous to a predicted protein sequence in T. urticae genomes was analyzed. Protein similarity to T. urticae proteomes was assessed using BLASTX (e-value threshold of 1.0E-5). A total of 19,111 sequences (74.89%) in our dataset had a significant match with T. urticae (Table S2). To further validate the accuracy of our ortholog assignment, an analysis of the protein length coverage for BLASTX alignments showed a considerable proportion of transcriptome with mostly coverage (average protein length coverage was 42.55%) of their corresponding T. urticae match, with 41% of the orthologs covering more than half of their corresponding T. urticae matches (Figure 2).
In summary, transcript mapping to reference genomes revealed a degree of incompleteness when using T. urticae as our reference. Incomplete CSM transcriptome could underestimate the diversity of protein configurations and thus, may limit protein identification by proteomic approaches in the absence of the genome. Despite the limitations in our dataset, the transcriptome genes information will be useful for experimentalists when designing primers and probes for one gene-targeted analysis, especially when combined use with the T. urticae genome, the researchers will be very convenient and fast obtaining a large number of T. cinnabarinus target genes with partial or full sequences.
To assess the completeness of the transcriptome assembly, the CEGMA (Core Eukaryotic Genes Mapping Approach) software was applied to identify the presence of a core protein set consisting of 458 highly conserved proteins that are found in a wide range of eukaryotes . This software is usually used to assess the completeness of a genome assembly, but should also enable the assessment of a transcriptome under different interpretations . We identified 435 core eukaryotic genes (CEGs) in the CSM dataset corresponding to 95% coverage (Table S3), which is slightly lower than T. urticae genome (448 of 458 CEGs, 98%). Considering the transcriptome for CSM showed a higher coverage than that for Anopheles albimanus (showed 90% coverage) and the coverage ranged from 95–98% in the other genome sequenced species , , we could say that the quality of transcriptome assembly for CSM is considerably good.
Unigene Functional Annotation by Nr, GO, COG, and KEGG
A total of 14,589 unigenes (57.17%) from the CSM transcriptome returned an above cut-off blast hit to the NCBI non-redundant protein database. The species distribution of the top blastx hits for each unique sequence was shown in Figure 3. The unambiguous assembled sequences revealed that the greatest number of matches (36.15%) was from T. urticae, followed by Panonychus citri (13.97%), Blomia tropicalis (11.75%), and Dermatophagoides pteronyssinus (10.42%).
The species distribution is shown as a percentage of the total homologous sequences in the NCBI Nr protein database with an E-value <10−5.
Based on the CSM transcriptome assembly, 2,447 (16.13%) sequences were annotated in the GO database, which were divided into a total of 47 groups in three ontology categories (molecular function, cellular component, biological process). The “molecular function” ontology category contains 26 groups. The largest group is “cellular process” with 1,054 unigenes, and the smallest group is “cell killing” with only one unigene (Figure 4-A). The “cellular component” ontology category contains 12 groups. The largest group is “cell” with 1,435 unigenes, and the smallest groups are “virion” and “synapse part”, with only two unigenes in each group (Figure 4-B). The “biological process” ontology category contains 9 groups. The largest group is “catalytic activity” with 876 unigenes and the smallest group is “antioxidant activity” with only one unigene (Figure 4-C).
A: Biological process B: Cellular component C: Molecular function.
In order to annotate the detail function of genes, COG database was used. In total, 6,558 unigenes (43.24%) were annotated and these genes were divided into 25 categories. A total of 2,834 unigenes, which was approximately half of the 6,558 unigenes, were placed into the “General function prediction only” category. Followed by “Carbohydrate transport and metabolism” (1,703, 25.97%), “Transcription” (1,504, 22.93%), and “Translation, ribosomal structure and biogenesis” (1,129, 17.22%). The smallest one is “Nuclear structure” with only four unigenes, accounting for only 0.06% of the functionally annotated unigenes (Figure 5).
To identify the genes that involved in metabolic pathways, a total of 11,545 unigenes (45.24%) were mapped to the KEGG pathway database . These unigenes were divided into 241 pathways. The largest pathway is “Metabolic pathways” with 1,529 unigenes, accounting for 13.24% of the unigenes with functional annotation, followed by “Pathways in cancer” (439, 3.80%) and “Lysosome” (400, 3.46%) (Table S4). The smallest pathways are “Allograft rejection”, “D-Arginine and D-ornithine metabolism” and “Graft-versus-host disease” with only one unigene in each group, only accounting for 0.01% of the unigenes with functional annotation.
Identification of Genes Related to Insecticide Resistance
Based on the results of nr blastx, the unigenes that possibly involved in insecticide resistance development in the CSM transcriptome assembly were selected manually. After eliminating redundant and shorter sequences, we identified 3 categories of genes that associated with metabolic resistance (Cytochrome P450, CarEs and GSTs) and 7 categories of genes that associated with target resistance (GABA receptor, AChE, sodium channel, GluCls, ATPase, nAChRs and Cytb) (Table 3). The genes in these categories have been proved involved in insecticide resistance development in arthropod.
Three Categories of Genes Mediating Metabolic Resistance
The cytochrome P450 family is a major family of enzymes functioning in detoxification and metabolism . Because of the genetic diversity, broad substrate specificity, and catalytic versatility, P450s can mediate resistance to all classes of insecticides . In present study, a total of 81 CSM P450 transcripts were identified in the dataset with an average length of 803 bp, and 53 unigenes were found from the 81 transcripts, which were further examined to confirm that each was respectively aligned to a certain T. urticae P450 protein sequence (Table S5). The reasons why the numbers of P450 genes are approximations in so far genome-sequenced species were provided by Feyereisen with details . The approximate numbers for CSM P450 genes might be 80 between 90 with estimation, according to that in its sibling species T. urticae whose genome were sequenced is 86 (Table 4), and from this sense the probability that the present 81 transcripts obtained from CSM transcriptome represented 81 P450 genes could not be excluded. The number of P450 genes in arthropod varies widely (e.g., 36 in the human body louse Pediculus humanus to 160 in the Egyptian mosquito Aedes aegypti, Table 4), but so far all the CYP genes can be assigned to one of four clans: CYP2, CYP3, CYP4 and the mitochondrial CYP clan (CYP M) . The mitochondrial clan in vertebrates is involved in essential physiological functions, such as metabolize sterols, steroids or secosteroids (vitamin D), but that in insect is involved in xenobiotic metabolism . CYP2 clade in insect includes P450s with essential physiological functions, e.g. ecdysone metabolism and juvenile hormone biosynthesis . Considerable evidence links members of CYP3 clade in insect to xenobiotic metabolism and also insecticide resistance, whereas some CYP4s appear to be inducible metabolizers of xenobiotics and others have been linked to odorant or pheromone metabolism . Phylogenetic analysis showed that the majority of CSM P450s belongs to clan 2 (22) and clan 4 (17) compared to clan 3 (8) and clan M (6) (Figure 6). Interestingly, we found a “bloom” or family expansion in clade 2 and a contraction in clade 3 in the Tetranychus lineage compared to Insecta (Table 4). Considering members of the CYP3 and CYP4 clades in most insect species are commonly involved in environmental response/detoxifying functions against xenobiotics and phytotoxins , so the CYP2 and CYP4 clades exert these functions in Tetranychus mites quite possibly.
The tree was constructed from the multiple alignments using PhyML3.1 software and generated with 500 bootstrap trials using maximum likelihood approach method.
Glutathione S-transferases (GSTs) are a class of multifunctional detoxification enzymes and play an important role in the metabolism of a variety of insecticides, especially organophosphorus insecticides . The increased expression and activity of GSTs has been documented as a mechanism of insect resistance , . In our study, a total of 27 putative GSTs transcripts were identified in the CSM transcriptome (Table S6). Based on the results of the closest BLAST hits against the NCBI nr database, T. urticae genome database and phylogenetic analysis, 22 GSTs genes were remained and belong to five classes, mu (8), delta (7), omega (2), zeta (1) and kappa (1), unknown (3), respectively (Figure 7). The GSTs family and number of GSTs genes between Subclass Acari such as CSM and Insecta are different (Table 5). For example, the delta and epsilon GST classes in Insecta seem to be implicated in xenobiotic detoxification , but no epsilon GST class gene was found in the Acari (except only one was found in Ixodes scapulari) replaced by mu GST class which also was responsible for insecticides resistance . The sigma GST class is widespread in Insecta but not identified in Acari, which was further evidenced playing roles in the flight muscle operating under oxidative stress . Finally, 1 gene of the kappa class was found in 3 species of Acari but not in Insecta, which had high catalytic activity for aryl halides, such as CDNB, and can reduce CuOOH and (S)-15-hydroperoxy-5, 8, 11, 13-eicosatetraenoic acid , .
The tree was constructed from the multiple alignments using PhyML3.1 software and generated with 500 bootstrap trials using maximum likelihood approach method.
The main physiological functions of carboxylesterases (CarEs) include the degradation of the neurotransmitters, metabolism of specific hormones and pheromones, detoxification, defense and behavior. It is a hydrolase and can hydrolyze carboxyl ester bond and phosphodiester bond, thus metabolizing various ester bond-containing insecticides. Studies have shown that the amplification of CarEs genes is one of the important mechanisms that are involved in insecticide resistance –. Our study showed that 29 CarEs transcripts have been identified in the CSM transcriptome. After mapped these transcripts to the genome of T. urticae, a total of 23 CarEs sequences were confirmed to be unique genes (Table S7). Based on phylogenetic analysis with other known insects CCEs or the identification of closest blastn hits in the T. urticae genome database, CSM CarEs were divided into three clades, with 13 unique transcripts in Clade J', 4 in Clade J'', 1 in Clade F' and 5 in undetermined (1 in Clade J was AChE) (Figure 8). Comparative analysis of CCEs showed that the number of CCEs in Acari and Insecta is about the same, with the exception that T. urticae had a significant expansion, however, the majority of CCEs are assigned to the Neuro/developmental class in Acari. It is noteworthy that the dietary class of CCEs is widespread in insect but not found in the Acari (Table 6).
The tree was constructed from the multiple alignments using PhyML3.1 software and generated with 500 bootstrap trials using maximum likelihood approach method.
Positive Selection Analysis of Genes Encoding Metabolic Enzyme
To identify metabolic enzyme encoding genes of T. cinnabarinus undergoing positive selection, a ω = dN/dS analysis in T. cinnabarinus/T. urticae ortholog pairs was performed. Generally speaking, synonymous and nonsynonymous substitution rates are defined under the comparison of two DNA sequences, namely dS and dN represent the numbers of synonymous and nonsynonymous substitutions per site, respectively. Thus, the ratio ω measures the difference between the two rates and is most easily understood from a mathematical description of a codon substitution model. In other words, an amino acid in neutral change will be fixed at the same rate as a synonymous mutation with ω = 1, in deleterious change will reduce its fixation rate, thus ω<1, in selective advantage change with ω>1. Therefore, significant advantage change offers convincing evidence for diversifying selection . Among the 34 pairs of T. cinnabarinus/T. urticae orthologs ω values ranged from 0.910 to 2.595, with an average of 1.513, in which 32 pairs had a ω value greater than 1 (Table 7), suggesting these 32 unigenes were under positive selection.
Seven Categories of Genes Mediating Target Resistance
Glutamate-gated chloride channels (GluCls), also known as inhibitory glutamate receptors (IGluRs), belong to the superfamily of cysteine loop ligand-gated ion channel, and the function of Glucls is mainly in mediating inhibitory neurotransmission in nerve and muscle cells . Because of GluCls are found only in invertebrates and have not been found in vertebrates, it is the ideal target for highly selective insecticides. Based on electrophysiological and pharmacological studies, glutamate receptors are divided into two categories termed ionotropic and metabotropic receptors. Insecticides that act on GluCls include abamectin, ivermectin, fipronil and the indole diterpenoid compound nodulisporic acid. A particular mutation in the α-subunit of GluCls causes the substitution of one amino acid, resulting in reduced sensitivity of the mutant channel to insecticide and thereby causing insecticide resistance . A total of 7 GluCls sequences were identified from the CSM transcriptome (Table S8), but most of insects, such as D. melanogaster, T. castaneum and A. mellifera, only have one glutamate-gated chloride channel subunit.
The nAChRs represent a diverse family of cys-loop ligand-gated ion channels. It plays an important role in the transmission of nerve signals at the postsynaptic membrane in both vertebrates and invertebrates . The current insecticides that are acting on insect nAChR mainly include nereistoxin, neonicotinoid and the biological insecticide, spinosad. These insecticides specifically bind to the insect nAChR and block the normal neural function of the receptors, thus leading to the paralysis and death of insects . In contrast to the case of many animals, insects are thought to have relatively few (on the order of 10 to 12) nAChR type receptor gene families. In T. cinnabarinus, 9 nAChRs unigenes have been identified (Table S9).
Acetylcholinesterase (AChE) is a very important neurotransmitter hydrolase that maintains the in vivo cholinergic nerve impulses and is an important target of organophosphate and carbamate insecticides. Inhibition of AChE by insecticides could lead to the accumulation of acetylcholine in the synapses and excessive levels of acetylcholine block depolarization, thus inhibiting normal nerve conduction and ultimately leading to the death of insects. It has been found that changes in AChE are one of the important mechanisms for insect resistant to organophosphate and carbamate insecticides. Many single amino acid substitutions can be detected in the AChE gene, which either act alone or as combination to decrease the sensitivity of AChE to insecticide , . One of the CCEs was identified (clade J in Figure 8) in T. cinnabarinus and it belonged to the AChE class (Table S7).
The GABA receptors also belong to the super family of cys-loop neurotransmitter receptors. GABARs are the main target for the phenylpyrazole insecticides (such as fipronil), abamectin and cyclopentyl diene insecticides . It has been reported that the mechanism underlying GABAR target resistance is the replacement of one alanine by serine or glycine in the open reading frame. This alanine plays a very important role in the binding between GABARs and insecticides, and the substitution of this alanine causes insects to become resistant to insecticides . Insect GABA receptors are divided into three classes. These are known to be encoded by the Rdl, Grd, or Lcch3 gene. Interestingly, most insect genomes sequences contain only one Rdl orthologues, however we found 3 Rdl orthologues, 2 GABA-A receptors and 3 GABA-B receptors in T. cinnabarinus (Table S10).
Sodium channel is the main target of DDT and pyrethroid insecticides. Pyrethroids can interfere with gating kinetics of sodium channels, slowing inactivation during membrane depolarization and extending the sodium ion current, and thus can cause repetitional discharges and blocked nerve conduction , . Many Studies have shown that nonsynonymous mutations in the sodium channel involve in insecticide resistance. Our previous study of CSM (GenBank accession number: GU196305) has showed that mutations on sodium channel were associated with fenpropathrin resistance . In this study, we found 7 transcripts hit against sodium channel with 100% accuracy (Table S11).
ATP synthase (ATPase) is one of the targets of beta-cypermethrin. It has been found that target resistance caused by reduced Na+-K+-ATPase and Ca2+-ATPase sensitivities is one of the important mechanisms that involve in beta-cypermethrin resistance of insects . Studies have found that the toxicological mechanisms of pyrethroid insecticides are closely related to the Na+-K+-ATPase in insect nervous system . Two Na+-K+- ATPase and 4 Ca2+- ATPase were identified from our CSM transcriptome assembly analysis (Table S12).
Cytochrome b (Cytb) is an important class of redox proteins in organisms. It locates in the electron transfer chain, and participates in a series of oxidation-reduction reactions of the living body, including the NADP dependent fatty acid desaturation, oxidation-reduction reactions catalyzed by cytochrome P450 and redox reactions of methemoglobin. Cytochrome b (Cyt b) was newly reported to be the alternative target for acaricide bifenazate , . We identified 12 Cyt b sequences from the CSM transcriptome assembly analysis (Table S13).
We obtained 45,016 contigs and 25,519 unigenes by sequencing the CSM transcriptome. BLAST was used to search the nr, SwissProt, the Clusters of Orthologous Groups (COGs), Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO) databases and T. urticae genome, from which 15,167 unigenes were annotated. These assembled unigenes were used for the identification of the CSM genes associated with insecticide resistance. Totally, 53 P450-related genes, 23 CarEs-, 22 GSTs-, 8 GABA receptor-, 1 AchE- 1 sodium channel-, 12 Cyt b-, 7 GluCls-, 6 ATPase- and 9 nAChRs-related genes from the CSM transcriptome were identified. These gene categories have been reported to be related to insecticide resistance.
We, for the first time, employed RNA-seq to provide a comprehensive identification of the critical elements that may involve in the development of insecticide resistance in CSM. This study utilized for the first time high-throughput sequencing technology to investigate the CSM transcriptome. Although most of the unigenes are not full length, they will greatly improve our understanding of CSM at the molecular level, and the large amount of gene sequences laid a very important foundation for the future investigation of the CSM genes with either known or unknown function.
Materials and Methods
The laboratory population of Carmine spider mite (CSM), T. cinnabarinus was first collected from the field of Beibei District, Chongqing mulicipality directly under the central government, China and no specific permissions were required for these collection activities because this mite is a kind of agriculture-harmful pest and distributes worldwide. We confirm that the field collection did not involve endangered or protected species.
The laboratory CSM population was derived by transferring the CSMs growing on cowpea (Vigna sesquipedalis Koern) leaves from the field of Beibei District, Chongqing mulicipality, China to fresh potted cowpea leaves in the lab, and had been raised in artificial climate chamber for 14 years without any insecticide. The rearing conditions were: 26±1°C temperature, 35% to 55% humidity, and 14∶10 (L: D) photoperiod.
Water was added to 60 Petri dishes with a diameter of 12 cm, and two pieces of sponge 4 cm×3 cm×2 cm in size were placed in each dish. Each sponge block was covered with a thin layer of absorbent cotton to increase the water absorption of the sponge, and a piece of filter paper with a size matched exactly with the sponge was placed on top of the sponge wrapped with absorbent cotton. Fresh leaves of cowpea seedling, which were slightly smaller than the filter paper, were collected, cleaned with water, wiped dry, and then placed on the filter paper with the top of leaves facing down. Each leaf was then transferred with 20 female adult mites. The female mites were allowed to lay eggs for 48 h, and then removed. The number of eggs was recorded, and each leaf contained around 200 to 400 eggs. Subsequently, 2-day-old eggs, 0.5-day-old larva, 1-day-old nymphs and 3-day-old female adult and male adult mites were collected.
RNA-seq and Sequence Information
We collected 4000 eggs, 2000 larva, 1000 nymphs, 800 female adult mites, and 1000 male adult mites, which were placed in a mortar, mixed with liquid nitrogen and fully grounded. The RNeasy plus MicroKit kit (Qiagen GmbH, Hilden, Germany) was used to extract the total RNA of mites at each life stage in accordance with the product manual. The total RNA concentration was above 400 ng/ul, and the total amount RNA of each stage was greater than 20 ug. The quality of the RNA sample was verified by ensuring that the OD260/280 was within the range of 1.8 to 2.2 as measured by the NanoVue UV-Vis spectrophotometer (GE Healthcare Bio-Science, Uppsala, Sweden), and the RNA integrity number (RIN) was greater than or equal to 7 (RIN value was measured by the BGI-Shenzhen). In addition, qualified samples also had a 28S to 18S rRNA ratio above 1.0 as measured by 1% agarose gel electrophoresis.
The qualified RNA sample was send to BGI (Beijing Genomics Institute, China) for Illumina sequencing and standardized analyzing (include estimation of data output, composition and quality assessment of sequencing data, results of the assembly, functional annotation of Unigene, GO classification, and Pathway enrichment analysis). Briefly, after extracting the total RNA from samples, magnetic beads with Oligo (dT) were used to enrich eukaryotic mRNA. The fragmentation buffer was added to break mRNAs into short fragments. The mRNA were used as the template to synthesize the first strand cDNA using random hexamers, followed by synthesis of the second strand cDNA by adding buffer, dNTPs, RNase H and DNA polymerase I. The cDNA was purified by the Qiaquick PCR kit and eluted with the EB buffer, followed by end repair, addition of poly (A) and ligation of the sequencing adaptor. The size of the fragments was selected by agarose gel electrophoresis, followed with PCR amplification. The constructed sequencing library was sequenced using the Illumina HiSeq 2000.
De novo Assembly
Figure 9 provides a flow diagram of a computational procedures used for this study. Before being assembled, the clean reads generated by Illumina sequencing were filtered to remove low quality reads from raw reads. First, de novo transcriptome analysis of the clean reads was carried out using the short-read assembly program Trinity  to generate longer fragments, which were called contigs. Next, the reads were mapped back to contigs. Finally, Trinity connects the contigs, and gets sequences that cannot be extended on either end. Such sequences are defined as Unigenes.
All de novo assembled unique transcripts were compared to protein databases including nr, KEGG, COG and T. urticae genome sequence information (http://bioinformatics.psb.ugent.be/webtools/bogas/overview/Tetur) using the Blastx algorithm with a significant cut-off of E-value <10−5. The best matches were used to identify coding regions and to determine the sequence direction. The functional annotations of the sequences were predicted using nr database, then Blast2GO was used to get GO annotation of Unigenes . WEGO software was used to do GO functional classification for all unigenes and to understand the distribution of gene functions of the species from the macro level. KEGG is a database that is able to analyze gene product during metabolism process and related gene function in the cellular processes. With the help of KEGG database, we can further study genes’ biological complex behaviors, and by KEGG annotation we can get pathway annotation for unigenes. Unigenes are firstly aligned by blastx (E-value <10−5) to protein databases in the priority order of nr, Swiss-Prot, KEGG and COG. When all alignments are finished, proteins with highest ranks in blast results are taken to decide the coding region sequences of Unigenes, the coding region sequences are translated into amino sequences with the standard codon table. So both the nucleotide sequences (5′ - 3′) and amino sequences of the unigene encoding region are acquired. Unigenes that cannot be aligned to any database are scanned by ESTScan, producing nucleotide sequence (5′ - 3′) direction and amino sequence of the predicted coding region.
The entire assembled transcript dataset was used to search for the best hit homologous proteins (BLASTX cut-off e-value 1.0E-5) in the T. urticae proteomes. Ortholog prediction was done by performing BLASTX and TBLASTN bidirectional comparisons between T. cinnabarinus and T. urticae (e value 1.0E-5) to identify the hits within the two species.
To identify the proportion of the core eukaryotic genome covered by the T. cinnabarinus transcriptome, we used HMM profiles corresponding to the 458 core eukaryotic proteins as provided by the CEGMA algorithm . Local HMMER3 searches  were calibrated using the T. urticae core eukaryotic protein validated dataset consisting of 448 sequences .
Identification of Genes Related to Insecticide Resistance
To identify the sequences encoding genes related to insecticide resistance, such as detoxification enzymes (GSTs, CarEs and P450) and insecticide targets (IGluRs, AChE, GABA receptor, sodium channel, Cytochrome b, ATP synthase and nAChRs), sequences encoding potential pesticide-related genes (>180 bp) were identified using the blastx program in the NCBI database with a cut-off E-value <10−5. Among the unigenes shown to contain sequences related to insecticide resistance, some corresponded to the same genes. In these cases the transcripts were advances to a filter to identify different isoforms and transcripts based on being mapped in the T. urticae genome database. The unique transcripts mapped in the same blast results or with high homology to one another were eliminated as allelic variants or as different parts of the same gene. In such cases, the longer ones were adopted. PhyML3.1 software  was used to analyze the phylogenetic relationships between interested genes with the related genes of other species, especially with T. urticae, to make a prediction of their classification and homology. A maximum likelihood analysis was used to create phylogenetic trees of resistance-related genes. Positions containing alignment gaps and missing data were eliminated with pairwise deletion. Bootstrap analysis of 500 replication trees was performed to evaluate the branch strength of each tree.
Positive Selection Analysis of Metabolic Enzyme
T. cinnabarinus orthologs of detoxification genes (P450s, GSTs, CCEs) in T. urticae were identified based on our phylogenetic analyses. Pairs having a bootstrap value greater than 90% and alignment length greater than 500nt were selected for further analysis. Pair-wise amino acid alignments of the region in common between two orthologs were conducted using Muscle 3.8.31. The amino acid sequences were back-translated to nucleotide sequences and used for the estimation of the pairwise non-synonymous (dN) and synonymous (dS) substitution rates using MEGA 5.05 . The Jukes-Cantor distance model with the modified Nei-Gojobori method was used. The pair-wise ratios of dN/dS (ω) were calculated and used to investigate if T. cinnabarinus sequences were evolved under positive (ω>1) or negative, purifying (ω <1) selection or neutrally (ω = 1) compared to the corresponding sequences of T. urticae.
The result of aligning T. cinnabarinus transcriptome with T. urticae genome CDS database.
The result of aligning T. cinnabarinus transcriptome with T. urticae genome proteins database.
Alignment of assembled unigenes to proteins in the CEGMA database, as determined by a BLASTx search.
Distribution of KEGG functional annotation of the CSM transcriptome.
Predicted insecticide resistance-related P450 transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related GST transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related CarEs and AChE transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related GluCls transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related nAChRs transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related GABA recceptor transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related VGSC transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
Predicted insecticide resistance-related calcium and sodium- potassium ATPase transcripts. Length: sequences length of the unigene. Identity: seqeunce identity of the alignment to the T. urticae protein. Tu ID: ID of T. urticae (Tu) genome.
The authors thank anonymous reviewers and academic editor for their constructive suggestions about polishing the manuscript.
Conceived and designed the experiments: LH. Performed the experiments: ZFX WYZ. Analyzed the data: ZFX WYZ QSC MP XZW YCL. Contributed reagents/materials/analysis tools: LH GMS. Wrote the paper: LH ZFX WYZ GMS XL.
- 1. He L, Xue CC, Wang JJ, Li M, Lu WC, et al. (2009) Resistance selection and biochemical mechanism of resistance to two acaricides in Tetranychus cinnabarinus (Boisduval). Pestic Biochem Physiol 93: 47–52.
- 2. Jia FL, Chen YJ, Chen J, Wang DD, Dai GH (2011) Biological activity of extracts from 8 species of plants against Tetranychus cinnabarinus. Chinese Agricultural Science Bulletin 27: 286–291.
- 3. Zhang JP, Wang JJ, Zhao ZM, He L, Dou W (2003) In vitro inhibiting of esterases in Tetranychus cinnabarinus (Boisduval) by three insecticides. Acta Arachnologica Sinica 12: 95–99.
- 4. Sun QT, Meng ZJ (2001) Biological characteristics of Tetranychus cinnabarinus, a vegetable pest. Journal of Jilin Agricultural University 23: 24–25.
- 5. Liang W, Bai XN, Ma LQ, Shi GL, Wang YN (2011) Preliminary study on scopoletin toxicity to Tetranychus cinnabarinus and its acaricidal mechanism. Guang Dong Agricultural Sciences 8: 68–71.
- 6. Robinson DM (1961) A species of Tetranychus Dufour (Acarina) from Uganda. Nature 189: 857–858.
- 7. Dupont LM (1979) On gene flow between Tetranychus urticae Koch, 1836 and Tetranychus cinnabarinus (Boisduval) Boudreaux, 1956 (Acari: Tetranychidae): synonymy between the two species. Entomol Exp Appl 25: 297–303.
- 8. Ehara S (1989) Recent advances in taxonomy of Japanese tetranychid mites. Shokubutu-boeki 43: 358–361 (in Japanese)..
- 9. Gotoh T, Tokioka T (1996) Genetic compatibility among diapausing red, non-diapausing red and diapausing green forms of the two-spotted spider mite, Tetranychus urticae Koch (Acari: Tetranychidae). Japanese Journal of Entomology 64: 215–225.
- 10. Ros VID, JAJ Breeuwer (2007) Spider mite (Acari: Tetranychidae) mitochondrial COI phylogeny reviewed: host plant relationships, phylogeography, reproductive parasites and barcoding. Exp Appl Acarol 42: 239–262.
- 11. Bolland HR, Gutierrez J, Flechtmann CHW (1998) World catalogue of the spider mite family (Acari: Tetranychidae), with references to taxonomy, synonymy, host plants and distribution. Brill Academic Publishers, Leiden, Netherlands.
- 12. Renata S, de Mendonca, Navia D, Diniz IR, Auger P, et al. (2011) A critical review on some closely related species of Tetranychus sensu stricto (Acari: Tetranychidae) in the public DNA sequences databases. Exp Appl Acarol 55: 1–23.
- 13. Boudreaux HB (1956) Revision of the two spotted spider mite (Acarina, Tetranychidae) complex, Tetranychus telarius (Linnaeus). Ann Entomol Soc Am 49: 43–49.
- 14. Kuang HY, Chen LS (1990) Studies on the differentiation of two sibling species Tetranychus cinnabarinus (Boisduval) and T. urticae Koch. Acta Entomologica Sinica 33(1): 109–116.
- 15. Smith FF, AL Boswell, RE Webb (1969) Segregation between strains of carmine and green two-spotted spider mites. Proc Int Congr Acarol. 2rid, Sutton Bonington. 1967: 155–159.
- 16. Jordaan LC (1977) Hybridization studies on the Tetranychus cinnabarinus complex in South Africa (Acari: Tetranychidae). J Entomol Soc S Aft 40: 147–156.
- 17. Brandenburg RL, Kennedy GG (1981) Differences in dorsal integumentary lobe densities between Tetranychus urticae Koch and Tetranychus cinnabarinus (Boisduval) (Acarina: Tetranychidae) from Northeastern NorthCarolina. Int J Acaro7: 231–234.
- 18. Goka K, Takafuji A, Toda S, Hamamura T, Osakabe M, et al. (1996) Genetic distinctness between two forms of Tetranychus urticae Koch (Acari: Tetranychidae) detected by electrophoresis. Exp Appl Acarol 20: 683–693.
- 19. Zhang ZQ, Jacobson RJ (2000) Using adult female morphological characters for differentiating Tetranychus urticae complex (Acari: Tetranychidae) from greenhouse tomato crops in UK. Syst Appl Acarol 5: 69–76.
- 20. Sugasawa J, Kitashima Y, T Gotoh (2002) Hybrid affinities between the green and the red forms of the two-spotted spider mite Tetranychus urticae (Acari: Tet-ranychidae) under laboratory and semi-natural conditions. Appl Entomol Zool 37: 127–139.
- 21. Li T, Chen XL, Hong XY (2009) Population genetic structure of Tetranychus urticae and its sibling species Tetranychus cinnabaribus (Acari: Tetranychidae) in China as inferred from microsatellite data. Ann Entomol Soc Am 102: 674–683.
- 22. Grbic M, Leeuwen TV, Clark RM, Rombauts S, Rouzé P, et al. (2011) The genome of Tetranychus urticae reveals herbivorous pest adaptations. Nat Biotechnol 479: 487–492.
- 23. Jia FL, Chen YJ, Chen J, Wang DD, Dai GH (2011) Biological activity of extracts from 8 species of plants against Tetranychus Cinnabarinus. Chinese Agricultural Science Bulletin. 27: 286–291.
- 24. Zhai QH (1995) Some aspects of progress in insect molecular biology: molecular mechanisms of insecticide resistance. Acta Entomologica Sinica 38: 493–501.
- 25. Qiu XH (2005) Insecticide resistance: genetics, genomics and implications for pest control. Acta Entomologica Sinica 48: 960–967.
- 26. Nieuwenhuyse PV, Leeuwen TV, Khajehali J, Vanholme B, Tirry L (2009) Mutations in the mitochondrial cytochrome b of Tetranychus urticae Koch (Acari: Tetranychidae) confer cross-resistance between bifenazate and acequinocyl. Pest Manag Sci 65: 404–412.
- 27. Leeuwen TV, Nieuwenhuyse PV, Vanholmet B, Dermauw W, Tirry L (2011) Parallel evolution of cytochrome b mediated bifenazate resistance in the citrus red mite Panonychus citri. Insect Mol Biol 20: 135–140.
- 28. Hsu JC, Chien TY, Hu CC, Chen MJM, Wu W J, et al. (2012) Discovery of genes related to insecticide resistance in Bactrocera dorsalis by functional genomic analysis of a de novo assembled transcriptome. PloS one 7: e40950.
- 29. Zhu JY, Zhao N, Yang B (2012) Global Transcriptome Profiling of the Pine Shoot Beetle, Tomicus yunnanensis (Coleoptera: Scolytinae). PloS one 7: e32291.
- 30. Parra G, Bradnam K, Ning Z, Keane T, Korf I (2009) Assessing the gene space in draft genomes. Nucleic Acids Res 37: 289–297.
- 31. Parra G, Bradnam K, Korf I (2007) CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23: 1061–1067.
- 32. Calvo E, Andersen J, Francischetti IM, Debianchi AG, James AA, et al. (2004) The transcriptome of adult female Anopheles darlingi salivary glands. Insect Mol Biol 13: 73–88.
- 33. Kanehisa M (2002) The KEGG database//Novartis Found Symp. 247: 91–101.
- 34. Daborn PJ, Lumb C, Boey A, Wong W, Batterham P (2007) Evaluating the insecticide resistance potential of eight Drosophila melanogaster cytochrome P450 genes by transgenic over-expression. Insect Biochem Mol Biol 37: 512–519.
- 35. Feyereisen R (2011) Arthropod CYPomes illustrate the tempo and mode in P450 evolution. Biochimica et Biophysica Acta (BBA)-Proteins and Proteomics 1814: 19–28.
- 36. Tijet N, Helvig C, Feyereisen R (2001) The cytochrome P450 gene superfamily in Drosophila melanogaster: Annotation, intron-exon organization and phylogeny. Gene 262: 189–198.
- 37. Feyereisen R (2006) Evolution of insect P450. Biochem Soc T 34: 1252–1255.
- 38. Claudianos C, Ranson H, Johnson RM, Biswas S, Schuler MA, et al. (2006) A deficit of detoxification enzymes: pesticide sensitivity and environmental response in the honeybee. Insect Mol Biol 15: 615–636.
- 39. Werck-Reichhart D, Feyereisen R (2000) Cytochromes P450: a success story. Genome Biol. 1: 3003.1–3003.9.
- 40. Bass C, Field LM (2011) Gene amplification and insecticides resistance. Pest manag Sci 67: 886–890.
- 41. Yamamoto K, Shigeoka Y, Aso Y, Banno Y, Kimura M, et al. (2009) Molecular and biochemical characterization of a Zeta-class glutathione S-transferase of the silkmoth. Pestic Biochem Physiol 94: 30–35.
- 42. Ranson H, Rossiter L, Ortelli F, Jensen B, Wang X, et al. (2001) Identification of a novel class of insect glutathione S-transferases involved in resistance to DDT in the malaria vector Anopheles gambiae. Biochem J 359: 295–304.
- 43. Ranson H, Claudianos C, Ortelli F, Abgrall C, Hemingway J, et al. (2002) Evolution of supergene families associated with insecticide resistance. Science 298: 179–181.
- 44. Low WY, Ng HL, Morton CJ, Parker MW, Batterham P, et al. (2007) Molecular evolution of glutathione S-transferases in the genus Drosophila. Genetics 177: 1363–1375.
- 45. Konanz S, Nauen R (2004) Purification and partial characterization of a glutathione-S-transferase from the two-spotted spider mite, Tetranychus urticae. Pestic Biochem Physiol 79: 49–57.
- 46. Hayes JD, Flanagan JU, Jowsey IR (2005) Glutathione transferases. Annu. Rev Pharmacol Toxicol 45: 51–88.
- 47. Hotelier T, Negre V, Marchot P, Chatonnet A (2010) Insecticide resistance through mutations in cholinesterases or carboxylesterases: data mining in the ESTHER database. J Pestic Sci 35: 315–320.
- 48. Sun W, Xue CH, He L, Lu WC, Li M, et al. (2010) Molecular characterization of two novel esterase genes from carmine spider mite, Tetranychus cinnabarinus (Acarina: Tetranychidae). Insect Sci 17: 91–100.
- 49. Li X, Schuler MA, Berenbaum MR (2007) Molecular mechanisms of metabolic resistance to synthetic and natural xenobiotics. Annu Rev Entomol 50: 231–253.
- 50. Yang Z, Bielawski JP (2000) Statistical methods for detecting molecular adaptation. Trends Ecol Evol 15: 496–503.
- 51. Dermauw W, Ilias A, Riga M, Tsagkarakou A, Grbic M, et al. (2012) The cys-loop ligand-gated ion channel gene family of Tetranychus urticae: implications for acaricide toxicology and a novel mutation associated with abamectin resistance. Insect Biochem Mol Biol 42: 455–465.
- 52. Wolstenholme AJ, Rogers AT (2005) Glutamate-gated chloride channels and the mode of action of the avermectin/milbemycin anthelmintics. Parasitology 131: S85–S95.
- 53. Tomizawa M, Casida JE (2005) Neonicotinoid insecticide toxicology: mechanisms of selective action. Annu Rev Pharmacol Toxicol 45: 247–268.
- 54. Millar NS, Denholm I (2007) Nicotinic acetylcholine receptors: targets for commercially important insecticides. Invert Neurosci 7: 53–66.
- 55. Lu WC (2009) Effects of abamectin and heat stresses on the expression of GABA and GABA receptor in Tetranychus cinnabarinus. Chongqing, China: Southwest University.
- 56. Dong K. 2007. Insect sodium channels and insecticide resistance. Invert Neurosci 7: 17–30.
- 57. Morin S, Williamson MS, Goodson SJ, Brown JK, Tabashnik BE, et al. (2002) Mutations in the Bemisia tabaci para sodium channel gene associated with resistance to a pyrethroid plus organophosphate mixture. Insect Biochem Mol Biol 32: 1781–1791.
- 58. Feng YN, Zhao S, Sun W, Li M, Lu WC, et al. (2011) The sodium channel gene in Tetranychus cinnabarinus (Boisduval): identification and expression analysis of a mutation associated with pyrethroid resistance. Pest Manag SCI 67: 904–912.
- 59. Ma Y, Ma M, Zhou TH, Huo XB (2006) Inhibition of high effect cypermethrin on Ca-ATPase of Blattella germanica. Chin J Public Health 22: 1080–1081.
- 60. Ma Y, Fu RS, Huo XB (2004) Inhibition of high effect cypermethrin on Na-K-ATPase of Blattella germanica. Chin J Public Health 20: 1440–1441.
- 61. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652.
- 62. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, et al. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674–3676.
- 63. Eddy S (2010) HMMER3: a new generation of sequence homology search software. URL: http://hmmer. janelia. Org.
- 64. Guindon S, Delsuc F, Dufayard JF, Gascuel O (2009) Estimating maximum likelihood phylogenies with PhyML[M]//Bioinformatics for DNA Sequence Analysis. Humana Press 113–137.
- 65. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32: 1792–1797.
- 66. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. (2011) MEGA5: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol 28: 2731–2739.