Although the importance of insect saliva in insect-host plant interactions has been acknowledged, there is very limited information on the nature and complexity of the salivary proteome in lepidopteran herbivores. We inspected the labial salivary transcriptome and proteome of Helicoverpa armigera, an important polyphagous pest species. To identify the majority of the salivary proteins we have randomly sequenced 19,389 expressed sequence tags (ESTs) from a normalized cDNA library of salivary glands. In parallel, a non-cytosolic enriched protein fraction was obtained from labial salivary glands and subjected to two-dimensional gel electrophoresis (2-DE) and de novo peptide sequencing. This procedure allowed comparison of peptides and EST sequences and enabled us to identify 65 protein spots from the secreted labial saliva 2DE proteome. The mass spectrometry analysis revealed ecdysone, glucose oxidase, fructosidase, carboxyl/cholinesterase and an uncharacterized protein previously detected in H. armigera midgut proteome. Consistently, their corresponding transcripts are among the most abundant in our cDNA library. We did find redundancy of sequence identification of saliva-secreted proteins suggesting multiple isoforms. As expected, we found several enzymes responsible for digestion and plant offense. In addition, we identified non-digestive proteins such as an arginine kinase and abundant proteins of unknown function. This identification of secreted salivary gland proteins allows a more comprehensive understanding of insect feeding and poses new challenges for the elucidation of protein function.
Citation: Celorio-Mancera MdlP, Courtiade J, Muck A, Heckel DG, Musser RO, Vogel H (2011) Sialome of a Generalist Lepidopteran Herbivore: Identification of Transcripts and Proteins from Helicoverpa armigera Labial Salivary Glands. PLoS ONE 6(10): e26676. https://doi.org/10.1371/journal.pone.0026676
Editor: Vladimir N. Uversky, University of South Florida College of Medicine, United States of America
Received: August 5, 2011; Accepted: September 30, 2011; Published: October 27, 2011
Copyright: © 2011 Celorio-Mancera et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Max Planck Society. Partial support for collaborative travel was provided by the National Science Foundation Plant Genome Research Initiative (No. 0820367 to ROM). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Many triploblastic metazoans benefit from a functional gland apparatus dedicated to produce saliva, a substance that in most cases lubricates their mouthparts and aids in predigestion. In addition, saliva may contain components crucial for a particular adaptation, from building a nest  to disarming a host's antibleeding defense . In humans, salivary constituents and their function have been extensively studied to the point of using saliva as a diagnostic medium for various biochemical tests. The human salivary proteome is composed of more than 1300 proteins and ongoing proteomic studies are performed to understand its quantitative and qualitative plasticity and find disease-related biomarkers . The saliva produced by blood-feeding arthropods has also been well characterized. High-throughput approaches, including Proteomics, have been utilized to identify the secreted salivary constituents of vectors such as ticks, triatomines, fleas, flies and mosquitoes , ,  aiming to find good targets to control the diseases they transmit. It has been observed that blood-feeding animals share salivary constituents which function is antihemostatic such as vasodilators, inhibitors of blood coagulation and platelet aggregation .
More recently, salivary proteins or secreted proteomes of three different insect herbivore species have been elucidated , , . The protein profiles corresponding to these three aphid species reflect more differences than similarities among each other. However, this discrepancy may represent the different interaction between each aphid species and its host(s) . The salivary constituents may be also very different depending on the particular feeding strategy used by an insect herbivore. Aphids, piercing the plant tissue intercellularly until reaching phloem cells, trigger a totally different plant defense response than the mostly jasmonic acid-regulated one triggered by a chewing caterpillar , . The complexity and identity of caterpillar saliva constituents has not been studied in detail. However, there is evidence that a glucose oxidase produced by Helicoverpa zea is the primary salivary factor to suppress the induction of nicotine in tobacco plants and that saliva of this same lepidopteran species has antibacterial properties , , . In turn, elicitors of plant defense responses have been found in caterpillar regurgitate  which may include salivary components. The Old World cotton bollworm, H. armigera (Har) belongs to a “major-pest lineage” of the cosmopolitan subfamily Heliothinae (Lepidoptera:Noctuidae) . Efforts to understand the digestive system of this generalist herbivore include the identification of its larval midgut lumen proteome . In turn, the insect gut has an intricate relationship with the salivary glands. It has been stated that during larval feeding, the plant tissue is sheared with the mandibles and passes through the foregut where it is mixed with digestive secretions from the salivary glands . The salivary apparatus is represented by the long and tubular labial glands and the relative smaller mandibular glands. One of the characteristics of most Endopterygota is the ability of their larvae to produce protein threads (silk) from their labial glands. Therefore, silk production may be an ancestral function of the labial salivary glands in Lepidoptera , . In the domesticated mulberry silkworm, Bombyx mori, the labial glands are referred as “silk glands” since they produce massive amounts of silk proteins during the final stages of larval development. Due to its economic relevance, these silk proteins are the best characterized components of lepidopteran labial saliva , . Here, we use an unbiased high-throughput approach, to expand the current knowledge on labial saliva produced by a generalist phytophagous insect and in particular, to aid understanding the role of saliva in Har digestion and elicitation of host plant responses. For this purpose, we generated a salivary-gland transcriptome dataset and examined the non-cytosolic enriched protein fraction from labial salivary glands using two-dimensional gel electrophoresis, identifying the proteins using de novo peptide sequencing and public database searches including sequence information from diverse Har cDNA libraries.
Results and Discussion
Har salivary gland cDNA library
Normalization of the Har salivary gland cDNA resulted in reduction of any over-abundant transcripts and production of a more even distribution of transcripts ranging from 0.2 to >4.0 kb in size. The average size of the cDNAs of the Har salivary gland cDNA library that were cloned and sequenced was 1,040 bp. The total number of high quality reads subsequently used for the assembly was 19,389 with an average length of read (bases) of 548 after vector clipping and quality trimming. Expressed sequenced tag (EST) clustering resulted in a total of 2,826 contiguous sequences (contigs; with 2 to 485 ESTs) and 5,463 singletons represented by a single EST, yielding a total of 8,289 putative gene objects. We found that 50% of the contigs are above 980 bases with the largest contig having 3,172 bases. The deduced sequences from 5,056 clusters (61% of total clusters) shared significant similarities with protein sequences deposited in non-redundant databases (EMBL/Genbank), a proportion comparable to that found in other studies of insect sialotranscriptomes , , . One has to note that the transcripts of the unknown class could represent novel proteins or derive from the less conserved 3′ or 5′ untranslated regions of genes, as was indicated for the transcriptomes in other insects , , .
Functional analyses using Gene Ontologies
For functional comparisons, all sequences were subjected to Gene Ontology (GO) analysis in Blast2GO, where we classified all gene objects in Biological Function, Molecular Process and Cellular Component. Of the 5,056 contigs in the Har salivary gland cDNA library with high-score matches in the Genbank non-redundant (nr) protein database, 4,173 (82.5%) shared significant similarity with proteins with assigned molecular functions in the GO database and thus could be classified into a GO category, with each class containing at least 21 sequences (0.5% of 4,173). Blast-positive clusters were classified into 10 molecular functional categories at level 2 of the gene ontology system (Figure S1), among which “binding” (GO:0005488) and “catalytic activity” (GO:0003824) categories were over-represented (43% and 35%, respectively), followed by “structural molecule activity” function (GO:0005198) and “transporter activity” (GO:0005215). Most of these dominant GO categories are also the most common functional categories identified in the venom gland transcriptome of a parasitic wasp . Transcript abundance is another indication of how important the proteins they code for can be to the specific organ or tissue, such as the case of digestive proteases in gut tissue. The most highly expressed genes in the moderately normalized Har salivary gland library encoded proteins involved in mitochondrial respiratory chain and ATP synthase proteins (cytochrome C, vacuolar ATP synthase), general cellular homeostasis, ribosomal proteins but also glucose oxidase and glucose dehydrogenase (belonging to the GMC oxidoreductase superfamily), coagulin, fibroin, lipases and protease inhibitors (e.g. brasiliensin) (Table S1).
When comparing the GO terms obtained from the salivary gland tissue library sequences with those obtained from other Har tissue-specific cDNA libraries (gut and hemocytes), there are clear differences in the relative representations of certain functional categories. Examples for such an over-representation of the GO categories are hydrolase and oxidoreductase activity, which are more prominent in the gut tissue versus both hemocytes and salivary gland tissue, while the category structural molecule activity is more prominent in the salivary gland tissue as compared to both other tissues (Figure S2). Overall, the assembly into 8,289 contigs from the salivary gland of Har and subsequent sequence annotation and functional categorization has revealed that this tissue is more complex than we envisioned beforehand. The encountered complexity can be at least partially addressed by identification of candidate gene groups.
Pre-digestion gene candidates
A very important aspect of any dietary constraints in Lepidoptera is the availability of proteins and nitrogen in the respective diets and the abundance of functional larval digestive enzymes to access these resources. Plant tissues are not only characterized by high levels of non-digestible materials such as cellulose and lignin, but leaves usually also contain low levels of both protein and lipids (i.e. triglycerides, phospholipids and galactolipids). The insect midgut has classically been viewed as a tissue primarily involved in digestion and detoxification and endopeptidases such as serine proteases (trypsin and chymotrypsin-like) are thought to play the dominant role in protein hydrolysis as well as exopeptidases of varying terminal amino acid specificity (aminopeptidases and carboxypeptidases). So far the lepidopteran salivary gland has not explicitly been seen as an additional, and potentially important, source of enzymes involved in pre-digestion of plant materials. However, the pre-digestion of food may occur already on the damaged plant tissue, provided that enough saliva is secreted, or takes place outside the midgut, for example, in the crop or foregut or even in some extent inside the oral cavity. Digestive enzymes in this case either come from the salivary gland alone or can be passed forward from the midgut and are then mixed with salivary gland enzymes. It is noteworthy to mention that, given the feeding of a chewing herbivore, salivary gland-derived enzymes would likely also end up in the midgut, thus adding to the midgut tissue-derived enzymatic composition of the gut lumen. In support of a role of the lepidopteran salivary gland in plant predigestion, our Har salivary gland cDNA library contains a range of contigs coding for proteases, lipases and amylases. Among the contigs with similarity to proteases, seven code for trypsin-like serine proteases, most of which display highest similarity to silk-gland derived serine proteases and trypsins form Ostrinia nubilalis with unknown functions. Among the proteases known to be present in the digestive enzyme repertoire of gut tissues, we could also identify 3 different carboxypeptidases but were unable to identify any sequence with homology to aminopeptidases. These proteases could act in concert, thus contributing to the efficient use of the low nitrogen-content of the ingested plant material which will be completed by the gut enzymes.
In addition to nitrogen acquisition through the concerted action of proteases, insect herbivores need to have a range of lipases in order to overcome their host plant limitations in lipid content. For example, deficiency of cholesterol, normally synthesized from phytosterols, leads to increased larval mortality and reduced egg hatch . Several studies have characterized lipid metabolic activities from insects. These include lipase ,  and phospholipase A2 . Digestive lipases and phospholipases are key enzymes in processing dietary lipids and enzyme activities have been identified in lepidopteran larval midgut, fat body and salivary gland . Our Har salivary gland cDNA library codes for 8 lipases and 2 phospholipases and two contigs coding for lipases are among the most highly expressed genes in the salivary gland (Table S1), pointing at an important role of these in plant tissue pre-digestion.
Though low in proteins and lipids, plant tissue is often a rich source of starch and sugars. In humans starch degradation starts in the oral cavity, where an amylase enzyme in saliva begins to break down starch into disaccharides such as maltose but also into dextrin. We have identified several putative alpha-amylases and a maltase with high similarity to Dipteran salivary maltases. Alpha-amylase genes often form multigene families in living organisms and this multigene family has been extensively studied in Diptera . It is therefore interesting to note that there is a single predicted alpha-amylase sequence of a lepidopteran insect in the NCBI nr database, as all other hits of the Har salivary amylases are against Diptera or Hymenoptera. However, Blast searches against the NCBI dbEST database lead to multiple hits against insect ESTs mostly derived from midgut cDNA libraries, pointing at a lack of annotated amylase genes in public databases containing Lepidoptera sequences. Among the three different alpha-amylases form Har salivary glands, one predicted protein sequence shows a much higher similarity to Dipteran alpha-amylases as compared to the existing lepidopteran enzymes present in the NCBI nr database. One of the identified alpha-amylases, Har_Contig 3039, coding for only a partial protein sequence, is identical to an alpha-amylase previously identified in the gut lumen of Har (ABU98614) . To examine the relationships among maltase proteins identified in Har salivary glands and those found in other insects, sequences from six insect species were aligned and used to construct a gene phylogeny (Figure 1A). The phylogenetic analysis revealed that these sequences clustered in distinct clades according to species phylogeny, with both lepidopteran maltase sequences clearly separated with a high bootstrap support. Overall, the sequence alignment of Har and all other insect maltases display multiple highly conserved amino acids (Figure 1B). As the salivary glands in Har express both, alpha-amylases and a maltase, pre-digestion of complex carbohydrates could process from cleavage into disaccharides by the action of the alpha-amylases and the release of the sugar glucose through the action of maltase. We also have identified a salivary gland beta-glucanase (Contig_1557; Contig_4824) and a fructosidase (EF600050) previously identified as digestive enzymes in the gut of Har , .
(A) An unrooted Bayesian inference tree constructed from the alignment of amino acid sequences presented in (B). The Helicoverpa (Har) sequence clusters with the predicted maltase from Bombyx mori (Bmo) with good bootstrap support. (B) The complete predicted polypeptide sequences of 5 insect maltases and the identified Helicoverpa maltase are aligned. Amino acid sequence alignments were performed using MAFFT multiple alignment program. Identical residues are color-coded and residues highly conserved in all arthropod CLPs are marked with asterisks. Species abbreviations: Drosophila virilis (Dvi), Harpegnathos saltator (Hsa), Aedes aegypti (Aae), Culex quincefasciens (Cqu). GenBank accession numbers are given at the end of sequence names.
In addition to merely aid in the digestion of plant nutrients, salivary gland enzymes could aid in host plant penetration, detoxify plant defensive phytochemicals, but could also both induce and degrade plant wound messengers. Several ribonucleases (RNases) are prominent in the Har salivary gland transcriptome, among which we have identified a contig with high sequence similarity to salivary secreted ribonucleases found in e.g. Glossina morsitans . Besides being active in nutrient acquisition through the degradation of ribonucleic acid, RNase has been shown to, when applied to wounded plant tissue, induce pathogen defense response in the attacked plant .
For the closely related species H. zea it was shown that a glucose oxidase enzyme can manipulate inducible plant defenses to benefit the herbivore when this enzyme gets into contact with wounded plant tissues . In our salivary gland transcriptome dataset we have identified several gene objects with homology to glucose oxidase/glucose dehydrogenases, all of which belong to the superfamily of glucose-methanol-choline oxidoreductases (GMCs). The GMC oxidoreductase gene family is known for a variety of substrates and catalytic activities ,  and has been characterized at the molecular and functional level in a beetle-host plant interaction system, where a specific GMC oxidoreductase is involved in beetle chemical defense . However, with the exception of a few GMC proteins, very little is known about the specific roles of members of this gene family.
Lepidoptera, as other insects, protect themselves against microbial infections through several defensive molecules, including the diverse group of antimicrobial peptides (AMPs). Many AMPs can lyse microbes, although this has only been directly shown with individual AMPs in few cases, while others can also act as eukaryotic cytolysins , . An emerging pattern and seemingly common feature of blood-sucking or plant sap-feeding insect sialotranscriptomes analyzed (which mostly excludes Lepidoptera) is the presence of AMPs such as defensins, cecropins, and lysozyme, as well as pattern recognition molecules (e.g. Gram-negative binding proteins (GNBPs), beta-1,3 glucan recognition protein (BGRP) and C-type lectins) and serine proteases that may act as proximal activators of the prophenoloxidase or proteolytic cascades , . Our transcriptomic analysis resulted in identification of a large number of AMPs among which are gloverin, attacin, cecropin, defensin, heliomicin, several lysozymes, pattern recognition proteins such as BGRP and ESTs (Contig_1415+Contig_719) with homology to an inducible metalloproteinase inhibitor identified and described in G. mellonella . The extent of antimicrobial defense molecule complexity expressed in Har salivary glands was somewhat surprising, but is in line with what was found in another tissue of Lepidoptera exposed to the outside, i.e. pheromone glands of Heliothis virescens female moths .
We identified four different lysozymes expressed in the salivary glands of Har. Lysozymes are a very interesting group of immune-related proteins, as they have frequently been shown to have a dual function, being both involved in immune defense and digestion , . One of the first lysozymes was identified in Galleria mellonella more than 40 years ago, representing the first antimicrobial protein reported from insects . In addition to antibacterial activity, G. mellonella lysozyme was also shown to exhibit antifungal activity in vitro, similar to that of human lysozyme against the pathogenic yeast Candida albicans , . A phylogenetic analysis of the Har salivary gland predicted lysozyme protein sequences revealed that they cluster in two distinct clades. One of these clades contains the C (chicken) type lysozymes , which includes three of the four Har lysozymes identified here (Figure 2). These findings are consistent with a typical number of C-type lysozymes found in other Lepidoptera (e.g. three lysozymes identified in the genome of Bombyx). To further examine the relationships among lysozyme proteins identified in Har salivary glands and those found in other insects, C-type lysozyme sequences from several insect species were aligned and used to construct a gene phylogeny (Figure 2A). The phylogenetic analysis revealed that the three Har and other lepidopteran C-type lysozyme sequences clustered in two distinct clades. One of these clades clearly separated with a high bootstrap support contains two of the three Har sequences and lepidopteran lysozymes generally associated with immune system functions, while another Har gland lysozyme clusters together with lysozymes identified in the gut of several Lepidoptera. This specific Har C-type lysozyme is 76% and 75% identical to the Antherea mylitta and Manduca sexta homologues, 42% identical to the salivary lysozyme homolog from a Diptera (Simulium nigrimanum), but displays only 35–39% identity to the immune-related C-type lysozymes from other Lepidoptera. The lysozyme sequence clustering displayed in the phylogeny can also be seen in the protein alignment, clearly separating two distinct groups of proteins (Figure 2B). In addition to the C-type, we identified one i-type-like lysozyme whose function remains to be elucidated. I-type lysozymes are vertebrate-specific and, although somewhat diverged in their activities, differ from other lysozymes in having 10–12 cysteine residues in the primary sequence. These cysteine residues are predicted to form five disulfide bonds which have been attributed to cause stability against heat denaturation or proteolytic degradation as i-type lysozymes can be intact even after prolonged heating . The i-type lysozymes are typically coded for by single copy genes in Lepidoptera .
(A) An unrooted Bayesian inference tree constructed from the alignment of lysozyme amino acid sequences presented in (B). Bayesian posterior probabilities are shown for all major nodes supported with probability higher than 60%. (B) Amino acid alignment of the three predicted proteins from Helicoverpa (Har) together with predicted protein sequences deduced from publicly available insect sequence datasets. Amino acid sequence alignments were performed using MAFFT multiple alignment program. Identical residues are color-coded and marked with asterisks above the alignment. Species abbreviations: Galleria mellonella (Gme), Antheraea mylitta (Amy), Manduca sexta (Mse), Simulium nigrimanum (Sni), Simulium vittatum (Svi), Trichoplusia ni (Tni), Bombyx mori (Bmo), Spodoptera exigua (Sex), Heliothis virescens (Hvi), Helicoverpa zea (Hze), Drosophila melanogaster (Dme). GenBank accession numbers are given at the end of sequence names.
Har salivary gland transcriptome is also very rich in genes coding for proteinase inhibitors, such as immune-related proteases involved in immune defense regulation, and several Kazal-type proteinase inhibitors (KPIs) such as dipetalogastin/brasiliensin-like inhibitors , several of which are amongst the most highly expressed genes in our library (Table1). Proteinases and proteinase inhibitors are involved in several biological and physiological processes in all multicellular organisms and can act as modulators for controlling the extent of deleterious proteinase activity. The invertebrate KPIs which function as anticoagulants in blood-sucking animals such as leech, mosquitoes and ticks, are likely involved in protecting host from microbial proteinases and have been shown to protect silk moth cocoons from predators and microbial destruction . The salivary gland transcriptome of Har comprises a number of serine proteinase inhibitors among which we identified three genes encoding potential metalloprotease inhibitors (Har_GLN-C719; C7076; C1415). All three insect metalloproteinase inhibitors (IMPIs) share sequence similarity only with an IMPI isolated from immune-induced G. mellonella larvae. This IMPI represents the first and to date only peptide known from animals which is capable of inhibiting thermolysin-like microbial metalloproteinases , including a number of prominent members such as bacillolysin and vibriolysin which are produced by pathogenic bacteria to invade the tissues of their hosts . IMPI proteins have recently been found to encode two distinct inhibitors where the N-terminal part contributes to innate immune responses by inhibiting microbial metalloproteases, whereas the C-terminal part has been implicated to mediate regulation of endogenous immunity and development-related matrix metalloproteinases . Two of the three Har IMPI cDNAs are truncated at the C-terminus but all three code for the complete N-terminal microbial metalloprotease inhibitor peptide, differing in several amino acid positions and thus pointing at the existence of a small IMPI gene family (Figure 3).
Multiple sequence alignment of the conserved N-terminal parts coding for the complete mature metalloprotease inhibitor peptides (IMPIs) of Helicoverpa armigera (Har), Galleria mellonella (Gme), Heliothis virescens (Hvi), Antheraea mylitta (Amy), and Samia cynthia (Scy). Identical residues are boxed with dark shading, and conserved residues are boxed with light shading. Conserved residues are marked with dots and identical residues in all IMPIs are marked with asterisks below the alignment.
Unknown transcripts with overlap to insect sialomes
A wide range of sequences identified in the Har salivary gland transcriptome display homology to predicted proteins that have been identified in salivary glands of aphids and mosquitoes but also in the venom glands of wasps. Among the overlapping cDNAs are sequences with homology to a 17 kDa salivary protein described in Phlebotomus, a putative 6.3 kDa salivary peptide Anopheles funestus, a putative secreted salivary protein from a flea Xenopsylla cheopis, several cDNAs with homology to an unknown salivary protein from a mosquito (Culicoides sonorensis), salivary cysteine-rich peptides of B. mori, a salivary/fat body serine carboxypeptidase identified in wheat midge Sitodiplosis mosellana, several cDNAs with similarity to venom acid phosphatases and a gamma-glutamyl cyclotransferase-like venom protein isoform 2 of Nasonia vitripennis, and several cDNAs with homology to secreted salivary ribonucleases. In general we can find a range of salivary-gland expressed genes which, based on their GO associations and predicted function overlap with the venom gland transcriptome of wasps . These findings support the hypothesis that most insect sialomes share a core fraction of expressed genes related to potentially important functional categories such as oxidative stress response, immune defense, pre-digestion and/or tissue penetration, and proteins determining the viscosity of the saliva. A complete list of the contigs and singletons with their GO annotations and BLAST results can be found in Table S2.
Characterization of secreted labial salivary proteins
Obtaining enough labial saliva in order to undergo a proteomic analysis is a challenging task. Since collection of labial saliva through the spinneret (tube-like structure on the larval labium from where the silk is drawn) is a time-consuming impractical possibility, we decided to extract the labial gland pairs and subject them to a centrifugal force such that by compressing the organs towards the bottom of the tube, the supernatant obtained would be enriched on proteins from the gland lumen. Previously, secreted salivary proteins have been recovered in the supernatant using this approach . The protein complexity of the non-cytosolic enriched soluble fraction from Har labial salivary glands is mostly represented by at least 20 proteins in the acidic pI range with an apparent molecular mass ranging from 25 to 150 kDa (Figure 4). A total of 65 gel spots were subjected to peptide de novo sequencing since they were considered of sufficient abundance (intensity) for subsequent MS analysis. The sequenced peptides from these spots yielded best protein hits from NCBI Insecta using MS-BLAST (Table S3) and only 24 hits when searched against ButterflyBase (http://butterflybase.ice.mpg.de/) (Table 1). Signal peptide probability was obtained for Har ESTs (positive hits) obtained after performing a MS-BLAST search of the peptides against Har salivary gland cDNA library translated into amino acid sequences. Indeed, the majority of more intensively stained and larger protein spots detected by 2-DE where predicted to have a high probability of being secreted enzymes (Table S4). Fewer abundant proteins between 15 to 25 kDa across the pI range were also detected. The majority of the remaining inconspicuous spots correspond to non-secreted proteins such as spots 23, 32 and 50, predicted to be involved in ubiquitin mediated proteolysis or glycolysis-related proteins (e.g. spots 47, 60) which indicates a degree of cytosolic protein contamination in the sample preparation. Similarly, the presence of the infection-inducible, hemolymph-clotting scolexin (spot 39), and arylphorin storage protein (spot 56, 57) may indicate a certain degree of contamination of our sample with hemolymph. However, salivary agglutinins may play a vital biological role by protecting the insect oral cavity from pathogens as observed in the case of humans . The complete list of peptide sequences detected per protein spot and interpreted de novo from MS/MS spectra are available in Table S5.
Secretion-enriched proteins from H. armigera larval salivary (labial) were loaded on 24-cm pH 3 to 11 NL isoelectric focusing strips, separated in the second dimension by Tris–Tricine–SDS-polyacrylamide gel electrophoresis on a 15% gel and stained with colloidal Coomassie. Numbers designate the protein spots which were analyzed by mass spectrometry. Molecular mass standards are indicated in kDa (left) and the pI range at the top of the gel.
Identified secreted proteins
Pre-digestion. The extent of food digestion in the oral cavity of caterpillars previous to further processing and absorption in the gut is unknown. However, the feeding strategy of a phytophagous insect may indicate the importance of digestive enzymes as salivary components. Phytophagous piercing-sucking insects digesting the plant tissue in an extra-oral fashion may depend on a more complex battery of digestive enzymes including those required for the digestion of the plant cell wall. Indeed, peptides corresponding to predicted pectinases, cellulases or amylases in the labial salivary proteome of Har caterpillars were not detected, despite the identification of amylase sequences in the salivary gland transcriptome. However, the proteomic analysis did predict the presence of digestive enzymes such as β-fructofuranosidase (spots 9, 11), fructose-bisphosphate aldolase (spot 58), glucose dehydrogenase (spot 4) and proteases (spots 37, and 41).
It is necessary to contemplate these results taking in consideration the insect diet used. The commercial artificial diet (Bio-Serv) fed for the experimental group of insects is a sucrose, soy-wheat germ based diet without antibiotics. Therefore, whether the quality and quantity of Har labial gland proteome varies when a given host-plant is offered as food is still an open question. For now, the prediction of β-fructofuranosidase as a salivary secreted protein, specifically as predicted product of GH32FruA-1 (EF600050), is consistent with the finding of this enzymatic activity in the labial glands of the related heliothine species H. zea  and β-fructofuranosidase BmSuc1 expression and product localization in the labial glands of B. mori . This protein has also been detected at relatively lower levels in the Har larval gut lumen . Since the sucrose-digesting activity of recombinant BmSUC1 is not inhibited by the alkaloidal sugar mimic glycosidase inhibitors found in mulberry leaves, it has been suggested that this enzyme is an adaptation which allows the silkworm to bypass the mulberry's defense system. In addition, transcript GH32FruA-1 is up-regulated in a tissue other than the gut upon detrimental gossypol concentrations to Har larval growth . All this information opens intriguing questions about the role of β-fructofuranosidases in insect host-plant adaptation and the importance of defining whether there is a main organ of production of this type of enzymes. Fructose-bisphosphate aldolase is an important enzyme in fructose metabolism found, although not exclusively, in human salivary glands ,  and glucose dehydrogenase, also a relevant enzyme in carbohydrate metabolism, is a protein that has been reported to be a component of the green peach aphid secreted saliva .
The identification of glucose oxidase (GOX) as a component of Har salivary proteome (spot 3) was consistent with previous reports revealing the ubiquity of this enzyme not only within the family Noctuidae but across Lepidoptera , , . GOX occurrence in Helicoverpa spp labial glands has been correlated with the inhibition of plant defences (e. g. nicotine production in tobacco) ,  and bacterial protection . Moreover, the production of this enzymatic activity seems to be correlated with herbivore diet breadth. Thus, it has been suggested that GOX activity represents a potential mechanism contributing to host-range expansion in insect species . The apparent multifunctional nature of GOX claims additional research, especially considering other components in caterpillar saliva, such as secreted antioxidant enzymes. We detected oxidase/peroxidase (spots 5, 6), superoxide dismutase (spot 53) and a putative secreted peroxiredoxin (spot 48) which potentially play a role in the removal of reactive oxygen species (ROS). Indeed, peroxidase activity has been found in the labial gland homogenate of a heliothine caterpillar species . It has been claimed that GOX, along with enzymes able to eliminate hydrogen peroxide (a product of GOX) constitute an antioxidant system in insect physiology . Another relatively abundant oxidase type, ecdysone oxidase (spots 1, 2), was also detected in the Har salivary gland secreted sialome. Ecdysteroids are involved in controlling different aspects of insect physiology such as moulting, development and reproduction, and in turn, one important player in their metabolism is ecdysone oxidase . Further analysis on the function of this enzyme in labial saliva may indicate whether this protein is indeed involved in steroidal hormone metabolism.
Preventing plant defences to be triggered upon feeding may not be the only offense strategy of an insect herbivore, but detoxification of constitutive chemical defences in the host. A highly abundant protein corresponding to a carboxyl/cholinesterase (CCEO16d) was found represented by spot 7 on the 2D-gel. CCEO16d has been classified as an extracellular non-catalytic esterase and through comparative genomics, it has been grouped among dipteran CCEs involved in insecticide resistance . Since there is evidence suggesting that esterases may not always hydrolyse their substrates , we speculate that labial salivary Har CCEO16d is an esterase involved in the modification and transport of host plant metabolites as a mechanism of insect defense.
Chymotrypsin inhibitor protein (spot 13) and brasiliensin (spot 27) were detected as elements of the secreted sialome in H. armigera. Brasiliensin is a multi domain serine protease inhibitor similar to other blood anticoagulants of blood-sucking insects. Termed after the hematophagous invertebrate Triatoma brasiliensis, its role in blood intake has recently been addressed confirming its anticoagulant activity . Although anticoagulants have been mostly identified from blood-sucking invertebrates, other protease inhibitor-like proteins have been found in a seed-feeding hemipteran . H. armigera brasiliensin-like protein raises the possibility of an additional insect response to a plant defense mechanism based on the increase of viscosity of the diet, interfering with insect digestion.
Arginine kinase and HaPUF-1.
These two proteins are particularly interesting since both have previously also been detected in the H. armigera larval midgut lumen proteome . The role of each of these enzymes in the insect midgut lumen is still unknown. Consistent with the midgut lumen 2D-gel protein separation results, arginine kinase, a human-allergenic enzyme, was detected also as two neighbor spots (35, 36) in our analysis. Recently, arginine kinase has been identified as a cytoplasmic protein which transcript is relatively abundant in different tissues of the silkworm, including the labial glands . In addition, both arginine kinase transcript and protein are elevated in a silkworm strain resistant to nucleopolyhedrovirus in comparison to the susceptible one . Comparing the intensity and magnitude of arginine kinase spots and HaPUF-1 (H. armigera protein of unknown function 1 or B1NLD7) in the midgut and the labial gland proteomes, HaPUF-1 (spot 12) represents a very abundant protein in the secreted labial salivary preparation while arginine kinase appears to occur at the same intensity in both the gut and labial gland. Further studies are required to determine whether there is a major organ of production of each of these proteins.
The main objective of this study was to evaluate the labial sialome (transcriptome and proteome) complexity of a lepidopteran herbivore and to identify a list of candidate genes and proteins likely to be involved in Har digestion and plant defense response manipulation. Indeed, the results herein represent additional evidence that Har labial glands are not simply a silk-producing organ but an overlooked important organ involved in insect immunity and digestion. In fact, a substantial number of the proteins found previously in Har gut have been identified as soluble luminal salivary proteins in this study, posing interesting questions regarding the mechanisms of insect digestive physiology. Therefore, the recurrence of such proteins claims a better understanding of their function. The insect mouth parts and oral cavity, as the first point of contact with the host, not only need to be protected from pathogens on the plant tissue but require offense molecules to counteract the plant chemical defense and enzymes that allow the acquisition of energy. The products of some Har labial salivary transcripts were not found in our proteomic analysis, such as amylases, lipases and some immune-related proteins. Reasons to explain this incongruence might be that such proteins were not detected in the soluble luminal protein preparation subjected to the proteomic analysis. AMPs, for example, are notoriously hard to detect in standard MS/MS analyses, mainly due to their small size which makes difficult their fragmentation and separation, and other proteins such as lipases may need even more stringent conditions to become denatured and solubilized. Other possible explanations are that these proteins are of low abundance, or that feeding induction studies may be necessary to allow such proteins to be detectable. We have generated a comprehensive tissue-specific database as a resource for more in-depth analyses of the salivary gland reprogramming of Har upon stress, most notably by toxic plant secondary metabolites. Furthermore, this data can be used for comparative genomics studies to identify overlap and differences among phytophagous and hematophagous insects and more specifically among generalist and specialist lepidopteran herbivores.
Materials and Methods
Insects and diet
Har eggs were acquired in 2008 from Bayer CropScience AG (Monheim, Germany) and reared under laboratory conditions (26°C, 55% RH, 16∶8 hr = L:D) in Jena, Germany since 2009, for about 10 generations prior to the start of this study. The artificial diet for larval rearing was purchased from BioServ (Cat. No. F9772, Frenchtown, NJ, USA).
Batches of second-day fifth-instar larvae were dissected longitudinally under ice-cold phosphate buffered saline (PBS) in order to retrieve with fine forceps the labial salivary gland apparatus (LG) which were collected in 1.5 ml tube containing 100 µl PBS. After centrifugation (16 000 g, 20 min. 4°C), the supernatant enriched with LG lumen soluble proteins was collected in a new tube and stored at −20°C until sample preparation for 2-D electrophoresis. Samples were pooled into two independent biological replicates each representing approximately 90 LG pairs and protein concentration was determined using the Protein Dye reagent (BioRad) and bovine serum albumin (BSA) as standard.
Normalization and cDNA library construction
Har salivary (labial and mandibular) glands were isolated from 3rd to 5th instar larvae by microsurgery. Isolated glands were placed in pre-cooled 1.5 ml tubes with 1 ml TriZol, homogenized with a TissueLyser (Qiagen) and shock frozen in liquid nitrogen before RNA isolation. After RNA purification with TriZol, an additional DNAse (Turbo DNAse, Ambion) treatment was included prior to the second purification step to eliminate any contaminating DNA. The DNAse enzyme was removed and the RNA was further purified by using the RNeasy MinElute Clean up Kit (Qiagen) following the manufacturer's protocol and eluted in 20 µl of RNA Storage Solution (Ambion). RNA integrity and quantity was verified on an Agilent 2100 Bioanalyzer using the RNA Nano chips (Agilent Technologies, Palo Alto, CA). RNA quantity was determined on a Nanodrop ND-1000 spectrophotometer. RNA extractions were generated from different pooled glands and four RNA extracts were subsequently pooled for cDNA generation.
For Har salivary gland tissue material a full-length enriched, normalized cDNA library was generated using a combination of the SMART cDNA library construction kit (Clontech) and the Trimmer Direct cDNA normalization kit (Evrogen) generally following the manufacturer's protocol but with several important modifications, essentially as previously described . Each step of the normalization procedure was carefully monitored to avoid the generation of artefacts and overcycling. The resulting ds-cDNA pool was purified and concentrated using the DNA Clean and Concentrator kit (Zymogen) and size fractionated with SizeSep 400 spun columns (GE Healthcare) that resulted in a cut-off at ∼200 bp. The full-length-enriched cDNAs were cut with SfiI and ligated to pDNR-Lib plasmid (Clontech). Ligations were transformed into E. coli ELECTROMAX DH5α-E electro-competent cells (Invitrogen). Hemocyte and midgut Har cDNA libraries  were used along with Har salivary cDNA library to inspect GO enrichment among tissue-specific cDNA libraries.
Sequencing, Generation of EST Databases and Sequence Analysis
Plasmid minipreparation from bacterial colonies grown in 96 deep-well plates was performed using the 96well robot plasmid isolation kit (NextTec) on a Tecan Evo Freedom 150 robotic platform (Tecan). Single-pass sequencing of the 5′ termini of cDNA libraries was carried out on an ABI 3730 xl automatic DNA sequencer (PE Applied Biosystems). Vector clipping, quality trimming and sequence assembly using stringent conditions (e.g. high quality sequence trimming parameters, 95% sequence identity cutoff, 25bp overlap) was done with the Lasergene software package (DNAStar Inc.). To identify similarities with known proteins, the sequences of contigs and singletons were searched using the BLASTX algorithm  against a local non-redundant protein database (NR, NCBI) with a E-value cut-off of 10−04. To define the function of the contigs and singletons, we used the Gene Ontology (GO)  controlled vocabulary, which provides annotations and allows a more global view of the dataset using the Blast2GO software with a stringency cut-off of 10−3. To minimize the number of classes with only few gene objects, we set the minimum number of gene objects (cut-off level) in a class to 0.5% of the total number of sequences that could be classified. The signalP algorithm was accessed online to predict the presence of signal peptides (SignalP 3.0 Server. [http://www.cbs.dtu.dk/services/SignalP]). The EST sequences were deposited into the NCBI dbEST database under accessions JK126269-JK145657.
Nucleotide sequences were analyzed in more detail using the commercial Lasergene Software package and the freeware BioEdit program. Genes were aligned by their amino acid sequences using the ClustalX2 function  or the MAFFT (http://mafft.cbrc.jp/alignment/server/index.html) program. If necessary, alignments were then corrected by eye and reverted back to the nucleotide sequences for the phylogenetic analyses and in order to remove redundant contigs. Conserved residues in the alignments were highlighted with BOXSHADE 3.21 (http://www.ch.embnet.org/software/BOX_form.html) or in ClustalX2. The phylogenetic reconstruction implemented for the analysis of several proteins was performed using two different methods, Maximum-Likelihood analyses using PhyML and by Bayesian inference using Mr. Bayes, both implemented in the Phylogeny.fr webserver (http://www.phylogeny.fr/version2_cgi/alacarte.cgi). The Maximum-Likelihood and the Bayesian tree topologies including their general subfamily relationships and node supports were in agreement. The gene trees were visualized and optimized with the TreeDyn tool also implemented on the Phylogeny.fr webserver.
Separation of Proteins by Two-Dimensional Gel Electrophoresis
The protocol used in order to separate the enriched LG lumen protein samples by 2-D PAGE has been described previously  with the only modification of staining the gels with colloidal Coomassie working solution prepared following the protocol described elsewhere (http://www1.em.mpg.de/proteomics/) .
Protein Spot Picking and Processing
The protein spots were manually picked and processed as described earlier  with the following modifications: trypsin digestion was carried out overnight with 70 ng of porcine trypsin (Promega) in 10 µL of 50 mM ammonium bicarbonate at 37°C. The digest was centrifuged down in MTPs and 50 µL of extraction solution (50% acetonitrile, 0.1% TFA) were added twice for 20 min extraction, and the solution was transferred to the plate. The extracted peptide mixtures were then vacuum-dried for approx. 45 min at 45°C.
Mass spectrometry (MS).
The tryptic peptides were reconstituted in 6 µL aqueous 0.1% formic acid (FA). The selected volume of samples (ca 4.5 µL) was injected on a nanoAcquity nanoUPLC system (Waters, Milford, MA, USA). Mobile phase A (0.1% aqueous formic acid, 15 µL/min for 1 min) was used to concentrate and desalt the samples on a 20×0.180 mm Symmetry C18, 5 µm particle precolumn. The samples were then eluted on a 100 mm×75 µm ID, 1.7 µm BEH nanoAcquity C18 column (Waters). Phases A and B (100% MeCN in 0.1% FA) were linearly mixed in a gradient to 5% phase B in 0.33 min, increased to 40% B in 10 min, and finally increased to 85% B in 10.5 min, holding 85%B to 11 min and decreasing to 1% B in 11.1 min of the run. The eluted peptides were transferred to the nano electrospray source of a Synapt HDMS tandem mass spectrometer (Waters) equipped with metal coated nanoelectrospray tips (Picotip, 50×0.36 mm, 10 µm I.D, New Objective, Woburn, MA, USA). The source temperature was set to 80°C, cone gas flow 20 L/h, and the nanoelectrospray voltage was 3.2 kV. The TOF analyzer was used in reflectron mode. The MS/MS spectra were collected at 1 s intervals (50–1700 m/z). A 650 fmol/µL human Glu-Fibrinopeptide B in 0.1% formic acid/acetonitrile (1∶1 v/v) was infused at a flow rate of 0.5 µL/min through the reference NanoLockSpray source every 30th scan compensating for mass shifts in the MS and MS/MS fragmentation mode.
The data were collected by MassLynx v4.1 software. ProteinLynx Global Server Browser v.2.3 software (both Waters) was used for baseline subtraction and smoothing, deisotoping, de novo peptide sequence identification. The de novo sequence characterization from collisionally induced (CID) MS/MS fragment spectra used peptide mass tolerance 0.03 Da mass deviation of precursor peptide masses, 1 possible missed cleavage, carbamidomethylation of cysteins, possible oxidation of methionines, and possible deamidation of asparagines and glutamines, respectively. Signal peptide prediction probabilities were obtained using SignalP 3.0 .
The procedure and its merits have been described by others . In brief, sequences with ladder scores (percentage of expected y- and b-ions) exceeding 40% were used in a homology-based search strategy using the MS BLAST program. The MS-BLAST utilizes possibly redundant short peptide sequences for similarity searches in protein databases from organisms phylogenetically distant from the study species. All candidate sequences from a given spot exceeding the threshold, even different sequences from the same peptide, are concatenated into a single query separated by dashes in an arbitrary order. The WU-BLAST2 BLASTP search engine (http://blast.wustl.edu) scores only the most significant match in the case of several peptide candidates covering the same region in the target sequence. In addition, the PAM30MS matrix, which accounts for the inability to distinguish I and L residues and allows for unknown residues X, is used in the blastp similarity search. This enables identification of homologous proteins in other species with many amino acid substitutions, under conditions where spectral searches are not possible due to lack of sequences for the given organism. Scoring of the significance of such matches is on precomputed threshold scores conditional on the number of query peptides and their E- values of the individual HSPs (high-scoring segment pairs) hits. Computational studies  have estimated a false positive rate of <3%. The searches were performed on MS BLAST server installed in-house for searching the EBI_100-nr database and on a locally generated EST database from Har salivary gland cDNA library or on the ButterflyBase web page (http://butterflybase.org/) for searching the ButterflyBase EST database from Lepidoptera, exclusive of B. mori (34 882 protein sequences).
Gene ontology (GO) assignments for the Helicoverpa sialotranscriptome. GO assignments as predicted for their involvement in (A) biological processes and (B) molecular functions. Data for biological processes are presented at level 2 GO categorization while data for molecular functions are presented at level 3 GO categorization. Classified gene objects are depicted as percentages of the total number of gene objects with GO assignments.
Comparison of GO category representations between Helicoverpa armigera salivary gland, gut and hemocyte transcriptome data. Each transcript was assigned applicable high-level generic GO terms. Data are presented for Molecular Function GO-level 3. Obtained GO data for gut (GN) and hemocyte (HCN) tissues were multiplied by the factor depicted next to the abbreviations in order to correct for different numbers of total contigs obtained. Note that one gene object can be classified into more than 1 class, therefore the total number of gene objects classified for both species is not identical to the number of contigs with GO associations.
Top highest expressed ESTs in salivary gland library.
Complete annotation file of the assembled Helicoverpa salivary gland ESTs. Contig IDs, sequence length, Helicoverpa contig sequences, top BLAST hits (if any) in the NCBI nr database for each unique contig, including accession number, E-value and percentage similarity, EC numbers, GO annotations and InterPro scans are listed.
Results of MS BLAST searches using de novo peptide sequences against the NCBI_insecta Database. aGenBank Accession number and description of best hit protein in NCBI_insecta by MS BLAST. bSpecies of best hit in NCBI-insecta. cPredicted molecular weight of best hit (kDa). dNumber of peptides matching best hit in the MS BLAST search. eMS BLAST scoring (see Material and Methods).
Results of MS BLAST searches using de novo peptide sequences against H. armigera ESTs and salivary gland cDNA library sequences and BLASTP searches using H. armigera protein predicted from cDNA against UniRef100. aNumber of peptides matching the target in the MS BLAST search. bNumber of amino acids of predicted H. armigera protein. cPredicted molecular weight of H. armigera protein (kDa). dPredicted pI of H. armigera protein. eResult of blastp search using H. armigera predicted protein against UniRef100. fUniRef100 Accession Number. gSpecies of best hit in UniRef100. hE-value of best hit in blastp search again UniRef100. iSignal peptide probability of H. armigera predicted protein.
Identification obtained with the de novo sequenced peptides using MS BLAST. Peptide sequences interpreted de novo from MS/MS spectra were used to query NCBI_insecta, ButterflyBase, or the H. armigera EST database using the MS BLAST search engine.
We thank Sebastian Schöne and Markus Garlipp for the nicely coordinated effort to maintain of the H. armigera Bayer colony in the laboratory.
Conceived and designed the experiments: HV ROM JC MdlPC-M. Performed the experiments: HV ROM AM JC MdlPC-M. Analyzed the data: HV AM JC MdlPC-M. Contributed reagents/materials/analysis tools: HV DGH ROM AM JC MdlPC-M. Wrote the paper: HV MdlPC-M. Proof-reading of manuscript: ROM JC AM.
- 1. Nakagawa H, Hama Y, Sumi T, Li SC, Maskos K, et al. (2007) Occurrence of a nonsulfated chondroitin proteoglycan in the dried saliva of Collocalia swiftlets (edible bird's-nest). Glycobiology 17: 157–164.
- 2. Alves-Silva J, Ribeiro JMC, Van Den Abbeele J, Attardo G, Hao ZR, et al. (2010) An insight into the sialome of Glossina morsitans morsitans. Bmc Genomics 11:
- 3. Huq NL, Cross KJ, Ung M, Myroforidis H, Veith PD, et al. (2007) A review of the salivary proteome and peptidome and saliva-derived peptide therapeutics. International Journal of Peptide Research and Therapeutics 13: 547–564.
- 4. Valenzuela JG (2002) High-throughput approaches to study salivary proteins and genes from vectors of disease. Insect Biochemistry and Molecular Biology 32: 1199–1209.
- 5. Assumpcao TCF, Charneau S, Santiago PBM, Francischetti IMB, Meng ZJ, et al. (2011) Insight into the Salivary Transcriptome and Proteome of Dipetalogaster maxima. Journal of Proteome Research 10: 669–679.
- 6. Cooper WR, Dillwith JW, Puterka GJ (2010) Salivary Proteins of Russian Wheat Aphid (Hemiptera: Aphididae). Environmental Entomology 39: 223–231.
- 7. Carolan JC, Fitzroy CIJ, Ashton PD, Douglas AE, Wilkinson TL (2009) The secreted salivary proteome of the pea aphid Acyrthosiphon pisum characterised by mass spectrometry. Proteomics 9: 2457–2467.
- 8. Harmel N, Letocart E, Cherqui A, Giordanengo P, Mazzucchelli G, et al. (2008) Identification of aphid salivary proteins: a proteomic investigation of Myzus persicae. Insect Molecular Biology 17: 165–174.
- 9. De Vos M, Van Oosten VR, Van Poecke RMP, Van Pelt JA, Pozo MJ, et al. (2005) Signal signature and transcriptome changes of Arabidopsis during pathogen and insect attack. Molecular Plant-Microbe Interactions 18: 923–937.
- 10. Rodriguez-Saona CR, Musser RO, Vogel H, Hum-Musser SM, Thaler JS (2010) Molecular, Biochemical, and Organismal Analyses of Tomato Plants Simultaneously Attacked by Herbivores from Two Feeding Guilds. Journal of Chemical Ecology 36: 1043–1057.
- 11. Musser RO, Cipollini DF, Hum-Musser SM, Williams SA, Brown JK, et al. (2005) Evidence that the caterpillar salivary enzyme glucose oxidase provides herbivore offense in Solanaceous plants. Archives of Insect Biochemistry and Physiology 58: 128–137.
- 12. Musser RO, Kwon HS, Williams SA, White CJ, Romano MA, et al. (2005) Evidence that caterpillar labial saliva suppresses infectivity of potential bacterial pathogens. Archives of Insect Biochemistry and Physiology 58: 138–144.
- 13. Liu F, Cui LW, Cox-Foster D, Felton GW (2004) Characterization of a salivary lysozyme in larval Helicoverpa zea. Journal of Chemical Ecology 30: 2439–2457.
- 14. Felton GW, Tumlinson JH (2008) Plant-insect dialogs: complex interactions at the plant-insect interface. Current Opinion in Plant Biology 11: 457–463.
- 15. Cho S, Mitchell A, Mitter C, Regier J, Matthews M, et al. (2008) Molecular phylogenetics of heliothine moths (Lepidoptera: Noctuidae: Heliothinae), with comments on the evolution of host range and pest status. Systematic Entomology 33: 581–594.
- 16. Pauchet Y, Muck A, Svatos A, Heckel DG, Preiss S (2008) Mapping the larval midgut lumen proteorne of Helicoverpa armigera, a generalist herbivorous insect. Journal of Proteome Research 7: 1629–1639.
- 17. Sutherland TD, Young JH, Weisman S, Hayashi CY, Merritt DJ (2010) Insect Silk: One Name, Many Materials. Annual Review of Entomology 55: 171–188.
- 18. Akai H, Hakim RS, Kristensen NP (2003) Labial glands, silk and saliva. Handbuch der Zoologie (Berlin) 4: 377–388.
- 19. Mondal M, Trivedy K, Kumar SN (2007) The silk proteins, sericin and fibroin in silkworm, Bombyx mori Linn., - a review. Caspian J Env Sci 5: 63–76.
- 20. Francischetti IMB, Lopes AH, Dias FA, Pham VM, Ribeiro JMC (2007) An insight into the sialotranscriptome of the seed-feeding bug, Oncopeltus fasciatus. Insect Biochemistry and Molecular Biology 37: 903–910.
- 21. Calvo E, Pham VM, Ribeiro JMC (2008) An insight into the sialotranscriptome of the non-blood feeding Toxorhynchites amboinensis mosquito. Insect Biochemistry and Molecular Biology 38: 499–507.
- 22. Vincent B, Kaeslin M, Roth T, Heller M, Poulain J, et al. (2010) The venom composition of the parasitic wasp Chelonus inanitus resolved by combined expressed sequence tags analysis and proteomic approach. Bmc Genomics 11: 15.
- 23. Canavoso LE, Jouni ZE, Karnas KJ, Pennington JE, Wells MA (2001) Fat metabolism in insects. Annual Review of Nutrition 21: 23–46.
- 24. Arrese EL, Wells MA (1994) Purification and properties of a phosphorylatable triacylglycerol lipase from the fat-body of an insect, Manduca sexta Journal of Lipid Research 35: 1652–1660.
- 25. Ponnuvel KM, Nakazawa H, Furukawa S, Asaoka A, Ishibashi J, et al. (2003) A lipase isolated from the silkworm Bombyx mori shows antiviral activity against nucleopolyhedrovirus. Journal of Virology 77: 10725–10729.
- 26. Tunaz H, Stanley DW (2004) Phospholipase A(2) in salivary glands isolated from tobacco hornworms, Manduca sexta. Comparative Biochemistry and Physiology B-Biochemistry & Molecular Biology 139: 27–33.
- 27. Dennis EA (1994) Diversity of group types, regulation and function of phospholipase A(2) Journal of Biological Chemistry 269: 13057–13060.
- 28. Maczkowiak F, Da Lage JL (2006) Origin and evolution of the Amyrel gene in the alpha-amylase multigene family of Diptera. Genetica 128: 145–158.
- 29. Pauchet Y, Freitak D, Heidel-Fischer HM, Heckel DG, Vogel H (2009) Glucanase activity in a glucan-binding protein family from Lepidoptera Journal of Biological Chemistry 284: 2214–2224.
- 30. Musser RO, Hum-Musser SM, Slaten-Bickford SE, Felton GW, Gergerich RC (2002) Evidence that ribonuclease activity present in beetle regurgitant is found to stimulate virus resistance in plants. Journal of Chemical Ecology 28: 1691–1696.
- 31. Cavener DR (1992) GMC oxidoreductases - A newly defined family of homologous proteins with diverse catalytic activities Journal of Molecular Biology 223: 811–814.
- 32. Zamocky M, Hallberg M, Ludwig R, Divne C, Haltrich D (2004) Ancestral gene fusion in cellobiose dehydrogenases reflects a specific evolution of GMC oxidoreductases in fungi. Gene 338: 1–14.
- 33. Kirsch R, Vogel H, Muck A, Reichwald K, Pasteels JM, et al. (2011) Host plant shifts affect a major defense enzyme in Chrysomela lapponica. Proceedings of the National Academy of Sciences of the United States of America 108: 4897–4901.
- 34. Martins RM, Sforca ML, Amino R, Juliano MA, Oyama S, et al. (2006) Lytic activity and structural differences of amphipathic peptides derived from trialysin. Biochemistry 45: 1765–1774.
- 35. Amino R, Martins RM, Procopio J, Hirata IY, Juliano MA, et al. (2002) Trialysin, a novel pore-forming protein from saliva of hematophagous insects activated by limited proteolysis. Journal of Biological Chemistry 277: 6207–6213.
- 36. Ribeiro JMC, Mans BJ, Arca B (2010) An insight into the sialome of blood-feeding Nematocera. Insect Biochemistry and Molecular Biology 40: 767–784.
- 37. Wedde M, Weise C, Kopacek P, Franke P, Vilcinskas A (1998) Purification and characterization of an inducible metalloprotease inhibitor from the hemolymph of greater wax moth larvae, Galleria mellonella. European Journal of Biochemistry 255: 535–543.
- 38. Erban T, Hubert J (2008) Digestive function of lysozyme in synanthropic acaridid mites enables utilization of bacteria as a food source. Experimental and Applied Acarology 44: 199–212.
- 39. Lemos FJA, Terra WR (1991) Digestion of bacteria and the role of midgut lysozyme in some insect larvae. Comparative Biochemistry and Physiology B-Biochemistry & Molecular Biology 100: 265–268.
- 40. Powning RF, Davidson WJ (1973) Studies on insect bacteriolytic enzymes.1. Lysozyme in hemolymph of Galleria mellonella and Bombyx mori Comparative Biochemistry and Physiology 45: 669.
- 41. Bergin D, Murphy L, Keenan J, Clynes M, Kavanagh K (2006) Pre-exposure to yeast protects larvae of Galleria mellonella from a subsequent lethal infection by Candida albicans and is mediated by the increased expression of antimicrobial peptides. Microbes and Infection 8: 2105–2112.
- 42. Samaranayake YH, Samaranayake LP, Wu PC, So M (1997) The antifungal effect of lactoferrin and lysozyme on Candida krusei and Candida albicans. Apmis 105: 875–883.
- 43. Callewaert L, Michiels CW (2010) Lysozymes in the animal kingdom. Journal of Biosciences 35: 127–160.
- 44. Cong LN, Yang XJ, Wang XX, Tada M, Lu ML, et al. (2009) Characterization of an i-type lysozyme gene from the sea cucumber Stichopus japonicus, and enzymatic and nonenzymatic antimicrobial activities of its recombinant protein. Journal of Bioscience and Bioengineering 107: 583–588.
- 45. Vogel H, Altincicek B, Glockner G, Vilcinskas A (2011) A comprehensive transcriptome and immune-gene repertoire of the lepidopteran model host Galleria mellonella. BMC Genomics 12:
- 46. Rimphanitchayakit V, Tassanakajon A (2010) Structure and function of invertebrate Kazal-type serine proteinase inhibitors. Developmental and Comparative Immunology 34: 377–386.
- 47. Altincicek B, Linder M, Linder D, Preissner KT, Vilcinskas A (2007) Microbial metalloproteinases mediate sensing of invading pathogens and activate innate immune responses in the lepidopteran model host Galleria mellonella. Infection and Immunity 75: 175–183.
- 48. Wedde M, Weise C, Nuck R, Altincicek B, Vilcinskas A (2007) The insect metalloproteinase inhibitor gene of the lepidopteran Galleria mellonella encodes two distinct inhibitors. Biological Chemistry 388: 119–127.
- 49. Celorio-Mancera MD, Greve LC, Teuber LR, Labovitch JM (2009) Identification of endo- and exo-polygalacturonase activity in Lygus hesperus (Knight) salivary glands Archives of Insect Biochemistry and Physiology 70: 122–135.
- 50. Prakobphol A, Xu F, Hoang VM, Larsson T, Bergstrom J, et al. (2000) Salivary agglutinin, which binds Streptococcus mutans and Helicobacter pylori, is the lung scavenger receptor cysteine-rich protein gp-340. Journal of Biological Chemistry 275: 39860–39866.
- 51. Burton RL, Starks KJ, Sauer JR (1976) Beta-fructosidase activity in silk glands of Heliothis zea Journal of Insect Physiology 22: 1045–1048.
- 52. Daimon T, Taguchi T, Meng Y, Katsuma S, Mita K, et al. (2008) beta-fructofuranosidase genes of the silkworm, Bombyx mori - Insights into enzymatic adaptation of B. mori to toxic alkaloids in mulberry latex. Journal of Biological Chemistry 283: 15271–15279.
- 53. Celorio-Mancera MP, Anh SJ, Vogel H, Heckel DG (2011) Transcript profiling of the cotton bollworm, Helicoverpa armigera, response to gossypol: an insight on hormesis and metabolism. BMC Genomics. In press.
- 54. Loo JA, Yan W, Ramachandran P, Wong DT (2010) Comparative Human Salivary and Plasma Proteomes. Journal of Dental Research 89: 1016–1023.
- 55. Eichenseer H, Mathews MC, Bi JL, Murphy JB, Felton GW (1999) Salivary glucose oxidase: Multifunctional roles for Helicoverpa zea? Archives of Insect Biochemistry and Physiology 42: 99–109.
- 56. Zong N, Wang CZ (2004) Induction of nicotine in tobacco by herbivory and its relation to glucose oxidase activity in the labial gland of three noctuid caterpillars. Chinese Science Bulletin 49: 1596–1601.
- 57. Eichenseer H, Mathews MC, Powell JS, Felton GW (2010) Survey of a Salivary Effector in Caterpillars: Glucose Oxidase Variation and Correlation with Host Range. Journal of Chemical Ecology 36: 885–897.
- 58. Mathews MC, Summers CB, Felton GW (1997) Ascorbate peroxidase: A novel antioxidant enzyme in insects. Archives of Insect Biochemistry and Physiology 34: 57–68.
- 59. Takeuchi H, Rigden DJ, Ebrahimi B, Turner PC, Rees HH (2005) Regulation of ecdysteroid signalling during Drosophila development: identification, characterization and modelling of ecdysone oxidase, an enzyme involved in control of ligand concentration. Biochemical Journal 389: 637–645.
- 60. Teese MG, Campbell PM, Scott C, Gordon KHJ, Southon A, et al. (2010) Gene identification and proteomic analysis of the esterases of the cotton bollworm, Helicoverpa armigera. Insect Biochemistry and Molecular Biology 40: 1–16.
- 61. Araujo RN, Campos ITN, Tanaka AS, Santos A, Gontijo NF, et al. (2007) Brasiliensin: A novel intestinal thrombin inhibitor from Triatoma brasiliensis (Hemiptera: Reduviidae) with an important role in blood intake. International Journal for Parasitology 37: 1351–1358.
- 62. Kang LQ, Shi HF, Liu XY, Zhang CY, Yao Q, et al. (2011) Arginine kinase is highly expressed in a resistant strain of silkworm (Bombyx mori, Lepidoptera): Implication of its role in resistance to Bombyx mori nucleopolyhedrovirus. Comparative Biochemistry and Physiology B-Biochemistry & Molecular Biology 158: 230–234.
- 63. Vogel H, Heidel AJ, Heckel DG, Groot AT (2010) Transcriptome analysis of the sex pheromone gland of the noctuid moth Heliothis virescens. Bmc Genomics 11:
- 64. Altschul SF, Madden TL, Schaffer AA, Zhang JH, Zhang Z, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25: 3389–3402.
- 65. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, et al. (2000) Gene Ontology: tool for the unification of biology. Nature Genetics 25: 25–29.
- 66. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, et al. (2007) Clustal W and clustal X version 2.0. Bioinformatics 23: 2947–2948.
- 67. Neuhoff V, Arold N, Taube D, Ehrhardt W (1988) Improved staining of proteinas in polyacrylamide gels including isoelectric-focusing gels with clear background at nanogram sensitivity using Coomassie brilliant blue G-250 and R-250. Electrophoresis 9: 255–262.
- 68. Bendtsen JD, Nielsen H, von Heijne G, Brunak S (2004) Improved prediction of signal peptides: SignalP 3.0. Journal of Molecular Biology 340: 783–795.
- 69. Shevchenko A, Sunyaev S, Loboda A, Shevehenko A, Bork P, et al. (2001) Charting the proteomes of organisms with unsequenced genomes by MALDI-quadrupole time of flight mass spectrometry and BLAST homology searching. Analytical Chemistry 73: 1917–1926.
- 70. Habermann B, Oegema J, Sunyaev S, Shevchenko A (2004) The power and the limitations of cross-species protein identification by mass spectrometry-driven sequence similarity searches. Molecular & Cellular Proteomics 3: 238–249.