The Asian longhorned beetle (Anoplophora glabripennis) is an invasive, wood-boring pest that thrives in the heartwood of deciduous tree species. A large impediment faced by A. glabripennis as it feeds on woody tissue is lignin, a highly recalcitrant biopolymer that reduces access to sugars and other nutrients locked in cellulose and hemicellulose. We previously demonstrated that lignin, cellulose, and hemicellulose are actively deconstructed in the beetle gut and that the gut harbors an assemblage of microbes hypothesized to make significant contributions to these processes. While lignin degrading mechanisms have been well characterized in pure cultures of white rot basidiomycetes, little is known about such processes in microbial communities associated with wood-feeding insects. The goals of this study were to develop a taxonomic and functional profile of a gut community derived from an invasive population of larval A. glabripennis collected from infested host trees and to identify genes that could be relevant for the digestion of woody tissue and nutrient acquisition. To accomplish this goal, we taxonomically and functionally characterized the A. glabripennis midgut microbiota through amplicon and shotgun metagenome sequencing and conducted a large-scale comparison with the metagenomes from a variety of other herbivore-associated communities. This analysis distinguished the A. glabripennis larval gut metagenome from the gut communities of other herbivores, including previously sequenced termite hindgut metagenomes. Genes encoding enzymes were identified in the A. glabripennis gut metagenome that could have key roles in woody tissue digestion including candidate lignin degrading genes (laccases, dye-decolorizing peroxidases, novel peroxidases and β-etherases), 36 families of glycoside hydrolases (such as cellulases and xylanases), and genes that could facilitate nutrient recovery, essential nutrient synthesis, and detoxification. This community could serve as a reservoir of novel enzymes to enhance industrial cellulosic biofuels production or targets for novel control methods for this invasive and highly destructive insect.
Citation: Scully ED, Geib SM, Hoover K, Tien M, Tringe SG, Barry KW, et al. (2013) Metagenomic Profiling Reveals Lignocellulose Degrading System in a Microbial Community Associated with a Wood-Feeding Beetle. PLoS ONE 8(9): e73827. https://doi.org/10.1371/journal.pone.0073827
Editor: Pedro Lagerblad Oliveira, Universidade Federal do Rio de Janeiro, Brazil
Received: March 29, 2013; Accepted: July 25, 2013; Published: September 4, 2013
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Funding: Funding for this project was provided by USDA-NRI-CRSEES grant 2008-35504-04464, USDA-NRI-CREES grant 2009-35302-05286, the Alphawood Foundation, a Seed Grant to KH from the Pennsylvania State University College of Agricultural Sciences, and a Microbial Genomics Fellowship from USDA-AFRI to EDS and JRH. JEC was partially supported by World Class University Project R31-2009-000-20025-0 grant from the Ministry of Education, Science and Technology of South Korea. Opinions, findings, conclusions or recommendations expressed in this publication are those of the authors and do not necessarily reflect the views of the USDA. USDA is an equal opportunity provider and employer. The work conducted by the U.S. Department of Energy Joint Genome Institute is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Cellulose and hemicellulose represent some of the most abundant, renewable carbohydrate resources on the planet, comprising the largest natural source of fermentable sugars, which could be utilized for ethanolic biofuel production . Despite the abundance of these polysaccharides, a major impediment to accessing fermentable sugars from these carbohydrates for large-scale industrial ethanol production is the presence of lignin , a stereotypically irregular, aromatic biopolymer comprised of phenylpropanoid aryl alcohol subunits and articulated by over 12 types of chemical bonds . Highly resilient β-aryl ether and carbon-carbon bonds constitute the majority of the linkages in hardwood lignin, which are resistant to hydrolysis and difficult to disrupt. However, wood-feeding insects, in collaboration with their gut microbial communities, have the capacity to produce enzymes that facilitate the degradation of lignocellulosic material [4,5]. Accordingly, these microbial communities constitute unique ecosystems that may serve as reservoirs of novel proteins and enzymes that could be exploited to enhance the efficiency of industrial biomass pre-treatment processes, decoupling lignin from wood polysaccharides and facilitating access to fermentable sugars in cellulose and hemicellulose. Of recent interest is the gut community of Anoplophora glabripennis [Order Coleoptera; Family Cerambycidae], an invasive, xylophagous beetle that colonizes and feeds in a broad range of apparently healthy tree species, including several genera commonly planted as short rotation biofeedstocks (e.g., Populus and Salix) [6,7]. A large community of microbes capable of producing cellulolytic and hemicellulolytic enzymes in the A. glabripennis midgut was previously described [8,9]. Analysis of A. glabripennis frass also revealed the presence of lignin degradation products , suggesting that its gut microbial community or the insect itself also harbors lignin degrading genes. The most dominant modification to lignin detected in A. glabripennis was propyl side chain oxidation, a reaction associated with white rot fungal lignin degradation that is not known to be catalyzed by bacterial- or animal-derived enzymes . White rot fungal isolates have not been previously detected in association with A. glabripennis using either culture-dependent or culture-independent approaches [9,11–13], suggesting that the lignin-degrading capacity of this system is unique from well-characterized, pure-culture canonical fungal systems. Therefore, the assemblage of microbes associated with the A. glabripennis midgut represents an excellent candidate for mining novel lignocellulose degrading enzymes for biofuel applications.
Many members of the family Cerambycidae, including A. glabripennis, produce their own endogenous cellulases (endoglucanases and β-glucosidases) and other plant cell wall degrading enzymes [9,14–16]. However, interaction with microbes has been observed to enhance cellulase activities and is hypothesized to enhance glucose release from cellulose in the guts of several beetle species, including A. glabripennis . For example, disruption of the gut microbiota induced by feeding on a cellulose-based artificial diet containing bacteriostatic and fungistatic agents results in a tangible reduction in cellulase complex activity (endoglucanases, exoglucanases, and β-glucosidases) in the A. glabripennis midgut . In addition, insects and other herbivores are generally not capable of producing a full arsenal of O-acetylglucuronxylan-degrading enzymes and they are also generally unable to utilize pentose sugars present in xylan (e.g., D-xylose) without the aid of xylose-degrading microbes . Although animal-derived enzymes have been hypothesized to be involved in lignin degradation  and an endogenous termite laccase can chemically modify lignin alkali and degrade lignin phenolics in vitro , microbes living in the guts of wood-feeding insects also have the capacity to produce enzymes that contribute to or enhance endogenous ligninase activities supplied by host enzymes [21,22]. Therefore, herbivorous animals, and specifically wood-feeding insects, likely benefit from enzymes produced by microbes to facilitate the digestion of woody tissue.
Wood-feeding insects exploit a variety of strategies to liberate carbohydrates from recalcitrant plant tissues and most wood-feeding insects maintain obligate associations with microbes. Associations of microbes with wood-feeding insects occur through cultivation of wood-degrading fungi , direct ingestion of fungal or bacterial enzymes , preferential feeding on compromised (stressed/decaying) trees whose structural polysaccharides have been previously disrupted by environmental wood-degrading microbes , or endosymbiosis with wood-degrading microbes . These microbial affiliates are thought to make important contributions to lignocellulose digestion in a phylogenetically diverse array of insects, including several beetle species where microbial fermentation products have been detected in the gut . Despite the associations between wood-feeding insects and microbes, the fate of lignin and the lignin degrading abilities of the microbial communities associated with many wood-feeding insects (with the exception of termites)  are largely uncharacterized; furthermore, no lignin degrading genes or proteins outside the white rot basidiomycetes have been annotated in metagenomes sampled from any wood-feeding insect microbial communities to date.
Wood-boring cerambycids harbor large communities of microbes, but little is known about their metabolic potential, other than the role of yeast-like gut symbionts in the digestion of hemicellulose and fermentation of xylose, which has been extensively studied . Community profiling of wood-feeding cerambycid guts has revealed a striking degree of diversity in terms of community richness. In general, stenophagous insects with restricted host ranges tend to have less complex and more static gut communities than polyphagous wood-feeding insects that can colonize a broad range of host tree species and tend to have more diverse and plastic communities. This diversity and plasticity is hypothesized to allow these insects to colonize and thrive in a broader range of host trees . Microbial community profiling of A. glabripennis larvae feeding in a variety of host tree species demonstrated that the composition of the community was plastic and varied by host tree species . However, the composition of the A. glabripennis midgut bacterial community was distinct from the wood bacterial community sampled from unforaged sections of the tree . Also, members of the Fusarium solani species complex 6 (group FSSC-6) have been consistently detected in the midguts of A. glabripennis larvae collected from multiple geographic locations and multiple host tree species, as well as larvae feeding on sterilized artificial diet . These findings suggest that not all of the microbes detected in the gut are acquired directly from the host tree.
The primary goals of this study were to provide a functional and taxonomic profile of the larval midgut microbial community of an invasive A. glabripennis population feeding on a preferred host (silver maple; Acer saccharinum) through next generation sequencing of small ribosomal subunit (SSU) amplicons and total DNA collected from the A. glabripennis midgut microbiota. Through this analysis, we compiled a suite of candidate genes found in the A. glabripennis microbial community whose annotations are consistent with lignin-, cellulose-, and hemicellulose-degrading capabilities and other genes that may have roles in nutrient synthesis and detoxification. These microbial genes are hypothesized to make key contributions to the ability of this insect to attack and develop in a broad range of healthy host trees [29,30]. We used a large-scale comparative metagenomic approach that included metagenomes derived from herbivore communities, ranging from grass-feeding ruminants to insects that thrive on highly complex woody substrates, to demonstrate that the A. glabripennis midgut metagenome was distinct from other host-associated metagenomes and could thus provide valuable insights into the interactions between wood-feeding beetles and their microbial affiliates that contribute to the digestion of woody tissue.
Preparation of Insect Cell Free DNA for Community Profiling and Shotgun Sequencing
Five fourth instar A. glabripennis larvae actively feeding in the heartwood of a preferred host tree (Acer saccharinum; silver maple) were collected from a field site located in Worcester, MA and were transported under permit conditions to a USDA-approved quarantine facility at The Pennsylvania State University for dissection and processing. The sample collection was conducted at a field site that was part of a United States Department of Agriculture’s eradication effort. Permission by the United States Department of Agriculture and by local authorities was obtained under the general permit (P526P-12-02646) Insects were sterilized twice in 70% ethanol to remove surface-contaminating microbes and residual ethanol was removed with a single rinse in sterile milliQ water. Insects were dissected and guts were removed under sterile conditions. For this experiment, we chose to focus exclusively on microbes associated with the midgut contents since this is the most prominent region in the guts of cerambycids. To enrich the sample for microbial cells and exclude insect tissue, the insect-derived peritrophic matrix (PM) that surrounds and protects the food bolus was separated from the midgut contents and DNA was extracted from microbes adhering to the food. DNA was extracted using the Fast DNA Spin Kit for Soil (MP Biomedicals, Santa Ana, CA), which was chosen due to its abilities to lyse cell walls from a variety of microbes and remove plant polysaccharides and other plant secondary metabolites that can co-extract with DNA and interfere with downstream processes. DNA was quantified using a Nano Drop 1000 spectrophotometer (Thermo-Scientific, Walthan, MA) and approximately 1 µg of DNA was used for 16S/18S amplicon and shotgun (total DNA) 454 library construction (Roche, Branford, CT).
454 Amplicon Pyrosequencing to Taxonomically Identify Microbes Associated with the A glabripennis Midgut
To identify the bacterial and fungal taxa found in association with the A. glabripennis midgut and to confirm that this sample was successfully enriched for microbial DNA prior to shotgun sequencing, a 16S/18S amplicon library encompassing the V6-V8 hypervariable regions was constructed using a set of primers designed to co-amplify both 16S bacterial rDNA and 18S fungal, insect, and plant rDNA from positions 926F to 1392R . The amplicon library was constructed following the Department of Energy-Joint Genome Institute’s Standard Operating Procedure. In brief, 20 ng of genomic DNA were added to a PCR cocktail containing 6 µL 5X PCR buffer, 2 µL GC melt solution (Clonetech, Mountain View, CA), 0.4 µL Taq Polymerase (Advantage 2 Polymerase, Clonetech, Mountain View, CA), 0.4 µL 10 mM dNTPs (Fermentas, Pittsburgh, PA), 1 µL 25 nM forward primer (926F: 5’-CCTATCGGGTGTGTGCCTTGGCAGTCTCAGAAACTYAAAKGAATTGACGG-3’) and 1 µL 25 nM reverse primer (1392R: 5’-CCATCTCATCCCTGCGTGTCTCCGACTCAGCTACTACGGGCGGTGTGTGC-3’). GC melt solution (Clonetech, Mountain View, CA) and Advantage 2 Polymerase (Clonetech, Mountain View, CA) were used to improve amplification efficiency of templates with high GC content. Primers were constructed using the standard 454 Titanium adaptor sequence (italics) and a five base-pair bar code incorporated into the reverse primer (bold). PCR thermal cycling conditions included an initial denaturation for three minutes at 95°C followed by 25 cycles of 95°C for 30s, 50°C for 45s, and 72°C for 90s and a final extension at 68 °C for 10 minutes. Product quality was assessed by agarose gel electrophoresis and the final product was purified using SPRI beads and quantified using the Quant-IT dsDNA Assay on a Qubit fluorimeter (Life Technologies, Carlsbad, CA). Approximately 7,000 reads were sequenced using 454 Titanium chemistry (Roche, Branford, CT). High quality reads greater than 250 bp in length were clustered into operational taxonomic units (OTUs) at 97% similarity and rarefaction curves and richness estimates were computed using the program mothur (version 1.2.22) . Putative chimeras were identified using UCHIME  and were omitted from the analyses. Sequences for representative OTUs were compared to the non-redundant nucleotide database using BLASTN (BLAST-2.2.23)  with an e-value threshold of 0.00001 to determine whether the OTU was of bacterial, fungal, insect, or plant origin. Bacterial reads were classified using Ribosomal Database Project (RDP) Classifier , with an 80% confidence threshold for taxonomic classifications; sequences classified as mitochondrial or chloroplast in origin were omitted from the analysis. Fungal reads were classified by comparison to the non-redundant nucleotide database using BLASTN (BLAST-2.2.23) with an e-value threshold of 0.00001 followed by MEGAN classification  of the top ten blast alignments using the least common ancestor algorithm. Alignments to unidentified or uncultured fungi were removed from BLAST results prior to MEGAN classification. Plant- and insect-derived OTUs were excluded from the analysis. Representative sequences of each bacterial OTU were aligned with ClustalW and were trimmed to 250 bp in length for phylogenetic reconstruction using Garli (version 2.0) . TIM1 + I + G was chosen as the optimal evolutionary model by jModelTest  and 500 bootstrap replicates were compiled to generate a consensus tree. High quality 454 amplicon reads are deposited in the NCBI Sequence Read Archive (SRA) under the accession number SRR767751.
Phylogenetic Binning and Functional Analysis of A glabripennis Midgut Microbiota Using Shotgun 454 Pyrosequencing
454 shotgun libraries were constructed using a modified version of the 454 standard library protocol. In brief, 500 ng of DNA were sheared using a sonicator (Covaris, Woburn, MA) and fragments ranging from 500 to 800 bp were size selected using ampure beads. DNA fragments were end-polished, purified, and ligated to 454 Titanium adapters. A fill-in reaction was performed and the ssDNA template was isolated, purified, and prepared for emulsion PCR (emPCR). Additional cycles were added to the emPCR protocol to linearly amplify 454 adapter-ligated DNA from low yield DNA extractions. A previous study comparing metagenome libraries prepared with additional emPCR cycles to libraries prepared with standard numbers of emPCR cycles revealed no substantial amplification biases in libraries prepared with extra emPCR cycles (unpublished data). Based on this study, we suspect that no major biases were introduced using this approach. A total of 1.25 million shotgun reads (382 Mb) were sequenced at the DOE-Joint Genome Institute using 454 Titanium chemistry (Roche, Branford, CT). Raw reads are deposited in the NCBI Sequence Read Archive under the accession number SRR767751.
Initially, reads were assembled using Newbler (Roche, Branford, CT), but the midgut community was diverse, containing 166 bacterial OTUs and 7 fungal OTUs and the sequencing depth per OTU was too low to generate a high quality assembly. Consequently, the N50 contig length was low (< 1000 bp), and coverage across contigs was not uniform. There was also significant possibility of generating chimeric contigs consisting of reads from more than one bacterial taxon (Table 1) . We felt the slight improvement in contig sequence length versus raw read length was outweighed by these assembly issues; therefore, rather than using assembled contigs, high quality shotgun reads were treated as individual gene tags, which were used for annotations (with the exception of comparisons to other metagenome communities and candidate lignin degrading gene comparisons, in which assembled contigs were used to maintain consistency with the other datasets). For annotation and analysis of the unassembled reads, low quality reads with mean quality scores below 20, reads containing repetitive regions, and reads less than 150 bp in length were excluded from the dataset. Tags originating from non-coding RNAs, including tRNAs and rRNAs, were detected with tRNA-Scan  and HMMer using HMM profiles for prokaryotic, eukaryotic, and archaeal small subunit and large subunit rRNAs [41,42]. While tRNAs were filtered out of the dataset and were not utilized in downstream functional analyses, small subunit (16S and 18S) rRNAs detected were taxonomically classified by alignment to the SILVA SSU database  to detect additional bacterial and fungal taxa that may not have been detected with 454 amplicon analysis due to primer inefficiencies or biases. After filtering and removing non-coding RNAs, 1.06 million reads, ranging in length from 150 to 1050 bp, remained (mean read length: 350 bp).
|Number of 454 Shotgun Reads Produced||1,258,810|
|Number of Contigs||25,838|
|Number of Singleton Reads||585,749|
|Minimum Contig Length (bp)||200|
|Maximum Contig Length (bp)||30,393|
|Total Number of Assembled (bp)||22,220,287|
|Total Number of Unassembled (bp)||179,346,064|
454 library adapters and low quality ends were trimmed from the remaining reads. Individual reads were annotated by BLASTX comparisons to the non-redundant (NR) protein database  using an e-value cutoff of 0.00001 and were taxonomically classified using MEGAN (MEtaGenome ANalyzer)  least common ancestor classification based on the top 10 BLAST alignments for each read. Reads predicted to originate from bacterial or fungal taxa were also uploaded to the MG-RAST server  for gene prediction and assignment to SEED subsystems. Reads were also functionally categorized via an RPS-BLAST comparison  to the Clusters of Orthologous Gene (COG) database . Reads were also assigned to Gene Ontology (GO) terms  and classified to KEGG enzyme classes  using BLAST2GO ; furthermore, reconstruction of metabolic pathways was conducted using MinPath (Minimal set of Pathways) parsimony analysis  of KEGG Orthology (KO) assignments. BLAST results were corroborated by 6-frame translation followed by functional domain analysis using HmmSearch  to scan for Pfam A domains . CAZyme (Carbohydrate active enzyme)  carbohydrase family classifications are based on Pfam domain assignments.
Comparisons to Other Herbivore-Related Metagenomes
Pfam domains from the A. glabripennis metagenome assembly (contigs and un-assembled singleton reads) were compared to domains from assembled (contigs and unassembled singletons) metagenome data sampled from communities associated with herbivores feeding on a diversity of plants that varied in carbohydrate and lignin composition. Pfam functional domains were chosen for comparative analysis because they are relatively short in length, which increases the likelihood that they will be correctly identified in single sequence reads. Therefore, detection and subsequent annotation of these domains are less likely to be influenced by assembly contiguity, which varied between the metagenome libraries. Annotated Pfam domains were obtained from the JGI IGM/M database for microbial communities associated with 1) herbivores that feed on a variety of plant tissues: panda, reindeer, honey bee, attine ant fungal garden, and wallaby; 2) insects that feed only on phloem and/or xylem tissue: Dendroctonous frontalis galleries and guts, Dendroctonous ponderosae galleries and guts, Xyleborus affinis galleries and guts (larval and adult); and 3) insects that feed only in woody tissue: Amitermes wheeleri hindgut, Nasutitermes sp. hindgut, Sirex noctilio fungal gallery, and a community affiliated with Trichonympha protist symbionts of termites collected from Los Padres National Forest, CA. The Pfam compositions of these communities were compared to the Pfam composition of the Anoplophora glabripennis midgut community. For each community, data were normalized by total number of Pfam domains detected, weighted by contig depth when assembly information was available, and a compositional dissimilarity matrix was constructed based on Euclidean distance. For unassembled singleton reads, a contig depth of one was assumed. Samples were subjected to cluster analysis using Ward’s method. Further, the standardized data were also analyzed using unconstrained Principal Components Analysis to plot samples in multidimensional space. PCA ordination was selected because the data were determined to be linear by detrended correspondence analysis (DCA) (Beta diversity <4). Partially constrained redundancy analysis (RDA), removing effects of library size, did not significantly change the ordination, indicating that differences in library sizes do not significantly influence the ordination. All multivariate comparisons and ordinations were performed using the R statistical package with ‘vegan’ and ‘cluster’ libraries.
Results and Discussion
Taxonomic Classification of OTUs and Shotgun Reads
Approximately 6.7% of the total shotgun reads were classified to class Hexapoda while approximately 0.2% of the total shotgun reads were classified as plant, indicating that the metagenome library was comprised predominantly of microbial DNA. Amplicon sequencing identified seven distinct fungal OTUs and 166 bacterial OTUs using a 97% similarity threshold in mothur, while only a single insect OTU (2% of the total amplicons) and a single plant OTU (0.53% of the total amplicons) were detected. Overall, fungal reads outnumbered bacterial reads, which could be attributed to a higher relative abundance of fungal taxa in the midgut or to preferential amplification of fungal amplicons with the 926F/1392R primers used in this study, as this dominance is not reflected in the shotgun sequencing data.
OTU taxonomic classification with RDP classifier detected the presence of 166 OTUs in seven bacterial phyla in the midgut community including Actinobacteria (30 OTUs), Bacteroidetes (29 OTUs), Chlamydiae (1 OTU), Firmicutes (14 OTUs), Proteobacteria (80 OTUs), candidate phylum TM7 (3 OTUs), and Verrucomicrobia (5 OTUs), while four OTUs could not be conclusively assigned to any previously-characterized bacterial phyla. Rarefaction analysis and Chao richness estimates predict the presence of over 350 bacterial OTUs (95% confidence interval range: 266-517 OTUs), demonstrating that deeper sampling of amplicon data may result in the detection of additional less abundant bacterial taxa (Figure 1 and Table 2). The most taxonomically-diverse phylum in terms of OTU richness was Proteobacteria, containing 80 distinct OTUs assigned to 22 different families. At the class level, 15 different bacterial classes were identified and the midgut community was dominated by six taxonomic classes (Figure 2 and Table 3). Overall, the single most-prevalent OTU, which comprised over 21% of the bacterial amplicons, was a member of the family Leuconostocaceae that could not be classified to genus level by RDP. Comparison of this OTU to 16S sequences curated in the RDP database revealed that it had highest nucleotide sequence similarity to bacteria in the genus Leuconostoc. Other predominant OTUs were assigned to the family Enterobacteriaceae (8.4% bacterial amplicons), the family Microbacteriaceae (8.3% bacterial amplicons), and to the higher phylum Actinobacteria (9.3% bacterial amplicons). Many OTUs could not be definitely assigned to low taxonomic levels, suggesting that the A. glabripennis midgut microbiota may serve as a reservoir for novel microbes. With the exception of the higher overall abundance of fungal 18s OTUs relative to bacterial 16s OTUs, the results of OTU abundance and classification were corroborated by phylogenetic binning of shotgun reads, which is less impacted by amplification biases relative to PCR-based approaches (Figure S1).
Approximately 166 bacterial OTUs were detected through amplicon sequencing. Various community richness estimators consistently predicted the presence of over 300 OTUs in association with the A. glabripennis gut and, in agreement with this observation, the rarefaction curve failed to reach saturation. This indicates that additional OTUs would likely be detected with additional amplicon sequencing.
|# OTUs Observed||Chao Richness||95% CI Chao||Ace Richness||95% CI Ace||Jackknife Richness||95% CI Jackknife||Simpson Diversity (1-D)||95% CI Simpson Diversity (1-D)|
Representative sequences from each bacterial OTU were aligned with MEGA 4.0 and phylogenetic analysis using was performed using GARLI 2.0 (500 bootstrap pseudoreplicates and TIM1+I+G evolutionary model). Nodes were collapsed and labeled by taxonomic class. Number of OTUs and percentage of amplicons assigned to each class are labeled. OTUs that could not be assigned to class level by RDP were omitted from the analysis.
Identification of Cellulose-, Hemicellulose- and Aromatic Compound- Degrading Bacterial Taxa
Several genera of bacteria were detected in the A. glabripennis midgut community that have been previously implicated in the degradation of lignocellulose, hemicellulose, and other aromatic hydrocarbons, including the following lignocellulose degrading bacteria previously isolated from the A. glabripennis midgut on carboxymethylcellulose-containing media or detected previously through 16S analyses: Brachybacterium, Bradyrhizobium, Corynebacterium, Rhizobium, Pseudomonas, Sphingomonas, and Xanthamonas [9,12]. Furthermore, the midgut community sampled for this study strongly resembles the taxonomic compositions of larval gut communities previously sampled from insects feeding in Acer saccharinum in a separate population (Brooklyn, NY)  and from beetles collected in China , suggesting a consistent relationship between these microbial taxa and A. glabripennis. Of significance is that, unlike the termite and other herbivore-associated gut communities, the microbiota associated with the A. glabripennis midgut is dominated by aerobes and facultative anaerobes with very few obligate anaerobic taxa. To date, all characterized large-scale lignin degrading reactions require oxygen and have only been demonstrated in aerobic environments , such as the A. glabripennis midgut .
Identification of Fungal Community
Fungi are frequently encountered in guts of wood feeding insects , including A. glabripennis ; however, in contrast to the bacterial community, the fungal community is considerably less diverse, containing approximately 7 distinct OTUs. Rarefaction analysis and richness estimates predict 18 fungal OTUs (95% confidence interval: 8-31 OTUs) (Figure 3). Compared to the 16S region in bacteria, 18S regions in fungi display considerably less sequence heterogeneity , even among distant relatives and an accurate assessment of fungal diversity in the A. glabripennis midgut may be underestimated. All fungal taxa detected belonged to the phylum Ascomycota, confirming a low abundance or complete absence of white-rot basidiomycetes in the midgut microbiota. All of the fungal taxa detected were yeasts assigned to the family Saccharomycetaceae. However, most could not be conclusively classified to genus level with MEGAN, but had highest-scoring BLAST alignments to the genera Issatchenika (3 OTUs; 58% total fungal amplicons) and Saccharomyces (1 OTU; 36% total fungal amplicons). The three other fungal OTUs were present as singletons and had highest-scoring BLAST alignments to the fungal genera Geotrichum, Pichia, and an unclassified member of the family Archaeosporaceae. Many of these genera are phylogenetically close relatives to yeasts isolated from the guts of other wood-feeding cerambycid beetles , which are often capable of processing hemicellulose and fermenting xylose into ethanol, but are not known to degrade lignin or cellulose. Many wood- and plant-feeding insects, such as leaf-cutter ants , wood wasps , bark beetles  and some termite species  maintain obligate external associations with non-yeast filamentous basidiomycete and ascomycete fungi and directly inoculate fungal isolates into their food sources, where they facilitate pre-digestion of lignocellulose and serve other nutrient-provisioning roles. These strategies substantially reduce the carbohydrate complexity and lignin content of the food substrate prior to ingestion by the insect. In contrast, A. glabripennis constitutively harbors a filamentous ascomycete belonging to the Fusarium solani species complex within its midgut . Multilocus phylogenetic analysis of this isolate collected from several geographic populations revealed that the isolates harbored in the beetle gut are distinct from other previously characterized members of the F. solani species complex. Moreover, this fungus could be detected in colony-reared insects feeding on sterile diet , suggesting that this fungus is intricately associated with the gut. Though F. solani was not detected in the 18S fungal amplicon data, F. solani has been cultivated previously from A. glabripennis beetle guts collected at this field site  and reads derived from F. solani were detected in the shotgun library. This low abundance of F. solani reads in the shotgun libraries is likely due to excluding the peritrophic matrix from the sample as F. solani is likely associated with the gut wall tissue. Members of the Fusarium species complex are metabolically versatile and often harbor lignin peroxidase and other ligninase homologs , which suggests contributions to these processes in the A. glabripennis midgut .
Seven fungal OTUs were detected through amplicon sequencing. While rarefaction begins to approach saturation, richness estimates predict the presence of at least 11 fungal OTUs indicating that additional sampling may be necessary. This scenario is likely since additional 18S rRNAs from fungal taxa not detected in the 18S amplicons were detected in the shotgun reads (e.g., Fusarium spp.).
Functional Profiling of Reads Generated through 454 Shotgun Sequencing
Approximately 65% of the high quality 454 reads generated had BLASTX matches to proteins from the non-redundant protein database at an e-value of 0.00001 or lower. Of these reads, approximately 79% had best alignment scores to annotated proteins, while the remaining 21% had highest scoring BLAST alignments to hypothetical or uncharacterized proteins. Overall, the most abundant BLAST and Pfam domain assignments associated with the midgut microbial community belonged to ABC transporters, major facilitator transporters, alcohol dehydrogenases, and aldehyde dehydrogenases. Functional categorization of shotgun reads by both COG and SEED assignments predicted that the majority of the reads originated from pathways involved in the metabolism of carbohydrates and amino acids (Figure 4). Annotation statistics are summarized in Table 4 and annotations are publically available through MG-RAST at http://metagenomics.anl.gov/ under the identification number 4453653.3 and JGI IMG/M at http://img.jgi.doe.gov/m/ under project ID Gm00068.
Reads assigned to 28 SEED subsystems were detected in the A. glabripennis larval midgut metagenome. The most dominant subsystems found in association with this microbial community included clustering based subsystems, carbohydrate metabolism, and amino acid and derivatives metabolism.
|Number of High Quality Shotgun 454 Reads||1,067,718|
|Number of rRNAs||397|
|Number of tRNAs||2,596|
|Number of reads with BLASTX alignments to annotated proteins in non-redundant protein database (e-value = 0.00001)||541,761|
|Number of reads with BLASTX alignments to hypothetical proteins in non-redundant protein database (e-value = 0.00001)||144,965|
|Number of reads with COG (Clusters of Orthologous Genes) assignments||357,999|
|Number of reads with Seed assignments||255,091|
|Number of reads with GO (Gene Ontology) assignments||361,412|
|Number of reads with KEGG assignments||173,359|
|Number of reads with Pfam domains||420,285|
|Number of reads with BLASTX alignments and Pfam domains||409,594|
|Number of reads with Pfam domains only (no BLASTX alignments)||10,691|
Comparison of Functional Domains from Other Herbivore Associated Microbial Communities
Hierarchical agglomerative cluster analysis based on Pfam abundances from herbivore-associated metagenomes did not appear to group the microbial communities based on the taxonomic relatedness of their herbivore hosts (Figure 5). Although many of the beetle gut communities and fungal gallery communities are derived from closely related beetles and cluster together, several notable exceptions suggest that factors other than taxonomic relatedness contribute to the hierarchical clustering pattern observed. For example, although A. glabripennis (Order Coleoptera) and S. noctilio (Order Hymenoptera) belong to two different insect orders, their microbial communities can be found in the same group in the cluster analysis, suggesting that they share similarities in microbial metabolic capabilities. Additionally, the two hymenopterans included in this comparison (honey bee and Sirex) fall into two distant clusters. However, a clear division between gut communities and fungal gallery communities is apparent, with the exceptions of the ant fungal garden, which clustered with the herbivore gut communities and was previously hypothesized to function as an external rumen . The A. glabripennis midgut community is also an exception as it clustered with the fungal gallery communities. Interestingly, many of the fungal gallery communities that cluster with the A. glabripennis metagenome are hypothesized to have lignin degrading capabilities, which is in contrast to the ant fungal garden community. While cellulose and hemicellulose were preferentially degraded in the fungal gardens, lignin remained relatively unscathed and was ultimately discarded by the insects . The same pattern of cell wall digestion has also been observed in the rumens of many grass-feeding herbivores .
Agglomerative hierarchical cluster analysis based on a compositional Euclidean distance matrix was conducted using Pfam annotations from various herbivore related metagenomes. Three distinct clusters representing different herbivore biome-types are highlighted and labeled. These include herbivore gut communities, fungal gallery communities associated with phoem/xylem feeding insects and communities associated with insects feeding in heartwood.
Although fungal communities cultivated by bark beetles  are primed to synthesize nutrients and detoxify plant secondary metabolites , penetration of the lignin barrier enhances access to cellulose and hemicellulose present in both phloem and xylem tissues where bark beetles feed. Although the fate of lignin in the majority of these systems is unclear, lignin degradation and aromatic compound metabolism have been demonstrated in a Fusarium solani fungal gallery strain associated with xylem-feeding ambrosia beetles (e.g., Xyleborus) . Thus, the fungal gallery communities associated with these phloem and xylem feeding beetles have the potential to harbor lignin degrading genes capable of degrading woody tissue. The final cluster in our analysis contains microbial communities associated with insects feeding on heartwood and includes the A. glabripennis midgut and Sirex wood wasp fungal gallery communities. Notably, these wood-feeding communities are relatively distant from those associated with the other herbivore guts or the other fungal gallery communities included in this comparison, suggesting that these communities may harbor genes that encode enzymes optimized for breaking down complex and recalcitrant woody tissue. Like A. glabripennis, the Sirex fungal gallery community is also capable of disrupting lignin polymers and the community contains a lignin degrading white rot fungus belonging to the genus Amylostereum, which produces manganese peroxidases and laccases .
The groupings detected through hierarchical cluster analysis are also supported by Principal Components Ordination (Figure 6). The X-axis separates the majority of the gut communities from the gallery communities with the notable exception of the A. glabripennis midgut, which is clearly distinct from the other gut metagenomes and was placed in close proximity to the Sirex fungal gallery microbiome. The Y-axis separates fungal gallery communities associated with phloem-feeding herbivores from wood-feeding herbivores that bore deep into the heartwood. Although both Sirex and A. glabripennis insects feed in similar regions of their host trees, Sirex has a limited host range relative to A. glabripennis and feeds exclusively on the genus Pinus . In contrast, A. glabripennis has a much broader host range and feeds in the heartwood of over 25 deciduous tree species in the United States (http://www.aphis.usda.gov/plant_health/plant_pest_info/asian_lhb/downloads/hostlist.pdf) and 47 tree species in its native range . These differences in lifestyle are also reflected in the PCA ordination. Although the A. glabripennis midgut community is most similar to the Sirex fungal gallery community, the distance between these two metagenomes is still quite significant and could be partially driven by differences in host range breadth and environment (e.g. gut vs. gallery).
Principal components analysis was conducted to plot samples in multidimensional space. Groupings detected in agglomerative cluster analysis are preserved (Mantel test, p< 0.0001) and are color-coded by groups identified in the dendrogram. Monte Carlo Permutation Procedure (n=1000 iterations): p<0.0001 for PCA 1 and PCA2. DP: Dendroctonous ponderosae DF: Dendroctonous frontalis CR: Costa Rica, FL: Florida.
Candidate Genes for Lignin Degrading Enzymes
Genes encoding enzymes that have been previously implicated in lignin degradation were identified in the microbiomes affiliated with both the midgut of A. glabripennis and the fungal gallery communities, and may be partially responsible for driving the grouping of these communities in the hierarchical analysis (Table S1). This is in contrast to the results of a recent comparative metagenomic study that concluded host-associated communities lacked the metabolic potential to degrade lignin , and may indicate that the A. glabripennis midgut community represents an exception. A number of bacterial and fungal reads with copper oxidase (Cu oxidase) Pfam domains were detected in the A. glabripennis midgut, which could have laccase-type activity in vivo . While many of these reads had corresponding BLAST assignments to laccases, multicopper oxidases, and polyphenol oxidases, a large number of the annotations were to hypothetical proteins and could represent novel and previously uncharacterized laccase-type enzymes. While laccases do not endogenously have a high enough redox potential to cleave major linkages in polymeric lignin , their activity can be enhanced in the presence of natural redox mediators  and, they are capable of disrupting β-aryl ether bonds under these conditions. β-aryl ethers represent the most dominant linkage in hardwood lignin and as a consequence, disruption of these linkages represents a critical step in lignin degradation .
A number of other extracellular peroxidases that are often highly expressed by lignin degrading microbes during periods of active lignin degradation were also detected. These include iron-dependent peroxidases, thiol peroxidases, and a number of other uncharacterized peroxidases. The potential participation of these peroxidases in large-scale lignin degradation is also supported by the detection of a number of peroxide-generating enzymes containing predicted leader sequences for extracellular targeting. These included aryl alcohol oxidases, FAD oxidoreductases, glyoxal oxidases, GMC oxidoreductases, and pyranose oxidases.
Bacterial dye-decolorizing peroxidases, also known as dyp-type peroxidases, were detected in association with the A. glabripennis midgut microbiota and microbial communities associated with other wood-feeding insects, and have previously been shown to cleave β-aryl ether linkages in both syringyl and guaiacyl lignin in a hydrogen peroxide dependent manner . While there is some evidence that manganese may act as a diffusible redox mediator in some bacterial dyp-type peroxidases , not all β-aryl ether cleaving peroxidases have identifiable manganese binding sites and thus, manganese may enhance the activity of a subset of these peroxidases . Furthermore, reads for another set of β-aryl ether degrading enzymes were also discovered, which have been shown to catalyze the cleavage of these bonds in a glutathione-dependent manner. These enzymes were classified as β-etherases or glutathione-S-transferases . In order to cleave β-aryl ether linkages, these enzymes first require oxidation of the Cα primary alcohol by aryl alcohol dehydrogenase (or Cα dehydrogenase) to generate a ketone group. The presence of a ketone group immediately adjacent to the ether linkage increases the polarity of the ether bond, allowing the ether bond to be easily cleaved by β-etherase, using glutathione as a hydrogen donor . However, these GST (β-etherase) functional domains were not exclusively present in candidate lignin degrading genes  and are also associated with genes involved in detoxification (i.e., glutathione s-transferases) . Therefore, only a subset of the GST domain proteins reported in this analysis are lignin degrading candidates. The role of dyp-type peroxidases and β-etherases in polymeric lignin degradation has yet to be clarified. While some bacteria harboring these genes can cleave β-aryl ether linkages in dimeric lignin model compounds and Kraft and wheat straw lignin, their ability to catalyze degradation of an intact biopolymer from woody plants is unknown .
Of significance is that the majority of the lignin degrading genes present in the A. glabripennis midgut community are either absent or present in very low abundances in the communities associated with herbivore guts, including, panda, reindeer, honey bee, and wallaby and termites. This finding suggests that these herbivore communities may have alternate genes and mechanisms that could have lignin degrading roles in vivo or that some of these gut-associated communities lack lignin degrading capabilities altogether. In contrast, these lignin degrading candidates were highly abundant in the communities associated with wood-feeding insects, including the Sirex fungal gallery and A. glabripennis midgut. Consistent with their hypothesized role in the pre-digestion of lignocellulose for phloem-feeding insects, many lignin-degrading candidates were also found in high abundances in the fungal galleries of phloem feeding bark beetles. Although small subsets of these lignin degrading genes were also detected in guts of phloem feeding insects, these genes are likely environmentally derived and were acquired by feeding on the fungal gallery inoculum or they may also be encoded by microbes housed in the gut. Notably, peroxidases and extracellular hydrogen-peroxide generating enzymes were overrepresented in the A. glabripennis midgut community relative to other communities included in this analysis, suggesting that this community may have alternative pathways for degrading core lignin.
Despite the high abundances of putative laccases, dyp-type peroxidases, and hydrogen peroxide generating enzymes (FAD oxidases and GMC oxidoreductases) in the fungal gallery communities and the A. glabripennis midgut community, another class of putative lignin degrading enzymes (aldo-keto reductases: AKRs) were well represented in the termite gut communities, the tamar wallaby gut community, a subset of the fungal gallery communities (e.g. Xyleborus, DP Fungal Alberta (hybrid), DP Fungal Alberta, and DF Fungal Mississippi), and the A. glabripennis midgut community. An endogenous termite AKR capable of degrading lignin phenolics and enhancing sugar release from pine sawdust was recently characterized  and subsets of microbial AKRs can act as Cα dehydrogenases, which can work in conjunction with β-etherases to cleave β-aryl ethers . Microbial AKRs are well represented in the termite gut communities and have the potential to collaborate with host-derived AKRs to enhance ligninase activity in the gut. Interestingly, microbial AKRs are overrepresented in the A. glabripennis gut community relative to most other communities included in the comparison and have the potential to make contributions to digestion of lignin in this system. Taken together, we hypothesize that the A. glabripennis midgut metagenome has a lignin degrading capacity distinct from the termites and other herbivore associated communities that could be prospected for biotechnology purposes. This possibility is supported by the fact that biochemical modifications to lignin detected in the gut of a lower termite (Zootermopsis angusticollis) were different than the lignin modifications detected in the A. glabripennis gut .
Candidate Genes for Cellulases and Carbohydrases
Although many of reads with predicted involvement in carbohydrate digestion are involved in core metabolic pathways, such as glycolysis, many also were annotated by BLAST as accessory enzymes that can digest cellulose and other plant cell wall carbohydrates. For example, reads were classified into 36 different glycoside hydrolase (GH) families based on a combination of Pfam domain and KEGG enzyme class assignments (Figure 7). The most abundant CAZyme (Carbohydrate Active Enzyme) families detected were represented by families GH 1 and GH 3 and their associated KEGG EC assignments are presented in Table 5. The majority of these GH 1 and 3 enzymes were predicted to encode β-glucosidases. KEGG E.C. assignments for all GHs detected in the A. glabripennis midgut metagenome can be found in Table S2.
Reads assigned to 36 glycoside hydrolase families were detected in the gut microbiome. The most dominant families were GH 1 and 3, while GH families 11, 45, 46, 61, and 71 were present in very low abundances.
|GH Family||KEGG ECs||Reactions|
|1,3||β-glucosidase (EC 22.214.171.124)||Hydrolyzes β-1,4 linkages in glucose-containing disaccharides|
|1||β-galactosidase (EC 126.96.36.199)||Hydrolyzes β-galactosidic bond between galactose and its organic functional group|
|1||β-mannosidase (EC 188.8.131.52)||Hydrolyzes terminal, non-reducing mannose residues from β-D linked mannosides|
|1||β-glucuronidase (EC184.108.40.206)||Hydrolyzes β-D glucuronic acid residues from non-reducing end of glycosoaminoglycans|
|1||Exo-β-1,4-glucanase (EC 220.127.116.11)||Releases cello-oligomers from exposed polysaccharide termini in cellulose|
|1||6-phospho-β-galactosidase (EC 18.104.22.168)||Hydrolyzes β-galactosidic bond between a 6-phospho-β-D-galactose and its organic functional group|
|1||6-phospho-β-glucosidase (EC 22.214.171.124)||Hydrolyzes β-1,4 linkages in glucose substituted disaccharides containing phosphorylated glucoside residues|
|1||Strictosidine amygdalin β-glucosidase (EC 126.96.36.199)||Liberates D-glucose from strictosidine|
|1||Thioglucosidase (EC 188.8.131.52)||Hydrolyzes linkage between thiol and glycosinolate|
|1||β-primeverosidase (EC 184.108.40.206)||Hydrolyzes linkage between 6-O-(beta-D-xylopyranosyl)-beta-D-glucopyranoside its organic functional group|
|3||Xylan 1,4-β -xylosidase (EC 220.127.116.11)||Hydrolyzes linkage between β-linked xylose residues in β-1,4 xylan|
|3||β -N-acetylhexosaminidase (EC 18.104.22.168)||Liberates hexose from gangliosides|
|3||Glucan 1,3-β -glucosidase||Cleaves β-1,3 linkages in β glucans|
|3||Endo-β -1,4-glucanase||Cleaves internal bonds in crystalline cellulose to liberate polysaccharide termini|
|3||Exo-1,3-1,4-glucanase||Releases cello-oligomers from β-1,3 or β-1,4 linked glucose oligosaccharides and polysaccharides|
|3||α-L-arabinofuranosidase||Hydrolyszes α-1,3 in arabinose-containing oligosaccharides and polysaccharides|
Many of these GH families could have key roles in processing cellulose, hemicellulose, and other plant polysaccharides in the A. glabripennis midgut. Of particular interest are cellulases (endoglucanases, exoglucanases, and β-glucosidases) that could augment the activities of cellulases inherently produced by A. glabripennis, enhancing the release of glucose from this highly insoluble and indigestible polysaccharide. Microbial cellulases detected in the A. glabripennis midgut metagenome were classified to seven different GH families, including GH 1, GH 3, GH 5, GH 6, GH 9, GH 45, and GH 61 and their corresponding KEGG E.C. assignments suggest the presence of all enzymes necessary to liberate glucose from cellulose. We hypothesize that these microbial derived cellulases can collaborate with host enzymes to enhance cellulase activity in the midgut of A. glabripennis. Alternatively, the overabundance of microbial-derived β-glucosidases may also allow microbes associated with the gut to exploit cellulose degradation products released by endogenous beetle cellulases secreted into the gut; however, the interactions among the beetle and its gut microbes are likely diverse, intricate, and dynamic and an explanation of why these β-glucosidases are overrepresented in this community cannot be fully determined without further investigation. Additionally, reads with highest BLAST scores to components of cellulosomes and other proteins with carbohydrate binding motifs that facilitate binding to the cellulose substrate, allowing hydrolytic enzymes to act processively and efficiently to release cellobiose and other cello-oligomers.
Candidate Genes for Xylose Utilization and Fermentation
GH families involved in processing hemicellulose were also detected; in general, the structure of hemicellulose is significantly more heterogeneous in comparison to cellulose and is comprised of a matrix of polysaccharides including xylan, glucuronoxylan, arabinoxylan, glucomannan, and xyloglucan. The heterogeneity both in terms of subunit and linkage composition signifies that degrading this prominent group of cell wall polysaccharides requires a greater diversity of enzymes, although xylan and xyloglucans are the dominant hemicellulose polysaccharides in woody plants . Not surprisingly, a number of GH families involved in breaking α- and β-linkages in xylan and xyloglucans were detected in the metagenome, including GH families 5, 8, 10, 11, 26, 39, and 43.
Sugar monomers liberated from xylan can be efficiently metabolized by the midgut microbiota (Figure 8). Of particular importance is the ability to process xylose and arabinose as mechanisms for insect utilization of plant-derived pentose sugars have not been reported  and these sugars are inherently difficult to ferment on an industrial scale. Enzymes from both bacterial and fungal xylose isomerase pathways are well represented in the shotgun data to convert D-xylose into D-xylulose-5-phosphate . D-xylulose-5-phosphate can be processed via the pentose phosphate pathway to produce glyceraldehyde-3-phosphate and fructose-6-phosphate, which can enter the glycolysis pathway . Ultimately, pyruvate produced through glycolysis can be converted to acetaldehyde by pyruvate decarboxylase  and then to ethanol by alcohol dehydrogenase. Alternatively, acetaldehyde can be oxidized to acetate by acetaldehyde dehydrogenase , which can be used as the building blocks for fatty acid production. Although arabinose is a minor constituent of hemicellose in woody plants, it can be converted to D-xylulose-5-phosphate by L-arabinose isomerase and L-ribulokinase where it can be further processed by the pentose phosphate and glycolysis pathways to generate fermentable products . All enzymes required to convert xylose and arabinose to ethanol (or acetate) are present in the A. glabripennis midgut community. Thus, this community could serve as a reservoir for novel enzymes that could be exploited to enhance industrial xylose fermentation.
Xylose released from hemicellulose can be converted into D-xylulose-5-phosphate and eventually into acetaldehyde. Acetaldehyde can be either converted into ethanol by alcohol dehydrogenase or into acetate by acetaldehyde dehydrogenase. These reactions are likely catalyzed by lactic acid bacteria or yeasts associated with the A. glabripennis gut.
Candidate Genes for Pectin Degrading Enzymes
Liberation of sugar monomers from both cellulose and hemicellulose is greatly enhanced when bonds crosslinking these compounds to pectin and lignin are disrupted, releasing polysaccharide termini and promoting easy access by processive hydrolytic enzymes. Pectin is a polysaccharide comprised primarily of α-galacturonic acid residues and it is often esterified to hemicellulosic and cellulosic polysaccharides in heartwood . Degradation of pectin catalyzed by GH 28 polygalacturonases, pectin lyases, pectin esterases, and pectin acetylases and the disruption of ester linkages between pectin and other structural polysaccharides by carboxylesterases, esterases, and acetyl xylan esterases produced by members of the A. glabripennis midgut community could indirectly facilitate cellulose and hemicellulose digestion by exposing polysaccharide termini to hydrolytic enzymes. Galacturonic acid residues released from this polysaccharide can be used as an energy source by the gut microbial community or A. glabripennis as microbial pathways involved in processing galactose and galacturonic acid were detected and pathways involved in galactose utilization have been previously described in beetles .
Candidate Genes for Nutrient Acquisition and Synthesis
Nutrients are extremely scarce in the heartwood where the later instars of A. glabripennis feed. For example, nitrogen is limiting in woody biomass  and nitrogen sources originating from plant cell wall proteins are intricately cross-linked with recalcitrant plant cell wall polysaccharides and biopolymers , while other dietary components, including fatty acids, sterols, and vitamins are present in extremely low abundances or are absent altogether . Besides the abilities of cerambycid beetles to produce endogenous cellulases and detoxification enzymes [14,16,92], little is known about their endogenous digestive and metabolic capabilities. Despite this, transcriptome profiling of other Coleopterans revealed that beetles have impressive endogenous digestive and metabolic capabilities and produce diverse arrays of cell-wall degrading enzymes  and detoxification enzymes [94,95], however, several pathways leading to the synthesis of sterols , aromatic amino acids, and branched chain amino acids are blocked at multiple steps  and these nutrients must either be acquired from the food source or through interactions with gut microbes. Because these nutrients are scarce in woody tissue, it is hypothesized that microbes associated with wood-feeding beetles can synthesize essential nutrients, facilitate nutrient recovery from woody tissue, and augment endogenous detoxification enzyme activities [25,98–100].
Candidate Genes for Nitrogen Acquisition
The C:N ratio in the heartwood of hardwood trees can be as high as 1000:1, although plant cell wall proteins cross-linked in the cell wall matrix may serve as a reservoir of protein sources for organisms that live in this habitat. However, there is much debate about whether or not the protein concentrations in woody tissues are high enough to obtain a sufficient amount of nitrogen for de novo synthesis of nucleotides and amino acids. Therefore, it is generally hypothesized that insects and microbes colonizing the heartwood have mechanisms in place to acquire and utilize atmospheric nitrogen or have efficient pathways to recycle nitrogenous waste products . Several bacterial nitrogen fixing genes were identified to convert atmospheric nitrogen to ammonia, which could then be assimilated and used by the beetle and other members of the midgut community. As a consequence, ammonium transporters and glutamine synthases, which actively transport ammonia into the cell and subsequently convert ammonia and glutamate into glutamine, are also highly represented in the A. glabripennis midgut community. In addition, ammonia (a major byproduct of amino acid deamination reactions) , urea (a major waste product of amino acid degradation produced by bacteria) and uric acid (a major nitrogenous waste product produced by insects)  represent suitable sources of nitrogen that can be recapitulated and recycled through urease, uricase, and allatonin degradation pathways encoded by the midgut community. Overall, reads assigned to recycling pathways were far more abundant than reads assigned to nitrogen fixing pathways; therefore, we hypothesize that that nitrogen recycling might make important contributions to the nitrogen economy in the larval A. glabripennis midgut community. Alternatively, nitrogen fixation pathways may also be prominent in the A. glabripennis community, but these bacteria may be more associated with other regions of the gut where oxygen levels are lower (e.g., hindgut), which were not sampled for this study. Furthermore, a wide array of proteinases with broad substrate abilities is associated with the gut community. This array of enzymes has the capacity to degrade plant proteins released from the plant cell wall matrix during active lignocellulose degradation and scavenge nitrogen from xenobiotic substrates, including cyanide, alkaloids , and non-protein amino acids (i.e., cyanoamino acids) . Finally, the gut community possesses full or partial pathways for the synthesis of 23 amino acids, including full pathways for the biosynthesis of aromatic amino acids.
Candidate Genes for Sterol, Vitamin, and Fatty Acid Synthesis
Other nutrients notably missing or present in low abundances in woody tissue include sterols, vitamins, fatty acids, and inorganic ions . Unlike other animals, insects cannot synthesize cholesterol as this pathway is blocked at several steps; thus, they must acquire sterols that can be converted to cholesterol from their feeding substrate . Many wood-feeding insects (e.g., ambrosia beetles) convert ergosterols produced by cultivated fungal symbionts into cholesterol , while others actively convert a variety of phytosterols produced by plants into cholesterol . The F. solani isolate as well as yeasts harbored in the A. glabripennis gut have the capacity to contribute to the synthesis of cholesterol and, accordingly, a number of ergosterol synthesis genes (e.g., C-22 sterol desaturase, cytochrome P450s, and lanosterol 14 alpha demethylase) assigned to phylum Ascomycota, were detected. Vitamins and other nutrients missing from woody tissue can be produced or efficiently assimilated by the A. glabripennis gut community. A combination of acetate, produced via conversion of sugar monomers liberated from woody polysaccharides, and coenzyme A, synthesized by microbial constituents, could be used to synthesize acetyl CoA which is the essential building block for fatty acid synthesis . Furthermore, pathways for synthesizing biotin (vitamin B7), coenzyme A folate (vitamin B9), lipoic acid, pyridoxine (vitamin B6), riboflavin (vitamin B2) thiamine (vitamin B1), and ubiquinone (coenzyme Q10) are well represented in the gut community.
Candidate Genes for Detoxification
Woody plants produce an array of secondary metabolites and digestive enzyme inhibitors in an attempt to restrict insect herbivory and colonization by pathogenic microbes. These compounds often accumulate in the heartwood of the plant . While many insects endogenously produce impressive arrays of detoxification enzymes or have mechanisms to sequester plant toxins, many beetle species directly benefit from detoxification enzymes produced by microbes [110,111]. For example, microbial communities associated with bark beetles feeding in phloem tissue, which serves as a conduit for toxic defensive chemicals, are highly enriched for detoxification genes . The A. glabripennis midgut microbial community also encodes genes that can mitigate host plant defenses. A number of bacterial and fungal reads with highest scoring BLAST alignments to host plant inducible cytochrome P450s were detected that are known to promiscuously degrade xenobiotic substrates in an oxidoreductive manner . Reads corresponding to enzymes involved in glutathione-mediated detoxification, including glutathione peroxidases, glutathione-S-transferases, and glutathione reductases, were detected in the gut metagenome. The broad substrate specificities of these quintessential detoxification enzymes allow them to act on a wide range of toxic metabolites produced by many species of host trees. Additionally, most plants produce salicylic acid as a defense mediator against pathogens, which induces the production of defensive compounds. Furthermore, salicylic acid and its regulated pathways have indirect roles in anti-herbivory defenses since they can negatively impact symbiotic microbes associated with herbivores. However, the gut community is capable of producing a number of isochorismatase family proteins hypothesized to disrupt the salicylic acid pathway, which uses isochorismate as a key intermediate . A number of salicylate hydratases were found in the A. glabripennis gut metagenome that could directly destroy salicylic acid to prevent induction of plant defensive pathways.
Metabolism of lignin also releases highly toxic metabolites, which can cause irreversible damage to the peritrophic matrix, digestive enzymes, and gut-associated microbes. While the cytochrome P450 enzymes mentioned previously could aid in the detoxification of these metabolites, other xenobiotic degrading enzymes were detected that could be involved in these processes, including glutathione S-transferases, glutathione S-peroxidases, epoxide hydrolases, aldo-keto reductases, and alcohol dehydrogenases. Further, several enzymes that hypothesized to directly break down small metabolites released from large-scale lignin degradation were detected in the A glabripennis metagenome and included lignostilbene-α-β-dioxygenases, 1,2 and 3,4 aromatic ring dioxygenases, biphenyl 2,3 dioxygenases, and ligX, ligZ, ligY, ligW, and ligW1, which have been observed to coordinate the degradation of ferulic acid and other small molecules released from lignin degradation . A number of enzymes that could function as antioxidants were also detected, which may prevent oxidative damage to the midgut or the microbiota from the ingestion of toxic dietary compounds (e.g. tannins) or from oxidative degradation of lignin.
Finally, one of the most common defense mechanisms employed by plants to reduce herbivory is to produce digestive proteinase enzyme inhibitors to restrict an organism’s ability to break down and assimilate nitrogen . These proteinase enzyme inhibitors typically show high specificity and target a single family of proteinases; however, many insects have evolved a mechanism to overcome these plant defenses by producing a different type of peptidase whose activity and integrity is not impacted by these plant inhibitors . The A. glabripennis microbial gut community has the genetic capacity to produce an assortment of digestive proteinase classes hypothesized to serve as alternative sources of proteinase family activities in the event that host plant proteinase inhibitors disrupt the endogenous proteinase families produced by A. glabripennis. Reduction of cysteine proteinase activity in western corn rootworm (Coleoptera: Diabrotica virgifera virgifera) in antibiotic treated insects has been previously reported , demonstrating a role for microbial derived proteinases in insect digestive physiology.
Candidate Genes from Fusarium
Filamentous fungi belonging to the Fusarium species complex have been observed in association with beetles collected from all US populations and from several species of host trees. Mass spectroscopy based protein identification techniques and in vitro enzyme assays of an F. solani strain associated with the A. glabripennis gut cultivated on wood chips demonstrated that this isolate is capable of producing several extracellular laccase enzymes, indicating that this isolate associated with A. glabripennis has lignin degrading potential. Furthermore, this isolate expressed 28 families of glycoside hydrolases, many of which had predicted cellulase and xylanase activities . In addition to these previously reported findings, genes classified to the genera Fusarium/Nectria were detected in this analysis included flavin-containing amine oxidoreductases (ammonium generation), glutathione-dependent formaldehyde-activating enzyme (methane metabolism), several sugar transporters, and several short chain dehydrogenases, which can participate in many biochemical processes including sterol synthesis, metabolism of sugar alcohols, and metabolism of fermentation products. Whole genome sequencing is currently underway to compile a complete genetic inventory of this unique fungal strain and will provide a more comprehensive insight into its role in the A. glabripennis midgut.
Candidate Genes from Leuconostoc
Although sequencing coverage was not deep enough to generate draft genomes of any individual OTU in the A. glabripennis gut community, roughly 22,000 high quality reads (7.8 Mb) classified to genus Leuconostoc were detected in the A. glabripennis gut metagenome. Bacteria from the genus Leuconostoc and other lactic acid bacteria have been previously identified in the guts of A. glabripennis larvae collected from other populations  and several other species of coleopterans (e.g., Agrilus planipennis and beetles in the family Carabidae) . Many genes taxonomically classified to this genus had highest scoring BLAST alignments to xylose fermentation pathways, pathways for utilization of pentose wood sugars, nitrogen recycling enzymes, nutrient synthesizing enzymes, and enzymes with detoxification abilities. A large number of cellobiose phosphorylases and glycoside hydrolase family 1 β-glucosidases were identified, which could be involved in degrading cellobiose disaccharides released from cellulose chains. In addition, a number of genes predicted to encode xylose transporters and xylose fermentation pathways were detected. Further, genes for the uptake and fermentation of other pentose sugars present in hemicellulose, including ribose and arabinose, were detected. Genes annotated as aromatic acid dioxygenases and aryl alcohol dehydrogenases, which could catalyze the degradation of aromatic subunits released from the lignin biopolymer or serve as helper enzymes for β-aryl ether cleavage catalyzed by dyp-type peroxidases, were also identified. Additionally, pathways involved in nutrient synthesis were also detected, which included pathways for the synthesis of branched chain amino acids, aromatic amino acids, sterols, and vitamins as well as enzymes that could function as antioxidants or in detoxification (e.g. cyanide hydratases). Due to the metabolic capacities for pentose sugar fermentation, nutrient synthesis, and detoxification, complete genome assembly for the Leuconostoc strains found in association with the A. glabripennis midgut and more in-depth studies to characterize the interactions between Leuconstoc and A. glabripennis species would be of value to pursue in future research.
This study represents the first large scale functional metagenomic analysis of the midgut microbial community of a cerambycid beetle with documented lignin degrading capabilities . A taxonomically diverse assemblage of bacteria and fungi are associated with the midgut of A. glabripennis and this study has shown that this community harbors the enzymatic capacity for extensive contributions to the digestion of woody tissue in this system. Of relevance is i) a microbial community dominated by bacterial and fungal aerobes and facultative anaerobes, indicating an appropriate aerobic environment in the midgut for microbial enzymes involved in oxygen-dependent lignin degradative processes, ii) the similarity of the A. glabripennis midgut microbiota to the Sirex fungal gallery community and its distinction from other herbivore gut communities, including the termite hindgut communities, iii) detection of genes encoding secreted oxidative enzymes proposed to disrupt β-aryl ether linkages and hypothesized to have roles in cleaving β-aryl ether linkages in lignin, iv) detection of extracellular H2O2-generating enzymes, and v) detection of a number of genera with predicted lignocellulolytic and hemicellulolytic capabilities. The midgut community of A. glabripennis has the metabolic potential to produce enzymes to help this wood-boring insect overcome major nutritional challenges associated with feeding in woody tissue and we hypothesize that interactions between the beetle and its gut microbes drive this insect’s ability to colonize and thrive in a broad range of healthy host trees. This wood-degrading system should also have great potential for the development of novel lignocellulose degrading enzymes for applications by the biofuels industry. This study provides the first glimpse into the metabolic potential of the gut community associated with a cerambycid beetle and lays the foundations for future hypothesis-based research, including more in-depth biochemical studies, comparative metagenomics, metatranscriptomics, and pathway modeling to assess potential metabolic cross-talk between this beetle and its gut microbes.
MEGAN classification of shotgun reads. Taxonomic assignments for highly abundant classes (>0.04% relative abundance) detected in the shotgun data. Percentages indicate relative abundance of reads assigned to each class.
Estimated copy number of candidate lignin degrading Pfam domains in herbivore-associated microbial communities. Estimated copies of Pfam domains detected in each herbivore-related metagenome assembly obtained from IMG/M. To obtain abundances, assembled contigs were multipled by read depth when assembly information was available and singleton reads were treated as single copies.
Abundance and Class Level Taxonomic Classification of GH Families in the A. glabripennis gut metagenome. Corresponding KEGG Enzyme Classifications and class level assignments are also presented.
Amplicon and metagenomic shotgun sequencing were performed at the Department of Energy-Joint Genome Institute. Annotation of shotgun reads was performed using computing resources available at the USDA-ARS Pacific Basin Agriculture Research Center (Moana cluster; Hilo, HI), Hawaii Open Supercomputing Center at University of Hawaii (Jaws cluster; Maui, HI) and the Research Computing and Cyberinfrastructure Group at The Pennsylvania State University (LionX clusters; University Park, PA). We thank Al sawyer’s group at USDA-APHIS in Otis, MA, the Massachusetts Department of Conversation and Recreation and Maya Nehme for assistance collecting insects.
Conceived and designed the experiments: EDS KH MT SGT JEC. Performed the experiments: EDS SMG SGT TGdR MC KWB. Analyzed the data: EDS SMG JRH. Contributed reagents/materials/analysis tools: EDS SMG KH JEC. Wrote the manuscript: EDS SMG KH MT JEC.
- 1. Jørgensen H, Vibe-Pedersen J, Larsen J, Felby C (2007) Liquefaction of lignocellulose at high-solids concentrations. Biotechnol Bioeng 96: 862-870. doi:https://doi.org/10.1002/bit.21115. PubMed: 16865734.
- 2. Alvira P, Tomás-Pejó E, Ballesteros M, Negro MJ (2010) Pretreatment technologies for an efficient bioethanol production process based on enzymatic hydrolysis: A review. Bioresour Technol 101: 4851-4861. doi:https://doi.org/10.1016/j.biortech.2009.11.093. PubMed: 20042329.
- 3. Campbell MM, Sederoff RR (1996) Variation in lignin content and composition (mechanisms of control and implications for the genetic improvement of plants). Plant Physiol 110: 3-13. PubMed: 12226169.
- 4. Watanabe H, Tokuda G (2010) Cellulolytic Systems in Insects. Annu Rev Entomol 55: 609-632. PubMed: 19754245.
- 5. Huang SW, Zhang HY, Marshall S, Jackson TA (2010) The scarab gut: A potential bioreactor for bio-fuel production. J Insect Sci 17: 175-183. doi:https://doi.org/10.1111/j.1744-7917.2010.01320.x.
- 6. Haack RA, Law KR, Mastro VC, Ossenbruggen HS, Raimo BJ (1997) New York’s battle with the Asian long-horned beetle. J Forestry 95: 11-15.
- 7. Haack RA, Hérard F, Sun JH, Turgeon JJ (2010) Managing invasive populations of Asian longhorned beetle and citrus longhorned beetle: A worldwide perspective. Annu Rev Entomol Palo Alto Annual Reviews, 55: 521-546. PubMed: 19743916.
- 8. Geib SM, Filley TR, Hatcher PG, Hoover K, Carlson JE et al. (2008) Lignin degradation in wood-feeding insects. Proc Natl Acad Sci U S A 105: 12932-12937. doi:https://doi.org/10.1073/pnas.0805257105. PubMed: 18725643.
Geib SM, Mdel M[!(surname)!] , Carlson JE, Tien M, Hoover K (2009) Effect of host tree species on cellulase activity and bacterial community composition in the gut of larval Asian longhorned beetle. Environ Entomol 38: 686-699 doi:10.1603/022.038.0320. PubMed: 19508777.
- 10. Kirk TK, Farrell RL (1987) Enzymatic combustion - The microbial-degradation of lignin. Annu Rev Microbiol 41: 465-505. doi:https://doi.org/10.1146/annurev.mi.41.100187.002341. PubMed: 3318677.
- 11. Schloss PD, Delalibera I, Handelsman J, Raffa KF (2006) Bacteria associated with the guts of two wood-boring beetles: Anoplophora glabripennis and Saperda vestita (Cerambycidae). Environ Entomol 35: 625-629. doi:https://doi.org/10.1603/0046-225X-35.3.625.
- 12. Geib SM, Mdel Jimenez-Gasco M, Carlson JE, Tien M, Jabbour R, et al (2009) Microbial community profiling to investigate transmission of bacteria between life stages of the wood-boring beetle, Anoplophora glabripennis. Microb Ecol 58: 199-211. doi:https://doi.org/10.1007/s00248-009-9501-4. PubMed: 19277770.
- 13. Geib SM, Scully ED Jimenez-Gasco MdM, Carlson JE, Tien M, et al. (2012) Phylogenetic analysis of Fusarium solani associated with the Asian longhorned beetle, Anoplophora glabripennis. Insects 3: 141-160.
- 14. Lee SJ, Kim SR, Yoon HJ, Kim I, Lee KS et al. (2004) cDNA cloning, expression, and enzymatic activity of a cellulase from the mulberry longicorn beetle, Apriona germari. Comp Biochem Physiol B Biochem Mol Biol 139: 107-116. doi:https://doi.org/10.1016/j.cbpc.2004.06.015. PubMed: 15364293.
- 15. Sugimura M, Watanabe H, Lo N, Saito H (2003) Purification, characterization, cDNA cloning and nucleotide sequencing of a cellulase from the yellow-spotted longicorn beetle, Psacothea hilaris. Eur J Biochem 270: 3455-3460. doi:https://doi.org/10.1046/j.1432-1033.2003.03735.x. PubMed: 12899703.
- 16. Geib SM, Tien M, Hoover K (2010) Identification of proteins involved in lignocellulose degradation using in gel zymogram analysis combined with mass spectroscopy-based peptide analysis of gut proteins from larval Asian longhorned beetles, Anoplophora glabripennis. J Insect Sci 17: 253-264. doi:https://doi.org/10.1111/j.1744-7917.2010.01323.x.
- 17. Kukor JJ, Cowan DP, Martin MM (1988) The role of ingested fungal enzymes in cellulose digestion in the larvae of cerambycid beetles. Physiol Zool 61: 364-371.
- 18. Brennan Y, Callen WN, Christoffersen L, Dupree P, Goubet F et al. (2004) Unusual microbial xylanases from insect guts. Appl Environ Microbiol 70: 3609-3617. doi:https://doi.org/10.1128/AEM.70.6.3609-3617.2004. PubMed: 15184164.
- 19. King AJ, Cragg SM, Li Y, Dymond J, Guille MJ et al. (2010) Molecular insight into lignocellulose digestion by a marine isopod in the absence of gut microbes. Proc Natl Acad Sci U S A 107: 5345-5350. doi:https://doi.org/10.1073/pnas.0914228107. PubMed: 20212162.
- 20. Coy MR, Salem TZ, Denton JS, Kovaleva ES, Liu Z et al. (2010) Phenol-oxidizing laccases from the termite gut. Insect Biochem Mol Biol 40: 723-732. doi:https://doi.org/10.1016/j.ibmb.2010.07.004. PubMed: 20691784.
- 21. Breznak JA, Brune A (1994) Role of microorganisms in the digestion of lignocellulose by termites. Annu Rev Entomol 39: 453-487. doi:https://doi.org/10.1146/annurev.en.39.010194.002321.
- 22. Mathew GM, Mathew DC, Lo S-C, Alexios GM, Yang J-C et al. (2012) Synergistic collaboration of gut symbionts in Odontotermes formosanus for lignocellulosic degradation and bio-hydrogen production. Bioresour Technol. PubMed: 23298769.
- 23. Francke-Grosmann H (1967) Ectosymbiosis in wood-inhabiting insects. Symbiosis 2: 141-205.
- 24. Jonsell M, Nordlander G, Jonsson M (1999) Colonization patterns of insects breeding in wood-decaying fungi. J Insect Conserv 3: 145-161. doi:https://doi.org/10.1023/A:1009665513184.
- 25. Dillon RJ, Dillon VM (2004) The gut bacteria of insects: Nonpathogenic interactions. Annu Rev Entomol 49: 71-92. doi:https://doi.org/10.1146/annurev.ento.49.061802.123416. PubMed: 14651457.
- 26. Lemke T, Stingl U, Egert M, Friedrich MW, Brune A (2003) Physicochemical conditions and microbial activities in the highly alkaline gut of the humus-feeding larva of Pachnoda ephippiata (Coleoptera: Scarabaeidae). Appl Environ Microbiol 69: 6650-6658. doi:https://doi.org/10.1128/AEM.69.11.6650-6658.2003. PubMed: 14602625.
- 27. Warnecke F, Luginbühl P, Ivanova N, Ghassemian M, Richardson TH et al. (2007) Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite. Nature 450: 560-565. doi:https://doi.org/10.1038/nature06269. PubMed: 18033299.
- 28. Suh SO, Marshall CJ, McHugh JV, Blackwell M (2003) Wood ingestion by passalid beetles in the presence of xylose-fermenting gut yeasts. Mol Ecol 12: 3137-3145. doi:https://doi.org/10.1046/j.1365-294X.2003.01973.x. PubMed: 14629392.
- 29. Xueyan Y, Jiaxi Z, Fugui W, Min C (1995) A study on the feeding habits of the larvae of two species of longicorn (Anoplophora) to different tree species. Journal of Northwest Forestry College 10: 1-6.
- 30. MacLeod A, Evans H, Baker R (2002) An analysis of pest risk from an Asian longhorn beetle Anoplophora glabripennis to hardwood trees in the European community. Crop Protect 21: 635-645. doi:https://doi.org/10.1016/S0261-2194(02)00016-9.
- 31. Engelbrektson A, Kunin V, Wrighton KC, Zvenigorodsky N, Chen F et al. (2010) Experimental factors affecting PCR-based estimates of microbial species richness and evenness. ISME J 4: 642-647. doi:https://doi.org/10.1038/ismej.2009.153. PubMed: 20090784.
- 32. Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M et al. (2009) Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol 75: 7537-7541. doi:https://doi.org/10.1128/AEM.01541-09. PubMed: 19801464.
- 33. Edgar RC, Haas BJ, Clemente JC, Quince C, Knight R (2011) UCHIME improves sensitivity and speed of chimera detection. Bioinformatics 27: 2194-2200. doi:https://doi.org/10.1093/bioinformatics/btr381. PubMed: 21700674.
- 34. Altschul SF, Madden TL, Schäffer AA, Zhang JH, Zhang Z et al. (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res 25: 3389-3402. doi:https://doi.org/10.1093/nar/25.17.3389. PubMed: 9254694.
- 35. Wang Q, Garrity GM, Tiedje JM, Cole JR (2007) Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol 73: 5261-5267. doi:https://doi.org/10.1128/AEM.00062-07. PubMed: 17586664.
- 36. Huson DH, Auch AF, Qi J, Schuster SC (2007) MEGAN analysis of metagenomic data. Genome Res 17: 377-386. doi:https://doi.org/10.1101/gr.5969107. PubMed: 17255551.
Zwickl D (2006) Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maxmum likelihood criterion. PhD dissertation available at http://wwwnescentorg/wg_garli. Austin: University of Texas.
- 38. Darriba D, Taboada GL, Doallo R, Posada D (2012) jModelTest 2: More models, new heuristics and parallel computing. Nat Methods 9: 772. doi:https://doi.org/10.1038/nmeth.2111. PubMed: 22847109.
- 39. Kunin V, Copeland A, Lapidus A, Mavromatis K, Hugenholtz P (2008) A bioinformatician’s guide to metagenomics. Microbiol Mol Biol Rev 72: 557-578, Table of Contents doi:https://doi.org/10.1128/MMBR.00009-08. PubMed: 19052320.
- 40. Lowe TM, Eddy SR (1997) tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25: 0955-0964. doi:https://doi.org/10.1093/nar/25.5.955. PubMed: 9023104.
- 41. Eddy SR (1998) HMMER: Profile hidden Markov models for biological sequence analysis. Bioinformatics, 14: 755-763. PubMed: 9918945.
- 42. Huang Y, Gilna P, Li W (2009) Identification of ribosomal RNA genes in metagenomic fragments. Bioinformatics 25: 1338-1340. doi:https://doi.org/10.1093/bioinformatics/btp161. PubMed: 19346323.
- 43. Pruesse E, Quast C, Knittel K, Fuchs BM, Ludwig W et al. (2007) SILVA: A comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res 35: 7188-7196. doi:https://doi.org/10.1093/nar/gkm864. PubMed: 17947321.
- 44. Meyer F, Paarmann D, D’Souza M, Olson R, Glass EM et al. (2008) The metagenomics RAST server - A public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics 9: 386. doi:https://doi.org/10.1186/1471-2105-9-386. PubMed: 18803844.
- 45. Marchler-Bauer A, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C et al. (2009) CDD: Specific functional annotation with the Conserved Domain Database. Nucleic Acids Res 37: D205-D210. doi:https://doi.org/10.1093/nar/gkn845. PubMed: 18984618.
- 46. Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B et al. (2003) The COG database: An updated version includes eukaryotes. BMC Bioinformatics 4: 41. doi:https://doi.org/10.1186/1471-2105-4-41. PubMed: 12969510.
- 47. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H et al. (2000) Gene Ontology: Tool for the unification of biology. Nat Genet 25: 25-29. doi:https://doi.org/10.1038/75556. PubMed: 10802651.
Kanehisa M (2008) The KEGG database. ‘In silico’ simulation of biological processes. John Wiley & Sons, Ltd.. pp. 91-103.
- 49. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M (2005) Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21: 3674-3676. doi:https://doi.org/10.1093/bioinformatics/bti610. PubMed: 16081474.
- 50. Ye Y, Doak TG (2009) A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes. PLOS Comput Biol 5: 1-8 (e1000465).
- 51. Bateman A, Coin L, Durbin R, Finn RD, Hollich V et al. (2004) The Pfam protein families database. Nucleic Acids Res 32: D138-D141. doi:https://doi.org/10.1093/nar/gkh121. PubMed: 14681378.
- 52. Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V et al. (2009) The Carbohydrate-Active EnZymes database (CAZy): An expert resource for glycogenomics. Nucleic Acids Res 37: D233-D238. doi:https://doi.org/10.1093/nar/gkn663. PubMed: 18838391.
- 53. Robinson T, McMullan G, Marchant R, Nigam P (2001) Remediation of dyes in textile effluent: A critical review on current treatment technologies with a proposed alternative. Bioresour Technol 77: 247-255. doi:https://doi.org/10.1016/S0960-8524(00)00080-8. PubMed: 11272011.
- 54. Engel P, Moran NA (2013) The gut microbiota of insects–diversity in structure and function. FEMS Microbiol Rev (published online ahead of print).
- 55. Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL et al. (2012) Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA bar code marker for fungi. Proc Natl Acad Sci U S A 109: 6241-6246. doi:https://doi.org/10.1073/pnas.1117018109. PubMed: 22454494.
- 56. Suh SO, McHugh JV, Pollock DD, Blackwell M (2005) The beetle gut: A hyperdiverse source of novel yeasts. Mycol Res 109: 261-265. doi:https://doi.org/10.1017/S0953756205002388. PubMed: 15912941.
- 57. Bass M, Cherrett J (1995) Fungal hyphae as a source of nutrients for the leaf-cutting ant Atta sexdens. Physiol Entomol 20: 1-6. doi:https://doi.org/10.1111/j.1365-3032.1995.tb00793.x.
- 58. Talbot P (1977) The Sirex-Amylostereum-Pinus association. Annu Rev Phytopathol 15: 41-54. doi:https://doi.org/10.1146/annurev.py.15.090177.000353.
- 59. Paine TD, Raffa KF, Harrington TC (1997) Interactions among scolytid bark beetles, their associated fungi, and live host conifers. Annu Rev Entomol 42: 179-206. doi:https://doi.org/10.1146/annurev.ento.42.1.179. PubMed: 15012312.
- 60. Logan JWM, Cowie RH, Wood T (1990) Termite (Isoptera) control in agriculture and forestry by non-chemical methods: A review. Bull Entomol Res 80: 309-330. doi:https://doi.org/10.1017/S0007485300050513.
- 61. Sutherland JB, Pometto AL III, Crawford DL (1983) Lignocellulose degradation by Fusarium species. Can J Bot 61: 1194-1198. doi:https://doi.org/10.1139/b83-126.
- 62. Scully ED, Hoover K, Carlson J, Tien M, Geib SM (2012) Proteomic analysis of Fusarium solani isolated from the Asian longhorned beetle, Anoplophora glabripennis. PLOS ONE 7: e32990. doi:https://doi.org/10.1371/journal.pone.0032990. PubMed: 22496740.
- 63. Suen G, Scott JJ, Aylward FO, Adams SM, Tringe SG et al. (2010) An insect herbivore microbiome with high plant biomass-degrading capacity. PLOS Genet 6: e1001129. PubMed: 20885794.
- 64. Waldo DR, Smith LW, Cox EL (1972) Model of cellulose disappearance from the rumen. J Dairy Sci 55: 125-129. doi:https://doi.org/10.3168/jds.S0022-0302(72)85442-0. PubMed: 5009526.
- 65. Mueller UG, Gerardo NM, Aanen DK, Six DL, Schultz TR (2005) The evolution of agriculture in insects. Annu Rev Ecol Evol Syst: 563-595.
- 66. Bridges JR (1983) Mycangial fungi of Dendroetonns frontalis (Coleoptera: Scolytidae) and their relationship to beetle population trends. Environ Entomol 12: 858-861.
- 67. Norris DM (1980) Degradation of 14C-labeled lignins and 14C-labeled aromatic acids by Fusarium solani. Appl Environ Microbiol 40: 376-380. PubMed: 16345616.
Bordeaux JM (2008) Characterization of growth conditions for production of a laccase-like phenoloxidase by Amylostereum areolatum, a fungal pathogen of pines and other conifers.
- 69. Carnegie AJ, Matsuki M, Haugen DA, Hurley BP, Ahumada R et al. (2006) Predicting the potential distribution of Sirex noctilio (Hymenoptera: Siricidae), a significant exotic pest of Pinus plantations. Ann Forest Sci 63: 119-128. doi:https://doi.org/10.1051/forest:2005104.
- 70. Ouzounis C, Sander C (1991) A structure-derived sequence pattern for the detection of type I copper binding domains in distantly related proteins. FEBS Lett 279: 73-78. doi:https://doi.org/10.1016/0014-5793(91)80254-Z. PubMed: 1995346.
- 71. Xu F, Shin W, Brown SH, Wahleithner JA, Sundaram UM et al. (1996) A study of a series of recombinant fungal laccases and bilirubin oxidase that exhibit significant differences in redox potential, substrate specificity, and stability. Biochim Biophys Acta Protein Struct Mol Enzymol 1292: 303-311. doi:https://doi.org/10.1016/0167-4838(95)00210-3.
- 72. Eggert C, Temp U, Dean JF, Eriksson KE (1996) A fungal metabolite mediates degradation of non-phenolic lignin structures and synthetic lignin by laccase. FEBS Lett 391: 144-148. doi:https://doi.org/10.1016/0014-5793(96)00719-3. PubMed: 8706903.
- 73. Hatfield R, Vermerris W (2001) Lignin formation in plants. The dilemma of linkage specificity. Plant Physiol 126: 1351-1357. doi:https://doi.org/10.1104/pp.126.4.1351. PubMed: 11500535.
- 74. Ahmad M, Roberts JN, Hardiman EM, Singh R, Eltis LD et al. (2011) Identification of DypB from Rhodococcus jostii RHA1 as a lignin peroxidase. Biochemistry 50: 5096-5107. doi:https://doi.org/10.1021/bi101892z. PubMed: 21534568.
- 75. Sugano Y, Sasaki K, Shoda M (1999) cDNA cloning and genetic analysis of a novel decolorizing enzyme, peroxidase gene dyp from Geotrichum candidum. J Biosci Bioeng 87: 411-417. doi:https://doi.org/10.1016/S1389-1723(99)80087-5. PubMed: 16232492.
- 76. Vuilleumier S (1997) Bacterial glutathione S-transferases: What are they good for? J Bacteriol 179: 1431–1441. PubMed: 9045797.
- 77. Masai E, Kubota S, Katayama Y, Kawai S, Yamasaki M et al. (1993) Characterization of the C alpha-dehydrogenase gene involved in the cleavage of beta-aryl ether by Pseudomonas paucimobilis. Biosci Biotechnol Biochem 57: 1655–1659. doi:https://doi.org/10.1271/bbb.57.1655. PubMed: 7764263.
- 78. Masai E, Katayama Y, Kubota S, Kawai S, Yamasaki M et al. (1993) A bacterial enzyme degrading the model lignin compound beta-etherase is a member of the glutathione-S-transferase superfamily. FEBS Lett 323: 135-140. doi:https://doi.org/10.1016/0014-5793(93)81465-C. PubMed: 8495726.
- 79. Armstrong RN (1997) Structure, catalytic mechanism, and evolution of the glutathione transferases. Chem Res Toxicol 10: 2-18. doi:https://doi.org/10.1021/tx960072x. PubMed: 9074797.
- 80. Taylor CR, Hardiman EM, Ahmad M, Sainsbury PD, Norris PR et al. (2012) Isolation of bacterial strains able to metabolize lignin from screening of environmental samples. J Appl Microbiol 113: 521-530. doi:https://doi.org/10.1111/j.1365-2672.2012.05352.x. PubMed: 22642383.
- 81. Sethi A, Slack JM, Kovaleva ES, Buchman GW, Scharf ME (2013) Lignin-associated metagene expression in a lignocellulose-digesting termite. Insect Biochem Mol Biol 43: 91-101. doi:https://doi.org/10.1016/j.ibmb.2012.10.001. PubMed: 23108206.
- 82. Timell TE (1964) Wood Hemicelluloses. I. Adv Carbohydr Chem 19: 247-302. doi:https://doi.org/10.1016/S0096-5332(08)60284-2. PubMed: 14272330.
- 83. Kuyper M, Harhangi HR, Stave AK, Winkler AA, Jetten MS et al. (2003) High-level functional expression of a fungal xylose isomerase: the key to efficient ethanolic fermentation of xylose by Saccharomyces cerevisiae? FEMS Yeast Res 4: 69-78. doi:https://doi.org/10.1016/S1567-1356(03)00141-7. PubMed: 14554198.
- 84. Walfridsson M, Bao X, Anderlund M, Lilius G, Bülow L et al. (1996) Ethanolic fermentation of xylose with Saccharomyces cerevisiae harboring the Thermus thermophilus xylA gene, which expresses an active xylose (glucose) isomerase. Appl Environ Microbiol 62: 4648-4651. PubMed: 8953736.
Bolotin A, Wincker P, Mauger S, Jaillon O, Malarme K et al. (2001) The complete genome sequence of the lactic acid bacterium Lactococcus lactis ssp. lactis IL1403. Genome Res 11: 731-753. doi:10.1101/gr.GR-1697R PubMed: 11337471.
- 86. Ohta K, Beall DS, Mejia JP, Shanmugam KT, Ingram LO (1991) Genetic improvement of Escherichia coli for ethanol production: chromosomal integration of Zymomonas mobilis genes encoding pyruvate decarboxylase and alcohol dehydrogenase II. Appl Environ Microbiol 57: 893-900. PubMed: 2059047.
- 87. Becker G (1971) Physiological influences on wood-destroying insects of wood compounds and substances produced by microorganisms. Wood Sci Technol 5: 236-246. doi:https://doi.org/10.1007/BF00353686.
- 88. Kohn Ral O (1977) Intermolecular calcium ion binding on polyuronate-polygalacturonate and polyguluronate. Collect Czech Chem Comun 42: 731-744. doi:https://doi.org/10.1135/cccc19770731.
- 89. Consortium TS (2008) The genome of the model beetle and pest Tribolium castaneum. Nature 452: 949-955. doi:https://doi.org/10.1038/nature06784. PubMed: 18362917.
- 90. Mattson WJ (1980) Herbivory in relation to plant nitrogen-content. Annu Rev Ecol Syst 11: 119-161. doi:https://doi.org/10.1146/annurev.es.11.110180.001003.
- 91. Keller B, Templeton MD, Lamb CJ (1989) Specific localization of a plant-cell wall glycine-rich protein in protoxylem cells of the vascular system. Proc Natl Acad Sci U S A 86: 1529-1533. doi:https://doi.org/10.1073/pnas.86.5.1529. PubMed: 16578841.
- 92. Calderón-Cortés N, Watanabe H, Cano-Camacho H, Zavala-Páramo G, Quesada M (2010) cDNA cloning, homology modelling and evolutionary insights into novel endogenous cellulases of the borer beetle Oncideres albomarginata chamela (Cerambycidae). Insect Mol Biol 19: 323-336. doi:https://doi.org/10.1111/j.1365-2583.2010.00991.x. PubMed: 20201981.
- 93. Pauchet Y, Wilkinson P, Chauhan R, Ffrench-Constant RH (2010) Diversity of beetle genes encoding novel plant cell wall degrading enzymes. PLOS ONE 5: e15635. doi:https://doi.org/10.1371/journal.pone.0015635. PubMed: 21179425.
- 94. Aw T, Schlauch K, Keeling CI, Young S, Bearfield JC et al. (2010) Functional genomics of mountain pine beetle (Dendroctonus ponderosae) midguts and fat bodies. BMC Genomics 11: 215. doi:https://doi.org/10.1186/1471-2164-11-215. PubMed: 20353591.
- 95. Keeling CI, Yuen MM, Liao NY, Docking TR, Chan SK et al. (2013) Draft genome of the mountain pine beetle, Dendroctonus ponderosae Hopkins, a major forest pest. Genome Biol 14: R27. doi:https://doi.org/10.1186/gb-2013-14-3-r27. PubMed: 23537049.
- 96. Douglas AE (2009) The microbial dimension in insect nutritional ecology. Funct Ecol 23: 38-47. doi:https://doi.org/10.1111/j.1365-2435.2008.01442.x.
- 97. Fraenkel G, Printy GE (1954) The amino acid requirements of the confused flour beetle, Tribolium confusum, Duval. Biol Bull 106: 149-157. doi:https://doi.org/10.2307/1538708.
Beaver R, Wilding N, Collins N, Hammond P, Webber J (1989) Insect-fungus relationships in the bark and ambrosia beetles. Academic Press. pp. 121-143.
- 99. Morales-Jiménez J, Zúñiga G, Villa-Tanaca L, Hernández-Rodríguez C (2009) Bacterial community and nitrogen fixation in the red turpentine beetle, Dendroctonus valens LeConte (Coleoptera: Curculionidae: Scolytinae). Microb Ecol 58: 879-891. doi:https://doi.org/10.1007/s00248-009-9548-2. PubMed: 19543937.
- 100. Grünwald S, Pilhofer M, Höll W (2010) Microbial associations in gut systems of wood- and bark-inhabiting longhorned beetles [Coleoptera: Cerambycidae]. Syst Appl Microbiol 33: 25-34. doi:https://doi.org/10.1016/j.syapm.2009.10.002. PubMed: 19962263.
- 101. Brodbeck BV, Andersen PC, Mizell RF (1995) Differential utilization of nutrients during development by the xylophagous leafhopper, Homalodisca coagulata. Entomol Exp Applicata 75: 279-289. doi:https://doi.org/10.1111/j.1570-7458.1995.tb01938.x.
- 102. Breznak JA (1982) Intestinal microbiota of termites and other xylophagous insects. Annual Reviews Microbiol 36: 323-323. doi:https://doi.org/10.1146/annurev.mi.36.100182.001543. PubMed: 6756291.
- 103. Baitsch D, Sandu C, Brandsch R, Igloi GL (2001) Gene cluster on pAO1 of Arthrobacter nicotinovorans involved in degradation of the plant alkaloid nicotine: Cloning, purification, and characterization of 2, 6-dihydroxypyridine 3-hydroxylase. J Bacteriol 183: 5262-5267. doi:https://doi.org/10.1128/JB.183.18.5262-5267.2001. PubMed: 11514508.
- 104. Raybuck SA (1992) Microbes and microbial enzymes for cyanide degradation. Biodegradation 3: 3-18. PubMed: 1369135.
- 105. Clark AJ, Bloch K (1959) The absence of sterol synthesis in insects. J Biol Chem 234: 2578-2582. PubMed: 13810427.
- 106. Six DL, Stone WD, de Beer ZW, Woolfolk SW (2009) Ambrosiella beaveri, sp nov., associated with an exotic ambrosia beetle, Xylosandrus mutilatus (Coleoptera: Curculionidae, Scolytinae), in Mississippi, USA. Antonie Van Leeuwenhoek Int J Gen Mol Microbiol 96: 17-29. doi:https://doi.org/10.1007/s10482-009-9331-x.
- 107. Robbins W, Kaplanis J, Svoboda J, Thompson M (1971) Steroid metabolism in insects. Annu Rev Entomol 16: 53-72. doi:https://doi.org/10.1146/annurev.en.16.010171.000413.
- 108. Louloudes SJ, Kaplanis J, Robbins W, Monroe R (1961) Lipogenesis from C14-acetate by the American cockroach. Ann Entomol Soc Am 54: 99-103.
- 109. Taylor AM, Gartner BL, Morrell JJ (2002) Heartwood formation and natural durability—A review. Wood Fiber Sci 34: 587-611.
- 110. Genta FA, Dillon RJ, Terra WR, Ferreira C (2006) Potential role for gut microbiota in cell wall digestion and glucoside detoxification in Tenebrio molitor larvae. J Insect Physiol 52: 593-601. doi:https://doi.org/10.1016/j.jinsphys.2006.02.007. PubMed: 16600286.
- 111. Dowd PF (1992) Insect Fungal Symbionts - A Promising Source of Detoxifying Enzymes. J Ind Microbiol 9: 149-161. doi:https://doi.org/10.1007/BF01569619.
- 112. Adams AS, Aylward FO, Adams SM, Erbilgin N, Aukema BH et al. (2013) Mountain pine beetles colonizing historical and naïve host trees are associated with a bacterial community highly enriched in genes contributing to terpene metabolism. Appl Environ Microbiol 79: 3468-3475. doi:https://doi.org/10.1128/AEM.00068-13. PubMed: 23542624.
- 113. Schuler MA (1996) The role of cytochrome P450 monooxygenases in plant-insect interactions. Plant Physiol 112: 1411–1419. doi:https://doi.org/10.1104/pp.112.4.1411. PubMed: 8972591.
- 114. Daayf F, El Hadrami A, El-Bebany AF, Henriquez MA, Yao Z et al. (2012) Phenolic compounds in plant defense and pathogen counter-defense mechanisms. Recent Advances in Polyphenol Research 3: 191.
- 115. Bugg TD, Ahmad M, Hardiman EM, Singh R (2011) The emerging role for bacteria in lignin degradation and bio-product formation. Curr Opin Biotechnol 22: 394-400. doi:https://doi.org/10.1016/j.copbio.2010.10.009. PubMed: 21071202.
- 116. Ryan CA (1990) Protease inhibitors in plants: Genes for improving defenses against insects and pathogens. Annu Rev Phytopathol 28: 425-449. doi:https://doi.org/10.1146/annurev.phyto.28.1.425.
- 117. Jongsma MA, Bakker PL, Peters J, Bosch D, Stiekema WJ (1995) Adaptation of Spodoptera exigua larvae to plant proteinase inhibitors by induction of gut proteinase activity insensitive to inhibition. Proc Natl Acad Sci USA 92: 8041-8045. doi:https://doi.org/10.1073/pnas.92.17.8041. PubMed: 7644535.
- 118. Chu C-C, Spencer JL, Curzi MJ, Zavala JA, Seufferheld MJ (2013) Gut bacteria facilitate adaptation to crop rotation in the western corn rootworm. Proc Natl Acad Sci U S A 110: 11917-11922. doi:https://doi.org/10.1073/pnas.1301886110. PubMed: 23798396.
- 119. Lehman RM, Lundgren JG, Petzke LM (2009) Bacterial communities associated with the digestive tract of the predatory ground beetle, Poecilus chalcites, and their modification by laboratory rearing and antibiotic treatment. Microb Ecol 57: 349-358. doi:https://doi.org/10.1007/s00248-008-9415-6. PubMed: 18587608.