Figures
Abstract
The European house dust mite Dermatophagoides pteronyssinus is of significant medical importance as it is a major elicitor of allergic illnesses. In this analysis we have undertaken comprehensive bioinformatic and proteomic examination of Dermatophagoides pteronyssinus airmid, identified 12,530 predicted proteins and validated the expression of 4,002 proteins. Examination of homology between predicted proteins and allergens from other species revealed as much as 2.6% of the D. pteronyssinus airmid proteins may cause an allergenic response. Many of the potential allergens have evidence for expression (n = 259) and excretion (n = 161) making them interesting targets for future allergen studies. Comparative proteomic analysis of mite body and spent growth medium facilitated qualitative assessment of mite group allergen localisation. Protein extracts from house dust contain a substantial number of uncharacterised D. pteronyssinus proteins in addition to known and putative allergens. Novel D. pteronyssinus proteins were identified to be highly abundant both in house dust and laboratory cultures and included numerous carbohydrate active enzymes that may be involved in cuticle remodelling, bacteriophagy or mycophagy. These data may have clinical applications in the development of allergen-specific immunotherapy that mimic natural exposure. Using a phylogenomic approach utilising a supermatrix and supertree methodologies we also show that D. pteronyssinus is more closely related to Euroglyphus maynei than Dermatophagoides farinae.
Citation: Waldron R, McGowan J, Gordon N, McCarthy C, Mitchell EB, Fitzpatrick DA (2019) Proteome and allergenome of the European house dust mite Dermatophagoides pteronyssinus. PLoS ONE 14(5): e0216171. https://doi.org/10.1371/journal.pone.0216171
Editor: Ben J. Mans, Onderstepoort Veterinary Institute, SOUTH AFRICA
Received: January 18, 2019; Accepted: April 15, 2019; Published: May 1, 2019
Copyright: © 2019 Waldron et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: RW is a recipient of an Irish Research Council Enterprise Partnership Scheme Scholar (EPSPG/2014/64). CGPM is funded by an Irish Research Council Government of Ireland Postgraduate Scholarship (Grant No. GOIPG/2015/2242). JM is funded by a postgraduate scholarship from the Irish Research Council, Government of Ireland (grant number GOIPG/2016/1112). Mass spectrometry facilities were funded by Science Foundation Ireland (SFI 12/RI/2346(3)). We acknowledge the DJEI/DES/SFI/HEA Irish Centre for High-End Computing (ICHEC) for the provision of computational facilities and support. EBM and NG are employees of Airmid Healthgroup Ltd. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: EBM and NG are both employees of Airmid Healthgroud Ltd. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
House dust mites (HDM) are the most prevalent source of indoor allergens worldwide, with 1–2% of the total population experiencing an allergic response in their presence [1]. HDM allergens are major causative agents in the pathogenesis of asthma, allergic rhinitis and atopic dermatitis [2]. Protease allergens disrupt the epithelial barrier and activate immune cells resulting in the production of large amounts of proinflammatory cytokines [3, 4]. Sero-dominant allergens; Der p 1and Der p 2 account for 50–60% of IgE reactivity in individuals tested [5]. Allergenic protein families represent only 2% of all protein families. Allergenicity and cross-reactivity is linked to the allergen family rather than allergen source [6]. Use of publicly available allergen databases to query newly sequenced genomes for the presence of potentially allergenic or cross-reactive proteins has enormous potential in identifying new allergens.
HDM allergens are either located within the mite body or in faecal particles. Current knowledge of mite allergen localisation is limited [7] and may be improved by employing comparative proteomics to study the mite body and spent culture media proteins of D. pteronyssinus. Proteins present in HDM faeces are of particular importance as faecal particles are inhaled deep into the lungs, due to their small size [4, 8, 9]. Very little is known about which HDM components are present in house dust or inhalable air, and assessment is limited to allergens for which there are ELISAs [10]. Therefore, it is our belief that characterising D. pteronyssinus proteins present in house dust could potentially yield much needed insights into allergens present in house dust.
Previous studies have shown that approximately 50% of all European homes contain HDM [11]. Therefore, evidenced based biocontrol strategies are needed to curtail HDM populations in homes. A Cochrane review of commonly used physical, chemical and combined physical chemical HDM control strategies has shown no clinical benefit or evidence that these control measures can reduce exposure to HDMs, their allergens or the severity/frequency of asthma symptoms [12]. Reducing humidity within the home has been proposed as a means of constraining HDM populations and limiting allergen production [13]. However, transient exposure to moist air allows for long term survival and reproduction indicating HDM may employ mechanisms to resist desiccation [14]. Therefore, genomic and proteomic characterisation of D. pteronyssinus has the potential to reveal biochemical pathways that could be exploited in future biocontrol strategies. These “Omic” approaches have accelerated digestive enzyme discovery [15], enabling in silico prediction of biochemical activities coupled with measurement of gene or protein expression. Potent enzymes are excreted by HDM into their surroundings as a by-product of their digestive processes, therefore the presence of putative enzymes in faeces is a strong indicator of a digestive function [16]. Surveying the predicted proteome of D. pteronyssinus airmid and subsequent proteomic examination of enzyme expression and localisation could identify new enzymes utilised in nutrient acquisition.
Here we describe the proteome of the European HDM D. pteronyssinus using a strain of mite housed at airmid healthgroup ltd. We have analysed the proteome in an attempt to elucidate the phylogenetic relationships between different species of HDMs and determined the localisation of allergenic components and enzymes involved in nutrient acquisition. The predicted proteome provides the basis to further understand the reported cross-reactivity between HDM and phylogenetically distinct species. We have examined and report the D. pteronyssinus airmid predicted proteome, mite body proteome and excretome with reference to a wild-type proteome as a means of (i) identifying potentially allergenic proteins, (ii) inferring localisation of allergenic and potentially allergenic molecules, and (iii) identifying proteins involved in key physiological processes.
Materials and methods
Genomic data
To construct and compare the phylogenetic relationships between the Acari, 12 genomes were downloaded from the NCBI database for use in the analyses (S1 Table). Two Arachnid outgroups namely the Arizona bark scorpion (Centruroides sculpturatus) and the American house spider (Parasteatoda tepidariorum) were also downloaded. Assembly completeness of each genome was assessed using BUSCO v3 (Benchmarking Universal Single-Copy Ortholog) [17] with the Arthropoda dataset. Comparative analysis of D. pteronyssinus genome assemblies were conducted against previously published D. pteronyssinus strains [18, 19].
Phylogenetic analysis
Phylogenomic analysis of D. pteronyssinus was undertaken with reference to 11 other species (subclass Acari), consisting of six Parasitiformes and five Acariformes (S1 Table). Supermatrix and supertree phylogenomic methods were employed to infer the evolutionary relationships. Suitable phylogenetic markers were selected by locating single copy orthologs (from BUSCO analysis above) in the Acari and outgroup genomes. Single copy orthologs that were ubiquitously present (n = 111) were aligned using MUSCLE [20]. Individual gene families alignments were subsequently concatenated to yield a supermatrix 77,878 amino acids in length. This supermatrix was used to reconstruct a maximum likelihood phylogenomic species tree using RAxML [21] utilising the LG+G+I+F model as selected by ProtTest [22], branch supports were determined using 100 bootstrap replicates.
Single copy ortholog families, present in at least four species (n = 2,796), were identified and individually aligned using MUSCLE. Subsequent phylogenies were generated using FastTree [23]. A supertree was constructed using the matrix representation with parsimony (MRP) method implemented in Clann [24] using the 2,796 gene trees as input with 100 bootstrap replicates. The resultant phylogeny was visualised and annotated using the Interactive Tree of Life (iTOL) [25].
Proteo-genomic analysis of assembly completeness
Proteo-genomic software Peppy [26] was used to generate peptide databases from three D. pteronyssinus assemblies [18, 19, 27]. LC-MS/MS spectra derived from proteomic experimentation on D. pteronyssinus airmid were searched against the six-frame translated genomes (maximum FDR 0.01; precursor tolerance 2000; fragment tolerance 300; digestion rules–cleavage acid R & K, missed cleavages 1; static mods -mod C 57.021464). For completeness and comparative purposes, the spectra were searched against the corresponding translated predicted protein-coding genes for each assembly. Output files were filtered to locate unique peptides and corresponding genomic locations.
Annotation of predicted proteome
Annotation of predicted proteins was achieved using BLAST2GO Version 5.0 [28] to sequentially search SwissProt (Downloaded; 14/01/2018) then NCBInr (Downloaded; 30/08/2017) database. Gene Ontology (GO) terms were assigned to predicted proteins (GO cut off 55, GO weight 5, E value hit filter E-06 and default computational evidence codes). InterPro Scan was used to identify; families, domains, sites and repeats in predicted proteins (CCD, HAMAP, HMMPanther, HMMpfam, FPrintscan, BLASTPromDom) and performed secretion peptide prediction using SignalP ver4.0 in parallel. Mapping feature facilitated mapping of GO terms to enzyme codes.
D. pteronyssinus specific proteins
D. pteronyssinus predicted proteins without BLASTp homology (n = 3,906) to proteins in NCBInr/SwissProt were searched (tBLASTn; E value ≤ E-05) against Acari genomes for presence of homologs in closely related species (S1 Table). Predicted proteins without significant homology were considered D. pteronyssinus-specific proteins (n = 1,848) and were then searched against other the other two available D. pteronyssinus genome assemblies [18, 19]. Proteins without homology to predicted proteins in these assemblies were considered D. pteronyssinus airmid-specific proteins.
Identification of LEA-like proteins
LEA homologs were identified by performing BLASTp (E-value ≤ 1E-05) searches [29] against the Late Embryogenesis Abundant Proteins Database (LEAPdb) [30].
Mite group allergen orthologs
The majority of mite allergens identified to date have been assigned to groups (Group 1–33) in accordance with their order of discovery [7]. Chan et al (2015) reported a further seven non-chronological allergens for D. farinae [31]. Mite group allergen (MGA) orthologs were identified by performing BLASTp searches of query FASTA files containing MGA from D. pteronyssinus & D. farinae (when no other sequences were available for D. pteronyssinus) against the Acari genomes utilised in this study (S1 Table). Allergen orthologs had to satisfy the following criteria; have reciprocal best BALST hits (RBH) with an allergen (E-value ≤ 1E-05) and a minimum alignment length of 80 amino acids with an identity ≥ 35% in accordance with FAO/WHO guidelines [32]. Allergens that had a RBH but did not meet the alignment and identity criteria were considered RBH homologs, these were included in the sequence similarity network and visualised using Gephi [33]. Each protein was represented by a node, two proteins were connected by an undirected edge if they were homologous (BLASTp ≥ 1E-05).
Allergens and predicted allergens
D. pteronyssinus airmid MGA and non-chronological allergens were located by performing a local BLASTp (E-value ≤ 1E-03) search of the D. pteronyssinus predicted proteome against the FASTA sequence file containing the MGAs located above. The match with the smallest E-value was chosen as designated MGA. These sequences were annotated as “Der p1 Allergen” or Der f 22 like-allergen etc. for D. farinae based BLAST hits. Subsequent BLASTp hits were considered MGA homologs and were assessed for potential cross-reactivity with MGA in accordance with FAO/WHO guidelines [32]. Sequence similarity between known allergens (Uniprot “allergenome”) and D. pteronyssinus airmid predicted proteins was assessed in accordance with FAO/WHO guidelines [32].
D. pteronyssinus airmid culture
D. pteronyssinus airmid were obtained from cultures housed at airmid healthgroup ltd (Dublin, Ireland) and maintained on diet composed of dried porcine liver and yeast, house dust mite maximal media (HDMMM, airmid healthgroup ltd, Ireland) at 75% relative humidity and 25°C.
Harvest of mites and spent culture medium
Mites from replicate cultures (n = 5) were separated from spent culture medium (SM) by sieving, and saline floatation method [16], washed with distilled water and surface-sterilised by submersion in 70% ethanol (3 min) followed by washing with sterile distilled water. An average of 10.8 mites were present per mg of spent medium. Mites were then snap frozen in liquid nitrogen, and lyophilised. SM (n = 5) was divided into aliquots (200 mg) and stored at -70°C prior to protein extraction.
Protein extraction
Lyophilised mite bodies (MB) were ground to a fine powder. Proteins were extracted from MB (25 mg) by addition of 500 μl lysis buffer and quantified according to methods described by Owens et al (2015) [34]. Proteins from spent culture medium (SM, n = 5) were extracted by addition of glass beads (50 mg; 0.5 mm BioSpec Products) and 1000 μl lysis buffer [34], followed by bead-beating (30 Hz; 5 min; MM300, Retsch®). MB (n = 2) and SM lysates (n = 2) were pooled for gel filtration (one biological replicate). Protein lysates for shotgun proteomic analysis (n = 4) were normalised (MB; 0.4 mg/ml, SM 0.3 mg/ml), then prepared for digestion according to Owens et al (2015) [34].
Culture media contaminants
Proteins deriving from HDMMM (200 mg) were extracted (n = 1) and prepared for shotgun proteomic analysis as per methods for spent growth culture medium.
House dust protein extracts
Der p 1 positive (0.2–16.94 μg Der p 1 per gram house dust) house dust protein extracts (n = 21) were provided by airmid healthgroup ltd (Dublin, Ireland).
Gel filtration chromatography
Gel filtration chromatography was carried out using an ÄKTA purifier coupled with a Superdex 200 10/300 GL column (GE Healthcare, Germany), equilibrated in PBS. Filtered specimens (0.22 μm), were injected (500 μl) and separated (flow rate 0.4 ml/min) with absorbance monitored at 215, 254 and 280 nm. Fractions were collected (between ~ 6 and 32 ml) and stored at -20°C until further analysis.
Proteolytic digestion of protein extracts for proteomic analysis
Specimens for proteomic analysis were prepared for LC-MS/MS as described by Owens et al (2015) [34].
Nano-flow liquid chromatography electro-spray ionization tandem mass spectrometry (LC-MS/MS) analysis
Peptide mixtures were analysed using a Thermo Fisher Q-Exactive mass spectrometer coupled to a Dionex RSLC nano for LC-MS/MS analysis. LC gradients operated from 3–40% acetonitrile over 40 min, with data collection using a Top15 method for MS/MS scans [35].
Representative proteome
LC-MS/MS spectra obtained from proteomic analysis of D. pteronyssinus airmid were randomised into 5 groups (14–20 files each). Spectra were searched using Sequest HT engine within Proteome Discoverer (Version 1.4) against D. pteronyssinus airmid predicted proteome (peptide filters; set to medium peptide confidence and protein filters; set to two peptides per protein). Protein molecular weight and pI for predicted proteome and representative proteome were calculated using JVirGel [36].
MaxQuant and perseus data analysis
Protein identification and label free quantitative (LFQ) analysis was conducted using MaxQuant (Version 1.6.1.0; http://maxquant.org/), statistical analysis of MaxQuant output data was performed by Perseus (Version 1.6.2.2) as described in O’Keeffe et al (2014) [37].
Culture media contaminants database
As it was not possible to fully remove culture media from specimens prior to proteomic analysis, a custom contaminates database was generated. This allowed for exclusion of protein identifications deriving from culture media (HDMMM) which contained porcine liver and baker’s yeast. Spectra obtained from LC-MS/MS of HDMMM were interrogated (MaxQuant and Perseus) against combined proteomic database of Sus scrofa and Saccharomyces cerevisiae, resulting in the identification of 2,135 proteins (min. 1 peptide). These proteins were added to MaxQuant contaminants database to generate a custom contaminants database (n = 2,380 contaminants).
Data analysis of D. pteronyssinus proteomes
Spectra obtained from LC-MS/MS were interrogated (MaxQuant and Perseus) using either standard contaminant (HD samples) or custom contaminant (MB & SM samples) databases. Proteins were considered present when a minimum of two peptides (1 unique) for each parent protein was observed. Proteins meeting the following criteria were included in the analysis; (i) identified in two of the four non-fractionated whole protein extracts or (ii) identified in one chromatographic fraction.
Results and discussion
Phylogenomic assessment of D. pteronyssinus reveals closest relative to be Euroglyphus maynei
Supermatrix and supertree phylogenomic methods were employed to infer the evolutionary relationships between the Acari species that have genome data available (S1 Table). Both supertree and supermatrix methodologies generated phylogenies with identical topologies and similarly high levels of bootstrap support (BP) for the monophyly of the Parasitiformes and Acariformes superorders (Fig 1A, 100% BP). Within the Parasitiformes superorder, Ixodida and Mesotigmata are monophyletic (Fig 1A, 100% BP). Within the Acariformes superorder, Trombidiformes and Sarcoptiformes are also found to be monophyletic (100% BP) although only a single representative of the Trombidiformes (Tetranychus urticae) is represented in our dataset (Fig 1A).
A. Phylogenomic species tree of 11 Acari species inferred using maximum-likelihood supermatrix (111 ortholog families, 77,878 aa aligned, LG+G+I+F model) and supertree (2,796 ortholog families) methods. Bootstrap support values are given for each branch, with values above branches corresponding to the supermatrix phylogeny and values below corresponding to the supertree phylogeny. Both supermatrix and supertree methods infer a strongly supported (87% and 100% bootstrap support respectively) sister group relationship between D. pteronyssinus and E. maynei to the exclusion of D. farinae. Therefore, in evolutionary terms D. pteronyssinus and E. maynei are more closely related to one another even though D. pteronyssinus and D. farinae belong to the same genus. B. Network map depicting abundance of mite group allergen homologs in each species. Homologous allergen group together forming an interconnected network distinct from non-homologous allergens i.e 2 & 22, groups 3, 6 & 9, groups 5 & 21, and groups 15 & 18.
To date, phylogenetic studies of Acari (mites and ticks) have been restricted to multi-locus studies utilising a small number of genes, due to the absence of full genome sequences [38–40]. Our phylogeny infers a strongly supported (87% and 100% BP for supermatrix and supertree methods, respectively) sister group relationship between D. pteronyssinus and Euroglyphus maynei to the exclusion of D. farinae. This confirms previous studies which utilised only two and six genes but observed the same phylogenetic relationship between D. pteronyssinus and E. maynei [38, 41]. Therefore, in molecular evolutionary terms, D. pteronyssinus and E. maynei are more closely related to one another, even though D. pteronyssinus and D. farinae are currently classified within the same genus. Furthermore, the phylogeny confirms the paraphyly of the Dermatophagoides genus as previously reported [41].
Comparison of D. pteronyssinus airmid predicted proteome with other D. pteronyssinus assemblies
D. pteronyssinus airmid genome assembly [27] completeness was assessed by proteo-genomics. Peppy software [26] facilitated the mapping of 615,150 spectra (28,001 non-redundant) to the predicted proteome and 402,998 (21,505 non-redundant) to the assembly. Peptides that spanned intron-exon junctions identified in the predicted proteome were not mapped to the assembly. Of the 21,505 peptides mapped to the genome assembly, 96.2% were also identified in predicted proteins (n = 20,683). Several peptides (n = 65) were located adjacent to predicted genes indicating that 65 gene models may need to be extended. The predicted proteome of D. pteronyssinus airmid [27] incorporates 8.3 million amino acids, this is significantly more than the Liu et al. (2018) and Randall et al. (2018) assemblies, which have 6.7 million and 5.9 million amino acids respectively. Moreover, despite having fewer predicted proteins than the other two available D. pteronyssinus proteomes (i.e. 12,530 versus 15,846 proteins [19] and 19,368 [18]), D. pteronyssinus airmid has on average longer protein coding genes (557aa vs. 425aa & 304aa respectively). These results indicate that differences in gene calling methodologies are most likely responsible for the differences in the number of protein coding genes.
Proteogenomic comparison of the predicted proteomes reveals the highest number of proteins were identified using the Waldron et al (2017) proteome (n = 4,581), followed by Randall et al (2018) with 4,416, then Liu et al (2018) with only 3,408 proteins identified. Therefore, unsurprisingly proteogenomic analysis of the available D. pteronyssinus genome assemblies against our protein samples which are derived from D. pteronyssinus airmid reveals the Waldron et al (2017) assembly and predicted proteome to be the most appropriate for our analyses, as higher numbers of protein identifications are uncovered relative to the other current assemblies [18, 19, 27].
Mite group allergen orthologs in arachnidia
The presence of common MGA in the genomes of Arachnida species (S1 Table) was investigated, identifying multiple putative cross-reactive MGA orthologs in mite species. Most MGA had numerous orthologs distributed across all species (S1 Data) with presence closely linked to phylogeny. To help visualise the abundance of MGAs in the different Arachnida species a homology network was generated. Our results show that D. farinae, D. pteronyssinus, and E. maynei contained at least one MGA ortholog for all groups investigated, with the exception of Groups 23 & 24 in E. maynei. Group 7 and 14 allergens are only located in the Sarcoptiformes subset of Acariformes. Homologous allergens, Der p 5 and Der p 21, were present in the closely related Acariformes D. pteronyssinus, D. farinae, E. maynei and Psoroptes ovis, but absent from the other species (Fig 1B and S1 Data). Group 23 allergens are specific to D. farinae and D. pteronyssinus. More distantly related species from the Parasitiformes superorder were either missing orthologs of particular allergens such as Der p 4, 5 & 21 or contained RBH homologs only, Der p 1, 2, 22, 23 & 27 for example, (S1 Data). Serine proteases (groups 3, 6 & 9) appear to be expanded in some species, with a minimum of 10 homologs in D. pteronyssinus to a maximum of 28 in ixodes scapularis (Fig 1B and S2 Data).
Annotation of D. pteronyssinus airmid predicted proteome
Predicted proteome annotations.
Multi-database Blast2GO workflow enabled annotation of 96% of the predicted proteome (n = 11,996, S3 Data). Gene Ontology (GO) terms were assigned to 68.2% of proteins (n = 8,546, Fig 2A). InterPro Scan (IPS) assigned IPS annotations to 95.5% of predicted proteins (n = 11,971), 6, 874 with IPS GO terms and IPS IDs to a further 3,804 proteins (Fig 2B). SignalP4.0 identified eukaryotic secretion signals in 10.3% of predicted proteins (n = 1,293, Fig 2B). Enzyme codes (EC) were assigned to 21.5% of the predicted proteins (n = 2,689, Fig 2C) with hydrolases (EC:3.0) representing the largest enzyme category (n = 1,244). Putative peptidase activity was identified in 377 predicted proteins comprising almost 3% of the total predicted proteome. Peptidase EC (EC:3.4) were assigned to 275 peptidases, the remaining peptidases (n = 102) were identified by GO annotations. Enzymes have a propensity to cause allergy [42]. The potent peptidase activity of Der p 1 has been shown to disrupt numerous immune system processes [43] and it is thought lesser studied peptidases may have a similar effect [42]. Enzymes, particularly those with predicted secretion peptides, should be considered in the context of patient exposure as they are more likely to be excreted into house dust and therefore may augment the immune response.
Annotation of D. pteronyssinus airmid proteins conducted using Blast2GO to search SwissProt, NCBI and InterPro databases for homology to known proteins and functional domains. A. GO annotations were assigned to 68.2% of the predicted proteome (n = 8,546). A small number of proteins had Blast hits with proteins in NCBInr or SwissProt databases but were not assigned a GO term (n = 372). The remaining proteins (n = 3,612) were not homologous with proteins in these databases. Most of the predicted proteins were annotated with more than one GO category (Biological Processes, Molecular function, Cellular component) with median number of assigned GO terms being 9 (Range: 0–164). B. InterPro scan feature of Blast2GO enabled assignment of InterPro IDs to 95.5% of predicted proteins/proteome? (n = 11,971). InterPro GO IDs were assigned to 55% (n = 6, 874) of the predicted proteome and predicted eukaryotic secretion peptides to 10.3% of the predicted proteome (n = 1,293). A further 3, 804 proteins were assigned InterPro Scan (IPS) IDs. Multi-database Blast2GO workflow enabled 96% of the predicted proteome to be assigned some form of annotation (n = 11,996). A small number of proteins had BLAST hits with proteins in NCBInr/SwissProt databases but were not assigned a GO term (n = 372). Several predicted proteins (n = 294) were assigned GO annotations but BLASTp hit alignments did not meet the required threshold of significance. The remaining proteins (n = 3,612) were not homologous with proteins in these databases C. Enzyme code classes assigned to D. pteronyssinus airmid predicted proteins (n = 2,689), representing 21.5% of D. pteronyssinus airmid predicted proteome. Hydrolyses (EC:3.0) formed the largest enzyme category (n = 1,244) and Isomerases (EC:5.0) the smallest (n = 87). D.Venn diagram depicting overlapping categorisation of allergenic and predicted allergenic proteins. D. pteronyssinus airmid predicted proteins were assigned into unambigous allergen goupings based upon BlastP homology to known allergens. We found full predicted proteins for all reported MGA (Groups 1–11, 13–16, 18, 20–33) and Seven non-chronalogical allergens (81.5–100% identiy). Subsequent blast hits were considered MGAH. Predicted proteins with potential cross-reactivity [32] with allergens from other species (Uniprot “allergenome”) were annotated as Allergen homolog (AH). Many Allergenic/potentially allergenic proteins were present in multiple allergen analyses, in total 332 allergenic/potentially allergenic proteins were identified.
D. pteronyssinus specific proteins.
Predicted proteins without BLASTp alignments (n = 3,906) to proteins in NCBI/Swissprot were searched (E-value ≤1E-05) against closely related species (S1 Table), 2,054 had homologs in one or more species. The remaining 1,848 uncharacterised proteins (S4 Data) represented D. pteronyssinus-specific proteins [44]. Of these, 1,475 (S4 Data), were specific to D. pteronyssinus airmid strain as they were not found in the other D. pteronyssinus assemblies [18, 19]. These data suggest that 88.3% of identified proteins are core protein coding genes as they are found in all 3 D. pteronyssinus assemblies, with the remaining 11.7% being strain specific. Some uncharacterised D. pteronyssinus airmid-specific proteins may represent adaptations while others are a likely consequence of genetic drift occurring in isolated populations [45]. Bacterial pan-genomic studies estimate that strain-specific genes range from 5% to 35% per genome [46]. These strain-specific accessory genes are generally under relaxed mutational pressure, accumulating mutations more frequently than those of the core genome [47]. Further validation of the expression of these strain specific proteins is necessary to determine if they are functional proteins facilitating strain specific adaptations. It is worth highlighting that within the D. pteronyssinus representative proteome (discussed in more detail later) were 172 D. pteronyssinus-specific proteins, 23 of which had homologs in other D. pteronyssinus assemblies [18, 19], while 149 were only found in D. pteronyssinus airmid. The putative functions of these strain specific proteins are unknown, their role in strain specific adaptions may be discovered through further proteomic investigation.
LEA-like D. pteronyssinus airmid predicted proteins.
HDM lose water readily through evaporation when the critical equilibrium humidity falls below optimum levels [48, 49]. Studies of biochemical mechanisms to resist desiccation have revealed late embryogenesis abundant proteins (LEAPs) play a key role in plant, insect and nematode desiccation survival [50–52]. Our analysis revealed 18 D. pteronyssinus airmid predicted proteins (S2 Table) to have significant homology with reported LEA proteins [30]. Gusev et al (2014) used a similar bioinformatic approach to identify 27 LEA-like proteins in the anhydrobiotic sleeping chromatid, Polypedilum vanderplanki [53]. This anhydrobiotic organism can tolerate extreme water loss of 97% by entering a state of suspended animation. The presence of similar proteins in D. pteronyssinus may explain the ability of mites in the protonymph developmental stage being entirely resistant to desiccation [49]. Although D. pteronyssinus can be killed by extended exposure to sub-critical equilibrium humidity, reduction of humidity in the home does not lead to a reduction in mite numbers or levels of allergen [54], as HDM return to a normal metabolic and reproductive state following short periods of optimal humidity. Furthermore, mattresses when occupied provide ample humidity to ensure survival of HDM in low humidity homes [14].
Expression of D. pteronyssinus airmid proteins exhibiting LEA-like proteins (n = 7) was validated by proteomics. All LEA-like proteins were found at low intensity (~LQF intensities of 1E+08) except for DERPT_G12026 and DERPT_G404 (LFQ Intensities > 7.8E+10). These two highly abundant LEA-like proteins are expressed under optimal non-desiccating laboratory growth conditions and were identified in both mite body and spent media. The ability of D. pteronyssinus airmid to utilise LEA-like proteins under normal laboratory conditions or in response to desiccation/cold may yield information that could be exploited for biocontrol strategies. The role of LEA-like proteins in relation to D. pteronyssinus is yet to be determined, however our data highlights them as potential players in desiccation resistance and hence as interesting biocontrol targets.
Allergens and predicted allergens
We searched the D. pteronyssinus airmid proteome for the presence of 37 prevously reported mite allergens [7, 31]. MGA were identified for D. pteronyssinus airmid, we found full protein sequences (n = 37) for all reported MGA; Groups 1–11, 13–16, 18, 20–33 and seven non-chronological allergens (81.5–100% identity, Table 1). Subsequent BLAST hits (E-value ≤1E-03) were considered MGA homologs (n = 233, Table 1 and S5 Data). Der p 1-like cysteine proteases were represented by 31 homologs (20.9–63.7% identity), several were found in clusters of 2–3 adjacent protein coding genes (n = 13). MGA homologs with high sequence similarity to the query MGA (> 67% identity) met criteria for being considered isoallergens [55]. Isoallergens were identified for eight different MGAs (Table 1). Der p 28 has five isoalllergens (69.4–85.7% identity) and two were identified for Der p 29 (75.7–88.1% identity). One-third of MGA homologous proteins met the criteria for potential allergenicty [32] and were annotated as MGA Homologs (MGAH) (n = 93, S5 Data). In addition, many D. pteronyssinus airmid predicted proteins (n = 326, S6 Data) also exhibited significant similarity [32] to allergens from other species, suggesting they may be cross-species allergens. Most of these allergen homologs (AH) had multiple high scoring alignments with non D. pteronyssinus allergens (n = 991, Range:1–204, Median: 4). Significant overlap was seen between allergenic (MGA) and predicted allergenic (MGAH & AH) proteins with many being observed in more than one category, illustrated in Fig 2D. The structure and function of a protein has important implications for allergenicity, most allergenic proteins are limited to just 2% of protein families, with cross-reactivity linked to protein family rather than allergen source [6, 56, 57]. Most potential allergens highlighted in this study had predicted biochemical functions that placed them in well defined allergen families [6]. For example, predicted enolases DERPT_G12026 and DERPT_G4831 have high levels of sequence similarity (67–87%) with enolases from up to 19 phylogenetically distinct species (S6 Data). Enolase has long been recognised as a major cross-reacting allergen in plants, fungi, fish, and arthropods [58]. Moreover, the presence of at least one putative cross-reactive enolase ortholog [32] in all 12 Acari and two Arachnid outgroups (S2 Data) highlights the importance of this pan-allergen protein family. Several cyclophilins (n = 12) were annotated as putative cross-reactive proteins, of note Der f 29 like allergen (DERPT_G9923) exhibited sequence homology (52–83% identity) with cyclophilins from 12 different species (S6 Data). Cross-reactive cyclophilins from HDM, mouse, humans and fungi are well reported in the literature [59–61]. HDM and fungi are frequently co-present in HD [62] with Alternaria and Aspergillus spp. being the most common source of mould allergens [63]. D. pteronyssinus airmid predicted proteins exhibited homology to Aspergillus fumigatus (n = 19) and Alternaria alternata (n = 6) allergens. Homology between HDM and fungal proteins may play a role in reported fungal exacerbation of HDM-induced asthmatic symptoms [64, 65]. Excretion of putative allergenic proteins into HD via faecal particles would implicate a route of exposure and therefore has significant implications for allergy. Eukaryotic secretion peptides were predicted in 20.7% of allergenic/potentially allergenic proteins. Even if these putative allergens were unable to induce immune responces in their own right, co-presence with other immune modulators may be involved in bystander sensitization. Der p 1 accumulates in HD, levels exceeding 2 μg/g of dust are considered hazardous to the occupants [1]. In addition to being a potent activator of the immune system, Der p 1 has an adjuvant effect, enhancing IgE production against bystander molecules that may be present in HD [66]. Therefore, any protein accumulating in HD should be considered in the context of being a bystander allergen candidate.
Proteomic characterisation of D. pteronyssinus airmid
Representative proteome.
Analysis of LC-MS/MS spectra obtained from proteomic analysis of D. pteronyssinus airmid resulted in the high confidence identification of 3,931 proteins (S7 Data). This representative proteome comprised 31.4% of the predicted proteome of D. pteronyssinus airmid with experimental evidence for expression. Protein molecular weight and pI were widely distributed in the representative proteome (Range: 5.25–3086.2 kDa, pI 3.26–12.85) and similar to that of the predicted proteome (Range: 3.94–3086.2 kDa, pI 2.63–13.27), confirming the protein extraction methods were optimal for the characterisation of D. pteronyssinus airmid proteome (Fig 3A). Establishing a representative proteome, that reflects the methodological limitations of protein extraction and identification is essential for subsequent enrichment analyses. This background proteome is defined by Bessarabova et al (2012) as “the complete set of proteins known to be expressed in an organ/tissue/body liquid/cell line of sample origin” [67]. To date, proteomic investigation of D. pteronyssinus has extended to a few discrete studies [68–71]. Laboratory HDM populations have been shown to have different reproduction rates to wild-type strains [72] and isolated populations to give rise to geographical allergen variants [7]. Experimental examination of uncharacterised D. pteronyssinus airmid-specific proteins, particularly those with evidence for expression, may provide useful insights into the genes evolving within isolated populations. Our representative proteome may be expanded by employing alternative protein extraction methods, use of trypsin alternative or multi-protease digestion and depletion of high abundance proteins [73–75].
A. Proteomic Strategy for characterisation of D. pteronyssinus airmid. Flow diagram depicting proteomic strategy for characterisation of D. pteronyssinus airmid excretome and mite body proteome utilising whole sample analysis and sample fractionation analysis. A1. Gel filtration chromatography of D. pteronyssinus airmid whole body homogenate. Whole body homogenate protein extract (4.5 mg; 500 μl injection) separated by size exclusion chromatography (Superdex 200 10/300 GL; 0.4 ml/min flow rate). 0.5 ml fractions collected between 6.21 and 32.26 ml post-injection. A2. SDS-PAGE analysis of D. pteronyssinus airmid whole body homogenate fractions. SDS-PAGE (4–20%) with silver staining of whole body homogenate protein extract (Lane 1; 10 μg) from D. pteronyssinus airmid and fractions from gel filtration (6.2–22.2 ml post-injection) of same (Lanes 2–13, 17 μl loaded). SDS-PAGE visualisation of fractions (0.5ml) depicting fractionation of complex protein mixture into reduced complexity extracts containing proteins of approximately similar sizes. Fractions, 7.2–32.2 ml, post injection (n = 50) were pooled to give 1–1.5 ml aliquots (n = 21),100 μl was processed for LC-MS/MS. A3. Gel filtration chromatography of D. pteronyssinus airmid spent growth medium (SM). Protein extract of spent growth media (0.2 mg; 500 μl injection) separated by size exclusion chromatography (Superdex 200 10/300 GL; 0.4 ml/min flow rate). 0.5 ml fractions collected between 6.1 and 32.1 ml post-injection for SDS-PAGE analysis. Proteins in gel filtration fractions were acidic and therefore unsuitable for analysis by SDS-PAGE. Therefore, fractions were pooled and analysed according the methods used for mite body extract fractions. Fractions, 7.1–32.1 ml, post injection (n = 50) were pooled to give 1–1.5 ml aliquots (n = 21), 100 μl was processed for LC-MS/MS. B. Alignment of theoretical molecular mass and pI from D. pteronyssinus airmid PP (n = 12, 530) and representative proteome (n = 3,931) Software: http://www.juirgel.de Hiller et al., (2006). Proteins > 900 kDa excluded for graphing purposes (DERPT_G11449, DERPT_G9606, DERPT_G4775, DERPT_G8007,3088 kDa, 2021 kDa, 1122 kDa & 1001 kDa respectively). Predicted proteins with strong proteomic evidence for expression (2+ medium confidence peptides) accounted for 31.4% of the predicted proteome (n = 3, 931). Calculated molecular mass and pI (Range: 5.25–3086.2 kDa, pI 3.26–12.85) of representative proteome was approximate to that of the predicted proteome (Range: 3.94–3086.2 kDa, pI 2.63–13.27). Median protein size in the D. pteronyssinus airmid PP was 54.25 kDa. The largest predicted protein coding gene was DERPT_G11449 at 79,548 bp and encoded a paladin (Titin) protein homolog (26,516 a.a, 3086.2 kDa, pI 4.93). The smallest predicted protein was DERPT_G12515 (36 a.a, 3.94 kDa, pI 6.04) of unknown function, encoded by a 106 bp gene. The median protein pI was calculated to be 7.42, the protein with lowest predicted pI of 2.63 was DERPT_G12367 (50 a.a, 5.45 kDa), DERPT_G11425 had the highest predicted pI at 13.27 (67 a.a, 6.94 kDa). C. Venn Diagram depicting overlap between Mite Body (MB), Spent Culture Medium (SM) and House Dust (HD) Proteomes, totalling 1,952 proteins identified.
Wildtype proteome of D. pteronyssinus.
Extending the relevance of D. pteronyssinus proteomics to the environment is essential, as very little is known about which HDM components are present in inhalable air [10]. Proteomic analysis of 21 Der p 1 positive HD samples revealed the presence of 150 D. pteronyssinus proteins (S8 Data), the ten most abundant are given in Table 2, with sequence coverage ranging from 9.6–73.3%. Here, it can be seen that allergens Der p 1, 2, 5, 14 and Der f 6 like allergen are amongst the most abundant D. pteronyssinus proteins in HD. Der p 1 and Der p 2 are considered major allergens while Der p 5, 6 and 14 are mid-tier IgE binders, all were amongst the ten most abundant proteins in HD [7]. Five non-allergenic proteins were also highly abundant, suggesting that sensitisation and IgE binding is a result of the unique properties of the protein rather than its abundance in house dust. Allergenic/potentially allergenic molecules accounted for almost 51% of all proteins identified in HD (n = 76). The predicted functions of many of these proteins place them into well-established allergen families, therefore their presence in HD and sequence similarity to known allergens make them interesting targets for future studies seeking to identify new allergens. Comparative analysis between allergenic and non-allergenic components of HD may reveal epitopes or structural characteristics common to inhalant allergens [10]. Our data illustrates the utility of high sensitivity protein MS as a novel way to identify HDM products in the wild-type environment and has significant implications for the development of immunotherapies that mimic natural exposure. Previously, researchers have examined numerous commercially available diagnostic and therapeutic HDM extracts, finding many were lacking important allergens and some had several fold variations in Der p 1 to Der p 2 ratios [76]. For example, the 2017 international consensus (ICON) report on the clinical consequences of mite hypersensitivity states that the “development of objective methods to assess allergen exposure and environmental control outcomes” are unmet and in need in mite allergy research [1]. Furthermore, the European medicines agencies guideline on the clinical development of products for specific immunotherapy for the treatment of allergic diseases, states that for seasonal allergies “it is mandatory to document the exposure to the relevant allergens” and “it is recommended to document the exposure level for the individual patient especially for the evaluation periods to evaluate the variation of indoor allergens” [77]. Generating diagnostic and therapeutic HDM extracts with allergen content and ratios that mimic natural exposure is of great importance. Characterisation of the factors affecting allergen repertoire and accumulation in different home microhabitats may give rise to much needed strategies for reducing allergen exposure for sensitised individuals [78].
Proteome of laboratory-reared D. pteronyssinus airmid.
D. pteronyssinus airmid whole protein lysates (WPL) were analysed directly (shotgun) and subjected to fractionation by size exclusion chromatography resulting in identification of 1,948 MB and SM proteins by high sensitivity protein mass spectrometry. Protein identification was confirmed by detection of at least two peptides per protein, and percentage sequence coverage ranged from 0.2 to 99.4% across the entire protein dataset. Gel filtration fractionation of MB extracts and SM (Fig 3A1–3A3) led to the unique identification of 248 and 105 proteins (18% of total identified proteins) from chromatographic fractions. Of the proteins identified, 1,076 proteins (58.2%) were uniquely found in MB extracts (S9 Data), while 99 proteins (0.8%) were solely identified as secreted proteins (S10 Data). Faecal rich SM was obtained by sieving to remove large mites, therefore smaller mites remained. In a previous proteomic study of D. pteronyssinus faeces, it was suggested that the method of faeces sample collection is beneficial over the sieving technique, which unavoidably contains mite bodies and growth media [69]. While culture media-derived proteins (contaminants) will also be detected in proteomic analysis, it is possible to differentiate true dust mite proteins from contaminants by use of a culture media contaminants database as demonstrated in our study. Several functional terms assigned to proteins were differentially represented in MB (S9A–S9C Data) and SM (S10A–S10C Data) proteomes compared to the RP.
Several GO terms were differentially represented (Over-represented: n = 161, Under-represented: n = 6, S9A Data), of note, GO Cellular Component terms cytosol (Fisher’s P = 5.3E-49, GO:0005829) and mitochondrion (Fisher’s P = 8.5E-33, GO:0005739) were the most highly over-represented terms. This finding supports our proteomic strategy, showing significant enrichment of GO terms associated with intracellular activities. EC were highly represented in the MB proteome, assigned to 44.6% of proteins identified (n = 824). Only two enzyme names were found to be differentially represented, acting on peptide bonds (Fisher’s P = 1.28E-05) was over-represented and transferring phosphorous-containing groups (Fisher’s P = 1.03E-04) underrepresented (S9B Data). NAD(P)-binding domain superfamily (IPR036291) was the most highly over-represented IPS ID (Fishers P = 1.63E-12) of 25, with mobidb-lite IPS ID (Fishers P = 5.58E-17) for Intrinsic disorder protein sequences, representing the most significantly underrepresented of 30 IPS IDs (S9C Data).
All MGA apart from Der p 28 were found in the MB, six were among the most abundant proteins identified (S3 Table). In the absence of Der p 28, nine Der p 28 homologs were identified including four isoallergens (Table 1 and S9 Data). MGA were also found to be amongst the most abundant proteins in the SM, including Der p 2 and Der p 14 (S4 Table). Allergenic/potentially allergenic molecules were highly represented in both MB and SM proteomes accounting for 11.4% and 18.5% of all proteins identified. Homologs of Der p 1 (n = 17), Der p 2 (n = 2) and Der p 23 (n = 1) were identified in the SM. Given that allergens Der p 1and Der p 2) account for up to 60% of IgE reactivity in HDM sensitised individuals [5], and Der p 23 sensitivity is seen in 79% of HDM allergic patients [79] examination of cross-reactivity between these MGA and excreted homologs is warranted.
MB and SM proteomes were abundant in enzymes, which accounted for 44.6% and 52.4% of all proteins identified respectively (S9 and S10 Datas), some of these enzymes may be involved in digestion. HDM have long been associated with feeding on shed skin present in HD. While they have been observed to eat skin, the poor nutritional value of keratin makes it unlikely to be a primary food source. Rather, HDM are trophic generalists, they feed on organic debris associated with their proximity to humans [16]. In the laboratory, D. pteronyssinus have been grown on diverse culture media including various combinations of wheat bran, wheat flour, dog food, rodent chow, ground porcine liver, dried egg powder, defatted skin scales, and fish food. Most research groups use dried yeast to supplement diets and improve mite population growth [72, 80–83]. HDM have been observed to feed on bacteria and fungi in laboratory experiments [84, 85]. Whether HDM consume bacteria or fungi in a wildtype setting as a nutrient source needs further experimental evidence. Expression and excretion of bacterial and fungal degrading enzymes may indicate a role in digestion [85].
Numerous glycoside hydrolases (EC:3.2.1) were identified in the predicted proteome (n = 57) and proteomic datasets (RP, MB & SM) summarised in Table 3. Each glycoside hydrolase enzyme sub-family were represented by at least one proteomic identification. Two predicted lysozymes (EC:3.2.1.17) and one 14.5 kDa bacteriolytic enzyme, Der p38 (DERPT_G10989) were identified with evidence for high expression, these enzymes may be responsible for bacteriolytic activity in HDM extracts [86]. Excreted proteins with predicted activities against major components of fungal cell walls were identified and include; chitinases (n = 5), four glycoside hydrolase family 16 members with putative β-1,3 glucanase activity, β-mannosidase (n = 1), α-mannosidases (n = 4), chitosanase (n = 1) and α-N-Acetyl hexosaminidaseine (n = 2). Carbohydrate metabolism GO terms were among the 318 over-represented GO terms (Fisher’s exact test < 0.05 FDR) in the excretome and included carbon utilization, hydrolase activity hydrolyzing O-glycosyl compounds, chitin metabolic process, chitin binding and starch metabolic process (S10A Data). Of the putative enzymes identified in the SM proteome (n = 457), enrichment analysis showed 35 to be over-represented, many of which related to carbohydrate digestion; Alpha-glucosidase, Chitinase and Alpha-mannosidase (S10B Data). The presence of a predicted secretion signal was a strong indicator of excretion as 22.5% of all proteins identified in the SM proteome contained predicted secretion signals. Moreover, secretion signal peptides were the most highly over-represented IPS ID (Fisher’s P = 1.08E-26) of 61 (S10C Data).
The expression of the numerous carbohydrate active enzymes listed above provides compelling evidence to support observations of D. pteronyssinus feeding on fungi and bacteria and demonstrate that they possess the necessary enzymes to utilise bacteria and fungi as a nutrient source [85]. Feeding on bacteria or fungi within wild-type microhabitats may alter allergen repertoire between homes, as diet has been demonstrated to alter allergen production in laboratory HDM cultures [87]. This new insight compounds the necessity for characterising factors affecting HDM allergen production within the home.
The process of chitin synthesis and remodelling is an integral part of the growth and development of all arthropods. Chitin remodelling enzymes include chitinase, β-N-Acetylhexosaminidase and the highly conserved chitin synthase, a key enzyme in the insect biosynthetic pathway [88]. Proteomic profiling of D. pteronyssinus airmid facilitated identification of eight predicted chitinases (EC:3.2.1.14, five β-N-Acetylhexosaminidases (EC:3.21.52) and two chitin synthases (IPR004835) (Table 3) putatively involved in chitin remodelling. These enzymes represent important biocontrol targets, as chitin is absent from vertebrates, dysregulation of these enzymes could provide a much needed method of curtailing HDM populations in the homes of sensitised individuals [12, 88].
Mite group allergen localisation
All proteins are synthesised in the MB prior to excretion, however excreted proteins are likely to accumulate in growth medium and HD. Data regarding localisation of MGA are limited [7]. Localisation is often linked to the degree of protein allergenicity, identifying sites of MGA accumulation may reveal trends of exposure that can be applied to assessing new allergens. In our analysis, the majority of MGA were detected in MB and SM proteomes, the relative amounts in each proteome was used to infer localisation. Proteomic assessment of localisation showed sero-dominant allergens Der p 1, Der p 2 and Der p 23 to accumulate in SM (Table 4), with Der p 1 and Der p 2 being the two most abundant proteins (S4 Table). Der p 23 has previously been reported to be found only in low quantities in SM relative to Der p 2 [79], we observed the same trend, more Der p 2 ms spectra (n = 486) were detected than for Der p23 (n = 90). Allergens Der p 3, Der f 6 like allergen, Der p 9, Der p 15 and Der p 28 were also found to accumulate in the SM (Table 4). Localisation of Der p 3 to SM is consistent with previous observations for Der f 3 [89], the serine peptidases Der p 3, Der f 6 like allergen and Der p 9 were all among the top 10 most abundant SM proteins (S4 Table). Despite a different methodological approach, another study also found Der p 1, Der p 2, Der p 6 and Der p 15 to be major proteins in D. pteronyssinus faeces and SM [69].
Analysis of localisation must not be restricted to laboratory-based studies and should include environmental reference samples where possible as demonstrated for allergens Der p 5 and Der p 21. Initial proteomic assessment found Der p 5 and Der p 21 were not excreted under laboratory conditions as they were absent from SM (S10 Data). However, proteomic analysis of HD revealed Der p 5 to be among the top 10 most abundant proteins (Table 2) and Der p 21 the 63rd most abundant, both were identified in at least 16 of the Der p 1 positive dust samples (S8 Data). The location of Der p 5 and Der p 21 in laboratory cultures, present in MB and absent from SM, would indicate they are not excreted, however, their presence in HD shows that there must be other factors that contribute to accumulation of non-excreted allergens in the home. These allergens may accumulate in HD as the mite bodies begin to degrade. The accumulation of dead mites in laboratory cultures is avoided by regular sub-culturing of mites. Cross-referencing SM with MB associated proteins as demonstrated in Table 4, can infer localisation and can allow for novel insights into modes of accumulation when compared to wild-type dust samples. Our work has demonstrated the utility of high sensitivity mass spectrometry in characterising the complex proteomes of D. pteronyssinus. Laboratory cultures show parallels in protein expression with wild-type samples, as most proteins were identified in two or more of the proteomes (Fig 3C) with only four proteins uniquely identified in HD. Additional research is required to characterise the various microhabitats of HDM and factors affecting allergen presence in the home.
Conclusion
Here we performed a comprehensive bioinformatic and proteomic examination of D. pteronyssinus airmid describing the expression of 4,002 proteins (S11 Data) and identified 332 potential allergens. High sensitivity mass spectrometry allowed for the description of novel D. pteronyssinus components in HD and facilitated qualitative assessment of MGA localisation. This research has expanded the knowledge of proteins utilised by D. pteronyssinus for key physiological processes and will form the basis for further research into biocontrol strategies for the medically important HDM.
Supporting information
S1 Table. Genome assemblies utilised for phylogenetic analysis.
https://doi.org/10.1371/journal.pone.0216171.s001
(DOCX)
S2 Table. LEA homologs in D. pteronyssinus airmid.
https://doi.org/10.1371/journal.pone.0216171.s002
(DOCX)
S3 Table. Top 10 most abundant proteins identified in D. pteronyssinus airmid mite body.
https://doi.org/10.1371/journal.pone.0216171.s003
(DOCX)
S4 Table. Top 10 most abundant proteins identified in D. pteronyssinus airmid spent culture medium.
https://doi.org/10.1371/journal.pone.0216171.s004
(DOCX)
S1 Data. Allergen orthologs potentially cross-reactive orthologs of known mite group allergens (Der p1-11, 13–16, 18, 20–33 or non-chronalogical allergen sequences available in UniProt for D. pteronyssinus and D. farinae, where D. pteronyssinus sequences were unavailable) identified by BLAST searches in 12 acari and 2 arachnid out-grouping.
To be considered an allergen ortholog, proteins had to have reciprocal best hits with an allergen (E-value ≤1E-05), an alignment length of at least 80 amino acids and share at least 35% identity. We also identified allergen homologs by removing the criteria of being a reciprocal best hit.
https://doi.org/10.1371/journal.pone.0216171.s005
(XLSX)
S2 Data. Number of potentially cross-reactive orthologs of known MGA (Der p1-11, 13–16, 18, 20–33 or non-chronalogical allergen sequences available in UniProt for D. pteronyssinus and D. farinae, where D. pteronyssinus sequences were unavailable) identified by BLAST searches in 12 acari and two arachnid outgrouping.
To be considered an allergen ortholog, proteins had to have reciprocal best hits with an allergen (E-value ≤1E-05), an alignment length of at least 80 amino acids and share at least 35% identity.
https://doi.org/10.1371/journal.pone.0216171.s006
(XLSX)
S3 Data. D. pteronyssinus and D. pteronyssinus airmid specific genes. D. pteronyssinus specific genes (n = 1,850) are absent from other acari but present in all one or more D. pteronyssinus genome assemblies (S1 Table).
D. pteronyssinus airmid specific genes (n = 1,475) were found only in D. pteronyssinus airmid genome assembly.
https://doi.org/10.1371/journal.pone.0216171.s007
(XLSX)
S4 Data. D. pteronyssinus airmid predicted proteome (n = 12,530 sequences) annotated by blast homology to sequences in NCBInr, SwissProt and InterPro databases utilising Blast2GO.
https://doi.org/10.1371/journal.pone.0216171.s008
(XLSX)
S5 Data. BLAST alignment results for D. pteronyssinus airmid predicted proteins (E-value ≤1E-03) to MGA sequences available in UniProt for D. pteronyssinus and D. farinae (where D. pteronyssinus sequences were unavailable).
https://doi.org/10.1371/journal.pone.0216171.s009
(XLSX)
S6 Data. List of all D. pteronyssinus airmid protein sequences with known immunomodulatory effects or predicted, “allergenic molecules (n = 332)”, based upon BLASThomology to known allergens (E-value ≤1E-05), with an alignment length of at least 80 amino acids and share at least 35% identity.
This list is a consolidation of protein sequences designated as“Mite Group Allergens and Non-chronological allergens” (MGA, n = 37), “Mite group allergen homologs” (MGAH, n = 93), “Allergen Homologs” (AH, n = 326). Allergen Homolog BLASTp alignment data.
https://doi.org/10.1371/journal.pone.0216171.s010
(XLSX)
S7 Data. List of D. pteronyssinus airmid proteins (n = 3,931) identified by meta-proteomic analysis of 88Gb of spectra (99 LC-MS/MS files) representing the proteome available for analysis utilising extraction and analysis techniques employed during this study (Representative Proteome).
https://doi.org/10.1371/journal.pone.0216171.s011
(XLSX)
S8 Data. Proteomic results of all D. pteronyssinus airmid proteins (n = 150) identified from house dust protein extracts by LC-MS/MS.
https://doi.org/10.1371/journal.pone.0216171.s012
(XLSX)
S9 Data.
A) Proteomic results of all D. pteronyssinus airmid proteins (n = 1,849) identified from the mite body through combined proteomic strategy of whole extract protein analysis and gel filtration protein extract fractionation, followed by LC-MS/MS. B) List of the most specific GO terms (n = 167) differentially represented between the representative proteome and mite body proteome (<0.05 FDR) illustrating sample enrichment for proteins associated with mite body processes. C) List of enzyme names (n = 2) differentially represented between the representative proteome and mite body proteome (<0.05 FDR) illustrating sample enrichment for enzymes associated with mite body processes. D) List of InterPro IDs (n = 57) differentially represented between the representative proteome and mite body proteome (<0.05 FDR) illustrating sample enrichment of proteins with functional domains associated with mite body processes.
https://doi.org/10.1371/journal.pone.0216171.s013
(XLSX)
S10 Data.
A) Proteomic results of all D. pteronyssinus airmid proteins (n = 873) identified from the spent culture media (Excretome) through combined proteomic strategy of whole extract protein analysis and gel filtration protein extract fractionation, followed by LC-MS/MS. B) List of the most specific GO terms (n = 320) differentially represented between the representative proteome and Excretome (<0.05 FDR) illustrating selective extraction of proteins associated with the Excretome. C) List of enzyme names (n = 35) differentially represented between the representative proteome and Excretome (<0.05 FDR) illustrating selective extraction of certain groups of enzymes associated with the Excretome. D) List of InterPro IDs (n = 69) differentially represented between the representative proteome and Excretome (<0.05 FDR) illustrating selective extraction of proteins with functional domains associated with the excretome.
https://doi.org/10.1371/journal.pone.0216171.s014
(XLSX)
S11 Data. List of all D. pteronyssinus airmid proteins identified in this study by proteomics (n = 4, 002).
https://doi.org/10.1371/journal.pone.0216171.s015
(XLSX)
Acknowledgments
Continuous access to laboratory facilities and consumables funding, as well as expert project guidance and proteomic expertise, was provided by Professor Sean Doyle of Maynooth University. The authors would like to thank Maire Fox and Shola Ibitoye for their ELISA work on house dust extracts.
References
- 1. Sanchez-Borges M, Fernandez-Caldas E, Thomas WR, Chapman MD, Lee BW, Caraballo L, et al. International consensus (ICON) on: clinical consequences of mite hypersensitivity, a global problem. World Allergy Organ J. 2017;10(1):14. Epub 2017/04/30. pmid:28451053; PubMed Central PMCID: PMCPMC5394630.
- 2. Oseroff C, Christensen LH, Westernberg L, Pham J, Lane J, Paul S, et al. Immunoproteomic analysis of house dust mite antigens reveals distinct classes of dominant T cell antigens according to function and serological reactivity. Clin Exp Allergy. 2017;47(4):577–92. Epub 2016/09/30. pmid:27684489; PubMed Central PMCID: PMCPMC5374056.
- 3. Wan H, Winton HL, Soeller C, Tovey ER, Gruenert DC, Thompson PJ, et al. Der p 1 facilitates transepithelial allergen delivery by disruption of tight junctions. J Clin Invest. 1999;104(1):123–33. Epub 1999/07/07. pmid:10393706; PubMed Central PMCID: PMCPMC408401.
- 4. Jacquet A. Innate immune responses in house dust mite allergy. ISRN Allergy. 2013;2013:735031. Epub 2013/06/01. pmid:23724247; PubMed Central PMCID: PMCPMC3658386.
- 5. Hales BJ, Martin AC, Pearce LJ, Laing IA, Hayden CM, Goldblatt J, et al. IgE and IgG anti–house dust mite specificities in allergic disease. Journal of Allergy and Clinical Immunology. 2006;118(2):361–7. https://doi.org/10.1016/j.jaci.2006.04.001 pmid:16890759
- 6. Radauer C, Bublin M, Wagner S, Mari A, Breiteneder H. Allergens are distributed into few protein families and possess a restricted number of biochemical functions. Journal of Allergy and Clinical Immunology. 2008;121(4):847–52.e7. http://dx.doi.org/10.1016/j.jaci.2008.01.025 pmid:18395549
- 7. Thomas WR. Hierarchy and molecular properties of house dust mite allergens. Allergology International. 2015;64(4):304–11. http://dx.doi.org/10.1016/j.alit.2015.05.004 pmid:26433526
- 8. Platts-Mills TA, Heymann PW, Longbottom JL, Wilkins SR. Airborne allergens associated with asthma: particle sizes carrying dust mite and rat allergens measured with a cascade impactor. J Allergy Clin Immunol. 1986;77(6):850–7. Epub 1986/06/01. pmid:3711552.
- 9. Tovey ER, Chapman MD, Platts-Mills TA. Mite faeces are a major source of house dust allergens. Nature. 1981;289(5798):592–3. Epub 1981/02/12. pmid:7464922.
- 10. Thomas WR. Blueprint for the House Dust Mite. Int Arch Allergy Immunol. 175. Switzerland2018. p. 121–5. pmid:29301118
- 11. Zock JP, Heinrich J, Jarvis D, Verlato G, Norback D, Plana E, et al. Distribution and determinants of house dust mite allergens in Europe: the European Community Respiratory Health Survey II. J Allergy Clin Immunol. 2006;118(3):682–90. Epub 2006/09/05. pmid:16950288.
- 12. Gøtzsche P, Johansen H. House dust mite control measures for asthma. Cochrane Database of Systematic Reviews [Internet]. 2008; (2). Available from: http://onlinelibrary.wiley.com/doi/10.1002/14651858.CD001187.pub3/epdf.
- 13. Arlian LG, Neal JS, Morgan MS, Vyszenski-Moher DL, Rapp CM, Alexander AK. Reducing relative humidity is a practical way to control dust mites and their allergens in homes in temperate climates. Journal of Allergy and Clinical Immunology. 2001;107(1):99–104. PubMed PMID: WOS:000166533300018. pmid:11149998
- 14. deBoer R, Kuller K. Mattresses as a winter refuge for house-dust mite populations. Allergy. 1997;52(3):299–305. PubMed PMID: WOS:A1997WV34300006. pmid:9140520
- 15. Ravee R, Mohd Salleh FI, Goh H-H. Discovery of digestive enzymes in carnivorous plants with focus on proteases. PeerJ. 2018;6:e4914. pmid:29888132
- 16.
Colloff MJ. Dust Mites. Springer, Van Godewijckstraat 30 3311 GX Dordrecht The Netherlands: CSIRO Publishing; 2009.
- 17. Waterhouse RM, Seppey M, Simao FA, Manni M, Ioannidis P, Klioutchnikov G, et al. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 2017. Epub 2017/12/09. pmid:29220515; PubMed Central PMCID: PMCPMC5850278.
- 18. Randall TA, Mullikin JC, Mueller GA. The Draft Genome Assembly of Dermatophagoides pteronyssinus Supports Identification of Novel Allergen Isoforms in Dermatophagoides Species. Int Arch Allergy Immunol. 2018;175(3):136–46. Epub 2018/01/11. pmid:29320781; PubMed Central PMCID: PMCPMC5847439.
- 19. Liu XY, Yang KY, Wang MQ, Kwok JS, Zeng X, Yang Z, et al. High-quality assembly of Dermatophagoides pteronyssinus genome and transcriptome reveals a wide range of novel allergens. J Allergy Clin Immunol. 2018;141(6):2268–71.e8. Epub 2018/01/07. pmid:29305317.
- 20. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7. Epub 2004/03/23. pmid:15034147; PubMed Central PMCID: PMCPMC390337.
- 21. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. Epub 2014/01/24. pmid:24451623; PubMed Central PMCID: PMCPMC3998144.
- 22. Darriba D, Taboada GL, Doallo R, Posada D. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics. 2011;27(8):1164–5. Epub 2011/02/22. pmid:21335321; PubMed Central PMCID: PMCPMC5215816.
- 23. Price MN, Dehal PS, Arkin AP. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. Epub 2010/03/13. pmid:20224823; PubMed Central PMCID: PMCPMC2835736.
- 24. Creevey CJ, McInerney JO. Clann: investigating phylogenetic information through supertree analyses. Bioinformatics. 2005;21(3):390–2. Epub 2004/09/18. pmid:15374874.
- 25. Letunic I, Bork P. Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics. 2007;23(1):127–8. Epub 2006/10/20. pmid:17050570.
- 26. Risk BA, Spitzer WJ, Giddings MC. Peppy: proteogenomic search software. J Proteome Res. 2013;12(6):3019–25. Epub 2013/04/26. pmid:23614390; PubMed Central PMCID: PMCPMC4117251.
- 27. Waldron R, McGowan J, Gordon N, McCarthy C, Mitchell EB, Doyle S, et al. Draft Genome Sequence of Dermatophagoides pteronyssinus, the European House Dust Mite. Genome Announcements. 2017;5(32):e00789–17. PubMed PMID: PMC5552995. pmid:28798186
- 28. Conesa A, Götz S. Blast2GO: A Comprehensive Suite for Functional Analysis in Plant Genomics. International Journal of Plant Genomics. 2008;2008:619832. PubMed PMID: PMC2375974. pmid:18483572
- 29. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402. Epub 1997/09/01. pmid:9254694; PubMed Central PMCID: PMCPMC146917.
- 30. Hunault G, Jaspard E. LEAPdb: a database for the late embryogenesis abundant proteins. BMC Genomics. 2010;11:221–. PubMed PMID: PMC2858754. pmid:20359361
- 31. Chan TF, Ji KM, Yim AKY, Liu XY, Zhou JW, Li RQ, et al. The draft genome, transcriptome, and microbiome of Dermatophagoides farinae reveal a broad spectrum of dust mite allergens. Journal of Allergy and Clinical Immunology. 2015;135(2):539–48. PubMed PMID: WOS:000349372300028. pmid:25445830
- 32.
FAO/WHO. Joint FAO/WHO Expert Consultation on Allergenicity of Foods Derived from Biotechnology. In: Food and Agriculture Organization of the United Nations WHO, editor. 2001.
- 33. Bastian M, Heymann S, Jacomy M. Gephi: An Open Source Software for Exploring and Manipulating Networks. 2009.
- 34. Owens RA, O'Keeffe G, Smith EB, Dolan SK, Hammel S, Sheridan KJ, et al. Interplay between Gliotoxin Resistance, Secretion, and the Methyl/Methionine Cycle in Aspergillus fumigatus. Eukaryotic cell. 2015;14(9):941–57. Epub 2015/08/27. pmid:26150413.
- 35. Sipos G, Prasanna AN, Walter MC, O'Connor E, Balint B, Krizsan K, et al. Genome expansion and lineage-specific genetic innovations in the forest pathogenic fungi Armillaria. Nat Ecol Evol. 2017;1(12):1931–41. Epub 2017/11/01. pmid:29085064.
- 36. Hiller K, Schobert M, Hundertmark C, Jahn D, Münch R. JVirGel: calculation of virtual two-dimensional protein gels. Nucleic Acids Research. 2003;31(13):3862–5. PubMed PMID: PMC168943. pmid:12824438
- 37. O’Keeffe G, Hammel S, Owens RA, Keane TM, Fitzpatrick DA, Jones GW, et al. RNA-seq reveals the pan-transcriptomic impact of attenuating the gliotoxin self-protection mechanism in Aspergillus fumigatus. BMC Genomics. 2014;15(1):894. pmid:25311525
- 38. Cui Y, Gao C, Zhou Y, Zhou P, Peng M, Lin Y, et al. Phylogenetic analysis of house dust mites. Central European Journal of Medicine. 2010;5(1):69–74.
- 39. Klimov PB, Oconnor B. Is Permanent Parasitism Reversible?-Critical Evidence from Early Evolution of House Dust Mites. Systematic Biology. 2013;62(3):411–23. PubMed PMID: WOS:000318000600005. pmid:23417682
- 40. Dermauw W, Leeuwen T, Vanholme B, Tirry L. The complete mitochondrial genome of the house dust mite Dermatophagoides pteronyssinus (Trouessart): a novel gene arrangement among arthropods. BMC Genomics. 2009;10(1):1–20. pmid:19284646
- 41. Klimov PB, Bochkov AV, Oconnor BM. Phylogenetic position of the house dust mite subfamily Guatemalichinae (Acariformes: Pyroglyphidae) based on integrated molecular and morphological analyses and different measures of support. Cladistics. 2015;32(3):261–75.
- 42. Huby RDJ, Dearman RJ, Kimber I. Why Are Some Proteins Allergens? Toxicological Sciences. 2000;55(2):235–46. pmid:10828254
- 43. Reithofer M, Jahn-Schmid B. Allergens with Protease Activity from House Dust Mites. International journal of molecular sciences. 2017;18(7):1368. pmid:28653989.
- 44. Muzzi A, Masignani V, Rappuoli R. The pan-genome: towards a knowledge-based discovery of novel targets for vaccines and antibacterials. Drug Discov Today. 2007;12(11–12):429–39. Epub 2007/05/29. pmid:17532526.
- 45. Tutar Y. Pseudogenes. Comparative and functional genomics. 2012;2012:424526–. Epub 2012/05/07. pmid:22611337.
- 46. Jordan IK, Makarova KS, Spouge JL, Wolf YI, Koonin EV. Lineage-specific gene expansions in bacterial and archaeal genomes. Genome Res. 2001;11(4):555–65. Epub 2001/04/03. pmid:11282971; PubMed Central PMCID: PMCPMC311027.
- 47. Guimarães LC, Florczak-Wyspianska J, de Jesus LB, Viana MVC, Silva A, Ramos RTJ, et al. Inside the Pan-genome—Methods and Software Overview. Current genomics. 2015;16(4):245–52. Epub 2015/08/. pmid:27006628.
- 48. Ghazy NA, Suzuki T. Desiccation tolerance in diapausing spider mites Tetranychus urticae and T. kanzawai (Acari: Tetranychidae). Experimental and Applied Acarology. 2014;63(1):49–55. pmid:24306933
- 49. Arlian LG. WATER-BALANCE AND HUMIDITY REQUIREMENTS OF HOUSE DUST MITES. Experimental & Applied Acarology. 1992;16(1–2):15–35. PubMed PMID: WOS:A1992KH47700004.
- 50. Kikawada T, Nakahara Y, Kanamori Y, Iwata KI, Watanabe M, McGee B, et al. Dehydration-induced expression of LEA proteins in an anhydrobiotic chironomid. Biochemical and Biophysical Research Communications. 2006;348(1):56–61. PubMed PMID: WOS:000239796100007. pmid:16875677
- 51. Solomon A, Salomon R, Paperna I, Glazer I. Desiccation stress of entomopathogenic nematodes induces the accumulation of a novel heat-stable protein. Parasitology. 2000;121:409–16. PubMed PMID: WOS:000090071900009. pmid:11072904
- 52. Browne JA, Dolan KM, Tyson T, Goyal K, Tunnacliffe A, Burnell AM. Dehydration-Specific Induction of Hydrophilic Protein Genes in the Anhydrobiotic Nematode Aphelenchus avenae. Eukaryotic Cell. 2004;3(4):966–75. PubMed PMID: PMC500876. pmid:15302829
- 53. Gusev O, Suetsugu Y, Cornette R, Kawashima T, Logacheva MD, Kondrashov AS, et al. Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge. Nature Communications. 2014;5:4784. https://www.nature.com/articles/ncomms5784#supplementary-information. pmid:25216354
- 54. Hyndman SJ, Vickers LM, Htut T, Maunder JW, Peock A, Higenbottam TW. A randomized trial of dehumidification in the control of house dust mite. Clin Exp Allergy. 2000;30(8):1172–80. Epub 2000/08/10. pmid:10931126.
- 55. Radauer C, Nandy A, Ferreira F, Goodman RE, Larsen JN, Lidholm J, et al. Update of the WHO/IUIS Allergen Nomenclature Database based on analysis of allergen sequences. Allergy. 2014;69(4):413–9. pmid:24738154
- 56. Hauser M, Roulias A, Ferreira F, Egger M. Panallergens and their impact on the allergic patient. Allergy, Asthma, and Clinical Immunology: Official Journal of the Canadian Society of Allergy and Clinical Immunology. 2010;6(1):1–. PubMed PMID: PMC2830198. pmid:20298513
- 57. McKenna OE, Asam C, Araujo GR, Roulias A, Goulart LR, Ferreira F. How relevant is panallergen sensitization in the development of allergies? Pediatric Allergy and Immunology. 2016;27(6):560–8. PubMed PMID: PMC5006871. pmid:27129102
- 58. Fæste CK, Jonscher KR, Dooper MMWB, Egge-Jacobsen W, Moen A, Daschner A, et al. Characterisation of potential novel allergens in the fish parasite Anisakis simplex. EuPA open proteomics. 2014;4:140–55. PubMed PMID: PMC4840464. pmid:27110489
- 59. Flückiger S, Fijten H, Whitley P, Blaser K, Crameri R. Cyclophilins, a new family of cross-reactive allergens. European Journal of Immunology. 2001;32(1):10–7.
- 60. Glaser Andreas G, Limacher A, Flückiger S, Scheynius A, Scapozza L, Crameri R. Analysis of the cross-reactivity and of the 1.5 Å crystal structure of the Malassezia sympodialis Mala s 6 allergen, a member of the cyclophilin pan-allergen family. Biochemical Journal. 2006;396(Pt 1):41–9. PubMed PMID: PMC1449988. pmid:16483252
- 61. Joshi SS, Gan HY, Chew FT. Cross Comparison between the IgE Binding Profiles of Cyclophilins from Human, Mouse, Dust Mites and Fungi. Journal of Allergy and Clinical Immunology. 2006;117(2):S120.
- 62. Norbäck D, Markowicz P, Cai G-H, Hashim Z, Ali F, Zheng Y-W, et al. Endotoxin, Ergosterol, Fungal DNA and Allergens in Dust from Schools in Johor Bahru, Malaysia- Associations with Asthma and Respiratory Infections in Pupils. PLOS ONE. 2014;9(2):e88303. pmid:24523884
- 63. Żukiewicz-Sobczak WA. The role of fungi in allergic diseases. Advances in Dermatology and Allergology/Postȩpy Dermatologii I Alergologii. 2013;30(1):42–5. PubMed PMID: PMC3834689. pmid:24278044
- 64. Zhang Z, Biagini Myers JM, Brandt EB, Ryan PH, Lindsey M, Mintz-Cole RA, et al. beta-Glucan exacerbates allergic asthma independent of fungal sensitization and promotes steroid-resistant TH2/TH17 responses. J Allergy Clin Immunol. 2017;139(1):54–65.e8. Epub 2016/05/26. pmid:27221135; PubMed Central PMCID: PMCPMC5073040.
- 65. Snelgrove RJ, Gregory LG, Peiró T, Akthar S, Campbell GA, Walker SA, et al. Alternaria-derived serine protease activity drives IL-33-mediated asthma exacerbations. The Journal of allergy and clinical immunology. 2014;134(3):583–92.e6. pmid:24636086.
- 66. Gough L, Sewell HF, Shakib F. The proteolytic activity of the major dust mite allergen Der p 1 enhances the IgE antibody response to a bystander antigen. Clin Exp Allergy. 2001;31(10):1594–8. Epub 2001/10/27. pmid:11678860.
- 67. Bessarabova M, Ishkin A, JeBailey L, Nikolskaya T, Nikolsky Y. Knowledge-based analysis of proteomics data. BMC bioinformatics. 2012;13 Suppl 16(Suppl 16):S13-S. pmid:23176192.
- 68. Bordas-Le Floch V, Le Mignon M, Bussieres L, Jain K, Martelet A, Baron-Bodo V, et al. A combined transcriptome and proteome analysis extends the allergome of house dust mite Dermatophagoides species. PLoS One. 2017;12(10):e0185830. Epub 2017/10/06. pmid:28982170; PubMed Central PMCID: PMCPMC5628879.
- 69. Erban T, Harant K, Hubert J. Detailed two-dimensional gel proteomic mapping of the feces of the house dust mite Dermatophagoides pteronyssinus and comparison with D. farinae: Reduced trypsin protease content in D. pteronyssinus and different isoforms. J Proteomics. 2017;162:11–9. Epub 2017/04/27. pmid:28442447.
- 70. Choopong J, Reamtone O, Sookrung N, Seesuay W, Indrawattana N, Sakolvaree Y, et al. Proteome, Allergenome, and Novel Allergens of House Dust Mite, Dermatophagoides farinae. Journal of Proteome Research. 2016;15(2):422–30. PubMed PMID: WOS:000369771700007. pmid:26754146
- 71. An S, Chen LL, Long CB, Liu XY, Xu XM, Lu XR, et al. Dermatophagoides farinae Allergens Diversity Identification by Proteomics. Molecular & Cellular Proteomics. 2013;12(7):1818–28. PubMed PMID: WOS:000321559000006. pmid:23481662
- 72. Hart BJ, Crowther D, Wilkinson T, Biddulph P, Ucci M, Pretlove S, et al. Reproduction and development of laboratory and wild house dust mites (Acari: Pyroglyphidae) and their relationship to the natural dust ecosystem. Journal of Medical Entomology. 2007;44(4):568–74. PubMed PMID: WOS:000247748400003. pmid:17695009
- 73. Klont F, Bras L, Wolters JC, Ongay S, Bischoff R, Halmos GB, et al. Assessment of Sample Preparation Bias in Mass Spectrometry-Based Proteomics. Anal Chem. 2018;90(8):5405–13. Epub 2018/04/03. pmid:29608294; PubMed Central PMCID: PMCPMC5906755.
- 74. Tsiatsiani L, Heck AJ. Proteomics beyond trypsin. Febs j. 2015;282(14):2612–26. Epub 2015/04/01. pmid:25823410.
- 75. Fonslow BR, Stein BD, Webb KJ, Xu T, Choi J, Park SK, et al. Digestion and depletion of abundant proteins improves proteomic coverage. Nature methods. 2013;10(1):54–6. PubMed PMID: PMC3531578. pmid:23160281
- 76. Casset A, Mari A, Purohit A, Resch Y, Weghofer M, Ferrara R, et al. Varying allergen composition and content affects the in vivo allergenic activity of commercial Dermatophagoides pteronyssinus extracts. Int Arch Allergy Immunol. 2012;159(3):253–62. Epub 2012/06/23. pmid:22722650; PubMed Central PMCID: PMCPMC4594775.
- 77.
Guideline on the Clinical Development of Products for Specific Immunotherapy for the Treatment of Allergic Diseases, (2008).
- 78. Vidal-Quist JC, Ortego F, Lombardero M, Castanera P, Hernandez-Crespo P. Allergen expression in the European house dust mite Dermatophagoides pteronyssinus throughout development and response to environmental conditions. Med Vet Entomol. 2015;29(2):137–46. pmid:25565338.
- 79. Weghofer M, Grote M, Resch Y, Casset A, Kneidinger M, Kopec J, et al. Identification of Der p 23, a peritrophin-like protein, as a new major Dermatophagoides pteronyssinus allergen associated with the peritrophic matrix of mite fecal pellets. J Immunol. 2013;190(7):3059–67. Epub 2013/03/06. pmid:23460742; PubMed Central PMCID: PMCPMC4582595.
- 80. Andersen A. Nutritional value of yeast for Dermatophagoides pteronyssinus (Acari: Epidermoptidae) and the antigenic and allergenic composition of extracts during extended culturing. J Med Entomol. 1991;28(4):487–91. Epub 1991/07/01. pmid:1941906.
- 81. Hubert J, Kopecky J, Sagova-Mareckova M, Nesvorna M, Zurek L, Erban T. Assessment of Bacterial Communities in Thirteen Species of Laboratory-Cultured Domestic Mites (Acari: Acaridida). J Econ Entomol. 2016;109(4):1887–96. Epub 2016/04/29. pmid:27122496.
- 82. Eraso E, Guisantes JA, Martinez J, Saenz-de-Santamaria M, Martinez A, Palacios R, et al. Kinetics of allergen expression in cultures of house dust mites, Dermatophagoides pteronyssinus and D. farinae (Acari: Pyroglyphidae). J Med Entomol. 1997;34(6):684–9. Epub 1998/01/24. pmid:9439123.
- 83. Arlian LG, Morgan MS. Reproductive biology of Euroglyphus maynei with comparisons to Dermatophagoides farinae and D. pteronyssinus. Exp Appl Acarol. 2015;66(1):1–9. Epub 2015/01/31. pmid:25633266.
- 84. Naegele A, Reboux G, Scherer E, Roussel S, Millon L. Fungal food choices of Dermatophagoides farinae affect indoor fungi selection and dispersal. International Journal of Environmental Health Research. 2013;23(2):91–5. PubMed PMID: WOS:000312440500001. pmid:22774849
- 85. Erban T, Hubert J. Digestive function of lysozyme in synanthropic acaridid mites enables utilization of bacteria as a food source. Exp Appl Acarol. 2008;44(3):199–212. Epub 2008/03/22. pmid:18357505.
- 86. Tang VH, Chang BJ, Srinivasan A, Mathaba LT, Harnett GB, Stewart GA. Skin-associated Bacillus, staphylococcal and micrococcal species from the house dust mite, Dermatophagoides pteronyssinus and bacteriolytic enzymes. Experimental and Applied Acarology. 2013;61(4):431–47. PubMed PMID: WOS:000326398200004. pmid:23783892
- 87. Avula-Poola S, Morgan MS, Arlian LG. Diet Influences Growth Rates and Allergen and Endotoxin Contents of Cultured Dermatophagoides farinae and Dermatophagoides pteronyssinus House Dust Mites. International Archives of Allergy and Immunology. 2012;159(3):226–34. pmid:22722476
- 88. Merzendorfer H, Zimoch L. Chitin metabolism in insects: structure, function and regulation of chitin synthases and chitinases. Journal of Experimental Biology. 2003;206(24):4393.
- 89. Zhan Z-k, Ji K-m, Liu X-y, Liu Z-g, Li M, Chen J-j, et al. Monoclonal antibodies against recombinant Der f 3 reveal localization of Der f 3 in the gut and faecal pellets of Dermatophagoides farinae. Experimental and Applied Acarology. 2010;52(1):63–71. pmid:20229323