Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Group 1 Allergen Genes in Two Species of House Dust Mites, Dermatophagoides farinae and D. pteronyssinus (Acari: Pyroglyphidae): Direct Sequencing, Characterization and Polymorphism

  • Rubaba Hamid Shafique ,

    Affiliation Department of Zoology, Pir Mehr Ali Shah Arid Agriculture University Rawalpindi, Rawalpindi, Pakistan

  • Pavel B. Klimov,

    Affiliations Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America, Faculty of Biology, Tyumen State University, Tyumen, Russia

  • Muhammad Inam,

    Affiliation Physiology Laboratory, Department of Zoology, University of the Punjab, Lahore, Pakistan

  • Farhana Riaz Chaudhary,

    Affiliation Department of Zoology, Pir Mehr Ali Shah Arid Agriculture University Rawalpindi, Rawalpindi, Pakistan

  • Barry M. OConnor

    Affiliation Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America


Group 1 allergens of Dermatophagoides farinae (Der f 1) and D. pteronyssinus (Der p 1) dominate overall allergic responses in house dust mite allergy patients. The need for accurate identification and characterization of representative variants of group 1 allergens in any given geographic locality has been emphasized for development of appropriate allergen extracts. Regional amino acid sequence polymorphism has been described but the extent of this polymorphism is not well understood. Such data are completely absent for the USA and many other countries. Most previous studies used cDNA libraries generated by reverse transcriptase (RT-PCR) and/or primers amplifying shorter fragments of this gene. Using novel species-specific primers and direct PCR, we document group 1 allergen gene sequence polymorphism in populations of D. farinae and D. pteronyssinus from the USA and Pakistan. We report two novel introns (nt pos 87 and 291) in both species, and the absence of intron 3 in Der p 1. Thirteen silent and one novel non-synonymous mutation (Tryptophan W197 to Arginine R197) were detected in D. farinae. The potential medical significance of the latter mutation is discussed. Two haplotypes of the Der f 1 gene were identified, haplotype 1 (63%) was more frequent than haplotype 2 (18%). Polymorphism in Der f 1 displayed geographical localization, since both haplotypes were present in mite populations from Pakistan whereas haplotype 1 was observed only in the USA. In Der p 1, a silent mutation at nt (aa) position 1011(149) and four non-synonymous mutations at positions 589(50), 935(124), 971(136), 1268(215) were observed. These mutations were reported from many other geographic regions, suggesting that polymorphism in the Der p 1 gene is panmictic. The extent of polymorphism in both genes is substantially lower than that reported previously (0.10–0.16% vs 0.31–0.49%), indicating the need for careful evaluation of potential polymerase errors in studies utilizing RT-PCR.


Pyroglyphid house dust mites of the genus Dermatophagoides are important sources of allergens in the indoor environment of human dwellings [1], causing allergic diseases such as asthma, rhinitis and atopic dermatitis, in millions of people worldwide [2]. Over 30 different proteins and macromolecules are known to produce IgE-binding reactions in patients allergic to house dust mites [3]. Among these molecules, group 1 allergens of Dermatophagoides farinae and D. pteronyssinus (Der f 1 and Der p 1) dominate overall allergic responses [3][5]. Group 1 allergens are cysteine proteases (proteolytic enzymes) [6][8], having the ability to induce pro-inflammatory response by breaking lung epithelium [9], [10]. Earlier reports described local variants of group 1 allergens from different geographical regions in Thailand, Korea, China, Australia, and the UK [11][20]. Variation data reports come from predicted amino acid sequences based on either amplification from genomic DNA [20], [21] or, more frequently, cDNA libraries obtained through reverse transcriptase polymerase chain reaction (RT-PCR) and subsequent cloning [11], [14], [16], [21]. Because reverse transcriptase is not proof reading [22], it is not surprising that a higher number of mutations were reported in the latter studies.

Amino acid sequence variations can influence IgE binding reactivity of allergens [23]. Single amino acid mutations can alter inflammatory cytokine production of T cells specific for Der p 1 [24], [25]. It is possible that these mutations influence the inherent allergenicity of particular variants and contribute to differential IgE binding frequency by increasing diversity of epitopes [26].

Der f 1 and Der p 1 share 81% amino acid sequence identity [27][29] (83%, our data), due to which cross reactivity exists among these two allergens [27][29]. Despite this relatively high degree of homology and cross reactivity, monoclonal antibodies (mAbs) produced against Der p 1 and Der f 1 are species specific [27], [28]. This contradictory behavior may be attributed to the position of IgE binding epitopes in allergen molecules. Epitope residues that are present in high homology regions of the allergen explain cross reactivity between Der f 1 and Der p 1 allergens, while the part of the IgE binding epitope found in variable regions may result in species specificity. An example of the first case is 4C1 anti Der f 1 mAb [28], which binds to a cross reactive (conserved) epitope on both Der f 1 and Der p 1. This set of amino acids includes Glutamic acid (E)14, Aspartic acid (D)16, Arginine (R)18, Serine (S)19, Arginine (R)21, Glycine (G)156, Arginine (R)157, Isoleucine (I)159, Threonine (T)181, Glutamine (Q)182, Tyrosine (Y)186, Aspartic acid (D)199, Tyrosine (Y)202 and Tyrosine (Y)204. The second part of this epitope is a calcium (Ca+) binding residue on the allergen molecule comprising four amino acids: Aspartic acid (D)57, Leucine (L)58, Glutamic acid (E)60 and Glutamic acid (E)92 [30]. Analysis of these amino acid residues may help to predict cross reactivity in allergens from different mite species. As mutations in some IgE binding epitopes may affect both cross-reactivity and specificity of monoclonal antibodies, allergen diversity both among and within species should be taken into consideration for development of appropriate allergen extracts. This accentuates the need for accurate identification and characterization of representative variants in any given geographic locality. Regional amino acid sequence polymorphism and the extent of this polymorphism are poorly studied, and such data are completely absent for many countries, including the USA and Pakistan. Our paper is a study of within- and among- species polymorphism in the group 1 allergen gene in two medically important species of house dust mites (Dermatophagoides farinae and D. pteronyssinus) collected in the USA and Pakistan. We compare results (direct PCR) with previous data (RT-PCR) and provide a comparative analysis of group 1 allergen peptide sequences of different mite species.

Materials and Methods

Collection of dust samples

Samples were obtained from domiciliary dust of beds and sofas in Pothwar, Pakistan (33.60°N 73.03°E) during July 2012. Sampling was done with a household vacuum cleaner specially modified to collect dust according to a previously described protocol [31]. Mites were isolated using the saturated sodium chloride floatation method [32]. The supernatant was filtered through 45µm filter paper. Mites were removed with a mounting needle under a dissecting microscope and stored in 70% ethanol at −20°C (later transferred into 96% ethanol). Specimens of D. farinae from the USA were obtained from a laboratory culture maintained at the University of Michigan, Museum of Zoology (started from multiple specimens collected locally 42.27°N 83.73°W in 2005). Specimens of D. pteronyssinus originated from cultures in Greer Laboratories, North Carolina, USA. Exact collection localities are available as GenBank metadata deposited along with our sequences.

DNA extraction, amplification, sequencing, and alignment

DNA was extracted with a QIAamp DNA Micro Kit (Qiagen). A single mite specimen was isolated with a mounting needle and placed on a drop of lysis buffer (Buffer ATL) in a cavity glass slide. Under a dissection microscope, the specimen was teased to break the exoskeleton and then transferred into a 1.5 ml microcentrifuge tube containing 180 µl of buffer ATL. The remaining steps followed the manufacturers’ protocol for tissue samples. DNA was eluted in 30µl of AE buffer and stored at −20°C.

Der p 1 and Der f 1 genes were amplified by nested PCR. For each species, two sets of species-specific primers were designed in Primer3 to amplify almost the entire coding region of the gene [33]. For uniform sequencing, T3 and SP6 tails were added to the forward inner and reverse inner primers respectively (Table 1). PCR was performed in 20 µl volume with Platinum Taq DNA Polymerase (Invitrogen). The master mix for initial PCR contained 2.0 µl of PCR buffer, 1.4 µl MgSO4 (50 mM) and dNTPs (10 mM each), 0.8 µl of primers (10 µM of each forward and reverse), 0.12 µl of Platinum Taq polymerase (1.5U) and 0.6–1 µl of DNA template (depending on DNA concentration in the sample), the total volume was made up to 20 µl with distilled water. The thermocycler protocol was as follows: 94°C, 2 min; [94°C, 30 sec; 48°C, 35 sec; 72°C, 2 min]×35 cycles; 72°C for 7 min. PCR products were kept at 4°C until the second PCR was performed. For the second PCR (inner primers) the master mix was modified with a reduced quantity of Taq Polymerase 0.08 µl (1.0 U) and 0.6 µl of PCR products from the first PCR. The thermocycler protocol was set as above, except for the annealing step (50°C for 38 sec), the extension step (1.50 min), and the total number of cycles (38). PCR products were run on 1.5% agarose gel, bands were excised under UV light, and DNA was purified with a QIAquick gel extraction kit (Qiagen).

Table 1. Species-specific oligonucleotide primers used in nested amplification of the group 1 allergen gene (Der f 1 and Der p 1).

Sequencing was done with an Applied Biosystems 3730 XL DNA Sequencer at the University of Michigan sequencing core. Sequences were analyzed in Sequencher ver. 5.0. Because our primers overlap small coding fragments at the 5′ end of the gene, our sequences were partial (22 nt missing for Der f 1 and 4 nt missing for Der p 1 at the 5′ end). These sequences were submitted to GenBank, accession numbers KJ542064 through KJ542097 (Table 2). However, in this paper, for simplicity, we assume that our sequences are complete and use GenBank data for cDNA of preproenzyme (AB034946 for Der f 1 and U11695.1for Der p 1) for the short missing 5′ ends. Thus, position 1 of our alignment coordinates is the first nucleotide of the start codon. Intron identification and translation into polypeptide was done in Mesquite ver. 2.75. After computer-assisted translation, preproenzyme sequences (full length peptides) were obtained. Three regions on the peptides were identified: signal or leader peptide (pre), inactive enzyme (proenzyme) and mature enzyme. In amino acid sequences, the starting position was set to the 1st amino acid of the mature peptide, whereas the signal peptides and proenzyme regions were given negative coordinates [34]. Homologous group 1 allergen DNA and protein sequences were retrieved from the National Centre for Biotechnology Information (NCBI) nucleotide, EST, and protein databases using blastn and blastp. Resulting sequences, Euroglyphus maynei (AAC82351), Psoroptes ovis (CAK32515), Sarcoptes scabiei (AAS93667), Acarus siro (ABU50820), Blomia tropicalis (AAQ24541) and Tyrophagus putrescentiae (ABM53753), were aligned with D. farinae (KJ542065) and D. pteronyssinus (KJ542087). Alignment was done in Clustal Omega. Signal peptide prediction of all selected cysteine proteases was done in SignalP ver4.1 [35]. The 3-dimensional structure and function prediction of selected proteins was carried out by iterative threading assembly refinement (I-TASSER) server [36][38].

Table 2. Exons, introns, and sequence polymorphism in the group 1 allergen-encoding gene of two species of house dust mites from Pakistan and the USA.


Der f 1 gene polymorphism

Twenty-two Der f 1 gene sequences, including six from the USA, were analyzed. The length of the gene (from start to stop codon), including six exons and five introns was 1278 base pairs (bp). Of the five introns we detected, two (intron1 and 2, at mRNA nucleotide positions (nt. pos) 87 and 291, respectively) were not reported previously (Table 2, Fig. 1). Seven mutations were observed in non-translated regions (introns) of the gene (Table 2).

Figure 1. Gene map of Der f 1 (A) and Der p 1 (B) allergens amplified by nested PCR.

UTR: untranslated regions; (numbers): nucleotide position for introns. The blank region with dashed outlines at exon 1 indicates partial 5′end of genes (given our sequences). T3/SP4 tails (see table 1) in primer sequences are not shown.

All D. farinae sequences show 99–100% homology, with two distinct haplotypes (H1, H2, S1 Figure). Haplotype 1 (e.g., RS17 and RS20) was the more frequent (63.5%, 14/22), followed by haplotype 2 (18%, 4/22, e.g., RS27 and RS30) and heterozygous variants (18%, 4/22, e. g., RS26 and RS40). Der f 1 sequences of all USA specimens (e.g., RS03) were identical to the H1 sequence from the Pakistan population (S1 Figure). Mutations were observed at 14 different positions along the whole length of the sequenced gene, where seven mutations were in the introns (non-translated region). Mutations (substitutions) in the exons were at nt pos 600, 794, 978, 9926, 1014, 1207 and 1211. Table 2 shows the corresponding amino acid positions and translated amino acid at each substitution site. Of these, all but one mutation were silent. The single non-silent mutation was observed in variant RS31 (KJ542072) where amino acid Tryptophan (W) was substituted by Arginine (R) at nt(aa) pos 1211(197) (Table 2). This novel mutation occurred in the active site of the mature enzyme (Fig. 2). Secondary and tertiary protein structure prediction indicated slight difference between these two variants whereas no predicted function difference was observed (Fig. 3).

Figure 2. Alignment of group 1 allergens (Cysteine proteases) of selected mite species.

Active site regions are shown in yellow, amino acid residues making 4C1 binding epitope (pink) and calcium binding epitope (green); “*”: identical, “:”: conserved and “.”: semi-conserved sites.

Figure 3. I-TASSER result showing tertiary structure predictions for RS33 (Tryptophan variant) and RS31 (Arginine variant) of Der f 1 protein [36][38].

A: specimen RS33; B: specimen RS31.

Der p 1 gene polymorphism

Twelve Der p 1 gene sequences were obtained, including four from the USA. The length of the gene (from start to stop codon), including five exons and four introns was 1248–1250 bp, owing to a variable poly-T region in intron1 (S2 Figure). The key difference between Der p 1 and Der f 1 genes is the absence of intron 3 (length 58 nt) in the Der p 1 gene (Fig. 1). Although intron 3 was known for Der f 1, no such data existed for Der p 1. Four mutations were observed in non-translated regions (introns) (Table 2). D. pteronyssinus samples obtained from USA and Pakistan displayed polymorphism at five nucleotide positions in the coding regions (exons). Deduced amino acid sequences showed four non-synonymous substitutions at positions [nt (aa)]: 589 (50), 935(124), 971(136), 1268(215) and a synonymous mutation at 1011(149) (Table 2). In contrast to D. farinae, all the sequences of Der p 1 were unique, differing by 1–2 amino acid residues, but no distinct haplotypes were observed (S2 Figure). Comparison of our results and data from previous studies showed that mutations in Der p 1 aa pos (Y->H)50, (V->A)124, (T->S)136 and (Q->E/K/G)215 were most frequently reported (Table 3). Nearly 20 other mutations that were sporadically observed in Der p 1 previously [11][20] were not found in our survey.

Table 3. Geographical polymorphism in the Der p 1 allergen.

Polypeptide analysis

The percent identity tree of group 1 allergens shows close similarity of pyroglyphid dust mites (D. farinae, E. maynei and D. pteronyssinus) with the psoroptid mange mite, P. ovis (Psoroptidae); whereas large phylogenetic distances were found between the pyroglyphid mites and species from the storage mite families Echimyopodidae (B. tropicalis) and Acaridae (A. siro and T. putrescentiae) (Fig. 4, Table 4). A closer evolutionary relationship of E. maynei with D. pteronyssinus was observed in the tree whereas D. farinae and D. pteronyssinus were more distantly placed (Fig. 4).

Figure 4. Maximum likelihood tree of the group 1 allergen protein of acariform mites.

For each node, nonparametric bootstrap support values (100 replicates) are indicated. The tree was inferred in RAxML ver.7.5.5 [39] using the WAGF model of amino acid evolution as found in PartitionFinder ver. 1.1.1 [40].

Table 4. Percent identity matrix of aligned group1 allergens.

The total length of the translated Der f 1 polypeptide was 321 amino acids. This included the signal peptide (with the C-terminus at pos −81), the proenzyme part (80 amino acids, pos −80 to −1), and the mature enzyme (223 amino acids, pos 1 to 223). Coordinates for Der p 1 were similar, except for the mature enzyme, which had a single deletion at aa pos 9, therefore, its length was 222 amino acids (Fig. 5).

Figure 5. Translated Der f 1 polypeptide aligned with cysteine proteases of other mite species:

Dermatophagoides pteronyssinus (Der p 1: present study, and U11695.1for Der p 1 to complete our partial signal peptide sequence), Euroglyphus maynei (Eur m 1:AAC82352.), Psoroptes ovis (Pso o 1:Q1EIQ3.1), Sarcoptes scabiei (Sar s 1: AAS93667.1), Acarus siro (Acr s: ABU50820.1), Blomia tropicalis (Blo t 1: AAQ24541.1) and Tyrophagus putrescentiae (Tyr p 1: ABM53753.1). Numbers inside triangles indicate the number of amino acids (aa). The “No data” box indicates missing 5′ ends.). AB034946 sequence was used to complete our partial Der f 1 signal peptide sequence.

The cleavage sites for signal peptides predicted for all cysteine proteases in this study were between aa pos −80 and −81. The length of signal peptides in Der f 1, Der p 1, Eur m 1, Pso o 1 and Blo t 1 was 18 amino acids, whereas Sar s 1, Aca s 1 and Try p 1 were 24, 15 and 17 amino acids long, respectively. Identification of proenzyme regions was based on the length of the signal peptide and mature enzyme (Fig. 5). No insertions or deletions were found in the Eur m 1 protein, whereas in Pso o 1 there was an insertion between aa pos −40 and −41 in the proenzyme region. In Sar s 1, Aca s 1, Blo t 1 and Tyr p 1, 6 to 7 insertions and 1 to 5 deletions were observed (Fig. 5).

Forty-four identical amino acid residues were observed in the alignment. The active sites13 were well conserved: the first region (pos 29–40, length 11 aa) has an 80% mean similarity; the second (pos 170–180, length 11) 72%, and the third (pos 186–205, length 20) 79% mean identity (Fig. 2).

Amino acid residues making the 4C1 mAb binding epitope for Der f 1 were aligned with cysteine proteases of other mite species (Fig. 2, Table 5). Our analysis revealed its high identity with Eur m 1 (100%), Der p 1 (85.7%) and Pso o 1 (78.87%), varying from 2 to 3 amino acid residues, whereas the Ca+ ion binding region demonstrated a 100% identity (Fig. 2, Table 5).

Table 5. Comparison of group 1 allergen conserved amino acid residues involved in 4C1mAb and Ca+ binding epitopes in eight mite species.


Group 1 allergens of house dust mites are medically important since they show high IgE binding frequencies and are commonly used in diagnostic tests (e.g., the skin prick test) and immunotherapeutic management of house dust mite allergy patients. Polymorphism in group 1 allergens in different geographical regions has been of great concern because it may affect the efficacy of allergy tests and treatment of the allergic disease. GenBank data available to date are mostly based on cDNA libraries produced by amplifying mRNA using RT-PCR with subsequent cloning of PCR products. Unfortunately, these studies made little effort to distinguish between potential polymerase errors (a reverse transcriptase is non-proofreading) and actual sequence polymorphism. In this study, we employed direct gene amplification and sequencing of the two most important house dust mite species with the aim to reduce artifacts that may be introduced by the non-proofreading reverse transcriptase. This technique is also less labor intensive, so results can be obtained faster in future studies. Der f 1 allergen polymorphism observed in the present study shows two haplotypes. Haplotype 1 from the USA and Pakistan exactly matches with partial mRNA variants from Korea12 China13 and the UK18. In contrast, haplotype 2 detected in Pakistan specimens showed 100% similarity with variants reported from Thailand9 and China [15].

Almost all mutations (13) observed in our study were silent substitutions, however, there was one novel non-silent mutation (tryptophan to arginine) at aa pos 197. This mutation lies within the active region of the mature protein (Fig. 4) [39], [40]. Tryptophan (W) is an aromatic amino acid with a large side chain pointing into the core between α helices of the polypeptide. Its side chain makes many hydrophobic interactions. The amino acid arginine (R) is polar positively charged and can only make a few of these interactions, thus, potentially destabilizing the active site domain [41]. Although no significant change in structure and function was predicted, there still is a need to investigate the effect of this mutation on the properties of this peptide. This mutation might alter the enzyme activity of cysteine protease but since it does not lie in the IgE binding epitope residue, therefore, it may not affect the allergenic properties, immune response, and cross-reactivity of the protein. Further investigations may help to confirm this hypothesis.

In Der p 1, sporadic substitutions of amino acids have been reported previously [11][20]. However, at least some of them may actually represent artifacts introduced by polymerase errors. For example, only single occurrence of an amino acid substitution was reported at several aa positions: 19 (L->M), 21(P->T), 44 (D->E), 125(S->N), 129 (K->E) and 138 (M->I) [11], [21]. Immune response to polymorphic peptides with these substitutions was either reduced or absent, whereas polymorphic peptides with more frequent substitutions at aa pos 50 (Y->H), 124 (V->A), 136 (T->S) and 215(Q->E/K/G) were able to induce a T cell response, indicating their role in differential inflammatory cytokine production of T cells [21]. Additional, albeit indirect evidence for the presence of potential RT-PCR artifacts in published sequences is the substantial difference in percentages of mutations per sequenced nucleotide for GenBank cDNA data versus our data: 0.3071 vs 0.1614 for D. farinae and 0.4866 vs 0.0948 for D. pteronyssinus. These two lines of evidence support our argument that some clones reported in the literature may be artifacts of RT-PCR.

Chua et al (1993) reported six variants of Der p 1 from Australia using RT-PCR, including five non-synonymous (aa pos 50, 81, 124, 136 and 215) and one synonymous mutation aa pos 149 [16]. Results of our study coincide with five of these reported substitutions [16]. This probably indicates the panmictic nature of D. pteronyssinus populations. Mutations at aa pos 50(Y->H) and 124 (V->A) are the most frequent substitutions and have been shown to strongly affect the T cell response in humans and mouse [14], [16], [21]. It is now recognized that amino acids Y50, V124, T136 and Q215 are common in Der p 1 and Der f 1 at these sites. The effect of these amino acid substitutions needs to be studied in the future for the development of species-specific monoclonal antibodies.

The predicted Der f 1 allergen sequence in our study shows a high percent homology with Eur m 1 (85.58%) suggesting a closer phylogenetic relationship to Der f 1, although D. farinae and D. pteronyssinus are currently taxonomically classified in the same genus. However, recent molecular phylogenetic studies based on different genes also support the close relationship of D. farinae and E. maynei [42]. The ordered distances of group 1 allergen protein agree with the phylogenetic distances of these taxa inferred using five independent genes [42].

Similarly, there was 100% homology in the second active site residue (aa pos 186–205), IgE-binding epitopes and in the Ca+ binding epitopes of Der f 1 and Eur m 1. This is supported by earlier reports of a greater homology between these two mite allergens and evidence of cross reactivity between them [30], [43]. Der p 1 epitopes were also highly conserved (86%) where only serine (S19) was replaced by glutamine (Q). This may be the cause of cross reactivity reported earlier between Der f 1, Der p 1 and Eur m 1 [44]. Pso o 1 allergen also shows a 100% conservation of the Ca+ binding epitope residue, 79% 4C1 mAb epitope homology and 69% complete protein identity score. This explains the cross-antigenicity between allergens of house dust mites and other parasitic psoroptidians [45]. On the other hand, the complete absence of any cross reactivity between Der p 1 and Blo t 1 [46] is supported by the large phylogenetic distance between group 1 allergens of pyroglyphid and echimyopodid mites (Blomia).

In conclusion, our results indicate that very little polymorphism occurs in the group 1 allergen gene of D. farinae, where all but one mutation were silent and do not affect the primary structure of this protein. The discovery of a novel Trp→Arg mutation in the active site of the enzyme is the most exciting finding of our work. Further experiments are required to estimate the frequency of this novel Der f 1 allele. In this study, a substantial amino acid variation is present in D. pteronyssinus, but the number of variants is far fewer than reported earlier. In order to eliminate RT-PCR artifacts as a probable cause of these variations, we suggest that direct sequencing technique should be utilized in the study of genetic polymorphism. Polymorphism in Der f 1 gene did show some geographic distribution patterns; haplotype 1 is more common and widely distributed as compared to haplotype 2. Der p 1 gene polymorphism is panmictic and does not show any geographically localized variants. Our analysis of group 1 allergen proteins from different mite species confirms a close evolutionary relationship between pyroglyphids and parasitic psoroptid mange mites.

Supporting Information

S1 Figure.

Selected sequences of the Der f 1 gene showing to two distinct haplotypes: Haplotype1-rows 1, 2 and 3 (RS20, RS17, RS03_USA); haplotype2-rows 4 and 5(Df_RS27 and Df_RS30); heterozygous - rows 6 and 7 (RS26 and RS40); and gb|Der-f1 gene (GenBank Accession number X65196).


S2 Figure.

Selected Der p 1 gene sequences aligned in NCBI blastn. gb|Der-p1gene (GenBank accession number X65197.1).



We thank Drs A. S. Flynt (University of Southern Mississippi) and E. Foot Perkowski (University of North Carolina at Chapel Hill) for comments on the earlier version of the manuscript.

Author Contributions

Conceived and designed the experiments: RHS. Performed the experiments: RHS PBK MI. Analyzed the data: RHS PBK MI. Contributed reagents/materials/analysis tools: RHS PBK. Contributed to the writing of the manuscript: RHS PBK MI. Supervisors: BMO FRC.


  1. 1. Thomas WR, Heinrich TK, Smith WA, Hales BJ (2007) Pyroglyphid house dust mite allergens. Protein Pept Lett 14:943–953.
  2. 2. Milián E, Díaz AM (2004) Allergy to house dust mites and asthma. PR Health Sci J 23:47–57.
  3. 3. Thomas WR, Smith WA, Hales BJ, Mills KL, O’Brien RM (2002) Characterization and immunobiology of house dust mite allergens. Int. Arch. Allergy Immunol 129:1–18.
  4. 4. Platts-Mills TA, Thomas WR, Aalberse RC, Vervloet D, Champman MD. (1992) Dust mite allergens and asthma: report of a second international workshop. J Allergy Clin Immunol 89:1046–1060.
  5. 5. Chapman MD, Platts-Mills TA (1978) Measurement of IgG, IgA and IgE antibodies to Dermatophagoides pteronyssinus by antigen-binding assay, using a partially purified fraction of mite extract (F4P1). Clin Exp Immunol 34:126.
  6. 6. Simpson RJ, Nice EC, Moritz RL, Stewart GA (1988) Structural studies on the allergen Der p1 from the house dust mite Dermatophagoides pteronyssinus: similarity with cysteine proteinases. Protein Seq Data Anal 2:17–21.
  7. 7. Hewitt CR, Horton H, Jones RM, Pritchard DI (1997) Heterogeneous proteolytic specificity and activity of the house dust mite proteinase allergen Der p I. Clin Exp Allergy. 27:201–207.
  8. 8. Chua K, Stewart GA, Thomas WR, Simpson RJ, Dilworth RJ, et al. (1988) Sequence analysis of cDNA coding for a major house dust mite allergen, Der p 1. Homology with cysteine proteases. J Exp Med 167:175–182.
  9. 9. Herbert CA, King CM, Ring PC, Holgate ST, Stewart GA, et al. (1995) Augmentation of permeability in the bronchial epithelium by the house dust mite allergen Der p1. Am J Respir Cell Mol Biol 12:369–78.
  10. 10. Wan H, Winton HL, Soeller C, Gruenert DC, Thompson PJ, et al. (2000) Quantitative structural and biochemical analyses of tight junction dynamics following exposure of epithelial cells to house dust mite allergen Der p 1. Clin Exp Allergy 30:685–698.
  11. 11. Piboonpocanun S, Malainual N, Jirapongsananuruk O, Vichyanond P, Thomas WR (2006) Genetic polymorphisms of major house dust mite allergens. Clin Exp Allergy 36:510–516.
  12. 12. Dreborg S, Einarsson R (1992) The major allergen content of allergenic preparations reflects their biological activity. Allergy 47:418–423.
  13. 13. van Ree R (2007) Indoor allergens: Relevance of major allergen measurements and standardization. J Allergy Clin Immunol 119:270–277.
  14. 14. Jeong KY, Lee IY, Yong TS, Lee JH, Kim EJ, et al. (2012) Sequence polymorphisms of Der f 1, Der p 1, Der f 2 and Der p 2 from Korean house dust mite isolates. Exp Appl Acarol 58:35–42.
  15. 15. Cui YB, Peng JL, Zhou P, Peng M (2008) Sequence analysis and structure prediction of the complete prepro-form group 1 allergen of adult Dermatophagoides farinae (Hughes) (Acari: Pyroglyphidae) from China. Int J Acarol 34:3–12.
  16. 16. Chua KY, Kehal PK, Thomas WR (1993) Sequence polymorphisms of cDNA clones encoding the mite allergen Der p I. Int Arch Allergy Immunol. 101:364–368.
  17. 17. Chua KY, Huang CH, Shen HD, Thomas WR (1996) Analysis of sequence polymorphism of a major mite allergen, Der p 2. Clin Exp Allergy 26:829–837.
  18. 18. Nishiyama C, Yuuki T, Takai T, Okumura Y, Okudaira H (1993) Determination of Three Disulfide Bonds in a Major House Dust Mite Allergen, Der f ll. Int Arch Allergy Immunol 101:159–166.
  19. 19. Olsson S, van Hage-Hamsten M, Whitley P, Johansson E, Hoffman DR, et al. (1998) Expression of two isoforms of Lep d 2, the major allergen of Lepidoglyphus destructor, in both prokaryotic and eukaryotic systems. Clin Exp Allergy 28:984–991.
  20. 20. Kent NA, Hill MR, Keen JN, Holland PW, Hart BJ (1992) Molecular characterization of the group I allergen Eur m I from house dust mite Euroglyphus maynei. Int Arch Allergy Immunol 99:150–152.
  21. 21. Smith WA, Hales BJ, Jarnicki AG, Thomas WR (2001) Allergens of wild house dust mites: environmental Der p 1 and Der p 2 sequence polymorphisms. J Allergy Clin Immunol 107:985–992.
  22. 22. Bebenek K, Kunkel TA (1993) 5 The Fidelity of Retroviral Reverse Transcriptases. Cold Spring Harbor Monogr Ser 23:85–102.
  23. 23. Park JW, Kim KS, Jin HS, Kim CW, Kang DB, et al. (2002) Der p 2 isoallergens have different allergenicity, and quantification with 2-site ELISA using monoclonal antibodies is influenced by the isoallergens. Clin Exp Allergy 32:1042–1047.
  24. 24. Hales BJ, Hazell LA, Smith WA, Thomas WR (2002) Genetic variation of Der p 2 allergens: effects on T cell responses and immunoglobulin E binding. Clin Exp Allergy 32:1461–1467.
  25. 25. Thomas WR, Smith WA, Hales BJ, Carter MD (1997) Functional effects of polymorphisms of house dust mite allergens. Int Arch Allergy Immunol 113:96–98.
  26. 26. Thomas WR (2010) Geography of house dust mite allergens. Asian Pac J Allergy Immunol 28:211–224.
  27. 27. Heymann PW, Chapman MD, Platts-Mills TA (1986) Antigen Der f 1 from the house dust mite Dermatophagoides farinae: structural comparison with Der pI from Dermatophagoides pteronyssinus and epitope specificity of murine IgG and human IgE antibodies. J Immunol 137:2841–2847.
  28. 28. Chapman MD, Heymann PW, Platts-Mills TA (1987) Epitope mapping of two major inhalant allergens, Der p I and Der f I, from mites of the genus Dermatophagoides. J Immunol 139:1479–1484.
  29. 29. Lind P, Hansen OC, Horn N (1988) The binding of mouse hybridoma and human IgE antibodies to the major fecal allergen, Der p I, of Dermatophagoides pteronyssinus. Relative binding site location and species specificity studied by solid-phase inhibition assays with radiolabeled antigen. J Immunol 140:4256–4262.
  30. 30. Chruszcz M, Pomés A, Glesner J, Vailes LD, Osinski T, et al. (2012) Molecular determinants for antibody binding on group 1 house dust mite allergens. J Biol Chem 287:7388–7398.
  31. 31. Hill MR (1998) Quantification of house-dust-mite populations. Allergy 53:18–23.
  32. 32. Arlian L, Woodford PJ, Bernstein IL, Gallagher JS (1983) Seasonal population structure of house dust mites, Dermatophagoides spp.(Acari: Pyroglyphidae). J Med Entomol 20:99–102.
  33. 33. Rozen S, Skaletsky HJ (1998) Primer3. Code available at http:\\\genome-software\other\primer3.html.
  34. 34. Dilworth RJ, Chua KY, Thomas WR (1991) Sequence analysis of cDNA coding for a major house dust mite allergen, Der f I. Clin Exp Allergy. 21:25–32.
  35. 35. Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 8:785–786.
  36. 36. Zhang Y (2008) I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9:40.
  37. 37. Roy A, Yang J, Zhang Y (2012) COFACTOR: an accurate comparative algorithm for structure-based protein function annotation. Nucleic Acids Res 40:W471–477 doi:
  38. 38. Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5:725–738.
  39. 39. Stamatakis A, Ludwig T, Meier H (2005) RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21:456–463.
  40. 40. Lanfear R, Calcott B, Ho SY, Guindon S (2012) Partitionfinder: combined selection of partitioning schemes and substitution models for phylogenetic analyses. Mol Biol Evol 29:1695–1701.
  41. 41. Bonifácio MJ, Archer M, Rodrigues ML, Matias PM, Learmonth DA, et al. (2002) Kinetics and crystal structure of catechol-O-methyltransferase complex with co-substrate and a novel inhibitor with potential therapeutic application. Mol Pharmacol 62:795–805.
  42. 42. Klimov PB, OConnor B (2013) Is Permanent Parasitism Reversible?–Critical Evidence from Early Evolution of House Dust Mites. Syst Biol 62:411–423.
  43. 43. Betts MJ, Russell RB (2007) Amino-acid properties and consequences of substitutions. In:M. R Barnes, editor Bioinformatics for Geneticists: A Bioinformatics Primer for the Analysis of Genetic Data. England: Johen Wiley & Sons Ltd.311–340.
  44. 44. Cui Y, Zhou P, Peng J, Peng M, Zhou Y, et al. (2008) Cloning, sequence analysis, and expression of cDNA coding for the major house dust mite allergen, Der f 1, in Escherichia coli. Braz J Med Biol Res 41:380–388.
  45. 45. Arlian LG (1991) House-dust-mite allergens: a review. Exp Appl Acarol 10:167–186.
  46. 46. Thomas WR, Smith WA, Hales BJ (2004) The allergenic specificities of the house dust mite. Chang Gung Med J 27:563–569.