Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Assessing the predictive ability of computational epitope prediction methods on Fel d 1 and other allergens

  • Hyeji Kwon ,

    Contributed equally to this work with: Hyeji Kwon, Soobon Ko

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Writing – original draft, Writing – review & editing

    Affiliations Combinatorial Tumor Immunotherapy MRC, Chonnam National University Medical School, Hwasun-gun, Jeollanam-do, Republic of Korea, Cancer Genomic Research Institute, Immunology Laboratory, Seoul Song Do Colorectal Hospital, Seoul, Republic of Korea

  • Soobon Ko ,

    Contributed equally to this work with: Hyeji Kwon, Soobon Ko

    Roles Data curation, Investigation, Methodology, Software, Writing – original draft

    Affiliation Combinatorial Tumor Immunotherapy MRC, Chonnam National University Medical School, Hwasun-gun, Jeollanam-do, Republic of Korea

  • Kyungsoo Ha,

    Roles Conceptualization, Project administration, Validation, Writing – original draft

    Affiliation New Drug Development Center, Osong Medical Innovation Foundation, Osong, Republic of Korea

  • Jungjoon K. Lee ,

    Roles Conceptualization, Investigation, Project administration, Supervision, Writing – original draft, Writing – review & editing

    jjk.lee@nus.edu.sg (JKL); kalicuta@gmail.com (YC)

    Affiliations NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore, Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore, Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore

  • Yoonjoo Choi

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Visualization, Writing – original draft, Writing – review & editing

    jjk.lee@nus.edu.sg (JKL); kalicuta@gmail.com (YC)

    Affiliation Combinatorial Tumor Immunotherapy MRC, Chonnam National University Medical School, Hwasun-gun, Jeollanam-do, Republic of Korea

Abstract

While computational epitope prediction methods have found broad application, their use, specifically in allergy-related contexts, remains relatively less explored. This study benchmarks several publicly available epitope prediction tools, focusing on the allergenic IgE and T-cell epitopes of Fel d 1, an extensively studied allergen. Using a variety of tools accessible via the Immune Epitope Database (IEDB) and other resources, we evaluate their ability to identify the known linear IgE and T-cell epitopes of Fel d 1. Our results show a limited effectiveness for B-cell epitope prediction methods, with most performing only marginally better than random selection. We also explored the general predictive abilities on other allergens, and the results were largely random. When predicting T-cell epitopes, ProPred successfully identified all known Fel d 1 T-cell epitopes, whereas the IEDB approach missed two known epitopes and demonstrated a tendency to over-predict. However, when applied to a larger test set, both methods performed only slightly better than random selection. Our findings show the limitations of current computational epitope prediction methods in accurately identifying allergenic epitopes, emphasizing the need for methodological advancements in allergen research.

Introduction

The cat allergy is one of the most common pet allergies, affecting approximately 20% of the global population [1]. The severity of cat allergy can vary, ranging from mild symptoms like coughing, pruritus, and skin eruption to potentially life-threatening reactions such as anaphylaxis. The primary allergen responsible for the cat allergy is Fel d 1, present in the fur and dander of domestic cats. It is estimated that approximately 90% of cat allergies are caused by Fel d 1 [2].

Fel d 1 is primarily secreted by epithelial cells and the salivary glands of cats, and it remains on their haircoats during grooming. One notable characteristic of Fel d 1 is its high thermal stability, allowing it to persist on cat hair. It can associate with small airborne particles, spreading throughout the surroundings and leading to allergic reactions in susceptible individuals. In fact, a study reported that Fel d 1 was detected in 99.9% of households in the United States, highlighting its ubiquity and potential for exposure [2, 3].

Although the function of Fel d 1 remains largely unknown, its allergenicity has been extensively studied. When Fel d 1 is presented by antigen-presenting cells, such as dendritic cells, it triggers the production of IgE antibodies during sensitization. These IgE antibodies specifically target and bind to epitopes on Fel d 1, leading to the activation of immune cells and the subsequent release of inflammatory mediators. This immune response causes the clinical manifestation of allergic symptoms [4]. In addition, T cell epitopes recognized by helper T cells can indirectly induce IgE production through T cell-mediated reactions, further amplifying the allergic response [5]. Thus, the recognition of IgE bound to effector cells, such as basophils and mast cells, plays a crucial role in the allergic cascade [6].

Accurate identification and understanding of epitopes related to allergies are essential for comprehending allergic responses and developing diagnostic tools. Experimental identification of B-cell epitopes has traditionally been the "gold standard" for epitope mapping [7]. However, this approach is often costly, time-consuming, and requires specialized laboratory techniques. Alternatively, computational epitope prediction methods have emerged as practical tools for epitope identification [8]. These methods utilize algorithms and machine learning techniques to analyze the physicochemical properties of proteins and predict potential epitopes.

Early computational tools for B-cell epitope prediction relied on amino acid propensity scales, which characterized the physicochemical properties of B-cell epitopes such as hydropathy [9, 10], flexibility [11], and surface accessibility [12]. These scales provided a foundation for initial predictions but had limitations in accuracy and specificity. However, recent advancements in machine learning techniques have leveraged computational B-cell epitope prediction methods, leading to improved outcomes. Machine learning algorithms, including Support Vector Machine (SVM) [1315], Random Forest (RF) [16, 17], K-Nearest Neighbors (KNN) [18], and Manifold Adaptive Experimental Design (MAED) [19], have been developed and trained to differentiate known B-cell epitopes from non-epitopes [1417, 2026].

While there have been many computational methods for epitope prediction, their application in predicting IgE and T-cell epitopes, specifically in the context of allergenic proteins has been largely limited. IgE epitopes are of particular interest due to their direct involvement in allergic reactions. However, there have been only a few computational methods developed specifically for IgE epitope prediction [27, 28], and most of them are currently unavailable. Therefore, in this study, we aimed to evaluate the effectiveness of currently available general T- and B-cell computational epitope prediction tools and their capacity to accurately identify allergenic epitopes.

To accomplish this, we employed an assortment of epitope prediction tools accessible through the Immune Epitope Database (IEDB) [29] and other sources. Our objective was to perform a detailed assessment of these methods, focusing on their ability to identify Fel d 1 allergenic epitopes.

Materials and methods

Epitope prediction tools

To computationally predict Fel d 1 epitopes, we mainly used epitope prediction tools from the Immune Epitope Database (IEDB). The IEDB hosts a comprehensive suite of seven sequence-based B-cell epitope prediction tools (BepiPred-1.0 [17], BepiPred-2.0 [16], Chou and Fashman β-turn prediction [30, 31], Emini surface accessibility scale [12], Karplus and Schulz flexibility prediction scale [11], Kolaskar and Tongaonkar antigenicity scale [32], and Parker hydrophilicity scale [33]) along with two structure-based prediction methods (ElliPro [34] and DiscoTope [23]). For the structure-based methods, we utilized the dimeric structure of Fel d 1 (PDB ID: 1PUO). Among the structure-based tools, we specifically chose ElliPro, but excluded DiscoTope, as DiscoTope predicted all residues of Fel d 1 to be epitopes. In addition to the IEDB’s B-cell epitope prediction tools, we also used BepiPred-3.0 [35], a protein language model epitope predictor. We applied default cut-off values for each prediction method.

For the prediction of T-cell epitopes, we employed two pMHC-II binding prediction tools: ProPred [36] and the IEDB MHC-II binding prediction tool [37]. In ProPred, the input sequence is divided into overlapping 9-mer linear peptides. We set a 5% threshold for ProPred to characterize each nonamer as either a binder or a non-binder for 8 HLA types [38]. As for the IEDB prediction method, the query sequence was fragmented into overlapping 15-mer linear peptides, and binding prediction was performed for each peptide using the 27 HLA reference set [39]. In this study, we used the IEDB recommended 2.22 method, using the percentile rank as a binding indicator. Peptides with a rank below 10 were classified as binders, while those with a higher rank were considered non-binders. The epitope scores for each method were calculated as a linear sum of the number of binding events. To evaluate the known T-cell epitopes shorter than 15 residues using the IEDB prediction tool, extra residues at each terminus were added to make the peptides within a 15mer window (Table 1).

thumbnail
Table 1. T-cell epitopes of Fel d 1 allergen and their predictions using IEDB and ProPred methods.

As the default sequence length for the IEDB T-cell epitope prediction method is 15, we extended the input sequence when its length was shorter than 15 residues. "O" indicates that the method successfully identified the peptide as an epitope for a given allele, while "X" denotes a failure in prediction. Neither of the prediction methods included HLA-DRB1*14:01 in their allele sets and thus no prediction was possible for the specific peptide (ENALSLLDKIYTS).

https://doi.org/10.1371/journal.pone.0306254.t001

Allergen epitope database

To evaluate general predictive abilities of the B-cell epitope prediction methods, we performed benchmarking using the Allerbase database [40]. This database contains a total of 117 allergens and 1,134 linear IgE epitopes. We selected allergens with determined crystal structures from the set, resulting in a subset of 59 allergens with 306 IgE epitopes (S1 File). For the T-cell epitope prediction, we used a food allergen database [41], which includes 81 epitopes from 9 allergens (S1 File).

Metrics for epitope/nonepitope binary classification

To evaluate the binary classification of regions as epitope or nonepitope, we employed several statistical metrics: Matthews Correlation Coefficient (MCC), Precision (Positive/Negative Predictive values), Recall (Sensitivity/Specificity), and F1 Score.

MCC is a balanced measure that accounts for true and false positives and negatives, ideal for uneven class sizes. It ranges from -1 (total disagreement) to 1 (perfect prediction), with values above 0.5 indicating good predictive performance. Positive Predictive Value (PPV) reflects the accuracy of the model in identifying regions as epitopes, with higher values suggesting better precision. Negative Predictive Value (NPV) indicates the proportion of nonepitope regions correctly identified, with values closer to 1 denoting higher accuracy in predicting nonepitopes. Recall (Sensitivity and Specificity) measures the model’s ability to identify all relevant epitope (nonepitope for specificity) instances correctly, with a range from 0 to 1. The F1 Score is the harmonic mean of Precision (PPV or NPV) and Recall (Sensitivity or Specificity), useful for uneven class distributions, ranging from 0 (worst) to 1 (best).

Defining Fel d 1 epitopes

While it is known that some Fel d 1 epitopes are partially conformational, our focus in this study was on the linear epitopes of Fel d 1 [42]. This choice was made due to the general low consistency and accuracy of computational prediction methods for conformational epitopes [43].

Fel d 1 is a heterodimer composed of two polypeptide chains (Fig 1A), known as Chain 1 and Chain 2 (also referred to as α and β chains, respectively). Chain 1 consists of 70 amino acids and has an approximate molecular mass of 8 kDa. Chain 2 exhibits variation in its C-terminal region, resulting in three known isoforms [2, 44, 45]. The major linear IgE-binding epitopes of Fel d 1 are located in three regions: two in Chain 1 and one in Chain 2 (Chain 1: 25 VAQYKALPVVLENA 38, 46 DAKMTEEDKRNALS 59, Chain 2: 12 DVFFAVANGNELLL 25) [46]. T-cell epitopes are present in a total of six regions, including overlapping segments (Table 1) [47].

thumbnail
Fig 1. The structures and epitopes of Fel d 1.

(A) The heterodimeric structure of Fel d 1 (PDB ID: 1PUO). The three disulfide bonds are highlighted in a small sphere representation. Chain 1 is colored in red and Chain 2 is in skyblue. (B) The tetrameric structure of Fel d 1 (PDB ID: 2EJN). Two Fel d 1 heterodimers are non-covalently bonded between two Chain 2s. (C) Three IgE epitopes of Fel d 1. (D) The residues that are included in the six T-cell epitopes (see Table 1). Overall, there are three main T-cell epitope regions. Epitopes are represented in thicker ribbons in (C) and (D).

https://doi.org/10.1371/journal.pone.0306254.g001

Results

Structure analysis of Fel d 1 epitopes

Fel d 1 primarily adopts the alpha-helical heterodimeric structure (Fig 1A). The heterodimer is composed of two distinct chains (Chain 1 and 2) that are linked by three disulfide bonds. These heterodimers naturally assemble into tetramers via non-covalent interactions between Chain 2s, as shown in Fig 1B [4850]

Given the structure and formation of the tetramer, the IgE epitope located on Chain 2 might be less accessible for IgE binding when the protein is in its tetrameric structure (The solvent accessible surface area for the epitope is 1390.99 Å2 in dimer and 381.83 Å2 in tetramer). It is known that approximately 15% of Fel d 1 is tetrameric in house dust [51]. As the binding of IgE antibodies to their specific epitopes is a crucial step in the onset of allergic reactions, the non-covalent interactions between Chain 2s in the tetramer might render these epitopes less exposed, thereby decreasing the likelihood of IgE binding.

Prediction of IgE epitopes

To assess the IgE epitope prediction, we examined nine publicly available epitope prediction tools, which include web-based B-cell epitope prediction methods provided by the IEDB. Generally, most methods are better at predicting non-epitopes than epitopes for Fel d 1 (Fig 2 and Table 2). However, considering the imbalance between the number of epitopes and non-epitopes, with 42 out of 162 total residues being epitopes (28 out of 70 in Chain 1, and 14 out of 92 in Chain 2), the probability of correctly predicting epitopes by chance alone is significantly lower (0.26) compared to non-epitopes (0.74). The Matthews correlation coefficient (MCC) values suggest that most methods perform similarly to random predictions.

thumbnail
Fig 2. IgE epitope prediction using computational tools.

Horizontal red lines represent the threshold values used by each prediction method. Red shaded areas indicate regions predicted as epitopes. Blue shaded areas denote the actual epitope positions of Fel d 1. Most methods are essentially no different from random selection. See also Table 2 for other details.

https://doi.org/10.1371/journal.pone.0306254.g002

thumbnail
Table 2. Prediction results of computational B-cell epitope prediction tools for Fel d 1 IgE epitopes.

Overall, the majority of methods yield results that are indistinguishable from random selection. See also Fig 2 for other details.

https://doi.org/10.1371/journal.pone.0306254.t002

BepiPred-3.0 excels at identifying true non-epitopes (NPV 0.9) but also produces considerable false positives in epitope predictions and misses many true non-epitopes (Specificity 0.15). BepiPred-2.0 and ElliPro show a more balanced performance between precision and recall for Fel d 1, with ElliPro having a slight edge in F1 score.

In an effort to extend our analysis beyond Fel d 1, we assessed the prediction accuracy of the same nine epitope prediction tools on 59 different allergenic proteins. Out of a total of 12,046 residues, 3,521 were allergen epitopes, comprising approximately 29% of the total epitopes. Consistent with the results observed in the case of Fel d 1, the prediction tools generally exhibited better performance in predicting non-epitopes compared to epitopes (S1 Table). However, most tools exhibit MCC values close to 0, indicating no significant difference from random predictions. Our results indicate that the effectiveness of the current B-cell epitope prediction tools is largely limited, highlighting the need for continuous refinement.

Prediction of T-cell epitopes

T-cell immunity plays a pivotal role in managing allergic responses by dictating the magnitude and nature of the immune response to allergens. In particular, T-cell responses against allergens can have profound influence on the production of specific types of antibodies, such as IgE, which plays a direct role in allergic reactions.

In our initial investigations, we subjected the known Fel d 1 T-cell epitopes to ProPred and the IEDB prediction methods. Among these epitopes, the sequence 55 ENALSLLDKIYTS 67 in Chain 1, known for its binding to HLA-DRB1*14:01, was unable to be evaluated since the specific allele is not included in either of the methods. Notably, we found that the IEDB method was not able to correctly predict two of the known T-cell epitopes as binders for their respective alleles. Contrarily, ProPred correctly identified all the epitopes as binders for their associated alleles (Table 1).

To explore the general predictive abilities of ProPred and IEDB for T-cell epitopes across various allergens, we conducted the same analyses on a total of 81 T-cell epitopes from 9 allergens (S2 Table). Our benchmark reveals that the two predictors exhibit different tendencies in prediction at the cut-off values we used. While ProPred is better at predicting non-epitopes compared to the IEDB method (F1 score: 0.84 vs. 0.65), IEDB is slightly better than ProPred for epitope prediction (F1: 0.18 vs. 0.31). However, the MCC values show that both methods are only marginally better than random (MCC: 0.04 vs. 0.11).

These observations are consistent when predicting T-cell epitopes on Fel d 1. Applying both methods to full sequences of Chain 1 and 2, ProPred remains superior at predicting non-epitopes (F1: 0.87 vs. 0.6), although the MCC values for both are only weakly positive (0.27 vs. 0.15, see Fig 3 and Table 3).

thumbnail
Fig 3. Fel d 1 T-cell epitope maps and prediction results.

Black horizontal lines represent known T-cell epitope positions, while vertical red lines denote the predicted epitope scores of each peptide starting at these positions. Higher scores indicate a greater number of binding events. In general, ProPred is better at predicting non-epitopes, while both methods perform only marginally better than random selection. Additional details are provided in Table 3 and S1 File.

https://doi.org/10.1371/journal.pone.0306254.g003

thumbnail
Table 3. Prediction results of computational pMHC-II binding prediction tools for Fel d 1 epitopes.

ProPred demonstrates enhanced accuracy in predicting non-epitopes. However, according to the Matthews Correlation Coefficient (MCC) values, the overall prediction results are only slightly better than random. See also Fig 3 and S1 File for other details.

https://doi.org/10.1371/journal.pone.0306254.t003

It is important to note that these methods solely predict peptide-MHC binding events. High epitope binding scores do not necessarily indicate immunogenicity, as actual immune responses also require T-cell recognition. Consequently, the absence of pMHC binding typically means no immune response, while its presence does not guarantee an immune response. Therefore, although the prediction methods are nearly random or only marginally better than random, the high F1 score for non-epitope prediction by ProPred may be beneficial for reducing T-cell epitope content [52, 53].

Discussion

In our study, we assessed various computational methods for predicting Fel d 1 epitopes, demonstrating their usefulness and limitations in allergenic epitope identification. Though focused on Fel d 1, these tools can be utilized for other allergens, potentially informing allergen-specific therapy and diagnostics development.

While these computational methods show promise in other applications, they fall short in accurately identifying allergenic epitopes, typically predicting non-epitopes more effectively. Most B-cell epitope prediction methods are essentially no different from random selection, possibly due to their lack of specific design for IgE epitopes.

For T-cell epitope prediction, ProPred successfully identified all known T-cell epitopes for the associated alleles in our analysis. In contrast, the IEDB method failed to predict two known T-cell epitopes. While ProPred is generally better at predicting non-epitopes, overall predictive abilities of the two methods are only marginally better than random.

In conclusion, our study highlights the limitations of current computational methods in allergenic epitope prediction. Particularly for B-cell epitope prediction, the modest improvement in MCC values between the two BepiPred versions suggests that integrating advanced artificial intelligence techniques and 3D structure information could enhance the accuracy and specificity in the future. Expansive and comprehensive databases of experimentally validated allergen epitopes could therefore assist these tools in refining their algorithms, thereby improving the prediction of epitopes.

Supporting information

S1 Table. Prediction results of a computational B cell epitope prediction tool for known allergen IgE epitopes.

https://doi.org/10.1371/journal.pone.0306254.s001

(DOCX)

S2 Table. T-cell epitope prediction results for food allergen IgE epitopes using ProPred and IEDB prediction methods.

https://doi.org/10.1371/journal.pone.0306254.s002

(DOCX)

S1 File. Comprehensive dataset of IgE and T-cell epitope predictions for allergen research.

https://doi.org/10.1371/journal.pone.0306254.s003

(XLSX)

References

  1. 1. Sparkes AH. Human allergy to cats: A review for veterinarians on prevalence, causes, symptoms and control. Journal of Feline Medicine and Surgery. 2022;24(1):31–42. pmid:34622710
  2. 2. Bonnet B, Messaoudi K, Jacomet F, Michaud E, Fauquert J-L, Caillaud D, et al. An update on molecular cat allergens: Fel d 1 and what else? Chapter 1: Fel d 1, the major cat allergen. Allergy, Asthma & Clinical Immunology. 2018;14:1–9.
  3. 3. Arbes SJ Jr, Cohn RD, Yin M, Muilenberg ML, Friedman W, Zeldin DC. Dog allergen (Can f 1) and cat allergen (Fel d 1) in US homes: results from the National Survey of Lead and Allergens in Housing. Journal of allergy and clinical immunology. 2004;114(1):111–7. pmid:15241352
  4. 4. Orengo J, Radin A, Kamat V, Badithe A, Ben L, Bennett B, et al. Treating cat allergy with monoclonal IgG antibodies that bind allergen and prevent IgE engagement. Nature communications. 2018;9(1):1421. pmid:29650949
  5. 5. Shamji MH, Valenta R, Jardetzky T, Verhasselt V, Durham SR, Würtzen PA, et al. The role of allergen‐specific IgE, IgG and IgA in allergic disease. Allergy. 2021;76(12):3627–41. pmid:33999439
  6. 6. Stone KD, Prussin C, Metcalfe DD. IgE, mast cells, basophils, and eosinophils. Journal of Allergy and Clinical Immunology. 2010;125(2):S73–S80. pmid:20176269
  7. 7. Mayer M, Meyer B. Group epitope mapping by saturation transfer difference NMR to identify segments of a ligand in direct contact with a protein receptor. journal of the american chemical society. 2001;123(25):6108–17. pmid:11414845
  8. 8. Zheng D, Liang S, Zhang C. B-Cell Epitope Predictions Using Computational Methods. Computer-Aided Antibody Design: Springer; 2022. p. 239–54.
  9. 9. Hopp TP, Woods KR. Prediction of protein antigenic determinants from amino acid sequences. Proceedings of the National Academy of Sciences. 1981;78(6):3824–8. pmid:6167991
  10. 10. Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. Journal of molecular biology. 1982;157(1):105–32. pmid:7108955
  11. 11. Karplus P, Schulz G. Prediction of chain flexibility in proteins: a tool for the selection of peptide antigens. Naturwissenschaften. 1985;72(4):212–3.
  12. 12. Emini EA, Hughes JV, Perlow D, Boger J. Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. Journal of virology. 1985;55(3):836–9. pmid:2991600
  13. 13. Chen J, Liu H, Yang J, Chou K-C. Prediction of linear B-cell epitopes using amino acid pair antigenicity scale. Amino acids. 2007;33:423–8. pmid:17252308
  14. 14. Yao B, Zhang L, Liang S, Zhang C. SVMTriP: a method to predict antigenic epitopes using support vector machine to integrate tri-peptide similarity and propensity. 2012.
  15. 15. Singh H, Ansari HR, Raghava GP. Improved method for linear B-cell epitope prediction using antigen’s primary sequence. PloS one. 2013;8(5):e62216. pmid:23667458
  16. 16. Jespersen MC, Peters B, Nielsen M, Marcatili P. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic acids research. 2017;45(W1):W24–W9. pmid:28472356
  17. 17. Larsen JEP, Lund O, Nielsen M. Improved method for predicting linear B-cell epitopes. Immunome research. 2006;2(1):1–7. pmid:16635264
  18. 18. Söllner J, Mayer B. Machine learning approaches for prediction of linear B‐cell epitopes on proteins. Journal of Molecular Recognition: An Interdisciplinary Journal. 2006;19(3):200–8. pmid:16598694
  19. 19. Yang H, Zhou Y, Cheng B, editors. Prediction of Linear B-cell Epitopes using Manifold Adaptive Experimental Design and Random Forest Algorithm. 2021 IEEE Conference on Telecommunications, Optics and Computer Science (TOCS); 2021: IEEE.
  20. 20. Ras-Carmona A, Pelaez-Prestel HF, Lafuente EM, Reche PA. BCEPS: A web server to predict linear B cell epitopes with enhanced immunogenicity and cross-reactivity. Cells. 2021;10(10):2744. pmid:34685724
  21. 21. Fukunaga K, Narendra PM. A branch and bound algorithm for computing k-nearest neighbors. IEEE transactions on computers. 1975;100(7):750–3.
  22. 22. Cai D, He X. Manifold adaptive experimental design for text categorization. IEEE Transactions on Knowledge and Data Engineering. 2011;24(4):707–19.
  23. 23. Kringelum JV, Lundegaard C, Lund O, Nielsen M. Reliable B cell epitope predictions: impacts of method development and improved benchmarking. PLoS computational biology. 2012;8(12):e1002829. pmid:23300419
  24. 24. Ansari HR, Raghava GP. Identification of conformational B-cell Epitopes in an antigen from its primary sequence. Immunome research. 2010;6(1):1–9.
  25. 25. Saha S, Raghava GPS. Prediction of continuous B‐cell epitopes in an antigen using recurrent neural network. Proteins: Structure, Function, and Bioinformatics. 2006;65(1):40–8.
  26. 26. Jespersen MC, Mahajan S, Peters B, Nielsen M, Marcatili P. Antibody specific B-cell epitope predictions: leveraging information from antibody-antigen protein complexes. Frontiers in immunology. 2019;10:298. pmid:30863406
  27. 27. Dall’Antonia F, Keller W. SPADE web service for prediction of allergen IgE epitopes. Nucleic Acids Research. 2019;47(W1):W496–W501. pmid:31066444
  28. 28. Sharma N, Patiyal S, Dhall A, Pande A, Arora C, Raghava GP. AlgPred 2.0: an improved method for predicting allergenic proteins and mapping of IgE epitopes. Briefings in Bioinformatics. 2021;22(4):bbaa294. pmid:33201237
  29. 29. Vita R, Mahajan S, Overton JA, Dhanda SK, Martini S, Cantrell JR, et al. The immune epitope database (IEDB): 2018 update. Nucleic acids research. 2019;47(D1):D339–D43. pmid:30357391
  30. 30. Garnier J. Prediction of the secondary structure of proteins from their amino-acid sequence. J Mol Biol. 1978;120:97–120.
  31. 31. Pellequer J-L, Westhof E, Van Regenmortel MH. Correlation between the location of antigenic sites and the prediction of turns in proteins. Immunology letters. 1993;36(1):83–99. pmid:7688347
  32. 32. Kolaskar AS, Tongaonkar PC. A semi-empirical method for prediction of antigenic determinants on protein antigens. FEBS letters. 1990;276(1–2):172–4. pmid:1702393
  33. 33. Parker J, Guo D, Hodges R. New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry. 1986;25(19):5425–32. pmid:2430611
  34. 34. Ponomarenko J, Bui H-H, Li W, Fusseder N, Bourne PE, Sette A, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC bioinformatics. 2008;9:1–8.
  35. 35. Clifford JN, Høie MH, Deleuran S, Peters B, Nielsen M, Marcatili P. BepiPred‐3.0: Improved B‐cell epitope prediction using protein language models. Protein Science. 2022;31(12):e4497. pmid:36366745
  36. 36. Singh H, Raghava G. ProPred: prediction of HLA-DR binding sites. Bioinformatics. 2001;17(12):1236–7. pmid:11751237
  37. 37. Wang P, Sidney J, Kim Y, Sette A, Lund O, Nielsen M, et al. Peptide binding predictions for HLA DR, DP and DQ molecules. BMC bioinformatics. 2010;11:1–12.
  38. 38. Southwood S, Sidney J, Kondo A, del Guercio M-F, Appella E, Hoffman S, et al. Several common HLA-DR types share largely overlapping peptide binding repertoires. The Journal of Immunology. 1998;160(7):3363–73. pmid:9531296
  39. 39. Greenbaum J, Sidney J, Chung J, Brander C, Peters B, Sette A. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics. 2011;63:325–35. pmid:21305276
  40. 40. Kadam K, Karbhal R, Jayaraman V, Sawant S, Kulkarni-Kale U. AllerBase: a comprehensive allergen knowledgebase. Database. 2017;2017:bax066. pmid:29220458
  41. 41. Lewis SA, Peters B. T-cell epitope discovery and single-cell technologies to advance food allergy research. Journal of Allergy and Clinical Immunology. 2023;151(1):15–20. pmid:36411114
  42. 42. Van’t Hof W, Van Milligen F, Van den Berg M, Lombardero M, Chapman M, Aalberse R. Epitope mapping of the cat (Felis domesticus) major allergen Fel d I by overlapping synthetic peptides and monoclonal antibodies against native and denatured Fel d I. Allergy. 1993;48(4):255–63. pmid:7687098
  43. 43. Greenbaum JA, Andersen PH, Blythe M, Bui HH, Cachau RE, Crowe J, et al. Towards a consensus on datasets and evaluation metrics for developing B‐cell epitope prediction tools. Journal of Molecular Recognition: An Interdisciplinary Journal. 2007;20(2):75–82.
  44. 44. Kaiser L, Velickovic TC, Badia-Martinez D, Adedoyin J, Thunberg S, Hallen D, et al. Structural characterization of the tetrameric form of the major cat allergen Fel d 1. Journal of molecular biology. 2007;370(4):714–27. pmid:17543334
  45. 45. Seppälä U, Hägglund P, Wurtzen PA, Ipsen H, Thorsted P, Lenhard T, et al. Molecular characterization of major cat allergen Fel d 1: expression of heterodimer by use of a baculovirus expression system. Journal of Biological Chemistry. 2005;280(5):3208–16. pmid:15546862
  46. 46. van Milligen FJ, van’t Hof W, van den Berg M, Aalberse RC. IgE epitopes on the cat (Felis domesticus) major allergen Fel d I: a study with overlapping synthetic peptides. Journal of allergy and clinical immunology. 1994;93(1):34–43. pmid:7508463
  47. 47. Kwok WW, Roti M, DeLong JH, Tan V, Wambre E, James EA, et al. Direct ex vivo analysis of allergen-specific CD4+ T cells. Journal of Allergy and Clinical Immunology. 2010;125(6):1407–9. e1. pmid:20513526
  48. 48. Duffort OA, Carreira J, Nitti G, Polo F, Lombardero M. Studies on the biochemical structure of the major cat allergen Felis domesticus I. Molecular immunology. 1991;28(4–5):301–9. pmid:1712068
  49. 49. Kroll Kristensen A, Schou G, Roepstorff P. Determination of isoforms, N-linked glycan structure and disulfide bond linkages of the major cat allergen Fel d1 by a mass spectrometric approach. Biological Chemistry. 1997;378(8):899–908. pmid:9377487
  50. 50. Kaiser L, Gronlund H, Sandalova T, Ljunggren H-G, van Hage-Hamsten M, Achour A, et al. The crystal structure of the major cat allergen Fel d 1, a member of the secretoglobin family. Journal of Biological Chemistry. 2003;278(39):37730–5. pmid:12851385
  51. 51. Bienboire-Frosini C, Lebrun R, Verloet D, Pageat P, Ronin C. 6 Variable Content of Fel d 1 Variants in House Dust and Cat Extracts May Have an Impact on Allergen Measurement. Journal of Investigational Allergology and Clinical Immunology. 2012;22(4):270.
  52. 52. Choi Y, Griswold KE, Bailey‐Kellogg C. Structure‐based redesign of proteins for minimal T‐cell epitope content. Journal of computational chemistry. 2013;34(10):879–91. pmid:23299435
  53. 53. Choi Y, Verma D, Griswold KE, Bailey-Kellogg C. EpiSweep: Computationally driven reengineering of therapeutic proteins to reduce immunogenicity while maintaining function. Computational Protein Design. 2017:375–98. pmid:27914063