Figures
Abstract
The inability of existing vaccines to cope with the mutation rate has highlighted the need for effective preventative strategies for COVID-19. Through the secretion of immunoglobulin A, mucosal delivery of vaccines can effectively stimulate mucosal immunity for better protection against SARS-CoV-2 infection. In this study, various immunoinformatic tools were used to design a multi-epitope oral vaccine against SARS-CoV-2 based on its receptor-binding domain (RBD) and heptad repeat (HR) domains. T and B lymphocyte epitopes were initially predicted from the RBD and HR domains of SARS-CoV-2, and potential antigenic, immunogenic, non-allergenic, and non-toxic epitopes were identified. Epitopes that are highly conserved and have no significant similarity to human proteome were selected. The epitopes were joined with appropriate linkers, and an adjuvant was added to enhance the vaccine efficacy. The vaccine 3D structure constructs were docked with toll-like receptor 4 (TLR-4) and TLR1-TLR2, and the binding affinity was calculated. The designed multi-epitope vaccine construct (MEVC) consisted of 33 antigenic T and B lymphocyte epitopes. The results of molecular dockings and free binding energies confirmed that the MEVC effectively binds to TLR molecules, and the complexes were stable. The results suggested that the designed MEVC is a potentially safe and effective oral vaccine against SARS-CoV-2. This in silico study presents a novel approach for creating an oral multi-epitope vaccine against the rapidly evolving SARS-CoV-2 variants. These findings offer valuable insights for developing an effective strategy to combat COVID-19. Further preclinical and clinical studies are required to confirm the efficacy of the MEVC vaccine.
Citation: Arshad NF, Nordin FJ, Foong LC, In LLA, Teo MYM (2024) Engineering receptor-binding domain and heptad repeat domains towards the development of multi-epitopes oral vaccines against SARS-CoV-2 variants. PLoS ONE 19(8): e0306111. https://doi.org/10.1371/journal.pone.0306111
Editor: Dhanusha Yesudhas, National Institute of Health, INDIA
Received: August 29, 2023; Accepted: June 11, 2024; Published: August 15, 2024
Copyright: © 2024 Arshad et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This study was supported by the following funders: 1. MYM Teo FRGS/1/2022/STG01/UCSI/03/1 Malaysian Ministry of Higher Education (MOHE) https://www.mohe.gov.my/en 2. MYM Teo REIG-FAS-2024/001 UCSI University https://www.ucsiuniversity.edu.my/about-us/research These funders had a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Coronavirus disease 2019 (COVID-19) is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in Wuhan, China in December 2019. The coronavirus was believed to be transmitted from bats to humans [1–4] which then evolved to adopt a human receptor as its entry mechanism and is spreading until today. The pandemic has resulted in more than 767 million cases and almost 6.9 million total deaths worldwide as of June 2023 [5]. The severity of the disease triggered many research efforts worldwide to quickly develop safe and effective vaccines to fight against the pandemic [6]. It has been shown that the distribution of the first COVID-19 vaccine for emergency use has had a positive impact on bending the curve of COVID-19 as cases and deaths declined remarkably [7, 8].
Although SARS-CoV-2 variants of concerns (VOCs) are no longer circulating and have been de-escalated from the list of the SARS-CoV-2 VOCs as of March 2023, there are still variants of interest (VOIs) emerging from the Omicron variant, XBB.1.5 and XBB.1.16, and it is expected that the SARS-CoV-2 virus will continue mutating and infecting humans [9]. Therefore, efforts in developing a robust and safe COVID-19 vaccines is still ongoing until today. Currently, eleven COVID-19 vaccines have been granted for emergency use by the WHO, with formulations mostly encoding and targeting the surface glycoprotein (S protein) of the SARS-CoV-2, including the use of inactivated whole viruses [10–12]. There were also other 39 COVID-19 vaccines that have been approved by at least one country, including the first intranasal COVID-19 vaccine developed by Bharat Biotech, which is also targeting the S protein of SARS-CoV-2 [11].
Since SARS-CoV-2 is primarily transmitted through human-to-human contact via liquid particles, with infection occurring in the upper respiratory tract [13–15], providing protection on the mucosal surface is one way to prevent viral infection and transmission. Mucosal delivery of vaccines can stimulate mucosal immunity by inducing the secretion of immunoglobulin A (sIgA), which is not effectively induced by vaccines delivered parenterally [15–17]. Currently approved COVID-19 vaccines, including most those under development, are administered via intramuscular injection, and are poorly effective in inducing mucosal immunity [15, 16]. Nevertheless, the newly approved intranasal COVID-19 vaccine, known as BBV154, has shown promising results and significant advantages over existing intramuscular COVID-19 vaccines [18]. It has been shown that the BBV154 vaccine, encoding the wild-type SARS-CoV-2 S protein, successfully stimulated higher production of mucosal antibodies than the intramuscularly injected COVID-19 vaccines [18, 19]. Therefore, many efforts have been devoted to developing more COVID-19 vaccines that can be administered via intranasal or oral routes.
This study focuses on designing a safe and robust multi-epitope oral vaccine against SARS-CoV-2 variants through a reverse vaccinology approach. The computational-based prediction of the potential antigenic epitopes and evaluation of the physicochemical properties of the designed vaccine constructs would help rule out possible unfavourable outcomes that could be caused by the vaccines [20]. Similar methodologies to those used to design previous multi-epitope were adapted in this study to predict highly conserved antigenic and immunogenic epitopes within the SARS-CoV-2 S protein that stimulate T and B-cell activations [21–23]. In silico cloning and molecular dockings were also used to evaluate the possibility of the designed vaccine constructs to be efficiently expressed in vitro and elicit specific immune responses without allergic or toxic reactions.
Methodology
Retrieval of SARS-CoV-2 S protein nucleotide sequences
The nucleotide sequences of the S protein for SARS-CoV-2 Wuhan-isolate (reference sequence) and the previous five variants of concerns (VOCs): Omicron (B.1.1.529), Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), and Delta (B.1.617.2) were obtained in FASTA format from the NCBI database (https://ncbi.nlm.nih.gov). Data were collected between January 2021 to 2023 from each region available in the NCBI database–Asia, Africa, North America, South America, Europe and Oceania. Only complete nucleotide sequences with no ambiguous characters (unknown bases) were selected in our study. The nucleotide or protein sequences of the plasmid vector pNZ8149 and the defensin (adjuvant), were also retrieved in FASTA format from the MoBiTec (http://www.mobitec.com) and NCBI databases, respectively. The processed nucleotide acid sequences were aligned using Geneious Prime 2023.0.4 software using MUSCLE alignment, and translated using Expasy online server (https://web.expasy.org/translate/) to obtain the protein sequences for further analysis. The regions of interest from the S protein of SARS-CoV-2 for the design of the multi-epitope vaccine construct, were extracted based on the location of the domains obtained from the Conserved Domain Database (CDD) web server (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). Specifically, the RBD domain (RBD: 319–541), and the region from the HR1 to HR2 domain of the S2 subunit (HRs: 918–1203) were extracted.
Prediction of critical binding epitopes
Cytotoxic T-cell epitopes.
The epitopes identification step as described in Fig 1 includes the prediction of the potential cytotoxic T lymphocyte (CTL) epitopes using NetCTLpan v1.1 (https://services.healthtech.dtu.dk/services/NetCTLpan-1.1/) from the DTU server. This method is optimised and trained to predict CTL epitopes with high specificity, outperforming other CTL epitope predictors [24]. The tool predicts the 9-mer peptides that act as the MHC ligands interacting with the 27 most frequent HLA class I supertypes in the global population, including HLA-A*01:01, HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA- A*03:01, HLA-A*11:01, HLA-A*23:01, HLA-A*24:02, HLA-A*26:01, HLA- A*30:01, HLA-A*30:02, HLA-A*31:01, HLA-A*32:01, HLA-A*33:01, HLA- A*68:01, HLA-A*68:02, HLA-B*07:02, HLA-B*08:01, HLA-B*15:01, HLA- B*35:01, HLA-B*40:01, HLA-B*44:02, HLA-B*44:03, HLA-B*51:01, HLA- B*53:01, HLA-B*57:01, HLA-B*58:01 [25]. The threshold parameters were set to the default values. Predicted epitopes with high binding affinity to the selected HLA class supertype will have a score close to 1 in the output results [26].
The formation of vaccine construct in silico includes 1. Protein sequences retrieval and processing, 2. Epitopes identification, 3. Epitopes selection, 4. Structure prediction and molecular docking and 5. Codon optimisation. Tools and web servers used in the analyses are in bold. CTL: Cytotoxic T Lymphocytes; HTL: Helper T Lymphocytes; LBC: Linear B-cells; Ig: immunoglobulin; 2D: secondary structure; 3D: tertiary structure.
Helper T-cell epitopes.
Helper T-cells (Th) are important members involved in the adaptive immunity that recognise foreign antigens and help to stimulate the activation of other immune cells [27]. Therefore, the identification of antigenic epitopes that can be recognised by Th cells are important to stimulate the adaptive immunity. The helper T lymphocyte (HTL) epitopes were predicted using the NetMHCIIpan v4.1 from the DTU server (https://services.healthtech.dtu.dk/services/NetMHCIIpan-4.0/). The length of the predicted epitopes for MHC class II was set to a default of 15-mer and the threshold for identifying strong and weak binders were set to the values of 1 and 5, respectively. A set of 27 HLA class II supertypes (HLA-DRB1*01:01, HLA-DRB1*03:01, HLA- DRB1*04:01, HLA-DRB1*04:05, HLA-DRB1*07:01, HLA-DRB1*08:02, HLA- DRB1*09:01, HLA-DRB1*11:01, HLA-DRB1*12:01, HLA-DRB1*13:02, HLA- DRB1*15:01, HLA-DRB3*01:01, HLA-DRB3*02:02, HLA-DRB4*01:01, HLA- DRB5*01:01, HLA-DQA1*05:01/DQB1*02:01, HLA-DQA1*05:01/DQB1*03:01, HLA-DQA1*03:01/DQB1*03:02, HLA-DQA1*01:01/DQB1*05:01, HLA-DPA1*02:01/DPB1*01:01, HLA-DPA1*01:03/DPB1*04:01, HLA-DPA1*02:01/DPB1*05:01, HLA-DPA1*02:01/DPB1*14:01) that were most frequent in the global population were selected for the prediction of the HTL epitopes. The programme gives an output of the predicted peptides along with the complimentary alleles and predicted binding scores between the peptides and the alleles.
Linear B-cell epitopes.
B-cell epitopes are antigenic fragments that can be recognised by antibodies and are important components in triggering humoral immunity [28]. In this study, we used multi-method approach to predict the linear B-cell epitopes (LBC) from the amino acid sequences of the SARS-CoV-2 S protein. The prediction was performed using BepiPred v3.0 (https://services.healthtech.dtu.dk/services/BepiPred-3.0/), as well as IEDB Antibody Epitope Prediction tool (http://tools.iedb.org/bcell/) for Chou & Fasman Beta-Turn, Emini Surface Accessibility prediction, Karplus & Schulz Flexibility, Kolaskar & Tongaonkar, and Parker Hydrophilicity. The candidate LBC epitopes were shortlisted to those that were between 12-16-mer, antigenic, non-allergenic, non-toxic, and potentially capable to induce either IgG or IgA antibodies.
The Ig-class specific antibodies for the candidate LBC epitopes were predicted using the Variable Length Model of the IgPred tool (https://webs.iiitd.edu.in/raghava/igpred/index.html). The predicted LBC epitopes were evaluated for their ability to induce IgA, IgG and IgE antibodies with approximately 80% accuracy. The threshold for each antibody-specific class was set to 0.0 for a balanced confidence and sensitivity prediction of the B-cell epitopes specific antibodies. The LBC epitopes predicted to induce IgE antibody were removed to avoid triggering any allergic reactions.
The shortlisted LBC epitopes were validated against the IEDB database (https://iedb.org/home_v3.php) and the predicted LBC epitopes that have been reported and identified by positive B-cell assays were shortlisted.
Epitopes selection
Following epitopes identification step, the epitopes were filtered out based on their antigenicity, allergenicity, toxicity, immunogenicity, conservancy and auto-immune responses (Fig 1).
Antigenicity, allergenicity and toxicity analyses.
To test the antigenicity, allergenicity and toxicity of the shortlisted epitopes, VaxiJen v2.0 (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html), AllerCatPro v2.0 server (https://allercatpro.bii.a-star.edu.sg), and CSM-toxin (https://biosig.lab.uq.edu.au/csm_toxin/) web servers, were used. The antigenicity threshold for the epitopes was set to 0.4, with ‘Virus’ set as the target organism. Epitope and whole vaccine constructs with antigenicity value higher than 0.4 were recognised as probable antigens. The entire vaccine constructs formulated after the in silico study were also assessed using similar methods and tools.
Prediction of immunogenicity and IFN-γ inducing epitopes.
The immunogenicity score of each potential CTL epitope was calculated using the IEDB Class I Immunogenicity tool (http://tools.iedb.org/immunogenicity/), to filter only the epitopes that can stimulate cytotoxic lymphocytes responses for our vaccine design. Peptides with positive scores were selected for further analysis. The predicted HTL epitopes were evaluated using IFNepitope web server (http://crdd.osdd.net/raghava/ifnepitope/predict.php), and epitopes that were predicted to bind with IFN-γ, which is significant in inducing anti-viral activity, were selected [29].
Conservancy and population coverage analysis.
The percentage conservancy of the predicted epitopes in SARS-CoV-2 S protein was checked using the IEDB Epitope Conservancy Analysis web server (http://tools.iedb.org/conservancy/). To design a mutational-resistant multi-epitope vaccine against SARS-CoV-2, epitopes that were highly conserved in the SARS-CoV-2 variants where the epitopes were present in more than 90% of the protein sequences retrieved, were selected. Since the HLA genotype frequencies also vary across different human populations and will influence the binding affinities between the vaccine and HLA I and II molecules [30], population coverage analysis was performed on the multi-epitope vaccine construct (MEVC) using the IEDB population coverage calculation tool (http://tools.iedb.org/population/) [30] on the selected epitopes and their corresponding HLA alleles [21].
Sequence homology and comparison to human proteome.
The selected epitopes were compared for human proteome homologs using NCBI Protein BLAST (BLASTp) (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) to prevent stimulation of autoimmune response in the host. The analysis was restricted to proteins from H. sapiens (taxid:9606) with E-value threshold between 0.01 to 0.0001 [31]. Peptides that have non-significant similarity to human proteome were chosen and included in the vaccine construct.
In silico vaccine construction
To construct the multi-epitope vaccine construct (MEVC), the finalised epitopes from previous steps (Epitopes Selection) were used, and the correct orientation of the fusion domains were employed for efficient in vitro and in vivo expression. T and B-cell epitopes were arranged together in the MEVC, separated by commonly used linkers (Fig 2). The KK and GPGPG linkers were used to separate the HTL and B-cell epitopes, respectively. While AAY linker was used to separate CTL epitopes, as this linker can act as the cleavage site for the proteasome to process the CTL suitable for the TAP transporter [32].
Three proposed organisation (A, B and C) of the multi-epitope vaccine constructs, MEVC-A, MEVC-B and MEVC-C. The orientation of the epitopes made based on previous literatures [26, 33, 34] were tested for the efficacy of the vaccine in silico. The defensin adjuvant (pink) was joined to the epitopes by the EAAAK linker (light pink). CTL epitopes (green) were joined together with AAY linker (light green). The HTL epitopes (orange) were separated with GPGPG linker (light orange) while LBC epitope (blue) were joined with KK linkers (light blue). The epitopes were then arranged from the most conserved epitopes to the least conserved extracted from the receptor-binding domain (RBD) S1 subunit and S2 (HR) subunits. hBD3, human β-defensin 3 (adjuvant); CTL, Cytotoxic Lymphocytes; HTL, Helper T Lymphocytes; LBC, Linear B-cells [26, 33, 34].
The adjuvant, human β-defensin 3 (hBD3), was also included in the MEVC, separated by EAAAK linker from the epitopes at the N-terminal of the MEVC to enhance the efficacy of the vaccine [35–42]. To assist in the protein purification process in further in vitro study, a 6×His-tag was included at the C-terminal of the MEVCs linked with RVRR linker. A signal peptide comprised of 27 amino acids, Usp45, was also added upstream to the adjuvant to allow the secretion of the heterologous recombinant protein from the L. lactis.
Prediction of physicochemical properties of vaccine construct.
The Expasy ProtParam online server (https://web.expasy.org/protparam/) was used to analyse the physicochemical properties of the MEVCs, including molecular weight, theoretical isoelectric point (pI), amino acid distributions, amino acid composition, estimated half-life, instability index, aliphatic index, and the grand average of hydropathicity (GRAVY) [21, 26]. The theoretical pI determines the acidic or basic nature of the vaccine. The instability index of the protein that is lesser than 40 is considered stable [21] while the estimated aliphatic index, which represents the thermostable nature of the protein, is indicated as thermo-stable when the value is between 61.09 to 83.59 [43]. A negative value of the GRAVY represents the hydrophilic nature of the input subunit vaccine [43]. The MEVCs that were stable and had high estimated half-life were selected for further analysis.
The solubility of the vaccine constructs was predicted using Protein-sol web server (https://protein-sol.manchester.ac.uk/cgi-bin/solubility/sequenceprediction.php) [44], which compares the property of the amino acid composition of the input query amino acid sequences with an experimental dataset of E. coli soluble proteins [44]. A scaled solubility value of more than 0.45 was interpreted to have higher solubility than average soluble E. coli proteins [44, 45].
Secondary and 3D structure analyses.
The secondary structures of the MEVCs were predicted using SOPMA (https://npsa-prabi.ibcp.fr/cgi-bin/npsa_automat.pl?page=/NPSA/npsa_sopma.html) while the 3D structures of the MEVCs were modelled using RoseTTAFold modelling method in the Robetta web server. The RoseTTAfold method is an improved deep learning modelling method which can accurately model proteins with no known homologs and outperformed other modelling methods provided in the server. The modelling server generated five 3D structures of the input protein sequences based on the modelling method chosen by users.
The 3D structures of the individual epitope selected were also predicted using PEPFOLD3 web server (https://mobyle.rpbs.univ-paris-diderot.fr/cgi-bin/portal.py#forms::PEP-FOLD3), prior to subsequent molecular dockings and binding affinity analyses.
Refinement and validation of 3D structure.
The 3D structures generated by Robetta server were submitted to the ProSA-web server (https://prosa.services.came.sbg.ac.at/prosa.php) [46] and further validation was performed using PROCHECK web server (https://www.ebi.ac.uk/thornton-srv/software/PROCHECK/) [47]. The ProSA-web calculated a Z-score for the refined 3D structure to define the overall model quality. The model structure with the best quality was refined using GalaxyRefine web server, which output another five model structures that were refined, along with the information on their structural changes in terms of Rama-favoured region, GDT-HA, RMSD and MolProbity score. The GDT-HA score, a high accuracy version of the global distance test–total score (GDT-TS), indicates the structure similarity between the input and experimental structures [48, 49]. A higher GDT-HA score suggests improved model structures [48]. The RMSD score evaluates the changes in structural stability of the refined model structures. The MolProbity scores assess the overall quality of atoms in the input model structure, where lower score indicates better quality of the refined structure [49]. A good quality of model structure should have more than 90% residues within the Rama-favoured region [50]. The models from the GalaxyRefine were re-validated using ProSA and PROCHECK servers, and the model structure with the best quality score was used as our final MEVC structure for docking and in silico cloning [21].
Molecular docking with TLR4 and TLR1-TLR2 molecules.
Molecular dockings were performed between the MEVC with the TLR molecules using ClusPro 2.0 (https://cluspro.bu.edu/) and HADDOCK version 2.4 web servers (https://bianca.science.uu.nl/haddock2.4/) [51–54], as these docking tools have been highly cited in many publications [55–57] and were also ranked as the best tools for protein docking [58]. TLRs can stimulate the activation of antigen-presenting cells (APCs) and contribute to the effectiveness of the vaccine [59]. The protein-protein complex stability score were considered in this analysis. The TLR structure, including TLR4 homodimer (PDB:3FXI) and TLR1-TLR2 heterodimer (PDB:2Z7X), together with the predicted MEVC 3D structure were used as the input. The ClusPro server ranked the generated complexes based on energy score of the clusters [56]. The cluster with the most number of members and most negative energy scores was selected. The CTL and HTL epitopes were also docked with their corresponding HLA alleles and also with the TLR molecules. Owing to the blind docking feature available on the ClusPro web server, we performed the docking on our protein-protein complex without restricting any active sites or interface residues for the binding interaction between the complex due to lack of knowledge on the binding pockets for the MEVC.
To cross-verify the docking results from the ClusPro, we performed another molecular docking procedure on the vaccine construct with the TLR molecules using HADDOCK, and compare the outcomes. However, since HADDOCK is an information-driven docking web server, we were required to input the active residues of the proteins to perform the docking. Therefore, we utilised the information on the interacting residues obtained from previous literature reviews on the residues within the RBD that were identified to be the critical binding epitopes that are interacting with the ACE2 receptor, and the residues within the HR domains interacting with the neutralising antibodies of SARS-CoV-2 [60–62]. We employed the residues between 438–506 within the RBD of the SARS-CoV-2 which has been identified as the receptor binding domain (RBM) [62] and also residues between 1141–1161 from the S2 subunit identified as the spike-specific CD4+ T cell epitopes by Guo et al. [61] but only the residues that were predicted to have a relative solvent accessibility confidence higher than five by SABLE web server [63–66] were selected as the active residues for the molecular docking. The docked complexes with the lowest HADDOCK score and restraints violation energy were analysed as the low scores indicated that the docked protein-protein complex were the best complex model generated by HADDOCK.
Molecular dynamic (MD) simulations of the docked complexes.
Molecular dynamic simulations were conducted on the docked protein-protein complex TLR molecules and the MEVC 3D output generated from the ClusPro web server to assess the stability, flexibility and binding energy of the complexes [67, 68]. The simulations were performed for at least 100ns by employing the all-atom optimized potentials for liquid simulations (OPLS-AA) force field using the GROMACS 2023.4 software. The simulations of the docked complexes were done in a cubic box with a dimension of 1.0nm with spc216, which is a generic equilibrated 3-point water model [69, 70], The net charge on the protein in the simulation system was neutralised by adding a specific concentration of negative ions to perform the energy minimisation and subsequent dynamic simulations. The potential energy of the protein-protein complexes was accessed and the graph plot was visualised using the XMGRACE tool to determine if the energy minimisation was successful [69, 71]. To ensure that the solvent and ions added around the protein were well-equilibrated for the dynamic simulations, the equilibration step was conducted in two phases, NVT (constant Number of particle, Volume, and Temperature) and NPT (Pressure), where the system were equilibrated to achieve a stable experimental conditions for the simulation of the complexes. The target value for the temperature is 300K and for pressure was set to 1 bar. The Root Mean Square Deviation (RMSD) and Root Mean Square Fluctuation (RMSF) of the carbon atom of each chain were calculated from the simulations to predict the structural stability and flexibility of the complexes. The radius of gyration of the protein-protein complex was also assessed to measure the compactness of the complexes. The graphs of the analyses were also plotted and visualised using the XMGRACE plotting tool [71].
Protein-protein interaction analysis.
The binding interactions between the protein-protein complexes were assessed using PDBsum online web server (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html) [72], with the PDB structure downloaded from ClusPro as the input. The interacting residues between the complexes were identified, and the binding interactions were visualised. The leucine-rich repeats (LRRs) motif of TLR4 molecule were retrieved from the Uniprot server, as this region is important in recognising pathogen-associated molecular patterns (PAMP) [73]. The activation of TLR4 requires a protein molecule known as myeloid differentiation-2 (MD-2), which interacts with the extracellular portion of the TLR4 molecule and promotes TLR4 dimerization [74]. Binding of the MEVC within the LRRs motif of TLR4 and MD-2 can potentially trigger the desired immune responses. In addition, the ligand binding domains of the heterodimer TLR1-TLR2 molecule were retrieved from previous study by Jin et al. [75]. It was reported that ligands binding to the residues of the TLR1 (258–339) and TLR2 (266–355) promote the heterodimerisation of TLR1-TLR2 and intracellular TIR domain for subsequent downstream signalling [75].
The binding affinity of the protein complexes were also assessed using PRODIGY web server (https://wenmr.science.uu.nl/prodigy/). The PDB structure of the protein-protein complex generated by ClusPro and HADDOCK were used as input. The binding affinity was calculated based on the interface binding contacts between the MEVC and TLR molecules combined with the properties of the non-interacting residues in the protein-protein complexes [54, 76, 77]. The protein-ligand method, an extension method provided in the PRODIGY web server to predict the binding affinity between small ligand and protein complex [78], was used to evaluate the binding interactions of the candidate epitopes with the HLA alleles [79, 80], where epitopes with stronger binding were selected for further analysis.
Codon optimisation.
The MEVCs’ protein sequences were reverse translated into DNA sequences using a reverse translation tool from the Sequence Manipulation Suite online server (https://www.bioinformatics.org/sms2/rev_trans.html), based on L. lactis subsp. Cremoris MG1363 codon usage table obtained from the Codon Usage Database (http://www.kazusa.or.jp/codon/). The resulting DNA sequences were further optimised to L. lactis subsp. Cremoris MG1363 genome using the OPTIMIZER web server (http://genomes.urv.es/OPTIMIZER/), to ensure efficient heterologous gene expression in vivo, as L. lactis NZ3900 is the derivative of the L. lactis subsp. cremoris MG1363 [81]. The GC content and Codon Adaptive Index (CAI) values of the DNA sequences were evaluated. GC content between 30% and 70% and a CAI value of more than 0.8 indicates an optimal codon usage with high expression of the gene is expected [26, 43, 69].
Results
Retrieval of protein sequences
A total of 78,182 complete nucleotide sequences of SARS-CoV-2 variants (VOCs)—Alpha (B.1.1.7), Beta (B.1.351), Delta (B.1.617.2), Gamma (P.1), Omicron (B.1.1.529)–including the Wuhan reference sequence (NC_045512.2) were extracted and filtered from the NCBI database. Multiple sequence alignment (MSA) of the unique nucleotide sequences of the SARS-CoV-2 virus was performed and trimmed between positions 21,000 to 26,000 to extract the region coding for the S protein of the virus. The nucleotide sequences were translated into amino acids to extract the receptor-binding domain (RBD) and the heptad repeats (HRs) region of the virus. A total of 669 and 718 amino acid sequences coding for the RBD and HRs, respectively, were used for epitope predictions.
Prediction and selection of the critical binding epitopes
T-lymphocyte epitopes.
The T-lymphocyte epitopes were divided into two types, Cytotoxic T-lymphocyte (CTL) and Helper T-Lymphocyte (HTL) epitopes, which induce two major cellular immune responses against the SARS-CoV-2 virus. A pool of 9-mer peptides was predicted for the CTL epitopes, resulting in 533 and 361 potential CTL epitopes in the targeted region of the S proteins, RBD and HRs, respectively. Similarly, a total of 1458 and 1245 potential 15-mer HTL epitopes were predicted within the S1 RBD and the S2 HRs region of SARS-CoV-2, respectively. After initial filtration, 267 out of 894 predicted CTL epitopes passed the first filtration process, while 1357 out of 1458 and 1114 out of 1245 predicted HTL epitopes in the RBD and HRs, respectively, passed the first filtration process. The initial filtration of the epitopes indicated that the CTL and HTL epitopes were predicted to be antigenic and immunogenic and can be potentially included in the formation of the vaccine constructs. The antigenicity and immunogenicity filtration stage showed that 627 CTL epitopes and 231 HTL epitopes that were predicted from the NetCTLpan v1.1, and NetMHCIIpan v4.1, respectively, may not have any potential to trigger cellular immune response mediated by the T lymphocytes, and were removed from our candidate lists.
The filtered CTL and HTL epitopes were also confirmed that they were non-allergenic and non-toxic after the second filtration stage. Then, the predicted T-cell epitopes were selected based on the complementary HLA alleles binding to each epitope. Epitopes with overlapping HLA binding alleles were removed. For the CTL epitopes, the 1060VVFLHVTYV1068, 1062FLHVTYVPA1070 and 1192NLNESLIDL1200 were removed as they were predicted to bind to similar HLA I alleles (HLA-A*02:03) as 1060VVFLHVTYV1068 (S1 Table). The latter epitope was kept as it binds to more different HLA I alleles (HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*68:02) compared to the other two epitopes. In contrast, the 1016AEIRASANL1024 and 1181KEIDRLNEV1189 epitopes were predicted to bind to exactly similar HLA I alleles (S1 Table). The 1181KEIDRLNEV118 epitope was kept as it has higher binding affinity towards the corresponding HLA I alleles compared to 1016AEIRASANL1024 (not shown). From the HTL epitopes, epitopes with a similar binding core but different alleles binding were selected as the candidate HTL epitopes (S2 Table).
Linear B-cell epitopes.
The linear B-cell (LBC) epitopes predicted using the multi-method approach resulted in a large pool of candidates. The LBC epitopes that matched the selection criteria which were 12-16-mer in length, antigenic, non-allergenic, and non-toxic, reduced the number of LBC candidates. Within the RBD region of the S protein, eight LBC epitopes were shortlisted, while there were 28 LBC epitopes predicted within the HR1-HR2 region (S3 Table). Of the 36 shortlisted LBC epitopes, 33 of them have the potential ability to induce either IgG or IgA antibodies predicted by the IgPred server at a threshold set to 0.0 (S3 Table).
The LBC candidates were then validated using the database of experimented B-cell epitopes from the IEDB resource. From the comparative analysis between the 36 shortlisted LBC epitopes and the existing linear B-cell epitopes from the IEDB database, the remaining three epitopes that were not predicted to have the ability to induce either IgA and IgG were consistent with the reported LBC from IEDB except 1067YVPAQEKNFTTA1078, where this epitope was experimentally tested to positively bind to IgM (IEDB_ID: 15908232) and IgG antibodies (IEDB_ID: 15908231). In addition, eight epitopes that were predicted to induce IgG response and four epitopes to induce IgA antibody by the IgPred tool were not experimentally validated (S3 Table). Therefore, another twelve LBC epitopes were removed from our LBC epitope list. The remaining 21 LBC epitope candidate which include eight candidates within the RBD, and thirteen epitopes within HR1-HR2, have the potential to be included in the formation of the vaccine construct.
Non-homology peptides to human proteome.
The BLASTp results showed that all the candidate T and B lymphocyte epitopes have no significant similarity to the human proteome and were expected to not trigger any autoimmune response in the host.
Highly conserved epitopes with broad population coverage.
The shortlisted CTL, HTL and LBC epitopes that matched exactly to more than 90% of the fraction of the SARS-CoV-2 S protein sequences retrieved, were selected, as the highly conserved epitopes. A total of nine CTL (Table 1), five HTL epitopes (Table 2), and nineteen LBC epitopes (S4 Table) met the criteria and were finalised for the formation of the multi-epitope vaccine construct (MEVC) in silico.
In the population coverage analysis, the CTL and HTL epitopes were able to cover around 91.61% and 99.83% of the global population, respectively. Since both CTL and HTL epitopes were included in the MEVC, we focused on the combined population coverage of the MHC class I and MHC class II restricted T-cell epitopes [30]. The average global population coverage by class I and class II combined for the final epitope candidates is 99.99% (Table 3).
The broad population coverage predicted for our MEVC ensured the effectiveness of the vaccine for majority of human populations. The population coverage analysis was also performed individually for the six regions where we initially retrieved the SARS-CoV-2 nucleotide sequences–Asia, Africa, North America, South America, Europe and Oceania. The combined population coverage for the six regions is more than 96%, and the maximum coverage (100%), was expected in the four regions i.e., North America, South America, Europe, and Oceania.
Validation on the finalised epitopes with IEDB database.
To validate the selected epitopes for the formation of the vaccine constructs, each of the CTL, HTL and LBC epitope were cross-checked with the experimental proven epitopes within the IEDB database. Out of the 13 finalised CTL epitopes that we have selected, only three of them, 388NDLCFTNVY396, 342FNATRFASV350 and 1113QIITTDNTF1121, were not experimentally validated with any MHC ligand binding assay or have not been reported to interact with any HLA class I alleles.
From the final candidates of HTL epitopes that were selected, two of eleven epitopes, 510VVVLSFELLHAPATV524 and 1059GVVFLHVTYVPAQEK1073, have not been experimentally validated and not reported in the IEDB database. The remaining nine HTL epitopes that were finalised were reported to either have the ability to induce the release of IFN-γ or were interacting with HLA class II alleles in MHC ligand binding assays.
Despite not being reported in the IEDB database and validated by any experimental assays, we still included the three CTL and two HTL epitopes in our vaccine construct formation as these epitopes have shown high potential ability to induce T lymphocyte responses from the in silico analyses.
Formation of multi-epitope vaccine construct in silico.
Three different arrangements of the MEVCs (Fig 2A–2C) based on previous literatures were evaluated for antigenicity, toxicity, allergenicity and immunogenicity. All three arrangements of the MEVCs were predicted to be probable antigens, and two MEVC arrangements with high antigenic score, MEVC-B (0.7081) and MEVC-C (0.7104), were selected for further analysis. The MEVCs were also predicted to not stimulate any allergic reactions and were non-toxic, indicating that they were safe to be consumed and have high potential to trigger immune responses.
Prediction of physicochemical properties and solubility of vaccine constructs.
Since only the arrangement of the epitopes of the MEVCs is different, the physicochemical properties, including the solubility score were similar for both MEVC-B (Fig 3) and MEVC-C. The MEVCs contained 558 amino acids with a molecular weight of 62.68 kDa, excluding the signal peptide as the signal peptide will be cleaved upon being released. A good antigenic protein should at least have a molecular weight of 40 kDa and up to 110 kDa in order to be immunogenic and targeted [82, 83]. The molecular weight of the MEVCs lies within the desired range for a target protein, making it an effective and highly antigenic target protein. The theoretical pI of the MEVCs is 9.73, indicating that our MEVC is basic. The instability index of the MEVCs is 26.79, which is considered stable (less than 40) [21], whereas the estimated value of the aliphatic index for the MEVCs is 68.80, indicating that the protein is thermostable [43]. The MEVCs also have a negative value of the GRAVY (-0.665) representing the hydrophilic nature of the subunit vaccine [43]. The estimated half-life of the MEVCs is calculated to be 30 hours in mammalian reticulocytes (in vitro), and in vivo, more than 20 hours in yeast, and more than 10 hours in E. coli. The physicochemical properties for the MEVC without the adjuvant was also checked for comparison. The MEVC without the adjuvant have similar properties except for its estimated half-life where the MEVC can only last for 1.1 hours in mammalian reticulocytes, 3 minutes in yeast and 2 min in E. coli. This indicated the present of the adjuvant is significant to ensure that the vaccine is stable and durable.
The signal peptide, Usp45, was added before the hBD3 adjuvant. The EAAAK linker was used to connect the adjuvant and the LBC epitopes which were arranged first in the construct, epitopes from the receptor-binding domain (RBD) region followed by heptad repeat (HRs) region. The LBC epitopes were separated by KK linker. The HTL epitopes were joined with the GPGPG linker while the CTL epitopes were joined with the AAY linker. The epitopes were arranged from RBD region followed by epitopes from HRs region. The 6×His-tag was added to the C-terminal linked with RVRR linker.
Overall, both our proposed arrangements of the MEVCs were considered to be stable in nature and have a long estimated half-life (more than 10 hours) [84]. The solubility of the MEVCs was calculated to be 0.475. A score of more than 0.450 indicates that the protein is highly soluble.
The physicochemical properties of the MEVCs predicted by the ProtParam server were compared with the experimental proven epitopes used for the construction of a novel antigenic multi-epitope vaccine against SARS-CoV-2 which has been immunologically tested from previous literature [85]. The physicochemical properties for the designed novel multi-epitope vaccine by Khairkhah et al. [85] was also predicted using ProtPram server. The antigenic multi-epitope vaccine had an average molecular weight and pI value of 25.26 kDa and 9.67, respectively. The instability and GRAVY indexes of the multi-epitope vaccine predicted with ProtParam server were 35.39 and -0.644, respectively [86], indicating that their vaccine construct is stable.
In addition, they also reported that the immunogenicity of the epitopes selected were also predicted using IgPred and IFNepitope web servers, and the epitopes that were predicted to had the ability to induce antibodies were included for the formation of their multi-epitope vaccine constructs [87]. Their vaccine constructs were predicted to have high solubility and stability to stimulate immunogenic reactions [87], similar to our designed MEVCs.
Further immunological study on the novel antigenic multi-epitope vaccine construct designed showed a significant stimulation of IgG, IgG2a, IFN-γ, TNF-α, IL-15, IL-21 and IL-6, and Granzyme B secretion in their animal studies with mice [85]. The similar characteristics predicted for the vaccine constructs by the ProtParam server supported the potential immunogenic responses that might be elicited by our MEVCs in further in vitro and animal studies.
Secondary and 3D structure of the MEVCs.
Of the 558 amino acids in the MEVC-B arrangement, 111 (19.89%) were alpha helices, 140 (25.09%) formed extended strands, and 307 (55.02% of the MEVC structure) were composed of random coils. In the MEVC-C arrangement, 118 amino acids (21.15%) were alpha helices, 133 of 558 amino acids (23.84%) were extended strands, and the remaining 307 (55.02%) formed random coils. The secondary structure that dominated both MEVC-B and MEVC-C was the formation of random coils, which is unfavourable for the folding of the protein.
Validation and refinement of model structures.
Therefore, each of the five model structure of MEVC-B and MEVC-C was subjected to validation with the ProSA and PROCHECK web servers, and refinement with GalaxyRefine tool. Model 2 of the MEVC-B arrangement has both a negative Z-score (-5.80) and lies within the blue dots plotted area (Fig 4A). The Ramachandran plot showed that 86.8% of residues of model 2 were in the most favourable region (Fig 5A and S5 Table) compared to other four models. The quality factor of more than 95 from ERRAT was determined based on the characteristic of atomic interaction verifying that the structure characteristic of atomic interaction [88]. However, none of our model structures passed the good quality score for the ERRAT and Ramachandran plots (S5 Table).
Before (panel A) and after (panel B) refinement with GalaxyRefine web server showed no significant difference in the Z-scores. The black dot in the blue shaded region represents our MEVC-B. The negative value of the Z-score indicates that the predicted 3D structure for the MEVC-B was reliable and stable as the predicted structure score value lies within the Z-scores of experimentally determined structures. Reprinted from ProSA server (https://prosa.services.came.sbg.ac.at/prosa.php) [46] under a CC BY license, with permission from PLOS ONE, original copyright 2023.
Plot from panel A showed the Ramachandran plot before refinement and panel B for the refined model structures generated with PROCHECK web server. The most favoured regions were labelled in white letters “A”, “B”, “L”. The black squares and triangles represent the residues that were within favourable regions. Red squares represent residues that were within the disallowed region of the Ramachandran plot. The residues were slightly more concentrated in the most favoured region and also had slightly more residues within the disallowed region after refinement compared to before refinement. A good quality model structure is expected to have more than 90% residues [47]. Reprinted from PROCHECK web server (https://www.ebi.ac.uk/thornton-srv/software/PROCHECK/) [47] under a CC BY license, with permission from PLOS ONE, original copyright 2023.
Model 2 of the MEVC-B with the highest residues within the most favourable region and least residues percentage in the disallowed region was further refined. The refinement generated another five refined models with improved Rama-favoured regions compared to the initially submitted model 2 (S5 Table). Refined model 4 have good Rama-favoured region (94.1), GDT-HA (0.9745), RMSD (0.370) and MolProbity (1.903) (S6 Table). Thus, the refined model 4 of MEVC-B was input into ProSA and PROCHECK servers for re-validation [77]
The Z-score of the refined model 4 of MEVC-B was a negative value (-5.79) (Table 4) and was within the range of Z-scores found for native proteins with similar sizes (Fig 4B), suggesting that the refined model structures predicted for MEVC-B was reliable [46, 89]. The Ramachandran plot generated from the PROCHECK server for the refined model structure showed that 91.1% of the residues were within the most favourable region, 6.1% were in the additional allowed regions, 1.2% were in the generously allowed regions, and the remaining 1.2% of the residues were exhibited in the disallowed regions (Fig 5B). The results indicated that the refined model structure of MEVC-B was the best model structure for the vaccine construct and was used for molecular dockings (Table 4).
Molecular docking and binding interactions with TLR molecules.
The ClusPro docking server generated 30 different clusters of members along with their interaction energies. The cluster with the highest number of members was ranked as the top cluster. The top five docked MEVCB-TLR4-MD-2 complex clusters had the lowest energy scores of -1228.4, -1203.8, -1173.1, -1207.1 and -1122.3, ranked from the first to the fifth cluster, respectively. The docking between the MEVC-B and TLR1-TLR2 complex generated the top five clusters with the lowest energy score of -1115.7, -1197.3, -1145.8, -1179.4 and -1032.0. The top cluster with the highest number of members and the most negative lowest energy score was analysed for its binding interactions [22, 23].
The binding affinity of the docked MEVCB-TLR4-MD-2 complex ranked first has the lowest free binding energy, -23.9 kcal mol-1, while for MEVCB-TLR1-TLR2, the free binding energy of the top ranked docked complex was -18.9 kcal mol-1. A more negative free binding energy, -28.5 kcal mol-1, was calculated for the fifth cluster of the MEVCB-TLR1-TLR2 complex, hereafter known as MEVCB-TLR1-TLR2-b, was also selected for further analysis. The negative binding affinity score showed that the binding interaction between the MEVC and the TLR molecules was favourable. This suggested that the MEVC design could effectively bind with the TLR molecules, which are involved in triggering appropriate immune responses against SARS-CoV-2.
The binding contacts between the MEVCB-TLR complexes were analysed using PDBsum and PRODIGY web servers to assess if the binding interactions were occurring within the TLR molecule binding domains for effective immune response to be triggered. The LRRs motif within the TLR4 molecule plays an important role in the recognition of PAMP. In the protein-protein interaction analysis with PDBsum server, 26 residues from chain A of the TLR4 molecule were interacting with 14 residues from the MEVC-B (Fig 6C), and from the chain B of the TLR4 molecule, 16 residues were interacting with 18 residues of the MEVC-B (Fig 6A). Most of the residues from TLR4 molecules that were involved in the binding with the MEVC-B were within the LRRs motif, and the MEVC-B was also interacting with one chain of the MD-2 molecule (not shown). This indicated that the MEVCB-TLR4-MD-2 complex could promote downstream signalling and stimulate immune responses.
Panel B showed the homodimer TLR4, labelled as Chain A (purple) and Chain B (magenta), the MD-2 structures coloured in orange and MEVC-B structure in cyan. Panel A and C showed the binding interactions between the docked complex. The MEVC-B structure interacted with both chains of TLR4 (panel A and C) but only one chain of the MD-2 in the docked complex (not shown). Reprinted from PDBsum Generate web server (https://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html) [72] under a CC BY license, with permission from PLOS ONE, original copyright 2023.
The binding affinity between the complexes which was calculated by PRODIGY web server was based on the intermolecular contacts at the interface of the complexes within the threshold distance of 5.5 Å [54, 76, 77]. Therefore, we compared the intermolecular contacts from the PRODIGY and the interacting residues generated form PDBsum web server. In the MEVCB-TLR4-MD-2 complex, there were 42 residues from chain A of TLR4 molecule interacting with 28 residues from the MEVC-B where 26 of the 42 residues from chain A of TLR4 and 14 of 28 residues from MEVC-B were consistent with the interacting residues identified from PDBsum (S7 Table). Meanwhile, the PRODIGY server identified 29 residues in chain B of TLR4 molecule interacted with 24 residues of the MEVC-B where all the interacting residues identified from PDBsum were also consistent (S7 Table). The PRODIGY server also identified that only one chain of the MD-2 molecule was interacting with the MEVC-B as in the PDBsum (not shown).
The binding interaction analysis of the MEVCB-TLR1-TLR2 complex showed that the interacting residues of the MEVC-B molecule were only limited to the TLR1 molecule (Fig 7A and 7C). The interacting residues visualisation generated by the PDBsum server showed that 39 residues from TLR1 molecule were involved in the interaction with 23 residues of MEVC-B, but only 6 of them were the reported ligand-binding residues (Asn280, Ser303, Thr332, His305, Gln306 and Ser279) [75] (Fig 7B and S8 Table). For the MEVCB-TLR1-TLR2-b complex with the most negative binding energy score, 26 residues from TLR1 were interacting with 16 MEVC-B residues while 28 residues from TLR2 interacted with 25 residues from MEVC-B molecule (Fig 8). However, none of the identified interacting residues were the reported ligand-binding residues.
The docked complex showed that there is no interaction predicted between the MEVC-B and TLR2 chain in protein-protein complex (panel A). The binding interactions between TLR1 of the TLR1-TLR2 heterodimer (red) and MEVC-B (cyan) were visualised with Pymol and PDBsum server in Panel B. The interaction within the complex is only limited between the MEVC-B with TLR-1 which is shown in the top-view of the docked complex (panel C). Reprinted from PDBsum Generate web server (https://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html) [72] under a CC BY license, with permission from PLOS ONE, original copyright 2023.
The docked complex showed that the interaction between the MEVC-B with TLR1 (panel C) and TLR2 chain (panel A). The binding interactions between the TLR1-TLR2 heterodimer (red and pink) and MEVC-B (cyan) were visualised with Pymol and PDBsum server in Panel B. The binding residues in the interaction were not included in the reported ligand-binding residues (panel B and C). Reprinted from PDBsum Generate web server (https://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html) [72] under a CC BY license, with permission from PLOS ONE, original copyright 2023.
In comparison with the intermolecular contacts at the interface of the MEVCB-TLR1-TLR2 complex, 62 residues from TLR1 molecule that were interacting with 33 residues from MEVC-B identified by PRODIGY were consistent with the 39 residues from TLR1 molecule and the 23 residues from MEVC-B identified by PDBsum web server (S8 Table). The MEVCB-TLR1-TLR2-b showed that both TLR1 and TLR2 molecules were interacting with the MEVC-B. A total of 34 interacting residues from TLR1 molecule were interacting with 25 residues from MEVC-B, while 41 residues of TLR2 interacted with 43 residues from MEVC-B. For the MEVCB-TLR1-TLR2-b complex, all 26 and 28 interacting residues of TLR1 and TLR2, respectively, identified from PDBsum were consistent with residues from PRODIGY server. Similarly, for the interacting residues listed for MEVC-B molecule.
The binding interaction analysis from the PRODIGY web server indicated that the intermolecular interacting residues that were in contact within the complexes were coherent to the results output from the PDBsum server. The highly negative free binding energy calculated for the complexes generated from both docking servers indicated that the binding interaction between the MEVC-B with TLR4 homodimer and TLR1-TLR2 heterodimer molecules was highly likely to occur.
To validate the docking results from ClusPro, further docking was performed using HADDOCK web server. We used the surface interacting residues identified using PRODIGY web servers between the protein-protein complex as input to run the docking with HADDOCK. The HADDOCK server generated 10 different clusters of members along with their interaction energies. In this server, the cluster with the most negative HADDOCK was ranked the most top regardless of the number of members in the clusters.
The docked complex of MEVCB-TLR4-MD-2 generated from HADDOCK web server had a HADDOCK score of -96.2 +/- 10.4, -92.7 +/- 31.6, -92.5 +/- 8.6, -86.0 +/- 12.8 and -85.6 +/- 11.4, ranked from the first to the top five clusters, respectively (Table 6). The HADDOCK scores of the top five clusters generated from HADDOCK for MEVCB-TLR1-TLR2 were -131.2 +/- 16.7, -129.5 +/- 15.3, -122.8 +/- 9.2, -119.5 +/- 14.0 and -89.1 +/- 8.8, respectively (Table 6). The docked complexes with of the top five ranked clusters generated from HADDOCK were calculated for their free binding energies and compared with docked complex from ClusPro (Tables 5 and 6). From the free binding energy analyses using PRODIGY web server, the docked complexes generated from the blind docking using ClusPro web server had the lowest free binding energy indicating the highest interaction and strongest affinity between the vaccine constructs and both the homodimer TLR4 and heterodimer TLR1-TLR2 molecules. Thus, the docked complexes generated from the blind docking from ClusPro web server were used for further structural stability and flexibility analyses through molecular dynamic simulations.
For each of the individual epitope docked with the TLR molecules using ClusPro web server, epitope 1062FLHVTYVPAQEKNFT1076, one of the HTL epitopes included in the MEVC-B has the most negative free binding energy (-19.9 kcal mol-1) with TLR4 homodimer molecule (Table 7). From the docking with HADDOCK, the epitope also has a strong binding affinity with the TLR4 molecule where the free binding energy is -13.0 kcal mol-1 (S9 Table). Nevertheless, from the docked protein-protein complex, MEVCB-TLR4-MD-2, the 1062FLHVTYVPAQEKNFT1076 epitope was not seen to be interacting with any of the molecules from the homodimer TLR4-MD2 but the Gln1071 (Gln418 in Fig 6) and Lys1073 (Lys420 in Fig 6) residues were identified as two of the interacting residues with the TLR4 chain (Fig 6).
As for the binding with TLR1-TLR2 molecule, another HTL epitope from the 33 epitopes included in the MEVC-B, 507PYRVVVLSF515, has a low binding free energy (-16.2 kcal mol-1) with the TLR1 molecule from the TLR1-TLR2 heterodimer indicating a strong and favourable binding the two molecules. From the MEVCB-TLR1-TLR2 complex, the Tyr508, Val511, and Phe514 from the 507PYRVVVLSF515 HTL epitope, are the identified interacting residues involved in the binding with TLR1 (Tyr457, Val460 and Phe464 in Fig 7, respectively). The Gln1071 and Tyr508 residues have a potential to be identifies as the critical binding epitopes in the protein-protein complex as they formed hydrogen bonding with the residues from the TLR4 and TLR1 molecule, respectively.
Structural stability and flexibility of the docked protein-protein complexes.
The structural stability and flexibility of the MEVC-B and the TLR molecules were estimated and assessed through the molecular dynamic (MD) simulations using the GROMACS software. Before the dynamic simulations were run on both the protein-protein complexes, MEVCB-TLR4-MD-2 and MEVCB-TLR1-TLR2, an energy minimisation was conducted to ensure that the simulation system has no steric clashes or inappropriate geometry by evaluating the potential energy (kJ mol-1) and the maximum force of the system [70]. A potential energy less than 1000 kJ mol-1 indicates that the system is stable and ready for simulation. From the energy minimisation of 50,000 steps, the potential energy of the system for MEVCB-TLR4-MD-2 complex was calculated to be -2.0 e + 07 kJ mol-1, while for MEVCB-TLR1-TLR2, the minimum potential energy was -2.5 e + 07 kJ mol-1, indicating that the system was stable (Fig 9A). After the equilibration step, both protein-protein complexes showed that the pressure values were fluctuated from the reference value (1 bar) over the course of 100-ps equilibration phase while the temperature of the system reached the target value of 300K in the 100-ps equilibration phase (Fig 9B and 9C).
The potential energy of the system of both docked complexes were minimised prior to the molecular dynamic simulations and the plots showed that the minimum potential energy of both docked complexes were stable over the course of 100-ps simulation (panel A). The system in both protein-protein complex was equilibrated under two phases: pressure (NPT) and temperature (NVT) ensembles. The graph of the NPT and NVT equilibration phases for the protein complexes in panel B and C, respectively, showed that the system in both protein complexes were instantly stabilised and reached the desired pressure and temperature. The graphs were plotted using the XMGRACE plotting tool [71].
To assess the structural stability and flexibility of the complexes, trajectory analyses were performed after the 100ns simulations, and the RMSD and RMSF values of the carbon atoms of each chain in the complex were plotted in Fig 10. The plot for the RMSD values of the MEVCB-TLR4-MD-2 complex showed that the carbon atoms of the proteins in the complex were stabilised after 50ns at 3nm with mild fluctuations (Fig 10A). However, from the MD simulation on the MEVCB-TLR1-TLR2 complex, the RMSD values had high-peak fluctuations indicating that the proteins were not stabilised over the course of 100-ns. A longer simulation time might be needed to allow the protein-protein complex to reach a more stable phase (Fig 10B). Nevertheless, from the five different timepoints within the MD simulation of the complex, the MEVCB and TLR1-TLR2 molecule still remained in contact with each other and similarly to the MEVCB-TLR4-MD-2 complex, the number of residues from the MEVCB interacting with TLR1-TLR2 molecule increases with time.
The RMSD calculation of C-alpha atoms of each chain in the MEVCB-TLR4-MD-2 (Panel A) and MEVCB-TLR1-TLR2 (Panel B) complex. The RMSD value assessed the structural stability of the protein in the complexes over the 100ns MD simulations. The graphs were plotted using the XMGRACE plotting tool [71].
The RMSF graph for each protein in both complexes showed that the atoms in the MEVC-B are highly flexible with RMSF values between 0.3 to 3.0nm as compared to the molecules of the TLR4-MD-2 homodimer and TLR1-TLR2 heterodimer with lower RMSF value ranging from 0 to 0.5nm (Fig 11A and 11B). Since there are many loop regions in the MEVC-B structure (Fig 11D), many high peaks were seen in the RMSF graph of the MEVC-B in both complexes (Fig 11C).
The RMSF of proteins in the MEVCB-TLR4-MD-2 (Panel A) and MEVCB-TLR1-TLR2 (Panel B) to show the flexibility of the C-alpha atom of each residues in all the chains of the protein-protein complexes. The RMSF of the MEVC-B from both complexes were compared (Panel C) to identify the high peak regions in the plot which were caused by the loop regions in the MEVC-B structure (Panel D). The graphs were plotted using the XMGRACE plotting tool [71].
The free binding energies of the MEVCB-TLR4-MD-2 and MEVCB-TLR1-TLR2 at five timepoints - 20ns, 40ns, 60ns, 80ns and 100ns—were also assessed using PRODIGY web server and compared to each other along with the free binding energy of the original docked complex (0ns) in Table 8. The free binding energy calculated between the MEVC-B and the TLR molecules was only slightly different for each timepoint throughout 100ns MD simulations where at 100ns, the last timepoint of the simulation, the binding free energy was the highest for both complexes. Although for MEVCB-TLR4-MD-2 complex the 0ns (original docked complex) has higher free binding energy as compared to at 100ns, the number of interacting residues from the MEVC-B increases. For MEVCB-TLR1-TLR2 complex, free binding energy of the complex increased as the timepoint increased except for timepoint at 40 and 60ns. The number of residues from MEVC-B interacting with the TLR1-TLR2 heterodimer molecule in the complex increased as the timepoint increased and the highest number of interacting residues is at 100ns. The analysis from the trajectory files also did not show any proteins unfolding at any timepoint of the simulation (not shown) and it is expected that the interaction between the vaccine construct and the TLR molecules is favourable and is likely to occur.
To measure the compactness of the proteins in the complex, the radius of gyration (Rg) value for the protein-protein complex from the simulation was plotted in Fig 12. The plot for the MEVCB in MEVCB-TLR4-MD-2 complex showed that the designed protein did undergo large conformational changes at the start of the simulation (0-50ns) as can be seen from the high fluctuations of the graph. After 50ns simulation, the Rg value of the MEVC-B in the complex decreased to similar Rg value as the TLR4 chains at 3.3nm, with mild fluctuations indicating that the MEVC-B is tightly packed after 50ns of simulation and there might be only small conformational changes occurring in the proteins in the complexes (Fig 12A). However, the Rg plot of the MEVC-B in complex MEVCB-TR1-TLR2, showed high fluctuations indicating very large conformational changes compared to TLR1 and TLR2 (Fig 12B). Similarly, longer simulations may be needed for MEVCB-TLR1-TLR2 to allow the complex to reach a more stable and flexible phase as what was observed in MEVCB-TLR4-MD-2. Nevertheless, we can conclude that the MEVC-B may not fold easily and has high potential to be able to interact with the TLR4 molecule.
The compactness of the proteins in MEVCB-TLR4-MD-2 (Panel A) and MEVCB-TLR1-TLR2 (Panel B) complexes were calculated using GROMACS and visualised in Rg value. The graphs were plotted using the XMGRACE plotting tool [71].
Codon optimisation.
To maximise the expression of the MEVC-B in vitro, the MEVC was codon optimised for our chosen host, L. lactis. The MEVC-B sequence, which include the signal peptide, Usp45, and the 6×His-tag at the N- and C-terminal, respectively, was codon optimised for L. lactis subsp. cremoris MG1363. The DNA sequence of the MEVC was comprised of 1755 bp, and the codon adaptation index (CAI) calculated for the MEVC was 1.0 with 30.8% GC content. The results indicated that the MEVC can be efficiently expressed in L. lactis NZ3900, which is a derivative of the MG1363.
Discussion
Although the spread of the SARS-CoV-2 virus within the human population is no longer a global threat, its variants are still circulating and have not been completely eradicated [9]. While numerous COVID-19 vaccines have been developed and approved, this study explores an alternative approach for the development of longer lasting mucosal vaccines against the SARS-CoV-2 virus and its variants.
The S protein of SARS-CoV-2, particularly the RBD region, which recognises the host receptor ACE2 for its entry mechanism into host cells [90–92], and the HR1 and HR2 regions [93–98], which mediate the fusion of viral entry, are highly conserved among other SARS CoV-2 variants [25, 99–103] and have been extensively targeted for vaccines development [1, 21, 82, 84, 104–111]. Multi-epitope COVID-19 vaccines targeting these highly conserved regions could contribute to stronger and longer-lasting immune reactions, as they stimulate concurrent humoral and cellular responses [82]. Therefore, a multi-epitope-based oral vaccine targeting the highly conserved regions of the S protein, particularly the RBD and HR1-HR2 domains, would be more effective to target the existing and emerging variants.
In this study, we explored the S protein of the SARS-CoV-2 virus, in particular the RBD and the HR1-HR2 domains of the S protein, to develop a mutational resistant multi-epitope oral vaccine against SARS-CoV-2 variants in silico using various immunoinformatic tools. There were different potential multi-epitope vaccine constructs that have been designed using immunoinformatic approaches against diseases and pathogens such as Streptococcus agalactiae [112] and dengue [113], which have shown promising results. The identification and prediction of the crucial epitopes for the formulation of peptide-based vaccines have been greatly improved, owing to the availability of current advanced bioinformatics tools such as the NetCTLpan v1.1, NetMHCIIpan v4.1 and IEDB Analysis Resource. Vaccine constructs formulated with highly immunogenic and antigenic peptides with a reduced risk of developing unwanted reactions are important criteria for the successful development of safe and effective vaccines.
Our study discovered that only a small portion of the epitopes within the RBD region that met all of our selection criteria and were highly conserved compared to the epitopes predicted within the HR1-HR2 region of the S protein. A total of 9 CTLs, 5 HTLs and 19 LBC epitopes were finalised and selected for the formation of the MEVCs. The MEVCs consisted of the 33 T and B lymphocyte epitopes were joined by associated linkers, a signal peptide, Usp45, at the N-terminal of the MEVCs, to allow secretion of the heterologous protein by our selected Lactococcus host, the L. lactis, and a 6×His-tag at its C-terminal for ease of protein purification (Fig 3).
In the evaluation study of the three different arrangements of the epitopes in the MEVCs (MEVC-A, MEVC-B, MEVC-C), the physicochemical properties and structure analyses showed that the MEVC-B was a better vaccine construct candidate compared to MEVC-A and MEVC-C. MEVC-B was predicted to be a probable antigen, non-toxic, non-allergenic and highly soluble. MEVC-B also had a high antigenicity value (0.7081) and was categorised as a probable antigen. The solubility index for MEVC-B (0.475) was also above the average threshold for soluble protein (0.450), indicating that the heterologous protein expressed from the vaccine construct was sufficient to promote efficient folding and prevent protein aggregation [114].
A 45-mer adjuvant, the human β-defensin 3 (hBD3), was also added to our MEVCs as it was reported that β-defensins can enhance the efficacy of the vaccine, stimulate various immune responses including the enhancement of macrophage activity and induce production of pro-inflammatory cytokines, and can elicit anti-viral responses [35–37]. The addition of hBD3 as the adjuvant showed that the MEVCs had an estimated half-life of at least 30 times longer in both in vitro and in vivo. The designed MEVCs were extensively evaluated for their probable efficiency to induce immune responses through molecular docking with toll-like receptors (TLRs).
The molecular docking analyses showed that MEVC-B could possibly bind with the cell surface of the TLRs associated with COVID-19 infection, i.e., TLR2 and TLR4 [115, 116]. The binding interactions between the MEVC-B and both TLR molecules were favourable. The interaction between MEVC-B and TLR4 could potentially trigger TLR4 signalling networks, activating immune responses against viral infections [116]. The involvement of residues from the reported binding domains of the TLR4 in the interaction with the MEVC-B showed promising results for the stimulation of the innate immune reaction by the designed MEVC. The high negative value of the binding affinity for the docked complexes generated from both docking tools, ClusPro and HADDOCK indicated that the binding between MEVC-B and the TLR molecules is highly likely to occur. Therefore, the MEVC-B could serve as a potent ligand for both receptors to stimulate the desired immune responses [117]. The results of the in silico predictions of the epitopes and evaluations of the designed MEVC indicated that this vaccine candidate can serve as a potential therapeutic strategy for COVID-19 against SARS-CoV-2 variants.
The MD simulation analyses on MEVCB-TLR4-MD-2 complex has shown a promising potential interaction between the vaccine construct, MEVC-B with the TLR4 molecules. The RMSD, RMSF and Rg values which represent the structural stability, flexibility and compactness of the MEVC-B in the protein-protein complex over the 100ns simulation showed that the MEVC-B is stable to be able to interact with TLR4 molecule in the real system. Although the MEVC-B may have shown lower structural stability when in complex with TLR1-TLR2 molecule, we expect that with longer simulation time, the vaccine construct will be able to reach more stable phase and may also have potential to successfully interact with the TLR1-TLR2 molecule as no unfolding of the MEVC-B was observed at different timepoints of the simulation.
As with any computational-based approach, there are limitations to the in silico analysis used in this study to design the multi-epitope oral vaccine against SARS-CoV-2 virus. While the use of bioinformatics tools greatly improves the efficiency and accuracy of the vaccine design process, there are still challenges to overcome. One of the limitations of the existing bioinformatics software is the potential inaccuracies of the algorithms used [20], as well as the lack of sufficient information in databases and inherent bias of training datasets. These limitations could potentially compromise the accuracy of molecular screenings and early predictions for vaccine design. Therefore, further experiments such as using immune markers as surrogate [118] to validate the predicted results and improve the accuracy and efficiency of vaccine design is crucial.
To efficiently deliver the designed vaccine, a lactic acid bacteria (LAB), Lactococcus lactis (L. lactis), will be considered in further experimental study. The L. lactis which has been generally recognised as safe (SAFE) due its long-term use in food products such as cheese and buttermilk [119–121], have been reported as effective microbial cell factories and carriers for therapeutic drug delivery into the gastrointestinal tract (GIT) [120]. The aforementioned advantages of L. lactis making it a convenient choice as a potential carrier for our vaccine construct [122].
Conclusion
This immunoinformatic predictions approach for the design of multi-epitope vaccines can provide useful information for researchers to develop an effective and durable therapeutic strategy for diseases management. The quick production of DNA-based multi-epitope vaccine is advantageous for therapeutic prevention, especially in time-sensitive situations like the recent COVID-19 pandemic. The findings in our study suggest that the highly conserved regions of the S protein, particularly the RBD and HR1-HR2 domains, could be promising combined targets for the development of multi-epitope-based vaccines. The MEVC designed in this study could also serve as a potential candidate vaccine against multiple SARS-CoV-2 variants. This study paves the way for the development of an effective and durable therapeutic strategy against COVID-19.
Supporting information
S1 Table. Predicted CTL epitope candidate within receptor-binding domain and heptad repeat domains of the SARS-CoV-2 surface glycoprotein with their antigenicity value, conservancy percentage, binding core and respective binding alleles.
https://doi.org/10.1371/journal.pone.0306111.s001
(PDF)
S2 Table. Predicted HTL epitopes within receptor-binding domain and heptad repeat domains of the SARS-CoV-2 surface glycoprotein with their binding core, respective binding alleles, antigenicity value and conservancy percentage.
https://doi.org/10.1371/journal.pone.0306111.s002
(PDF)
S3 Table. Predicted LBC epitopes within receptor-binding domain and heptad repeat domains of the SARS-CoV-2 surface glycoprotein with their length, region, antigenicity value, conservancy percentage, predicted specific immunoglobulin isotypes and qualitative measurement on specific B-cell assays reported in IEDB resource database.
https://doi.org/10.1371/journal.pone.0306111.s003
(PDF)
S4 Table. Final LBC epitope candidates within receptor-binding domain and heptad repeat domains of the SARS-CoV-2 surface glycoprotein that were reported to have positive binding in B-cell assays in IEDB resource with their specific immunoglobulin isotype, binding core, antigenicity value and conservancy percentage.
https://doi.org/10.1371/journal.pone.0306111.s004
(PDF)
S5 Table. Validation of five models structures of MEVC-B and MEVC-C generated by Robetta web server using ProSA and PROCHECK.
https://doi.org/10.1371/journal.pone.0306111.s005
(PDF)
S6 Table. Model 2 of MEVC-B after refinement with GalaxyRefine.
https://doi.org/10.1371/journal.pone.0306111.s006
(PDF)
S7 Table. Interacting residues within MEVCB-TLR4 complex identified from PDBsum and PRODIGY web servers.
https://doi.org/10.1371/journal.pone.0306111.s007
(PDF)
S8 Table. Interacting residues within MEVCB-TLR1-TLR2 complex identified from PDBsum and PRODIGY web servers.
https://doi.org/10.1371/journal.pone.0306111.s008
(PDF)
S9 Table. Docking results of the individual epitopes and TLR molecules for the top 10 highest binding affinity with the lowest Z-score and HADDOCK score generated by HADDOCK web server.
https://doi.org/10.1371/journal.pone.0306111.s009
(PDF)
References
- 1. Behmard E, Soleymani B, Najafi A, Barzegari E. Immunoinformatic design of a COVID-19 subunit vaccine using entire structural immunogenic epitopes of SARS-CoV-2. Sci Rep [Internet]. 2020 Dec 1 [cited 2023 Jan 20];10(1). Available from: /pmc/articles/PMC7704662/ pmid:33257716
- 2. Lee WS, Wheatley AK, Kent SJ, DeKosky BJ. Antibody-dependent enhancement and SARS-CoV-2 vaccines and therapies. Nature Microbiology 2020 5:10 [Internet]. 2020 Sep 9 [cited 2023 Feb 28];5(10):1185–91. Available from: https://www.nature.com/articles/s41564-020-00789-5 pmid:32908214
- 3. Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF. The proximal origin of SARS-CoV-2. Nature Medicine 2020 26:4 [Internet]. 2020 Mar 17 [cited 2023 Feb 28];26(4):450–2. Available from: https://www.nature.com/articles/s41591-020-0820-9 pmid:32284615
- 4. Hu B, Ge X, Wang LF, Shi Z. Bat origin of human coronaviruses Coronaviruses: Emerging and re-emerging pathogens in humans and animals Susanna Lau Positive-strand RNA viruses. Virol J [Internet]. 2015 Dec 22 [cited 2023 Feb 28];12(1):1–10. Available from: https://virologyj.biomedcentral.com/articles/10.1186/s12985-015-0422-1
- 5.
WHO Coronavirus (COVID-19) Dashboard | WHO Coronavirus (COVID-19) Dashboard With Vaccination Data [Internet]. [cited 2023 Apr 8]. Available from: https://covid19.who.int/
- 6. Hartenian E, Nandakumar D, Lari A, Ly M, Tucker JM, Glaunsinger BA. The molecular virology of coronaviruses. J Biol Chem [Internet]. 2020 Sep 9 [cited 2023 Mar 30];295(37):12910. Available from: /pmc/articles/PMC7489918/ pmid:32661197
- 7. Christie A, Henley SJ, Mattocks L, Fernando R, Lansky A, Ahmad FB, et al. Decreases in COVID-19 Cases, Emergency Department Visits, Hospital Admissions, and Deaths Among Older Adults Following the Introduction of COVID-19 Vaccine—United States, September 6, 2020–May 1, 2021. MMWR Morb Mortal Wkly Rep [Internet]. 2022 [cited 2022 Nov 16];70(23):858–64. Available from: https://www.cdc.gov/mmwr/volumes/70/wr/mm7023e2.htm
- 8. Ng KW, Faulkner N, Finsterbusch K, Wu M, Harvey R, Hussain S, et al. SARS-CoV-2 S2-targeted vaccination elicits broadly neutralizing antibodies. Sci Transl Med [Internet]. 2022 Jul 27 [cited 2022 Nov 16];14(655):eabn3715. Available from: https://www.science.org/doi/10.1126/scitranslmed.abn3715 pmid:35895836
- 9.
Tracking SARS-CoV-2 variants [Internet]. [cited 2023 Apr 8]. Available from: https://www.who.int/activities/tracking-SARS-CoV-2-variants
- 10. Kantarcioglu B, Iqbal O, Lewis J, Carter CA, Singh M, Lievano F, et al. An Update on the Status of Vaccine Development for SARS-CoV-2 Including Variants. Practical Considerations for COVID-19 Special Populations. Clinical and Applied Thrombosis/Hemostasis [Internet]. 2022 Feb 1 [cited 2023 Apr 8];28. Available from: /pmc/articles/PMC8851053/
- 11.
WHO–COVID19 Vaccine Tracker [Internet]. [cited 2023 Apr 8]. Available from: https://covid19.trackvaccines.org/agency/who/
- 12.
COVID-19 Vaccines | FDA [Internet]. [cited 2023 Feb 28]. Available from: https://www.fda.gov/emergency-preparedness-and-response/coronavirus-disease-2019-covid-19/covid-19-vaccines
- 13. Russell MW, Mestecky J. Mucosal immunity: The missing link in comprehending SARS-CoV-2 infection and transmission. Front Immunol [Internet]. 2022 Aug 17 [cited 2023 Apr 8];13. Available from: /pmc/articles/PMC9428579/ pmid:36059541
- 14. WHO. Transmission of SARS-CoV-2: implications for infection prevention precautions [Internet]. 2020 [cited 2023 Apr 8]. Available from: https://www.who.int/news-room/commentaries/detail/transmission-of-sars-cov-2-implications-for-infection-prevention-precautions
- 15. Miteva D, Peshevska-Sekulovska M, Snegarova V, Batselova H, Alexandrova R, Velikova T. Mucosal COVID-19 vaccines: Risks, benefits and control of the pandemic. World J Virol [Internet]. 2022 Sep 9 [cited 2023 Apr 8];11(5):221. Available from: /pmc/articles/PMC9523321/ pmid:36188733
- 16. Furuyama W, Shifflett K, Pinski AN, Griffin AJ, Feldmann F, Okumura A, et al. Rapid Protection from COVID-19 in Nonhuman Primates Vaccinated Intramuscularly but Not Intranasally with a Single Dose of a Vesicular Stomatitis Virus-Based Vaccine. mBio [Internet]. 2022 Feb 1 [cited 2023 Apr 8];13(1). Available from: https://coronavirus.jhu.edu/map.html pmid:35012339
- 17. Russell MW, Moldoveanu Z, Ogra PL, Mestecky J. Mucosal Immunity in COVID-19: A Neglected but Critical Aspect of SARS-CoV-2 Infection. Front Immunol. 2020 Nov 30;11:3221. pmid:33329607
- 18. Bose P. The effectiveness of intranasal COVID-19 vaccination in healthy adults. 2023 Feb 3 [cited 2023 Jul 22]; Available from: https://www.news-medical.net/news/20230203/The-effectiveness-of-intranasal-COVID-19-vaccination-in-healthy-adults.aspx
- 19. Singh C, Verma S, Reddy P, Diamond MS, Curiel DT, Patel C, et al. Immunogenicity and Tolerability of BBV154 (iNCOVACC®), an Intranasal SARS-CoV-2 Vaccine, Compared with Intramuscular Covaxin® in Healthy Adults: A Randomised, Open-Label, Phase 3 Clinical Trial. The Lancet [Internet]. [cited 2023 Jul 22]; Available from: https://papers.ssrn.com/abstract=4342771
- 20. Ng AWR, Tan PJ, Hoo WPY, Liew DS, Teo MYM, Siak PY, et al. In silico-guided sequence modifications of K-ras epitopes improve immunological outcome against G12V and G13D mutant KRAS antigens. PeerJ [Internet]. 2018 Jul 20 [cited 2023 Aug 16];2018(7):e5056. Available from: https://peerj.com/articles/5056 pmid:30042874
- 21. Ayyagari VS, Venkateswarulu TC, Abraham Peele K, Srirama K. Design of a multi-epitope-based vaccine targeting M-protein of SARS-CoV2: an immunoinformatics approach. J Biomol Struct Dyn [Internet]. 2020 [cited 2023 Apr 8];40(7):1. Available from: /pmc/articles/PMC7754933/ pmid:33252008
- 22. Shehzada A, Sumartono C, Nugraha J, Susilowatid H, Yasmin Wijayab A, Yasmin A, et al. Engineering of a Multi-Epitope Subunit Vaccine Against SASRS-CoV-2 Through the Viroinformatic Approach. Karbala International Journal of Modern Science [Internet]. 2022 [cited 2023 Aug 16];8(2). Available from: https://kijoms.uokerbala.edu.iq/homehttps://kijoms.uokerbala.edu.iq/home/vol8/iss2/14
- 23. Shehzad A, Sumartono C, Nugraha J, Susilowati H, Wijaya AY, Ahmad HI, et al. Development of a multi-epitope spike glycoprotein vaccine to combat SARS-CoV-2 using the bioinformatics approach [Desarrollo de una vacuna de glicoproteína spike multiepítopo para combatir el SARS-CoV-2 utilizando el enfoque bioinformático]. J Pharm Pharmacogn Res [Internet]. 2022 [cited 2023 Aug 16];10(3):445–58. Available from: https://jppres.com
- 24. Stranzl T, Larsen MV, Lundegaard C, Nielsen M. NetCTLpan: pan-specific MHC class I pathway epitope predictions. Immunogenetics [Internet]. 2010 Jun [cited 2023 Jan 16];62(6):357. Available from: /pmc/articles/PMC2875469/ pmid:20379710
- 25. Kibria KMK, Faruque MO, Islam MS bin, Ullah H, Mahmud S, Miah M, et al. A conserved subunit vaccine designed against SARS-CoV-2 variants showed evidence in neutralizing the virus. Appl Microbiol Biotechnol [Internet]. 2022 Jun 1 [cited 2022 Nov 16];106(11):4091. Available from: /pmc/articles/PMC9130996/ pmid:35612630
- 26. Martinelli DD. In silico vaccine design: A tutorial in immunoinformatics. Healthcare Analytics. 2022 Nov 1;2:100044.
- 27. Luckheeram RV, Zhou R, Verma AD, Xia B. CD4 +T cells: Differentiation and functions. Clin Dev Immunol. 2012;2012.
- 28. Ras-Carmona A, Lehmann AA, Lehmann P V., Reche PA. Prediction of B cell epitopes in proteins using a novel sequence similarity-based method. Scientific Reports 2022 12:1 [Internet]. 2022 Aug 12 [cited 2023 Apr 5];12(1):1–9. Available from: https://www.nature.com/articles/s41598-022-18021-1
- 29. Griffin DE. Cytokines and Chemokines. Encyclopedia of Virology. 2008 Jan 1;620–4.
- 30. Bui HH, Sidney J, Dinh K, Southwood S, Newman MJ, Sette A. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinformatics [Internet]. 2006 Mar 17 [cited 2023 Jan 16];7:153. Available from: /pmc/articles/PMC1513259/ pmid:16545123
- 31. Rizwan M, Naz A, Ahmad J, Naz K, Obaid A, Parveen T, et al. VacSol: a high throughput in silico pipeline to predict potential therapeutic targets in prokaryotic pathogens using subtractive reverse vaccinology. BMC Bioinformatics [Internet]. 2017 Feb 13 [cited 2023 Apr 12];18(1). Available from: /pmc/articles/PMC5307925/ pmid:28193166
- 32. De Oliveira LMF, Morale MG, Chaves AAM, Cavalher AM, Lopes AS, De Oliveira Diniz M, et al. Design, Immune Responses and Anti-Tumor Potential of an HPV16 E6E7 Multi-Epitope Vaccine. PLoS One [Internet]. 2015 Sep 21 [cited 2023 Apr 5];10(9). Available from: /pmc/articles/PMC4577214/
- 33. Sami SA, Marma KKS, Mahmud S, Khan MAN, Albogami S, El-Shehawi AM, et al. Designing of a Multi-epitope Vaccine against the Structural Proteins of Marburg Virus Exploiting the Immunoinformatics Approach. ACS Omega [Internet]. 2021 Nov 30 [cited 2023 Jan 26];6(47):32043–71. Available from: https://pubs.acs.org/doi/full/10.1021/acsomega.1c04817 pmid:34870027
- 34. Singh H, Jakhar R, Sehrawat N. Designing spike protein (S-Protein) based multi-epitope peptide vaccine against SARS COVID-19 by immunoinformatics. Heliyon. 2020 Nov 1;6(11):e05528. pmid:33225084
- 35. Kohlgraf KG, Pingel LC, Dietrich DE, Brogden KA. Defensins as anti-inflammatory compounds and mucosal adjuvants. Future Microbiol [Internet]. 2009 Dec 18 [cited 2023 Apr 5];5(1):99–113. Available from: https://www.futuremedicine.com/doi/10.2217/fmb.09.104
- 36. Laneri S, Brancaccio M, Mennitti C, De Biasi MG, Pero ME, Pisanelli G, et al. Antimicrobial Peptides and Physical Activity: A Great Hope against COVID 19. Microorganisms 2021, Vol 9, Page 1415 [Internet]. 2021 Jun 30 [cited 2023 Jan 19];9(7):1415. Available from: https://www.mdpi.com/2076-2607/9/7/1415/htm pmid:34209064
- 37. Tarrahimofrad H, Rahimnahal S, Zamani J, Jahangirian E, Aminzadeh S. Designing a multi-epitope vaccine to provoke the robust immune response against influenza A H7N9. Scientific Reports 2021 11:1 [Internet]. 2021 Dec 29 [cited 2023 Apr 5];11(1):1–22. Available from: https://www.nature.com/articles/s41598-021-03932-2
- 38. Shamriz S, Ofoghi H, Moazami N. Effect of linker length and residues on the structure and stability of a fusion protein with malaria vaccine application. Comput Biol Med. 2016 Sep 1;76:24–9. pmid:27393958
- 39. Yang Y, Sun W, Guo J, Zhao G, Sun S, Yu H, et al. In silico design of a DNA-based HIV-1 multi-epitope vaccine for Chinese populations. Hum Vaccin Immunother [Internet]. 2015 [cited 2023 Apr 5];11(3):795. Available from: /pmc/articles/PMC4514284/
- 40. Reddy Chichili VP, Kumar V, Sivaraman J. Linkers in the structural biology of protein–protein interactions. Protein Sci [Internet]. 2013 Feb [cited 2023 Apr 5];22(2):153. Available from: /pmc/articles/PMC3588912/ pmid:23225024
- 41. Funderburg N, Lederman MM, Feng Z, Drage MG, Jadlowsky J, Harding C V., et al. Human β-defensin-3 activates professional antigen-presenting cells via Toll-like receptors 1 and 2. Proc Natl Acad Sci U S A [Internet]. 2007 Nov 11 [cited 2023 Apr 5];104(47):18631. Available from: /pmc/articles/PMC2141828/ pmid:18006661
- 42. Kim J, Yang YL, Jeong Y, Jang YS. Conjugation of Human β-Defensin 2 to Spike Protein Receptor-Binding Domain Induces Antigen-Specific Protective Immunity against Middle East Respiratory Syndrome Coronavirus Infection in Human Dipeptidyl Peptidase 4 Transgenic Mice. Vaccines (Basel) [Internet]. 2020 Dec 1 [cited 2023 Jan 19];8(4):1–15. Available from: /pmc/articles/PMC7712746/
- 43. Ali M, Pandey RK, Khatoon N, Narula A, Mishra A, Prajapati VK. Exploring dengue genome to construct a multi-epitope based subunit vaccine by utilizing immunoinformatics approach to battle against dengue infection. Scientific Reports 2017 7:1 [Internet]. 2017 Aug 23 [cited 2022 Nov 23];7(1):1–13. Available from: https://www.nature.com/articles/s41598-017-09199-w
- 44. Hebditch M, Carballo-Amador MA, Charonis S, Curtis R, Warwicker J. Protein–Sol: a web tool for predicting protein solubility from sequence. Bioinformatics [Internet]. 2017 Oct 1 [cited 2023 Jul 24];33(19):3098–100. Available from: pmid:28575391
- 45. Niwa T, Ying BW, Saito K, Jin W, Takada S, Ueda T, et al. Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins. Proc Natl Acad Sci U S A [Internet]. 2009 Mar 17 [cited 2023 Jul 24];106(11):4201–6. Available from: https://www.pnas.org/doi/abs/10.1073/pnas.0811922106 pmid:19251648
- 46. Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res [Internet]. 2007 Jul 1 [cited 2023 Jul 5];35(suppl_2):W407–10. Available from: pmid:17517781
- 47. Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr [Internet]. 1993 Apr 1 [cited 2023 Jul 5];26(2):283–91. Available from: https://onlinelibrary.wiley.com/doi/full/10.1107/S0021889892009944
- 48. Wu T, Guo Z, Cheng J. Atomic protein structure refinement using all-atom graph representations and SE(3)-equivariant graph transformer. Cowen L, editor. Bioinformatics [Internet]. 2023 May 4 [cited 2023 Jul 6];39(5). Available from: pmid:37144951
- 49. Zemla A. LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res [Internet]. 2003 Jul 1 [cited 2023 Jul 6];31(13):3370–4. Available from: pmid:12824330
- 50. Agnihotry S, Pathak RK, Singh DB, Tiwari A, Hussain I. Protein structure prediction. Bioinformatics: Methods and Applications. 2022 Jan 1;177–88.
- 51. Pandey RK, Ojha R, Aathmanathan VS, Krishnan M, Prajapati VK. Immunoinformatics approaches to design a novel multi-epitope subunit vaccine against HIV infection. Vaccine. 2018 Apr 19;36(17):2262–72. pmid:29571972
- 52. Porter KA, Xia B, Beglov D, Bohnuud T, Alam N, Schueler-Furman O, et al. ClusPro PeptiDock: efficient global docking of peptide recognition motifs using FFT. Bioinformatics [Internet]. [cited 2023 Jul 24]; Available from: https://peptidock.cluspro.org/nousername.php. pmid:28430871
- 53. Yan Y, Tao H, He J, Huang SY. The HDOCK server for integrated protein–protein docking. Nat Protoc. 2020 May 1;15(5):1829–52. pmid:32269383
- 54. Honorato R V., Koukos PI, Jiménez-García , Tsaregorodtsev A, Verlato M, Giachetti A, et al. Structural Biology in the Clouds: The WeNMR-EOSC Ecosystem. Front Mol Biosci. 2021 Jul 28;8:729513. pmid:34395534
- 55. Van Zundert GCP, Rodrigues JPGLM, Trellet M, Schmitz C, Kastritis PL, Karaca E, et al. The HADDOCK2.2 Web Server: User-Friendly Integrative Modeling of Biomolecular Complexes. J Mol Biol. 2016 Feb 22;428(4):720–5. pmid:26410586
- 56. Kozakov D, Hall DR, Xia B, Porter KA, Padhorny D, Yueh C, et al. The ClusPro web server for protein-protein docking. Nat Protoc [Internet]. [cited 2023 Jul 7]; Available from: https://cluspro.org
- 57. Rajendaran S, Jothi A, Anbazhagan V. Targeting the glycan of receptor binding domain with jacalin as a novel approach to develop a treatment against COVID-19. R Soc Open Sci [Internet]. 2020 Sep 1 [cited 2023 Nov 17];7(9). Available from: https://pubmed.ncbi.nlm.nih.gov/33047045/ pmid:33047045
- 58. Desta IT, Porter KA, Xia B, Kozakov D, Vajda S. Performance and Its Limits in Rigid Body Protein-Protein Docking. Structure [Internet]. 2020 Sep 1 [cited 2023 Nov 17];28(9):1071–1081.e3. Available from: http://www.cell.com/article/S0969212620302094/fulltext pmid:32649857
- 59. Rottembourg D, Filippi CM, Bresson D, Ehrhardt K, Estes EA, Oldham JE, et al. Essential Role for TLR9 in Prime but Not Prime-Boost Plasmid DNA Vaccination To Activate Dendritic Cells and Protect from Lethal Viral Infection. J Immunol [Internet]. 2010 Jun 6 [cited 2023 Apr 5];184(12):7100. Available from: /pmc/articles/PMC3128449/ pmid:20483769
- 60. Samad A, Ahammad F, Nain Z, Alam R, Imon RR, Hasan M, et al. Designing a multi-epitope vaccine against SARS-CoV-2: an immunoinformatics approach. J Biomol Struct Dyn. 2020;1–17. pmid:32677533
- 61. Guo L, Lin S, Chen Z, Cao Y, He B, Lu G. Targetable elements in SARS-CoV-2 S2 subunit for the design of pan-coronavirus fusion inhibitors and vaccines. Signal Transduction and Targeted Therapy 2023 8:1 [Internet]. 2023 May 10 [cited 2024 Apr 25];8(1):1–22. Available from: https://www.nature.com/articles/s41392-023-01472-x
- 62. Lan J, Ge J, Yu J, Shan S, Zhou H, Fan S, et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature 2020 581:7807 [Internet]. 2020 Mar 30 [cited 2024 Apr 25];581(7807):215–20. Available from: https://www.nature.com/articles/s41586-020-2180-5 pmid:32225176
- 63. Wagner M, Adamczak R, Porollo A, Meller J. Linear Regression Models for Solvent Accessibility Prediction in Proteins. https://home.liebertpub.com/cmb [Internet]. 2005 Apr 21 [cited 2024 Apr 25];12(3):355–69. Available from: https://www.liebertpub.com/doi/ pmid:15857247
- 64. Adamczak R, Porollo A, Meller J. Accurate prediction of solvent accessibility using neural networks–based regression. Proteins: Structure, Function, and Bioinformatics [Internet]. 2004 Sep 1 [cited 2024 Apr 25];56(4):753–67. Available from: https://onlinelibrary.wiley.com/doi/full/10.1002/prot.20176
- 65. Adamczak R, Porollo A, Meller J. Combining prediction of secondary structure and solvent accessibility in proteins. Proteins: Structure, Function, and Bioinformatics [Internet]. 2005 May 15 [cited 2024 Apr 25];59(3):467–75. Available from: https://onlinelibrary.wiley.com/doi/full/10.1002/prot.20441
- 66. Porollo A, Adamczak R, Wagner M, Meller J. Maximum Feasibility Approach for Consensus Classifiers: Applications to Protein Structure Prediction.
- 67. Oyewusi HA, Wahab RA, Akinyede KA, Albadrani GM, Al-Ghadi MQ, Abdel-Daim MM, et al. Bioinformatics analysis and molecular dynamics simulations of azoreductases (AzrBmH2) from Bacillus megaterium H2 for the decolorization of commercial dyes. Environ Sci Eur [Internet]. 2024 Dec 1 [cited 2024 Apr 22];36(1):1–20. Available from: https://enveurope.springeropen.com/articles/10.1186/s12302-024-00853-5
- 68. ul Ain Q, Batool M, Choi S. TLR4-Targeting Therapeutics: Structural Basis and Computer-Aided Drug Discovery Approaches. Molecules 2020, Vol 25, Page 627 [Internet]. 2020 Jan 31 [cited 2023 Dec 28];25(3):627. Available from: https://www.mdpi.com/1420-3049/25/3/627/htm pmid:32023919
- 69. Kar T, Narsaria U, Basak S, Deb D, Castiglione F, Mueller DM, et al. A candidate multi-epitope vaccine against SARS-CoV-2. Scientific Reports 2020 10:1 [Internet]. 2020 Jul 2 [cited 2023 Jan 17];10(1):1–24. Available from: https://www.nature.com/articles/s41598-020-67749-1
- 70. Lemkul J. GROMACS Tutorial: Lysozyme in Water [Internet]. [cited 2024 Apr 22]. Available from: http://www.mdtutorials.com/gmx/lysozyme/index.html
- 71.
Turner P. XMGRACE [Internet]. Center for Coastal and Land-Margin Research, Oregon Graduate Institute of Science and Technology, Beaverton; 2005 [cited 2024 Apr 22]. Available from: https://plasma-gate.weizmann.ac.il/Grace/COPYRIGHT.html
- 72. Laskowski RA, Jabłońska J, Pravda L, Vařeková RS, Thornton JM. PDBsum: Structural summaries of PDB entries. Protein Sci [Internet]. 2018 Jan 1 [cited 2023 Nov 12];27(1):129. Available from: /pmc/articles/PMC5734310/ pmid:28875543
- 73. Yi C, Sun X, Ye J, Ding L, Liu M, Yang Z, et al. Key residues of the receptor binding motif in the spike protein of SARS-CoV-2 that interact with ACE2 and neutralizing antibodies. Cellular & Molecular Immunology 2020 17:6 [Internet]. 2020 May;17(6):621–30. Available from: https://www.nature.com/articles/s41423-020-0458-z
- 74. Nagai Y, Takatsu K. Role of the Immune System in Obesity-Associated Inflammation and Insulin Resistance. Nutrition in the Prevention and Treatment of Abdominal Obesity. 2014 Jan 1;281–93.
- 75. Jin MS, Kim SE, Heo JY, Lee ME, Kim HM, Paik SG, et al. Crystal Structure of the TLR1-TLR2 Heterodimer Induced by Binding of a Tri-Acylated Lipopeptide. Cell [Internet]. 2007 Sep 21 [cited 2023 Jul 7];130(6):1071–82. Available from: http://www.cell.com/article/S009286740701152X/fulltext pmid:17889651
- 76. Vangone A, Bonvin AMJJ. Contacts-based prediction of binding affinity in protein–protein complexes. Elife. 2015 Jul 20;4(JULY2015). pmid:26193119
- 77. Xue LC, Rodrigues JP, Kastritis PL, Bonvin AM, Vangone A. PRODIGY: a web server for predicting the binding affinity of protein–protein complexes. Bioinformatics [Internet]. 2016 Dec 1 [cited 2023 Jul 24];32(23):3676–8. Available from: pmid:27503228
- 78. Vangone A, Schaarschmidt J, Koukos P, Geng C, Citro N, Trellet ME, et al. Large-scale prediction of binding affinity in protein–small ligand complexes: the PRODIGY-LIG web server. Bioinformatics [Internet]. 2019 May 1 [cited 2023 Jul 24];35(9):1585–7. Available from: pmid:31051038
- 79. Li S, Xi L, Wang C, Li J, Lei B, Liu H, et al. A novel method for protein-ligand binding affinity prediction and the related descriptors exploration. J Comput Chem [Internet]. 2009 Apr 30 [cited 2023 Jul 24];30(6):900–9. Available from: https://onlinelibrary.wiley.com/doi/full/10.1002/jcc.21078 pmid:18785151
- 80. Seo S, Choi J, Park S, Ahn J. Binding affinity prediction for protein–ligand complex using deep attention mechanism based on intermolecular interactions. BMC Bioinformatics [Internet]. 2021 Dec 1 [cited 2023 Jul 24];22(1):1–15. Available from: https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-021-04466-0
- 81. Puigbò P, Guzmán E, Romeu A, Garcia-Vallvé S. OPTIMIZER: a web server for optimizing the codon usage of DNA sequences. Nucleic Acids Res [Internet]. 2007 Jul [cited 2023 Jan 7];35(Web Server issue):W126. Available from: /pmc/articles/PMC1933141/ pmid:17439967
- 82. Rouzbahani AK, Kheirandish F, Hosseini SZ. Design of a multi-epitope-based peptide vaccine against the S and N proteins of SARS-COV-2 using immunoinformatics approach. Egyptian Journal of Medical Human Genetics [Internet]. 2022 Dec 1 [cited 2023 Jan 19];23(1):1–18. Available from: https://jmhg.springeropen.com/articles/10.1186/s43042-022-00224-w
- 83. Butt AM, Nasrullah I, Tahir S, Tong Y. Comparative Genomics Analysis of Mycobacterium ulcerans for the Identification of Putative Essential Genes and Therapeutic Candidates. PLoS One [Internet]. 2012 Aug 13 [cited 2023 Feb 7];7(8):e43080. Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0043080 pmid:22912793
- 84. Can H, Köseoğlu AE, Erkunt Alak S, Güvendi M, Döşkaya M, Karakavuk M, et al. In silico discovery of antigenic proteins and epitopes of SARS-CoV-2 for the development of a vaccine or a diagnostic approach for COVID-19. Sci Rep [Internet]. 2020 Dec 1 [cited 2022 Nov 16];10(1). Available from: /pmc/articles/PMC7769971/
- 85. Khairkhah N, Bolhassani A, Agi E, Namvar A, Nikyar A. Immunological investigation of a multiepitope peptide vaccine candidate based on main proteins of SARS-CoV-2 pathogen. PLoS One [Internet]. 2022 Jun 1 [cited 2023 Aug 8];17(6). Available from: /pmc/articles/PMC9182696/ pmid:35679246
- 86. Sadeghi Z, Fasihi-Ramandi M, Bouzari S. Evaluation of immunogenicity of novel multi-epitope subunit vaccines in combination with poly I:C against Brucella melitensis and Brucella abortus infection. Int Immunopharmacol [Internet]. 2019 Oct 1 [cited 2023 Oct 24];75. Available from: https://pubmed.ncbi.nlm.nih.gov/31437796/ pmid:31437796
- 87. Khairkhah N, Aghasadeghi MR, Namvar A, Bolhassani A. Design of novel multiepitope constructs-based peptide vaccine against the structural S, N and M proteins of human COVID-19 using immunoinformatics analysis. PLoS One [Internet]. 2020 Oct 1 [cited 2023 Nov 7];15(10):e0240577. Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0240577 pmid:33057358
- 88. Colovos C, Yeates TO. Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci [Internet]. 1993 [cited 2023 Jul 5];2(9):1511–9. Available from: https://pubmed.ncbi.nlm.nih.gov/8401235/ pmid:8401235
- 89. Sippl MJ. Recognition of errors in three-dimensional structures of proteins. Proteins: Structure, Function, and Bioinformatics [Internet]. 1993 Dec 1 [cited 2023 Jul 5];17(4):355–62. Available from: https://onlinelibrary.wiley.com/doi/full/ pmid:8108378
- 90. V’kovski P, Kratzel A, Steiner S, Stalder H, Thiel V. Coronavirus biology and replication: implications for SARS-CoV-2. Nature Reviews Microbiology 2020 19:3 [Internet]. 2020 Oct 28 [cited 2022 Nov 17];19(3):155–70. Available from: https://www.nature.com/articles/s41579-020-00468-6 pmid:33116300
- 91. Neamtu A, Mocci F, Laaksonen A, Barroso da Silva FL. Towards an optimal monoclonal antibody with higher binding affinity to the receptor-binding domain of SARS-CoV-2 spike proteins from different variants. Colloids Surf B Biointerfaces [Internet]. 2022 Jan 1 [cited 2022 Nov 16];221:112986. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0927776522006701 pmid:36375294
- 92. Walls AC, Park YJ, Tortorici MA, Wall A, McGuire AT, Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell [Internet]. 2020 Apr 16 [cited 2022 Nov 17];181(2):281–292.e6. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0092867420302622 pmid:32155444
- 93. Xia X. Domains and Functions of Spike Protein in SARS-Cov-2 in the Context of Vaccine Design. Viruses [Internet]. 2021 Jan 1 [cited 2022 Nov 18];13(1). Available from: /pmc/articles/PMC7829931/ pmid:33466921
- 94. Pang W, Lu Y, Zhao YB, Shen F, Fan CF, Wang Q, et al. A variant-proof SARS-CoV-2 vaccine targeting HR1 domain in S2 subunit of spike protein. Cell Research 2022 [Internet]. 2022 Nov 10 [cited 2022 Nov 18];1–18. Available from: https://www.nature.com/articles/s41422-022-00746-3 pmid:36357786
- 95. Huang Y, Yang C, feng Xu X, Xu W, wen Liu S. Structural and functional properties of SARS-CoV-2 spike protein: potential antivirus drug development for COVID-19. Acta Pharmacologica Sinica 2020 41:9 [Internet]. 2020 Aug 3 [cited 2022 Nov 16];41(9):1141–9. Available from: https://www.nature.com/articles/s41401-020-0485-4
- 96. Xia S, Zhu Y, Liu M, Lan Q, Xu W, Wu Y, et al. Fusion mechanism of 2019-nCoV and fusion inhibitors targeting HR1 domain in spike protein. Cellular & Molecular Immunology 2020 17:7 [Internet]. 2020 Feb 11 [cited 2022 Nov 17];17(7):765–7. Available from: https://www.nature.com/articles/s41423-020-0374-2
- 97. Ng KT, Mohd-Ismail NK, Tan YJ. Spike S2 Subunit: The Dark Horse in the Race for Prophylactic and Therapeutic Interventions against SARS-CoV-2. Vaccines (Basel) [Internet]. 2021 Feb 1 [cited 2022 Nov 17];9(2):1–12. Available from: /pmc/articles/PMC7923282/ pmid:33672450
- 98. Xu X, Li G, Sun B, Zuo YY. S2 Subunit of SARS-CoV-2 Spike Protein Induces Domain Fusion in Natural Pulmonary Surfactant Monolayers. Journal of Physical Chemistry Letters [Internet]. 2022 Sep 8 [cited 2022 Nov 16];2022:8359–64. Available from: https://pubs.acs.org/doi/full/10.1021/acs.jpclett.2c01998 pmid:36043851
- 99. Burnett DL, Jackson KJL, Langley DB, Aggrawal A, Stella AO, Johansen MD, et al. Immunizations with diverse sarbecovirus receptor-binding domains elicit SARS-CoV-2 neutralizing antibodies against a conserved site of vulnerability. Immunity. 2021 Dec 14;54(12):2908–2921.e6. pmid:34788600
- 100. Cubuk J, Alston JJ, Incicco JJ, Singh S, Stuchell-Brereton MD, Ward MD, et al. The SARS-CoV-2 nucleocapsid protein is dynamic, disordered, and phase separates with RNA. Nature Communications 2021 12:1 [Internet]. 2021 Mar 29 [cited 2022 Nov 17];12(1):1–17. Available from: https://www.nature.com/articles/s41467-021-21953-3
- 101. Chen J, Deng Y, Huang B, Han D, Wang W, Huang M, et al. DNA Vaccines Expressing the Envelope and Membrane Proteins Provide Partial Protection Against SARS-CoV-2 in Mice. Front Immunol. 2022 Feb 24;13:727. pmid:35281016
- 102. Liu H, Wu NC, Yuan M, Bangaru S, Torres JL, Caniels TG, et al. Cross-Neutralization of a SARS-CoV-2 Antibody to a Functionally Conserved Site Is Mediated by Avidity. Immunity. 2020 Dec 15;53(6):1272–1280.e5. pmid:33242394
- 103. Li T, Cai H, Zhao Y, Li Y, Lai Y, Yao H, et al. Uncovering a conserved vulnerability site in SARS‐CoV‐2 by a human antibody. EMBO Mol Med [Internet]. 2021 Dec 12 [cited 2023 Feb 23];13(12). Available from: /pmc/articles/PMC8646660/ pmid:34672091
- 104. Khan MT, Islam MJ, Parihar A, Islam R, Jerin TJ, Dhote R, et al. Immunoinformatics and molecular modeling approach to design universal multi-epitope vaccine for SARS-CoV-2. Inform Med Unlocked. 2021 Jan 1;24:100578. pmid:33898733
- 105. Feng Y, Jiang H, Qiu M, Liu L, Zou S, Li Y, et al. Multi-epitope vaccine design using an immunoinformatic approach for sars-cov-2. Pathogens [Internet]. 2021 Jun 1 [cited 2023 Feb 7];10(6):737. Available from: https://www.mdpi.com/2076-0817/10/6/737/htm pmid:34208061
- 106. Adam KM. Immunoinformatics approach for multi-epitope vaccine design against structural proteins and ORF1a polyprotein of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). Trop Dis Travel Med Vaccines [Internet]. 2021 Dec 1 [cited 2023 Jan 17];7(1):1–13. Available from: https://tdtmvjournal.biomedcentral.com/articles/10.1186/s40794-021-00147-1
- 107. Tahir Ul Qamar M, Shahid F, Aslam S, Ashfaq UA, Aslam S, Fatima I, et al. Reverse vaccinology assisted designing of multiepitope-based subunit vaccine against SARS-CoV-2. Infect Dis Poverty. 2020 Sep 16;9(1). pmid:32938504
- 108. Susithra Priyadarshni M, Isaac Kirubakaran S, Harish MC. In silico approach to design a multi-epitopic vaccine candidate targeting the non-mutational immunogenic regions in envelope protein and surface glycoprotein of SARS-CoV-2. J Biomol Struct Dyn [Internet]. 2021 [cited 2023 Feb 7];1. Available from: /pmc/articles/PMC8477437/ pmid:34528491
- 109. Dong R, Chu Z, Yu F, Zha Y. Contriving Multi-Epitope Subunit of Vaccine for COVID-19: Immunoinformatics Approaches. Front Immunol. 2020 Jul 28;11:1784. pmid:32849643
- 110. Almofti YA, Abd-elrahman KA, Eltilib EEM. Vaccinomic approach for novel multi epitopes vaccine against severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). BMC Immunology 2021 22:1 [Internet]. 2021 Mar 25 [cited 2023 Feb 7];22(1):1–20. Available from: https://bmcimmunol.biomedcentral.com/articles/10.1186/s12865-021-00412-0
- 111. Noorimotlagh Z, Karami C, Mirzaee SA, Kaffashian M, Mami S, Azizi M. Immune and bioinformatics identification of T cell and B cell epitopes in the protein structure of SARS-CoV-2: A systematic review. Int Immunopharmacol [Internet]. 2020 Jun 28 [cited 2023 Feb 7];86:106738–106738. Available from: https://europepmc.org/articles/PMC7321027 pmid:32683296
- 112. Zhang Y, Liang S, Zhang S, Zhang S, Yu Y, Yao H, et al. Development and evaluation of a multi-epitope subunit vaccine against group B Streptococcus infection. Emerg Microbes Infect [Internet]. 2022 [cited 2023 Jul 4];11(1):2371–82. Available from: https://www.tandfonline.com/doi/abs/10.1080/22221751.2022.2122585 pmid:36069613
- 113. Kaushik V, Gupta SK, Kalra LR, Shaikh U, Cavallo AR, Chawla L. Immunoinformatics Aided Design and In-Vivo Validation of a Cross-Reactive Peptide Based Multi-Epitope Vaccine Targeting Multiple Serotypes of Dengue Virus. Front Immunol. 2022;13. pmid:35799781
- 114. Vihinen M. Solubility of proteins. ADMET DMPK [Internet]. 2020 Jun 28 [cited 2023 Jul 10];8(4):391. Available from: /pmc/articles/PMC8915590/ pmid:35300195
- 115. Alhabibi AM, Hassan AS, Elbaky NMA, Eid HA, Alzaher Khalifa MAA, Wahab MA, et al. Impact of Toll-Like Receptor 2 and 9 Gene Polymorphisms on COVID-19: Susceptibility, Severity, and Thrombosis. J Inflamm Res [Internet]. 2023 Feb 17 [cited 2023 Jul 24];16:665–75. Available from: https://www.dovepress.com/impact-of-toll-like-receptor-2-and-9-gene-polymorphisms-on-covid-19-su-peer-reviewed-fulltext-article-JIR pmid:36825132
- 116. Alturaiki W, Alkadi H, Alamri S, Awadalla ME, Alfaez A, Mubarak A, et al. Association between the expression of toll-like receptors, cytokines, and homeostatic chemokines in SARS-CoV-2 infection and COVID-19 severity. Heliyon. 2023 Jan 1;9(1):e12653. pmid:36589720
- 117. Wan S, Bhati AP, Zasada SJ, Coveney P V. Rapid, accurate, precise and reproducible ligand–protein binding free energy prediction. Interface Focus [Internet]. 2020 Dec 6 [cited 2023 Jul 24];10(6). Available from: https://royalsocietypublishing.org/doi/ pmid:33178418
- 118. Temtanakitpaisan Y, Saengnipanthkul S, Sarakosol N, Maskasame S, Mongkon S, Buranrat B, et al. Reactogenicity and immunogenicity of the intradermal administration of BNT162b2 mRNA vaccine in healthy adults who were primed with an inactivated SARS-CoV-2 vaccine. Vaccine X [Internet]. 2022 Dec 1 [cited 2023 Aug 16];12. Available from: /pmc/articles/PMC9671541/ pmid:36415450
- 119. Tavares LM, de Jesus LCL, da Silva TF, Barroso FAL, Batista VL, Coelho-Rocha ND, et al. Novel Strategies for Efficient Production and Delivery of Live Biotherapeutics and Biotechnological Uses of Lactococcus lactis: The Lactic Acid Bacterium Model. Front Bioeng Biotechnol. 2020 Nov 4;8:1269. pmid:33251190
- 120. Song AAL, In LLA, Lim SHE, Rahim RA. A review on Lactococcus lactis: from food to factory. Microbial Cell Factories 2017 16:1 [Internet]. 2017 Apr 4 [cited 2022 Nov 16];16(1):1–15. Available from: https://microbialcellfactories.biomedcentral.com/articles/10.1186/s12934-017-0669-x
- 121. Gurien LA, Stallings-Archer K, Smith SD. Probiotic Lactococcus lactis decreases incidence and severity of necrotizing enterocolitis in a preterm animal model. J Neonatal Perinatal Med [Internet]. 2018 [cited 2023 Jul 22];11(1):65–9. Available from: https://pubmed.ncbi.nlm.nih.gov/29689748/ pmid:29689748
- 122. Wang T, Wang Z, Mi J, Wang W, Li K, Qi X, et al. Recombinant Avian β-Defensin Produced by Food-Grade Lactococcus as a Novel and Potent Immunological Enhancer Adjuvant for Avian Vaccine. Probiotics Antimicrob Proteins [Internet]. 2021 Dec 1 [cited 2023 Apr 4];13(6):1833–46. Available from: https://link.springer.com/article/10.1007/s12602-021-09847-8