Figures
Abstract
Background
The surface of Leishmania spp. presents glycoprotein 63 (GP63), a metalloprotease that acts as one of the parasite’s major antigens. A vaccine against leishmaniasis has not yet been developed and stationary phase promastigotes have utmost importance in transmitting Leishmania spp. from phlebotomine sand fly to humans or reservoirs. Therefore, this study aimed to analyze GP63 protein in three different Leishmania spp. to determine new vaccine candidate antigen against leishmaniasis using sequencing data of locally detected Leishmania strains and in silico approaches.
Methodology/Principal findings
The GP63 protein sequences of the stationary phase/amastigote form of L. infantum, L. major, and L. tropica were identified and then the gene encoding GP63 protein in Leishmania positive samples (n:59) was amplified and sequenced for variation analysis. According to the results, 4, 6, 19 GP63 variants were found within L. infantum, L. major, and L. tropica isolates, respectively. The most prevalent variants within each species were selected for further analysis using in silico approaches. Accordingly, all selected GP63 proteins were antigenic and the amount of B and T cell epitopes were 23 for L. infantum, 10 for L. major, and 9 for L. tropica. The analysis of each epitope showed that all of them were non-toxic, non-allergen, and soluble but had different antigenicity values. Among these epitopes, EMEDQGSAGSAGS associated with L. major, STHDSGSTTC and AEDILTDEKRDILRK epitopes associated with L. infantum had the highest antigenicity values for B cell, MHC-I, and MHC-II epitopes, respectively. Moreover, conserved epitopes were detected among two or three Leishmania species.
Author summary
Leishmaniasis is a neglected parasitic disease that causes significant health problems in humans as well as canine reservoirs. A safe and protective vaccine against leishmaniasis has not yet been developed. Leishmania spp. express glycoprotein 63 (GP63) which is a metalloprotease that acts as one of the parasite’s major surface antigens. In this study, we aimed to analyze the multi-copy GP63 gene in three different Leishmania species and determine a novel vaccine candidate antigen against leishmaniasis using sequencing data of locally detected Leishmania strains isolated from humans and in silico approaches. According to the results, the GP63 proteins selected for each Leishmania species were antigenic and contained many B and T cell epitopes. Each epitope selected was non-toxic, non-allergenic and soluble, with different antigenicity values. Among them, the epitopes EMEDQGSAGSAGSAGS, STHDSGSTTC and AEDILTDEKRDILRK were found to be more suitable for the development of vaccines and serologic tests. Furthermore, conserved epitopes identified between two or three Leishmania species can be valuable for vaccine development purposes.
Citation: Güvendi M, Can H, Yavuz İ, Özbilgin A, Değirmenci Döşkaya A, Karakavuk M, et al. (2025) In silico identification of Leishmania GP63 protein epitopes to generate a new vaccine antigen against leishmaniasis. PLoS Negl Trop Dis 19(6): e0013137. https://doi.org/10.1371/journal.pntd.0013137
Editor: Claudia Ida Brodskyn, Centro de Pesquisa Gonçalo Moniz-FIOCRUZ/BA, BRAZIL
Received: January 20, 2025; Accepted: May 11, 2025; Published: June 5, 2025
Copyright: © 2025 Güvendi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This study was supported by the grant from The Scientific and Technological Research Council of Türkiye (TÜBİTAK; https://www.tubitak.gov.tr/) (Project number: 22AG081). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Leishmaniasis is a vector-borne infectious disease caused by obligate intracellular protozoan parasites of the genus Leishmania [1]. There are 22 different species within the genus Leishmania [2]. Each Leishmania species is characterized by host factors, specific geographical preferences, and the symptoms occurred during the disease they cause. Leishmaniasis presents in three main clinical forms: cutaneous leishmaniasis (CL), visceral leishmaniasis (VL), and mucocutaneous leishmaniasis (MCL). The other two rare types are disseminated cutaneous leishmaniasis (DCL) and post-kala-azar dermal leishmaniasis (PKDL) [3]. VL, also known as kala-azar, is the most lethal form of leishmaniasis and fatal in over 95% of untreated cases. VL is caused by L. infantum, L. chagasi, and L. donovani. CL is the most common form which causes ulcerative skin lesions on exposed parts of the body. CL is typically caused by L. tropica, L. major, L. aethiopica, L. amazonensis, L. mexicana, L. braziliensis, L. guyanensis, and L. panamensis [4]. All Leishmania species, including those that infect mammals, are transmitted by phlebotomine sand flies.
The relatively simple life cycle of Leishmania is characterized by two basic phases, the promastigote, the motile form that lives in the gut of the sand fly vector, and the amastigote, the immobile form that resides in vertebrate host macrophages. In the gut of the sandfly, initially promastigotes rapidly divide which are called logarithmic phase promastigotes (log phase promastigotes). At this stage, the parasites exhibit low infectivity. Then, in the gut of the sandfly, promastigotes become non-dividing but more virulent which are called stationary phase promastigotes (Fig 1) [5–7]. During the blood meal, the Leishmania-infected sand fly inoculates the stationary phase promastigotes into the skin and the promastigotes are phagocytosed by the macrophages and/or dendritic cells of the vertebrate host. Next, the promastigotes rapidly transform into amastigotes and continue to grow and multiply. When another sand fly sucks blood from an infected host, the amastigote-containing macrophages are transferred and in the sandfly’s midgut the amastigotes are transformed into rapidly dividing log phase promastigotes (Fig 1) [8,9].
Accordingly, the infected sand fly inoculates the stationary phase promastigotes into the skin and the promastigotes are phagocytosed by the macrophages of the vertebrate host. Then, the promastigotes rapidly transform into amastigotes and continue to grow and multiply. When another sand fly sucks blood from an infected host, the amastigote-containing macrophages are transferred and in the sandfly’s midgut the amastigotes are transformed into rapidly dividing log phase promastigotes. In the gut of the sandfly, initially promastigotes rapidly divide which are called logarithmic phase promastigotes (log phase promastigotes) and they have low infectivity. Then, in the gut of the sandfly, promastigotes become non-dividing but more virulent which are called stationary phase promastigotes and ready to infect the host by sandfly bite. Microsoft Office PowerPoint and Paint were used only as a tool during the drawing of the figure and Phlebotomus image was obtained from https://commons.wikimedia.org/wiki/File:Phlebotomus_(01).png.
L. infantum, L. tropica, and L. major have 36 chromosomes [10]. GP63 protein also known as Leishmanolysin is one of the major surface antigen that facilitates the entry of Leishmania parasites into macrophages, contributes to parasite survival and has shown to be immunogenic in limited vaccine studies [11–14]. However, GP63 protein is encoded from different chromosomes and different gene regions (multi-copy gene) in different life stages of Leishmania (log phase, stationary phase, amastigote) and there are also differences in its expression between species. Previous studies have reported that one group of genes encoding the GP63 protein: First group (Group 1) is constitutively expressed, where a second group (Group 2) is expressed in the log phase promastigote, and a third group (Group 3) is expressed in the stationary phase promastigote/amastigote [15]. In addition, the metalloprotease GP63 protein contains pro-peptide sequences in the N-terminal and C-terminal regions that are removed during biosynthesis and maturation. These pro-peptide sequences protect the promastigote from the protease activity of the active protein [16]. In this study, we aimed to sequence the more virulent stationary phase promastigote/amastigote specific GP63 gene (Group 3) in L. infantum, L. major and L. tropica isolates (n:59) detected in clinical cases from Türkiye. The sequence data were then used to detect variations in GP63 within each Leishmania species. Next, the prevalent GP63 protein sequences in each Leishmania species were analyzed by in silico analysis to identify novel epitopes to generate a new antigen that could be used in vaccine studies or development of new serological assays.
Results
Sample identification
The stationary phase promastigote/amastigote specific GP63 genes in L. infantum, L. major, and L. tropica were amplified by PCR and the ~ 1800 bp amplicons were sequenced. According to the sequence data, similarity rates were changed from 99.78 to 100% for L. infantum samples (n: 10), 99.23 to 100% for L. major samples (n: 20), and 90.09 to 91.48% for L. tropica samples (n: 29) (Table 1). The GP63 gene of L. infantum is more conserved than that of L. major and L. tropica. In contrast, the GP63 gene of L. tropica is more polymorphic and the similarity with the reference GP63 gene is reduced to 90.09%. Sequence data were given in S2, S3, and S4 Files.
Variations of GP63 proteins
Amino acids sequences of GP63 protein belonging to each sequenced Leishmania sample were aligned with reference GP63 proteins (XM_001463664.2, XM_001681327.1, and CM024296.1) using MEGA X. Although GP63 protein has propeptide regions at N-terminal and C-terminal ends, mature GP63 protein presented on the surface of Leishmania spp. doesn’t have propeptide regions. Therefore, after removing propeptides of GP63 protein, amino acid region between 101–577 were evaluated and variations were identified.
In L. infantum group, five different GP63 variants were identified and named as Variant 1, 2, 3, 4 and 5. Among them only Variant 1 (sample 63) had a 100% similarity with the reference GP63 protein (LR812943.1) (Table 1). However, prevalent GP63 protein was the Variant 3 which is detected in 5 different L. infantum samples (samples 65, 66, 67, 68, and 69) (Table 2). For L. major, six different GP63 variants were identified and named as Variant 1, 2, 3, 4, 5, and 6. Only Variant 1 was 100% identical to the reference GP63 protein (OU755544.1) (Table 1) and at the same time, prevalent GP63 protein was also Variant 1 which is identical to reference GP63 protein detected in 8 different L. major samples (Table 3). When compared to the L. infantum and L. major, more variations were identified in GP63 protein of L. tropica samples. For L. tropica, 19 different GP63 variants were identified and named as Variant 1, 2, 3…, and 19. Any L. tropica isolate did not show %100 similarity with the reference GP63 protein (AJ495009.1) (Table 1). Among them, Variant 1 was most identical to the reference GP63 protein and at the same time, prevalent GP63 protein was also Variant 1 and detected in 9 different L. tropica samples (Table 4).
Properties of selected GP63 proteins
When selected GP63 proteins were compared, the number of amino acids were 477 for L. infantum and L. major (except of L. tropica which was 476 aa) and molecular weights were ~51.5 kDa. The theoretical pI values were between 5.23 and 5.83. All proteins had a half-life of 30 hours and were stable. Aliphatic indexes were between 71.64 and 75.68. For the selected GP63 proteins, the alpha helix scores were between 23.06% and 27.67%, the extended strand scores were between 19.08% and 23.90%, and the random coil scores were between 50.94% and 53.25% (Table 7). All the selected GP63 proteins were antigenic and non-allergenic. The GP63 proteins selected for L. infantum and L. major have N-glycosylation sites and the GP63 proteins selected for L. infantum, L. major, and L. tropica have O-glycosylation sites. (Table 6).
B-cell epitopes
Several B cell epitopes were predicted in the GP63 protein of L. infantum, L. major, and L. tropica. When an epitope was antigenic, non-toxic, non-allergenic, and soluble, it was selected as a B cell epitope. Among these B cell epitopes, the EMEDQGSAGSAGSAGS epitope predicted in GP63 protein of L. major had the highest antigenicity value (2.2086). In the GP63 protein of L. infantum, EDILTDEKRDILVKH epitope had the highest antigenicity value (1.1605), while the LQLHTERLKARQVQGKWKVT epitope predicted on the GP63 protein of L. tropica had the highest antigenicity value (0.9818). Interestingly, these epitopes with high antigenicity values did not have residue/s presented in non-linear B cell epitope. An epitope LQLHTERLKARQVQGKWKVT was found in all selected GP63 protein with minor variations in some residues (underlined residues). Another epitope, EDILTDEKRDILVKH, was found in selected GP63 proteins of L. infantum and L. tropica with minor variations in some residues (underlined residues) and these two amino acids variation caused a dramatic change in their antigenicity.
A different epitope, SHIKMRNAQDEL, shared by selected GP63 proteins of L. major and L. tropica has residues presented in the non-linear B cell epitope (bold residues). Moreover, epitopes with residue(s) presented in the non-linear B cell epitope had lower antigenicity values (Table 7). According to the docking results, all B cell epitopes were docked to variable regions of human B cell heavy and light chains of Fab fragment.
MHC-I/II epitopes
Several MHC-I/II epitopes were predicted in the GP63 protein of L. infantum, L. major, and L. tropica. When an epitope was antigenic, non-toxic, non-allergenic and soluble, it was selected as an MHC-I or MHC-II epitope. Among the MHC-I epitopes, the STHDSGSTTC epitope predicted in GP63 protein of L. infantum had the highest antigenicity value (1.5063) (Table 8). The RLKVQQVQGK epitope predicted on the GP63 protein of L. major had the highest antigenicity value (1.0311), while no epitope was predicted on the GP63 protein of L. tropica. The predicted epitopes were non-toxic, non-allergenic, and soluble (Table 9). An epitope, RLKVRQVQDK, was predicted in selected GP63 proteins of L. infantum and L. major with minor changes in some residues (underlined residues) and these two amino acid changes caused slight change in antigenicity. Another epitope, KRDILVKHLI, was shared with GP63 proteins of L. infantum and L. major.
Among the MHC-II epitopes, the AEDILTDEKRDILVK epitope predicted on GP63 protein of L. infantum had the highest antigenicity value (1.1978) (Table 10). The EDARIVASVPNVRGK epitope predicted on the GP63 protein of L. major had the highest antigenicity value (1.180), while the DPAYHCARVGQRVHN epitope predicted on the GP63 protein of L. tropica had the highest antigenicity value (1.1654) (Tables 11 and 12). An epitope, CTAEDILTDEKRDIL was shared with selected GP63 proteins of L. infantum and L. tropica. Also, four epitopes (AEDILTDEKRDILRK, TAEDILTDEKRDILR, EDILTDEKRDILRKY, and DPAYHCARVGQRVHN) were detected in selected GP63 proteins of L. infantum and L. tropica with minor changes in some residues (underlined residues) and these two amino acid changes caused dramatic changes in antigenicity values. According to the docking results, MHC-I/II epitopes with highest antigenicity value were docked to their MHC-I/II alleles with a different docking score (Figs 2 and 3).
Discussion
There are three main clinical forms of leishmaniasis: cutaneous leishmaniasis (CL), visceral leishmaniasis (VL), and mucocutaneous leishmaniasis (MCL), each associated with specific clinical presentation. Although leishmaniasis has devastating effects and is the most common parasitic disease after malaria, prevention and control of this disease is challenging due to the difficulty of the treatment course and the lack of a safe vaccine that confers complete protection [17]. Among ongoing vaccine studies, only ChAd63-KH completed phase2b clinical trials in 2023 [18]. Developing a vaccine for leishmaniasis is challenging due to complex life cycle of Leishmania species. During its life cycle, many proteins are involved in entry and egress from the host cell as well as in evading the immune response. Among these proteins, the GP63 surface protein allows the parasite to adhere to host macrophages and escape phagocytosis. In addition, GP63 has been used as antigen in different vaccine studies due to its important role in the pathogenesis of Leishmania species. For example, in a study, mice were vaccinated by CpG ODN co-encapsulated with recombinant GP63 protein in cationic liposomes and it achieved reduction in spleen parasite burden, higher IgG2a/IgG1 ratio, IFN-γ levels, and lower IL-4 levels compared to other groups [12]. In another study, after GP63 encoding gene was introduced into the Salmonella typhimurium which is a live oral vaccine strain, mice were vaccinated with this Salmonella typhimurium. The results showed that a significant resistance developed against challenge with L. major compared to controls [11]. In a different study, three groups of mice were initially vaccinated by DNA vaccines encoding GP63, Kmp-11, and Amastin proteins followed by booster with recombinant GP63, Kmp-11, and Amastin proteins. According to the challenging results of mice which were intraperitoneally administered with L. infantum, splenic parasitic loads decreased by 89%, 86%, and 79% in GP63, Kmp-11, and Amastin vaccine groups, respectively [14].
Although the GP63 protein has been used as an antigen in a limited number of vaccine studies, it is not clear whether cross-species variations in the Leishmania spp. multi-copy GP63 gene have been taken into account or which copy of GP63 was used. Previous studies have reported that the stationary phase promastigote/amastigote form is highly virulent and responsible for human infection [15]. Thus, GP63 expressed in stationary phase promastigote and consequently in amastigote in the host, is an important vaccine target.
To analyze this issue more in depth, we first sequenced the GP63 gene expressed in the stationary phase promastigote/amastigote form in L. infantum, L. major, and L. tropica isolates detected from clinical cases in Türkiye. According to the results, many variations were detected in the GP63 protein expressed in the stationary phase promastigote/amastigote form in L. infantum, L. major, and L. tropica isolates. Among Leishmania species, the stationary phase promastigote/amastigote GP63 protein of L. tropica showed noticeably more variation compared to L. major and L. infantum as shown by BLAST similarity results which was between 90.09-91.48% for L. tropica isolates, 99.78 to 100% for L. infantum isolates, and 99.23 to 100% for L. major isolates (Table 1). These data indicate that the GP63 protein derived from stationary-phase promastigotes/amastigotes of Leishmania infantum, a causative agent of visceral leishmaniasis, exhibits a higher degree of sequence conservation compared to the corresponding GP63 proteins from L. tropica and L. major, which are associated with cutaneous leishmaniasis. This may be due to narrower range of Phlebotomus vector, mammalians hosts, restricted distribution of L. infantum compared to L. tropica and L. major resulting with a more conserved genome. Moreover, while L. infantum is primarily zoonotic, L. tropica is more frequently acquired through anthroponotic transmission, and L. major has numerous wild rodent reservoirs, which may result in increased evolutionary pressures and genomic plasticity. It has been showed that the average amino acid identity between L. major and L. infantum was 92%, and the average nucleotide identity was 94% which indicates that they both have very similar genomes, both at the DNA and protein levels [19] and supports our results in which L. infantum and L. major have more conserved GP63 proteins compared to L. tropica (Tables 4, 5, and 6). Mosaic aneuploidy detected in different levels in various Leishmania spp. can also be linked with this GP63 variability between Leishmania isolates from different strains [20]. The GP63 protein variation data was important as it showed that if the GP63 protein of L. tropica is used as an antigen in any vaccine study, variation analysis has utmost importance for the selection of vaccine antigen.
There isn’t any significant difference between the selected GP63 proteins in terms of antigenicity value, allergenicity, post translational modifications, and physico-chemical parameters. However, the GRAVY value of L. infantum GP63 protein was lower than that of other species, which points to a higher degree of hydrophilicity. This indicates that L. infantum GP63 protein interacts more positively with surrounding water molecules, which may increase its solubility [21]. Such a property is especially advantageous for recombinant protein production, because the increased hydrophilicity facilitates higher expression levels and improved folding efficiency in expression systems. On the other hand, the GP63 protein of L. major showed a higher aliphatic index, which indicates greater thermal stability over a wide temperature range while the GP63 protein of L. infantum showed a higher proportion of random coil structures, which may increase its structural flexibility and potentially increase its accessibility for antibody recognition [21,22]. Interestingly, while the GP63 proteins of L. infantum and L. major contained both N- and O-glycosylation motifs, the GP63 protein of L. tropica had only O-glycosylation sites and lacked any detectable N-glycosylation motifs. The absence of N-linked glycosylation sites in L. tropica could be attributed to the higher degree of sequence variation observed in the GP63 protein, which could disrupt the consensus sequences required for N-glycosylation. This feature also suggests that the GP63 protein of L. tropica could potentially be expressed in prokaryotic expression systems that are more cost-effective and technically simpler than eukaryotic platforms [23,24]. All predicted instability index were below the threshold value of 40, indicating predicted in vitro stability. However, the value for the L. infantum GP63 protein was closer to the threshold, suggesting that it may be slightly less stable than the others and potentially more sensitive to environmental conditions such as temperature or pH. When these parameters are evaluated collectively, the GP63 protein of L. infantum appears to be a more promising candidate for vaccine development due to its higher hydrophilicity, increased random coil structure, and the presence of post-translational modifications. Although its predicted stability is slightly lower than others, its overall biochemical and structural features offer advantages for recombinant production and immunogenic potential.
Secondly, the GP63 protein, which was consistently found to be highly represented across all analyzed Leishmania species, was selected for subsequent epitope prediction analyses. Several B cell epitopes were predicted in the GP63 protein of L. infantum, L. major, and L. tropica. All epitopes selected in this study are antigenic and soluble, as well as non-toxic and non-allergenic. Thus, all of them can be used as antigens in vaccine studies and serological tests such as ELISA or LFA (Lateral Flow Assay). Among these B cell epitopes, the EMEDQGSAGSAGSAGS epitope predicted on GP63 protein of L. major had the highest antigenicity value (2.2086). The EDILTDEKRDILVKH epitope predicted on the GP63 protein of L. infantum had the highest antigenicity value (1.1605), while the LQLHTERLKARQVQGKWKVT epitope predicted on the GP63 protein of L. tropica had the highest antigenicity value (0.9818). An epitope LQLHTERLKARQVQGKWKVT was conserved in all selected GP63 protein with a minor variations in some residues (underlined residues). In addition, EDILTDEKRDILVKH (underlined residues shows minor changes) epitope was conserved among L. infantum and L. tropica whereas SHIKMRNAQDEL epitope was conserved among L. major and L. tropica. These results suggest that these conserved epitopes could be used in development of the serological test targeting all three Leishmania species or two Leishmania species. Similarly, conserved MHC-I (RLKVRQVQDK, KRDILVKHLI for L. infantum and L. major; underlined residues show minor variations) and MHC-II epitopes (CTAEDILTDEKRDIL, AEDILTDEKRDILRK, TAEDILTDEKRDILR, EDILTDEKRDILRKY and DPAYHCARVGQRVHN for L. infantum and L. tropica; underlined residues show minor variations) were also identified. These results likewise suggest that such conserved epitopes could be used in epitope vaccine studies targeting two different Leishmania species. Moreover, it was proposed that the incorporation of conserved B cell epitopes into a multi-epitope vaccine construct could enhance the humoral immune response complementing the cellular immune response elicited by inclusion of conserved MHC-I/II T cell epitopes.
Conclusion
GP63 gene expressed in the stationary phase promastigote/amastigote form in L. infantum, L. major, and L. tropica isolates detected in clinical cases from Türkiye was sequenced for the first time. During variation analysis, the most common GP63 protein variants for each Leishmania species were identified and analyzed for epitope discovery using an in silico approach. Although a large number of epitopes were predicted, some were promising due to their high antigenicity values or conservation among Leishmania species. Overall, these promising epitopes could be used as antigens in vaccine studies and in the development of new serologic diagnostic assays.
Materials and methods
Ethics statement
This study was approved by the Research Ethics Committee of the Ege University Faculty of Medicine (Approval number: 24-6.1T/54) for the use of DNA samples. Written informed consent was provided to all patients, data privacy protection was guaranteed by anonymization of DNA samples.
DNA samples
L. tropica (n = 29), L. major (n = 20), L. infantum (n = 10) isolates used in this study were isolated from different clinical cases that were admitted to Celal Bayar University Hospital in Manisa province of Türkiye and were cryopreserved. DNA was extracted from these samples by a commercial DNA isolation kit (Zymo Research) and used in PCR studies. The Leishmania isolates analyzed were in this study were named as MHOM/TR/Year/Laboratory Number. Clinical characteristics of the patients from whom the parasites were isolated are provided in S1 File.
Selection of stationary phase promastigote/amastigote specific GP63 genes and primer design
The stationary phase promastigote/amastigote specific GP63 gene region was targeted during PCR. The stationary phase promastigote/amastigote-specific GP63 gene sequences of L. tropica and L. infantum were obtained from TriTryDB (https://tritrypdb.org/) database. In a previous study by Castro Neto et al. (2019), accession numbers LinJ.10.0520 and LmjF.10.0480 of L. infantum (GenBank: XM_001463664.2) and L. major (GenBank:XM_001681327.1) were reported to correspond to stationary phase/amastigote-specific GP63, respectively [15]. However, there is not any information about L. tropica. For this reason, the stationary phase promastigote/amastigote-specific GP63 gene of L. major was used as a template to find that of L. tropica, since existing molecular phylogenetic studies show that L. tropica and L. major are more closely related to each other, while L. infantum is in a more distant phylogenetic position than these two species. During analysis, L. major GP63 gene sequence (LmjF.10.0480) was blasted in the NCBI by selecting L. tropica as organism and the GP63 sequence with highest similarity rate (genome accession number in GenBank: CM024296.1) was accepted as the stationary phase promastigote/amastigote-specific GP63 gene. GP63 is a multi-copy gene and GP63 gene regions within a Leishmania species or between Leishmania species are similar to each other. Therefore, primers designed from the exon sequence annealed to different GP63 sites and generated multiple band patterns during PCR that were rendered unacceptable for sequencing. To solve this problem, forward and reverse primers were designed from the GP63 gene 5’UTR and 3’UTR intron regions of each Leishmania species as well as two additional sequencing primers using the NCBI Primer design tool (https://www.ncbi.nlm.nih.gov/tools/primer-blast/) (Table 13).
PCR and sequencing
PCR reaction for L. tropica in final volume of 50 μl included 1.5 μl template DNA, 10 µl 5x SuperFi II Buffer, 1 µl SuperFi II Taq Polimerase, 1 µl dNTPs, 0.05 µM of each primer and 36 μl distilled water. The following PCR cycle was used: 30 sec initial denaturation at 98 ºC, followed by 35 cycles of 10 sec denaturation at 98 ºC, 2 min extension at 72 ºC with a 5 min final extension at 72 ºC.
PCR reaction for L. major and L. infantum in final volume of 50 μl included 0.5 μl template DNA, 10 µl 5x SuperFi II Buffer, 1 µl Superfii Taq Polimerase, 1 µl dNTPs, 0.1 µM of each primer and 36.5 μl distilled water. The following PCR cycle was used: 30 sec initial denaturation at 98 ºC, followed by 35 cycles of 10 sec denaturation at 98 ºC, 10 sec annealing at 60 ºC, 2 min extension at 72 ºC with a 5 min final extension at 72 ºC. After amplification, PCR products were visualized on a 1% agarose gel and sequenced by the PCR primers as well as two additional primers defined for sequencing in Table 1 using the ABI 3740xl DNA Sequencer (Applied Biosystems). The sequence data obtained from each Leishmania species were aligned and translated to amino acid sequence by MEGA X [25] and variations in the stationary phase promastigote/amastigote specific GP63 protein were detected. The prevalent stationary phase promastigote/amastigote specific GP63 proteins within each Leishmania species were used in further in silico predictions.
Bioinformatics analyses of the selected GP63 proteins
During bioinformatic analyses, psychochemical properties, secondary structure, allergenicity, antigenicity, N-glycosylation and O-glycosylation sites, 3D structure, toxicity, and solubility of selected GP63 proteins were analyzed by tools described in Table 14.
Immunoinformatic analyses of the selected GP63 proteins
B-cell epitope prediction.
Linear B cell epitopes of selected GP63 proteins were predicted by Bcepred online server (http://crdd.osdd.net/raghava/bcepred/) [36]. Hydrophilicity, flexibility, accessibility, turns, exposed surface, polarity, and antigenic propensity parameters were selected as default and epitopes that have at least three of these parameters were selected for solubility, antigenicity, toxicity, and allergenicity predictions. Non-allergenic, non-toxic, soluble, and antigenic epitopes were accepted as B cell epitopes. Also, non-linear B cell epitopes of selected GP63 proteins were predicted by Ellipro online server (http://tools.iedb.org/ellipro/) [37] and residues detected as non-linear epitopes were shown on linear B cell epitopes.
MHC-I and MHC-II epitope prediction.
The prediction of MHC-I and MHC-II epitopes of the selected GP63 proteins were analyzed by NetMHCpan 4.1 EL method [38] using IEDB tool (https://www.iedb.org/). For the prediction of MHC-I epitopes, twelve different HLA supertypes (A01.01, A02.01, A03.01, A24.02, A26.01, B07.02, B08.01, B27. 05, B39.01, B40.01, B58. 01 and B15.01) and 9 different HLA alleles (A02.06, A23.01, A30.02, A32.01, B51.01, B57.01, A68.02, B35.01 and B53.01) proposed for L. donovani in a previous study were used [39]. Additionally, for MHC-II epitope prediction, seven different MHC-II alleles (DRB1.03.01, DRB1.07.01, DRB1.15.01, DRB3.01.01, DRB3.02.02, DRB4.01.01, and DRB5.01.01) were used during the analysis. Non-allergenic, non-toxic, soluble, and antigenic epitopes were accepted as MHC-I/II epitopes.
Docking analysis.
A three-dimensional structure of the selected peptides was generated using PEP-FOLD 3 (https://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD3/) [40]. All selected B cell epitopes were docked with human B cell heavy and light chain of Fab fragment (PDB: 5DRW). Also, for each selected GP63 protein of L. infantum, L. major, and L. tropica, MHC-I or MHC-II epitopes with highest antigenicity value were docked with their MHC-I (PDB: 8RRO for HLA-A*03:01 and PDB: 4O2E for HLA-B*39:01) and MHC-II (PDB: Q30154 for HLA-DRB5*01:01 and 3D structure of HLA-DRB1*03:01 predicted with AlphaFold3 and HDOCK) alleles using the HDOCK (http://hdock.phys.hust.edu.cn/) [41] and visualized in the Pymol (https://www.pymol.org/) [42].
Supporting information
S1 File. Clinical characteristics of the patients from whom the parasites were isolated.
https://doi.org/10.1371/journal.pntd.0013137.s001
(XLSX)
S2 File. Sequence data belonging to L. infantum.
https://doi.org/10.1371/journal.pntd.0013137.s002
(DOCX)
S3 File. Sequence data belonging to L. major.
https://doi.org/10.1371/journal.pntd.0013137.s003
(DOCX)
S4 File. Sequence data belonging to L. tropica.
https://doi.org/10.1371/journal.pntd.0013137.s004
(DOCX)
References
- 1. Mann S, Frasca K, Scherrer S, Henao-Martínez AF, Newman S, Ramanan P, et al. A Review of Leishmaniasis: Current Knowledge and Future Directions. Curr Trop Med Rep. 2021;8(2):121–32. pmid:33747716
- 2. Berry I, Berrang-Ford L. Leishmaniasis, conflict, and political terror: A spatio-temporal analysis. Soc Sci Med. 2016;167:140–9. pmid:27194448
- 3. Sasidharan S, Saudagar P. Leishmaniasis: where are we and where are we heading? Parasitol Res. 2021;120(5):1541–54. pmid:33825036
- 4. Daga MK, Rohatgi I, Mishra R. Leishmaniasis. Indian J Crit Care Med. 2021;25(Suppl 2):S166–70. pmid:34345133
- 5. Wozencraft AO, Blackwell JM. Increased infectivity of stationary-phase promastigotes of Leishmania donovani: correlation with enhanced C3 binding capacity and CR3-mediated attachment to host macrophages. Immunology. 1987;60(4):559–63. pmid:2953670
- 6. da Silva R, Sacks DL. Metacyclogenesis is a major determinant of Leishmania promastigote virulence and attenuation. Infect Immun. 1987;55(11):2802–6. pmid:3666964
- 7. van Zandbergen G, Bollinger A, Wenzel A, Kamhawi S, Voll R, Klinger M, et al. Leishmania disease development depends on the presence of apoptotic promastigotes in the virulent inoculum. Proc Natl Acad Sci U S A. 2006;103(37):13837–42. pmid:16945916
- 8. Van Assche T, Deschacht M, da Luz RAI, Maes L, Cos P. Leishmania-macrophage interactions: insights into the redox biology. Free Radic Biol Med. 2011;51(2):337–51. pmid:21620959
- 9. Singh RK, Pandey HP, Sundar S. Visceral leishmaniasis (kala-azar): challenges ahead. Indian J Med Res. 2006;123(3):331–44. pmid:16778314
- 10. Leprohon P, Fernandez-Prada C, Gazanion É, Monte-Neto R, Ouellette M. Drug resistance analysis by next generation sequencing in Leishmania. Int J Parasitol Drugs Drug Resist. 2014;5(1):26–35. pmid:25941624
- 11. Xu D, Liew FY. Protection against leishmaniasis by injection of DNA encoding a major surface glycoprotein, gp63, of L. major. Immunology. 1995;84(2):173–6. pmid:7750991
- 12. Jaafari MR, Badiee A, Khamesipour A, Samiei A, Soroush D, Kheiri MT, et al. The role of CpG ODN in enhancement of immune response and protection in BALB/c mice immunized with recombinant major surface glycoprotein of Leishmania (rgp63) encapsulated in cationic liposome. Vaccine. 2007;25(32):6107–17. pmid:17629372
- 13. Olivier M, Atayde VD, Isnard A, Hassani K, Shio MT. Leishmania virulence factors: focus on the metalloprotease GP63. Microbes Infect. 2012;14(15):1377–89. pmid:22683718
- 14. Zhang J, He J, Liao X, Xiao Y, Liang C, Zhou Q, et al. Development of dominant epitope-based vaccines encoding Gp63, Kmp-11 and Amastin against visceral leishmaniasis. Immunobiology. 2021;226(3):152085. pmid:33910113
- 15. Castro Neto AL, Brito ANALM, Rezende AM, Magalhães FB, de Melo Neto OP. In silico characterization of multiple genes encoding the GP63 virulence protein from Leishmania braziliensis: identification of sources of variation and putative roles in immune evasion. BMC Genomics. 2019;20(1):118. pmid:30732584
- 16. Yao C, Donelson JE, Wilson ME. The major surface protease (MSP or GP63) of Leishmania sp. Biosynthesis, regulation of expression, and function. Mol Biochem Parasitol. 2003;132(1):1–16. pmid:14563532
- 17. Chakravarty J, Sundar S. Current and emerging medications for the treatment of leishmaniasis. Expert Opin Pharmacother. 2019;20(10):1251–65. pmid:31063412
- 18. Ayala A, Llanes A, Lleonart R, Restrepo CM. Advances in Leishmania Vaccines: Current Development and Future Prospects. Pathogens. 2024;13(9):812.
- 19. Peacock CS, Seeger K, Harris D, Murphy L, Ruiz JC, Quail MA, et al. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat Genet. 2007;39(7):839–47. pmid:17572675
- 20. Lachaud L, Bourgeois N, Kuk N, Morelle C, Crobu L, Merlin G, et al. Constitutive mosaic aneuploidy is a unique genetic feature widespread in the Leishmania genus. Microbes Infect. 2014;16(1):61–6. pmid:24120456
- 21. Droppa-Almeida D, Franceschi E, Padilha FF. Immune-informatic analysis and design of peptide vaccine from multi-epitopes against Corynebacterium pseudotuberculosis. Bioinform Biol Insights. 2018;12:1177932218755337.
- 22. Shaddel M, Ebrahimi M, Tabandeh MR. Bioinformatics analysis of single and multi-hybrid epitopes of GRA-1, GRA-4, GRA-6 and GRA-7 proteins to improve DNA vaccine design against Toxoplasma gondii. J Parasit Dis. 2018;42(2):269–76. pmid:29844632
- 23. Can H, Erkunt Alak S, Köseoğlu AE, Döşkaya M, Ün C. Do Toxoplasma gondii apicoplast proteins have antigenic potential? An in silico study. Comput Biol Chem. 2020;84:107158. pmid:31810853
- 24. Can H, Köseoğlu AE, Erkunt Alak S, Güvendi M, Döşkaya M, Karakavuk M, et al. In silico discovery of antigenic proteins and epitopes of SARS-CoV-2 for the development of a vaccine or a diagnostic approach for COVID-19. Sci Rep. 2020;10(1):22387.
- 25. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9. pmid:29722887
- 26. Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A. ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 2003;31(13):3784–8. pmid:12824418
- 27. Garnier J, Gibrat JF, Robson B. GOR method for predicting protein secondary structure from amino acid sequence. Methods Enzymol. 1996;266:540–53. pmid:8743705
- 28. Dimitrov I, Bangov I, Flower DR, Doytchinova I. AllerTOP v.2--a server for in silico prediction of allergens. J Mol Model. 2014;20(6):2278. pmid:24878803
- 29. Sharma N, Patiyal S, Dhall A, Pande A, Arora C, Raghava GPS. AlgPred 2.0: an improved method for predicting allergenic proteins and mapping of IgE epitopes. Brief Bioinform. 2021;22(4):bbaa294. pmid:33201237
- 30. Doytchinova IA, Flower DR. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics. 2007;8:4. pmid:17207271
- 31. Gupta R, Jung E, Brunak S. Prediction of N-glycosylation sites in human proteins. 2004. Available from: http://www.cbs.dtu.dk/services/NetNGlyc/.
- 32. Steentoft C, Vakhrushev SY, Joshi HJ, Kong Y, Vester-Christensen MB, Schjoldager KT-BG, et al. Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology. EMBO J. 2013;32(10):1478–88. pmid:23584533
- 33. Thompson B, Petrić Howe N. Alphafold 3.0: the AI protein predictor gets an upgrade. Nature. 2024. pmid:38719964
- 34. Rathore AS, Choudhury S, Arora A, Tijare P, Raghava GPS. ToxinPred 3.0: An improved method for predicting the toxicity of peptides. Comput Biol Med. 2024;179:108926. pmid:39038391
- 35. Lear S, Cobb SL. Pep-Calc.com: a set of web utilities for the calculation of peptide and peptoid properties and automatic mass spectral peak assignment. J Comput Aided Mol Des. 2016;30(3):271–7. pmid:26909892
- 36.
Saha S, Raghava GPS. BcePred: prediction of continuous B-cell epitopes in antigenic sequences using physico-chemical properties. Artificial Immune Systems. 2004. p. 197–204.
- 37. Ponomarenko J, Bui H-H, Li W, Fusseder N, Bourne PE, Sette A, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. 2008;9:514. pmid:19055730
- 38. Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020;48(W1):W449–54. pmid:32406916
- 39. Kumari S, Vijaykumar S, Kumar V, Ranjan R, Alti D, Singh V, et al. In silico and in vitro evaluation of the immunogenic potential of Leishmania donovani ascorbate peroxidase and its derived peptides. Acta Trop. 2024;260:107381. pmid:39244139
- 40. Lamiable A, Thévenet P, Rey J, Vavrusa M, Derreumaux P, Tufféry P. PEP-FOLD3: faster de novo structure prediction for linear peptides in solution and in complex. Nucleic Acids Res. 2016;44(W1):W449–54. pmid:27131374
- 41. Yan Y, Zhang D, Zhou P, Li B, Huang S-Y. HDOCK: a web server for protein-protein and protein-DNA/RNA docking based on a hybrid strategy. Nucleic Acids Res. 2017;45(W1):W365–73. pmid:28521030
- 42. DeLano WL. Pymol: An open-source molecular graphics tool. CCP4 Newsl Protein Crystallogr. 2002;40(1):82–92.