Figures
Abstract
Leptospira borgpetersenii is a causative agent of human leptospirosis, with the potential to lead to severe disease manifestations. The first published analysis of L. borgpetersenii, performed on two strains of serovar Hardjo (L550 and JB197), suggested that the L. borgpetersenii genome is in the process of genome decay with functional consequences leading to a more obligately host-dependent life cycle. Whole genome analysis has only been carried out on few strains of L. borgpetersenii, with limited closed genomes and comprehensive analyses. Herein we report the complete, circularized genomes of seven non-typeable Leptospira borgpetersenii isolates from human leptospirosis patients in Sri Lanka. These isolates (all identified as strain ST144) were found to be nearly identical by whole genome analysis; serotyping with serogroup-specific reference antisera was unreactive, suggesting that these are members of a novel serogroup/serovar. We show that the L. borgpetersenii isolated from humans in Sri Lanka show less genomic decay than previously reported isolates: fewer pseudogenes (N = 141) and insertion sequence (IS) elements (N = 46) compared to N = 248, N = 270, and N = 400 pseudogenes, and N = 121 and N = 116 IS elements in other published L. borgpetersenii Hardjo genomes (strains L550, JB197 and TC112). Compared to previously published L. borgpetersenii whole genome analyses showing two or three VM proteins in L. borgpetersenii isolates from cattle, rats and humans, we found that all of the human L. borgpetersenii isolates from Sri Lanka, including previously reported serovar Piyasena, have four encoded VM proteins, one ortholog of L. interrogans Copenhageni LIC12339 (LA1402) and three orthologs of LIC12844 (LA0589). Our findings of fewer pseudogenes, IS elements, and expansion of the LIC12844 homologs of the PF07598 family in these human isolates suggests that this newly identified L. borgpetersenii serovar from Sri Lanka has unique pathogenicity. Comparative genome analysis and experimental studies of these L. borgpetersenii isolates offer deeper insights into the molecular and cellular mechanisms of leptospirosis pathogenesis.
Author summary
Leptospirosis is an emerging bacterial zoonosis worldwide. Leptospira borgpetersenii is a causative agent of human leptospirosis in some agricultural contexts. We address here the relatively neglected comparative genome analysis of L. borgpetersenii. We show that L. borgpetersenii isolated from humans in Sri Lanka have less genome reduction compared to available cattle isolates and have novel virulence characteristics compared to isolates from other animals including cattle and rats.
Citation: Senavirathna I, Jayasundara D, Warnasekara J, Agampodi S, Putz EJ, Nally JE, et al. (2026) Genomic analysis of human-infecting Leptospira borgpetersenii isolates in Sri Lanka: Expanded PF07598 gene family repertoire and less genome reduction than bovine isolates. PLoS Negl Trop Dis 20(3): e0012540. https://doi.org/10.1371/journal.pntd.0012540
Editor: Ana LTO Nascimento, Instituto Butantan, BRAZIL
Received: September 16, 2024; Accepted: March 19, 2026; Published: March 27, 2026
Copyright: © 2026 Senavirathna et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All available are available publicly, listed in GenBank under the following accession numbers: CP072630:CP072631(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072630:CP072631[accn]) CP072628:CP072629(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072628:CP072629[accn]) CP072626:CP072627(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072626:CP072627[accn]) CP072624:CP072625(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072624:CP072625[accn]) CP072622:CP072623(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072622:CP072623[accn]) CP072620:CP072621(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072620:CP072621[accn]) CP072618:CP072619(https://www.ncbi.nlm.nih.gov/nuccore/?term=CP072618:CP072619[accn]).
Funding: This work was supported by the US Public Health Service through National Institutes of Health grants R01AI108276 (JMV) and U19AI115658 (JMV, SA), and by the Americas Foundation (JMV).The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Funders provided partial salary support to the authors of this paper.
Competing interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: Some of the work reported here has been filed in patent applications by Yale University. JMV and spouse have an equity interest in Luna Bioscience, Inc., which may have a future interest in licensing this work. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Introduction
Leptospirosis, a globally important but neglected bacterial zoonosis [1–6], is caused by spirochetes of the genus Leptospira, and is an emerging disease worldwide. Leptospirosis is conservatively estimated to affect approximately 1 million people annually resulting in ~60,000 deaths per year [2,4] with a further estimated 2.9 million Disability Adjusted Life Years (DALYs), which is on par with cholera, typhoid and dengue fever [1,4,7–9]. The estimated number of cases of leptospirosis in humans exceeds an average of 500,000 per year, and the case fatality can be as high as 20% [2,4,6]. Leptospirosis incidence is strongly predicted to increase over coming years due to environmental changes and extreme climate events [7,10–14]. Identification and characterization of novel Leptospira species, which were discovered recently in both pathogen and intermediate lineages [3,15], are critical for developing novel diagnostic tools for early detection of the disease, for making timely therapeutic decisions [10,16–19], and for vaccine development [20,21].
Whole genome sequencing (WGS) has revolutionized in-depth understanding of infection and pathogenesis of leptospirosis at a molecular level [3,22–24]. Whole genome analysis of new Leptospira isolates from different geographic locations has already advanced our understanding of pathogenic mechanisms [25], which will facilitate the development of better diagnostic and treatment options [3,20,26]. WGS has also become a powerful tool for bacterial strain classification and epidemiological typing [5] [27, 28]. Leptospiral genome sequences published to date include at least 966 Leptospira sequences with most sequences (49%) belonging to L interrogans, followed by L. borgpetersenii (7%), L. santarosai (6%), and L. kirschneri (5%). The size of these genomes varies from 3.9 to 4.6 Mb [7] and this list continues to grow [3,22].
The first whole genome sequence analysis of Leptospira—that of L. interrogans serovar Lai—was published in 2003 [29], followed in 2006 with the whole genome sequence analysis of a bovine strain of L. borgpetersenii published by Bulach et al. [30]. A recent study published in 2018 reported the genome of L. borgpetersenii strain 4E, a highly virulent isolate obtained from Mus musculus in southern Brazil [10]. The above-referenced studies identified a total of 3,469 coding DNA sequences (CDSs), 37 transfer-RNAs (tRNAs), 4 ribosomal RNAs (rRNAs), one transfer-messenger RNA (tmRNA) and five riboswitch loci in L. borgpetersenii. A fully closed complete genome of L. borgpetersenii was first reported for the genome of the laboratory-maintained reference strain L. borgpetersenii serogroup Sejroe serovar Ceylonica strain Piyasena isolated in 1964 (from a male patient in Colombo, Sri Lanka). Additionally, the complete genome sequences of four bovine isolates of L. borgpetersenii serovar Hardjo designated strains TC112, TC147, TC129, and TC273 were reported to have 3,345-3,495 coding sequences and 397–416 pseudo genes [12].
Recently, the PF07598 gene family that encodes the Virulence Modifying Proteins (VM) was reported to encode a large family of secreted leptospiral exotoxins that may contribute to the pathogenesis of leptospirosis [25]. While there are 12–15 paralogous VM proteins in L. interrogans, only two VM protein-encoding genes have been reported in non- interrrogans species [31].
In the present study, we performed whole-genome sequencing, de novo assembly, structural, and functional annotation of seven pathogenic L. borgpetersenii isolates recovered from humans in Sri Lanka. Our analysis tested the proposed genome reduction hypothesis and compared these isolates with others isolated from different mammalian hosts for genomic content of the PF07598 gene family-encoded [32]s [30].
Methods
Leptospira strains and genomic DNA extraction
Isolates for this work were obtained from a large study conducted among febrile patients who were clinically classified as ‘probable’ leptospirosis cases, five from the Teaching Hospital Anuradhapura (FMAS_AP2, FMAS_AP3, FMAS_AP4, FMAS_AP8 and FMAS_AP9), and two from the General Hospital Polonnaruwa (FMAS_PN1, FMAS_ PN4) [8,16,33,34]. Details of patient selection and culture isolation are reported in the original papers [8,16,33,34], including lack of reactivity with reference serogrouping antisera.
Serotyping of newly isolated Leptospira strains was done at the Pasteur Institute, France, as published [35]. Specifically, the microscopic agglutination test using a standard battery of rabbit antisera raised against the main 24 reference serogroups was used for serotyping [35]. These strains were newly isolated from symptomatic patients and had few passages before genomic DNA extraction for WGS [35]. The organisms were first grown in semisolid EMJH media before being sub-cultured in liquid EMJH medium. Cells were harvested in log phase growth, followed by DNA extraction carried out using the gram-negative bacteria protocol from Qiagen's DNeasy Blood & Tissue Kit including an RNase clean-up step after proteinase K buffer ATL incubation [3]. Extracted DNA was quantified using a Qubit 4 fluorometer (ThermoFisher).
Sample preparation
Genomic DNA (gDNA) size and integrity was assessed by pulsed field gel electrophoresis (PFGE) method before beginning library preparation. Multiplexed PacBio Single Molecule Real-Time (SMRT) bell libraries were prepared from extracted high quality gDNA using the SMRTbell Express Template Prep Kit 2.0. To prepare 15-kb libraries, 1µg of genomic DNA was sheared using G-TUBE from Covaris Woburn, MA, USA and AMPure PB Beads (Pacific Bioscience) were used for the concentration of DNA. The DNA was finally repaired by overnight ligation to the overhanging barcoded 8A adapter (Pacific Bioscience). Blue Pippin size selection (Sage Science, Beverly, Massachusetts, USA) of 4 kb or more was performed according to the manufacturer's instructions. The SMRTbell template was evaluated using a calculator from RS Remote (Pacific Biosciences).
Whole-genome sequencing and assembly
As published [35], SMRTbell libraries were generated and sequenced on a PacBio RS II system (Maryland Genomics, Institute for Genome Sciences, University of Maryland School of Medicine). A minimum of 800X read coverage was obtained for all seven isolates. Raw read data were preprocessed using an in-house developed quality control pipeline. The raw sequencing data (S3 File) was collected in FASTQ format using the PacBio sequencing technique. Prior to assembly, the data was quality-checked using the FastQC program to determine read quality, including the distribution of read lengths (S3 File).[35] After the initial quality check, error correction was carried out, followed by trimming to remove low-quality elements and adapter sequences. The resulting trimmed reads were then used for de novo assembly with Canu 2.1 and were circularized using Circlator [17] (http://sangerpathogens.github.io/circlator). Two overlapping contigs were recovered in all isolates after completion of the described workflow. The annotation was completed in all seven fully closed genomes using the NCBI Prokaryotic Genome Annotation Pipeline with default settings.
Functional annotation and analysis
Genome-level functional annotation was performed using Prokka v1.13.3 (https://github.com/tseemann/prokka) [36] and the RAST (Rapid Annotation using Subsystem Technology) server for our seven closed genomes. CRISPR and Cas regions were predicted by the CRISPR Cas-finder tool (https://crisprcas.i2bc.paris-saclay.fr/CrisprCasFinder/Index). CRISPR and Cas regions were extracted from annotated data submitted to the RAST server [37]. The Virulence Factor of Bacterial Pathogen Database (VFDB) was used to predict virulence factors in these Leptospira genomes [38].
For the virulence factors Loa22, LigA (not found), LigB, LigC, LipL32, and Lsa21, a manual search was conducted. The protein and nucleotide sequences of these virulence factors were downloaded, and this information was utilized for BLAST searches and RAST analysis. Mobile elements of the seven isolates were identified by screening using tools at http://www.genomicepidemiology.org/services. BLAST search was performed against the IS finder database for the seven genomes at https://isfinder.biotoul.fr [39]. VM proteins were identified by performing a BLAST search against isolates with known VM proteins.
In silico PubMLST, CG View and Multiple genome alignment
Conventional Multi-locus Sequence Typing (MLST) for the seven isolates against the PubMLST database was performed using seven standardized housekeeping genes https://pubmlst.org/leptospira/ [40]. Fully circularized annotated genomes obtained from the RAST server were uploaded to the CGView server [41], an interactive comparative genomics tool for circular genomes. For identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal gene transfer, the software package Mauve (https://darlinglab.org/mauve/mauve.html) was used [42,43]. For multiple alignments, three of our isolates (FMAS_AP8, FMAS_AP9 and FMAS_PN1), strain Piyasena strain JB 197, and L550 were used.
Methods to identify PF07598 (VM) protein homologs in animal-infecting strains of L. borgpetersenii
Several different approaches were used to identify which (or whether) any of the four Sri Lanka isolate VM homologs (orthologs, paralogs) were present in different strains of L. borgpetersenii obtained from animals including serogroup Sejroe serovar Hardjo strains HB203, TC112, TC129, TC147, TC273, serogroup Ballum serovar Arborea strain LR131 [44], and serovar Tarassovi strain MN900 [31]. In the first approach, the Hidden Markov Model (HMM) for the Conserved Protein Domain Family DUF1561 was obtained from NCBI (https://www.ncbi.nlm.nih.gov/). Currently, there is only one Pfam, PF07598, associated with this Domain of Unknown Function (DUF) (C.f. Pfam: Family: DUF1561 (PF07598- IPR011455) (xfam.org)). The putative protein sequences for each genome were obtained from their respective NCBI annotations. The program hmmscan (http://hmmer.org/) was used to search all the annotated proteins against the DUF1561.hmm model. The hmmscan options “-E 0.001 --domE 0.001” were specified for the searches. The hmmscan reported three proteins meeting these criteria in HB203, TC112, TC129, TC147, and TC273 genomes, four proteins meeting these criteria in the LR131 strain, and two proteins meeting these criteria in the MN900 strain. The second and third search approaches did not use NCBI protein annotations. This was done to eliminate the possibility that a homolog could have been missed due to an incorrect or missing protein annotation. For the second approach, a liberal method of searching the translations from every ORF over 50 bp in all six reading frames was utilized. These translations were searched against the DUF1516 (IPR011455) HMM as described for the NCBI annotations. In the third method, all four of the Sri Lanka protein sequences were compared by tblastn (default parameters) to the nucleotide sequence of the genomes of all six other L. borgpetersenii strains. Any hits with a bitscore > 50 was considered putative positive output. This analysis also confirmed exactly three VM regions in each of the HB203, TC112, TC129, TC147, and TC273 genomes, and four regions in the LR131 strain, and two regions in the MN900 strain.
Results
The GC content of the seven isolates ranged from 39.36%-39.54% (Table 1). Theree were an estimated 3,368–3,521 coding regions (CDS) predicted for all 7 isolates. FMAS_AP8 and FMAS_AP9 had same number of CDS regions (3,521) while FMAS_AP4 had the lowest number of coding sequences. According to the NCBI annotation, proteins with functional assignment ranged from 3,226–3,380 (Table 1) while the number of hypothetical proteins predicted in the strains ranged from 136-146. FMAS_ PN4 had the lowest number of hypothetical proteins. Two different genomic types were clearly observed based on the coding sequences. FMAS_AP2, FMAS_AP3, and FMAS_AP4 (Group 1) contained an average of approximately 3,370 protein coding sequences. FMAS_PN1, FMAS_PN4, FMAS_AP8, and FMAS_AP9 (Group 2) contained approximately 3,520 protein coding sequences, an increase of about 4.5%. The average protein coding sequences for strains L550, JB197, 56604, and TC112 were approximately 3,280, representing a 2.7% reduction compared to Group 1 and a 7.3% reduction compared to Group 2. Thirty-seven tRNAs were identified in all seven isolates with the exception of FMAS_AP4 in which only 36 tRNAs were observed. RAST server-based subsystem analysis identified 226 subsystems in all the strains except in FMAS_AP4 which had only 225. Based on the RAST analysis, CDSs involved in amino acid biosynthesis appeared to be the most abundant subsystem in all strains. The FMAS_AP2 (170) had the highest number of predicted subsystems whereas, strain FMAS_AP4 (168) was predicted to have the least number of subsystems. The subsystem distribution of predicted CDSs in each of the strains is shown in Fig 1.
The ST144 MLST profile and CRISPRs and Cas regions predicted by the CRISPR Cas-finder tool and two Crisper-Cas systems were identified in all seven isolates. The circular representation of the seven genomes (CG view) is given in Fig 2 and the arrangement of the CRISPR system given in Fig 3.
Several putative virulence factors were identified in these Leptospira genomes using the VFDB database. Sixteen virulence factors were identified in each of the seven isolates (Table 2). The virulence factors required for host cell interactions were identified in all seven strains, including Loa22, LigB, LigC, and LipL32; Lsa21 was not found (S2 File). Five main IS elements were identified in seven isolates: SLbp8, ISLbp4, ISLbp6, IS1533 and ISLbp5 (Table 3).
To visualize the general organization of the genome and discover potential genome rearrangements among strains, conserved regions were visualized using the Mauve genome aligner (Fig 4).
Large Collinear Blocks (LCBs) were identified. Colored rectangular and variant-specific regions (genomic islands, GI) or white region spaces within or between LCBs were identified in both chromosomes of all strains. Chromosome II was highly conserved in all strains (S1 File). Dimensions and the location of the central LCB on chromosome I was significantly different in our isolates compared to strains Piyasena, JB197 and L550. Genomic islands and major genome rearrangements, insertion sequence (IS) elements are often located at the intersection of chromosomal rearrangements. The total number of IS elements identified in these strains was 46. The number of pseudogenes identified ranged from 136-146. All seven isolates had four genes encoding VM proteins, 3 copies of the LIC12844 (LA0769) homolog and 1 copy of the LIC12339 (LA1402) homolog (Table 4).
We found that L. borgpeterseni has fewer VM proteins than L. interrogans, as exemplified by comparison to the L. interrogans serovar Copenhageni strain Fiocruz L1-130 reference genome [45], as reported in detailed comparisons of L. borgpeterseni vs. L. interrogans [32]. The PF07598 gene family encodes a newly identified leptospiral virulence factor family, the Virulence Modifying (VM) proteins encoded by the PF07598 gene family. There are four encoded VM proteins, one ortholog of LIC12339, and three orthologs of LIC12844. The sequence similarity ranged from 68.34% to 71.24%. The VM protein coding region predicts protein sizes for LIC12884 homologs of 638, 632, 629, and for the LIC12339 homolog of 536 amino acids, respectively (Table 4).
We investigated whether the new Sri Lankan isolates shared VM homologs with strains of L. borgpetersenii isolated from animal hosts, including bovine isolates of serovar Hardjo strains HB203, TC112, TC129, TC147, TC273, a rodent isolate of serogroup Ballum serovar Arborea strain LR131 [44], and a bovine isolate of serovar Tarassovi strain MN900. Three different methods were utilized to identify VM protein homologs, including an hmmscan search of annotated protein sequences of the PF07598 (IPR011455) protein family, searching translations from all ORFs over 50 bp against the DUF1561 HMM, and using tblastn to search the genomes’ nucleotide sequences for any high scoring pairs returned from querying with the known VM protein sequences, Pfam currently uses 16 species and 83 protein sequences to define DUF1561 (IPR011455). The hmmscan reported three proteins meeting these criteria in HB203, TC112, TC129, TC147, and TC273 genomes, four proteins meeting these criteria in the LR131 strain, and two proteins meeting these criteria in the MN900 strain.
Compared to the four VM proteins present in the Sri Lankan strains, collectively, all methods indicate the presence of three VM homologs in the serovar Hardjo strains (HB203, TC112, TC129, TC147, and TC273), four VM homologs in the rodent serogroup Ballum serovar Arborea strain LR131, and only two VM homologs in the serovar Tarassovi strain MN900. Based on the tblastn analysis (using default parameters), the annotations associated with these regions within each strain revealed the same three annotations identified by the HMM approach, detailed above. Taken together, these findings lead us to conclude that there are three coding regions homologous to the four Sri Lankan proteins in the HB203, TC112, TC129, TC147, and TC273 genomes, four regions in the LR131 strain, and two regions in the MN900 strain.
Discussion
Here we present whole genome analysis of 7 new isolates of novel, non-serotypable, Leptospira borgpetersenii obtained from humans in Sri Lanka [35]. The main findings are that 1) these isolates from humans, are essentially genomically identical; 2) the level of genome reduction appears to be substantially less than originally proposed for reference L. borgpetersenii Hardjo-Bovis strains [30], and 3) the genomic analysis reflects emergence of a predominant leptospiral strain (ST144), with Sri Lankan bovines as the likely source of human infection; 4) the strains analyzed here have a 3-fold duplication of one VM protein (the homolog of LIC12844, which contrasts with other strains of L. borgpetersenii, which have either 1 or 2 copies e of the LIC12844 homolog), and a single copy of the VM protein homology of LIC12339.
An increased incidence of human leptospirosis due to L. borgpetersenii is increasingly reported worldwide. In a study carried out in the Caribbean archipelago of Guadeloupe during an outbreak, the isolates showed the emergence of the Ballum serogroup (L. borgpetersenii), serogroup Icterohaemorrhagiae (L. interrogans) [8]. Another report from Malaysia identified L. borgpetersenii serovar Bataviae transmitted by two dominant rat species, Rattus rattus and R. norvegicus [2]. L. borgpetersenii has been reported to cause severe human disease [46–48].
Previous studies have reported genome reduction in L. borgpetersenii serovar Hardjo strains L550, JB197 and L. borgpetersenii serogroup Ballum serovar Ballum strain 56604 [5,9]. These studies drew a general conclusion that L. borgpetersenii has undergone IS-mediated genome shrinkage potentially due to inter-host transmission (not requiring environmental mediated transmission). IS elements are also thought to be important features of the L. borgpetersenii genome and mechanisms of genomic decay, contributing to multiple chromosomal rearrangements and pseudogene formation. The total number of coding sequences reported in the three strains were serovar Ballum 56604 (N = 2618), serovar Hardjo strain L550 (N = 2832), and serovar Hardjo strain JB197 (N = 2770) [5,9].
All seven isolates reported in this study belong to MLST sequence type 144. In the PubMLST database, seven isolates recovered both locally and globally have already been listed under this ST. The first one was the L. borgpetersenii serovar Ceylonica isolated from a human in 1964 from Sri Lanka. Other local isolates include human samples from Gampaha, Giradurukotte, Bogammana, and a rodent isolate from a black rat in Sri Lanka [32]. The other two are global isolates each from Thailand and Laos [32]. Previous isolates from the North Western Province (human) and the Uva Province (rat) in Sri Lanka were reported to have the same ST 144, which belongs to the serogroup Javanica [49]. Since all seven isolates from the present study, isolated from the dry zone, belong to the same ST 144, it may represent the predominant sequence type in that particular geographical region. cgMLST of these seven isolates revealed their clonal group as 267. However, cgMLST data for the previous seven isolates aren’t available for more comprehensive analysis. According to the Mauve genome alignment, strain Piyasena (a previous Sri Lankan isolate) is significantly different from our isolates. FMAS_AP8 and JB 197 had the significant number of conserved regions. The pathogenesis of L. borgpetereseni strain Hardjo JB197 is an anomaly [50]; as a laboratory isolate obtained from cattle, this strain is fairly unique for causing acute, lethal disease in hamsters; its chromosome has significant rearrangements compared to other Hardjo-Bovis strains. However, we have not found this rearrangement of Chromosome II in our L. borgpetereseni isolates.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated protein systems are found in bacterial genomes, which are important to generate adaptive immunity against invading exogenous genetic elements such as plasmid and phage infection [33,34]. The Cas gene clusters are quite diverse, and they are frequently encoded by a diverse family of proteins with a wide range of functional domains involved in nucleic acid interaction. Two main classes with six types and numerous subtypes were identified in CRISPR-Cas systems based on protein families and features of the architecture of Cas loci [35]. In pathogenic and intermediate Leptospira, three subtypes subtype I-B, subtype I-C and subtype I-E were recognized. CRISPR-Cas systems are not present in non-infectious, saprophytic species [35].
The CRISPR Cas Finder tool found two CRISPR-Cas systems in all seven isolates of this study. The CRISPR-Cas systems identified in these Sri Lankan isolates closely resemble the subtype I-E with CRISPR array that was previously reported in L. borgpetersenii strain 56604. It was also identified in other group 1 species including L. alexanderi, L. alstoni and L. mayottensis, L. noguchii, L. santarosai, L. weilii and L. fainei [35]. L. borgpetersenii serovar Ballum is reported to contain three crisper repeats GGTTCAACCCCACGCATGTGGGGAATAGGCT between nt 2938442–2938534 [34]. In JB197 and L550 these repeats were not detected [34]. In the seven L. borgpetersenii isolates reported here, 6–7 repeats were detected, showing the variability of our strains compared to other reported data. A recent study conducted in Malaysia has shown the presence of 10–16 loci with 1–13 spacers in the CRISPR arrays in six L. interrogans strains [13]. However, the same study suggested further work was needed before making inferences regarding this observation’s relevance to pathogenicity and environmental adaptation of pathogenic Leptospira. [12]. The RAST analysis of the genomes of L. borgpetersenii strain HP364, L. weilii strain SC295, and L. interrogans strain HP358 revealed a similar number of coding sequences (CDSs) and subsystems. For instance, L. borgpetersenii strain HP364 has 4,234 coding sequences and 224 subsystems, while L. weilii strain SC295 has 4,578 coding sequences and 225 subsystems. In our isolates, 4,154 coding sequences and 226 subsystems were identified. These subsystems include categories such as amino acids and derivatives, cofactors, vitamins, prosthetic groups, pigments, protein metabolism, motility and chemotaxis, and carbohydrates. Our findings align with previously reported data, where the majority of subsystems are associated with amino acids and their derivatives. Additionally, the subsystem coverage represents 16% in our isolates, as well as in Leptospira borgpetersenii strains JB197, L550, and 56604 [51].
The protein secretory systems that export proteins from the cytoplasm in L. borpetersenii were found to be Type I and Type II [5]. The Type VI Secretion System (T6SS), a protein nanomachine found in some Gram-negative bacteria, translocates some effector proteins into neighboring cells, playing a critical role in bacterial competition and host interactions. Orthologs of key T6SS components, including ClpV, TssB, TssC, TssD, and TssE, are widely conserved across various bacterial species. ClpV, an essential ATPase, is responsible for recycling the contracted sheath after effector delivery, ensuring the efficiency and functionality of the system. Because we identified only ClpV orthologs in these ex-human L. borgpetersenii isolates, we cannot conclude that there is a functional Type VI Secretion System in L. borgpetersenii [3,51]. Proteins identified as virulence factors are predicted to function in adherence, anti-phagocytosis, chemotaxis, mortality (invasion), enzyme, lipid and fatty acid metabolism and stress adaptation. These proteins have been previously reported as virulence factors in other pathogenic Leptospira species. However, the number of virulence factors identified in these seven L. borgpetersenii isolates was comparatively low compared to serovars and strains of Leptospira interrogans [7,36]. A limitation of our approach is that we have not experimentally demonstrated the presence of a secretion system, similar to the limitation of our previous comparative genomic analysis publication [3].
Mobile elements (IS Elements) insertion can interrupt coding sequences and lead to pseudogene formation in Leptospira [4,8]. The number of IS elements varies not just within species but even within serovars. In L. borgpetersenii, a total of approximately 54 ISs scattered among the chromosomes of strain 56604 have been identified. This includes 31 copies of IS1533, 15 copies of ISLin1, 4 copies of IS1502, 2 copies of IS1500, 1 copy of IS1501, and 1 copy of ISLin2 [9]. Strains L550 and JB197 have been reported to have 121 and 116 IS elements, respectively. In contrast, we found a lower number of IS elements in our isolates, N = 46, a comparatively low number [37]. In parallel to this observation, a relatively low number of pseudogenes were observed in human-obtained Sri Lankan isolates (136–146) compared to published genomes of cattle-obtained Hardjo-bovis L550, JB197, 56604, and TC112: N = 248, 270, 231, and 400 pseudogenes respectively [12,38]. This could be attributed to relatively high number of mobile elements reported in those three strains which may be related to host-pathogen or pathogen-environment interactions. Five types of mobile elements (ISLbp4) belonging to the IS50 family were identified in the seven Sri Lankan isolates using a web-based mobile element finder. Strains L550 and JB197 were found to have 9 mobile element types belonging to different IS families. JB197 was isolated from cattle at slaughterhouses in the United States and the L550 strain was isolated from a human with leptospirosis acquired zoonotically from cattle in Australia. Strain 56604 of serovar Ballum was isolated from a rat in the west region of China. While genome reduction was observed in the above strains, which were probably having exclusive host-to-host transmission, our isolates from human cases, among whom the transmission was probably environment-mediated, had less genome reduction. This observation needs to be confirmed by further studies simultaneously involving animals, humans, and the environment [39].
With regard to a recently discovered and characterized virulence gene family, the Virulence Modifying (VM) proteins encoded by the PF07598 gene family [3,26], we found that only two PF07598 (VM protein-encoding genes) were present in the presently reported L. borgpeterseni isolates obtained from humans. We found that all isolates had three copies of the orthologous PF07598 gene encoding L. interrogans LIC_12844, and one copy of the orthologous PF07598 gene gene encoding L. interrogans LIC_12339, both of which have ~70% amino acid identify with between L. interrogans and L. borgpetersenii. This is consistent with published analyses [25,26], including the specific comparison of PF07598-encoded VM proteins in L. borgpetersenii vs. L. interrogans [32]. The amplification of VM proteins in all seven isolates isolated from humans compared with other animal-obtained L. borgpeterseni isolates which have 2–3 PF07598 genes may be relevant to mechanisms of human infectivity and pathogenesis. PF07598 paralogs are found only in group I pathogens; L. interrogans serovars have 12–15 paralogs, and other group I pathogenns, including L. borgptersenii, have only 2 paralogs, with the LIC12844 homolog found with 2, 3 or 4 gene copies. The genes encoding VM proteins such as the LIC12339 (LA1402) homolog and the LIC12844 (LA_0769) homolog have been shown to be upregulated during infection [26]. The strains JB197, L550, 203, and L49 had identical VM orthologs in a 1:2 ratio, viz. LIC12339 (LA1402) to LIC12844 (LA_0769). The strain L. borgpetersenii sv. Ceylonica strain Piyasena recovered from human subjects in Sri Lanka had similar number of VM proteins as our seven isolates. Further analysis of additional closed leptospiral genomes will continue to clarify the primary sequences (and evolutionary relationships and drivers) of the PF07598 gen family.
While homologs of the VM proteins found in the Sri Lankan isolates were identified in multiple animal isolates of L. borgpetersenii, they were not consistent across serovars with only three ortho/paralogs found in the serovar Hardjo strains (HB203, TC112, TC129, TC147, and TC273), four homologs in the serogroup Ballum serovar Arborea strain LR131 and only two in the MN900 serovar Tarassovi strain. Further, while the number of ortho/paralogs may vary between strains, expression patterns of VM proteins may also vary by strain and between environmental conditions. For instance, recent analysis of the transcriptome of HB203 (causes chronic disease in the hamster model of leptospirosis) and JB197 serovar (causes severe acute) Hardjo strains cultured at 29°C and 37°C, shows that two of the three VM Hardjo homologs were differentially expressed between strains at both 29°C and 37°C. None of the VM proteins were differentially expressed at the transcriptomic level within strains between temperatures under the conditions tested, and ex vivo analysis or changes in the NaCl concentration (mimicking in vivo conditions) may be required to see upregulation [41] [26, 52, 53]. It is notable that between strains, VM expression was higher in the severe disease causing JB197 strain compared to the chronic HB203, which broadly suggests VM gene expression may be associated with acute disease presentation in the hamster. In data sets looking at the highly similar HB203, TC129, and TC273 strains, there is also evidence of strain-to-strain variation for VM proteins [31,50]. These results emphasize the need to further characterize expression of these unique proteins and their role in promoting virulence of pathogenic leptospires. Isolating Leptospira from clinical samples is challenging, as the isolation process can take several months. Cultures are maintained at 28–30°C in darkness and monitored weekly using darkfield microscopy for up to 13 weeks before discarding [54]. All genomes included in this study are fully circularized. Therefore, the number of L. borgpetersenii included in this study is relatively large. Further studies are necessary to validate these findings in a larger and more diverse sample, both geographically and across host species. Additionally, the influence of genetic variability under different environmental conditions and among various animal hosts warrants further exploration. Virulence-modifying protein (VM) is known to play a critical role in pathogenesis [35]. However, the current study does not focus on its impact on clinical phenotypes and disease severity.
Genome decay has been observed among bovine isolates, as reported [30]. Investigating the influence of environmental factors on this decay was beyond the scope of this study and requires further research. These data provide some indication that there may be less genome reduction and a larger PF07598 gene family in human-infecting L. borgpetersenii strains, independent of serovar. While our study uses whole-genome sequencing and bioinformatics to identify virulence genes, we acknowledge that functional validation of these findings was beyond the scope of this work. Experimental studies to confirm the role of these proteins or genetic factors in Leptospira's life cycle and pathogenicity are needed for translating these genomic insights into functional understanding. The lower number of pseudogenes and the increased presence of VM proteins hold potential for advancing diagnostics, vaccines, and treatments. However, these applications need to be validated through targeted studies. Future research focusing on functional studies and translational approaches will be crucial to fully understand how these findings can contribute to improved leptospirosis management and control strategies in endemic areas.
Conclusion
We isolated seven nearly identical L. borgpetersenii strains from humans with acute febrile disease over a three-year period. Based on seven isolates from dry zone analyzed herein, our work suggests that a single Sequence Type, ST144, represents the dominant strain causing the human infections in the dry zone of Sri Lanka during this period. Genome reduction, described for L. borgpetersenii Hardjo strains L550, JB197 and L. borgpetersenii serovar Ballum strain 56604, was observed to a lesser degree in these seven Sri Lankan human isolates. Mauve alignment indicates the presence of conserved regions and genome rearrangement within our isolates. Duplication of a VM protein-encoding gene in human-infecting L. borgpetersenii in Sri Lanka may contribute to adaptive mechanisms for survival in the environment or reservoir hosts leading to human infection and possibly contribute to disease pathogenesis.
Supporting information
S1 File. Genomic alignments using Mauve 2 of seven ex-human, Sri Lankan Leptospira borgpetersenii isolates, and comparison with Leptospira borgpetersenii strains L550, JB197, MN 900 and Piyasena.
https://doi.org/10.1371/journal.pntd.0012540.s001
(DOCX)
S2 File. LipL32 and LigB nucleotide and amino acid sequences.
https://doi.org/10.1371/journal.pntd.0012540.s002
(DOCX)
Acknowledgments
We would like to thank Dr. Michael Matthias, Mr. S.K. Senevirathna, and Mr. Milinda Perera for technical assistance, and Mr. Shalka Srimantha and Ms. Chamila Kappagoda for culture maintenance and laboratory support.
References
- 1. Vinetz JM, Chetrick AN, Steimel KE, Devillers CE, Excler JL, Kim JH. Getting leptospirosis onto the lists of neglected tropical diseases. Open Access Government. https://www.openaccessgovernment.org/article/getting-leptospirosis-onto-the-lists-of-neglected-tropical-diseases/177928. 2024.
- 2. Costa F, Hagan JE, Calcagno J, Kane M, Torgerson P, Martinez-Silveira MS, et al. Global Morbidity and Mortality of Leptospirosis: A Systematic Review. PLoS Negl Trop Dis. 2015;9(9):e0003898. pmid:26379143
- 3. Fouts DE, Matthias MA, Adhikarla H, Adler B, Amorim-Santos L, Berg DE, et al. What Makes a Bacterial Species Pathogenic?:Comparative Genomic Analysis of the Genus Leptospira. PLoS Negl Trop Dis. 2016;10(2):e0004403. pmid:26890609
- 4. Torgerson PR, Hagan JE, Costa F, Calcagno J, Kane M, Martinez-Silveira MS, et al. Global Burden of Leptospirosis: Estimated in Terms of Disability Adjusted Life Years. PLoS Negl Trop Dis. 2015;9(10):e0004122. pmid:26431366
- 5. Warnasekara J, Agampodi S. Neglecting the neglected during the COVID-19 pandemic: the case of leptospirosis in Sri Lanka. Epidemiol Health. 2022;44:e2022015. pmid:35038829
- 6. Bharti AR, Nally JE, Ricaldi JN, Matthias MA, Diaz MM, Lovett MA, et al. Leptospirosis: a zoonotic disease of global importance. Lancet Infect Dis. 2003;3(12):757–71. pmid:14652202
- 7. Kim CL, Agampodi S, Marks F, Kim JH, Excler J-L. Mitigating the effects of climate change on human health with vaccines and vaccinations. Front Public Health. 2023;11:1252910. pmid:37900033
- 8. Warnasekara J, Srimantha S, Kappagoda C, Jayasundara D, Senevirathna I, Matthias M, et al. Diagnostic method-based underestimation of leptospirosis in clinical and research settings; an experience from a large prospective study in a high endemic setting. PLoS Negl Trop Dis. 2022;16(4):e0010331. pmid:35377883
- 9. Warnasekara J, Srimantha SM, Agampodi SB. Estimating the burden of leptospirosis: global lessons from Sri Lanka. BMJ Glob Health. 2021;6(10):e006946. pmid:34706880
- 10. Athukorala N, Wickramasinghe S, Yatawara L, Weerakoon K. A Systematic Review of Zoonotic Disease Prevalence in Sri Lanka (2000-2022). Vector Borne Zoonotic Dis. 2024;24(8):453–72. pmid:38775108
- 11. Baharom M, Ahmad N, Hod R, Ja’afar MH, Arsad FS, Tangang F, et al. Environmental and Occupational Factors Associated with Leptospirosis: A Systematic Review. Heliyon. 2023;10(1):e23473. pmid:38173528
- 12. Stark AM, Nohrenberg M, Draper AD, McMahon KE, Hewitt TA, Lomas K. A cluster of leptospirosis cases associated with crocodile workers in the Northern Territory, Australia, 2022. Commun Dis Intell. 2023;47. pmid:37968070
- 13. Rees EM, Lotto Batista M, Kama M, Kucharski AJ, Lau CL, Lowe R. Quantifying the relationship between climatic indicators and leptospirosis incidence in Fiji: A modelling study. PLOS Glob Public Health. 2023;3(10):e0002400. pmid:37819894
- 14. Jayaramu V, Zulkafli Z, De Stercke S, Buytaert W, Rahmat F, Abdul Rahman RZ, et al. Leptospirosis modelling using hydrometeorological indices and random forest machine learning. Int J Biometeorol. 2023;67(3):423–37. pmid:36719482
- 15. Vincent AT, Schiettekatte O, Goarant C, Neela VK, Bernet E, Thibeaux R, et al. Revisiting the taxonomy and evolution of pathogenicity of the genus Leptospira through the prism of genomics. PLoS Negl Trop Dis. 2019;13(5):e0007270. pmid:31120895
- 16. Jayasundara D, Gamage C, Senavirathna I, Warnasekara J, Matthias MA, Vinetz JM, et al. Optimizing the microscopic agglutination test (MAT) panel for the diagnosis of Leptospirosis in a low resource, hyper-endemic setting with varied microgeographic variation in reactivity. PLoS Negl Trop Dis. 2021;15(7):e0009565. pmid:34197457
- 17. Levett PN, Branch SL. Evaluation of two enzyme-linked immunosorbent assay methods for detection of immunoglobulin M antibodies in acute leptospirosis. Am J Trop Med Hyg. 2002;66(6):745–8. pmid:12224584
- 18. Yersin C, Bovet P, Mérien F, Wong T, Panowsky J, Perolat P. Human leptospirosis in the Seychelles (Indian Ocean): a population-based study. Am J Trop Med Hyg. 1998;59(6):933–40. pmid:9886203
- 19. Levett PN. Leptospirosis. Clin Microbiol Rev. 2001;14(2):296–326.
- 20. Maia MAC, Bettin EB, Barbosa LN, de Oliveira NR, Bunde TT, Pedra ACK, et al. Challenges for the development of a universal vaccine against leptospirosis revealed by the evaluation of 22 vaccine candidates. Front Cell Infect Microbiol. 2022;12:940966. pmid:36275031
- 21. Chaurasia R, Salovey A, Guo X, Desir G, Vinetz JM. Vaccination With Leptospira interrogans PF07598 Gene Family-Encoded Virulence Modifying Proteins Protects Mice From Severe Leptospirosis and Reduces Bacterial Load in the Liver and Kidney. Front Cell Infect Microbiol. 2022;12:926994. pmid:35837473
- 22. Giraud-Gatineau A, Nieves C, Harrison LB, Benaroudj N, Veyrier FJ, Picardeau M. Evolutionary insights into the emergence of virulent Leptospira spirochetes. PLoS Pathog. 2024;20(7):e1012161. pmid:39018329
- 23. Picardeau M. Virulence of the zoonotic agent of leptospirosis: still terra incognita? Nat Rev Microbiol. 2017;15(5):297–307. pmid:28260786
- 24. Ko AI, Goarant C, Picardeau M. Leptospira: the dawn of the molecular genetics era for an emerging zoonotic pathogen. Nat Rev Microbiol. 2009;7(10):736–47. pmid:19756012
- 25. Chaurasia R, Marroquin AS, Vinetz JM, Matthias MA. Pathogenic Leptospira Evolved a Unique Gene Family Comprised of Ricin B-Like Lectin Domain-Containing Cytotoxins. Front Microbiol. 2022;13:859680. pmid:35422779
- 26. Lehmann JS, Fouts DE, Haft DH, Cannella AP, Ricaldi JN, Brinkac L, et al. Pathogenomic inference of virulence-associated genes in Leptospira interrogans. PLoS Negl Trop Dis. 2013;7(10):e2468. pmid:24098822
- 27. Guglielmini J, Bourhy P, Schiettekatte O, Zinini F, Brisse S, Picardeau M. Genus-wide Leptospira core genome multilocus sequence typing for strain taxonomy and global surveillance. PLoS Negl Trop Dis. 2019;13(4):e0007374. pmid:31026256
- 28. Grillová L, Picardeau M. Core Genome Multi-locus Sequence Typing Analyses of Leptospira spp. Using the Bacterial Isolate Genome Sequence Database. Methods Mol Biol. 2020;2134:11–21. pmid:32632855
- 29. Ren S-X, Fu G, Jiang X-G, Zeng R, Miao Y-G, Xu H, et al. Unique physiological and pathogenic features of Leptospira interrogans revealed by whole-genome sequencing. Nature. 2003;422(6934):888–93. pmid:12712204
- 30. Bulach DM, Zuerner RL, Wilson P, Seemann T, McGrath A, Cullen PA, et al. Genome reduction in Leptospira borgpetersenii reflects limited transmission potential. Proc Natl Acad Sci U S A. 2006;103(39):14560–5. pmid:16973745
- 31. Putz EJ, Bayles DO, Alt DP, Nally JE. Complete Genome Sequence of Four Strains of Leptospira borgpetersenii serovar Hardjo isolated from Cattle in the Central United States. J Genomics. 2022;10:45–8. pmid:35300048
- 32. Vieira DS, Chaurasia R, Vinetz JM. Comparison of the PF07598-Encoded Virulence-Modifying Proteins of L. interrogans and L. borgpetersenii. Trop Med Infect Dis. 2022;8(1):14. pmid:36668921
- 33. Agampodi S, Warnasekara J, Siribaddana S, Kularatna S, Gamage C, Jayasundara D, et al. Demographic, exposure, clinical, biochemical and diagnostic data of febrile patients recruited for the largest field study on leptospirosis in Sri Lanka. Data Brief. 2022;43:108378. pmid:35770027
- 34. Jayasundara D, Senavirathna I, Warnasekara J, Gamage C, Siribaddana S, Kularatne SAM, et al. Correction: 12 Novel clonal groups of Leptospira infecting humans in multiple contrasting epidemiological contexts in Sri Lanka. PLoS Negl Trop Dis. 2021;15(5):e0009471. pmid:34038414
- 35. Jayasundara D, Senavirathna I, Warnasekara J, Gamage C, Siribaddana S, Kularatne SAM, et al. 12 Novel clonal groups of Leptospira infecting humans in multiple contrasting epidemiological contexts in Sri Lanka. PLoS Negl Trop Dis. 2021;15(3):e0009272. pmid:33735202
- 36. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9. pmid:24642063
- 37. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. pmid:18261238
- 38. Chen L, Yang J, Yu J, Yao Z, Sun L, Shen Y, et al. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 2005;33(Database issue):D325–8. pmid:15608208
- 39. Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 2006;34(Database issue):D32–6. pmid:16381877
- 40. Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res. 2018;3:124. pmid:30345391
- 41. Stothard P, Grant JR, Van Domselaar G. Visualizing and comparing circular genomes using the CGView family of tools. Brief Bioinform. 2019;20(4):1576–82. pmid:28968859
- 42. Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–403. pmid:15231754
- 43. Darling AE, Treangen TJ, Messeguer X, Perna NT. Analyzing patterns of microbial evolution using the mauve genome alignment system. Methods Mol Biol. 2007;396:135–52. pmid:18025691
- 44.
USDA. WGS of LR131: Leptospira borgpetersenii isolated from a rat on St. John US Virgin Islands. NCBI. https://www.ncbi.nlm.nih.gov/sra?LinkName=biosample_sra&from_uid=413865912025. 2023.
- 45. Nascimento ALTO, Ko AI, Martins EAL, Monteiro-Vitorello CB, Ho PL, Haake DA, et al. Comparative genomics of two Leptospira interrogans serovars reveals novel insights into physiology and pathogenesis. J Bacteriol. 2004;186(7):2164–72. pmid:15028702
- 46. Rajaonarivelo JA, Desmoulin A, Maillard O, Collet L, Baudino F, Jaffar-Bandjee M-C, et al. Clinical manifestations of human leptospirosis: bacteria matter. Front Cell Infect Microbiol. 2023;13:1259599. pmid:37953799
- 47. Desmoulin A, Rajaonarivelo A, Maillard O, Collet L, Jaffar-Bandjee M-C, Moiton M-P, et al. A Comparative Study of Human Leptospirosis between Mayotte and Reunion Islands Highlights Distinct Clinical and Microbial Features Arising from Distinct Inter-Island Bacterial Ecology. Am J Trop Med Hyg. 2024;111(2):237–45. pmid:38955193
- 48.
Mariet J, Picardeau M, Agampodi S. 12 Novel clonal groups of Leptospira infecting humans in multiple contrasting epidemiological contexts in Sri Lanka: journals.plos.org; 2021.
- 49. Karunanayake L, Gamage CD, Gunasekara CP, De Silva S, Izumiya H, Morita M, et al. Multilocus sequence typing reveals diverse known and novel genotypes of Leptospira spp. circulating in Sri Lanka. PLoS Negl Trop Dis. 2020;14(8):e0008573. pmid:32841248
- 50. Putz EJ, Sivasankaran SK, Fernandes LGV, Brunelle B, Lippolis JD, Alt DP, et al. Distinct transcriptional profiles of Leptospira borgpetersenii serovar Hardjo strains JB197 and HB203 cultured at different temperatures. PLoS Negl Trop Dis. 2021;15(4):e0009320. pmid:33826628
- 51. Coulthurst S. The Type VI secretion system: a versatile bacterial weapon. Microbiology (Reading). 2019;165(5):503–15. pmid:30893029
- 52. Matsunaga J, Lo M, Bulach DM, Zuerner RL, Adler B, Haake DA. Response of Leptospira interrogans to physiologic osmolarity: relevance in signaling the environment-to-host transition. Infect Immun. 2007;75(6):2864–74. pmid:17371863
- 53. Matsunaga J, Sanchez Y, Xu X, Haake DA. Osmolarity, a key environmental signal controlling expression of leptospiral proteins LigA and LigB and the extracellular release of LigA. Infect Immun. 2005;73(1):70–8. pmid:15618142
- 54. Musso D, La Scola B. Laboratory diagnosis of leptospirosis: a challenge. J Microbiol Immunol Infect. 2013;46(4):245–52. pmid:23639380