• Loading metrics

Genetic changes associated with the temporal shift in invasive non-typhoidal Salmonella serovars in Bamako Mali

  • Kristin Bornstein,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft

    Affiliations Center for Vaccine Development and Institute for Global Health, University of Maryland School of Medicine, Baltimore, MD, United States of America, Department of Epidemiology & Public Health, University of Maryland School of Medicine, Baltimore, MD, United States of America, ICF, Lee Highway, Fairfax

  • Sharon M. Tennant,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, Writing – review & editing

    Affiliations Center for Vaccine Development and Institute for Global Health, University of Maryland School of Medicine, Baltimore, MD, United States of America, Baltimore VA Medical Center Geriatric Research, Education, and Clinical Center, and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States of America

  • Tracy H. Hazen,

    Roles Methodology, Writing – review & editing

    Affiliation Institute for Genome Sciences, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, United States of America

  • John D. Sorkin,

    Roles Formal analysis, Methodology, Writing – review & editing

    Affiliation Baltimore VA Medical Center Geriatric Research, Education, and Clinical Center, and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States of America

  • Milagritos D. Tapia,

    Roles Conceptualization, Data curation, Investigation, Resources, Writing – review & editing

    Affiliations Center for Vaccine Development and Institute for Global Health, University of Maryland School of Medicine, Baltimore, MD, United States of America, Department of Pediatrics, University of Maryland School of Medicine, Baltimore, MD, United States of America

  • Samba O. Sow,

    Roles Conceptualization, Investigation, Resources

    Affiliation Centre pour le Développement des Vaccins, Mali (CVD-Mali), Bamako, Mali, Africa

  • Uma Onwuchekwa,

    Roles Data curation, Investigation, Resources

    Affiliation Centre pour le Développement des Vaccins, Mali (CVD-Mali), Bamako, Mali, Africa

  • Myron M. Levine ,

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliations Center for Vaccine Development and Institute for Global Health, University of Maryland School of Medicine, Baltimore, MD, United States of America, Baltimore VA Medical Center Geriatric Research, Education, and Clinical Center, and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, United States of America, Department of Pediatrics, University of Maryland School of Medicine, Baltimore, MD, United States of America

  • David A. Rasko

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Software, Supervision, Visualization, Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliation Institute for Genome Sciences, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, United States of America

Genetic changes associated with the temporal shift in invasive non-typhoidal Salmonella serovars in Bamako Mali

  • Kristin Bornstein, 
  • Sharon M. Tennant, 
  • Tracy H. Hazen, 
  • John D. Sorkin, 
  • Milagritos D. Tapia, 
  • Samba O. Sow, 
  • Uma Onwuchekwa, 
  • Myron M. Levine, 
  • David A. Rasko

This is an uncorrected proof.



Invasive non-typhoidal Salmonella (iNTS) serovars S. Typhimurium and S. Enteritidis are major etiologic agents of invasive bacterial disease among infants and young children in sub-Saharan Africa, including in Mali. Early studies of iNTS serovars in several countries indicated that S. Typhimurium was more prevalent than S. Enteritidis, including in Mali before 2008. We investigated genomic and associated phenotypic changes associated with an increase in the relative proportion of iNTS caused by S. Enteritidis versus S. Typhimurium in Bamako, Mali, during the period 2002–2012.

Methodology/Principal findings

Comparative genomics studies identified homologs of tetracycline resistance and arsenic utilization genes that were associated with the temporal shift of serovars causing iNTS shift, along with several hypothetical proteins. These findings, validated through PCR screening and phenotypic assays, provide initial steps towards characterizing the genomic changes consequent to unknown evolutionary pressures associated with the shift in serovar prevalence.


This work identified a shift to S. Enteritidis from the more classic S. Typhimurium, associated with iNTS in Bamako, Mali, during the period 2002–2012. This type of shift in underlying iNTS pathogens are of great importance to pediatric public health in endemic regions of sub-Saharan Africa. Additionally, this work demonstrates the utility of combining epidemiologic data, whole genome sequencing, and functional characterization in the laboratory to identify and characterize genomic changes in the isolates that may be involved with the observed shift in circulating iNTS agents.

Author summary

Much remains unknown about the mode of transmission of iNTS or the reservoirs of infection. As such, insight into potential selective pressures on underlying serovars bears importance to public health. Longitudinal studies over the years 2002–2012 identified a shift in the proportion of invasive non-typhoidal Salmonella (iNTS) disease caused by S. Typhimurium or S. Enteritidis in Bamako, Mali. Since S. Enteritidis exhibits a higher rate of fatal cases among the pediatric population than S. Typhimurium, it is important to understand what led to the increased proportion of cases from this serovar. This study examined the genetic changes in S. Enteritidis associated with this serovar shift through comparative genomics and laboratory findings. Through these methods, genes related to tetracycline resistance and arsenic catabolism were associated with the shift in serovars. These findings represent preliminary steps in investigating this underlying shift and determining the long-term repercussions of these changes to the epidemiologic profile of iNTS disease.


Salmonella enterica is a bacterial pathogen that includes over 2500 serological variants (serovars) and is estimated to cause more than 1.3 billion cases of clinical illness annually worldwide [1]. Non-typhoidal Salmonella (NTS) serovars such as S. Enteritidis and S. Typhimurium, while mainly limited to gastrointestinal illness in industrialized nations [2,3], have been found to cause severe invasive bacterial disease in sub-Saharan Africa [46] typically unaccompanied by or preceded by diarrhea. Children from 6–12 months of age suffer the greatest incidence of invasive disease, and case fatality rates for S. Enteritidis disease are high (20–28%) in infants and young children in Africa [7]. In part, the differences in clinical presentation of NTS disease in pediatric populations in sub-Saharan Africa versus in industrialized countries may be due to immunosuppression, malnutrition, intensity of previous antigenic exposure, or co-infections, particularly malaria [5,810]. However, there are also differences in the inherent pathogenicity of strains of non-typhoidal Salmonella, in addition to the host and infection-related factors [1116]. For example, ~95% of the invasive S. Typhimurium strains in Africa are multi-locus sequence type 313 (ST313), a genotype unique to that continent that has undergone extensive genomic degradation associated with host-adaptation [11,13] and which is different from the ST19 strains that predominate outside Africa [11,17].

S. Typhimurium has been identified as the leading cause of invasive NTS (iNTS) disease in many countries in sub-Saharan Africa [14], although in some countries S. Enteritidis was the predominant serovar [18]. Systematic surveillance undertaken by the Center for Vaccine Development of Mali (CVD-Mali) and by the Center for Vaccine Development at the Hôpital Gabriel Touré (HGT) in Mali revealed a sizable burden of iNTS disease [19]. This surveillance was initiated in 2002 to quantify the burden of invasive bacterial disease caused by Haemophilus influenzae type b (Hib) and Streptococcus pneumoniae (pneumococcus) and to assess the need for introduction of vaccines to prevent disease caused by those pathogens, which were responsible for most cases of pediatric bacteremia. Although not originally designed to do so, review of the surveillance data identified NTS serovars Typhimurium and Enteritidis as common causes of severe invasive disease. Moreover, after routine infant immunization with Hib and pneumococcal conjugate vaccines was implemented and those two infections were controlled, NTS emerged as the predominant cause of invasive bacterial disease in infants and young children in Mali [19]. The quality and duration of the systematic surveillance at HGT facilitated identification of a shift in iNTS epidemiology around 2008 [12]. An increased relative proportion of iNTS disease due to S. Enteritidis was detected along with a concurrent decrease in the prevalence of S. Typhimurium after 2008 [19]. From the iNTS isolates included in the epidemiologic investigation by Tapia et al. [19], a subset of 42 S. Enteritidis isolates, selected as a stratified sample by year of isolation, were included in a phylogenetic analysis examining global lineages of S. Enteritidis conducted by Feasey et al. [15]. Examination of the genomic phylogeny suggested that the newly circulating S. Enteritidis isolates collected from Mali (2008–2012) are phylogenomically distinct from the 2002–2008 isolates. While periodic Salmonella serovar shifts have been observed elsewhere in the world [20], the genotypic characteristics and environmental pressures that lead to such serovar shifts are not well understood and have not been investigated in detail, especially using whole genome sequencing.

Although data on the characterization of major genomic elements and evolutionary pressures on non-typhoidal Salmonella including S. Enteritidis are limited, studies by Nuccio and Baumler [21,22] have identified genomic signatures associated with increased growth within the host gastrointestinal tract and increased gut inflammation. These signatures are identified with NTS strains associated with gastroenteritis, while the genome of NTS strains associated with invasive disease is thought to be degraded [12]. Feasey et al. [15] examined global lineages of S. Enteritidis, including the isolates in this study, and identified two clades of S. Enteritidis that have become prominent in Africa. Feasey et al. [15] also established that the S. Enteritidis clades exhibit similar host-adapted genomic degradation as seen with S. Typhi, S. Paratyphi A, and S. Typhimurium. None of the previous studies investigated genomic changes in S. Enteritidis within a discrete population, such as the pediatric population of Bamako. Our current study was undertaken to identify and characterize genomic markers in S. Enteritidis isolates that are representative of the clade that emerged as the predominant serovar of iNTS disease in the pediatric population in Bamako, Mali during the time period since systematic surveillance was established. This work demonstrates the utility of combining epidemiological principles, whole genome sequencing, and functional characterization in the laboratory to identify and characterize genomic changes in the isolates that may be responsible for the observed shift in the iNTS serovars.


Sample acquisition

Isolates included in this study were collected by local hospital staff from the blood or spinal fluid of patients during hospital admission at the HGT, as reported by Tapia et al. [19]. The serovar of these isolates was confirmed through multiplex polymerase chain reaction (PCR) assay and agglutination with Salmonella antiserum. The 42 isolates submitted for whole-genome sequencing (S1 Table), as described by Feasey et al. [15] and utilized for the in silico analysis in our study, had their taxonomy verified by culture on selective Salmonella-Shigella agar, positive O (D1) agglutination from cultures grown on Trypticase Soy Agar, and positive H agglutination (with antisera against g and m Phase 1 H antigens) from cultures grown on 0.6% agar swarm medium.

Establishing a temporal breakpoint in the relative abundance of serovars

We examined a phylogenetic tree based on the core genome of the same 42 S. Enteritidis isolates examined by Feasey et al. [15]. The proportion of misclassified isolates was determined for each year and at each branch of the phylogenetic tree, with year of isolation determined by the seasonal trends in weather (June-July) [19]. Isolates were considered misclassified when they were collected from one side of the temporal breakpoint, but appeared in a branch that consisted mainly of isolates from the other side of the breakpoint. The branches of the tree were manually rotated at each node to determine the potential breakpoint year to minimize the number of misclassifications across the entire tree. Each year was tested as a potential breakpoint and the year which resulted in the lowest proportion of misclassifications across the major branches of the tree was ultimately selected. This temporal breakpoint was then compared to the epidemiologic data reported from the HGT [19] and the global phylogenetic analysis performed by Feasey et al. [15].

Bioinformatic genetic analysis

The whole genome sequences of the 42 S. Enteritidis isolates (sequenced by the Wellcome Trust Sanger Institute) were assembled and annotated using the Velvet assembly through the CloVR pipeline, release version 1.0-RC5 [23] at the Institute for Genome Sciences (IGS) at the School of Medicine, University of Maryland, Baltimore. The sequence of each isolate was analyzed for protein encoding genes using a protein-coding gene prediction program, Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) [24].

Once all the putative genes had been identified, BLAST Score Ratios (BSRs) [25] were calculated for each putative gene in each examined isolate using Large Scale BLAST Ratio Analysis (LS-BSR) [26]. The protein-coding genes in all of the genomes were predicted using Prodigal and were clustered by >90% similarity using uclust. BSR is calculated as the bit score of a gene detected in a genome divided by the bit score of the gene compared to itself, resulting in a scale normalized from 0 to 1. A BSR of ≥ 0.8 indicated a gene was present in a genome with significant similarity, a BSR ≤ 0.4 indicated a gene was absent, and genes identified with BSR values <0.8 but ≥ 0.4 were considered present with divergent similarity. The predicted protein-coding genes that were present (BSR≥ 0.8) in all isolates were removed from further analysis, as they constituted the conserved core genome, resulting in a list of 1,220 putative genes with variable presence among the included isolates (S1 Table). To identify genes associated with the genetic shift in S. Enteritidis, methods similar to those described by Sahl et al. [26] were used. Briefly, significant differences in the average BSRs before and after the 2008 breakpoint for each gene were identified using Student’s t-tests resulting in a p<0.01 (Welch’s method was used to account for unequal variances). Genes significantly associated with either the pre- or post-2008 time periods were then verified by manual BLAST searches in each genome. This verification was performed to ensure that any differences in BSR were due to a true association with the 2008 cut point, and were not missing from the draft genome assemblies as an artifact of the sequencing and assembly.

Phylogenomic analysis

The genomes of the isolates in this study were compared by whole-genome phylogenomic analysis as previously described [15]. The genomes were aligned using Mugsy [27] and homologous blocks were concatenated using the bx-python toolkit [28]. The columns that contained one or more gaps were removed using Mothur [29]. The concatenated regions from each genome were used to construct a maximum-likelihood phylogeny with 100 bootstrap replicates using RAxML v7.2.8 [30] that was visualized using FigTree v1.4.2 [31]. Isolates that were isolated before or during 2008 are colored black and those post-2008 are colored blue.

Sample size and selection

From surveillance through June 30th, 2012, the isolates from 299 individuals identified to have laboratory-confirmed invasive infections caused by S. Enteritidis were collected. Of the 103 available isolates obtained before or during 2008, 23 were sequenced using the Illumina platform. Nine of these 23 (39%) exhibited one of the putative genes identified as significantly associated with the shift in S. Enteritidis. This gene (centroid_210035_1) possessed the weakest level of statistical significance (p = 0.0009) among the putative genes associated with the serovar shift. Of the 196 isolates obtained from after 2008, 19 were sequenced, with 16 exhibiting the present allele of the same gene (84%). By focusing on this least significant associated gene, we used these prevalence rates to estimate conservative sample size calculations for examining the distribution of this, and more significantly associated genes across other available isolates based on a binomial distribution across two samples. A two-sided sample size analysis indicated that screening 72 additional samples [36 from each group, pre- and post-2008) would provide 80% power by PCR to identify genetic changes across all available isolates. Simple random sampling of all isolates collected before and after the breakpoint was used to select the 36 isolates from each group. This resulted in 12 isolates selected from those collected in 2002–2003, none from 2003–2004, 1 from 2004–2005, 1 from 2005–2006, 4 from 2006–2007, 8 from 2007–2008, 10 from 2008–2009, 9 from 2009–2010, 11 from 2010–2011, and 16 from 2011–2012.

Selection of PCR targets

The manual BLAST searches were performed on each significant putative gene against the pre- and post-2008 isolates to determine where the genes exist on the chromosome in reference to each other. Geneious version 7 [32] was used to visualize the location of significant putative genes in relationship to each other within each isolate and comparatively across other sequenced isolates. Genes that were identified to have homologs with biologically relevant function via BLAST searches, or were co-located with other genes of interest, suggesting these genes may be in a functional unit or operon, were selected as PCR targets. Geneious software was used to confirm that primer target regions were identical across the previously sequenced S. Enteritidis isolates containing the genes of interest.

PCR amplification

NORGEN Biotek Corp. bacterial genomic DNA isolation kits were used to extract the genomic DNA of the 72 non-sequenced isolates selected by simple random sampling and 2 μl of template from each isolate was included in the PCR to screen for each of the six putative genes under their optimized conditions. PCR conditions for each gene were optimized in regard to magnesium concentration and annealing temperature on isolates that had previously been sequenced [15] and verified in silico to contain the genes of interest. Primer mix included 2 μl 10x PCR buffer, 0.4 μl of 10 mM dNTPs, and 0.2 μl of Invitrogen Taq DNA polymerase without Mg, and 1 μl of 10 mM forward and reverse primer relevant to each targeted centroid (final volume, 18 μl). PCR was performed in an Eppendorf Mastercycler®. The cycling parameters for screening each centroid involved denaturation at 94°C for 2 min, followed by 25 cycles of heating to 94°C for 30 sec, the relevant optimized annealing temperature for 30 sec, 72°C for extension time of relevant to the product length (1 min/kbp), and a final step of 72°C for 5 min. Primer sequences and PCR conditions for each centroid are shown in Table 1.

PCR products were separated on a 1% agarose gel stained with SYBR Safe and visualized using a UV transilluminator. The proportion of isolates in the pre- and post-2008 groups that contained the selected PCR target were compared by chi-square analysis (with a p-value adjusted for multiple comparisons by Bonferroni correction) to test for significant differences in their distribution.

Phenotypic analysis

Initial annotation of the putative genes novel to the S. Enteritidis isolates collected after 2008 were found to be homologs of tetracycline-family resistance genes and arsenic utilization genes. Kirby-Bauer tests with 30 μg tetracycline were performed on the isolates which had been screened by whole genome sequencing or PCR assay to verify the predicted phenotypic differences between these strains. Colonies grown from overnight incubations were resuspended in PBS at a turbidity of 0.5 MacFarland and plated on Mueller-Hinton agar with discs impregnated with 30 μg tetracycline and concentrations of 0.04, 0.16, and 0.64 mg/ml sodium arsenate [33]. Zones of inhibition were measured after 12–18 hours of incubation at 37°C. For tetracycline antibiograms, zones with a diameter of ≥19 mm were considered susceptible, and ≤14 mm were resistant per CLSI guidelines [34]. Significant differences in the proportion of tetracycline resistant isolates from pre- and post-2008 samples were identified by chi-square analysis and differences in zone diameters of arsenic resistance were compared using Student’s t-test (p<0.05).


Selection of temporal break point

While the epidemiologic analysis of iNTS by Tapia et al. [19] suggested that the increase in the relative proportion of iNTS caused by S. Enteritidis compared to S. Typhimurium occurred in 2008, the change appeared to occur gradually over time from July 2006 to June 2010 (Fig 1). Phylogenomic analysis (Fig 2) was used to determine where in this time period the breakpoint occurred, at which the novel clade was established as the predominant S. Enteritidis genotype in the region. The branches of the tree were manually rotated at each node to determine the potential breakpoint year to minimize the number of misclassifications (isolates collected from one side of the temporal breakpoint appear on a branch with isolates collected from the other side of the breakpoint) across the entire tree. A breakpoint of 2008 minimized the difference in misclassifications between the pre and post groups across several major branches of the phylogenetic tree (S2 Table), and therefore was accepted as the year at which the novel S. Enteritidis genotype was established as the dominant serovar.

Fig 1. Relative proportion of iNTS cases caused by S. Typhimurium and S. Enteritidis from 2002–2012.

The relative proportion of the iNTS isolates observed among pediatric admissions to the HGT as identified by Tapia et al. [19]. The observed shift in relative proportions of S. Enteritidis and S. Enteritidis occurs at approximately 2008.

Fig 2. Phylogenomic analysis of pre-, during, and post-2008 S. Enteritidis isolates.

A whole-genome phylogeny was generated for the 42 S. Enteritidis genomes analyzed in this study, which were collected from the HGT from July 2002 through June 2012. Isolates collected before or during 2008 are shown in black and isolates collected after 2008 are shown in blue. The tree is constructed using a maximum-likelihood phylogeny with 100 bootstrap replicates using RAxML v7.2.8. Shaded grey circles over the nodes designate bootstrap values ≥90.

In silico identification of genes associated with the serovar shift

Among the 42 whole genome sequences acquired from isolates obtained from the HGT from July 2002 through July 2012, as examined by Feasey et al. [15], LS-BSR analysis identified a total of 5,883 putative genes, including 1,220 putative genes that were not completely conserved in all samples (S1 Table). After BLAST score ratios (BSRs) were calculated for each of these putative genes, 12 genes were found to be significantly more prevalent in the pre-2008 sequences and 15 significantly more prevalent in the post-2008 sequences (p<0.01 by chi-square test) (Table 2). After further manual BLAST verification, two putative genes associated with pre-2008 were identified to also be conserved in the post-2008 isolates, and were therefore removed from our analysis. These genes bridged across two contigs of the genetic sequences of post-2008 sequences and therefore were identified as false-negatives during the preliminary in silico analysis. A heatmap visualizing the LS-BSRs of each associated putative gene for each isolate is presented in Fig 3.

Fig 3. Heat map of putative genes associated with the pre- and post-2008 breakpoint.

The 42 S. Enteritidis genomes are listed across the x-axis with putative genes significantly associated with the temporal breakpoint listed down the y-axis. BSR scores range from 1 (“present”), indicated in yellow, to 0 (“absent”), indicated in blue, for each putative gene in each sequenced isolate.

Table 2. Characteristics of putative genes associated with the 2008 breakpoint.

Selection of PCR screening targets

Targets for PCR screening were selected based on genes with functional annotation which could be phenotypically validated and based on the co-location of the genes of interest. Among the isolates collected before or during 2008, S. Enteritidis genes encoding oxidoreductases, a multidrug transcription regulator (marR) and efflux proteins, and putative transport and membrane export proteins were identified (Table 2). Most of the genes associated with the post-2008 time period were not identified among previously sequenced S. Enteritidis available in the NCBI database. However, homologs of insertional transposases, arsR family arsenic-dependent transcriptional regulators, sodium-dependent glutamate permeases, and tetracycline resistance/efflux proteins (tetA) were identified (Table 1). These homologs shared greater than 95% coverage and nucleotide identity with the putative genes of interest in S. Enteritidis isolates (with the exception of centroid_21646_1, for which no homolog was identified through BLAST). Of the 25 investigated genes, seven (7/25, 30%) have no functional homolog and are considered hypothetical proteins.

Mapping of the pre- and post-2008 genes to isolates sequenced for this study suggested that many of the putative genes of interest appear to be located in close proximity, or on the same genomic contig with consistent arrangement and orientation (Fig 4). This suggests that these isolates may have acquired a genomic island or other mobile element that allowed expansion into this niche. Definitive identification of these mobile elements will require further study.

Fig 4. Gene mapping.

(a) Map of the genes associated with the pre-2008 time period (b) Map of the genes associated with the post-2008 time period; genes selected for PCR screening are indicated in green (pre1, pre2, pre3 and post1, post2, post3), other genes significantly associated with the time threshold are shown in blue.

PCR screening

Primers were designed to target each centroid of interest (Table 2 and Fig 4). The results of PCR screening the six selected genes across the 72 non-sequenced isolates exhibited a similar distribution of genes as was identified through the bioinformatic analysis (Table 3). Each of the three genes associated with the pre-2008 time period were identified in significantly more of the isolates collected during or before 2008, compared to those collected after (p = 0.0008, 0.0078, and 0.0047, respectively), suggesting that these genes were lost in the post-2008 isolates. Similarly, each of the three genes associated with the post-2008 time period were identified in significantly more of the isolates collected during that time, compared to isolates collected in the earlier years (p<0.0001, 0.0008, and <0.0001, respectively). This represents an acquisition of these genes in the post-2008 isolates.

Phenotypic results

To validate the in silico results, phenotypes for tetracycline and arsenic resistance were assessed by resistance assays (Table 4). No statistically significant difference was observed in the arsenic resistance among the isolates at any of the three concentrations tested. Tetracycline resistance was identified to be highly associated with the 2008 threshold, with 53% more isolates resistant to tetracycline after 2008 than during or prior to 2008 (p = 0.0008). In comparing these results, the presence of the tetracycline resistance gene screened by PCR (post1) is strongly associated with phenotypic tetracycline resistance as observed by antibiogram assay (p = 0.0016).


S. Typhimurium was initially identified as the predominant serovar of iNTS in most, albeit not all, regions in sub-Saharan Africa [1214]. We identified that 2008 marked a breakpoint in the relative proportions of iNTS caused by S. Enteritidis versus S. Typhimurium in Bamako, Mali, collected from 2002 to 2012. Additionally, we identified that the newly circulating clade of S. Enteritidis exhibits greater levels of tetracycline resistance than previous strains of S. Enteritidis. Our temporal breakpoint agrees with the epidemiologic data from Tapia et al. [19] and the phylogenomic analysis by Feasey et al. [15]. Our finding that S. Enteritidis represents an emerging threat of invasive bacterial disease in Bamako, Mali is mirrored by similar findings from other parts of sub-Saharan Africa [14]. Additionally, surveillance data from 2014 through 2017 with complete serovar determination support this continued change in prevalence as there were 28, 15, 16 and 14 cases of invasive Salmonella disease that occurred, respectively during 2014, 2015, 2016 and 2017. During this four-year period S. Enteritidis was the most commonly identified organsim (S. Enteritidis (N = 28), S. Typhimurium (N = 12), S. Typhi [11] and S. Dublin (N = 9) accounted for 63 of the isolates from the 73 cases of invasive disease). Importantly S. Enteritidis is associated with a significantly greater case fatality rate than S. Typhimurium in all pediatric age groups [12].

The current study markedly expands the number of genome sequences of S. Enteritidis from iNTS cases available in public databases. Furthermore, our study describes an approach to examining these genomes, combining whole genome sequencing and large-scale BLAST score ratios (LS-BSRs) [26]. The use of the LS-BSR method allows investigation of major genetic changes across the genome by identifying and comparing putative gene content among a collection of genome sequences. In an under researched clade of a Salmonella serovar, such a broad sweeping approach was integral to identifying phylogenomic changes over time.

The underlying cause of the shift in serovar predominance resulting in the emergence of S. Enteritidis as the leading cause of iNTS disease, has yet to be fully explained, and we anticipate that the bacterial component of this observation is only part of the larger infection picture. We identified that the recently emerged circulating clade of S. Enteritidis exhibits greater levels of tetracycline resistance than previous strains of S. Enteritidis. This phenotype may provide the serovar with an evolutionary advantage over S. Typhimurium, contributing to the shift wherein S. Enteritidis became the predominant serovar causing iNTS disease in Bamako [19]. Host-mediated hemolysis caused by malaria impairs resistance against iNTS infections [35,36]. Resistance to doxycycline, a long-acting member of the tetracycline family of drugs that can be used for prophylaxis and treatment of malaria [37,38], could theoretically convey an evolutionary advantage to S. Enteritidis by allowing it to survive during malaria prophylaxis or treatment. However, in the Mali context this is implausible because doxycycline and other tetracyclines are rarely used for chemoprophylaxis or treatment of malaria, particularly in infants and young children. However, tetracyclines and other antibiotics are sometimes inserted into potions used to treat fever by traditional healers, but the relative frequency of this practice is not easily quantified. Thus, another explanation must be sought for acquisition of tetracycline genes conveying an increased fitness. Expression of the tetA resistance gene has also been associated with increased influx and accumulation of toxic heavy metal salts in E. coli [33], which may help explain the simultaneous acquisition of this gene with the arsR regulator in the post-2008 isolates.

The arsenic resistance assay performed in this study was selected as a screening method to verify the in silico findings, but the inconclusive results do not negate the bioinformatic analysis which clearly suggested that the acquisition of an arsR regulator gene is associated with the 2008 shift in prevalence of S. Enteritidis over S. Typhimurium. It is possible that our measure of arsenic resistance does not adequately assess the potential phenotypic and global transcriptomic changes generated by the arsR gene product. It is also possible that the arsR homolog gene product regulates other phenotypes that have not been examined in this study. Future studies should investigate alternative metabolism assays, arsenic utilization assays, and genetic knockouts to more thoroughly investigate the possible evolutionary advantages granted by the acquisition of this gene by S. Enteritidis.

By identifying putative genes at a 90% nucleotide identity level, major genetic changes were identified that appear to be novel to the S. Enteritidis serovar. Future studies should further investigate the difference identified to determine the precise nature of the genetic acquisitions and search for additional significant changes at the single nucleotide level. Additional research involving genetic knock outs and complementation will be necessary to fully assess the impact of each potential gene on the phenotype of the organism and to formally identify which phenotypic and genotypic features should be examined through GWAS. Additionally, the shift in NTS serovars was identified to be associated with a specific year, but it is possible that the evolution of infectious disease may occur as a more gradual phenomenon that cannot truly be dichotomized to a single year. As such, our 2008 cut point to identify the novel phylogenomic clade could be subject to misclassification bias. We have attempted to minimize this misclassification by minimizing the amount of differential misclassification between pre- and post-2008 groups, thus biasing our results towards the null.

The concurrent acquisition of the putative novel genes, and the increase in relative importance of the serovar in causing invasive bacterial disease suggests that the genetic changes may provide an evolutionary advantage to S. Enteritidis over S. Typhimurium. This appears to have resulted in S. Enteritidis losing minor enzymes and a putative multidrug transporter in order to gain resistance to tetracycline family antibiotics and an arsenic responsive transcriptional regulator. These data provide a potentially important clue that can be pursued through future research to characterize the competitive advantages and to confirm that no other change in S. Typhimurium is responsible for the decline of that serovar relative to S. Enteritidis. These findings will help illuminate the public health effects of these evolutionary changes to the serovar and shed more light on the evolutionary pressures on S. Enteritidis. Further analyses of the functional effects of this documented change in antibiotic resistance or other phenotypes in the natural environment can help generate novel hypotheses of natural reservoirs of NTS and help elucidate how the S. Enteritidis serovar competes with S. Typhimurium within the human host or in an environmental reservoir or vehicle of transmission.

Supporting information

S1 Table. Large Scale BLAST Score Ratios (LS-BSR) data.

LS-BSR scores of each of the all the putative genes identified among the 42 sequenced S. Enteritidis isolates, with ENA Run reference numbers and year of clinical isolation (based on the June-July seasonal year).


S2 Table. Distribution of misclassified isolates by year.

The proportion of misclassified isolates distributed across the branches of the tree using each year of the study as a potential breakpoint for the serovar shift between S. Typhimurium and S. Enteritidis. Isolates were considered misclassified when they were collected from one half of the chronology but appeared in a branch that consisted mainly of isolates from the other half of the chronology. Branches enumerated from the root outward and were manually rearranged to minimize the number of misclassifications across the tree for each breakpoint. A breakpoint of 2008 provides the lowest minimal misclassifications across both the pre- and post-breakpoint groups, indicated by (*).



  1. 1. Jackson B, Griffin P, Cole D, Walsh K, Chai S. Outbreak-associated Salmonella enterica Serotypes and Food Commodities, United States, 1998–2008—Volume 19, Number 8—August 2013—Emerging Infectious Disease journal—CDC. [cited 2017 Feb 9]; Available from:
  2. 2. Acheson D, Hohmann EL. Nontyphoidal salmonellosis. Clin Infect Dis. 2001;32(2):263–9. pmid:11170916
  3. 3. Varma JK, Mølbak K, Barrett TJ, Beebe JL, Jones TF, Rabatsky-Ehr T, et al. Antimicrobial-resistant nontyphoidal Salmonella is associated with excess bloodstream infections and hospitalizations. J Infect Dis. 2005;191(4):554–61. pmid:15655779
  4. 4. Gordon MA, Banda HT, Gondwe M, Gordon SB, Boeree MJ, Walsh AL, et al. Non-typhoidal salmonella bacteraemia among HIV-infected Malawian adults: high mortality and frequent recrudescence. Aids. 2002;16(12):1633–41. pmid:12172085
  5. 5. Graham SM, Walsh AL, Molyneux EM, Phiri AJ, Molyneux ME. Clinical presentation of non-typhoidal Salmonella bacteraemia in Malawian children. Trans R Soc Trop Med Hyg. 2000;94(3):310–4. pmid:10975008
  6. 6. Laupland KB, Schønheyder HC, Kennedy KJ, Lyytikäinen O, Valiquette L, Galbraith J, et al. Salmonella enterica bacteraemia: a multi-national population-based cohort study. BMC Infect Dis. 2010;10:95. pmid:20398281
  7. 7. MacLennan CA, Gondwe EN, Msefula CL, Kingsley RA, Thomson NR, White SA, et al. The neglected role of antibody in protection against bacteremia caused by nontyphoidal strains of Salmonella in African children. J Clin Invest. 2008;118(4):1553–62. pmid:18357343
  8. 8. Bronzan RN, Taylor TE, Mwenechanya J, Tembo M, Kayira K, Bwanaisa L, et al. Bacteremia in Malawian children with severe malaria: prevalence, etiology, HIV coinfection, and outcome. J Infect Dis. 2007;195(6):895–904. pmid:17299721
  9. 9. Brent AJ, Oundo JO, Mwangi I, Ochola L, Lowe B, Berkley JA. Salmonella bacteremia in Kenyan children. Pediatr Infect Dis J. 2006;25(3):230–6. pmid:16511385
  10. 10. Feasey NA, Everett D, Faragher EB, Roca-Feltrer A, Kang’ombe A, Denis B, et al. Modelling the contributions of malaria, HIV, malnutrition and rainfall to the decline in paediatric invasive non-typhoidal Salmonella disease in Malawi. PLoS Negl Trop Dis. 2015;9(7):e0003979. pmid:26230258
  11. 11. Okoro CK, Barquist L, Connor TR, Harris SR, Clare S, Stevens MP, et al. Signatures of Adaptation in Human Invasive Salmonella Typhimurium ST313 Populations from Sub-Saharan Africa. PLoS Negl Trop Dis. 2015 Mar 24;9(3):e0003611. pmid:25803844
  12. 12. Okoro CK, Kingsley RA, Connor TR, Harris SR, Parry CM, Al-Mashhadani MN, et al. Intracontinental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan Africa. Nat Genet. 2012 Nov;44(11):1215–21. pmid:23023330
  13. 13. Kingsley RA, Msefula CL, Thomson NR, Kariuki S, Holt KE, Gordon MA, et al. Epidemic multiple drug resistant Salmonella Typhimurium causing invasive disease in sub-Saharan Africa have a distinct genotype. Genome Res. 2009 Dec 1;19(12):2279–87. pmid:19901036
  14. 14. Feasey NA, Dougan G, Kingsley RA, Heyderman RS, Gordon MA. Invasive non-typhoidal salmonella disease: an emerging and neglected tropical disease in Africa. The Lancet. 2012;379(9835):2489–99.
  15. 15. Feasey NA, Hadfield J, Keddy KH, Dallman TJ, Jacobs J, Deng X, et al. Distinct Salmonella Enteritidis lineages associated with enterocolitis in high-income settings and invasive disease in low-income settings. Nat Genet. 2016;
  16. 16. Tennant SM, Diallo S, Levy H, Livio S, Sow SO, Tapia M, et al. Identification by PCR of non-typhoidal Salmonella enterica serovars associated with invasive infections among febrile patients in Mali. PLoS Negl Trop Dis. 2010;4(3):e621. pmid:20231882
  17. 17. Ramachandran G, Boyd MA, MacSwords J, Higginson EE, Simon R, Galen JE, et al. Opsonophagocytic Assay To Evaluate Immunogenicity of Nontyphoidal Salmonella Vaccines. Clin Vaccine Immunol. 2016 Jun 1;23(6):520–3. pmid:27030587
  18. 18. Ikumapayi UN, Antonio M, Sonne-Hansen J, Biney E, Enwere G, Okoko B, et al. Molecular epidemiology of community-acquired invasive non-typhoidal Salmonella among children aged 2–29 months in rural Gambia and discovery of a new serovar, Salmonella enterica Dingiri. J Med Microbiol. 2007 Nov 1;56(11):1479–84.
  19. 19. Tapia MD, Tennant SM, Bornstein K, Onwuchekwa U, Tamboura B, Maiga A, et al. Invasive nontyphoidal Salmonella infections among children in Mali, 2002–2014: microbiological and epidemiologic features guide vaccine development. Clin Infect Dis. 2015;61(suppl 4):S332–8.
  20. 20. Le Bacq F, Louwagie B, Verhaegen J. Salmonella typhimurium andSalmonella enteritidis: Changing epidemiology from 1973 until 1992. Eur J Epidemiol. 1994;10(4):367–71. pmid:7843338
  21. 21. Nuccio S-P, Bäumler AJ. Comparative analysis of Salmonella genomes identifies a metabolic network for escalating growth in the inflamed gut. MBio. 2014;5(2):e00929–14. pmid:24643865
  22. 22. Nuccio S-P, Bäumler AJ. Reconstructing pathogen evolution from the ruins. Proc Natl Acad Sci. 2015;112(3):647–8. pmid:25568086
  23. 23. Angiuoli SV, Matalka M, Gussman A, Galens K, Vangala M, Riley DR, et al. CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing. BMC Bioinformatics. 2011;12:356. pmid:21878105
  24. 24. Hyatt D, Chen G, Locascio P, Land M, Larimer F, Hauser L. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11(1):119.
  25. 25. Rasko DA, Myers GS, Ravel J. Visualization of comparative genomic analyses by BLAST score ratio. BMC Bioinformatics. 2005;6(1):1.
  26. 26. Sahl JW, Caporaso JG, Rasko DA, Keim P. The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes. PeerJ. 2014;2:e332. pmid:24749011
  27. 27. Angiuoli SV, Salzberg SL. Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics. 2011 Feb 1;27(3):334–42. pmid:21148543
  28. 28. Taylor J. bx-python [Internet]. 2016. Available from:
  29. 29. Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009 Dec;75(23):7537–41. pmid:19801464
  30. 30. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014 May 1;30(9):1312–3. pmid:24451623
  31. 31. FigTree [Internet]. Molecular Evolution, Phylogenetics and Epidemiology; 2007. Available from:
  32. 32. Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28(12):1647–9. pmid:22543367
  33. 33. Kazemi M, Kasra Kermanshahi R, Heshmat Dehkordi E, Payami F, Behjati M. Resistance Index of Penicillin-Resistant Bacteria to Various Physicochemical Agents. ISRN Microbiol. 2012;2012.
  34. 34. Clinical and Laboratory Standards Institute. Disk Diffusion Supplemental Tables. Wayne, PA: CLSI; 2011. (M100-S21).
  35. 35. Mabey DCW, Brown A, Greenwood BM. Plasmodium falciparum Malaria and Salmonella Infections in Gambian Children. J Infect Dis. 1987;155(6):1319–21. pmid:3553352
  36. 36. Roux CM, Butler BP, Chau JY, Paixao TA, Cheung KW, Santos RL, et al. Both Hemolytic Anemia and Malaria Parasite-Specific Factors Increase Susceptibility to Nontyphoidal Salmonella enterica Serovar Typhimurium Infection in Mice. Infect Immun. 2010 Apr 1;78(4):1520–7. pmid:20100860
  37. 37. Cunnington AJ, de Souza JB, Walther M, Riley EM. Malaria impairs resistance to Salmonella through heme- and heme oxygenase-dependent dysfunctional granulocyte mobilization. Nat Med. 2012 Jan;18(1):120–7.
  38. 38. Watt G, Loesuttivibool L, Shanks G, Boudreau E, Brown A, Pavanand K, et al. Quinine with tetracycline for the treatment of drug-resistant falciparum malaria in Thailand. Am J Trop Med Hyg. 1992 Jul;47(1):108–11. pmid:1636876