Chickens, pigs, and cattle are key reservoirs of Salmonella enterica, a foodborne pathogen of worldwide importance. Though a decade has elapsed since publication of the first Salmonella genome, thousands of genes remain of hypothetical or unknown function, and the basis of colonization of reservoir hosts is ill-defined. Moreover, previous surveys of the role of Salmonella genes in vivo have focused on systemic virulence in murine typhoid models, and the genetic basis of intestinal persistence and thus zoonotic transmission have received little study. We therefore screened pools of random insertion mutants of S. enterica serovar Typhimurium in chickens, pigs, and cattle by transposon-directed insertion-site sequencing (TraDIS). The identity and relative fitness in each host of 7,702 mutants was simultaneously assigned by massively parallel sequencing of transposon-flanking regions. Phenotypes were assigned to 2,715 different genes, providing a phenotype–genotype map of unprecedented resolution. The data are self-consistent in that multiple independent mutations in a given gene or pathway were observed to exert a similar fitness cost. Phenotypes were further validated by screening defined null mutants in chickens. Our data indicate that a core set of genes is required for infection of all three host species, and smaller sets of genes may mediate persistence in specific hosts. By assigning roles to thousands of Salmonella genes in key reservoir hosts, our data facilitate systems approaches to understand pathogenesis and the rational design of novel cross-protective vaccines and inhibitors. Moreover, by simultaneously assigning the genotype and phenotype of over 90% of mutants screened in complex pools, our data establish TraDIS as a powerful tool to apply rich functional annotation to microbial genomes with minimal animal use.
Salmonella Typhimurium is a major cause of human diarrhoeal infections, usually acquired from chickens, pigs, cattle, or their products. To understand the basis of persistence and pathogenesis in these reservoir hosts, and to inform the design of novel vaccines and treatments, we generated a library of 7,702 S. Typhimurium mutants, each bearing an insertion at a random position in the genome. Using DNA sequencing, we identified the disrupted gene in each mutant and determined its relative abundance in a laboratory culture and after experimental infection of mice, chickens, pigs, and cattle. The method allowed large numbers of mutants to be investigated simultaneously, drastically reducing the number of animals required to perform a comprehensive screen. We identified mutants that grow in culture but do not survive in one or more of the animals. The genes disrupted in these mutants are inferred to be important for the infection process. Most of these genes were required in all three food-producing animals, but smaller subsets of genes may mediate persistence in a specific host species. The data provide the most comprehensive map of virulence-associated genes for any bacterial pathogen in natural hosts and are highly relevant for the design of control strategies.
Citation: Chaudhuri RR, Morgan E, Peters SE, Pleasance SJ, Hudson DL, Davies HM, et al. (2013) Comprehensive Assignment of Roles for Salmonella Typhimurium Genes in Intestinal Colonization of Food-Producing Animals. PLoS Genet 9(4): e1003456. doi:10.1371/journal.pgen.1003456
Editor: Diarmaid Hughes, Uppsala University, Sweden
Received: January 26, 2013; Accepted: March 2, 2013; Published: April 18, 2013
Copyright: © 2013 Chaudhuri et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Biotechnology and Biological Sciences Research Council (http://www.bbsrc.ac.uk; grant numbers BB/D017947/1, BB/D017556/1, and BB/D018080/1) and by the Wellcome Trust (http://www.wellcome.ac.uk; grant number 076964). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Salmonella enterica is a facultative intracellular pathogen of worldwide importance, associated with c. 21.7 million cases of systemic typhoid fever and 93.8 million cases of non-typhoidal gastroenteritis in humans each year , . Around 86% of human cases of non-typhoidal salmonellosis are the result of food-borne infections , and chickens, pigs and cattle are key reservoirs of infection . The major S. enterica subspecies enterica encompasses a wide variety of serovars. Some of these, such as S. Typhimurium and S. Enteritidis, exhibit a wide host range, whereas others such as S. Typhi are largely restricted to a single host species. The molecular basis of the host- and tissue-tropism of S. enterica has long eluded researchers and there has been a disproportionate emphasis on the basis of Salmonella persistence and pathogenesis in murine models of colitis and typhoid fever. Comparative analyses of whole genome sequences have associated host-restriction of S. enterica serovars with gene decay. Interpretation of the impact of variation in the repertoire, sequence or expression of S. enterica genes requires an understanding of the roles of those genes in relevant hosts. Random transposon insertion mutants have been screened individually in chickens , but high-throughput simultaneous analysis of mutant phenotypes was made possible by the advent of signature-tagged mutagenesis (STM) . STM allows the survival of individual mutants within a pool to be assessed qualitatively through hybridization of a probe to a unique tag sequence within the transposon. Comparison of hybridization signals obtained from “input pools” of mutants grown in vitro with the those obtained from the same set of mutants screened for survival in a model of infection (“output pool”) allows attenuated mutants to be identified. The insertion sites can then be identified by subcloning and sequencing. Analysis of pools of signature-tagged S. Typhimurium mutants in mice  led to the discovery of Salmonella Pathogenicity Island (SPI)-2 , a gene cluster that encodes a type III secretion system (T3SS-2) that acts as a molecular syringe for the secretion of effector molecules and influences systemic virulence and intracellular survival . A distinct T3SS, encoded by SPI-1, was already known to be essential for infection of mice via the oral route . Subsequently, STM libraries constructed in a range of Salmonella serovars were examined for their ability to colonize multiple hosts –. Comparative analysis of a library of 1045 mutants in chickens, pigs and calves suggested that S. Typhimurium deploys both conserved- and host-specific virulence factors , . Notably, SPI-1 and -2 were vital in intestinal colonization of calves and pigs and to a lesser extent chickens , , yet a Type I protein secretion system encoded by SPI-4 appeared to influence infection of calves, but not chickens  or pigs .
Although STM has provided valuable insights into Salmonella pathogenesis, the technique is limited by the number of unique tags available, and the time and effort required to construct the library and identify attenuating mutations. Moreover, only negatively-selected mutants tend to be investigated and subjective judgments are used to compare signal intensities relative to the input and to other screened mutants. Transposon-directed insertion-site sequencing (TraDIS) , one of a new generation of STM-like techniques, addresses some of these limitations. TraDIS exploits Illumina sequencing  to obtain the sequence of the genomic region flanking each transposon. The massively parallel nature of the sequencing permits comparison of the number of specific reads derived from the input pools and the output pools after animal infection, providing a numerical measure of the extent to which mutants were negatively- or positively-selected during colonization (see Figure 1). TraDIS-like sequencing methods have been used to identify the essential gene complement of S. Typhi  and Streptococcus pneumoniae , genes involved in virulence of Haemophilus influenzae  and S. pneumoniae  in mice, and genes required for survival of the symbiont Bacteroides thetaiotaomicron in the murine gut . Here we apply TraDIS to simultaneously assign the genotype and relative fitness of 7702 distinct S. Typhimurium mutants during intestinal colonization of chickens, pigs and cattle, providing highly relevant data for the control of zoonotic and animal salmonellosis.
An input pool of random transposon insertion mutants is generated, and used to inoculate experimental animals. Output pools of bacteria that are capable of survival and growth in each host are isolated from an appropriate tissue. Massively parallel sequencing of the regions flanking each transposon allow the disrupted genes to be identified, and comparison of the sequence counts derived from the input and output pools allows the relative fitness of each mutant to be assessed.
Quantitative assessment of mutant fitness
To validate the quantitative nature of TraDIS, we applied it to investigate pools of Mu and mini-Tn5 mutants of S. Typhimurium strain SL1344 before and after intravenous infection of BALB/c mice (see Text S1). These mutant pools had been characterized previously using transposon-mediated differential hybridization (TMDH) , a microarray-based method that relies on hybridization of run-off transcripts arising from transposon-encoded T7 and SP6 promoters to high-density oligonucleotide arrays. In total, using TraDIS, 9792 distinct transposon insertions (4992 Mu and 4800 Tn5) were unambiguously mapped at the level of the single nucleotide to the SL1344 genome, providing relative fitness scores for 94.4% of the 10368 mutants screened, This is likely to be an underestimate of the performance of TraDIS, since it is likely that there were siblings of mutants with mapped insertions within the pool of 10368 mutants. The fitness scores assigned by TraDIS are defined as the log2-fold change in the number of sequence reads obtained across the boundaries of each transposon insertion between the input and output pools (see Materials and Methods), and were significantly correlated with the existing TMDH data (Figure S1; P<2.2×10−16), verifying the quantitative nature of TraDIS. TraDIS allowed identification of a number of mutants missed by TMDH (see Text S1), and provided finer mapping of insertion sites than can be achieved by hybridization of transposon-flanking sequences to tiling arrays, extending the conclusions of the earlier study and demonstrating the superiority of the TraDIS approach.
Assessment of mutant fitness in food animals
Though useful, previous attempts to assign comprehensively the role of S. Typhimurium genes relied on parenteral infection of atypically susceptible mice –, and do not reflect the roles of Salmonella genes during intestinal colonization of food-producing animals infected via the natural oral route. To identify genes relevant to colonization of animal reservoirs and therefore zoonosis, we generated a library of 8550 mini-Tn5 mutants of S. Typhimurium strain ST4/74 and applied TraDIS to assess the survival of these mutants during oral infection of chickens, pigs and calves. Pools of 475 mutants were screened in individual pigs and calves and pools of 95 mutants were screened in duplicate chickens. Pilot studies in which selected pools had been repeatedly screened in each species indicated the reliable negative selection of the same mutants at these pool complexities in the absence of stochastic loss that may be due to population bottlenecks (data not shown). TraDIS mapped 7702 distinct insertions to the nucleotide level, representing at least 90.1% of the mutants screened, and demonstrated the random distribution of the transposon insertions around the genome (see Figure 2), which disrupt 2715 different genes. TraDIS analysis revealed evidence of random drop-out of mutants from the pools in one chicken, two pig and two calf experiments, likely owing to recovery of output pools of an inadequate size, and data from these animals were omitted from subsequent analysis (see Text S1).
The inner two rings indicate the positions of annotated genes, coloured according to their GC content (blue = low, yellow = intermediate, red = high). The outer ring indicates the number of transposon-flanking sequence reads obtained at each position, with peaks corresponding to the presence of a transposon insertion.
TraDIS assignments of the insertion sites and fitness scores of mutants are listed in Table S1 (BALB/c mice dosed intravenously) and Table S2 (chickens, pigs and calves dosed orally). The raw TraDIS sequence data are available from the NCBI Short Read Archive (accession numbers ERA000172 and ERP000286). To facilitate exploration of the TraDIS data, a user-friendly online genome browser was constructed with which the insertion site and fitness score can be viewed in the context of the linear genome, GC content, transcription start sites and existing annotation (http://www-tradis.vet.cam.ac.uk). Figure 3 shows the fitness scores obtained by TraDIS analysis of S. Typhimurium mutants screened in mice, chickens, pigs and calves, plotted against read coverage (equivalent to the “M/A” plots commonly used to display microarray data). The proportion of significantly attenuated mutants identified during intestinal colonization of food-producing animals was greater than in the murine typhoid model. In each host, a large proportion of mutations did not exert a strong negative or positive effect, indicating that a high number of accessory or redundant functions exist. P values were estimated using the available biological replicates (all pools were screened in duplicate in chickens, 2 pools were screened in duplicate calves and 3 pools were screened in triplicate pigs), and attenuated mutants were defined as those with a negative fitness score and P≤0.05.
Mutants which showed a significant change in abundance in the output relative to the input are highlighted in red. Mutants which had no reads in the output pool are assigned an arbitrary fitness score of −15.
To summarize the dataset further, genes were scored as potentially important in colonization if they were disrupted in at least one significantly attenuated mutant in any of the four host species. This enables comparisons between datasets derived from different transposon libraries (such as the mouse and chicken/pig/cattle datasets), and visualization of the data in comparison with other genome-wide datasets. For some genes, insertions at different subgenic locations can have divergent effects on the encoded protein resulting in contrasting fitness scores, so it is important to consider the genetic context of each individual transposon when interpreting the TraDIS data in detail. This is true for all transposon-based mutant screens, although earlier technologies such as STM and TMDH lacked sufficient resolution to permit such considerations. We recommend the use of the TraDIS browser (http://www-tradis.vet.cam.ac.uk) to assist interpretation of our data in the context of the genome annotation.
Fitness scores were obtained for 3194 distinct genes disrupted by transposon insertions in the mouse screen, and 2715 genes in the chicken, pig and calf screens, of which fitness score existed for 2435 genes in all three food-producing animals. Fitness scores were available from all four hosts for 1935 genes, of which 1069 had a significantly attenuated mutant in at least one host. Venn diagrams showing the numbers of significantly attenuated mutants, and the numbers of genes potentially important for colonization in chickens, pigs and cattle are shown in Figure 4. A further Venn diagram, combining the chicken/pig/cattle and mouse datasets, is available in Figure S2, and Figure S3 shows a comparison between the chicken, pig and cattle TraDIS data, and the data obtained from equivalent mutants in the earlier STM screens , . A table of all genes disrupted by a transposon insertion, indicating if any of the mutants in each gene was significantly attenuated, is available in Table S3. To facilitate exploration of the TraDIS data, custom files have been prepared that allow proteins in KEGG metabolic pathway diagrams  to be coloured blue if an attenuated mutant was found in the encoding gene, or red if the gene was mutated but no significant attenuation was observed. The KEGG colour files are available from http://www-tradis.vet.cam.ac.uk. As an example, Figure S4 shows the effect of mutations affecting multiple steps in chorismate biosynthesis, which is known to influence persistence of S. Typhimurium in vivo. It is evident that mutations affecting sequential steps in the pathway are attenuating, with just two exceptions: the initial condensation of D-erythrose-4-phosphate and phosphoenolpyruvate into 3-deoxy-D-arabino-heptulosonate-7-phosphate, and the conversion of shikimate to shikimate-3-phosphate, both of which can be catalyzed by the products of multiple genes (aroFGH and aroKL, respectively). Thus, TraDIS identifies pathways in which defects exert a common effect, but also reveals steps at which functional redundancy exists. During the analysis of the TraDIS data it became clear that many regions of the genome with low GC content are important for intestinal colonization of chickens, pigs and cattle (see Text S1 and Figure S5).
A) the numbers of transposon mutants which were significantly attenuated in each host B) the numbers of genes which were disrupted in the TraDIS mutant library, and which are potentially important for colonization (i.e. they had at least one significantly attenuated mutant) in each of the three reservoir hosts.
Analysis of defined null mutants in chickens
Twelve genes were selected for further investigation based on the TraDIS data: carB, clpB, ilvC, mig-14, pagN, SL1344_0084 (STM0084), SL1344_4248 (STM4312), SL1344_3128 (STM3154), trxA, virK, ytfL and zirT (SL1344_1599). These targets were chosen based on their fitness scores (at least one mutant in each gene shows significant attenuation), and to include some genes with established roles in colonization in chickens (clpB ) or mice (trxA , mig-14 , virK ), genes with a postulated role in colonization (pagN , SL1344_0084 ), the mouse anti-virulence factor zirT , genes demonstrating variable fitness scores (SL1344_0084, SL1344_3128 and ytfL) and genes that demonstrate putative host-specific effects on colonization in the TraDIS data (clpB, ilvC and ytfL). Each gene was inactivated separately by λRed recombinase-mediated integration of linear PCR amplicons by homologous recombination . Mutant phenotypes were evaluated in groups of 3 chickens per mutant per time interval. For each mutant, competitive indices (CIs) were derived 4, 6 and 10 days post-inoculation of age-matched chickens with the kanR-tagged mutant and ST4/74 nalR wild-type strain in a 1∶1 ratio (see Table 1).
With a single exception (SL1344_3128) all mutants were negatively-selected relative to the parent strain at day 4 post-inoculation, which corresponds to the time at which mutants were recovered for the TraDIS analysis. The difference in the mutant∶wild-type ratio was significantly different from the ratio in the inocula for 8 of the mutants at day 4. For a further 2 mutants, significant differences were detected at later time points. Taken together with comparisons to existing datasets for signature-tagged mutants of the same strain in the same animal models ,  (see Text S1), the data strongly support evidence of attenuation detected by TraDIS. Variance from TraDIS fitness scores is likely to reflect differences in competition dynamics for a given mutant relative to co-screened wild-type or mutant bacteria. For one gene (SL1344_3128 ), no evidence was found of any attenuation of the defined mutant, which performed comparably to the wild-type at all time intervals. This gene was chosen for further investigation because its mutants exhibited a wide range of TraDIS fitness scores (−1.02 to −9.20). Interestingly, mutants in the gene cluster SL1344_3128-30 are predicted to be deficient in swarming motility , suggesting the possibility that such motility may be an occasional but not universal requirement for colonization.
The genetic basis of intestinal colonization
The TraDIS dataset is a powerful resource for understanding intestinal colonization of a range of highly relevant hosts by Salmonella, and thus zoonotic transmission and animal disease. The data suggest that the definition of what constitutes a colonization gene is not straightforward, encompassing genes involved in metabolism, stress responses and transcriptional regulation, together with genes with well-established roles in virulence. The T3SSs encoded by SPI-1 and SPI-2 are both essential for infection in chickens, pigs and cattle, although there are some mutants within both regions that are not attenuated or exhibit a less pronounced phenotype in chickens. T3SSs allow the secretion of effector molecules into the host cytoplasm, these effectors being encoded both within SPI-1 and 2 and distally. Most of the known effector genes, including sopA, sopB, sopE2, sipA, avrA, sipC, sseG, sseI, sifA, sseK1, pipB2 and sopD2, were identified by TraDIS as being important for infection of all three food-producing animals, although as with the T3SS structural genes, the phenotype was often less pronounced in chickens. Other T3SS effectors, including sptP, slrP, gogB and sspH2, could be disrupted without affecting colonization. Of these, slrP has been implicated as a host-specificity factor, essential for oral infection of mice but not required for calf infection . Null mutants of sptP are not impaired in their interactions with cultured macrophages or epithelial cells –, and sspH2 mutants do not show any defect in vacuole-associated actin polymerization . For both sptP and sspH2, the lack of phenotype was suggested to be due to functional redundancy amongst T3SS effectors.
There are also attenuated mutants that harbour insertions within the other recognized Salmonella pathogenicity islands . In SPI-3, mutants in some genes (mgtC, marT, SL1344_3717 and SL1344_3721) were attenuated, with others (misL, sugR, slsA and mgtB) showing no attenuation. Interestingly, marT, which encodes a transcriptional regulator, is a pseudogene in S. Typhi, and restoring it reduces survival during infection of a human cell culture . A role for marT in infection of chickens, pigs and cattle suggests a selection pressure for its retention in the S. Typhimurium genome. SPI-4 was previously thought to play a role in infection of cattle but not chickens or pigs, based on STM screens , . TraDIS suggests a role for SPI-4 in all three species, although the phenotypes in chickens and pigs are more subtle than in calves, highlighting the increased sensitivity of TraDIS relative to STM. Some transposon insertions within the central highly repetitive region of siiE are tolerated in chickens and pigs, but are attenuated in cattle. Interestingly, siiE is split into two ORFs in both S. Typhi genome sequences, and part of the repetitive central region is absent from the genome of S. Paratyphi A. SiiE is secreted , indicating that in trans complementation by co-screened mutants does not obscure the identification of secreted colonization factors by TraDIS. All of the genes of the enteritis-associated SPI-5 that were disrupted by a transposon (pipACD, sopB and orfX) were required in all three species, although often with a milder phenotype in chickens. Several clusters of attenuated mutants were also found in the Salmonella chromosomal island (SCI, also known as SPI-6 in S. Typhi), including mutants in the hypothetical genes sciJ, sciQ, SL1344_0286A and sciX, the fimbrial subunit safA and its chaperone safB, the regulator sinR, SL1344_0301 (STM0305) which encodes a putative cytoplasmic protein, the pagN adhesin (STM0306) and sciZ (STM0307), a homologue of Shigella virG. Deletion of SCI affects invasion and virulence in a mouse intraperitoneal infection model , and the phenotype of a defined safA mutant has been confirmed in pigs .
Fimbriae play a well-established role in Salmonella attachment and intestinal colonization . All twelve fimbrial operons were disrupted by multiple transposons in the TraDIS screen. No obvious host-specific phenotypes were seen, with a common pattern that mutants of fimbrial subunit genes were attenuated, whereas assembly genes were often dispensable, suggesting cross-talk in the assembly pathways. Stress responses are also important in the infection process, as Salmonella is subjected to a range of stresses including low pH, oxidative stress and heat shock . The genetic components of these stress responses overlap , and many of these genes harboured transposons that resulted in attenuation. These included the sigma factor gene rpoE and its anti-sigma factor resA, the heat shock chaperone genes dnaK and dnaJ and the heat shock protease gene degP (htrA). Interestingly, several stress response genes are variably attenuated in the different hosts, suggesting species-specific stresses. These include the two-component regulatory system genes envZ and ompR, and the oxidative stress response genes dps, katE and proV which are all attenuated in pigs and cattle but show little or no attenuation in chickens. Conversely, transposon mutants in clpB, clpP and clpX, which encode proteases and are involved in the regulation of rpoS, are attenuated in chickens but not pigs or cattle.
Many S. Typhimurium genes beyond the classical virulence factors and stress response genes were revealed to be important for oral infection of livestock species. These include genes involved in nucleotide metabolism (pyrCD, purADGH, dgt, dcd, guaA, pyrCD and carAB), aromatic amino acid biosynthesis (aroABCDE), inorganic ion transport (trkAH, znuABC, fepCDG), protein synthesis (tufAB, fusA, efp, rplI, rpsK), protein export (tatABC, yajC) and many genes involved in carbohydrate metabolism. Additionally, numerous low GC clusters of genes with putative metabolic functions and multiple attenuating mutations were identified. Several global regulators, including crp, smpB and dam, result in attenuation in all three hosts, whereas another, fnr, appeared only to be important for infection in chickens. On occasion, TraDIS revealed functional data at a sub-genic level. For example, most of the insertions that disrupt the SPI-1 gene sptP result in attenuation, but insertions close to the 3′ end of the gene are tolerated. The gene rpoC, which encodes the β′ subunit of RNA polymerase, is essential in S. Typhimurium . However, one transposon insertion in the chicken, pig and cattle dataset, and two in the mouse dataset, were identified close to the 3′ end of rpoC. These insertions would disrupt the extreme C-terminal end of the encoded protein, and were found to reduce the fitness of the mutants in the animal screens. Similarly, an insertion was found at the 3′ end of the essential polA gene which encodes DNA polymerase I, and this mutant was significantly attenuated in chickens, pigs and cattle.
The recent RNAseq-based analysis of the S. Typhimurium SL1344 transcriptome  identified a number of small-regulatory RNAs. As indicated in Table S4 several of these were implicated in colonization by TraDIS. For many it is difficult to demonstrate conclusively a colonization-associated phenotype from the TraDIS data alone, since we cannot preclude the potential for polar effects on adjacent genes. This is the case for InvR, which is encoded within SPI-1. Table S4 details only the sRNA genes annotated in the SL1344 genome (which differs from ST4/74 by just 8 SNPs ), but in the TraDIS data there are a number of attenuating transposons within large intergenic regions that could reveal the presence of novel sRNA genes.
Putative host-specificity determinants of S. Typhimurium
The chicken, pig and cattle TraDIS data presented in Figure 4 indicate that a shared core set of 611 genes is required for efficient colonization of all three species, with a smaller set of species-specific colonization factors. The core set comprises approximately two thirds of the genetic requirements for infection of each individual species, and 48% of the total set of colonization-associated genes. There are 259 genes which are required for systemic infection of mice for which comparable data are available from the food-producing animals (Figure 4); of these, 140 also contribute to oral infection of chickens, pigs and cattle, and only 43 are putative mouse-specific factors. Many of the differences between the mouse and chicken/pig/cattle datasets may arise from the additional genetic requirements for infection via the oral route.
Although most colonization factors were necessary for infection of chickens, pigs and cattle, there were some patterns amongst the colonization factors that appeared to function in a host-specific manner that may reflect underlying differences in host biology. There are many genes associated with flagellar motility that are essential for infection of pigs but not required for chicken or calf infection, including fliY, flgK, fliN, flgN, fliB and fliZ. Several other flagellum-associated genes (flgB, flgL, fliL) are required for infection of cattle but not chickens or pigs. In chickens, many genes that are involved in anaerobic growth are required; these include genes involved in the production of group I hydrogenase (hypOBF, hybABCDF), fumarate reductase (frdAD), pflB, pfkA, rNTP reductase (nrdDG) and the global regulator Fnr. Differences in oxygen tension proximal to villus tips have been detected that modulate the regulation of Shigella virulence genes , therefore the requirement for distinct respiratory pathways by S. Typhimurium in food animals may reflect differences in the niches occupied. Also required in chickens, but apparently not calves or pigs, are the virK homologue ybjX, ilvGE, clpB and the his operon. The observation that many of the host-specific phenotypes were observed independently in multiple genes affecting the same pathways strongly suggests that these effects are due to differences in the within-host environment. For example, in relation to fumarate reductase there were nine independent frdA mutants that were all significantly attenuated in chickens; of these only one showed significant attenuation in pigs, and none in calves. Similarly, in relation to group I hydrogenase, from a total of 15 mutations affecting the hybABCDF genes, 12 showed significant attenuation in chickens, but none in pigs or calves (Table S2).
The serovar Typhimurium strain ST4/74 investigated here is a natural bovine isolate that elicits pathology typical of clinical salmonellosis in all the host species used in this study. However, some atypical S. Typhimurium strains exist that have lost the capability to colonize a broad range of hosts. For example, the laboratory-adapted strain LT2 and its derivatives tend to be avirulent or less virulent in mice relative to natural isolates of serovar Typhimurium  and ST4/74-based strains. It is noteworthy that the rpoS gene encoding the sigma factor σS, which is defective in LT2 and associated with the relative avirulence of this strain , was found to be required in all four species tested by screening of the S. Typhimurium libraries we describe (Table S3). TraDIS identified additional regions of the ST4/74 genome that are required for infection, but which are absent from the LT2 genome. These include several genes that are encoded within the same phage element: SL1344_1965, which is required for infection of mice, and SL1344_1929/30 and SL1344_1976, which are required for infection of chickens, pigs and cattle. Moreover, some Typhimurium strains have become adapted to a particular host, and the genome sequence of a human-adapted variant  reveals the decay of a number of genes important in colonization of food animals (e.g. allP, sseI, pipD, ydeE), but also other pseudogenes for which no role in food animals for the intact gene could be detected (e.g. ratB, ygbE, yhjU). Integration of genome sequences with high-resolution functional data of the kind we describe will provide further clues to explain the differential virulence of host-adapted or laboratory strains relative to natural isolates.
Our study represents the first comprehensive genome-wide survey of the role of thousands of Salmonella genes during colonization of the primary reservoirs of human non-typhoidal salmonellosis. TraDIS simultaneously assigned the genotype and relative fitness of 7702 distinct S. Typhimurium insertion mutants in chickens, pigs and cattle, representing over 90% of the mutants screened in pools of up to 475 mutants per animal. TraDIS therefore represents a significant advance in the reduction, refinement and replacement of animal models relative to STM, where only negatively-selected mutants tend to be interrogated and the vast majority of insertion sites and phenotypes are unreported . Multiple lines of evidence suggest that the TraDIS data are robust and reliably reflect the fitness of the screened mutants in each host animal. Many of the attenuated mutants were found to harbour transposons in genes known to be involved in colonization. The TraDIS fitness scores correlated well with established datasets obtained using STM ,  and TMDH . Multiple mutations within the same gene or pathway usually gave comparable phenotypes, and most of the attenuated mutants demonstrated the same phenotype independently in the three different food-producing animal hosts. The examples of putatively host-specific attenuation tended to be restricted to particular pathways with multiple independent mutations. Finally, analysis of defined knockout mutants of targets chosen based on the TraDIS data reproduced attenuated phenotypes in all but one case.
Many novel colonization-associated genes were identified within the S. Typhimurium genome and the data provide an invaluable resource for the community to mine and extend. Moreover, TraDIS indicated that thousands of mutations exerted little or no effect in vivo, implying functional redundancy that may limit and refine the selection of targets for novel inhibitors, as previously suggested . Unlike library screens conducted in murine typhoid models to date, we provide highly relevant data for control of intestinal S. enterica infections in food-producing animals, and thus zoonosis. Attenuating mutations may be suitable for selection and refinement of live vaccines for food-animals, and these in turn may express heterologous antigens. Further, the data will guide the interpretation of existing and fast emerging datasets on the repertoire, sequence and expression of Salmonella genes and aid the modelling of virulence in a wider evolutionary and ecological context. Our data reflect mutant phenotypes at a specific time and site, and further studies on the temporal and spatial role of Salmonella genes are likely to be informative. This study also establishes TraDIS as a quantitative technology in functional genomics, which has potential for widespread application beyond the realm of microbial pathogenicity.
Animal experiments were conducted according to the requirements of the Animals (Scientific Procedures) Act 1986 (project license number 30/2485) with the approval of the local Ethical Review Committee.
For full details of experimental animals, bacterial strains, materials, molecular biological techniques and statistical methods see Text S1. Briefly, a library of 8550 mini-Tn5 mutants was generated in a spontaneous nalidixic acid resistant variant of S. Typhimurium ST4/74. The mutants were combined into pools of 95 for chickens, and 475 for pigs and calves. Animals were inoculated orally and killed humanely 4 days (chickens and calves) or 3 days (pigs) after infection, or earlier if the clinical endpoint was reached. A section of an appropriate tissue (whole caeca for chickens, spiral colonic mucosa for pigs and distal ileal mucosa for calves) was homogenized and grown overnight on MacConkey agar plates to isolate the output bacteria.
Genomic DNA was prepared from the inocula and output samples, and fragmented to ∼300 bp. An Illumina adapter was ligated to the fragments, and PCRs were performed using an adapter-specific primer in conjunction with primers homologous to each end of the transposon. The sequences of all oligonucleotide primers used in this study are detailed in Table S5. The resultant products were sequenced on single end Illumina flowcells using a sequencing primer designed to read a 10 bp tag of transposon-derived sequence, plus 27 bp of flanking genomic DNA. Sequences containing the tag were mapped to the S. Typhimurium SL1344 genome sequence. A transposon was inferred to be present if there were corresponding reads derived from each end of the transposon in the input pool.
The number of reads corresponding to each transposon in the input pool, and the number of reads mapping to the equivalent position in the output pool data, were compared using DESeq . The ratio of input∶output read counts was determined, after normalisation to account for variations in the total number of reads obtained for each sample, and expressed as log2(fold change), referred to as the fitness score. A negative fitness score indicates an attenuated mutant, a positive score indicates a mutant which was more abundant in the output pool than in the input. For strongly attenuated mutants, no reads were obtained in the output pool, so it was not possible to calculate a finite log2(fold change); such mutants were assigned an arbitrary fitness score of −15. For each individual mutant, the hypothesis that the fitness score was equal to zero (i.e. that the mutant was present at equivalent levels in the input and output pools) was tested using the negative binomial distribution as implemented in DESeq. DESeq models variance under the assumption that mutants with comparable levels of sequence coverage exhibit similar levels of dispersion. We exploited this model to estimate P values for all mutants whilst minimising the number of biological replicates by fitting using only those mutants for which replicate data points were available, and applying the resultant model to the data derived from all mutants.
Defined null mutants were obtained for twelve genes identified as attenuated in the TraDIS screen, and assessed in competition with wild-type ST4/74 during oral infection of chickens. The ratios of mutant∶wild-type bacteria from caecal isolates at day 4, 6 and 10 were compared with those in the inoculum, and the significance of any differences was tested using Student's t-test.
Comparison of fitness scores obtained using TraDIS with the equivalent attenuation scores obtained using TMDH. Values were obtained by investigation of pools of S. Typhimurium SL1344 mutants screened during systemic infection of BALB/c mice using the two technologies.
Venn diagram showing the numbers of genes in which at least one significantly attenuated mutant was identified for each of the four host species.
Venn diagrams illustrating the overlap between attenuated and non-attenuated mutants from the earlier STM studies and the attenuation of mutants with transposon insertions at equivalent loci in the TraDIS datasets a) chickens, b) pigs and c) cattle , .
Illustration of the chorismate biosynthesis pathway, adapted from KEGG . For each step, boxes indicate the EC numbers of the enzyme(s) mediating the specified reactions, and are coloured blue, if a mutant in the associated genes was attenuated during intestinal colonization of chickens and red if the gene was disrupted but not attenuated. White boxes indicate enzymes absent from S. Typhimurium SL1344. Mutants that are defective in multiple stages of the pathway are attenuated, however two of the steps can be catalysed by the products of multiple genes, so inactivation of the individual genes associated with these steps does not result in attenuation.
Box plot of GC content of genes for which attenuated mutants were observed in the chicken TraDIS dataset, and genes for which no attenuated mutants were obtained.
Complete dataset derived from TraDIS investigation of pools of S. Typhimurium SL1344 mutants during systemic infection of BALB/c mice. The table includes the position in the genome and orientation of each transposon identified, identity of disrupted genes, the raw sequence counts, fitness scores, adjusted P values annotation of the predicted function of selected genes. ‘Absent’ indicates a gene that is present in SL1344 but has no identifiable orthologue in the LT2 genome. The equivalent TMDH attenuation scores are included for mutants that could be unambiguously identified in the TMDH dataset.
Complete dataset derived from TraDIS investigation of pools of S. Typhimurium ST4/74 mutants during intestinal colonization of chickens, pigs and cattle. The table includes the position in the genome and orientation of each transposon identified, and details of any disrupted gene. ‘Gene direction’ indicates the orientation of each gene (pink: sense, blue: antisense). Gaps in the fitness score columns indicate data points that were omitted due to stochastic loss of mutants in some animals, for example owing to recovery of output pools of inadequate size to be confident that mutants were absent owing to attenuation rather than chance.
List of all genes disrupted in the TraDIS mutant libraries from the food-producing animal and mouse experiments. For each gene, “yes” indicates that at least one significantly attenuated mutant was identified in that experiment, “no” indicates that the gene was disrupted but no evidence of attenuation was obtained.
Fitness scores of mutants within sRNA genes annotated in the S. Typhimurium SL1344 genome.
Oligonucleotides used in this study.
Supporting data and methods.
Conceived and designed the experiments: RRC EM SEP SJP DJT JP IGC DJM MPS. Performed the experiments: EM SEP SJP DLH HMD JW PMvD AMB AJB GDP DJT GCL AKT. Analyzed the data: RRC IGC DJM MPS. Contributed reagents/materials/analysis tools: RRC SEP AKT. Wrote the paper: RRC EM SEP JP IGC DJM MPS.
- 1. Crump JA, Luby SP, Mintz ED (2004) The global burden of typhoid fever. Bull World Health Organ 82: 346–353.
- 2. Majowicz SE, Musto J, Scallan E, Angulo FJ, Kirk M, et al. (2010) The global burden of nontyphoidal Salmonella gastroenteritis. Clin Infect Dis 50: 882–889.
- 3. Stevens MP, Humphrey TJ, Maskell DJ (2009) Molecular insights into farm animal and zoonotic Salmonella infections. Philos Trans R Soc Lond B Biol Sci 364: 2709–2723.
- 4. Turner AK, Lovell MA, Hulme SD, Zhang-Barber L, Barrow PA (1998) Identification of Salmonella typhimurium genes required for colonization of the chicken alimentary tract and for virulence in newly hatched chicks. Infect Immun 66: 2099–2106.
- 5. Hensel M, Shea JE, Gleeson C, Jones MD, Dalton E, et al. (1995) Simultaneous identification of bacterial virulence genes by negative selection. Science 269: 400–403.
- 6. Shea JE, Hensel M, Gleeson C, Holden DW (1996) Identification of a virulence locus encoding a second type III secretion system in Salmonella typhimurium. Proc Natl Acad Sci U S A 93: 2593–2597.
- 7. Hensel M (2000) Salmonella pathogenicity island 2. Mol Microbiol 36: 1015–1023.
- 8. Galan JE, Curtiss R 3rd (1989) Cloning and molecular characterization of genes whose products allow Salmonella typhimurium to penetrate tissue culture cells. Proc Natl Acad Sci U S A 86: 6383–6387.
- 9. Bispham J, Tripathi BN, Watson PR, Wallis TS (2001) Salmonella pathogenicity island 2 influences both systemic salmonellosis and Salmonella-induced enteritis in calves. Infect Immun 69: 367–377.
- 10. Carnell SC, Bowen A, Morgan E, Maskell DJ, Wallis TS, et al. (2007) Role in virulence and protective efficacy in pigs of Salmonella enterica serovar Typhimurium secreted components identified by signature-tagged mutagenesis. Microbiology 153: 1940–1952.
- 11. Lichtensteiger CA, Vimr ER (2003) Systemic and enteric colonization of pigs by a hilA signature-tagged mutant of Salmonella choleraesuis. Microb Pathog 34: 149–154.
- 12. Morgan E, Campbell JD, Rowe SC, Bispham J, Stevens MP, et al. (2004) Identification of host-specific colonization factors of Salmonella enterica serovar Typhimurium. Mol Microbiol 54: 994–1010.
- 13. Shah DH, Lee MJ, Park JH, Lee JH, Eo SK, et al. (2005) Identification of Salmonella gallinarum virulence genes in a chicken infection model using PCR-based signature-tagged mutagenesis. Microbiology 151: 3957–3968.
- 14. Tsolis RM, Townsend SM, Miao EA, Miller SI, Ficht TA, et al. (1999) Identification of a putative Salmonella enterica serotype Typhimurium host range factor with homology to IpaH and YopM by signature-tagged mutagenesis. Infect Immun 67: 6385–6393.
- 15. Langridge GC, Phan M-D, Turner DJ, Perkins TT, Parts L, et al. (2009) Simultaneous assay of every Salmonella Typhi gene using one million transposon mutants. Genome Research 19: 2308–2316.
- 16. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, et al. (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456: 53–59.
- 17. van Opijnen T, Bodi KL, Camilli A (2009) Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat Methods 6: 767–772.
- 18. Gawronski JD, Wong SMS, Giannoukos G, Ward DV, Akerley BJ (2009) Tracking insertion mutants within libraries by deep sequencing and a genome-wide screen for Haemophilus genes required in the lung. Proceedings of the National Academy of Sciences 106: 16422–16427.
- 19. van Opijnen T, Camilli A (2012) A fine scale phenotype-genotype virulence map of a bacterial pathogen. Genome Research 22: 2541–2551.
- 20. Goodman AL, McNulty NP, Zhao Y, Leip D, Mitra RD, et al. (2009) Identifying genetic determinants needed to establish a human gut symbiont in its habitat. Cell Host Microbe 6: 279–289.
- 21. Chaudhuri RR, Peters SE, Pleasance SJ, Northen H, Willers C, et al. (2009) Comprehensive identification of Salmonella enterica serovar Typhimurium genes required for infection of BALB/c mice. PLoS Pathog 5: e1000529 doi:10.1371/journal.ppat.1000529.
- 22. Lawley TD, Chan K, Thompson LJ, Kim CC, Govoni GR, et al. (2006) Genome-wide screen for Salmonella genes required for long-term systemic infection of the mouse. PLoS Pathog 2: e11 doi:10.1371/journal.ppat.0020011.
- 23. Santiviago CA, Reynolds MM, Porwollik S, Choi SH, Long F, et al. (2009) Analysis of pools of targeted Salmonella deletion mutants identifies novel genes affecting fitness during competitive infection in mice. PLoS Pathog 5: e1000477 doi:10.1371/journal.ppat.1000477.
- 24. Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, et al. (2006) From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res 34: D354–357.
- 25. Bjur E, Eriksson-Ygberg S, Aslund F, Rhen M (2006) Thioredoxin 1 promotes intracellular replication and virulence of Salmonella enterica serovar Typhimurium. Infect Immun 74: 5140–5151.
- 26. Valdivia RH, Cirillo DM, Lee AK, Bouley DM, Falkow S (2000) mig-14 is a horizontally acquired, host-induced gene required for salmonella enterica lethal infection in the murine model of typhoid fever. Infect Immun 68: 7126–7131.
- 27. Detweiler CS, Monack DM, Brodsky IE, Mathew H, Falkow S (2003) virK, somA and rcsC are important for systemic Salmonella enterica serovar Typhimurium infection and cationic peptide resistance. Mol Microbiol 48: 385–400.
- 28. Lambert MA, Smith SG (2009) The PagN protein mediates invasion via interaction with proteoglycan. FEMS Microbiol Lett 297: 209–216.
- 29. Gal-Mor O, Gibson DL, Baluta D, Vallance BA, Finlay BB (2008) A novel secretion pathway of Salmonella enterica acts as an antivirulence modulator during salmonellosis. PLoS Pathog 4: e1000036 doi:10.1371/journal.ppat.1000036.
- 30. Datsenko KA, Wanner BL (2000) One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A 97: 6640–6645.
- 31. Wang Q, Mariconda S, Suzuki A, McClelland M, Harshey RM (2006) Uncovering a large set of genes that affect surface motility in Salmonella enterica serovar Typhimurium. J Bacteriol 188: 7981–7984.
- 32. Chen LM, Kaniga K, Galan JE (1996) Salmonella spp. are cytotoxic for cultured macrophages. Mol Microbiol 21: 1101–1115.
- 33. Fu Y, Galan JE (1998) The Salmonella typhimurium tyrosine phosphatase SptP is translocated into host cells and disrupts the actin cytoskeleton. Mol Microbiol 27: 359–368.
- 34. Kaniga K, Uralil J, Bliska JB, Galan JE (1996) A secreted protein tyrosine phosphatase with modular effector domains in the bacterial pathogen Salmonella typhimurium. Mol Microbiol 21: 633–641.
- 35. Miao EA, Brittnacher M, Haraga A, Jeng RL, Welch MD, et al. (2003) Salmonella effectors translocated across the vacuolar membrane interact with the actin cytoskeleton. Mol Microbiol 48: 401–415.
- 36. Hensel M (2004) Evolution of pathogenicity islands of Salmonella enterica. Int J Med Microbiol 294: 95–102.
- 37. Retamal P, Castillo-Ruiz M, Villagra NA, Morgado J, Mora GC (2010) Modified intracellular-associated phenotypes in a recombinant Salmonella Typhi expressing S. Typhimurium SPI-3 sequences. PLoS ONE 5: e9394 doi:10.1371/journal.pone.0009394.
- 38. Morgan E, Bowen AJ, Carnell SC, Wallis TS, Stevens MP (2007) SiiE is secreted by the Salmonella enterica serovar Typhimurium pathogenicity island 4-encoded secretion system and contributes to intestinal colonization in cattle. Infect Immun 75: 1524–1533.
- 39. Folkesson A, Lofdahl S, Normark S (2002) The Salmonella enterica subspecies I specific centisome 7 genomic island encodes novel protein families present in bacteria living in close contact with eukaryotic cells. Res Microbiol 153: 537–545.
- 40. Edwards RA, Puente JL (1998) Fimbrial expression in enteric bacteria: a critical step in intestinal pathogenesis. Trends Microbiol 6: 282–287.
- 41. Rychlik I, Barrow PA (2005) Salmonella stress management and its relevance to behaviour during intestinal colonisation and infection. FEMS Microbiol Rev 29: 1021–1040.
- 42. Morgan RW, Christman MF, Jacobson FS, Storz G, Ames BN (1986) Hydrogen peroxide-inducible proteins in Salmonella typhimurium overlap with heat shock and other stress proteins. Proc Natl Acad Sci U S A 83: 8059–8063.
- 43. Knuth K, Niesalla H, Hueck CJ, Fuchs TM (2004) Large-scale identification of essential Salmonella genes by trapping lethal insertions. Mol Microbiol 51: 1729–1744.
- 44. Kroger C, Dillon SC, Cameron AD, Papenfort K, Sivasankaran SK, et al. (2012) The transcriptional landscape and small RNAs of Salmonella enterica serovar Typhimurium. Proceedings of the National Academy of Sciences of the United States of America 109: E1277–1286.
- 45. Richardson EJ, Limaye B, Inamdar H, Datta A, Manjari KS, et al. (2011) Genome sequences of Salmonella enterica serovar typhimurium, Choleraesuis, Dublin, and Gallinarum strains of well- defined virulence in food-producing animals. J Bacteriol 193: 3162–3163.
- 46. Marteyn B, West NP, Browning DF, Cole JA, Shaw JG, et al. (2010) Modulation of Shigella virulence in response to available oxygen in vivo. Nature 465: 355–358.
- 47. Sanderson KE, Stocker BAD (1987) Salmonella typhimurium strains used in genetic analysis. In: Neidhardt FC, Ingraham JL, Low KB, Magasanik B, Schaechter M et al.., editors. Escherichia coli and Salmonella typhimurium: cellular and molecular biology. Washington D.C.: ASM Press. pp. 1220–1224.
- 48. Swords WE, Cannon BM, Benjamin WH Jr (1997) Avirulence of LT2 strains of Salmonella typhimurium results from a defective rpoS gene. Infection and Immunity 65: 2451–2453.
- 49. Kingsley RA, Msefula CL, Thomson NR, Kariuki S, Holt KE, et al. (2009) Epidemic multiple drug resistant Salmonella Typhimurium causing invasive disease in sub-Saharan Africa have a distinct genotype. Genome Res 19: 2279–2287.
- 50. Eckert S, Dziva F, Chaudhuri RR, Langridge GC, Turner DJ, et al. (2011) Retrospective application of transposon-directed insertion-site sequencing to a library of signature-tagged mini-Tn5Km2 mutants of Escherichia coli O157:H7 screened in cattle. J Bacteriol 193: 1771–1776.
- 51. Becker D, Selbach M, Rollenhagen C, Ballmaier M, Meyer TF, et al. (2006) Robust Salmonella metabolism limits possibilities for new antimicrobials. Nature 440: 303–307.
- 52. Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11: R106.