Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Sequencing of the invasive E. coli strain BEN2908 isolated from poultry: A comparative investigation of genomic regions shared with intestinal and extraintestinal model E. coli strains

Abstract

Extraintestinal pathogenic Escherichia coli (ExPEC) cause disease outside the gut and include avian pathogenic E. coli (APEC), a leading cause of bacterial infections in poultry. Among their highly diverse types, strain BEN2908 stands out for its significant invasive ability across various human and avian cell types. Aiming to investigate further aspects of this strain and its plasmid, we sequenced and assembled the complete genome of BEN2908 and compared it to 22 E. coli strains, including other invasive strains such as adherent and invasive E. coli (AIEC) LF82 and NRG857c, by constructing a phylogenetic tree and using web-based characterization software. With these results, we selected eight strains closely related to BEN2908 to perform a ring comparison, including two APEC (APEC O1 and IMT5155), two neonatal meningitis E. coli (NMEC; RS218 and IHE3034), two uropathogenic E. coli (UPEC; 78-Pyelo and CFT073), one commensal E. coli (MG1655) and one adherent-invasive E. coli (AIEC; LF82). This revealed 20 genomic regions (GRs) of interest which were then analysed by CD-Search, BLASTp and KEGG Pathway databases. Many of the genes in these GRs had no previous description but showed similarity to known genes involved in sugar uptake, nitrogen metabolism, and dicarboxylate transport and processing, among other functions. These results were tabulated and used to infer possible pathways that could be involved in ExPEC pathogenesis, highlighting candidate genes that have been overlooked in ExPEC research.

Introduction

Avian pathogenic Escherichia coli (APEC) causes avian colibacillosis, a prevalent bacterial infection that can affect birds of all ages and at all stages of poultry production. It manifests in a range of clinical forms, including omphalitis in embryos, salpingitis in laying hens, cellulitis, airsacculitis, perihepatitis, peritonitis, and septicaemia [1]. Avian colibacillosis results in high morbidity and mortality leading to economic losses in the industry throughout the world (e.g., in Netherlands, losses due to salpingitis were estimated at € 0.4 million, € 3.3 million and € 3.7 million for the layer-sector, the meat-sector and poultry farming, respectively [2]). APEC strains are typical commensal inhabitants of the chicken intestine, exhibiting a high genetic diversity, reflected by the various serogroups and sequence types (ST) isolated from clinical cases. To cause an extraintestinal infection, though, an APEC would require siderophores, which permit them to survive in the host fluids (poor in free iron), and protectins (such as K1 capsule), with which they can evade host defenses. Yet, even if some virulence genes/strategies have been identified, the pathophysiology of avian colibacillosis remains incompletely understood, and additional determinants likely remain to be discovered. One such example is the dicarboxylate uptake regulator, DctR, which appears to contribute to biofilm formation, serum resistance, adherence and colonization in ducks [3]. In that perspective, many APEC genomes have been sequenced; to date (December, 2024), of the 349,451 E. coli genomes available on Enterobase, 21,666 were isolated from poultry, of which 468 reportedly belong to ST95. ST95 strains exhibit a broad host range, and phylogenetic studies show that strains isolated from different species share numerous genes and SNPs, indicating a substantial degree of genomic overlap [4]. Among E. coli strains of this ST, BEN2908 (O2:H5:K1) is one of the APEC strains that has been most studied. Moreover, the ST95 complex (STc95, a ST complex defined as having at least three STs that differ among each other by no more than two of the seven loci) is also closely related to different types of ExPEC infection, being one of the five most common STc in ExPEC, at least for the past few decades [58].

Strain BEN2908 is a nalidixic acid resistant derivative of strain MT78, isolated in 1977 from the trachea of a one-day old chick in France [9]. BEN2908 is an efficient colonizer of the chicken intestine [10,11], and its inoculation either by the air sacs or intratracheally results in severe systemic infection [12,13]. In mice, transurethral inoculation of BEN2908 results in urinary infections comparable to those caused by human UPEC strains [14]. Remarkably, BEN2908 is able to metabolize short chain fructooligosaccharides, conferring a competitive advantage to colonize the intestine of chickens [10,11,15]. Carbohydrate metabolism also seems to play a role in colonization of the lungs and/or air sacs by BEN2908 [16]. Genes linked to sugar metabolism also contribute to bacterial fitness under stressful conditions such as oxygen restriction, the late stationary phase of growth, or growth in serum or in the intestinal tract [17,18]. Type 1 fimbriae expression, in particular, seems to be dependent on regular cytosolic levels of carbohydrates [18,19]. Further, this strain can also invade and survive within avian and human cells including avian fibroblasts [20] and hepatocytes (LHM), human pneumocytes (A549) [21], human brain microvascular endothelial cells (HBMEC) [22], and human intestinal cells (Intestine-407), at levels comparable to AIEC strains [23]. Moreover, BEN2908 has also been used as a model to study important ExPEC-specific genes such as ibeA, which contributes to adherence to host cells, resistance to oxidative stress and virulence in both avian and mammalian hosts [22,24,25]. Due to its ability to induce colibacillosis in poultry, cause urinary tract infections in murine models, and invade various types of human cells, BEN2908 represents a highly versatile and cross-host pathogenic E. coli.

Another subset of pathogenic E. coli includes those belonging to the AIEC pathotype, which are commonly associated with high adhesion and invasion rates causing inflammatory bowel diseases (IBD) in humans, such as ulcerative colitis or Crohn’s disease. Strain LF82 is an extensively studied AIEC reference strain and one of the first identified as capable of inducing IBD, followed by AIEC NRG857c, isolated from the ileum of a patient with Crohn’s disease [26,27]. In addition to its capacity to invade different intestinal cell lines, these AIEC strains display notable genomic similarity to some ExPEC strains, such as strain APEC O1 and UPEC strain UTI89 [28]. AIEC strains also have some well-known virulence factors that are essential for adherence, invasion and survival inside host cells, such as the serine autotransporter protease Vat, which induces vacuolization and cytoskeleton rearrangements in avian, murine and human cells [2931]. Moreover, transcriptomics and Tn-seq analyses identified LF82-specific genes that might be implicated in the growth or survival of intracellular bacterial communities (IBCs). This investigation uncovered three noteworthy gene clusters: the High Pathogenicity Island (HPI), a putative type 6 secretion system (T6SS), and a region associated with carbohydrate metabolism [31]. Curiously, the functionality of those three clusters has already been correlated to ExPEC, especially APEC, pathogenesis by many different authors [3236].

In this study, we perform a comparative genomic analysis of BEN2908 to ten ExPEC strains, ten intestinal pathogenic E. coli (InPEC) strains, and two commensal strains, to identify genomic features common to these different strains, that are potentially underlying their ability to cause pathogenesis. To achieve that, we: (i) demonstrated the close evolutionary relationship between BEN2908, AIEC LF82 and NRG857c and other ExPEC model strains through a phylogenetic analysis, complemented by different characterization programs, (ii) identified and analysed the content of 20 genomic regions (GRs) in common to these strains that were absent in the commensal E. coli K-12, and (iii) inferred the possible origin of some of these features by comparison to analogous genomic modules from other bacteria. Through this approach, our aim was to identify novel genes or gene modules that may have a functional role in E. coli pathogenesis due to their homology to other known genes. Moreover, the complete sequence of the chromosome and plasmid, identified, described, and analysed in this study will be useful to study more comprehensively the pathogenic mechanism of strain BEN2908 and other invasive E. coli.

Methods

BEN2908 DNA extraction, sequencing, assembly and annotation

Nanopore sequencing and assembly were performed by the Genome and Transcriptome Facility at Bordeaux, France. For the extraction, 5 µg of genomic DNA were sheared to 20 Kb using Megaruptor 2 (Diagenode). Sheared DNA was End-Repaired using Oxford Nanopore recommendations for 1D Ligation sequencing (LSK-SQK 108), with minor modifications, as follows. 48 µL of sheared DNA were incubated with 7 µL of Ultra II End-prep reaction buffer and 3.5 µL of Ultra II End-prep enzyme mix (New England Biolabs) at 20 °C for 15 minutes and at 65 °C also for 15 minutes. The sample was then cleaned up using 1.0X of AMPure XP beads and barcoded using NBD-103 kit (Oxford Nanopore Technologies). After that, 22.5 µL of clean repaired DNA and 25 µL of Blunt/TA Ligase Master Mix (New England Biolabs) were added to 2.5 µL of barcode and incubated at room temperature for 15 minutes. Barcoded samples were cleaned up again and quantified using Qubit fluorometer (Invitrogen) for equimolar pooling. Then, 2.2 µg of pooled DNA were ligated to AMX adapter and purified according to Oxford Nanopore recommendations. Afterwards, 13 µL of the library were loaded into a MinION Flow cell (FLO-MIN106 R9.4) and sequenced during 48 hours on a GridION x5. The obtained raw data (.fast5 files) were base called in high accuracy mode and filtered using Guppy (v. 4.0.11) by applying a minimum quality score of Q7. This resulted in 752,742 reads above the Q7 cutoff. Read quality and length distributions were then assessed using NanoPlot [37] and compared with the dataset prior to filtering.

Illumina paired-end sequencing was performed in 2015 at the Genome Québec facility, at McGill University (Montreal, Quebec, Canada) in an Illumina MiSeq machine; sequencing generated 3,274,430 reads with 250 bp of length and 50% GC content. Illumina reads were trimmed for adapters and low-quality bases using Trimmomatic (v. 0.32) [38], with the following parameters: ILLUMINACLIP:TruSeq3-PE.fa:2:30:10; LEADING:30; TRAILING:30; HEADCROP:20; MINLEN:150. This resulted in 2,476,272 paired reads with minimum length of 150 bp, which were further filtered with a Q20 cutoff using fastq_quality_filter (v. 1.0.0), available on Galaxy platform (v. 25.0) [39]. After filtering, a total of 1,713,025 reads were retained, yielding a coverage depth of 282x and breadth of 99.86%, as assessed by BWA (v. 0.7.19) [40]. The files and logs of the programs mentioned above, including the FastQC score (v. 0.12.1)(https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) are available on the GitHub repository referred in the Data availability section. Then, the Nanopore files were assembled with Canu (v 1.6) [41] and polished with the Illumina reads obtained using Pilon (v. 1.22) [42]; three runs were necessary until no corrections were made anymore. Finally, the annotation was made with the classic RAST workflow available online [43].

Genomic comparison and characterization of E. coli strains

In addition to BEN2908, described above, 22 other E. coli genomes used in this study were downloaded from NCBI’s database under the following accession numbers and were also submitted to RAST for annotation: 55989 (GCF_000026245.1), 11368 (GCF_000091005.1), 042 (GCF_008042015.2), H10407 (GCF_000210475.1), 2009EL-2050 (GCF_000299255.1), 11128 (GCF_000010765.1), E2348/69 (GCF_000026545.1), E24377A (GCF_000017745.1), O157:H7 EDL933 (GCF_000732965.1), O157:H7 Sakai (GCF_000008865.1), IMT5155 (GCA_000813165.1), APEC O1 (GCA_000014845.1), IHE3034 (GCA_000025745.1), RS218 (GCA_000800845.2), NRG857c (GCA_000183345.1), LF82 (GCA_000284495.1), CFT073 (GCA_014262945.1), χ7122 (GCA_000307205.1), 78-Pyelo (GCA_014131615.1), UTI89 (GCA_000013265.1), SCU-397 (GCA_013358385.1), and K-12 MG1655 (GCA_000005845.2). Regarding extraintestinal strains, these comprised three APEC (APEC O1, IMT5155 and χ7122) [4446], two NMEC (RS218 and IHE3034) [47,48] and three UPEC strains (UTI89, CFT073 and 78-Pyelo) [4951]. Regarding intestinal strains, these comprised four enterohemorrhagic E. coli (EHEC; O157:H7 Sakai, O157:H7 EDL933, 11368 and 11128) [5255], three enteroaggregative E. coli (EAEC; 042, 2009EL-2050 and 55989) [5658], two enterotoxigenic E. coli (ETEC; H10407 and E24377A) [59,60], two AIEC (LF82 and NRG857c) [27,28], one enteropathogenic (EPEC; E2348/69) [61], and two commensal strains (SCU-397 and MG1655) [62,63]. Multilocus sequence type (MLST) and serotype information of these strains were described in the above cited publications and information was verified using data from the Enterobase database [64] and the following software: ClermonTyping (v. 24.02) [65], SerotypeFinder (v. 2.0) [66], MLST (v. 2.0) [5,67,68], and FimTyper (v. 1.0) [69]. Other programs, such as VFanalyzer (v. 6.0) [70], CRISPRCasTyper (v. 1.8.0) [71], MinCED [72], SecReT6 (v. 3.0) [73], PHASTEST (v. 3.0) [74], Roary (v. 3.13.0) [75] and KEGG (release 116) [76] were also used to complement this characterization. The KEGG Pathway (KP) and KEGG Orthology (KO) databases were used by mapping the KO assignment numbers of the uncharacterized ORF homologs identified in this work onto KP maps. This allowed us to predict metabolic pathways potentially related to the molecular functions of the novel ORFs. However, these predictions are based on homology-derived KO assignments and require experimental validation of protein activity, regulation and specificity. All programs, excepting Roary, were run using the .fasta files of the 23 strains on their respective web-based platforms using default parameters. Roary was executed in Linux Ubuntu 24.04.3 LTS terminal with default alignment configurations using the .gff files of each strain. The -r and -e parameters were used to generate R plots and align core genes using PRANK [77], respectively.

With the annotated genomes, Orthofinder (v. 2.5.4) [78] was used for the identification of orthogroups (i.e., groups of genes with an evolutionary relation). Afterwards, the files generated from this analysis were used for constructing an unrooted phylogenetic tree, described below.

RAxML unrooted tree generation and average amino acid identity (AAI) to AIEC LF82

Phylogenetic tree construction was based on two files generated with Orthofinder: (i) a .txt file identifying all the single copy orthogroups (i.e., the identification of all groups of homologous genes that possess only one allele corresponding to each strain) and (ii) a .csv file containing all the Orthogroups found. Based on the information contained in these two files, Python scripts were made for the following operations: (i) generation of one .fasta file for each single copy orthogroup found; (ii) alignment of the homologous genes contained in each single copy orthogroup .fasta using MUSCLE (v.5) [79]; and (iii) concatenation of all the aligned genes in a single .multifasta file with one entry for each strain. After, trimAl (v. 1.5) was used to trim ambiguous regions in the alignment, resulting in 23 sequences of 987,155 amino acids, which were converted in a .phy file, using a custom Python script. Then, ModelFinder (from IQ-TREE v. 3.0.1) [80] was used to identify the best substitution matrix. After that, RAxML [81] was executed using a JTT substitution matrix incorporating a proportion of invariable sites and a gamma rate of heterogeneity. Node support was assessed with 1,000 nonparametric bootstrap replicates. To validate the tree topology, three additional phylogenies were generated under alternative substitution matrices and evaluated with bootstrap replicates. The trees and supporting files are available in the GitHub repository. Finally, the web tool iTOL (v.7)(interactive Tree of Life) [82] was used for tree visualization and image editing. To complement the result of the tree, the program EzAAI (v. 1.2.4) [83] was used to obtain AAI values and proteome coverage percentages of all strains to AIEC LF82.

Ring image generation and CDS functional characterization

The ring comparison was generated using the software BRIG (BLAST Ring Image Generator; v. 0.95) [84]. The chromosomal ring comparison was made by setting BEN2908 as the reference genome against the genomes of the ExPEC/AIEC strains that clustered in the phylogenetic tree (grey arrows), excepting UTI89 and NRG857c, which presented the same alignment pattern of RS218 and LF82, respectively. These comprised two APEC (red tones; IMT5155 and APEC O1), one AIEC (orange; LF82), two NMEC (purple tones; RS218 and IHE3034), two UPEC (green tones; 78-Pyelo and CFT073), and one commensal E. coli (grey; K-12 MG1655). The plasmidial ring comparison was made by setting pBEN2908 as the reference and the sequences of four other ColV-like plasmids screened using Liu et al. criteria (2018) [85], i. e., if at least one gene from four or more of the following gene clusters were present: (i) cvaABC and cvi (the ColV operon), (ii) iroBCDEN (the salmochelin operon), (iii) iucABCD and iutA (the aerobactin operon), (iv) etsABC, (v) ompT and hlyF, and (vi) sitABCD. The four plasmids comprised three APEC plasmids (red tones; p1ColV5155, pAPECO1Col-BM, and pAPEC-1) and one AIEC plasmid (orange; pO83-CORR). BLAST was run with BLAST+ (v. 2.12.0) using default BLASTn parameters (word_size = 11; reward = 2; penalty=−3; gapopen = 5; gapextend = 2; e-value = 10). Regions containing at least 4 kb that were absent (coverage values below 50%) in the commensal K-12 strain were tagged as genomic regions (GR) of interest. The 4 kb cutoff was chosen because it approximately corresponds to the size of small functional gene clusters (around 3−4 genes, given E. coli general gene density of 1 gene per kb) and because the same threshold has precedent in genomic descriptions of related APEC strains (APEC O1 and IMT5155), which aids comparative interpretation [44,45]. S1 Table shows a relation of the GC content, nucleotide coverage, and nucleotide identity of each GR from BEN2908 against the genomes of the other strains using BLASTn optimized for highly similar sequences (megablast). Each GR range drawn in the ring was determined by the end of the last CDS common to all strains until the beginning of the first CDS common to all strains. These limiting CDS are written in the “GR Limits” column in Table 3 and S2 Table. Further, the names of twenty-one previously identified regions, including four from BEN2908 genome (GimA, GimB, AGI-1, and AGI-3) [16,86,87] were also added to the ring comparison.

The predicted putative functions of some of the genes and operons within the GRs were verified using BLASTp and CD-Search (v 3.21) [88,89], and were classified as follows: Sugar metabolism (SM), Prophages (Phg), Metabolism of iron and other metals (Met), Secretion Systems (SS), Adhesion and Invasion (A/I), and Defense mechanisms (Def). The other genes selected that did not correlate with these functions were grouped as General Metabolism (GM). The domains and genes selected on “CD-Search prediction” and “BLASTp identity” columns in Table 5 and S2 Table were, respectively, the hit result with an expect value closer to 0 and, preferentially, a reviewed entry from the UniProtKB database. Unreviewed entries were selected when an unreviewed homolog had a relevant mention in a previous paper (ORFs 36-56 [90]; ORF 234 [91]; ORF 534 [92]; and ORFs 934-1334 [93] or when no reviewed entries were found (ORF 15 and ORF 16). Each novel ORF identified in this work or previously reported CDS or region were referred to as a “feature”. For prophage-containing GRs, the boundaries were defined by the region between the attachment sites identified in PHASTEST, except for GRs 11 and 14, where no attachment sites were found. S2 Table contains a summary of the features and their characterization in all 36 GRs.

Results and discussion

Assembly, characterization and phylogenetic analysis of BEN2908

Sequencing analysis indicated that the genome of BEN2908 comprised the chromosome and a single plasmid. The chromosome contig has 5,061,728 bp and 50.6% GC content, and the plasmid has 133,673 bp and 49.8% GC content. The characterization of BEN2908 and the strains used in this study was conducted using the web-based programs from CGE, as described in the Methods section. Strain BEN2908 (phylogroup B2; serotype O2:H5) exhibits a characteristic STc95 APEC profile, similar to APEC strain IMT5155 (B2; O2:H5:K1) and APEC O1 (B2; O1:H7) (Table 1).

thumbnail
Table 1. Characterization of the strains used in this study1.

https://doi.org/10.1371/journal.pone.0342894.t001

To compare the genome of BEN2908 to other E. coli strains, we generated an unrooted phylogenetic tree using the 3,101 common orthologues identified by Orthofinder (Fig 1). The strains used in this tree were selected to evaluate which typing scheme best clusters BEN2908 and to elucidate which pathogenic pathotypes show greater affinity to the AIEC strains LF82 and NRG857c. For those reasons, in addition to the three strains mentioned above, 20 model E. coli strains from different phylogroups, sequence types, serotypes, and pathotypes were selected. Also, considering the importance of type 1 fimbriae (T1F) to invasion, strains of different fimH types were also selected. Gene fimH is commonly used as T1F typing gene because it directly mediates attachment to host cells, and it has been shown that its allelic variation alters affinity for mannosylated receptors, affecting adhesion and invasion [94,95]. In the phylogenetic tree (Fig 1), BEN2908 clustered with other STc95 strains, especially close to APEC strain IMT5155 which has the same serotype (O2:H5:K1), but a different ST due to one allelic differentiation in the adenylate kinase housekeeping gene (adk55 rather than adk13). Another noticeable aspect of the tree is that the AIEC strains LF82 and NRG857c are more closely related to the eight ExPEC strains (grey arrows), including BEN2908, than to any other InPEC strains analysed in this work. This is further supported by the AAI values and proteome coverage analysis, which shows that LF82 proteins have more identity and are more covered if compared to these ExPEC strains than to any InPEC, excepting AIEC NRG857c. These results suggest that STc135 AIEC orthologs closely resemble those of the ExPEC strains that clustered with BEN2908 (also known for its invasive capacity), implying that some genomic features may underlie their pathogenic traits.

thumbnail
Fig 1. Unrooted phylogenetic tree generated and AAI values of each strain to AIEC LF82.

Maximum likelihood phylogeny using the 3,101 common concatenated orthologues detected by OrthoFinder. The strains used are described in Table 1. Branches colored blue represent ExPEC, red represent InPEC, and green represent Commensal strains. The red dotted circle indicates that all nodes within have 100% bootstrap support. The grey arrows indicate the strains that clustered together and showed the highest AAI values and proteome coverage to AIEC LF82.

https://doi.org/10.1371/journal.pone.0342894.g001

Supplementary characterization using VFanalyzer, CRISPRCasTyper, Roary and PHASTEST and comparison to AIEC strains

The genetic proximity shown above is further supported by the results of additional characterization tools, such as CRISPRCasTyper and MinCED (for Cas typing and CRISPR spacers identification), VFanalyzer (for virulence related genes screening), and Roary (for core genome comparisons). CRISPR/Cas analysis revealed that, with the exception of CFT073, 78-Pyelo (which doesn’t possess a CRISPR/Cas system) and χ7122, all ExPEC and both AIEC harbour the same Cas subtype (I-F), contrasting to the I-E subtype found in every InPEC and commensal strain (S3 Table). Additionally, excepting χ7122, at least half of the spacers from every ExPEC strain were identified in LF82 and at least one-third in NRG857c, contrasting with none found in common to InPEC or commensal strains (S4 Table). This suggests a genetic relationship between these AIEC and ExPEC, since strains that share a recent evolutionary history often have identical spacers [96]. The presence of virulence and core genes also follow this trend, as a higher proportion of genes common to AIEC and ExPEC were found on both VFanalyzer and Roary. Roary results showed that AIEC strains have more genes in common to ExPEC (3,352) rather than InPEC (3,119) strains, yielding a difference of 233 genes (full output available on GitHub). VFanalyzer identified 85 virulence-related genes in both AIEC strains. Of these, six genes (toxins vat and usp, invasin ibeA, metal transport periplasmic binding protein sitA, and two T6SS components) were found exclusively in at least one ExPEC and absent in all other InPEC strain, in contrast to only three genes (long polar fimbriae genes lpfBCE) exclusively shared with InPEC strains (full output in S5 Table). Despite the small numeric difference, the genes shared exclusively with ExPEC are not restricted to a single locus as the lpfBCE genes, and their presence and functionality are usually related to ExPEC virulence traits, such as pyelonephritis, HBMEC invasion, and cytoskeletal modifications on different hosts cells [22,29,9799]. In contrast, PHASTEST results diverged from the pattern observed, as AIEC most common phages were almost evenly distributed among InPEC and ExPEC strains (S6 Table).

pBEN2908 is a ColV-like plasmid

Colicin V (ColV) plasmids have been shown to play a pivotal role in the pathogenesis of extraintestinal pathogenic E. coli. Its name was attributed to the production of colicin V, a bacteriocin originally described by Gratia in 1925 as “factor V” [100]. These plasmids usually range from 80 to 180 kbp and were identified in a variety of E. coli hosts, with particular significance in poultry infections [101104]. Besides being widespread among hosts, the pathogenicity acquired from those plasmids is also noteworthy: acquisition of ColV-like plasmids by commensal/environmental E. coli from phylogroup B1 was associated with a more than threefold increase in infection rates among patients at a hospital in Paris [8,105]. This pathogenic potential relies on their gene content which encode various virulence factors, especially metal transport and uptake systems, such as the siderophores encoded by the aerobactin (iuc/iut) and salmochelin (iro) genes, but also the sitABCD and hypothetical etsABC metal transport systems [106]. The iss (increased serum survival) gene is also present on ColV plasmids and may contribute collectively with other systems to resist the bactericidal activity of serum complement [107]. Other genes such as outer membrane vesicle regulator hlyF and outer membrane protease ompT, type II toxin-antitoxin system vapBC, SOS inhibition system psiAB, and the conjugative transfer system tra/trb are also commonly found on these plasmids. To screen for ColV-like plasmids we applied the criteria defined by Liu et al. [85] to the plasmids from 18 strains in our selection, as five strains (78-Pyelo, CFT073, 55989, MG1655, and SCU-397) did not harbour any plasmids. Of the 41 plasmids tested (description in S7 Table), only five fitted the criteria: four from the APEC strains (among which BEN2908) and one from the AIEC strain NRG857c (Table 2). To facilitate comparison, we generated a ring comparison containing only these five ColV-like plasmids and the ColV-associated genes described above with pBEN2908 as reference (Fig 2). The pLF82 is distinct from ColV plasmids, closely resembling the cryptic pHCM2 of Salmonella enterica serovar Typhi CT18 (isolated from a typhoid fever patient in Vietnam) [28]. This cryptic plasmid harbours phage‐derived regions, a parAB‐like partitioning module, a suite of DNA replication genes (including helicases, ligases, and exonucleases), and coding sequences similar to genes rnhA (ribonuclease H), dhfR (dihydrofolate reductase), thyA (thymidylate synthase), and nrdAB (ribonucleotide diphosphate reductase), involved in nucleotide synthesis [108]. Despite this difference, finding a ColV-like plasmid uniquely in an AIEC strain supports the idea that these plasmids may act as a zoonotic pathogenic trait derived from APEC [85].

thumbnail
Table 2. Table of identities of ColV-like plasmids and the identity of their genes to the clusters from p1ColV51551.

https://doi.org/10.1371/journal.pone.0342894.t002

thumbnail
Fig 2. Plasmid comparison using the software BRIG, setting pBEN2908 as the reference strain.

From the innermost to the outermost ring, the following plasmids are shown: p1ColV5155 (NZ_CP005931.1; IMT5155), pAPEC-O1-ColBM (DQ381420.1; APEC O1), pAPEC-1 (CP000836.1; χ7122), and pO83-CORR (CP001856.1; NRG857c). Genes usually present in ColV-like plasmids and the four gene clusters used in Liu et al. for ColV plasmid screening are shown in the figure [85].

https://doi.org/10.1371/journal.pone.0342894.g002

Overview and metabolic functions of the GRs in common between ExPEC and AIEC strains

While numerous genes have already been reported to contribute to virulence of extraintestinal E. coli [109,110], our research focused on identifying specific genomic regions possibly harbouring novel genes that could have a role in pathogenesis. Therefore, in an attempt to further analyse the genomic content of the strains that clustered with BEN2908 and AIEC LF82/NRG857c (Fig 1; grey arrows), we generated a ring comparison using the software BRIG. The BEN2908 genome was set as the reference and the following strains as subjects: two APEC (IMT5155 and APEC O1), two NMEC (RS218 and IHE3034), two UPEC (78-Pyelo and CFT073), one AIEC (LF82), and one commensal E. coli strain (MG1655).

The ring comparison identified 36 regions larger than 4 Kbp in the BEN2908 chromosome that were absent (below 50% of coverage) in the commensal strain MG1655 (Fig 3, S1 Table). To characterize the features within these regions, we used BLASTp and CD-Search. This analysis enabled us to classify the features into one or more of the following categories: “Sugar Metabolism” (SM), “Prophages,” “Metabolism of Iron and other metals” (Met), “Secretion Systems” (SS), “Adhesion and Invasion” (A/I), and “Defense Mechanisms” (Def). Among these, 20 regions were also identified in all strains, excepting the commensal MG1655 with 15 exhibiting more than 70% coverage (Table 3, S1 Table). Features that did not fit into any of these categories were grouped as “General Metabolism” (GM). To better infer the functions of the uncharacterized features in this section, we used the KP and KO databases to investigate whether any pathway maps could be related to some of the hypothetical molecular functions of the ORFs identified. The following subsections describe the content of these 20 GRs - which are summarised in Table 4 (Reported features) and Table 5 (Uncharacterized features) – according to their respective categories, while also linking these findings to important ExPEC virulence traits.

thumbnail
Table 3. Genomic regions (GRs) of BEN2908 with more than 50% coverage to the ExPEC strains from this study and AIEC LF82.

https://doi.org/10.1371/journal.pone.0342894.t003

thumbnail
Table 4. Summary of Genomic Regions (GRs) present in both ExPEC and AIEC strains from this study containing Reported features.

https://doi.org/10.1371/journal.pone.0342894.t004

thumbnail
Table 5. Genomic Regions (GRs) present in both ExPEC and AIEC strains from this study containing Uncharacterized features and their predicted identification.

https://doi.org/10.1371/journal.pone.0342894.t005

thumbnail
Fig 3. Genome comparison using the software BRIG, setting BEN2908 as the reference strain.

From the innermost to the outermost ring, the following strains are shown: APEC IMT5155 and APEC O1 (red tones), NMEC RS218 and IHE3034 (purple tones), UPEC 78-Pyelo and CFT073 (green tones), AIEC LF82 (yellow) and commensal K-12 MG1655 (grey). Genomic regions (GRs) identified by the criteria described in the Methods section are indicated by numbers 1 to 36. The 20 regions marked in orange are common to all strains, except K-12 MG1655.

https://doi.org/10.1371/journal.pone.0342894.g003

Sugar metabolism (SM).

Reported. The genomic regions 6, 19, 21, 23, 27 and 29 contain genes predicted to contribute to sugar metabolism (S2 Table), one of which (GR 27) has already been described. GR 27 contains a carbohydrate metabolic operon, named frz [18]. In this work, the authors showed that the presence of the frz operon promoted fitness, adhesion and internalization to different eukaryotic cell lines. Moreover, hybridization analysis of 151 ExPEC and 35 non-pathogenic avian E. coli strains showed that this operon is rarely present in non-pathogenic strains (5%) and its association increases with virulence, reaching 75% in the most virulent group [18].

Uncharacterized. The number of distinct GRs containing genes related to SM supports the importance of varied carbohydrate usage and uptake, enabling the bacteria to acquire various sugars that could be linked to other important metabolic functions. Four genomic regions identified in the ring comparison (GRs 19, 21, 23 and 29) encode genes predicted to be related to the transport of carbohydrates (Table 5) such as Phosphotransferase systems (PTS: ORFs 419 and 129), ATP-binding cassette transporters (ABC-T: ORFs 323, 423, and 523) and Tripartite ATP-independent periplasmic transporters (TRAP-T: ORFs 521, 621, and 721) as well as probable transcriptional regulators (ORFs 519, 121, 723, 429, and 529). Additionally, with the exception of GR 21, hypothetical epimerases or isomerases were also present in those GRs (ORF 219, ORF 123, and ORF 129), likely playing a role in converting isomeric sugars into a form that can be metabolically processed by other enzymes following uptake.

Besides transport systems, additional enzymes potentially involved in carbohydrate processing were also identified. GR 6 harbours a pair of CDSs (ORFs 66 and 76) structurally analogous to the PdxA2 and DUF1537 protein families. This gene pair is involved in pyridoxal-5’-phosphate synthesis, an important cofactor for many enzymes such as alanine aminotransferases, which catalyze the reversible transfer of an amino group from alanine to 2-oxoglutarate to generate glutamate and pyruvate [90,125]. Some of the homologs of the identified ORFs in this region also cluster together in some bacteria. In Sinorhizobium meliloti 1021, for instance, enzymes structurally analogous to a sialidase (ORF 36), a 4-hydroxy-tetrahydrodipicolinate synthase (dapA; ORF 46), and an alcohol dehydrogenase (adh; ORF 56), are encoded in a similar cluster. Additionally, S. meliloti carries an ABC transporter absent from the pathogenic E. coli strains. However, CD-Search of ORF 16 returned a modest match to a citrate permease (Table 5, S2 Table), suggesting a possible transporter role that would require more robust confirmation [90].

Other features could also be noticed in these regions. Region 19, for instance, encodes a probable phosphoglycerate dehydrogenase (ORF 119) and a beta-cystathionase (ORF 319) with structural similarity to malY. MalY is capable of cleaving C-S linkages, producing central metabolic compounds such as pyruvate, and also represses the activity of the maltose regulon, being involved in catabolic regulation [126]. Additionally, in GR 23, a gene encoding a hypothetical tagatose aldolase (ORF 623), showing high similarity to the catalytic GatY aldolase subunit from E. coli K-12 (63.95% identity) was identified. This enzyme plays an important role in glycolysis and gluconeogenesis, as it reversibly converts tagatose-1,6-bisphosphate to D-Glyceraldehyde-3P (G3P) and dihydroxyacetone phosphate (DHAP) [127]. Interestingly, by analysing three KEGG pathway maps it was possible to trace a connection among ORFs 221 and 321 (GR 21), and 223/823 (GR 23) that may also lead to the production of these compounds (Fig 4).

thumbnail
Fig 4. Predicted sugar metabolism pathway based on KEGG maps and KEGG Orthology.

ORFs highlighted in purple represent putative novel coding sequences, while those in red correspond to genes located in other regions of the genome. ORFs from GRs 19, 21, 23 appear to be involved in the conversion of different sugars to KDPG, and, after, to Pyruvate and G3P, two important intermediates of glycolytic pathways.

https://doi.org/10.1371/journal.pone.0342894.g004

The predicted protein product of ORF 221 presented homology to YjjN, an enzyme that converts L-galactonate to D-tagaturonate [128], which is a molecule present in both “Ascorbate and Aldarate metabolism” and “Pentose and Glucuronate interconversions” KEGG pathway. On this second map, the homolog of ORF 321, named UxuB, catalyses the reversible interconversion of D-fructuronate to D-mannonate, and vice-versa [129]. By employing other enzymes present elsewhere on the genome (Fig 4, depicted in red: UxaA, UxaB, and UxuA), both reaction products (D-tagaturonate and D-mannonate) may be transformed to 2-dehydro-3-deoxy-D-gluconate (KDG). KDG is the substrate of the enzyme KdgK, whose sequences encoded by ORFs 223 and 823 showed significant structural homology containing all the specific ATP binding sites and active sites (S2 Table). This enzyme phosphorylates KDG, producing 2-dehydro-3-deoxy-D-gluconate-6P (KDPG), which is degraded to pyruvate and G3P in the final step of the Entner-Doudoroff pathway of glucose oxidation (Fig 4).

Metabolism of Iron and other metals (Met).

Reported. For a bacterium to obtain iron within host extraintestinal tissues and fluids it requires specialized uptake systems called siderophores or other high-affinity systems to uptake iron-containing molecules such as haem or transferrin. Although the conserved siderophore enterobactin is the most widespread in E. coli, this siderophore is effective only in the intestinal lumen, as it is neutralized in extra-intestinal fluids by albumin and siderocalins [130]. Consequently, ExPEC lineages rely on a variety of patho-specific siderophores that evade host innate defense proteins such as siderocalin.

For that matter, different kinds of metal uptake systems contribute to the success of APEC strains in different types of infection, depending on the specific microenvironment inhabited by the pathogen [131]. All five regions (GRs 8, 9, 12, 22, and 26) contained previously reported ABC transporters and correlated metabolic enzymes: the sit operon (GR 8; related to Fe+2, but mainly Mn+2 uptake [98]), the prrA, modD, fepC, yc73 cluster (GR 9; possibly mediating iron uptake; [111]), the yersiniabactin cluster (GR 12; related to Fe+3 and Zn+2 uptake by the yersiniabactin siderophore system [132]), the fit operon (GR 22; related to Fe+2, Co+2, and Cd+2 uptake [112]) and the chu operon (GR 26; related to the capture of haem through haemophores [113]). The roles of these systems for APEC infection were previously described in a number of reports [101,133,134]. For instance, it was shown that the most virulent APEC strains were able to grow in the presence of transferrin, in contrast to non-lethal strains, probably because these virulent strains contained additional iron uptake systems [135]. Also, it has been demonstrated that APEC O1 upregulated 13 genes related to metal uptake out of 20, when in contact with chicken serum and, among them were genes encoded by the chu, ybt (yersiniabactin), sit and fep operons [136].

Adhesion and Invasion (A/I).

Reported. Four GRs previously reported in E. coli were linked to bacterial adhesion and invasion (GRs 3, 4, 17, and 24, Table 4). The capacity to adhere and invade is perhaps one of the main characteristics of strain BEN2908 [14,2022]. An extensively studied factor involved in this capacity is the type 1 fimbria (T1F) encoded by the fim operon, present in most E. coli strains. This fimbria attaches to mannosylated host cell receptors, contributing directly to adhesion and invasion by various E. coli pathotypes [94]; T1F is considered a major virulence factor of ExPEC. Despite its conservation in most strains, some strains are unable to express T1F. T1F is regulated by a phase variation mechanism mediated by an invertible promoter switch, FimS, whose orientation can be flipped by a pair of recombinases called FimB and FimE [137]. While FimB inverts the fim switch in the on-to-off and off-to-on orientations with similar efficiencies, FimE inverts it rapidly in the on-to-off orientation [138]. Aside from these two recombinases, the gene fimX, present in GR 4, is an additional recombinase found in some strains. In the absence of the fimE and fimB, fimX also plays a role in phase variation, turning the E. coli fimbriae rapidly to an “ON” state in vivo [139], further increasing adhesion and invasion properties as well.

In addition to T1F, the fimbrial system encoded by the auf operon, which has been linked to UPEC pathogenesis is present in GR 24 [140]. The auf operon is structurally similar to the fim operon, also containing: an adhesin (AufG), major (AufA) and minor subunits (AufE and AufD), two chaperones (AufB and AufF), and an usher (AufC) protein. The expression of this operon, evaluated by RT-PCR, was observed in strain CFT073 in at least three different times during infection of mice (4 h, 24 h, and 48 h post-infection) [117]. Aside from fimbrial systems, other genomic features are also linked to bacterial adherence [14,16,18,141]. Two other previously reported adhesin-related loci were detected: the gene encoding the E. coli adherence factor, FdeC (GR 3; [114]), and the Intimin-like Adhesin (Ila) encoding gene cluster (GR 17; [116]). Gene fdeC codes for a single large 1,416 residue protein with 95% identity to the surface protein EaeH, which was shown to promote adhesion [142]. It was shown by confocal micrographs that a strain lacking fdeC was incapable of adhering to bladder cells from the UM-UC-3 line, appearing to be involved in urinary tract infection (UTI) [114]. Moreover, the fdeC gene belongs to a locus containing eight other CDS, five structurally related to reductases and three putative regulatory genes. The ila cluster comprises three genes (sinH, sinI, ratA), and is probably derived from the genetic island CS54 identified in Salmonella enterica, which contains five genes, and is also linked to adherence and cell invasion. Many UPEC strains harbour the ila gene cluster, and it was shown that the deletion of these genes attenuated strains in a murine UTI model, due to reduced bladder cell invasion and decreased capacity to ascend the urinary tract, depending on the gene that was deleted [116].

Defense mechanisms.

Reported. The modification of the bacterial cell surface is also a common defense strategy against host defenses and antimicrobial peptides. GR 20 harbours a genetic cluster known as the kps cluster, responsible for the synthesis and transport of components of the bacterial capsule [118]. The bacterial capsule is a polysaccharide-rich layer that surrounds the cell, playing a key role in evading host immune responses, mediating surface adhesion, and resisting desiccation. In E. coli, approximately 80 different capsule serotypes have been identified, highlighting the significance of capsule usage and adaptation [143].

Uncharacterized. Gram-negative bacteria have evolved a second external membrane that selectively allows compounds to enter through protein pores with size-exclusion properties or via diffusion across its hydrophobic lipid bilayer. Structures in the outer leaflet of this bilayer, such as lipopolysaccharides (LPS), play a critical defensive role. LPS is a modified lipid comprising three parts: Lipid A, the core oligosaccharide, and the highly variable O antigen, all of which can undergo modifications in response to environmental pressures [144]. For instance, Lipid A can be modified by the addition of 4-amino-4-deoxy-L-arabinose (L-Ara4N), a process likely mediated by ORF 219 in GR 19, which shows high sequence homology with the KdsD arabinose isomerase responsible for this modification [145]. Such alterations can enhance bacterial resistance to positively charged particles, including cationic antimicrobial peptides (CAMP), thus contributing significantly to bacterial defense and survival.

Secretion systems.

Reported. GRs encoding secretion systems belonged to the type 6 secretion system (T6SS), which are widely spread among Proteobacteria. The T6SS is a secretion apparatus that bacteria use to transport effector molecules that usually disrupt the targeted cell wall, being structurally related to a prophage injection apparatus. In E. coli, the T6SS has been described to be involved in interbacterial competition and in bacteria-host cell interactions, secreting different types of effectors, depending on the targeted structure [146]. In addition, many reports on E. coli have tested the role of T6SS in different E. coli pathotypes, being involved not only in the capacity to alter the structural actin filaments in human brain microvascular endothelial cells (HBMEC), but also to promote antibacterial effects through DNase activity [97].

To investigate the presence of T6SS genes in the strains investigated, we have employed the SecReT6 web resource [147]. Two GRs containing T6SS components could be identified: GRs 11 and 19. The latter contained more than fifteen CDS of this secretion system, being classified as T6SS-2, which is among the most prevalent sets found in APEC [97,148].

In addition to this cluster, the program SecReT6 identified two additional copies of the tssI (vgrG) gene in GR 11. One complete copy, with 1563 bp, exhibited 99.6% identity to the tssI from cluster 1 of APEC strain DE719 [149], while the smaller copy, with only 960 bp, showed 49.2% identity to the same gene. It’s not unusual to find various copies of the tssI/vgrG homologs in the same strain, being reported in different gram-negative bacteria such as P. aeruginosa and V. cholerae [120,150]. For example, it has been shown that different VgrG homologs from V. cholerae strain V52 have distinct impacts on virulence, as only one of the three copies (VgrG-1) was able to cause modifications in cell actin structure. Moreover, these homologs have been found to interact with each other, forming various multimeric complexes that may affect the targeted cell differently. So, the presence of multiple copies of the tssI/vgrG gene in E. coli may have a potential for diverse functions, as demonstrated in other gram-negative bacteria.

General Metabolism (GM).

The BRIG analysis identified four GRs associated with general or accessory metabolic genes present in the strains phylogenetically close to BEN2908 and AIEC LF82/NRG85. Of these, two (GRs 29 and 32) contained both reported and uncharacterized features, and two (GRs 21 and 31) consisted exclusively of uncharacterized features.

Reported. GR 29 is a 16 kb Genomic Island strongly associated with ExPEC from the phylogroup B2 and first identified in the APEC O1 genome [44]. In that island, a putative transketolase named Tkt1 (ORF 329) showed 68% amino acid identity to TktA from Vibrio cholerae [121]. Notably, the Tkt1 protein could not complement the function of TktA involved in L-arabinose usage as a carbon source. Instead, it showed activity as an enzyme involved in peptide nitrogen extraction, since a mutant of this gene showed defects in the use of Pro-Ala or Phe-Ala as a nitrogen source. Interestingly, in GR 32, the genes alr and tyrB are both involved in the usage of some of the same residues tested by Li et al., since alr encodes an alanine racemase involved in the interconversion of both stereoisomers of alanine while tyrB encodes an aminotransferase that uses aromatic residues to transfer its amino group to 2-oxoglutarate and vice-versa [121]. In addition to these genes, GR 32 also harbours dnaB that encodes the extensively studied DNA helicase, which is the main replicative DNA helicase, participating in initiation and elongation during chromosome replication.

Uncharacterized. Other hypothetical enzymes encoded by genes in GR 29 (Table 5) also could be related to nitrogen obtention via carbamate hydrolysis. In the Rut pathway, nitrogen is obtained by cleaving the pyrimidine ring of a nitrogenous base, initially forming ureidoacrylate (product of RutA), followed by aminoacrylate and carbamate (product of RutB). Carbamate, in turn, is hydrolysed yielding nitrogen (as ammonia) and carbon (as CO2) [151]. Notably, while ORF 1229 presented sequence homology to RutB, ORF 629 showed similarity to a cytosine permease channel named CodB [152], indicating the potential presence of a pyrimidine transporter in this region. ORFs 829 and 1029, in turn, presented homology to a carbamate kinase (YqeA) and to a oxamate carbamoyltransferase (FdrA), both capable of performing their reactions bidirectionally [153]. While FdrA transforms oxalurate to oxamate and carbamoyl phosphate, YqeA catalyses the transfer of phosphate from carbamoyl phosphate to ADP, forming ATP and leaving carbamate as a by-product, which, as mentioned earlier, is hydrolyzed.

Further expanding the network of nitrogen-related processes, the genes that encode hypothetical enzymes identified in GR 31 could be related to nucleotide processing or degradation. Despite displaying a modest (around 27%) amino acid identity to the B. subtilis 168 YfkN nucleotidase and no identity to the E. coli K-12 UshA nucleotidase, ORFs 231, 331, and 431 each exhibited highly significant CD-Search matches to the conserved domains of UshA (Table 5; S2 Table). UshA is a 5′-nucleotidase that cleaves phosphate groups from nucleotides (with secondary NAD-pyrophosphatase activity), preserving all metal binding and active sites. Further, ORF 131 exhibited high sequence identity to the nucleoside-specific channel Tsx, known to form pores for nucleoside uptake under substrate-limiting conditions [154,155]. Together, these findings could suggest the presence of a coordinated system for nucleoside degradation and transport, probably facilitating the processing of nucleosides.

Furthermore, other ORFs identified in GR 32 may have a role in the synthesis of tricarboxylic acid cycle intermediates, such as succinate. By plotting the KO number of the homologs of ORFs 1132, 1232, 1332, 1432 and 1532 in KEGG “Citrate Cycle (TCA Cycle)” map, it is noteworthy that they encode enzymes of a chain of reactions that lead to succinate production from 2-oxoglutarate (Fig 5). These ORFs exhibited high protein identity and all the conserved functional sites to SucA, SucB, LpdA, SucC, and SucD, respectively (Table 5, S2 Table). Moreover, ORF 1632 showed significant similarity to the Orf3 permease from Cupriavidus necator H16, a dicarboxylate transporter responsible for the translocation of TCA cycle intermediates, including citrate, alpha-ketoglutarate, and succinate. This ORF also displayed the 206 conserved residues of the transmembrane helices of the Solute Carrier 13 permease domain, characteristic of these transporters [156]. Other ORFs also presented homology to dicarboxylate transporters, such as ORF 16, which showed some structural similarity to CitT, a citrate transporter [157], and ORFs 521, 621, and 721, mentioned above in the Sugar Metabolism subsection, that encode a hypothetical TRAP transporter, of which the most studied, dctPQM from R. capsulatus, is involved in C4-dicarboxylate uptake [158]. Notably, the two last ORFs from GR 32 presented similarity to the two-component regulatory system responsible for the expression of some dicarboxylate transporters. ORFs 1832 and 1932 presented significant homology to DctD and DctB from P. aeruginosa PAO1, respectively. When dicarboxylates are present, DctB autophosphorylates and transfers its phosphate group to DctD. DctD, in turn, binds to the promoter regions of DctA and DctPQM, which interacts with RpoN sigma factor, enabling its transcription and the posterior uptake of TCA cycle intermediates [159].

thumbnail
Fig 5. Predicted dicarboxylate utilization pathway based on KEGG maps and KEGG Orthology.

ORFs highlighted in purple represent putative novel coding sequences. Some ORFs from GR 32 appear to be involved in the conversion of 2-oxoglutarate to succinate, forming a second gene module analogous to the canonical sucABCD from E. coli K-12 MG1655.

https://doi.org/10.1371/journal.pone.0342894.g005

Genomic rearrangements as influential forces for pathogenic E. coli adaptation

Some studies have shown that horizontal gene transfer (HGT) and gene duplication are important differentiation events in E. coli [160,161]. These events not only directly augment the arsenal of virulence genes, but also may give rise to new copies of existing genes that can evolve to acquire novel functions (neofunctionalization), complementary functions (subfunctionalization) or equal functions but with a different stimulatory network or a different expressing dosage [162].

Many GRs (6, 19, 21, 23, 29, and 32) contained ORFs that showed significant structural and sequence homology to known transporters (Table 5, S2 Table). Transporters are one of the most common categories seen in HGT and duplication events, because they are encoded by fewer genes than other systems, and possess fewer pleiotropic constraints, favouring their posterior fixation on the genome [163].

In our report, many of the GRs identified had gene modules that probably originated from HGT events and posterior gene rearrangements. In Fig 6A, for instance, the pdxA-like cluster from the pathogenic strains from this study is compared to one of the three pdxA-like clusters from S. meliloti 1021, described in [90] (Fig 2). Although the modular structure is different than any of the three from S. meliloti 1021, the sequence identity between five of the genes support the likelihood of a transfer event in the past. Curiously, S. enterica LT2 has a similar, but smaller cluster that was capable of complementing the function of the canonical pdxA gene in vitro (4PHT dehydrogenase) [90]. Notably, three of the four protein sequences from this cluster showed an even higher identity than S. meliloti 1021 (40.38% to ORF 66, 75.38% to ORF 76, and 71.17% to ORF 86)(Table 5, S2 Table). This suggests that Salmonella enterica LT2 suffered enough selection pressure to discard some of the genes acquired, although more studies would be required to understand the correct timeline of events related to this cluster.

thumbnail
Fig 6. Comparison of operons with gene organization similar to the observed in GRs 6, 29 and 32.

https://doi.org/10.1371/journal.pone.0342894.g006

Fig 6B shows the high resemblance of ORFs from GR 29 to genes from Photobacterium profundum 3TCK, a resemblance significantly higher than to the homologous genes present on the E. coli K-12 genome (S2 Table). Photobacterium is a cosmopolitan genus of gram-negative marine bacterium that belongs to the family Vibrionaceae, and is present on the oceans in different depths, being considered piezophilic (optimal growth in high hydrostatic pressures) or non-piezophilic, depending on the species or even the strains. To live in different depths, each strain requires a series of adaptations to different chemo-physical parameters, such as light, hydrostatic pressure, and organic carbon or nitrogen availability [164]. The strain P. profundum 3TCK is a non-piezophilic strain that was isolated from shallow waters in the San Diego Bay, it presents optimal growth under atmospheric pressure and at a broad range of temperatures (0 to above 20o C) [165]. The presence of this organism in shallow waters of a populous bay may have facilitated a transfer event to Escherichia at some point in the past [166]. Hypothetically, because this organism is from marine environments, the gene module shown on Fig 6B may be expressed to facilitate nitrogen acquisition through pyrimidine degradation (as mentioned in the GM subsection) when the E. coli strains are exposed to sea-like conditions (high salinity, colder temperature, etc). Curiously, it has been shown using a BIOLOG assay that another shallow water Photobacterium, named P. marinum J15, is capable of obtaining nitrogen through 41 (out of 95) different substrates, including the pyrimidines uridine and cytidine. Moreover, genomic analysis revealed that this strain also carries a carbamate kinase (similar to ORF 829), which is likely involved in the nitrogen processing pathway [167].

Different from the cases depicted in Fig 6A (GR 6)in Fig 6B (GR 29),and in Fig 6C (GR 32), only 12 of the 19 ORFs identified in ExPEC were also present in AIEC strains. Among these, three are well-known genes: the helicase dnaB (ORF 632), the alanine racemase alr (ORF 732), and the aromatic aminotransferase tyrB (ORF 1032). Five of the missing ones occur between the quinone reductase, qorA (which is the GR upper limit) and dnaB, and the other two between alr and tyrB (for more information, see S2 Table). All the remaining ORFs (with the exception of ORFs 1732 and 1932) displayed more than 50% amino acid identity to homologous genes from different bacteria, suggesting that more than one genomic rearrangement has occurred throughout time. The first portion (Fig 6C, triangle), showed significant identity to the canonical sucABCD operon from E. coli K-12, although it had even greater similarity in sequence and operon arrangement to the sucAB-lpdA2-sucCD cluster from Chromobacterium violaceum ATCC 12472. C. violaceum is a gram-negative saprophyte found in water and soil of tropical regions. Nonetheless, it can also act as an opportunistic pathogen, with infections typically arising from contact with contaminated water or exposure of skin lesions to infected soil or water. This bacterium was already isolated in cases of bacteraemia, septicaemia, and UTI, with some infections leading to death [168]. Curiously, the second (Fig 6C, circle) and third portions (Fig 6C, square) of the GR 32 gene module are absent in C. violaceum. Instead, they show significant homology to a probable dicarboxylate permease and a lactate dehydrogenase from the gram-negative soil bacterium Cupriavidus necator H16, as well as to the two-component regulatory system dctBD of the TRAP transporter dctPQM from the gram-negative opportunistic pathogen Pseudomonas aeruginosa PAO1. This suggests that the genomic rearrangements within GR 32 have formed a “mosaic” gene module [155], and although some genes are absent in AIEC, the ORFs depicted in Fig 6C are conserved, preserving key modules regarding catabolism of dicarboxylates such as genes encoding proteins for transport (ORF 1632), processing (ORFs 1132, 1232, 1332, 1432, and 1532), and regulation (ORFs 1832 and 1932).

Conclusion

Despite some differences such as the variation in phage-related regions (S6 Table), STc135 AIEC strains show strong genomic similarity to ExPEC strains, greater than with the InPEC included in the current genomic comparative analyses. This close phylogenetic relatedness (Fig 1), shared virulence gene profile (S5 Table), overlapping CRISPR spacers (S3 and S4 Tables) and greater core genome similarity support a recent common ancestry.

The ring comparison (Fig 3) revealed that of the 36 genomic regions of BEN2908 absent in E. coli K-12, 20 were also present on AIEC and ExPEC strains studied. Further investigation of these 20 regions reinforced aspects regarding ExPEC pathogenesis, such as the importance of iron and sugar uptake and metabolism (as supported by the number of GRs containing features related to those aspects, especially transporters), but also the importance of the T6SS (as effectors and cluster identified are commonly studied in APEC pathogenesis). In addition to that, the analyses of some of the GR features made with BLASTp, CD-Search, and KEGG databases indicated that ORFs related to nitrogen processing (identified in GRs 29, 31, and 32) and dicarboxylate uptake and metabolism (identified in GRs 6, 21, and 32) also appear to be relevant traits in these E. coli lineages. Although deletion of some of the ORF homologs from those GRs resulted in decreased virulence in other bacteria, such as sucA/ORF 1132 in CFT073 [169], sucABCD/ORFs 1132, 1232, 1432, 1532 in Salmonella Typhimurium [170], yqeA/ORF 829 in E. coli O102-ST405 [171], and dctP/ORF 521 in Vibrio alginolyticus [172], none of these bacteria had a second copy of the affected genes, which is the case identified here. Moreover, the architecture of some of the genomic modules explored in this paper clearly resembles the architecture described for bacteria from other genera, with several genes presenting high amino acid identity, particularly those from GRs 29 and 32 (Fig 6) to those in other bacterial species. In spite of that, this is a predictive work and experimental validation for these in silico analysis is necessary to define the true functionality of the novel ORFs identified.

Finally, it is interesting to consider that the year of isolation of BEN2908 (1977) and the phylogenetically close strains studied spans decades, so the uncharacterized ORFs identified in this work (Table 5) remained conserved in those E. coli lineages, possibly indicating relevant roles for fitness, host adaptation, and virulence.

Supporting information

S1 Table. Table with GC content, coverage and identity.

1 GC content calculated with the https://jamiemcgowan.ie/bioinf/gc_content.html web tool. 2 Bolded values have more than 70% coverage. Underscored values have between 50 and 70% coverage.

https://doi.org/10.1371/journal.pone.0342894.s001

(XLSX)

S2 Table. Table containing all GRs with reported and uncharacterized features.

1 specific hit or non-specific hit with expect value closer to 0. Commented in the cells are the conserved residues identified by CD-Blast. 2 Sequencing made by Schouler and Trotereau (2016), available at: https://www.ncbi.nlm.nih.gov/nuccore/AY395687.1. 3 Unreviewed entries selected because of their mention in the following papers: ORFs 3₆-5₆ (Thiaville et al., 2016); ORF 2₃₄ (Majumdar et al. 2004); ORF 5₃₄ (Gárcia-Sanchez et al., 2021); and ORFs 9₃₄-13₃₄ (Lim et al., 2015). 4 Unreviewed entry. No reviewed entries found. 5 PHASTEST attL (2101049.2101064) and attR (2167393.2167408) are too upstream and downstream, respectively, encompassing genes present in all strains – even those lacking phage content. Therefore, the region boundaries were defined using the same criteria applied to the other GRs: end of the first gene common to all strains, beginning of the last gene common to all strains.

https://doi.org/10.1371/journal.pone.0342894.s002

(XLSX)

S3 Table. Table with information regarding spacers and type of CRISPR/Cas system using MinCED and CRISPRCastyper.

1 CFT073, 78-Pyelo and E2348/69 doesn’t possess a CRISPR/Cas system.

https://doi.org/10.1371/journal.pone.0342894.s003

(XLSX)

S4 Table. Table containing the number of spacers shared in each strain.

1 CFT073, 78-Pyelo and E2348/69 doesn’t possess a CRISPR/Cas system. 2 Spacer sequence is present, but displayed one to four nucleotide insertions located at their 5′ or 3′ ends.

https://doi.org/10.1371/journal.pone.0342894.s004

(XLSX)

S5 Table. Table generated using VFAnalyzer (VFDB) with the 23 strains.

1 Highlighted in any color is shown the 85 genes identified in AIEC strains by VFanalyzer, of which: Yellow represents the 55 genes present among at least one InPEC, one ExPEC, one Commensal, and LF82. Green represents the 18 genes present among at least one InPEC, one ExPEC, and LF82. Orange represents the 6 genes present between at least one ExPEC and LF82. Blue represents the 3 genes present between at least one ExPEC, one Commensal, and LF82. Red represents the 3 genes present between at least one InPEC and LF82.

https://doi.org/10.1371/journal.pone.0342894.s005

(XLSX)

S6 Table. PHASTEST summary statistics of the 23 strains from this study.

https://doi.org/10.1371/journal.pone.0342894.s006

(XLSX)

S7 Table. Plasmids statistics.

1 The strains IHE3034, CFT073, 55989, SCU-397, and K-12 do not harbour any plasmids. 2 Three plasmids from the strain χ7122 and the plasmid from LF82 were not available for download. 3 pLF82 information was extracted from Miquel et al. (2010) Table 2 and Supplemental S1 Table. 4 Small plasmid size, RAST annotation failed. Information obtained from Genbank and GC content calculator: https://jamiemcgowan.ie/bioinf/gc_content.html.

https://doi.org/10.1371/journal.pone.0342894.s007

(XLSX)

Acknowledgments

We thank Raul Simon Batista for assisting in editing the preliminary results.

References

  1. 1. Guabiraba R, Schouler C. Avian colibacillosis: still many black holes. FEMS Microbiol Lett. 2015;362(15):fnv118. pmid:26204893
  2. 2. Landman WJM, van Eck JHH. The incidence and economic impact of the Escherichia coli peritonitis syndrome in Dutch poultry farming. Avian Pathol. 2015;44(5):370–8. pmid:26083823
  3. 3. Zhang Y, Wang Y, Zhu H, Yi Z, Afayibo DJA, Tao C, et al. DctR contributes to the virulence of avian pathogenic Escherichia coli through regulation of type III secretion system 2 expression. Vet Res. 2021;52(1):101. pmid:34229767
  4. 4. Jorgensen SL, Stegger M, Kudirkiene E, Lilje B, Poulsen LL, Ronco T. Diversity and population overlap between avian and human Escherichia coli belonging to Sequence Type 95. mSphere. 2019;4(1).
  5. 5. Wirth T, Falush D, Lan R, Colles F, Mensa P, Wieler LH, et al. Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol. 2006;60(5):1136–51. pmid:16689791
  6. 6. Nandanwar N, Janssen T, Kühl M, Ahmed N, Ewers C, Wieler LH. Extraintestinal pathogenic Escherichia coli (ExPEC) of human and avian origin belonging to sequence type complex 95 (STC95) portray indistinguishable virulence features. Int J Med Microbiol. 2014;304(7):835–42. pmid:25037925
  7. 7. Gordon DM, Geyik S, Clermont O, O’Brien CL, Huang S, Abayasekara C, et al. Fine-scale structure analysis shows epidemic patterns of lonal Complex 95, a cosmopolitan Escherichia coli lineage responsible for extraintestinal infection. mSphere. 2017;2(3):e00168-17. pmid:28593194
  8. 8. Royer G, Darty MM, Clermont O, Condamine B, Laouenan C, Decousser J-W, et al. Phylogroup stability contrasts with high within sequence type complex dynamics of Escherichia coli bloodstream infection isolates over a 12-year period. Genome Med. 2021;13(1):77. pmid:33952335
  9. 9. Dho M, Lafont JP. Escherichia coli colonization of the trachea in poultry: comparison of virulent and avirulent strains in gnotoxenic chickens. Avian Dis. 1982;26(4):787–97. pmid:6760850
  10. 10. Porcheron G, Chanteloup NK, Trotereau A, Brée A, Schouler C. Effect of fructooligosaccharide metabolism on chicken colonization by an extra-intestinal pathogenic Escherichia coli strain. PLoS One. 2012;7(4):e35475. pmid:22514747
  11. 11. Schouler C, Taki A, Chouikha I, Moulin-Schouleur M, Gilot P. A genomic island of an extraintestinal pathogenic Escherichia coli Strain enables the metabolism of fructooligosaccharides, which improves intestinal colonization. J Bacteriol. 2009;191(1):388–93. pmid:18978057
  12. 12. Horn F, Corrêa AMR, Barbieri NL, Glodde S, Weyrauch KD, Kaspers B, et al. Infections with avian pathogenic and fecal Escherichia coli strains display similar lung histopathology and macrophage apoptosis. PLoS One. 2012;7(7):e41031. pmid:22848424
  13. 13. Pourbakhsh SA, Boulianne M, Martineau-Doizé B, Dozois CM, Desautels C, Fairbrother JM. Dynamics of Escherichia coil infection in experimentally inoculated chickens. Avian Dis. 1997;41(1):221–33. pmid:9087340
  14. 14. Pavanelo DB, Houle S, Matter LB, Dozois CM, Horn F. The periplasmic trehalase Affects Type 1 fimbria production and virulence of Extraintestinal Pathogenic Escherichia coli strain MT78. Infect Immun. 2018;86(8):e00241–18. pmid:29844238
  15. 15. Porcheron G, Kut E, Canepa S, Maurel M-C, Schouler C. Regulation of fructooligosaccharide metabolism in an extra-intestinal pathogenic Escherichia coli strain. Mol Microbiol. 2011;81(3):717–33. pmid:21692876
  16. 16. Chouikha I, Germon P, Brée A, Gilot P, Moulin-Schouleur M, Schouler C. A selC-associated genomic island of the extraintestinal avian pathogenic Escherichia coli strain BEN2908 is involved in carbohydrate uptake and virulence. J Bacteriol. 2006;188(3):977–87. pmid:16428402
  17. 17. Répérant M, Porcheron G, Rouquet G, Gilot P. The yicJI metabolic operon of Escherichia coli is involved in bacterial fitness. FEMS Microbiol Lett. 2011;319(2):180–6. pmid:21477255
  18. 18. Rouquet G, Porcheron G, Barra C, Répérant M, Chanteloup NK, Schouler C, et al. A metabolic operon in extraintestinal pathogenic Escherichia coli promotes fitness under stressful conditions and invasion of eukaryotic cells. J Bacteriol. 2009;191(13):4427–40. pmid:19376853
  19. 19. Klemberg VS, Pavanelo DB, Houle S, Dhakal S, Pokharel P, Iahnig-Jacques S, et al. The osmoregulated metabolism of trehalose contributes to production of type 1 fimbriae and bladder colonization by extraintestinal Escherichia coli strain BEN2908. Front Cell Infect Microbiol. 2024;14:1414188. pmid:38979511
  20. 20. Matter LB, Barbieri NL, Nordhoff M, Ewers C, Horn F. Avian pathogenic Escherichia coli MT78 invades chicken fibroblasts. Vet Microbiol. 2011;148(1):51–9. pmid:20850232
  21. 21. Chanteloup NK, Porcheron G, Delaleu B, Germon P, Schouler C, Moulin-Schouleur M, et al. The extra-intestinal avian pathogenic Escherichia coli strain BEN2908 invades avian and human epithelial cells and survives intracellularly. Vet Microbiol. 2011;147(3–4):435–9. pmid:20708353
  22. 22. Germon P, Chen Y-H, He L, Blanco JE, Brée A, Schouler C, et al. ibeA, a virulence factor of avian pathogenic Escherichia coli. Microbiology (Reading). 2005;151(Pt 4):1179–86. pmid:15817785
  23. 23. Martinez-Medina M, Garcia-Gil J, Barnich N, Wieler LH, Ewers C. Adherent-invasive Escherichia coli phenotype displayed by intestinal pathogenic E. coli strains from cats, dogs, and swine. Appl Environ Microbiol. 2011;77(16):5813–7. pmid:21705530
  24. 24. Cortes MAM, Gibon J, Chanteloup NK, Moulin-Schouleur M, Gilot P, Germon P. Inactivation of ibeA and ibeT results in decreased expression of type 1 fimbriae in extraintestinal pathogenic Escherichia coli strain BEN2908. Infect Immun. 2008;76(9):4129–36. pmid:18591231
  25. 25. Fléchard M, Cortes MAM, Répérant M, Germon P. New role for the ibeA gene in H2O2 stress resistance of Escherichia coli. J Bacteriol. 2012;194(17):4550–60. pmid:22730120
  26. 26. Darfeuille-Michaud A, Neut C, Barnich N, Lederman E, Di Martino P, Desreumaux P, et al. Presence of adherent Escherichia coli strains in ileal mucosa of patients with Crohn’s disease. Gastroenterology. 1998;115(6):1405–13. pmid:9834268
  27. 27. Nash JH, Villegas A, Kropinski AM, Aguilar-Valenzuela R, Konczy P, Mascarenhas M, et al. Genome sequence of adherent-invasive Escherichia coli and comparative genomic analysis with other E. coli pathotypes. BMC Genomics. 2010;11:667. pmid:21108814
  28. 28. Miquel S, Peyretaillade E, Claret L, de Vallée A, Dossat C, Vacherie B, et al. Complete genome sequence of Crohn’s disease-associated adherent-invasive E. coli strain LF82. PLoS One. 2010;5(9):e12714. pmid:20862302
  29. 29. Gibold L, Garenaux E, Dalmasso G, Gallucci C, Cia D, Mottet-Auselo B, et al. The Vat-AIEC protease promotes crossing of the intestinal mucus layer by Crohn’s disease-associated Escherichia coli. Cell Microbiol. 2016;18(5):617–31. pmid:26499863
  30. 30. Parreira VR, Gyles CL. A novel pathogenicity island integrated adjacent to the thrW tRNA gene of avian pathogenic Escherichia coli encodes a vacuolating autotransporter toxin. Infect Immun. 2003;71(9):5087–96. pmid:12933851
  31. 31. Prudent V, Demarre G, Vazeille E, Wery M, Quenech’Du N, Ravet A, et al. The Crohn’s disease-related bacterial strain LF82 assembles biofilm-like communities to protect itself from phagolysosomal attack. Commun Biol. 2021;4(1):627. pmid:34035436
  32. 32. Feldmann F, Sorsa LJ, Hildinger K, Schubert S. The salmochelin siderophore receptor IroN contributes to invasion of urothelial cells by extraintestinal pathogenic Escherichia coli in vitro. Infect Immun. 2007;75(6):3183–7. pmid:17353289
  33. 33. Huang SH, Chen YH, Kong G, Chen SH, Besemer J, Borodovsky M, et al. A novel genetic island of meningitic Escherichia coli K1 containing the ibeA invasion gene (GimA): functional annotation and carbon-source-regulated invasion of human brain microvascular endothelial cells. Funct Integr Genomics. 2001;1(5):312–22. pmid:11793250
  34. 34. Martinez-Jéhanne V, Pichon C, du Merle L, Poupel O, Cayet N, Bouchier C, et al. Role of the vpe carbohydrate permease in Escherichia coli urovirulence and fitness in vivo. Infect Immun. 2012;80(8):2655–66. pmid:22615242
  35. 35. Schubert S, Picard B, Gouriou S, Heesemann J, Denamur E. Yersinia high-pathogenicity island contributes to virulence in Escherichia coli causing extraintestinal infections. Infect Immun. 2002;70(9):5335–7. pmid:12183596
  36. 36. Zong B, Zhang Y, Wang X, Liu M, Zhang T, Zhu Y, et al. Characterization of multiple type-VI secretion system (T6SS) VgrG proteins in the pathogenicity and antibacterial activity of porcine extra-intestinal pathogenic Escherichia coli. Virulence. 2019;10(1):118–32. pmid:30676217
  37. 37. De Coster W, D’Hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018;34(15):2666–9. pmid:29547981
  38. 38. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20. pmid:24695404
  39. 39. Galaxy Community. The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update. Nucleic Acids Res. 2024;52(W1):W83–94. pmid:38769056
  40. 40. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60. pmid:19451168
  41. 41. Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27(5):722–36. pmid:28298431
  42. 42. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9(11):e112963. pmid:25409509
  43. 43. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. pmid:18261238
  44. 44. Johnson TJ, Kariyawasam S, Wannemuehler Y, Mangiamele P, Johnson SJ, Doetkott C, et al. The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes. J Bacteriol. 2007;189(8):3228–36. pmid:17293413
  45. 45. Zhu Ge X, Jiang J, Pan Z, Hu L, Wang S, Wang H, et al. Comparative genomic analysis shows that avian pathogenic Escherichia coli isolate IMT5155 (O2:K1:H5; ST complex 95, ST140) shares close relationship with ST95 APEC O1:K1 and human ExPEC O18:K1 strains. PLoS One. 2014;9(11):e112048. pmid:25397580
  46. 46. Dziva F, Hauser H, Connor TR, van Diemen PM, Prescott G, Langridge GC, et al. Sequencing and functional annotation of avian pathogenic Escherichia coli serogroup O78 strains reveal the evolution of E. coli lineages pathogenic for poultry via distinct mechanisms. Infect Immun. 2013;81(3):838–49. pmid:23275093
  47. 47. Wijetunge DSS, Katani R, Kapur V, Kariyawasam S. Complete Genome Sequence of Escherichia coli Strain RS218 (O18:H7:K1), Associated with Neonatal Meningitis. Genome Announc. 2015;3(4):e00804-15. pmid:26205862
  48. 48. Moriel DG, Bertoldi I, Spagnuolo A, Marchi S, Rosini R, Nesta B, et al. Identification of protective and broadly conserved vaccine antigens from the genome of extraintestinal pathogenic Escherichia coli. Proc Natl Acad Sci U S A. 2010;107(20):9072–7. pmid:20439758
  49. 49. Chen SL, Hung C-S, Xu J, Reigstad CS, Magrini V, Sabo A, et al. Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach. Proc Natl Acad Sci U S A. 2006;103(15):5977–82. pmid:16585510
  50. 50. Welch RA, Burland V, Plunkett G 3rd, Redford P, Roesch P, Rasko D, et al. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc Natl Acad Sci U S A. 2002;99(26):17020–4. pmid:12471157
  51. 51. Biggel M, Xavier BB, Johnson JR, Nielsen KL, Frimodt-Møller N, Matheeussen V, et al. Horizontally acquired papGII-containing pathogenicity islands underlie the emergence of invasive uropathogenic Escherichia coli lineages. Nat Commun. 2020;11(1):5968. pmid:33235212
  52. 52. Strachan NJC, Rotariu O, Lopes B, MacRae M, Fairley S, Laing C, et al. Whole Genome Sequencing demonstrates that Geographic Variation of Escherichia coli O157 Genotypes Dominates Host Association. Sci Rep. 2015;5:14145. pmid:26442781
  53. 53. Ohnishi M, Terajima J, Kurokawa K, Nakayama K, Murata T, Tamura K, et al. Genomic diversity of enterohemorrhagic Escherichia coli O157 revealed by whole genome PCR scanning. Proc Natl Acad Sci U S A. 2002;99(26):17043–8. pmid:12481030
  54. 54. Delannoy S, Mariani-Kurkdjian P, Webb HE, Bonacorsi S, Fach P. The mobilome; a major contributor to Escherichia coli stx2-Positive O26:H11 Strains Intra-Serotype Diversity. Front Microbiol. 2017;8:1625. pmid:28932209
  55. 55. Ogura Y, Ooka T, Iguchi A, Toh H, Asadulghani M, Oshima K, et al. Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coli. Proc Natl Acad Sci U S A. 2009;106(42):17939–44. pmid:19815525
  56. 56. Chaudhuri RR, Sebaihia M, Hobman JL, Webber MA, Leyton DL, Goldberg MD, et al. Complete genome sequence and comparative metabolic profiling of the prototypical enteroaggregative Escherichia coli strain 042. PLoS One. 2010;5(1):e8801. pmid:20098708
  57. 57. Ahmed SA, Awosika J, Baldwin C, Bishop-Lilly KA, Biswas B, Broomall S, et al. Genomic comparison of Escherichia coli O104:H4 isolates from 2009 and 2011 reveals plasmid, and prophage heterogeneity, including shiga toxin encoding phage stx2. PLoS One. 2012;7(11):e48228. pmid:23133618
  58. 58. Alexander DC, Hao W, Gilmour MW, Zittermann S, Sarabia A, Melano RG, et al. Escherichia coli O104:H4 infections and international travel. Emerg Infect Dis. 2012;18(3):473–6. pmid:22377016
  59. 59. Crossman LC, Chaudhuri RR, Beatson SA, Wells TJ, Desvaux M, Cunningham AF, et al. A commensal gone bad: complete genome sequence of the prototypical enterotoxigenic Escherichia coli strain H10407. J Bacteriol. 2010;192(21):5822–31. pmid:20802035
  60. 60. Rasko DA, Rosovitz MJ, Myers GSA, Mongodin EF, Fricke WF, Gajer P, et al. The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J Bacteriol. 2008;190(20):6881–93. pmid:18676672
  61. 61. Iguchi A, Thomson NR, Ogura Y, Saunders D, Ooka T, Henderson IR, et al. Complete genome sequence and comparative genome analysis of enteropathogenic Escherichia coli O127:H6 strain E2348/69. J Bacteriol. 2009;191(1):347–54. pmid:18952797
  62. 62. Blattner FR, Plunkett G 3rd, Bloch CA, Perna NT, Burland V, Riley M, et al. The complete genome sequence of Escherichia coli K-12. Science. 1997;277(5331):1453–62. pmid:9278503
  63. 63. Stephens C, Arismendi T, Wright M, Hartman A, Gonzalez A, Gill M. F plasmids are the major carriers of antibiotic resistance genes in human-associated commensal Escherichia coli. mSphere. 2020;5(4).
  64. 64. Zhou Z, Alikhan N-F, Mohamed K, Fan Y, Agama Study Group, Achtman M. The EnteroBase user’s guide, with case studies on Salmonella transmissions, Yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res. 2020;30(1):138–52. pmid:31809257
  65. 65. Beghain J, Bridier-Nahmias A, Le Nagard H, Denamur E, Clermont O. ClermonTyping: an easy-to-use and accurate in silico method for Escherichia genus strain phylotyping. Microb Genom. 2018;4(7):e000192. pmid:29916797
  66. 66. Joensen KG, Tetzschner AMM, Iguchi A, Aarestrup FM, Scheutz F. Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data. J Clin Microbiol. 2015;53(8):2410–26. pmid:25972421
  67. 67. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421. pmid:20003500
  68. 68. Larsen MV, Cosentino S, Rasmussen S, Friis C, Hasman H, Marvig RL, et al. Multilocus sequence typing of total-genome-sequenced bacteria. J Clin Microbiol. 2012;50(4):1355–61. pmid:22238442
  69. 69. Roer L, Tchesnokova V, Allesøe R, Muradova M, Chattopadhyay S, Ahrenfeldt J, et al. Development of a Web Tool for Escherichia coli Subtyping Based on fimH Alleles. J Clin Microbiol. 2017;55(8):2538–43. pmid:28592545
  70. 70. Liu B, Zheng D, Zhou S, Chen L, Yang J. VFDB 2022: a general classification scheme for bacterial virulence factors. Nucleic Acids Res. 2022;50(D1):D912–7. pmid:34850947
  71. 71. Russel J, Pinilla-Redondo R, Mayo-Muñoz D, Shah SA, Sørensen SJ. CRISPRCasTyper: Automated Identification, Annotation, and Classification of CRISPR-Cas Loci. CRISPR J. 2020;3(6):462–9. pmid:33275853
  72. 72. Bland C, Ramsey TL, Sabree F, Lowe M, Brown K, Kyrpides NC, et al. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics. 2007;8:209. pmid:17577412
  73. 73. Zhang J, Guan J, Wang M, Li G, Djordjevic M, Tai C, et al. SecReT6 update: a comprehensive resource of bacterial Type VI Secretion Systems. Sci China Life Sci. 2023;66(3):626–34. pmid:36346548
  74. 74. Wishart DS, Han S, Saha S, Oler E, Peters H, Grant JR, et al. PHASTEST: faster than PHASTER, better than PHAST. Nucleic Acids Res. 2023;51(W1):W443–50. pmid:37194694
  75. 75. Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31(22):3691–3. pmid:26198102
  76. 76. Kanehisa M, Furumichi M, Sato Y, Matsuura Y, Ishiguro-Watanabe M. KEGG: biological systems database as a model of the real world. Nucleic Acids Res. 2025;53(D1):D672–7. pmid:39417505
  77. 77. Löytynoja A. Phylogeny-aware alignment with PRANK. Methods Mol Biol. 2014;1079:155–70. pmid:24170401
  78. 78. Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20(1):238. pmid:31727128
  79. 79. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7. pmid:15034147
  80. 80. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14(6):587–9. pmid:28481363
  81. 81. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. pmid:24451623
  82. 82. Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49(W1):W293–6. pmid:33885785
  83. 83. Kim D, Park S, Chun J. Introducing EzAAI: a pipeline for high throughput calculations of prokaryotic average amino acid identity. J Microbiol. 2021;59(5):476–80. pmid:33907973
  84. 84. Alikhan N-F, Petty NK, Ben Zakour NL, Beatson SA. BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12:402. pmid:21824423
  85. 85. Liu CM, Stegger M, Aziz M, Johnson TJ, Waits K, Nordstrom L, et al. Escherichia coli ST131-H22 as a Foodborne Uropathogen. mBio. 2018;9(4):e00470–18. pmid:30154256
  86. 86. Bonacorsi S, Clermont O, Houdouin V, Cordevant C, Brahimi N, Marecat A, et al. Molecular analysis and experimental virulence of French and North American Escherichia coli neonatal meningitis isolates: identification of a new virulent clone. J Infect Dis. 2003;187(12):1895–906. pmid:12792866
  87. 87. Homeier T, Semmler T, Wieler LH, Ewers C. The GimA locus of extraintestinal pathogenic E. coli: does reductive evolution correlate with habitat and pathotype? PLoS One. 2010;5(5):e10877. pmid:20526361
  88. 88. Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, et al. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 2020;48(D1):D265–8. pmid:31777944
  89. 89. Wang J, Chitsaz F, Derbyshire MK, Gonzales NR, Gwadz M, Lu S, et al. The conserved domain database in 2023. Nucleic Acids Res. 2023;51(D1):D384–8. pmid:36477806
  90. 90. Thiaville JJ, Flood J, Yurgel S, Prunetti L, Elbadawi-Sidhu M, Hutinet G, et al. Members of a novel kinase family (DUF1537) can recycle toxic intermediates into an essential metabolite. ACS Chem Biol. 2016;11(8):2304–11. pmid:27294475
  91. 91. Majumdar A, Ghatak A, Ghosh RK. Identification of the gene for the monomeric alkaline phosphatase of Vibrio cholerae serogroup O1 strain. Gene. 2005;344:251–8. pmid:15656991
  92. 92. García-Sánchez M, Souche M, Trives-Segura C, Plassard C. The grazing activity of Acrobeloides sp. drives phytate mineralisation within its trophic relationship with bacteria. J Nematol. 2021;53:e2021–21. pmid:33860247
  93. 93. Lim S, Han A, Kim D, Seo HS. Transcriptional profiling of an attenuated Salmonella Typhimurium ptsI mutant strain under low-oxygen conditions using microarray analysis. J Bacteriol Virol. 2015;45(3):200–14.
  94. 94. Dreux N, Denizot J, Martinez-Medina M, Mellmann A, Billig M, Kisiela D, et al. Point mutations in FimH adhesin of Crohn’s disease-associated adherent-invasive Escherichia coli enhance intestinal inflammatory response. PLoS Pathog. 2013;9(1):e1003141. pmid:23358328
  95. 95. Schwartz DJ, Kalas V, Pinkner JS, Chen SL, Spaulding CN, Dodson KW, et al. Positively selected FimH residues enhance virulence during urinary tract infection by altering FimH conformation. Proc Natl Acad Sci U S A. 2013;110(39):15530–7. pmid:24003161
  96. 96. Koonin EV, Makarova KS. Origins and evolution of CRISPR-Cas systems. Philos Trans R Soc Lond B Biol Sci. 2019;374(1772):20180087. pmid:30905284
  97. 97. Ma J, Sun M, Bao Y, Pan Z, Zhang W, Lu C, et al. Genetic diversity and features analysis of type VI secretion systems loci in avian pathogenic Escherichia coli by wide genomic scanning. Infect Genet Evol. 2013;20:454–64. pmid:24120694
  98. 98. Kehres DG, Janakiraman A, Slauch JM, Maguire ME. SitABCD is the alkaline Mn(2+) transporter of Salmonella enterica serovar Typhimurium. J Bacteriol. 2002;184(12):3159–66. pmid:12029031
  99. 99. Lloyd AL, Rasko DA, Mobley HLT. Defining genomic islands and uropathogen-specific genes in uropathogenic Escherichia coli. J Bacteriol. 2007;189(9):3532–46. pmid:17351047
  100. 100. Gratia A. Sur un remarquable exemple d’antagonisme entre deux souches de colibacille. C R Soc Biol. 1925;93:1040–1.
  101. 101. Caza M, Lépine F, Milot S, Dozois CM. Specific roles of the iroBCDEN genes in virulence of an avian pathogenic Escherichia coli O78 strain and in production of salmochelins. Infect Immun. 2008;76(8):3539–49. pmid:18541653
  102. 102. Chagneau CV, Payros D, Goman A, Goursat C, David L, Okuno M, et al. HlyF, an underestimated virulence factor of uropathogenic Escherichia coli. Clin Microbiol Infect. 2023;29(11):1449.e1–1449.e9. pmid:37532127
  103. 103. Murase K, Martin P, Porcheron G, Houle S, Helloin E, Pénary M, et al. HlyF produced by extraintestinal pathogenic Escherichia coli Is a virulence factor that regulates outer membrane vesicle biogenesis. J Infect Dis. 2016;213(5):856–65. pmid:26494774
  104. 104. Quackenbush RL, Falkow S. Relationship between colicin V activity and virulence in Escherichia coli. Infect Immun. 1979;24(2):562–4. pmid:378858
  105. 105. Reid CJ, Cummins ML, Börjesson S, Brouwer MSM, Hasman H, Hammerum AM, et al. A role for ColV plasmids in the evolution of pathogenic Escherichia coli ST58. Nat Commun. 2022;13(1):683. pmid:35115531
  106. 106. Mellata M, Ameiss K, Mo H, Curtiss R 3rd. Characterization of the contribution to virulence of three large plasmids of avian pathogenic Escherichia coli chi7122 (O78:K80:H9). Infect Immun. 2010;78(4):1528–41. pmid:20086082
  107. 107. Nolan LK, Horne SM, Giddings CW, Foley SL, Johnson TJ, Lynne AM, et al. Resistance to serum complement, iss, and virulence of avian Escherichia coli. Vet Res Commun. 2003;27(2):101–10. pmid:12718504
  108. 108. Kidgell C, Pickard D, Wain J, James K, Diem Nga LT, Diep TS, et al. Characterisation and distribution of a cryptic Salmonella typhi plasmid pHCM2. Plasmid. 2002;47(3):159–71. pmid:12151231
  109. 109. Denamur E, Clermont O, Bonacorsi S, Gordon D. The population genetics of pathogenic Escherichia coli. Nat Rev Microbiol. 2021;19(1):37–54. pmid:32826992
  110. 110. Schouler C, Schaeffer B, Brée A, Mora A, Dahbi G, Biet F, et al. Diagnostic strategy for identifying avian pathogenic Escherichia coli based on four patterns of virulence genes. J Clin Microbiol. 2012;50(5):1673–8. pmid:22378905
  111. 111. Ye C, Xu J. Prevalence of iron transport gene on pathogenicity-associated island of uropathogenic Escherichia coli in E. coli O157:H7 containing Shiga toxin gene. J Clin Microbiol. 2001;39(6):2300–5. pmid:11376076
  112. 112. Ouyang Z, Isaacson R. Identification and characterization of a novel ABC iron transport system, fit, in Escherichia coli. Infect Immun. 2006;74(12):6949–56. pmid:16982838
  113. 113. Suits MDL, Lang J, Pal GP, Couture M, Jia Z. Structure and heme binding properties of Escherichia coli O157:H7 ChuX. Protein Sci. 2009;18(4):825–38. pmid:19319934
  114. 114. Nesta B, Spraggon G, Alteri C, Moriel DG, Rosini R, Veggi D, et al. FdeC, a novel broadly conserved Escherichia coli adhesin eliciting protection against urinary tract infections. mBio. 2012;3(2):e00010–12. pmid:22496310
  115. 115. Bateman SL, Stapleton AE, Stamm WE, Hooton TM, Seed PC. The type 1 pili regulator gene fimX and pathogenicity island PAI-X as molecular markers of uropathogenic Escherichia coli. Microbiology (Reading). 2013;159(Pt 8):1606–17. pmid:23744903
  116. 116. Shea AE, Stocki JA, Himpsl SD, Smith SN, Mobley HLT. Loss of an intimin-like protein encoded on a uropathogenic E. coli pathogenicity island reduces inflammation and affects interactions with the urothelium. Infect Immun. 2022;90(2):e0027521. pmid:34871042
  117. 117. Buckles EL, Bahrani-Mougeot FK, Molina A, Lockatell CV, Johnson DE, Drachenberg CB, et al. Identification and characterization of a novel uropathogenic Escherichia coli-associated fimbrial gene cluster. Infect Immun. 2004;72(7):3890–901. pmid:15213132
  118. 118. Vimr ER. Map position and genomic organization of the kps cluster for polysialic acid synthesis in Escherichia coli K1. J Bacteriol. 1991;173(3):1335–8. pmid:1991726
  119. 119. Sun M, Gao X, Zhao K, Ma J, Yao H, Pan Z. Insight into the virulence related secretion systems, fimbriae, and toxins in O2:K1 Escherichia coli Isolated From Bovine Mastitis. Front Vet Sci. 2021;8:622725. pmid:33644149
  120. 120. Pukatzki S, Ma AT, Revel AT, Sturtevant D, Mekalanos JJ. Type VI secretion system translocates a phage tail spike-like protein into target cells where it cross-links actin. Proc Natl Acad Sci U S A. 2007;104(39):15508–13. pmid:17873062
  121. 121. Li G, Kariyawasam S, Tivendale KA, Wannemuehler Y, Ewers C, Wieler LH, et al. tkt1, located on a novel pathogenicity island, is prevalent in avian and human extraintestinal pathogenic Escherichia coli. BMC Microbiol. 2012;12:51. pmid:22471764
  122. 122. Nakayama N, Arai N, Bond MW, Kaziro Y, Arai K. Nucleotide sequence of dnaB and the primary structure of the dnaB protein from Escherichia coli. J Biol Chem. 1984;259(1):97–101. pmid:6323420
  123. 123. Wu D, Hu T, Zhang L, Chen J, Du J, Ding J, et al. Residues Asp164 and Glu165 at the substrate entryway function potently in substrate orientation of alanine racemase from E. coli: Enzymatic characterization with crystal structure analysis. Protein Sci. 2008;17(6):1066–76. pmid:18434499
  124. 124. Kuramitsu S, Inoue K, Ogawa T, Ogawa H, Kagamiyama H. Aromatic amino acid aminotransferase of Escherichia coli: nucleotide sequence of the tyrB gene. Biochem Biophys Res Commun. 1985;133(1):134–9. pmid:3907634
  125. 125. McAllister CH, Good AG. Alanine aminotransferase variants conferring diverse NUE phenotypes in Arabidopsis thaliana. PLoS One. 2015;10(4):e0121830. pmid:25830496
  126. 126. Zdych E, Peist R, Reidl J, Boos W. MalY of Escherichia coli is an enzyme with the activity of a beta C-S lyase (cystathionase). J Bacteriol. 1995;177(17):5035–9. pmid:7665481
  127. 127. Nobelmann B, Lengeler JW. Molecular analysis of the gat genes from Escherichia coli and of their roles in galactitol transport and metabolism. J Bacteriol. 1996;178(23):6790–5. pmid:8955298
  128. 128. Kuivanen J, Richard P. The yjjN of E. coli codes for an L-galactonate dehydrogenase and can be used for quantification of L-galactonate and L-gulonate. Appl Biochem Biotechnol. 2014;173(7):1829–35. pmid:24861318
  129. 129. Robert-Baudouy J, Portalier R, Stoeber F. Regulation of hexuronate system genes in Escherichia coli K-12: multiple regulation of the uxu operon by exuR and uxuR gene products. J Bacteriol. 1981;145(1):211–20. pmid:7007313
  130. 130. Ratledge C, Dover LG. Iron metabolism in pathogenic bacteria. Annu Rev Microbiol. 2000;54:881–941. pmid:11018148
  131. 131. Porcheron G, Garénaux A, Proulx J, Sabri M, Dozois CM. Iron, copper, zinc, and manganese transport and regulation in pathogenic Enterobacteria: correlations between strains, site of infection and the relative importance of the different metal transport systems for virulence. Front Cell Infect Microbiol. 2013;3:90. pmid:24367764
  132. 132. Perry RD, Fetherston JD. Yersiniabactin iron uptake: mechanisms and role in Yersinia pestis pathogenesis. Microbes Infect. 2011;13(10):808–17. pmid:21609780
  133. 133. Caza M, Lépine F, Dozois CM. Secretion, but not overall synthesis, of catecholate siderophores contributes to virulence of extraintestinal pathogenic Escherichia coli. Mol Microbiol. 2011;80(1):266–82. pmid:21306443
  134. 134. Li G, Laturnus C, Ewers C, Wieler LH. Identification of genes required for avian Escherichia coli septicemia by signature-tagged mutagenesis. Infect Immun. 2005;73(5):2818–27. pmid:15845486
  135. 135. Dho M, Lafont JP. Adhesive properties and iron uptake ability in Escherichia coli lethal and nonlethal for chicks. Avian Dis. 1984;28(4):1016–25. pmid:6395845
  136. 136. Li G, Tivendale KA, Liu P, Feng Y, Wannemuehler Y, Cai W, et al. Transcriptome analysis of avian pathogenic Escherichia coli O1 in chicken serum reveals adaptive responses to systemic infection. Infect Immun. 2011;79(5):1951–60. pmid:21357721
  137. 137. Klemm P. Two regulatory fim genes, fimB and fimE, control the phase variation of type 1 fimbriae in Escherichia coli. EMBO J. 1986;5(6):1389–93. pmid:2874022
  138. 138. Holden N, Totsika M, Dixon L, Catherwood K, Gally DL. Regulation of P-fimbrial phase variation frequencies in Escherichia coli CFT073. Infect Immun. 2007;75(7):3325–34. pmid:17452474
  139. 139. Hannan TJ, Mysorekar IU, Chen SL, Walker JN, Jones JM, Pinkner JS, et al. LeuX tRNA-dependent and -independent mechanisms of Escherichia coli pathogenesis in acute cystitis. Mol Microbiol. 2008;67(1):116–28. pmid:18036139
  140. 140. Behzadi P. Classical chaperone-usher (CU) adhesive fimbriome: uropathogenic Escherichia coli (UPEC) and urinary tract infections (UTIs). Folia Microbiol (Praha). 2020;65(1):45–65. pmid:31165977
  141. 141. Matter LB, Spricigo DA, Tasca C, de Vargas AC. Invasin gimB found in a bovine intestinal Escherichia coli with an adherent and invasive profile. Braz J Microbiol. 2015;46(3):875–8. pmid:26413073
  142. 142. Sheikh A, Luo Q, Roy K, Shabaan S, Kumar P, Qadri F, et al. Contribution of the highly conserved EaeH surface protein to enterotoxigenic Escherichia coli pathogenesis. Infect Immun. 2014;82(9):3657–66. pmid:24935979
  143. 143. Whitfield C. Biosynthesis and assembly of capsular polysaccharides in Escherichia coli. Annu Rev Biochem. 2006;75:39–68. pmid:16756484
  144. 144. Bertani B, Ruiz N. Function and biogenesis of lipopolysaccharides. EcoSal Plus. 2018;8(1).
  145. 145. Sommaruga S, Gioia LD, Tortora P, Polissi A. Structure prediction and functional analysis of KdsD, an enzyme involved in lipopolysaccharide biosynthesis. Biochem Biophys Res Commun. 2009;388(2):222–7. pmid:19664604
  146. 146. Russell AB, Peterson SB, Mougous JD. Type VI secretion system effectors: poisons with a purpose. Nat Rev Microbiol. 2014;12(2):137–48. pmid:24384601
  147. 147. Li J, Yao Y, Xu HH, Hao L, Deng Z, Rajakumar K, et al. SecReT6: a web-based resource for type VI secretion systems found in bacteria. Environ Microbiol. 2015;17(7):2196–202. pmid:25640659
  148. 148. Barret M, Egan F, O’Gara F. Distribution and diversity of bacterial secretion systems across metagenomic datasets. Environ Microbiol Rep. 2013;5(1):117–26. pmid:23757140
  149. 149. Wang S, Dai J, Meng Q, Han X, Han Y, Zhao Y, et al. DotU expression is highly induced during in vivo infection and responsible for virulence and Hcp1 secretion in avian pathogenic Escherichia coli. Front Microbiol. 2014;5:588. pmid:25426107
  150. 150. Hachani A, Lossi NS, Hamilton A, Jones C, Bleves S, Albesa-Jové D, et al. Type VI secretion system in Pseudomonas aeruginosa: secretion and multimerization of VgrG proteins. J Biol Chem. 2011;286(14):12317–27. pmid:21325275
  151. 151. Kim K-S, Pelton JG, Inwood WB, Andersen U, Kustu S, Wemmer DE. The Rut pathway for pyrimidine degradation: novel chemistry and toxicity problems. J Bacteriol. 2010;192(16):4089–102. pmid:20400551
  152. 152. Danielsen S, Kilstrup M, Barilla K, Jochimsen B, Neuhard J. Characterization of the Escherichia coli codBA operon encoding cytosine permease and cytosine deaminase. Mol Microbiol. 1992;6(10):1335–44. pmid:1640834
  153. 153. Kim NY, Kim OB. The ybcF Gene of Escherichia coli encodes a local orphan enzyme, catabolic carbamate kinase. J Microbiol Biotechnol. 2022;32(12):1527–36. pmid:36384810
  154. 154. Wang L, Zhou YJ, Ji D, Lin X, Liu Y, Zhang Y, et al. Identification of UshA as a major enzyme for NAD degradation in Escherichia coli. Enzyme Microb Technol. 2014;58–59:75–9. pmid:24731828
  155. 155. Alber A, Morris KM, Bryson KJ, Sutton KM, Monson MS, Chintoan-Uta C, et al. Avian Pathogenic Escherichia coli (APEC) strain-dependent immunomodulation of respiratory granulocytes and mononuclear phagocytes in CSF1R-reporter transgenic chickens. Front Immunol. 2020;10:3055. pmid:31998322
  156. 156. Pajor AM. Molecular properties of the SLC13 family of dicarboxylate and sulfate transporters. Pflugers Arch. 2006;451(5):597–605. pmid:16211368
  157. 157. Pos KM, Dimroth P, Bott M. The Escherichia coli citrate carrier CitT: a member of a novel eubacterial transporter family related to the 2-oxoglutarate/malate translocator from spinach chloroplasts. J Bacteriol. 1998;180(16):4160–5. pmid:9696764
  158. 158. Rabus R, Jack DL, Kelly DJ, Saier MH Jr. TRAP transporters: an ancient family of extracytoplasmic solute-receptor-dependent secondary active transporters. Microbiology (Reading). 1999;145(Pt 12):3431–45. pmid:10627041
  159. 159. Valentini M, Storelli N, Lapouge K. Identification of C(4)-dicarboxylate transport systems in Pseudomonas aeruginosa PAO1. J Bacteriol. 2011;193(17):4307–16. pmid:21725012
  160. 160. Rodríguez-Beltrán J, Tourret J, Tenaillon O, López E, Bourdelier E, Costas C, et al. High Recombinant Frequency in Extraintestinal Pathogenic Escherichia coli Strains. Mol Biol Evol. 2015;32(7):1708–16. pmid:25804522
  161. 161. Didelot X, Méric G, Falush D, Darling AE. Impact of homologous and non-homologous recombination in the genomic evolution of Escherichia coli. BMC Genomics. 2012;13:256. pmid:22712577
  162. 162. Glasner ME, Truong DP, Morse BC. How enzyme promiscuity and horizontal gene transfer contribute to metabolic innovation. FEBS J. 2020;287(7):1323–42. pmid:31858709
  163. 163. Milner DS, Attah V, Cook E, Maguire F, Savory FR, Morrison M, et al. Environment-dependent fitness gains can be driven by horizontal gene transfer of transporter-encoding genes. Proc Natl Acad Sci U S A. 2019;116(12):5613–22. pmid:30842288
  164. 164. Lauro FM, Eloe-Fadrosh EA, Richter TKS, Vitulo N, Ferriera S, Johnson JH, et al. Ecotype diversity and conversion in Photobacterium profundum strains. PLoS One. 2014;9(5):e96953. pmid:24824441
  165. 165. Campanaro S, Vezzi A, Vitulo N, Lauro FM, D’Angelo M, Simonato F, et al. Laterally transferred elements and high pressure adaptation in Photobacterium profundum strains. BMC Genomics. 2005;6:122. pmid:16162277
  166. 166. Pang R, Xie T, Wu Q, Li Y, Lei T, Zhang J, et al. Comparative genomic analysis reveals the potential risk of Vibrio parahaemolyticus Isolated From Ready-To-Eat Foods in China. Front Microbiol. 2019;10:186. pmid:30792709
  167. 167. Roslan NN, Ngalimat MS, Leow ATC, Oslan SN, Baharum SN, Sabri S. Genomic and phenomic analysis of a marine bacterium, Photobacterium marinum J15. Microbiol Res. 2020;233:126410. pmid:31945517
  168. 168. Yang C-H, Li Y-H. Chromobacterium violaceum infection: a clinical review of an important but neglected infection. J Chin Med Assoc. 2011;74(10):435–41. pmid:22036134
  169. 169. Redford P, Roesch PL, Welch RA. DegS is necessary for virulence and is among extraintestinal Escherichia coli genes induced in murine peritonitis. Infect Immun. 2003;71(6):3088–96. pmid:12761086
  170. 170. Spiga L, Winter MG, Furtado de Carvalho T, Zhu W, Hughes ER, Gillis CC, et al. An oxidative central metabolism enables Salmonella to utilize microbiota-derived succinate. Cell Host Microbe. 2017;22(3):291-301.e6. pmid:28844888
  171. 171. Billard-Pomares T, Clermont O, Castellanos M, Magdoud F, Royer G, Condamine B, et al. The arginine deiminase operon is responsible for a fitness trade-off in extended-spectrum-β-lactamase-producing strains of Escherichia coli. Antimicrob Agents Chemother. 2019;63(8):e00635-19. pmid:31138573
  172. 172. Zhang Y, Tan H, Yang S, Huang Y, Cai S, Jian J, et al. The role of dctP gene in regulating colonization, adhesion and pathogenicity of Vibrio alginolyticus strain HY9901. J Fish Dis. 2022;45(3):421–34. pmid:34931326