The isolation and characterization of Stenotrophomonas maltophilia T4-like bacteriophage DLP6

Increasing isolation of the extremely antibiotic resistant bacterium Stenotrophomonas maltophilia has caused alarm worldwide due to the limited treatment options available. A potential treatment option for fighting this bacterium is ‘phage therapy’, the clinical application of bacteriophages to selectively kill bacteria. Bacteriophage DLP6 (vB_SmoM-DLP6) was isolated from a soil sample using clinical isolate S. maltophilia strain D1571 as host. Host range analysis of phage DLP6 against 27 clinical S. maltophilia isolates shows successful infection and lysis in 13 of the 27 isolates tested. Transmission electron microscopy of DLP6 indicates that it is a member of the Myoviridae family. Complete genome sequencing and analysis of DLP6 reveals its richly recombined evolutionary history, featuring a core of both T4-like and cyanophage genes, which suggests that it is a member of the T4-superfamily. Unlike other T4-superfamily phages however, DLP6 features a transposase and ends with 229 bp direct terminal repeats. The isolation of this bacteriophage is an exciting discovery due to the divergent nature of DLP6 in relation to the T4-superfamily of phages.


Introduction
The spread and increasing incidence of antibiotic resistance is imminent, with experts suggesting we will face a "post-antibiotic era" in the 21 st century [1]. The situation has recently become dire, as a new mechanism of antibiotic resistance towards one of the globally-recognized 'last-resort' antibiotics colistin has evolved [2]. The extremely antibiotic resistant bacterium Stenotrophomonas maltophilia has been increasingly identified as a causative agent in both nosocomial and community-acquired infections [3]. Infections associated with S. maltophilia include, but are not limited to pneumonia, bacteremia, meningitis, endocarditis, catheter-related bacteremia/septicemia and acute exacerbations in patients with cystic fibrosis and chronic obstructive pulmonary disease [3,4]. Due to the ubiquitous nature of S. maltophilia in the environment and the possibility of spreading this bacterium through cough-generated aerosols, infection prevention has proven to be difficult [3,5]. Once infected with S. maltophilia, treatment options are limited due to its innate resistance to a broad array of antibiotics including trimethoprim / sulfamethoxazole, β-lactams, macrolides, cephalosporins, fluoroquinolones, aminoglycosides, carbapenems, chloramphenicol, tetracyclines, and polymyxin. Due to the limited treatment options, alternative strategies are being investigated in order to combat this extremely drug resistant bacterium.
The clinical application of bacteriophages to selectively kill infecting bacteria, known as "phage therapy", is a potential solution to extremely antibiotic resistant bacteria. During the Second World War, Soviet and German armies utilized phage therapy to treat dysentery of their soldiers, while the United States military conducted classified research on it [6]. Additionally, some clinicians worldwide continued to use phage therapy from the 1920s to the early 1950s [6]. Unfortunately, with the advent of broad-spectrum antibiotics and a limited understanding of phage biology, phage therapy was largely abandoned in the West. However, with the recent significant rise in antibiotic resistance in bacterial pathogens throughout the world, interest in the efficacious use of phage therapy has been renewed. Recent studies utilizing phage therapy to treat multi-drug resistant infections in animal models [7][8][9][10][11][12][13][14][15][16] and human clinical trials [17][18][19][20] have shown that phages can be a successful treatment option. To use phages in the clinical treatment of infections, the U.S. Food and Drug Administration requires characterization of the phages to prove they do not include moronic genes encoding toxins or other undesirable proteins which could enhance bacterial virulence [21]. Therefore, all phages isolated for use in clinical therapy must be fully characterized through complete genome sequencing and functional analysis. Towards that goal, the isolation and characterization of the novel S. maltophilia phage vB_SmoM-DLP6 (DLP6) is described herein. This phage is related to T4-superfamily of phages and exhibits an interesting combination of T4 and cyanobacteriophage genes.

Bacterial strains and growth conditions
Five S. maltophilia strains were acquired from the Canadian Burkholderia cepacia complex Research and Referral Repository (CBCCRRR; Vancouver, BC). The S. maltophilia strains used for isolation of phage from soil samples were D1585, D1571, D1614, D1576 and D1568. An additional 22 S. maltophilia strains were gifted from the Provincial Laboratory for Public Health-North (Microbiology), Alberta Health Services, for host range analysis. All strains were grown aerobically overnight at 30˚C on half-strength Luria-Bertani (½ LB) solid medium or in ½ LB broth with shaking at 225 RPM.
Phage isolation, propagation, host range analysis and electron microscopy DLP6 was isolated from planter soil located at the Kinsman Sports Center in Edmonton, Alberta, Canada using strain D1571 and a previously described extraction protocol [22]. Propagation of DLP6 was performed using soft agar overlays: 100 μl liquid culture and 100 μl phage stock were incubated 20 min at room temperature, mixed with 3 ml 0.7% ½ LB top agar, overlaid on a plate of ½ LB solid medium, and incubated at 30˚C until plaque formation was complete. High titre stocks were made by overlaying plates with confluent lysis with 3 ml modified SM and the top agar was scraped into a sterile falcon tube. The top agar was pelleted by centrifugation for 5 min at 10,000 × g and the supernatant was removed and filter-sterilized using a Millex-HA 0.45 μm syringe-driven filter unit (Millipore, Billerica, MA), followed by storage at 4˚C. Titres were obtained using serial dilutions of phage stock into SM, followed by the soft agar overlay technique described above and incubation at 30˚C until plaque formation was complete.
Host range analysis was performed using a panel of 27 clinical S. maltophilia and 19 P. aeruginosa strains. Soft-agar overlays containing 100 μl liquid culture were allowed to solidify for 10 minutes at room temperature. Plates were spotted with 10 μl drops of DLP6 at multiple dilutions and assayed for clearing and/or plaque formation after incubation at 30˚C for 36 h.
For electron microscopy, phage stocks were prepared as described above with the following modifications: ½ LB agarose plates and ½ LB soft agarose were used for overlays, MilliQ-filtered water for phage recovery and a 0.22 μm filter was used for syringe-driven filtration. A carbon-coated copper grid was incubated with lysate for 2 min and stained with 4% uranyl acetate for 30 s. Transmission electron micrographs were captured using a Philips/FEI (Morgagni) transmission electron microscope with charge-coupled device camera at 80 kV (University of Alberta Department of Biological Sciences Advanced Microscopy Facility). The average capsid diameter, tail length and tail width were calculated using Microsoft Excel based on measurements from 10 individual virions.
Phage DNA isolation, RFLP analysis and sequencing DLP6 genomic DNA was isolated from bacteriophage lysate using the Wizard Lambda DNA purification system (Promega Corp., Madison, WI) with a modified protocol [23,24]. A 10 ml aliquot of high-titre filter-sterilized phage lysate was treated with 10 μl DNase I (Thermo Scientific, Waltham, MA), 100 μl 100x DNase I buffer (1 M Tris-HCl, 0.25 M MgCl2, 10 mM CaCl2), and 6 μl RNase (Thermo Scientific) and incubated 1 h at 37˚C to degrade the bacterial nucleic acids. Following incubation, 400 μl of 0.5 M EDTA and 25 μl of 20 mg/ml proteinase K (Applied Biosystems, Carlsbad, CA) was added and incubated 1 h at 55˚C to inactivate DNase I. The lysate was cooled to room temperature and added to 8.4 g of guanidine thiocyanate, along with 1 ml of 37˚C resuspended Wizard DNA Clean-Up Resin (Promega Corporation, Madison, WI). This mixture was rocked for 10 min then pelleted by centrifugation for 10 min at 5,000 x g. The supernatant was drawn off until~5 ml remained. Remaining mixture was resuspended by swirling, transferred to a syringe attached to a Wizard Minicolumn (Promega Corporation). The Wizard Minicolumn was attached to The Vac-Man 1 Jr. Laboratory Vacuum Manifold (Promega Corporation) and placed under vacuum to remove the supernatant. The column was then washed with 2 ml 80% isopropanol and dried by centrifugation for 2 min at 10,000 x g. Phage DNA was eluted from the column following a 1 min incubation of 100 μl of 80˚C nuclease-free water (Integrated DNA Technologies, Coralville, IA) followed by centrifugation for 1 min at 10,000 x g. A NanoDrop ND-1000 spectrophotometer (Thermo Scientific, Waltham, MA) was used to determine purity and concentration of eluted DNA.
Restriction fragment length polymorphism (RFLP) analysis was used with 18 FastDigest (Thermoscientific) restriction enzymes: BamHI, EcoRI, AciI, HpaII, XbaI, HindIII, KpnI, SmaI, ApaI, SaII, PstI, SpHI, SacI, ClaI, Ndel, SpeI, Xhol and HaeIII. Restriction reactions were set up using 1 μl of FastDigest enzyme, 2 μl of FastDigest restriction buffer, 1 μg of phage DNA and topped up to 20 μl with nuclease free water. Reactions were incubated at 37˚C for 20 min and separated on a 1% (wt/vol) agarose gel in 1x TAE (pH 8.0). Sequencing of DLP6 was performed at The Applied Genomics Core at the University of Alberta. Purified DLP6 DNA was prepared for sequencing using a Nextera XT library prep kit, creating a library size of 223 bp. The library was used for paired-end sequencing on a MiSeq (Illumina, San Diego, CA) platform using a MiSeq v2 reagent kit. The Q30 for all reads was 92.5%.

Lifecycle of DLP6
DLP6 resistant colonies of S. maltophilia D1571 were isolated by using the top agar overlay method using a phage stock at a titer of 1x10 5 PFU/mL. Following overnight incubation at 30˚C, a 3 mL aliquot of SM was transferred to each plate, supernatant was collected and used for serial dilutions to obtain superinfection resistant single colonies on ½ LB plates. Individual colonies were picked, washed with SM and used to produce freezer stocks. Superinfection experiments were performed using overnight cultures of the potential lysogens and DLP6 at a 1x10 5 PFU/mL titer. Single colonies for each potential lysogen or pseudolysogen were used for colony PCR to detect the presence of DLP6. Identifying the temporary presence of DLP6 in the cell during pseudolysogeny was determined using specific sets of internal primers for DLP6.

Pulse Field Gel Electrophoresis (PFGE)
Overnight cultures of wild type D1571 and five DLP6 PCR positive single colony isolates were used to generate plugs following the protocol outline by Sueh et al. 2013 [25]. A SpeI (New England Biolabs) or XbaI (Thermo Fisher Scientific) restriction digest was set up for all six samples and incubated for 45 min at 37˚C. A 1% agarose gel made with TAE was used for the PFGE. Switch time was 50-90 for 12 hrs at 6 V/cm. Ladders used were Saccharomyces cerevisiae and Lambda DNA/HindIII markers (Promega) for comparison of high and low molecular weight DNA. Gels were stained using SYBR Safe DNA Gel Stain (Thermo Fisher Scientific) and visualized with a ChemiDoc MP imaging system and the Image Lab software (Bio-Rad).

Bioinformatics analysis
A single contig was assembled using the CLC Genomics Workbench (Qiagen, Toronto, ON). Although no ambiguous regions in the contig were observed, PCR amplification and sequencing was used to confirm the assembly. All attempts by PCR amplification to identify a DNA segment between the direct repeats or to show genome circularization via direct repeat annealing were negative. Open reading frames (ORFs) were identified with the GLIMMER plugin [26] for Geneious [27] using the Bacteria and Archaea setting, as well as GeneMarkS (http:// exon.gatech.edu/GeneMark/genemarks.cgi) for phage [28] and Prodigal [29]. Conserved domain searches were performed using CD-Search [30]. Pfam was used to identify functions for hypothetical protein hits from BLASTP [31]. The contig was annotated and confirmed with an Interactive Remote Invocation Service utilizing the RAST pipeline [32][33][34]. BLASTN and BLASTP (for full genomes and individual proteins, respectively) was used to gain more information for each RAST annotation, and to identify any potential related phages [35]. BLASTP hits above 1.00E -3 were not recorded and the coding sequence (CDS) was annotated as hypothetical. Rho independent terminators were predicted using ARNold [36][37][38] searching both strands. Promoters were predicted using PHIRE [39] and plotted using WebLogo 3 [40]. tRNAs were identified using tRNAscan-SE using the general tRNA model [41]. Multiple sequence alignments were performed with the top 250 BLASTP hits for gp20 and from core T4 and cyanophage proteins [42] using the MUSCLE [43] plugin for Geneious [44]. The maximum number of iterations selected was 8, with the anchor optimization option selected. The trees from iterations 1 and 2 were not retained. The distance measure for iteration 1 was kmer6_6, and was pctid_kimura for subsequent iterations. The clustering method was UPGMB for all iterations. An unrooted tree was constructed from MUSCLE alignments with the FastTree 2.1.5 [45] plugin for Geneious [44]. The Jones-Taylor-Thornton model was used with rate category of sites set to 20. A PROmer comparison was conducted with DLP6 and FM12 with the following parameters: breaklen = 60, maxgap = 30, mincluster = 10, minmatch = 3 [46].

Results and discussion
Isolation, host range and morphology Phage DLP6 was isolated from a soil sample using clinical isolate S. maltophilia strain D1571 as the host. Propagation of DLP6 to high titre has proven difficult in liquid cultures, with liquid grown lysate concentrations remaining constant at 10 6 PFU/ml despite attempts to increase progeny numbers. DLP6 exhibits a unique plaquing inhibition that was previously observed in Burkholderia cepacia complex phages KL1 and AH2 [47], as well as S. maltophlila phages DLP1 and DLP2 [22]. Although high titer stocks (10 10 plaque forming units [PFU]/ml) can easily be obtained using the top agar plating method, use of such high titre stock inhibits the plaque formation on a bacterial lawn. Plaquing of DLP6 is inhibited at titers above 10 6 plaque forming units (PFU/ml). Plaque development occurs readily at 30˚C within 24 hr, forming diffuse plaques with irregular boarders and a mean size of 0.8 ± 0.3 mm. Host range analysis of DLP6 revealed a moderate host range within S. maltophilia clinical isolates, infecting 13 out of 27 clinical isolates (S1 Table). Whereas S. maltophilia phages DLP1 and DLP2 exhibited some cross-species infectivity 22 , extended host range analysis of DLP6 using P. aeruginosa isolates did not yield successful infections. Initially, we produced evidence to suggest that DLP6 existed in S. maltophilia D1571 as a prophage. However, after extensive experimentation it was determined that DLP6 undergoes only pseudolysogeny. PFGE analysis using SpeI or XbaI separately showed no integration of the DLP6 genome into the S. maltophilia D1571 chromosome, and DLP6-specific PCR indicated the genome's presence after 2-3 passages, but not after >5 passages. DLP6 is classified in the Myoviridae family of the Caudovirales order due to its icosahedral head and contractile tail (Fig 1). The average capsid height, tail length and width measurements for DLP6 are 99, 144, and 23 nm respectively.

Genome characterization
Purified DLP6 gDNA was isolated and exposed to a panel of 18 restriction enzymes for RFLP analysis. Even though controls were performed to detect the presence of restriction inhibitors, the DLP6 genome was assembled into a linear scaffold of 168,489 bp with a GC content of 55.8% using 43,112 reads for a mean coverage of 57 reads and an overall Q30 score of 93.1%. The direct terminal repeats are covered by up to 45 reads (S1 Fig). The contig can be found in GenBank with the accession number KU682439.2. The genome is predicted to encode 241 coding DNA sequences and 30 tRNAs of 14 different specificities (Fig 2, S2 and S3 Tables). The genome is arranged in a semi-modular format, with DNA replication/repair genes (dark blue) primarily grouped together, whereas the regulatory genes (light blue) are mainly grouped within the same region as the DNA replication/repair genes. Auxiliary metabolism (black) genes are dispersed throughout the genome, occurring in small pairs rather than a large set. The phage morphogenesis (dark purple) genes are grouped in a cluster, with the exception of gp34 (AAY80_073; long tail fiber). The 30 tRNA genes are grouped together spanning the genome from 92,962-113,834 bp. There is no lysis (red) module in DLP6, rather four genes encoding typical lysis proteins are randomly located throughout the genome.
Three interesting phage-encoded proteins are ADP-ribosyltransferases (Alt: AAY80_209, ModA: AAY80_029, ADP-ribosyltransferase: AAY80_145) that were identified using the Pfam database. Two of these proteins are orthologs to T4 proteins Alt and ModA. It is known that phage T4 encodes three ADP-ribosyltransferases (Alt, ModA and ModB), each modifying specific groups of host proteins. The Alt protein is a component of the phage head and enters the host cell during the infection process with the phage DNA, ModA and ModB [48]. Following entry into the cell, Alt immediately ADP-ribosylates the host RNA polymerase (RNAP), causing transcription of host genes to stop and transcription from the T4 "early" promoters to be carried out instead [49]. The ModA modification of the host RNAP prevents transcription from T4 early promoters and possibly primes the RNAP for T4-encoded auxiliary factors (gp55, gp33, gp45 and gp44/62) to transcribe middle and late genes [50]. Identification of the previously hypothetical proteins into the Alt and ModA families helps to provide insight into the possible role these proteins play in DLP6 phage infection and transcription initiation and regulation.
Phage promoter sequences lack the conserved structure observed in bacterial promoters with -10 and -35 regions; instead, they feature short consensus sequences that are specific to different phages [51]. These short consensus sequences were identified using the phage-specific program PHIRE, and visualized with WebLogo 3 (Fig 3). There are 25 phage promoters identified, with 22 of the phage promoters found repeating in groups of two in front of a gene cluster. A single phage promoter is located upstream of genes AAY80_058 (hypothetical protein) and AAY80_059 (peptidase protein). The next single phage promoter is located upstream of the gene cluster beginning with AAY80_139 (kinase protein) through to AAY80_150 (hypothetical protein). The final single phage promoter is located upstream of a small cluster of hypothetical proteins beginning at AAY80_217 through to AAY80_220. Two phage promoters are found upstream of AAY80_060 (hypothetical protein) in a gene cluster coding for 32 proteins, including DNA primase (AAY80_083) phage tail fiber (AAY80_073), ribonucleotide diphosphate reductase subunit alpha (AAY80_086) and beta (AAY80_089), and many hypothetical proteins (locus tags ending in 060, 063-067, 070-071, 074-077, 079-082, 084-085, 088 and 090). The next set of phage promoters is located upstream of AAY80_092 (hypothetical protein) to AAY80_112 (hypothetical protein). Annotated genes included in this gene cluster are AAY80_098 (RNase H), AAY80_104 (PhoH), AAY80_106 (exonuclease), AAY80_109 (SleB), AAY80_110 (dCMP deaminase) and AAY80_111 (deoxycytidylate deaminase). The next two sets of gene clusters, AAY80_113-120 and AAY80_121-130, encode hypothetical proteins only and both clusters are under control of two promoters each. Gene cluster AAY80_131-138 utilizes two promoter sequences and encodes mainly hypothetical proteins, but also a guanosine 3',5'-bis(diphosphate) 3'-pyrophosphohydrolase (AAY80_131) and a transposase (AAY80_133). The adjacent gene cluster under control of two promoters is upstream of AAY80_151 (hypothetical protein) through to AAY80_158 (hypothetical protein) and contains nine tRNAs. The remaining 21 tRNAs are under control of two promoters upstream of AAY80_159 (hypothetical protein) through to AAY80_180 (hypothetical protein). Two promoters control the next gene cluster spanning from AAY80_181-202 that contains genes encoding many hypothetical proteins and proteins such as DNA ligase (AAY80_192), DNA helicase loader (AAY80_201) and ssDNA binding protein (AAY80_202). Two small gene clusters encoding a total of eight hypothetical proteins (AAY80_213-216 and AAY80_221-224) are both under control of two phage promoters. The last set of double phage promoters controls the gene cluster from AAY80_225 to AAY80_057 where the phage genome is circularized. This final gene cluster contains many proteins involved in DNA replication and homologous recombination.

Phylogeny using DLP6 gp20 protein
A BLASTN search using DLP6 genome as a query revealed the closest hits are Sinorhizobium phages FN3, FM19, FM7 and FM12. All four phages have coverage of only 4% with a 72% identity. Although initial BLASTP and BLASTN searches indicated DLP6 was more closely related to members of the T4-superfamily of phages, more comparisons were required to classify DLP6 as a T4-superfamily phage. For a commonly used phylogenetic comparison, the protein sequence of portal protein gp20 was used in a BLASTP search to identify 250 of the most similar sequences [52][53][54]. The most significant hits came from cyanophages grouped into the T4-superfamily. A MUSCLE alignment was completed using the 250 BLASTP results compared to the DLP6 gp20 protein (AAY80_011). This alignment was then used to generate an unrooted tree with the FastTree plugin for Geneious (Fig 4). The generated tree positions DLP6 in a clade with Sinorhizobium phages FM12, FN3 and Caulobacter phage Cr30 (Fig 4).

DLP6 contains features from T4-superfamily enteric bacteriophages and T4-superfamily cyanophages
Genomic organization of DLP6 is similar to the Sinorhizobium transducing phage FM12, which has been classified into a new T4-superfamily fusing features of cyanophages and phages of enteric bacteria [42] (Fig 5). A set of "core universal" and "nearly universal" proteins has been determined for T4-superfamily phages [55,56]. DLP6 contains all of the core and nearly universal proteins common to all T4-superfamily phages ( Table 2). A MUSCLE comparison was used to align the T4-superfamily phage core proteins against their respective orthologs from 18 T4-superfamily phages, and their percent identity to each ortholog was determined ( Table 2). Similar to FM12, DLP6 contains a second copy of gp19 tail tube monomer that was not found in the 17 other phages studied (21% similarity). The order of the gene products presented in Table 2 corresponds to the order they are found within the DLP6 genome. This order differs from the T4-superfamily phages, which start with gp41 (DNA primase-helicase). Results from the MUSCLE alignment reveal that DLP6 core proteins are most similar to their orthologs from cyanophages, with the exception of the UvsX protein, which is most similar to T4-superfamily phages of enteric bacteria. The alignment also shows DLP6 has the highest percent identity to FM12 core proteins. The ortholog with the highest similarity to a DLP6 protein was FM12 gp23, with 65% similarity. Overall, the most highly conserved proteins between DLP6 and the T4-superfamily cyanophages are RegA (early transcriptional regulator) and gp23 (major capsid protein), with similarity rates averaging 59 and 57% respectively. This suggests that DLP6 is divergent from the T4-superfamily phages. DLP6 does share additional Table 1. Predicted Rho-independent terminators in DLP6. Rho-independent terminators were identified using the ARNold [36,37] program and putative terminators with a ΔG value of -10 kcal/mol or less were retained. DNA that is predicted to form the loop in the RNA is in emboldened, whereas DNA that is predicted to encode an RNA stem is underlined. proteins that are found within the T4-superfamily cyanophages or the T4-superfamily enteric phages. All sequenced T4-superfamily cyanophages feature an accessory core set of 25 gene clusters (T4-GCs) that are not found within enteric bacteria T4-superfamily phages [55]. Of this accessory core, DLP6 encodes six of the core proteins: CobS (porphyrin biosynthetic protein), PhoH (P-starvation inducible protein), T4-CG 313 (hypothetical protein), T4-GC 321 (hypothetical protein), VlrC (predicted structural protein) and MazG (pyrophosphatase) ( Table 3). Although DLP6 does contain these T4-superfamily cyanophage accessory core proteins, the DLP6 proteins are again divergent, with the maximum similarity found with the PhoH (P-starvation inducible protein) at 45% similar to P-HM1 cyanophage.

Start
DLP6 was found to contain nine out of the designated 30 non-cyanophage core T4-superfamily proteins. A MUSCLE alignment of these nine proteins indicates that although this phage does contain the proteins, they do not share high amino acid similarities ( Table 4). The protein sharing the highest similarity was the RnaseH protein, which had a maximum similarity of 32.6% with the RB43 phage protein. Again, these results demonstrate that DLP6,  although classified within the T4-superfamily, it is more divergent than the other accepted members.
Differences between the T4-superfamily superfamily members discussed in this paper as compared to DLP6 are interesting, given that DLP6 contains the complete set of T4-superfamily core group of proteins, six accessory core cyanophage proteins, and nine non-cyanophage accessory core proteins. Moreover, the ends of DLP6 genome feature 229 bp direct terminal repeats, unlike the genomes of other members of the T4-superfamily of phages which are cyclically permuted. This finding is unusual, and suggests that DNA circularization in the host cell occurs at cohesive sites. The direct terminal repeats are located in a region of DNA devoid of ORFs, or homology to known DNA sequences. The average number of tRNAs encoded by the T4-superfamily phages is ten, with the exception of S-CRM01, a freshwater cyanophage that encodes 33 tRNAs [57]. The DLP6 genome contains 30 tRNAs, which is in the high range for the T4-superfamily. DLP6 also encodes a transposase (AAY80_133) (Fig 2, S2 Table).

Conclusions
S. maltophilia bacteriophage DLP6 was isolated from a soil sample using S. maltophilia strain D1571 as the host bacterium. Phage DLP6 exhibits a moderate host range, infecting 13 out of 27 clinical S. maltophilia strains. A phylogenetic comparison of gp20 portal protein against the Table 3. MUSCLE alignment percent identity score of DLP6 amino acid sequences against T4-superfamily cyanophage accessory core proteins. DLP6 contains six of the 25 T4-superfamily cyanophage core proteins. Numbers indicate percent similarity to the related DLP6 protein.

Gene product: function Cyanophages
ΦM12 S-SM2 P-SSM4 Syn1 HTVC008M Syn9 Syn19 S-ShM2 Syn33 S-SM1 P-HM1 S-CRM01 top 250 BLASTP results places DLP6 in a clade with Sinorhizobium phages FM12, FN3 and Caulobacter phage Cr30. Although DLP6 does encode all of the T4-superfamily core and nearly universal core orthologs, the similarity between these proteins and their nearest neighbors is typically less than 60%. The DLP6 genome also encodes six T4-superfamily cyanophage core proteins, but again, the nearest neighbor similarity is below 40%. There are nine T4-superfamily non-cyanophage core proteins found within the DLP6 genome, though the similarity between the DLP6 proteins and the T4-superfamily enteric phage orthologs averaged less than 30% similarity. Unlike other T4-superfamily phages, DLP6 possesses 229 bp direct terminal repeats at the ends of its genome instead of circular permutation. Although DLP6 also encodes a transposon, experimental investigation has shown it does not form a stable prophage in S. maltophilia strain D1571. The results presented in this paper suggest DLP6 is a divergent T4-superfamily phage, exhibiting characteristics not yet identified in other T4-superfamily phages.
Supporting information S1 Fig. Illumina pair-end sequencing reads aligning to start (1A) and end (1B) of DLP6 contig. Geneious was used to map R1 and R2 to DLP6 reference contig. Sensitivity was set to medium sensitivity/fast. Fine-tuning was iterated up to five times. Reads were not trimmed. Coverage at ends was between one and 45. (PDF) S1