Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Whole genome sequencing, characterization and analysis of coronene degrading bacterial strain Halomonas elongata

  • Thasneema Rafic,

    Roles Formal analysis, Methodology, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Bioengineering, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

  • Mohammed Alarawi,

    Roles Methodology, Software

    Affiliation Comparative Genomics and Genetics, King Abdullah University of Science and Technology, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia

  • Omer Salem Alkhnbashi,

    Roles Formal analysis, Resources, Software, Supervision, Writing – review & editing

    Affiliations Department of Information Computer System, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia, Mohammed Bin Rashid University of Medicine and Health Sciences (MBRU), Dubai Healthcare City, Dubai, United Arab Emirates

  • Assad Al-Thukair,

    Roles Methodology, Writing – review & editing

    Affiliation Department of Bioengineering, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

  • Ajibola H. Okeyode,

    Roles Methodology, Writing – review & editing

    Affiliation College of Petroleum Engineering & Geosciences, King Fahd University of Petroleum and Minerals Dhahran, Saudi Arabia

  • Karthikeyan G,

    Roles Formal analysis, Writing – original draft, Writing – review & editing

    Affiliation Department of Microbiology, Kasturba Medical College, Manipal, India

  • Alexis Nzila

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Writing – original draft, Writing – review & editing

    alexisnzila@kfupm.edu.sa

    Affiliations Department of Bioengineering, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia, Interdisciplinary Research Center for Membranes and Water Security, King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia

Abstract

Polycyclic aromatic hydrocarbons (PAHs) are persistent environmental pollutants with significant ecological and health risks. Among them, coronene, a high molecular weight PAH, is particularly resistant to biodegradation due to its complex structure. This study characterizes a halophilic bacterial strain, initially identified as Halomonas caseinilytica and later reclassified as Halomonas elongata, capable of utilizing coronene as its sole carbon source under high salinity (10% NaCl). Whole genome sequencing using Oxford Nanopore technology (ONT) revealed 4,308 predicted genes, including those linked to hydrocarbon metabolism, stress adaptation, and secondary metabolite biosynthesis. Pathway analysis identified genes associated with xenobiotic degradation, although no canonical coronene specific degradative enzymes were identified, implying that the bacteria may be utilising an alternative or novel pathway. Comparative annotation uncovered operons and enzymes relevant to aromatic compound breakdown. Notably, the presence of ectoine biosynthesis genes suggests a robust osmoadaptation system. Features such as mobile genetic elements and horizontal gene transfer events were also investigated. These findings expand current knowledge on PAH-degrading halophiles and highlight the potential of H. elongata in bioremediation of saline and hypersaline environments contaminated with complex hydrocarbons. The study also emphasises the potential of long read sequencing technologies in environmental genomics and bioremediation.

Introduction

Polycyclic aromatic hydrocarbons (PAHs) are persistent pollutants that pose serious environmental and health risks due to their toxicity, mutagenicity, and carcinogenicity [1]. These pollutants, commonly originating from the incomplete combustion of organic materials and fossil fuels, are widespread in terrestrial and aquatic ecosystems [2,3].Polycyclic aromatic hydrocarbons (PAHs) can be categorized into two groups. The first group consists of low molecular weight PAHs (LMW-PAHs), which contain two or three aromatic rings. Representative compounds in this group include naphthalene, phenanthrene, and anthracene. The second group comprises high molecular weight PAHs (HMW-PAHs), which contain more than three rings; notable examples include pyrene (four rings), benzo[a]pyrene (five rings), and coronene (seven rings) [35]. The literature contains numerous reports on the biodegradation of both LMW-PAHs (e.g., naphthalene, phenanthrene, and anthracene) and HMW-PAHs (e.g., pyrene and benzo[a]pyrene), including studies conducted under thermophilic, halophilic, and anaerobic conditions [1,512]. Bacteria capable of degrading PAHs have been identified across a wide range of genera. A study summarizing research on Saudi bacterial strains identified 38 different genera capable of degrading PAHs and other petroleum-derived compounds [13].

Comparatively, limited work has been carried out on the degradation of the HMW-PAHs coronene, due to the complexity of it’s structure, which makes it recalcitrant to biodegradation [14]. Three studies have reported the degradation of coronene by strains of Stenotrophomonas maltophilia (formerly known as Burkholderia cepacia) [1517]. However, this degradation was observed only in the presence of pyrene, suggesting that these strains may not be capable of utilizing coronene as a sole carbon source. Recently, our research group identified a novel halophilic bacterial strain, Halomonas caseinilytica 10SCRN4D, isolated from fuel depots on the campus of King Fahd University of Petroleum and Minerals (Dhahran, Saudi Arabia), which was capable of degrading coronene as the sole carbon source under high salinity conditions (10% NaCl w/v). The discovery of H. caseinilytica 10SCRN4D’s unique ability to degrade coronene in highly saline environments opens new avenues for research in PAH bioremediation, particularly in marine and hypersaline ecosystems. In addition, this strain was also capable of degrading other high molecular weight PAHs, including benzo[a]pyrene, phenanthrene, and pyrene, indicating a robust and versatile metabolic potential for PAH degradation [18].

To further elucidate the genetic and metabolic mechanisms underlying this exceptional capability, we have conducted a whole genome sequencing analysis of H. caseinilytica 10SCRN4D. Whole genome sequencing has proven to be an invaluable tool in understanding the metabolic potential and genetic adaptations of microorganisms involved in biodegradation processes [19]. For instance, the genome analysis of Mycobacterium sp. strain CH2, capable of degrading pyrene and benzo[a]pyrene, revealed a complete set of genes responsible for the degradation pathways of these HMW-PAHs [20].

Previous studies relying on Short-read Sequencing Technologies (SRST), such as Illumina, faced challenges in assembling repetitive regions, structural variations, and long operonic sequences, which are critical for understanding microbial genomic architecture [21]. For example, multiple studies underlined the difficulty in assembling repetitive regions using short reads, leading to fragmented assemblies of bacterial genomes [2224]. This could be because limited read lengths and lack of paired-end reads pose impediments for assembly software in resolving repeat regions, leading to fragmented assemblies [23]. Another challenge is the inability of short reads to accurately resolve repetitive genomic regions making it arduous to detect genetic variations [21]. In context of our study, where our strain is expected to have a relatively higher GC content as an extremophile, SRST often does not permit to accurately characterize DNA and RNA with extreme GC content, repetitive homologous sequences, or epigenetic modifications, making SRST a poor choice of sequencing technology [24,25]. These shortcomings inevitably restrict functional annotation and hamper the identification of novel pathways. In contrast, Long-read Sequencing Technologies (LRST) has demonstrated superior capabilities. It enables accurate mapping of sequencing reads to reference genomes, facilitates diverse variant detection methodologies, and introduces innovative approaches for characterizing epigenetic diversity [26]. The advancements in sequencing speed and accuracy, alongside the improved quality of bioinformatics analyses, demonstrate the effectiveness of recent technological innovations and their inherent chemical kits [27]. For instance, Koren et al. [28] in 2013 demonstrated the power of long reads in resolving complete bacterial genomes, including plasmids and repetitive regions, enhancing our understanding of bacterial evolution and pathogenicity. Another study used LRST for single-cell genomics of uncultivated bacteria, providing insights into microbial dark matter and expanding our knowledge of microbial diversity [29]. In the present study, we use LRST to explore the genetic mechanisms underlying coronene degradation in H. caseinilytica 10SCRN4D, we seek to fill the knowledge gap in HMW-PAH biodegradation and offer valuable insights and tools to tackle the enduring issue of PAH contamination across various environmental contexts.

Materials and methods

DNA isolation, whole genome sequencing and quality assessment

The strain used in this study, H. caseinilytica 10SCRN4D was originally isolated from soil samples collected from a fuel station of King Fahd University of Petroleum and Minerals, as described in Okeyode et al.(2023) [18]. In brief, researchers enriched the soil samples under saline conditions using coronene as the only carbon source. This process led to the isolation of this halophilic bacterium, as detailed previously [18].

Bacterial pellet from single colony enrichment was subject to DNA isolation using qiagen MagAttract HMW DNA Kit (Qiagen, Germany). DNA was quantified using Qubit BR Assay Kits (Thermo, USA). 400–500 ng DNA was used to prepare sequencing library for Oxford nano-pore sequencing (ONT) using SQK-LSK109 Ligation Sequencing kit with R9.4.1 flowcell (Oxford Nanopore Technologies, Oxford, UK). The basecalling was performed in realtime using Guppy v5.1.

Bacterial genome assembly and analysis from ONT long reads was performed using the nf-core/bacass pipeline (v2.0.0) using nextflow (v23.04.0) [30]. Raw reads initial quality control and adapter trimming was performed using NanoPlot (v1.38.0) [31] and Porechop (v0.2.4). The de novo assembly was utilized Minimap2 (v2.21-r1071) [32] for read alignment and Miniasm (v0.3-r179) [33] and contig generation. The draft assembly was polished using Minimap2, Racon (v1.4.20) [34], and Medaka (v1.4.3) to improve the sequence accuracy. Finally, assembly quality was assessed using QUAST (v5.0.2) [35], and a comprehensive multi-tool report was generated with MultiQC [36].

The completness of the assembled genome was measured using BUSCO v 5.4.6 (Benchmarking Universal Single-Copy Orthologs) [37], with an E-value cutoff of 0.001 for BLAST searches to ensure high-confidence detection of conserved orthologs while minimizing false positives.

Strain identification

The thorough analysis of the bacterial genome began strain identification using Kraken2 (v2.1.1), for assigning taxonomic labels and detect contamination [38]. Parameters were set at a 0.5 confidence score to balance sensitivity and specificity. Additionally, a minimum hit group of 2 was used to avoid weak or ambiguous taxonomic assignments, improving the reliability of strain identification.

Gene prediction and functional annotation

To ensure comprehensive and accurate gene annotation, three distinct gene prediction tools were employed, each paired with a specific annotation tool. PROKKA [39] was the first tool employed for initial gene prediction, using an e-value cutoff of 1e-06 to ensure highly reliable functional annotations. A 1e-06 cut off was selected to balance specificity and sensitivity as zero is not a valid threshold in BLAST, and this cutoff also minimizes false positives while retaining biological meaningful homologs. The predicted genes were promptly annotated with PROKKA’s integrated annotation system. To enhance the depth of functional insights, hypothetical proteins identified by PROKKA were subjected to CDD (Conserved Domains Databases) searches [40,41]. These searches were performed with a stringent e-value threshold of 0.001 and a maximum of 500 hits to allow for the identification of conserved functional domains even in hypothetical proteins, thereby enhancing the depth and biological relevance of the genomic annotations. The functional insights gained from CDD analysis were then integrated with PROKKA’s annotations, creating a more comprehensive and detailed overview of the predicted genes’ roles and their potential biological significance.

In addition to PROKKA, two other gene prediction tools were employed: PRODIGAL [42] and GeneMarkS2 [43]. The genes predicted by these tools were subsequently annotated using EggNOG-mapper v2, a powerful functional annotation tool. [44]. MAFFT, a multiple sequence alignment program, was utilized to align the gene sequences predicted by all three tools to identify potential discrepancies between the different prediction methods and enhancing the overall accuracy of the annotation process [45]. Finally, gene ontology-based functional annotation was performed using InterProScan and Blast2GO [46].

Identification of genomic features

To gain insights into gene organization and regulatory mechanisms within the genome, Operon Mapper was utilized to identify potential operons, providing information on gene clustering and regulation [47]. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) arrays, known for their role in bacterial immunity and genome editing, were detected using CRISPRCasFinder [48]. This step was crucial to understand the adaptive immune mechanisms of the organism. Additionally, the resistance gene Identifier program of the database CARD (Comprehensive Antibiotic Resistance Database) was used to spot any genes of antibiotic resistance [49].

To further investigate the genome’s structure and evolutionary dynamics, RepeatMasker v4.1.5 [50] was employed to identify repeat elements, and RepeatModeler v2.0.5 [51] was used for de novo annotation of these repetitive sequences. Additionally, Palindrome v5.0.0.1 [52] was applied to detect inverted repeats, with parameters set as follows: lengths ranging from 10 to 100 base pairs target meaningful structural motifs; a maximum gap of 100 base pairs between repeats to accommodate typical regulatory structures and no mismatches allowed to ensure the identification of exact inverted repeats. Mobile genetic elements, which are pivotal in bacterial evolution and environmental adaptation, were identified using MobileOG-DB with e-value score of 1.0e-05 and k value of 1 that would maximize sensitivity, ensuring the identification of all potentially relevant mobile genetic sequences [53]. Furthermore, potential horizontal gene transfer events, critical for the acquisition of novel traits and rapid adaptation, were detected using Alien Hunter [54].Lastly, the presence of secondary metabolite biosynthesis genes, was identified using the antiSMASH web-based tool [54].

Pathway analysis

Two complementary approaches were used for pathway analysis: the RAST (Rapid Annotations using Subsystems Technology) server and KAAS (KEGG Automatic Annotation Server). The RAST server was employed to annotate genes based on curated subsystems and protein families [55,56]. KAAS was utilized for functional annotation of genes [57]. KAAS employed the bi-directional best hit (BBH) method, a reliable technique for identifying orthologous relationships. KO (KEGG Orthology) identifiers assigned through this process were subsequently used to automatically generate KEGG pathways and functional classifications.

Results

Whole genome sequencing and quality assessment

ONT allowed real time detection and generated long reads of 6 contigs combining to a total length of 3966854 bp, of which the largest contig made up 1702422 bp (maybe repot in Mbp or Kbp). The number of N’s per 100 kbp was reported to be zero implying that no ambiguous ‘N’ bases were in in 100,000 bp (Kbp or Mbp) of the assembly, suggesting an assembly with high sequence continuity without gaps. Table 1 summarises the quality assessment report. Overall, the statistics indicated a high-quality genome assembly with few, large, and contiguous sequences, minimal gaps, and a good representation of the genome’s total length (Fig 1). The GC content was reported to be 63.04% which, although within the desirable range of 40%−80%, is still relatively high. Higher GC content often correlates with thermal stability, which suggests that our organism is adapted to high-temperature environments, an information we can confirm from our previous study [18].

thumbnail
Fig 1. Evaluation of quality of the genome assembly.

A. An Nx plot showing the assembly continuity and indicating a high-quality genome assembly with significant coverage achieved by large contigs. B. The plot shows the cumulative length of contigs from genome assembly as a function of contig index. A steep initial slope, which then levels off, indicates that a few long contigs make up a substantial part of the genome. C. The plot shows distribution of GC content across different windows of the assembled genome having a predominant and consistent GC content around 60%. D. The plot shows the peak of GC content across the contigs in the genome assembly and implies a uniform GC content of a little over 60%.

https://doi.org/10.1371/journal.pone.0334420.g001

The completeness of the genome assembly was then analyzed with BUSCO which validates the quality of genome assemblies based on the presence of highly conserved genes. In Fig 2, our results show that out of a total of 619 BUSCOs searched, 506 were complete. This included 505 that are present as a single copy and 1 that is duplicated. This implies that 81% of orthologs that were found in the genome assembly are intact without missing any important regions. The predominance of single-copy BUSCOs and the minimal duplication suggest that our assembly is accurate and largely free from redundancy or misassembly. Additionally, 72 orthologs were fragmented while 41 were missing from the assembly. Low number of missing genes mean only a small proportion of expected genes are absent. This indicates the genome assembly is mostly comprehensive.

thumbnail
Fig 2. BUSCO assessment results displaying three categories of genomic completeness.

Complete (C), Fragmented (F), and Missing (M). The Complete (C) category dominates with approximately 81% of BUSCOs, including 505 single-copy and 1 duplicated. The Fragmented (F) category accounts for about 12%, while the Missing (M) category represents roughly 7%. The chart highlights the high quality and completeness of the genome assembly.

https://doi.org/10.1371/journal.pone.0334420.g002

Taxonomical classification

Taxonomical classification of the sequence was done using Kraken2 software [58] that reclassified the bacteria as H. elongata contradicting the previous 16s rRNA based identification of the strain as H. caseinilytica 10SCRN4D. Fig 3 represents the hierarchical taxonomical classification of the strain. S1 Table shows the output result of the taxonomical identification when the H. elongata had the highest score of association.

Gene prediction and functional annotation

PROKKA predicted a total of 4308 genes within the genome. These genes were categorized as follows: 4227 genes annotated as conserved domain sequences (CDS), 12 genes annotated as rRNA, 68 genes annotated as tRNA and 1 gene annotated as tmRNA. Additionally, 1659 CDS were annotated as hypothetical proteins, representing genes with unidentified or uncertain functions. To further characterize these hypothetical proteins, they were subjected to analysis using Conserved Domain Database (CDD), where 737 hypothetical proteins were identified as specific proteins with defined functions and 396 hypothetical proteins were linked to their respective superfamilies, providing functional insights. However, 526 hypothetical proteins remained uncharacterized, representing sequences with no detectable matches to known proteins or superfamilies (S1S4 Figs).

In addition to PROKKA, PRODIGAL and GeneMarkS2 were utilized for gene prediction (Table 2). PRODIGAL predicted 4234 genes, of which 3785 genes were annotated using EggNOG-mapper. GeneMarkS2 predicted 4280 genes, with 3861 genes annotated via EggNOG-mapper. Gene ontology (GO) assignments were carried out using InterProScan and Blast2GO. The sequence distribution based on Biological, Cellular, and Molecular functions is summarized in Fig 4. 100 proteins were categorized under GO:0006805, corresponding to xenobiotic degradation, indicating potential involvement in detoxification processes. Notably, no proteins were categorized under GO:0019439, which corresponds to aromatic compound catabolism, highlighting a lack of direct annotations related to this specific function.

thumbnail
Table 2. Comparison of the gene prediction and annotation results.

https://doi.org/10.1371/journal.pone.0334420.t002

thumbnail
Fig 4. GO analysis provides a functional snapshot of the strain’s genomic capacity.

The image displays GO results distributed across three major categories: Cellular Component, Molecular Function, and Biological Process. Cytosolic and membrane-associated proteins suggest critical metabolic and transport-related roles; Enzymes involved in transferase, hydrolase, and oxidoreductase activities support the degradation of complex organic compounds like coronene; The biological processes further underline the bacterium’s capacity to adapt, organize its cellular machinery and perform specialized functions.

https://doi.org/10.1371/journal.pone.0334420.g004

Identification of genomic features

Operon Mapper identified 2013 operons out of which at least 9 were associated with aromatic compounds degradation. Two CRISPR sites were identified, one in utg000001l contig and the other in utg000005l contig (S2 Table). No cas sites were detected.

In total, 436 repeat regions were identified. These regions mainly comprised of simple repeats. In addition to the simple repeats, LINEs, SINEs, rRNA and tRNA repeats were also detected (Fig 5). 1288 Palindrome, 121 mobile elements and 47 Horizontal transfer genes were also found. antiSMASH was able to identify three secondary metabolite regions, namely, ectoine, NRPS/ NRPS metallophore, RiPP like protein. Fig 6 shows the gene clusters of the three secondary metabolite biosynthesis. CARD [49] detected 3 antibiotic resistance genes, namely, adeF, rsmA and qacG. Fig 7 represents the whole genome of the bacteria created using Proksee web-based tool [59].

thumbnail
Fig 5. Repeat sequences found in the genome.

T-transfer DNA, IE- Integration/excision, RRR-Replication/Recombination/Repair, P- Phage, STD- Stability/Transfer/Defense.

https://doi.org/10.1371/journal.pone.0334420.g005

thumbnail
Fig 6. Secondary metabolite clusters of NRPS/NRP-metallophore, ectoine and RiPP-like protein.

Each gene clusters consist of core and additional biosynthetic genes, regulatory genes, transport-related genes and resistance gene.

https://doi.org/10.1371/journal.pone.0334420.g006

thumbnail
Fig 7. Schematic circular map of the whole genome of Halomonas elongata.

The map illustrates the location of the different genetic components of the bacterial genome. Used under creative commons license.

https://doi.org/10.1371/journal.pone.0334420.g007

Ectoine production

In recent years, ectoine has been extensively studied for commercial application due to its ability to stabilize cellular components such as DNAs and proteins [60]. From the annotation results, 3 of the enzymes required for ectoine synthesis, namely, Diaminobutyric acid acetyltransferase (ectA), L-2,4-diaminobutyrate-2-oxoglutarate transaminase (ectB) and Ectoine synthase (ectC) were identified. Additionally, Ectoine hydroxylase (ectD) involved in the conversion of ectoine to 5-hydroxyectoin was also found. EctD is not commonly found in all ectoine biosynthesizing bacteria. 5-hydroxyectoin has superior stress-relieving properties [61].

Functional and pathway analysis

The RAST analysis revealed that only 32% of the genome was associated with subsystem categories. Overall, 4393 coding sequences in the genome were Identified using RAST. Of these, 1398 coding sequences were linked to one or more subsystems in the database. Within the category of aromatic compound metabolism, 30 features/genes were identified, highlighting the organism’s potential role in degrading aromatic compounds. Additionally, 2 genes categorized under miscellaneous subsystems were associated with aromatic dioxygenase activity (S3 Table). RAST also identified other important subcategories including resistance to antibiotics and toxic compounds, Invasion and intracellular resistance, Prophage and phage packaging machinery suggesting mechanisms for survival in challenging environments, host interaction capabilities and phage related function (Fig 8).

thumbnail
Fig 8. A pie chart representing the subsystems coverage of the genome using the RAST database.

Subsystems include “Metabolism of Aromatic Compounds,” “Amino Acids and Derivatives,” “Carbohydrates,” “Phosphorus Metabolism,” “Secondary Metabolism,” “Stress Response,” “Membrane Transport,” and others. The chart highlights functional diversity with significant portions for metabolic, transport, and stress adaptation-related processes.

https://doi.org/10.1371/journal.pone.0334420.g008

Pathway analysis using InterProScan results identified 266 KEGG pathways in addition to 1866 sequences that were found to be associated to one or more pathways. KASS produced KO list which helped in mapping the pathways. A total of 2101 genes were annotated with KO numbers. Pathway mapping allowed to see the different pathways our bacterial genome aligns with in the KEGG databases. Under the category of xenobiotic degradation pathways, 13 partial KEGG pathways were identified. One reason to explain this would be the probable presence of alternative pathways which may not be a part of the standard KEGG modules. S5 Fig shows the pathway map for the degradation of PAHs mediated by cytochrome P450.

Discussion

The advent of LRST, such as ONS used in our study, has revolutionized microbial genomics by enabling high-contiguity assemblies and the resolution of complex genomic features. This study leverages the strengths of LRST to elucidate the genomic adaptations of H. elongata (previously H. caseinilytica 10SCRN4D). Transitioning from 16S rRNA sequencing to whole genome sequencing resulted in the reclassification of our strain from H. caseinilytica 10SCRN4D to H. elongata, accentuating the evolving nature of bacterial taxonomy and the limitations of 16S rRNA-based identification methods [62]. This phenomenon is not unique to our study, as similar reclassifications have been observed in other bacterial genera. For instance, WGS-based analyses led to the proposed combination of two Clostridium species in a 2021 study [63], and another research effort reclassified an Elizabethkingia miricola strain as E. bruuniana [64]. These recurring instances of species reclassification can be attributed to the insufficient resolution of 16S rRNA-based identification, particularly when distinguishing closely related species within genetically complex genera like Halomonas [65]. The genetic similarity among Halomonas species complicates accurate classification when relying solely on the 16S rRNA gene [62]. Relatively, LRST provides a comprehensive genetic landscape, enabling more precise and nuanced species determination. Our study’s findings highlight the advantages of LRST in uncovering subtle genomic differences crucial for accurate taxonomic classification [64].

Recent studies have highlighted the potential of halophilic bacteria in degrading HMW-PAHs under saline conditions. Nanca et al. [66] isolated halophilic bacteria from Philippine salt beds capable of degrading pyrene, fluorene, and fluoranthene, demonstrating the versatility of halophiles in PAH degradation. Other studies have demonstrated the ability of various Halomonas sp. to degrade aromatic hydrocarbons under hypersaline conditions. For instance, H. organivorans has been reported to degrade phenol, salicylate, and benzoate, utilizing pathways involving phenol hydroxylase and catechol 2,3-dioxygenase enzymes [67].Similarly, Halomonas sp. strain ML-15 was shown to degrade phenanthrene effectively under haloalkaliphilic conditions, emphasizing the adaptability of Halomonas species to extreme environments [68]. Halomonas sp. strain C2SS100 has exhibited the capacity to degrade hydrocarbons under high salinity, highlighting the genus’s adaptability to extreme environments [69].Our strain of study, as observed from our previous research, was capable of degrading coronene at the same rate as that of any LMW-PAHs and at a salinity ranging between 0.5% to 10% [18]. Renowned for their ectoine producing ability, H. elongata is a halophilic γ-proteobacterium that has an optimal growth at salt concentrations ranging from 3.5% to 20% NaCl [62]. Despite the Halomonas sp. remarkable capability, the degradation of HMW PAHs such as coronene by H. elongata has not been previously reported in the literature, highlighting the novelty and significance of our findings.

The gene prediction and annotation results from multiple tools (PROKKA, PRODIGAL, and GenemarkS2) provide a comprehensive view of the H. elongata strain’s genomic content. The consistent gene count across different prediction algorithms (4308, 4234, and 4280, respectively) lends credibility to the overall gene density and supports the robustness of the genome assembly. PROKKA’s annotation revealed a high proportion of protein-coding sequences (4227 CDS) and essential RNA genes, indicating a complete set of translational machinery crucial for cellular function [70]. However, high number of hypothetical proteins (1659 out of 4227 CDS) initially annotated by PROKKA highlights the current limitations in our knowledge of bacterial gene functions, particularly in less-studied genera like Halomonas. The subsequent analysis of these hypothetical proteins using CDD reduced the number of truly uncharacterized proteins from 1659 to 526. This significant reduction emphasizes the importance of using multiple annotation tools and databases to maximize functional assignments as done in this study. The remaining 526 hypothetical proteins with no identified domains or superfamilies represent potential targets for future experimental characterization. These could be genes unique to Halomonas or even strain-specific adaptations, possibly playing roles in the organism’s specific environmental niche, and in our case, the ability to degrade coronene [65,71].

Comparative genomic analyses have further elucidated the mechanisms underlying PAH degradation in halophilic bacteria. Pontibacillus chungwhensis HN14, for example, possesses gene clusters associated with PAH degradation pathways, emphasizing the genetic basis for their catabolic capabilities [72]. These findings align with our genomic analysis of H. elongata, which revealed genes involved in aromatic compound degradation, antibiotic resistance, and stress adaptation. GO analysis with InterProScan and Blast2GO provided a general overview of the genome’s functional landscape. The identification of 100 proteins categorized under xenobiotic metabolic processes (GO:0006805) potentially addresses the strain’s ability to degrade PAHs. However, the absence of proteins categorized under aromatic compound catabolism (GO:0019439) presents a contradiction that could be interpreted in several ways, including the possibility of alternative or novel pathways for aromatic compound degradation not yet captured by current GO terms or specific genes may not be well-represented in existing databases [62,73]. This can be backed by the knowledge that GO has not completely established its ontology and has limited coverage of multi-functional genes [73].

RAST and KEGG pathway analyses provide insights into the strain’s functional capabilities and metabolic potential. The relatively low percentage of genes assigned to RAST subsystems (32%) implies a substantial number of unique or poorly characterized genes [56]. The identification of 13 partial KEGG pathways related to xenobiotic degradation is of interest, although the absence of complete pathways could be due to the use of unique or modified pathways not defined in KEGG modules or the genes may have slight variations resulting in them not being assigned with a KO number [74].

The results from Operon Mapper, CRISPR analysis, repeat region identification, CARD and antiSMASH provide valuable insights into the genomic organization and functional potential. Particularly, the nine operons associated with aromatic compound degradation corroborates the earlier Gene Ontology results indicating xenobiotic degradation potential, and at the same time suggesting that H. elongata strain under study may possess specialized pathways to break down aromatic compounds. The detection of CRISPR site but the absence of cas genes is intriguing. This either means the CRISPRs identified are non-functional, orphan CRISPR arrays or the cas genes are present but not identified by the current annotation methods [75]. Antibiotic resistance to fluoroquinolone, tetracycline, diaminopyrimidine and phenolic compounds is mainly due to the presence of efflux proteins rsmA and adeF. The gene qacG confers to it’s resistance to disinfecting agents and antiseptics [49]. Additionally, the 47 genes found to be horizontally transferred could play a role in the strain’s ability to degrade coronene. This assumption is based off of Han and co-workers research in 2025 where they observed that the ability of Altererythrobacter sp. H2 to degrade PAHs was due to horizontal gene transfer [76].

Secondary metabolite regions, particularly ectoine biosynthesis cluster is consistent with the halophilic nature of the strain as ectoine is used for osmotic balance in halophilic bacteria [60,77]. The presence of NRPS clusters, including one encoding a metallophore, would suggest the capacity to produce complex secondary metabolites participating in metal acquisition or other ecological interactions that plays a prominent role in bioremediation. RiPP-like (Ribosomally synthesized and Post-translationally modified Peptide) cluster signifies the potential for the production of bioactive peptides which has the prospect to be explored in anti-microbial activity studies [78]. Most importantly, the presence of 4 genes involved in ectoine synthesis makes our strain an important candidate for research in cosmetics and medicine. On the other hand, RAST and KEGG pathway analyses provided insights into the strain’s functional capabilities and metabolic potential. The relatively low percentage of genes assigned to RAST subsystems (32%) implies a substantial number of unique or poorly characterized genes [56]. The identification of 13 partial KEGG pathways related to xenobiotic degradation is of interest, although the absence of complete pathways could be due to the use of unique or modified pathways not defined in KEGG modules or the genes may have slight variations resulting in them not being assigned with a KO number [74].

While this study provides a detailed genomic analysis of H. elongata and its potential role in coronene degradation, it is constrained by the lack of functional validation through transcriptomic or proteomic data. The genes and pathways identified here, though computationally annotated, require experimental confirmation to establish their specific roles in PAH metabolism. Given the structural complexity and limited existing knowledge regarding the biodegradation pathways for coronene, we initially hypothesized that H. elongata might utilize established degradation pathways known for other PAHs, such as naphthalene or phenanthrene. Surprisingly, genome analysis revealed that H. elongata lacks key enzymes commonly associated with these canonical PAH degradation pathways. As the degradation intermediates of coronene were not characterized, our understanding of the complete metabolic pathway is limited. Future studies incorporating gene knockout, heterologous expression, and metabolite profiling will be essential to verify the function of key enzymes and to clarify the molecular mechanisms enabling coronene degradation under high salinity conditions.

Conclusion

This study presents a comprehensive genomic analysis of H. elongata (previously classified as H. caseinilytica), revealing its exceptional potential for degrading coronene, a HMW-PAH, under saline conditions. By utilizing LRST coupled with advanced bioinformatics tools, we identified specific genetic components and pathways related to xenobiotic metabolism, production of secondary metabolites, and adaptive mechanisms such as horizontal gene transfer and CRISPR arrays. These genetic insights highlight the organism’s adaptability and underscore its significant promise for environmental applications.

Broader implications of our findings include potential utilization of H. elongata in bioremediation strategies for marine and hypersaline ecosystems contaminated with complex hydrocarbons, as well as opportunities for industrial biotechnology applications, particularly involving halotolerant secondary metabolite production like ectoine. However, this genomic study faces limitations, notably the absence of functional validation through transcriptomic, proteomic, and metabolomic analyses. Consequently, the specific biochemical mechanisms underlying coronene degradation remain hypothetical and require confirmation through experimental studies.

Future research should explicitly focus on validating the identified metabolic pathways, characterizing unannotated or hypothetical proteins, and exploring industrially relevant secondary metabolites. Targeted genetic experiments, including gene knockouts and metabolite profiling, are essential next steps to fully harness the biotechnological and environmental potentials of H. elongata. Such future studies will significantly strengthen our understanding and enable the practical deployment of this microorganism to sustainably mitigate PAH contamination in challenging environmental settings.

Supporting information

S1 Fig. Genes associated with superfamilies predicted on CDD.

https://doi.org/10.1371/journal.pone.0334420.s001

(DOCX)

S2 Fig. Genes associated with superfamilies predicted on CDD.

https://doi.org/10.1371/journal.pone.0334420.s002

(DOCX)

S3 Fig. Genes associated with superfamilies predicted on CDD.

https://doi.org/10.1371/journal.pone.0334420.s003

(DOCX)

S4 Fig. Genes associated with superfamilies predicted on CDD.

https://doi.org/10.1371/journal.pone.0334420.s004

(DOCX)

S5 Fig. Pathway mapping of metabolism of xenobiotics by cytochrome P450 from KASS The green boxes indicate the genes that are present in our gene list while the blue boxes indicate those that are absent but should have been present.

Where EC: 2.5.1.18 is glutathione S-transferase.

https://doi.org/10.1371/journal.pone.0334420.s005

(DOCX)

S1 Table. Kraken2 taxonomical classification.

https://doi.org/10.1371/journal.pone.0334420.s006

(DOCX)

S3 Table. RAST Annotation corresponding to aromatic compound metabolism.

https://doi.org/10.1371/journal.pone.0334420.s008

(DOCX)

References

  1. 1. Haritash AK, Kaushik CP. Biodegradation aspects of polycyclic aromatic hydrocarbons (PAHs): a review. J Hazard Mater. 2009;169(1–3):1–15. pmid:19442441
  2. 2. Kadri T, Rouissi T, Kaur Brar S, Cledon M, Sarma S, Verma M. Biodegradation of polycyclic aromatic hydrocarbons (PAHs) by fungal enzymes: A review. J Environ Sci (China). 2017;51:52–74. pmid:28115152
  3. 3. Abdel-Shafy HI, Mansour MSM. A review on polycyclic aromatic hydrocarbons: Source, environmental impact, effect on human health and remediation. Egyptian J Petroleum. 2016;25(1):107–23.
  4. 4. Lawal AT. Polycyclic aromatic hydrocarbons. A review. Cogent Environ Sci. 2017;3(1):1339841.
  5. 5. Patel AB, Shaikh S, Jain KR, Desai C, Madamwar D. Polycyclic Aromatic Hydrocarbons: Sources, Toxicity, and Remediation Approaches. Front Microbiol. 2020;11:562813.
  6. 6. Heker I, Samak NA, Kong Y, Meckenstock RU. Anaerobic degradation of polycyclic aromatic hydrocarbons. Appl Environ Microbiol. 2025;91(4):e0226824. pmid:40172203
  7. 7. Gupta G, Kumar V, Pal AK. Microbial Degradation of High Molecular Weight Polycyclic Aromatic Hydrocarbons with Emphasis on Pyrene. Polycyclic Aromatic Compounds. 2017;39(2):124–38.
  8. 8. Kanaly RA, Harayama S. Biodegradation of High-Molecular-Weight Polycyclic Aromatic Hydrocarbons by Bacteria. J Bacteriol. 2000;182(8):2059–67.
  9. 9. Nzila A. Biodegradation of high-molecular-weight polycyclic aromatic hydrocarbons under anaerobic conditions: Overview of studies, proposed pathways and future perspectives. Environ Pollut. 2018;239:788–802. pmid:29751337
  10. 10. Nzila A. Current Status of the Degradation of Aliphatic and Aromatic Petroleum Hydrocarbons by Thermophilic Microbes and Future Perspectives. Int J Environ Res Public Health. 2018;15(12):2782. pmid:30544637
  11. 11. Nzila A, Musa MM. Current Status of and Future Perspectives in Bacterial Degradation of Benzo[a]pyrene. Int J Environ Res Public Health. 2020;18(1):262. pmid:33396411
  12. 12. Dhar K, Subashchandrabose SR, Venkateswarlu K, Krishnan K, Megharaj M. Anaerobic Microbial Degradation of Polycyclic Aromatic Hydrocarbons: A Comprehensive Review. Rev Environ Contam Toxicol. 2020;251:25–108. pmid:31011832
  13. 13. Mordecai J, Al-Thukair A, Musa MM, Ahmad I, Nzila A. Bacterial Degradation of Petroleum Hydrocarbons in Saudi Arabia. Toxics. 2024;12(11):800. pmid:39590980
  14. 14. Ghosal D, Ghosh S, Dutta TK, Ahn Y. Current State of Knowledge in Microbial Degradation of Polycyclic Aromatic Hydrocarbons (PAHs): A Review. Front Microbiol. 2016;7.
  15. 15. Juhasz AL, Britz ML, Stanley GA. Degradation of fluoranthene, pyrene, benz[ a ]anthracene and dibenz[ a , h ]anthracene by Burkholderia cepacia . Journal of Applied Microbiology. 1997;83(2):189–98.
  16. 16. Juhasz AL, Britz ML, Stanley GA. Degradation of high molecular weight polycyclic aromatic hydrocarbons by Pseudomonas cepacia. Biotechnol Lett. 1996;18:577–82.
  17. 17. Juhasz AL, Stanley GA, Britz ML. Microbial degradation and detoxification of high molecular weight polycyclic aromatic hydrocarbons by Stenotrophomonas maltophilia strain VUN 10,003. Lett Appl Microbiol. 2000;30(5):396–401.
  18. 18. Okeyode AH, Al-Thukair A, Chanbasha B, Nazal MK, Afuecheta E, Musa MM, et al. Degradation of the highly complex polycyclic aromatic hydrocarbon coronene by the halophilic bacterial strain Halomonas caseinilytica, 10SCRN4D. Archives of Environmental Protection. 2023;:78–86.
  19. 19. Lou F, Okoye CO, Gao L, Jiang H, Wu Y, Wang Y, et al. Whole-genome sequence analysis reveals phenanthrene and pyrene degradation pathways in newly isolated bacteria Klebsiella michiganensis EF4 and Klebsiella oxytoca ETN19. Microbiol Res. 2023;273:127410. pmid:37178499
  20. 20. Qutob M, Rafatullah M, Muhammad SA, Alosaimi AM, Alorfi HS, Hussein MA. A Review of Pyrene Bioremediation Using Mycobacterium Strains in a Different Matrix. Fermentation. 2022;8(6):260.
  21. 21. Kumar KR, Cowley MJ, Davis RL. Next-generation sequencing and emerging technologies. Semin Thromb Hemost. 2019;45:661–73.
  22. 22. Treangen TJ, Salzberg SL. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet. 2011;13(1):36–46. pmid:22124482
  23. 23. Pop M, Salzberg SL. Bioinformatics challenges of new sequencing technology. Trends Genet. 2008;24:142.
  24. 24. Kim J, Lee C, Ko BJ, Yoo DA, Won S, Phillippy AM, et al. False gene and chromosome losses in genome assemblies caused by GC content variation and repeats. Genome Biol. 2022;23(1):204. pmid:36167554
  25. 25. Chen X, Xu H, Shu X, Song C-X. Mapping epigenetic modifications by sequencing technologies. Cell Death Differ. 2025;32(1):56–65. pmid:37658169
  26. 26. Kaplun L, Krautz-Peterson G, Neerman N, Stanley C, Hussey S, Folwick M, et al. ONT long-read WGS for variant discovery and orthogonal confirmation of short read WGS derived genetic variants in clinical genetic testing. Front Genet. 2023;14:1145285. pmid:37152986
  27. 27. Pollard MO, Gurdasani D, Mentzer AJ, Porter T, Sandhu MS. Long reads: their purpose and place. Hum Mol Genet. 2018;27(R2):R234–41. pmid:29767702
  28. 28. Koren S, Harhay GP, Smith TPL, Bono JL, Harhay DM, Mcvey SD, et al. Reducing assembly complexity of microbial genomes with single-molecule sequencing. Genome Biol. 2013;14(9):R101. pmid:24034426
  29. 29. Sharon I, Kertesz M, Hug LA, Pushkarev D, Blauwkamp TA, Castelle CJ, et al. Accurate, multi-kb reads resolve complex populations and detect rare microorganisms. Genome Res. 2015;25(4):534–43. pmid:25665577
  30. 30. Ewels PA, Peltzer A, Fillinger S, Patel H, Alneberg J, Wilm A, et al. The nf-core framework for community-curated bioinformatics pipelines. Nat Biotechnol. 2020;38(3):276–8. pmid:32055031
  31. 31. De Coster W, Rademakers R. NanoPack2: population-scale evaluation of long-read sequencing data. Bioinformatics. 2023;39(5).
  32. 32. Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34(18):3094–100.
  33. 33. Li H. Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics. 2016;32(14):2103–10. pmid:27153593
  34. 34. Vaser R, Sović I, Nagarajan N, Šikić M. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res. 2017;27(5):737–46. pmid:28100585
  35. 35. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29(8):1072–5. pmid:23422339
  36. 36. Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016;32(19):3047–8. pmid:27312411
  37. 37. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2. pmid:26059717
  38. 38. Lu J, Rincon N, Wood DE, Breitwieser FP, Pockrandt C, Langmead B, et al. Metagenome analysis using the Kraken software suite. Nat Protoc. 2022;17(12):2815–39. pmid:36171387
  39. 39. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–9. pmid:24642063
  40. 40. Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, et al. CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(Database issue):D225-9. pmid:21109532
  41. 41. Wang J, Chitsaz F, Derbyshire MK, Gonzales NR, Gwadz M, Lu S, et al. The conserved domain database in 2023. Nucleic Acids Res. 2023;51(D1):D384–8. pmid:36477806
  42. 42. Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. pmid:20211023
  43. 43. Lomsadze A, Gemayel K, Tang S, Borodovsky M. Modeling leaderless transcription and atypical genes results in more accurate gene prediction in prokaryotes. Genome Res. 2018;28(7):1079–89. pmid:29773659
  44. 44. Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021;38(12):5825–9. pmid:34597405
  45. 45. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20(4):1160–6. pmid:28968734
  46. 46. Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar GA, et al. InterPro in 2022. Nucleic Acids Res. 2023;51(D1):D418–27. pmid:36350672
  47. 47. Taboada B, Estrada K, Ciria R, Merino E. Operon-mapper: a web server for precise operon identification in bacterial and archaeal genomes. Bioinformatics. 2018;34(23):4118–20. pmid:29931111
  48. 48. Couvin D, Bernheim A, Toffano-Nioche C, Touchon M, Michalik J, Néron B, et al. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 2018;46(W1):W246–51. pmid:29790974
  49. 49. Alcock BP, Huynh W, Chalil R, Smith KW, Raphenya AR, Wlodarski MA, et al. CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res. 2023;51:D690–D699.
  50. 50. Tempel S. Using and understanding RepeatMasker. Methods Mol Biol. 2012;859:29–51. pmid:22367864
  51. 51. Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020;117(17):9451–7. pmid:32300014
  52. 52. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16(6):276–7. pmid:10827456
  53. 53. Brown CL, Mullet J, Hindi F, Stoll JE, Gupta S, Choi M, et al. mobileOG-db: a Manually Curated Database of Protein Families Mediating the Life Cycle of Bacterial Mobile Genetic Elements. Appl Environ Microbiol. 2022;88(18):e0099122. pmid:36036594
  54. 54. Vernikos GS, Parkhill J. Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Bioinformatics. 2006;22(18):2196–203.
  55. 55. Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, et al. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 2014;42(Database issue):D206-14. pmid:24293654
  56. 56. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 2008;9:75. pmid:18261238
  57. 57. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35(Web Server issue):W182-5. pmid:17526522
  58. 58. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014;15(3):R46. pmid:24580807
  59. 59. Grant JR, Enns E, Marinier E, Mandal A, Herman EK, Chen C, et al. Proksee: in-depth characterization and visualization of bacterial genomes. Nucleic Acids Research. 2023;51(W1):W484–92.
  60. 60. Schwibbert K, Marin-Sanguino A, Bagyan I, Heidrich G, Lentzen G, Seitz H, et al. A blueprint of ectoine metabolism from the genome of the industrial producer Halomonas elongata DSM 2581 T. Environ Microbiol. 2011;13(8):1973–94. pmid:20849449
  61. 61. Radjasa OK, Steven R, Humaira Z, Dwivany FM, Nugrahapraja H, Trinugroho JP, et al. Biosynthetic gene cluster profiling from North Java Sea Virgibacillus salarius reveals hidden potential metabolites. Sci Rep. 2023;13(1):19273. pmid:37935710
  62. 62. Hobmeier K, Cantone M, Nguyen QA, Pflüger-Grau K, Kremling A, Kunte HJ, et al. Adaptation to Varying Salinity in Halomonas elongata: Much More Than Ectoine Accumulation. Front Microbiol. 2022;13:846677. pmid:35432243
  63. 63. Wambui J, Cernela N, Stevens MJA, Stephan R. Whole Genome Sequence-Based Identification of Clostridium estertheticum Complex Strains Supports the Need for Taxonomic Reclassification Within the Species Clostridium estertheticum. Front Microbiol. 2021;12:727022. pmid:34589074
  64. 64. Liang C-Y, Yang C-H, Lai C-H, Huang Y-H, Lin J-N. Comparative Genomics of 86 Whole-Genome Sequences in the Six Species of the Elizabethkingia Genus Reveals Intraspecific and Interspecific Divergence. Sci Rep. 2019;9(1):19167. pmid:31844108
  65. 65. Jeong J, Yun K, Mun S, Chung W-H, Choi S-Y, Nam Y, et al. The effect of taxonomic classification by full-length 16S rRNA sequencing with a synthetic long-read technology. Sci Rep. 2021;11(1):1727. pmid:33462291
  66. 66. Nanca CL, Neri KD, Ngo ACR, Bennett RM, Dedeles GR. Degradation of Polycyclic Aromatic Hydrocarbons by Moderately Halophilic Bacteria from Luzon Salt Beds. J Health Pollut. 2018;8(19):180915. pmid:30524874
  67. 67. Fathepure BZ. Recent studies in microbial degradation of petroleum hydrocarbons in hypersaline environments. Front Microbiol. 2014;5:173. pmid:24795705
  68. 68. Wright MH, Bentley SR, Greene AC. Draft Genome Sequence of Halomonas sp. Strain ML-15, a Haloalkaliphilic, Polycyclic Aromatic Hydrocarbon-Degrading Bacterium. Microbiol Resour Announc. 2020;9(47):e01175-20. pmid:33214310
  69. 69. Mnif S, Chamkha M, Sayadi S. Isolation and characterization ofHalomonassp. strain C2SS100, a hydrocarbon-degrading bacterium under hypersaline conditions. J Appl Microbiol. 2009;107(3):785–94.
  70. 70. Lafi FF, Ramirez-Prado JS, Alam I, Bajic VB, Hirt H, Saad MM. Draft Genome Sequence of Halomonas elongata Strain K4, an Endophytic Growth-Promoting Bacterium Enhancing Salinity Tolerance In Planta. Genome Announc. 2016;4(6):e01214-16. pmid:27811099
  71. 71. Gasperotti AF, Revuelta MV, Studdert CA, Herrera Seitz MK. Identification of two different chemosensory pathways in representatives of the genus Halomonas. BMC Genomics. 2018;19(1):266. pmid:29669514
  72. 72. Yang H, Qian Z, Liu Y, Yu F, Huang T, Zhang B, et al. Comparative genomics reveals evidence of polycyclic aromatic hydrocarbon degradation in the moderately halophilic genus Pontibacillus. J Hazard Mater. 2024;462:132724. pmid:37839372
  73. 73. Tomczak A, Mortensen JM, Winnenburg R, Liu C, Alessi DT, Swamy V, et al. Interpretation of biological experiments changes with evolution of the Gene Ontology and its annotations. Sci Rep. 2018;8(1):5115. pmid:29572502
  74. 74. Vera A, Wilson FP, Cupples AM. Predicted functional genes for the biodegradation of xenobiotics in groundwater and sediment at two contaminated naval sites. Appl Microbiol Biotechnol. 2022;106(2):835–53. pmid:35015144
  75. 75. Takeuchi N, Wolf YI, Makarova KS, Koonin EV. Nature and Intensity of Selection Pressure on CRISPR-Associated Genes. J Bacteriol. 2012;194(5):1216–25.
  76. 76. Han Q, Yang M-L, Liu Z-S, Zhao Y-H, Liu X-H, Ai G-M, et al. Simultaneous high molecular weight PAHs degradation and chromate and arsenite detoxification by Altererythrobacter sp. H2. J Hazard Mater. 2025;492:138314. pmid:40250277
  77. 77. Hobmeier K, Oppermann M, Stasinski N, Kremling A, Pflüger-Grau K, Kunte HJ, et al. Metabolic engineering of Halomonas elongata: Ectoine secretion is increased by demand and supply driven approaches. Front Microbiol. 2022;13:968983. pmid:36090101
  78. 78. Zytnick AM, Gutenthaler-Tietze SM, Aron AT, Reitz ZL, Phi MT, Good NM, et al. Identification and characterization of a small-molecule metallophore involved in lanthanide metabolism. Proc Natl Acad Sci U S A. 2024;121(32):e2322096121. pmid:39078674