Sub-daily virus sampling at the Bermuda Atlantic Time Series reveals diel and depth-structured population dynamics without community-level shifts

Alfonso Carrillo; Emily Hageman; Lauren Chittick; Anna I. Mackey; Kimberley S. Ndlovu; Funing Tian; Naomi E. Gilbert; Daniel Muratore; Dean Vik; Gary R. LeCleir; Christine Sun; Ho B. Jang; Ricardo R. Pavan; Joshua S. Weitz; Steven W. Wilhelm; Matthew B. Sullivan

doi:10.1371/journal.pbio.3003474

Abstract

Ocean microbes contribute to biogeochemical cycles and ecosystem function, but they do so under top-down pressure imposed by viruses. While viruses are increasingly understood spatially and beginning to be incorporated into predictive modeling, high-frequency ocean virus dynamics remain understudied due to methodological challenges. Here we sampled stratified Bermuda Atlantic Time Series (BATS) waters for 112 hours at sub-daily 4- (surface) or 12- (deep chlorophyll maximum) hour intervals, purified viral particles from these samples, sequenced their metagenomes, and used the resulting data to characterize high-frequency virus community dynamics. Aggregated community diversity metrics changed with depth, but were not statistically significant temporally at a fixed location. However, finer-scale population-level analyses revealed both depth and temporal change, including physicochemical depth-driven differences and, in surface waters, thousands of viral populations that exhibited statistically significant diel rhythms. Statistical analyses revealed three main archetypes of temporal dynamics that themselves differed in abundance patterns, host predictions, viral taxonomy, and gene functions. Among these, highlights include viruses resembling an archetype with a night peaking pattern in activity that include an over-representation of viruses that putatively infect Prochlorococcus, a phototrophic cyanobacteria. Together, these efforts provide baseline community- and population-scale short-time-frame observations relevant to future climate state modeling.

Citation: Carrillo A, Hageman E, Chittick L, Mackey AI, Ndlovu KS, Tian F, et al. (2026) Sub-daily virus sampling at the Bermuda Atlantic Time Series reveals diel and depth-structured population dynamics without community-level shifts. PLoS Biol 24(3): e3003474. https://doi.org/10.1371/journal.pbio.3003474

Academic Editor: Jeremy J. Barr, Monash University, AUSTRALIA

Received: October 3, 2025; Accepted: February 4, 2026; Published: March 6, 2026

This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

Data Availability: Raw viromic sequencing data is available through JGI under project ID #505733 https://genome.jgi.doe.gov/portal/Infvirtimeseries/Infvirtimeseries.info.html. Scripts for data analysis, relevant input files, and large data tables can be found on a public repository on Zenodo https://doi.org/10.5281/zenodo.18425561 and our GitHub repository https://github.com/carrillo1998/BATS. All other relevant data are within the paper and its Supporting information Files.

Funding: Funding was provided by the National Institutes of Health (https://www.nih.gov/) Grant GM141955 to A.C. This material is based upon work supported by the National Science Foundation (https://www.nsf.gov/) under Grant #2149505, OCE-1829640, OCE-1829641 to M.B.S. and OCE-1829636 to J.S.W. Along with support from the Department of Energy (https://www.energy.gov/) award DOE#DE-SC0023307 to M.B.S. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Abbreviations: AMGs, auxiliary metabolic genes; BATS, Bermuda Atlantic Time Series; DCM, deep-chlorophyll maximum; MAGs, metagenome-assembled genomes; SOM, self-organizing map; SPOT, San Pedro Ocean Time; SUR, surface; vOTUs, virus operational taxonomic units

Introduction

The oceans play key roles in global biogeochemical cycles, including buffering against human-accelerated climate change [1]. However, this buffering is self-limiting—ocean biogeochemistry is directly impacted by climate change as warming surface waters increase seasonal stratification [2] and see reduced nutrient concentrations. These impacts are particularly important over the vast oligotrophic ocean regions that cover approximately 30%–50% of the Earth’s ocean surface [3,4]. Such altered surface ocean functions are hypothesized to impact the ocean’s ability to absorb atmospheric carbon dioxide and sink surface-produced carbon to the deep sea via the biological carbon pump. While one-third of anthropogenic carbon dioxide released into the atmosphere is absorbed into surface waters via mass action [1], its fate is dictated by plankton—including bacteria, archaea, and microbial eukaryotes that serve as the base of the food web and drive the biological carbon pump.

Viral roles in these ocean biological carbon pump processes are recently being revisited. For decades, the viruses that infect these cells were thought to keep carbon in the dissolved phase by lysis resulting in remineralization—a process known as the viral shunt [5]. However, viral lysis may also generate sticky aggregates that sink out of the photic zone. This “viral shuttle” has been hypothesized as a mechanism that reduces the retentiveness of the microbial loop [6–10]. Recent work leveraging machine learning, statistical modeling, and global ocean datasets from the Tara Oceans expeditions provides evidence in support of this hypothesis given that globally ocean carbon flux is best predicted by viral abundances—even more so than abundances of bacteria, archaea, or eukaryotes [11]. In parallel, viral infection of microbial hosts has shown viruses to reprogram infected cells (termed “virocells,” [12]) into entities that are metabolically and biogeochemically different from their uninfected sister cells [13–16]. This reprogramming can also apply when cells mutate to successfully defend themselves against virus attack. Towards this, recent experimental work shows spontaneous virus-resistant marine bacterial mutants alter their carbon substrate utilization, metabolite secretion, and aggregation and sinking rates in ways that will drastically alter these cells’ ecosystem inputs and outputs [17]. Thus, viruses can alter a cell’s ecosystem outputs well beyond simple lysis.

Early work on Kill-the-Winner models showed how viruses infecting a single microbial host could enable fluctuating host diversity through negative frequency-dependent selection [18–20]. These models have been extended to include increased complexity - spanning interaction networks, community dynamics, and feedbacks with ecosystem functioning [21–29]. However, a major obstacle to extending these models to the Earth System scale is the lack of observational data to guide model incorporation, with sampling challenges resulting in relatively little being known about marine microbial and viral temporal dynamics. Some time series sampling efforts have assessed, at monthly resolution, abundance changes in bacteria and archaea (using PCR-amplicons and/or metagenomic sequencing), and picoeukaryotes (via flow cytometry)—and this has been done for years (e.g., 7 year Bay of Banyuls [30]) or even decades (e.g., >30 years Hawaii Ocean Time (HOT) Series [31]). However, these studies largely excluded viruses, except as bycatch in early low-resolution metagenomic surveys [32,33]. Other time series similarly surveyed bacteria, archaea, and picoeukaryotes over long time scales monthly, while also including viruses (e.g., >15 years at San Pedro Ocean Time (SPOT) Series [34–36]). Although they are based on amplicon-based approaches [37], these studies nonetheless provide strong evidence of seasonality whereby bacteria, archaea, and picoeukaryote abundances cycle yearly and then return to (nearly) the same state with only slight year-to-year baseline shifts. This is additionally supported by more recent, monthly viral metagenomics analyses such as those from the SPOT Series that show seasonal dynamics in viral communities along with multi-year stability across five years. Additionally, this metagenomic lens revealed continual within-population genetic turnover consistent with Red Queen-like strain dynamics [38] thought to emerge from Kill-the-Winner ecological interactions [18–20].

At a much finer temporal resolution, some studies have explored diurnal rhythms in ocean systems for microbes and their viruses. Sub-daily timescale metatranscriptome-derived microbial and viral transcript measurements have helped identify strong diel rhythms of gene expression along with temporal relationships between gene expression and metabolite concentration [39–43]. Observations of transcript activity between viruses and their hosts have shown that viruses can often coordinate their activity with host diel cycles, responding to environmental cues in synchrony with their hosts [41–42]. Other work has taken more targeted approaches by focusing on one group, cyanobacterial viruses (or cyanophages), and revealed transcriptional rhythms and adsorption rates in the field that laboratory studies then linked to diurnal photosynthetic activity [44]. Along with this, amplicon-based marker gene fragment patterns (e.g., terminal restriction fragment length polymorphism) have similarly been used to target T4-like viruses to reveal daily dynamics over the course of 38 consecutive days that correlate virus operational taxonomic units (vOTUs) with bacterial OTUs to document short-term variation in both microbial and viral communities [35]. Together, these studies demonstrate clear sub-daily dynamics, but are limited in that they again either catch viruses as a byproduct of microbial sampling (e.g., prokaryotic metatranscriptomics) or target specific virus groups (e.g., cyanophages or T4-like viruses) rather than community-wide signals often doing so with highly degenerate primer strategies that might confound quantitative data generation.

In the decade since these studies, viral metagenomics [37,45] and our understanding of virus population biology [46–48] have matured to the point that there is opportunity to study viruses at increasingly high taxonomic and temporal resolution, even over a short timescale. The Bermuda Atlantic Time Series (BATS)—a sentinel model ecosystem for “future oceans” due to its increased stratification and acidification [49]—has been surveyed via DNA staining and epifluorescence microscopy to quantify virus and microbe abundances and found seasonality over a decade of sampling from 2000 to 2009 [50]. While BATS samples were targeted by these extensive SYBR-stained virus abundance data and early sequencing technology was applied to a single virus sample [51], no genome-resolved virus sequencing data were available at BATS. This prevents ecological inferences like the kinds of viruses, their hosts, their functions, and their metabolic reprogramming capacities. Towards this, recent work employed time-resolved metagenomics at BATS and then compared viral population-based abundances inferred from those captured in paired cellular and viral fraction metagenomes. This revealed that viral fraction population data represents the integral of cellular fraction infections and multiple days of viral turnover [52], but left open questions about viral community and population-level changes through time.

Here, we build upon these prior efforts by establishing a 112-hour time series—sampled every 4 hours in surface waters and, every 12 hours at the deep chlorophyll maximum (DCM)—of virus metagenomic data at BATS during late seasonal stratification as a proxy for future climate-change-impacted oceans. This resulted in detailed, sub-daily insight into community- and population-level viral dynamics for 48,428 viral populations at BATS.

Results

Assembly of a high quality and detailed viral reference database at BATS

To estimate sub-daily virus population-level dynamics, we established a high-resolution dataset from surface (SUR) and deep-chlorophyll maximum (DCM) waters sampled at BATS between October 12−17, 2019 (Fig 1A–1C). A total of 39 virus concentrates were prepared via chemical flocculation [53] (see Methods) of samples collected every 4 (SUR waters) or 12 (DCM waters) hours over a 112-hour time course (Fig 1C). To follow the same parcel of water through the time-course, sampling was done in a Lagrangian manner using a surface buoy with an underwater drogue (at 30 m depth) to ‘track’ the water for the length of the study. Sampling started at 16:00 local time (GMT-3) and depths were determined with each collection from water column profiles of temperature and salinity collected by the ship’s CTD system as described previously [54].

Download:

Fig 1. Field site location, depth profiles, and sampling schematic.

A. The Bermuda Atlantic Time Series is located in the Sargasso Sea portion of the Atlantic Ocean. Lagrangian sampling track for data collection is shown with each black point representing a CTD-cast starting on Oct. 12th, 2019 at 16:00 local time (GMT-3) and ending on Oct. 17th, 2019, at 08:00. The data underlying this Figure can be found in S1 Table. B. Depth profile of chlorophyll (green) and temperature (blue) down to 1,000 meters depth, and ship schematic to emphasize key depths in the water column including ship (0 m), surface (0–50 m), and DCM (105–120 m). The data underlying this Figure can be found in S1 Data. C. Sampling schematic to show the depth (surface or SUR vs. deep chlorophyll maximum or DCM), frequency (every 4 and 12 hours for surface and DCM, respectively), size fraction (virus red boxes or cellular blue boxes), and time of day (yellow or gray background shading) of each sample taken.

https://doi.org/10.1371/journal.pbio.3003474.g001

The resultant virus-fraction concentrates were resuspended in a buffer, DNA extracted, sequencing libraries that targeted double-stranded DNA viruses only were prepared, and then short-read sequenced (see Methods) to an average depth of 147M reads per sample. The resultant 39 virus metagenomes were assembled (see Methods) into 1.48B contigs (≥10 kb) and 228,013 of these were conservatively identified as viral (using the VirSorter 2 Standard Operating Procedure VS2_SOP [55,56]). These virus contigs were then clustered into 48,428 viral populations using community consensus cut-offs of 95% average nucleotide identity over 80% of the contig coverage [47,48,57] with an average genomic length captured of 10.28 kbp. These viral populations were also compared with GeNomad [58] which revealed that 94.92% of the viral populations were shared between both methods. With 99.29% of the VirSorter2 viral populations being also found in GeNomad. Given these results, we find that GeNomad is identifying the same viruses as VirSorter2 as well as additional ones, as there are no clear guidelines on how to efficiently add the results from different viral identification tools [59] we will use VirSorter2’s output alone for all downstream analyses. While other ocean datasets reflect more dispersed sampling schemes, such as the deeply sequenced GOV2 dataset that spread its sequencing across 79 sites [47], here we provide a more focused sampling of the BATS site to maximally identify virus populations at this site. Indeed, this provides us with many more captured viruses at a single site (compare 48,428 with an average of 2,083 viral populations per site in GOV2), at a similar genomic coverage per population (compared 10.28 kb versus GOV2’s average genomic length of 10.17 kbp [47]). Taxonomically, these populations are thought to represent approximately species-level taxa as prior studies inferred from (i) gene flow, selection, and genomic fixation indices [46], (ii) host range-based phenotypic variation in heterotrophic viruses [60], and (iii) natural breaks in sequence similarity for nearly 500K virus populations in GOV2 [47].

Community level diversity significantly varies with depth, but not through 112-hour time-course

We first sought to assess how community-level diversity changed with depth and across our high-frequency time course (see Methods). For depth-resolved comparisons, we hypothesized that viral diversity would differ significantly between surface (SUR—sampled from 5 m) and DCM samples, due to the inherent differences in physicochemical features such as temperature, and light penetration that also impact biology (e.g., chlorophyll as a proxy for phototrophic biomass) ([61–63]; Figs 1B, S2, and S3). Indeed, we found that virus community alpha diversity (Inverse Simpson’s Index) was significantly higher in SUR versus DCM waters (Wilcoxon Rank Sum P = 0.000028) (Fig 2A), and this remained stable for nearly all time points throughout the 112-hour time course (Fig 2A). Along with this, we found viral community beta diversity (Bray-Curtis) to be significantly different between SUR and DCM samples (S3 Fig). These findings complement other daily observations at BATS where extracellular viral abundances had clear depth-specific differences inside and outside the surface mixed layer [52]. Additionally, we evaluated Inverse Simpson’s diversity changes within SUR versus DCM depths of water columns sampled throughout the global oceans alongside our own. This revealed that BATS had higher alpha diversity in SUR as opposed to the DCM, contrary to other previously sampled regions ([47], S4 Fig).

Download:

Fig 2. Community-level statistics across water depths and times of sampling.

A. Inverse Simpson’s Index box plot showing that virus community diversities were statistically significantly different (Wilcoxon Rank Sum P = 0.000028) between SUR (blue points) vs. DCM (pink points). Line graph below depicting alpha diversity (Inverse Simpson’s) changes across time points for SUR (blue points) and DCM (pink points) samples. The data underlying this Figure can be found on our zenodo repository (see Data availability) within file all-meancov.txt. B. Inverse Simpson’s Index box plot of day (yellow points) vs. night (black points) for SUR samples showing that virus community diversities were not significantly different (Wilcoxon Rank Sum P = 0.62, denoted as NS in the figure). The data underlying this Figure is the same as in Fig 2A. C. Principal coordinate analysis of Bray-Curtis dissimilarity distances shows that day vs. night virus community dissimilarities were not significantly different (Multi-Response Permutation Procedure [MRPP], p = 0.068). Shaded polygons outline day and night groups. The data underlying this Figure is the same as in Fig 2A.

https://doi.org/10.1371/journal.pbio.3003474.g002

We also assessed the impact of the diel cycle on diversity proxies in the surface samples only. Specifically, we evaluated how alpha- and beta-diversity metrics varied between nighttime (20:00) and daytime (08:00). Although some microorganisms possess unique circadian rhythms and dependencies on the light-dark cycle [64], community-wide virus diversity metrics were not significantly different whether inferred at the level of alpha diversity (Inverse Simpson’s diversity metric; Wilcoxon Rank Sum P = 0.62; Fig 2B) or beta diversity (PCoA ordination of Bray-Curtis dissimilarity measures; MRPP P = 0.068; Fig 2C). Thus even though day/night light intensity and microbial taxa change during this 112-hour sampling [54], our findings show that aggregate viral community diversity measures do not change. Our 2019 BATS viral sampling expands upon prior 2017 BATS viromic sampling [52] by increasing resolution (4- versus 12-hour sub-daily time scale) and adding 20× fold more viruses at a single site (48K versus 2.3K). We reasoned then that these significant dataset improvements provided a better opportunity to assess high-resolution virus community dynamics at BATS.

Viral populations and potential metabolic gene repertoire vary across depth

Given that the viral depth-related differences shown above were explored at the community-level, we continued by exploring depth-related changes at the virus population level. First, we predicted hosts for our 48,428 virus populations to assess their patterns across the two depths sampled. Notably, we used aggregate in silico prediction strategies (see Methods) and maximized our ability to predict hosts (prior benchmarking suggests host prediction improvement of 25% for most systems [65] by augmenting the standard database available with 89 high-quality metagenome-assembled genomes (MAGs) from prior BATS work [52]. This resulted in predicted hosts for 11,814 (24.39%) of our 48,428 total populations, a large improvement on the 6 (0.26%) of 2,301 total viral populations whose hosts could be predicted in that prior study [52].

SUR versus DCM depth comparisons revealed some populations were shared across depths, whereas others exclusive. As expected, hosts whose virus populations were better sampled displayed abundance changes across the depths. For example, viruses that infect heterotrophic bacteria such as Pelagibacter (SAR11 clade I), Pelagibacter_A (SAR11 clade II), and AG-337-I02 (SAR11 clade V) were among the most represented host predictions in SUR, with AG-337-I02 and Pelagibacter ~4-fold or higher in abundance at the surface compared to DCM (Fig 3A and 3B). This SUR enrichment signal of SAR11 clade infecting viruses parallels SAR11 bacteria being one of the most abundant heterotrophic surface bacteria found in the Sargasso Sea, dominated by SAR11 clades I and II in BATS SUR waters [66,67]. As well, a higher proportion of viruses found in DCM samples were predicted to infect phototrophic Prochlorococcus, with Prochlorococcus_B the dominant Prochlorococcus ecotype and having greater than 6-fold abundance in DCM compared to SUR (Fig 3A). This signal is consistent with Prochlorococcus_B being a low-light ecotype that prefers lower depths, like the DCM, at BATS [50,68].

Download:

Fig 3. Viral population hosts across depths.

A. Alluvial plot of predicted hosts (genus-level ranks) for viral populations present at each depth. Height of row is based on abundance of genera and genera that make up less than or equal to 1% of the total abundances are grouped as “Others”. Alluvial is ordered from most abundant at top to least abundant at bottom. The pie chart on top of each column indicates the percent of viral populations with host predictions in each group with the total relative abundance of the category (SUR/DCM) below. The data underlying this Figure can be found in our zenodo repository (see Data availability) within file all-meancov.txt and combined_host_prediction_to_genome_m90.csv. B. Bar graph showing the Log₂ fold change of host predictions across depths. Log₂ fold change is based on abundances of genera that make up greater than 1% of the total abundances in the dataset. The data underlying this Figure is the same as that of Fig 3A.

https://doi.org/10.1371/journal.pbio.3003474.g003

Looking further, we wondered whether viruses across these depths contained any metabolic genes of interest (e.g., auxiliary metabolic genes, or AMGs), and how these would fit into measured microbial metabolisms within the Sargasso Sea. [12–15,69]. We sought to uncover any AMGs in viruses unique to either depth environment. To ensure that genes were not misattributed as AMGs, we adopted a highly conservative approach for curating our AMG catalog, as used previously (see Methods; [70], which resulted in only ~0.05% (349) of 684K annotated genes passing our conservative filtering. Beyond this conservative filtering, we did not conduct any manual curation or further validation.

Many of the putative AMGs belonged to pathways enriched in one depth over the other (Fig 4). Among these, cobalamin biosynthesis was SUR-enriched and of particular interest given that Prochlorococcus and Synechococcus require cobalt for microbial growth [71,72] and comparative Prochlorococcus genomics has documented their genetic machinery for the synthesis and use of cobalt-bearing cofactors (cobalamins) [71]. Considering this, we hypothesized it was possible that abundant phototrophic viruses in the viral community harbor AMGs that assist in cobalamin biosynthesis. Assessing our data, however, we found a single gene in the cobalamin biosynthesis pathway (K09882, cobS) in a viral population exclusively within SUR (S5 Table). Host prediction suggested that this population infects SAR86A. The SAR86A prediction is curious given that this lineage is not known to possess any genes involved in cobalamin biosynthesis.

Download:

Fig 4. Metabolic pathways found in viruses across depth gradients.

KEGG map was made on iPath (v3) under map selection “Metabolic Pathways” using data from S5 Table and shows pathways for AMGs present in viruses across depth gradients. Color of line is based on what depth the virus containing the AMG was found in, with gray lines indicating lack of AMGs relevant in that pathway for either group. This KEGG map is a qualitative analysis depicting which pathways identified AMGs belong to, with lines indicating pathways with relevant AMGs present. Thickness of the line is the same throughout figure to highlight presence of pathways.

https://doi.org/10.1371/journal.pbio.3003474.g004

Taking instead, a host-focused perspective on AMG depth-related patterns, we observed the following. First, we explored virus populations predicted to infect Pelagibacter (formerly known as SAR11), i.e., pelagiphages [73], which revealed 3 AMGs among them including phoH, K06217 (1 viral population) and 2OG-Fe(II) oxygenase, K07336 (2 viral populations). Of these, phoH is known to be present in both cyanophages [74] along with heterotrophic phages including pelagiphage [75]. Phylogenetic analysis of the phoH gene in phages has been shown to vary greatly based on the phage host while the role of phoH remains unclear as expression of the gene has been shown to vary within different phage-host pairs [76]. The 2OG-Fe(II) oxygenase AMG, however, is a much more well-known gene in pelagiphages [73]. Curiously, however, these pelagiphage AMGs were depth-specific with the viral populations possessing said genes only being present exclusively in one depth or the other. This can be seen with one of the viral populations predicted to target Pelagibacter_A that possessed a 2OG-Fe(II) oxygenase being found only in DCM, while all other pelagiphages with AMGs were found exclusively within SUR. Second, we explored AMG depth-related signals from Prochlorococus, which revealed a photosystem II gene (K02703, psbA) found in 7 viral populations exclusive to the DCM, where two populations were predicted to infect Prochlorococcus_B and the remaining five were predicted to infect TMED108, TMED70, MGIIa-K1, and 2 for unknown hosts. Within these host predictions both TMED108 and TMED70 are heterotrophic hosts while MGIIa-K1 is a member of marine group II Euryarchaea [40]. TMED108 is a member of the Marinisomatota phylum (formerly recognized as Marinimicrobia, Marine Group A, and SAR406) which has traditionally been characterized as heterotrophic microorganisms, although recently has been seen to possess members that demonstrate the capacity to harness light-dependent processes associated with Crassulacean acid metabolism [77] TMED108 and MGIIa-K1 are members of widely distributed and ecologically important clades of marine microbes [40,77–79], however, none of these genera have previously been shown to hold the photosystem II gene, psbA. Although both depths had virus populations with Prochlorococcus_B predicted as the host, this genus was a common (second most) predicted host in the DCM, 6× more abundant in DCM compared to SUR—all of which underscored its known low-light ecotype adaptations that would do well in DCM.

Finally, though a weaker signal, we highlight the finding that a SUR virus population contained a gene putatively involved in Coenzyme M biosynthesis (K05979, comB, 1 viral population), presumably relevant to methanogenic archaea. The coenzyme M biosynthesis signal might represent an understudied link to methanotrophic archaeal carbon metabolism, specifically anaerobic oxidation of methane to carbon dioxide [80,81]. Such methane cycling has previously been shown in oligotrophic North Atlantic Ocean gyres to be important with elevated methane from archaea as well as some bacteria that produce it as a byproduct [82]. Although the virus possessing this AMG had no host predicted, it is known that archaeal viruses outside extremophile conditions are incredibly challenging to predict [83]. We posit that the presence of a coenzyme M biosynthesis gene in a virus contig and its potential relevance to methane and carbon cycling at BATS, invites future targeted work once pelagic ocean archaeal viruses are more readily identifiable and/or increased genomic context becomes available for this gene.

Diel periodicity analyses reveal population-level viral population dynamics

Though virus community-level alpha and beta-diversity metrics revealed no significant sub-daily temporal dynamics, prior ocean viral community monthly sampling had revealed population-level variation underpinning such community-level stability [38]. Thus, we wondered whether specific populations were in fact dynamic, even at this sub-daily time frame, and if so, whether they had consistent diel cycles that might be detectable given the extensive sequencing available across this high-resolution time-course. To test this, we applied prior methods [84] to statistically detect diel patterns (via RAIN [85] and significance testing; see Methods) among our virus populations. This revealed 3,097 viral populations that were significantly diel rhythmic in SUR (10.68% of SUR viral populations). The lack of detected diel cycling viruses in DCM may be due to reduced sampling which may have limited statistical power to detect diel rhythmicity.

Focusing on SUR viral populations, we then compared diel versus non-diel SUR viral populations to look for differences at levels of host predictions, viral taxonomy, and gene function. For hosts, the abundances of viral populations for a given predicted host were summed and then assessed across the time course for diel versus non-diel viral populations via RAIN and significance testing as described above (Fig 5). This revealed diel versus non-diel viral abundances for many predicted hosts with Pelagibacter being the most abundant targeted taxa across the two categories. However, slight differences were observed for the proportion of viruses targeting taxa like Hyphobacterium in the diel group or those targeting Joostella in the non-diel group. Such observations are difficult to interpret given that not much is known for either genus regarding their diel behavior in oceans. Similarly when looking at family-level viral taxonomy and their abundances, we observed very little change between diel and non-diel viruses, with slight differences in the enriched proportion of viruses belonging to Caudoviricetes (new family 33 and order 140) for diel and Caudoviricetes (new family 21 and order 134) in non-diel (S5 Fig).

Download:

Fig 5. Predicted hosts for SUR viral populations with diel or non-diel rhythmic periodicities.

Alluvial plot of predicted hosts (genus-level ranks) for diel and non-diel viral populations present. Height, order, and pie chart are as described in Fig 3. The data underlying this figure can be found in our zenodo repository (see Data availability) in file all-meancov.txt and combined_host_prediction_to_genome_m90.csv.

https://doi.org/10.1371/journal.pbio.3003474.g005

Seeing largely similar host predictions and viral taxonomy for the diel and non-diel viral populations, we assessed their genes for functional gene content signatures. We hypothesized that viruses possessing diel patterns of abundance may possess AMGs that would enable these respective niches. Evaluating our conservatively filtered AMG catalog, we found 18 putative AMGs (belonging to 7 pathways) in diel viruses and 331 putative AMGs (belonging to 39 pathways) in non-diel viruses (S8 and S9 Tables). Most of the genes identified by our conservative filtering methods for diel viruses were shared with non-diel viruses, while most of the non-diel virus genes were not. Among the AMGs shared between diel and non-diel populations, many are likely involved in virus infection to produce genomes and/or particles (e.g., nucleotide metabolism, amino acid metabolism). The only one that was exclusive to the diel viral populations was Coenzyme M biosynthesis (K05979, comB, 1 viral population), whereas the non-diel exclusive AMGs included 131 genes involved in 33 pathways (S6–S9 Tables). To summarize their biology, we interpret the non-diel-unique AMGs to represent pathways that contribute to constitutive rather than diel-cycling metabolisms. These constitutive metabolisms include genes involved in carbohydrate and redox metabolism (glucoronate, galactose, glyoxylate, ascorbate), nucleotide metabolism (purine, pyrimidine, degradation), amino acid and membrane metabolism (fatty acids, phospholipids), amino acid and polyamine metabolism (methionine, arginine/polyamines, creatine), cofactor and vitamin biosynthesis (THF, BH₄, B₁₂, biotin), sulfur cycling (sulfate assimilation/reduction) and secondary metabolites (aurachin). While less clear examples are those involved in cell wall/structural sugar biosynthesis (rhamnose, KDO) as dTDP-L-rhamnose biosynthesis which is involved in biosynthesis of terpenoids and polyketides is a pathway not typically found in viruses [86] (S6–S9 Tables).

Unsupervised learning identifies day- versus night-peaking patterns in diel signals

We set out to evaluate the structure of diel rhythmic viral populations given observed variation in amplitude, shape of oscillation, and peak timing. To formally evaluate this, we built on recent efforts to identify viral archetypes in our dataset that may possess recurring temporal patterns, assign groups of viral populations to these archetypes, and assess within- and between-group differences in ecological relevance [84]. With this analytic pipeline [84], we utilized an unsupervised, self-organizing map (SOM) approach to determine whether smaller groups or ‘archetypes’ exist by evaluating the euclidean distances of their abundance across the time series (see Methods). The resultant SOM clustering approach for our 3,097 diel viral populations revealed 3 archetypes that we titled “archetype 1,” “archetype 2,” and “archetype 3” (Fig 6A). Adding to this archetype construction, we calculated the average peak time for individual viral populations as well (see Methods), revealing that most (94.09%) SUR diel viral populations peaked in abundance during the night with the most abundant time point being 04:00 hours, with much less common day peaking viral populations peaking at 12:00 hours.

Download:

Fig 6. Categorizing diel patterns into three potential archetypes using unsupervised clustering.

A. Organized pairwise distance matrix for all diel viral populations after clustering based on unsupervised, self-organizing maps (SOMs). Pixels in figure represent the Euclidean distance between the time series of two diel viral populations with purple indicating a small distance (similar time series) and orange indicating a higher distance (less similar time series). Dotted lines form boxes to separate the different cluster boundaries. The data underlying this Figure can be found on our zenodo repository (see Data availability) within file all-meancov.txt B. Line graph for time series of viral populations within each cluster. Archetype time series are a combination of all time series in their cluster as defined via the SOM algorithm (midnightblue = Archetype 1; gold = Archetype 2; purple = Archetype 3). A random sample of 145 time series pertaining to each cluster are plotted as dark lines. Data underlying this figure is the same as in Fig 6A. C. Proportion for number of viral populations by time in archetype. Columns represent archetype while color represents mean peak time for viral populations across the time series. Data underlying this Figure is the same as in Fig 6A. D. NMDS of all diel viral populations colored by calculated peak rank measurement time and given shapes by archetype. Data underlying this Figure is the same as in Fig 6A.

https://doi.org/10.1371/journal.pbio.3003474.g006

To assess the archetypes for biological differences, we first evaluated their temporal patterns to connect those possible patterns to host predictions. These temporal patterning analyses revealed that archetypes 1 and 3 possessed night-peaking viral populations and dominated the dataset (95.22% of diel viral populations), whereas archetype 2 possessed primarily day-peaking viral populations and represented only a few percent of the total populations (Fig 6B and 6C). Ordination (NMDS with euclidean distances) revealed a clear separation between viral populations that coincided with time and archetype grouping such that one sees a gradual shift from virus population archetypes from peaking at 00:00 hours to peaking at 12:00 hours (Figs 6D and S6). Because these archetypes required no a priori assumptions of sinusoidal patterns nor of preferred phase, these observations represent naturally forming groups that presumably more accurately capture diel rhythmicity across the time course (see also [84]).

Evaluating these temporally classified archetypes further, we next sought to link these to changes in host predictions, which revealed dominant host prediction abundances varying between archetypes (Fig 7). Archetype 2 consisted of viruses that predominantly targeted genera that made up ≤1% of the predictions in that archetype with only 4 different genera (Winogradskyella, Pelagibacter, Joostella, and Flavobacterium) having >1% of the host predictions. This is contrary to the other archetypes that possessed more host predictions that accounted for >1% of the viruses, perhaps because they were composed of so many more viral populations. Another notable observation was that archetype 1 held the largest proportion of phototroph targeting viruses, with viruses predicted to infect Prochlorococcus_A being the fifth most abundant host in the archetype (Fig 7). Archetype 1 mainly consisted of viruses peaking in abundance during the night with the largest proportion peaking at 20:00 hours. Our observations are further supported by bacterial data sampled during the same expedition that show Prochlorococcus bacteria having larger abundances during nighttime samples [54]. The correlated abundance of viruses with their hosts seen here with Prochlorococcus coincides with other studies that used transcripts and have similarly seen phototrophic targeting viruses being co-expressed with their associated bacteria [41,42,87]. The night peaking viral activity of SAR11 and Prochlorococcus have also previously been observed as we did at BATS [41]. Our archetype comparisons, show that viruses that target heterotrophs are the main ones showing diel patterns and that among our archetype 2 viruses that mainly peak in abundance during the day, the biggest targets were the genera Winogradskyella, Pelagibacter, Joostella, and Flavobacterium (Fig 7). As these genera are also found to be targeted in the other archetypes, it is possible that the difference here lies in a higher resolution of taxonomy for the bacteria being targeted.

Download:

Fig 7. Comparison of diel archetypes host predictions.

Alluvial plot of predicted hosts (genus-level ranks) for viral populations present within archetypes. Height, order, and pie chart are as described in Fig 3. The data underlying this figure can be found in our zenodo repository (see Data availability) in file all-meancov.txt and combined_host_prediction_to_genome_m90.csv.

https://doi.org/10.1371/journal.pbio.3003474.g007

Discussion

In this study, we leveraged a high-resolution sub-daily temporal dataset from SUR and DCM waters to investigate spatial and temporal viral dynamics at BATS, a model “future” ocean given climate predictions. With respect to depth, we observed distinct differences in viral alpha and beta diversity across the depth-gradient, which were likely shaped by physicochemical features inherent to each depth such as light, temperature, oxygen, and chlorophyll concentration [61–63]. Additionally, we expanded upon our prior work to show how host predictions for depth-structured virus populations changed at BATS [52]. Temporally, however, the picture was different—viral community-scale metrics remained stable across the 112 hour time course, whereas differences could be observed at the viral population-level. Our observations that viral community structure remained stable across the time-series while viral population abundances varied, mirrors a broader pattern seen in monthly-sampled, longer timescale studies [34–36,38]. These results reinforce the idea that viral populations are dynamic even when community level metrics appear stable, with implications for host interactions, gene flow and downstream ecological impacts. Suggesting that virus-host interactions may maintain stability but with underlying evolutionary dynamism happening at a finer resolution through continuous changes among viral populations, driven by ongoing selection pressures from hosts and the environment. Exploration of these population-level differences revealed diel rhythmicity with machine learning-enabled methods [60] resolving such populations into one of three archetypes. Biologically, these archetypes differed in hosts predicted, viral taxonomy, and gene content.

By capturing viral abundances across depth and time, we uncover a complex and temporally structured viral community that mirrors host activity and environmental variation. The variation seen at the population-scale suggests that viral–host interactions are governed by more than abundance alone, likely shaped by biogeochemistry, host genome architecture, and ecological context [33]. Though not yet accomplished anywhere, future BATS efforts would benefit from many recent innovations that could layer in additional data, measurements, and interpretive frameworks. Long-read sequencing or single virus genomics could improve the capture of niche-defining genomic islands and microdiverse populations that are often missed by short-read assembled viromes [88,89]. Viral identification could be improved with the incorporation of additional methods such as GeNomad [58] once guidelines on how to effectively combine outputs on multiple tools without increasing the risk of increased contamination or decreased precision in true-positives are completed. Host predictions could be improved in a targeted manner through viral tag and grow experiments [90] or community-wide through DNA Hi-C proximity ligation sequencing, the latter assuming appropriate controls and analyses [91]. Measurements to assess activity, via metatranscriptomes, metaproteomes, metabolomes, and/or isotopic probing [92], could distinguish integrated past virus functioning emergent from viral particle sequencing from presumably real-time virus infection conditions. With better datasets, one could also expand the biological “players” in the story, for example, by exploring the interplay between viruses and other mobile genetic elements to better inform the biological entities interrupting and/or carrying key metabolic reactions [93]. Additionally, modeling frameworks that predict viral roles in community metabolisms [94] and experiments that focus on phenotyping the ecological costs of resistance [17] could provide and test specific hypotheses about viral roles in biogeochemistry that will lead to better incorporation of viruses into predictive models. Again, such a diverse toolkit has not yet been applied anywhere, but does aspire to more holistically capture true biological integration across molecular and organismal scales that dictate ecosystem functioning.

On the whole, viruses impact marine ecosystems via lysis, gene transfer, and metabolic reprogramming during infection [95]. The BATS time series in the Sargasso Sea, with its multidecadal records of biogeochemical and physical data, provides opportunity to assess how viruses contribute to and are shaped by ocean warming, stratification, and shifting nutrient regimes [96], particularly as its gyre expands, warms, and acidifies [49]. Our work focuses on establishing baseline viral genomic datasets across two depths at BATS, where these observations provide glimpses into host-driven structuring of virus communities. Such observations show enrichment of viruses targeting Prochlorococcus_B and SAR11 clade bacteria in DCM and SUR respectively, along with genes predicted to be associated with said taxa such as psbA for Prochlorococcus_B. Along with this, we also document short-term temporal changes that manifest not in community-aggregated measures, but instead at the population-level where we revealed the diversity and potential functioning of viruses that differentially occupy day- or night-peaking niches. These short-term temporal changes not only illustrate that heterotrophic targeting viruses make the majority of our diel signal, but that 94.09% of these diel viruses peak in abundance during the night. Placing such short-term change into the context of longer-term changes already known for marine bacteria [97] will be critical to better incorporate viral ecology and roles into long-term ocean observing systems that seek to better predict future ocean functioning.

Methods

Sample collection

Samples for this project were collected under Special Permit SP190906 from the Government of Bermuda Department of Environment and Natural Resources. Sample collection took place on the RV Atlantic Explorer (cruise AE1926) and coordinates of samples can be found within S1 Table. Sea water samples were collected once every 4 hours in the surface, SUR, or 12 hours in the DCM for 112 hours resulting in 39 samples (29 SUR, 10 DCM). A surface buoy with an underwater drogue, at 30 m depth, was deployed to allow us to follow the same parcel of water for the entire study. Water samples were collected using a CTD-rosette equipped with 24 × 12-l Niskin bottles. Depths for each cast were chosen based on in situ CTD oceanographic parameters. Sampling started at 16:00 local time (GMT-3) from SUR waters at ~5 m, always in the stratified SUR mixed layer, and 105–120 m in the DCM mesopelagic layer. For each cast, the CTD was deployed to at least 500 m to collect data on physical water columns, measuring the water temperature, pH, and salinity. Measurements were binned to the nearest 0.5 m of depth to standardize across casts, then a Nadaray-Watson kernel smoothing filter with bandwidth of 5 db was applied to each variable to remove noise and spikiness. For each sample, 10 L of seawater were filtered through a 0.2 µm Millipore filter (CAT number: GPWP14250;LOG number ROBA92539) to minimize cells, and iron chloride flocculation [53] was performed to concentrate the viruses in the 0.2 µm filtrate. The virus-concentrated iron hydroxide flocs were then filter-captured using a 1 µm polycarbonate filter and stored damp on the filter at 4 °C until resuspended in ascorbate-EDTA buffer and processed to DNA.

Library preparation and sequencing

For each sample, half a filter, corresponding to ~5 L of seawater was resuspended in 7.5 mL of ascorbate-EDTA resuspension buffer to capture the concentrated viral particles. All samples were then treated with DNase diluted 1:40 in DNase reaction buffer. DNase was then inactivated by adding 0.1 M EDTA then 0.1 M EGTA. Viruses were concentrated by spinning each sample through a 15 mL 100 kDa Amicon filter and resuspended in ascorbate-EDTA buffer (0.1 M EDTA, 0.2 M Mg, 0.2 M ascorbic acid, pH 6.0) in total of less than 1 ml. DNA was then extracted using Wizard PCR Preps DNA Purification Resin and Mini-columns (Promega, Cat. #A7181 and A7211, respectively) [98]. Between 216 and 620 ng of DNA (355 ng on average between samples) was provided to the Joint Genome Institute for sequencing. Shotgun metagenomic sequencing at JGI was performed on an Illumina platform, resulting in an average of 144M reads per sample (~9.8 × 10⁷ to 2.6 × 10⁸ sequencing reads per sample).

Quality check, trimming, and assembly

Data processing and metagenomic analysis were conducted on the Ohio Supercomputer Center [99]. Reads went through a quality check using BbDuk (https://sourceforge.net/projects/bbmap/). Low-quality reads, adaptors, and Phix174 reads were removed (ktrim = r minlength = 30 k = 23 mink = 11 hdist = 1 hist2 = 1) and reads were trimmed (qtrim = rl maq = 20 maxns = 0 minlength = 30 trimq = 20). Four samples were resequenced due to low-quality reads (T4D, T52S, T88S, T28D). Fastqc identified T28D as having a low read quality, so it was removed from the study. Resequenced samples had an average sequencing depth of about 381M reads per sample, due to this, resequenced samples had their cleaned reads subsampled using bbmap 38.96 (https://sourceforge.net/projects/bbmap/) to an even depth of 62,381,078 in the SUR and 68,082,965 in the DCM which is the median depth value for clean reads in the rest of the samples in these regions. Samples were assembled using MegaHit [100] with default options.

Viral identification

Contigs from MegaHit were processed using Virsorter 2 SOP version 2.2.3 [55]. Briefly, all samples were first processed via Virsorter2 (–keep-original-seq –include-groups sdDNAphage, ssDNA –min-length 5,000 min-score 0.5). Then CheckV v0.8.1 [101] was used along with Virsorter2 outputs to curate viruses following the parameters in the Virsorter 2 SOP [55]. After curation, the pipeline identified 228,013 viruses which were then clustered with CheckV clustering v0.8.1 [101]. 48,428 viral populations greater than 5 kb and 14,634 greater than 10 kb were clustered at 95% average nucleotide identity and 80% coverage. A viral population is defined here as a group of viruses of the same species [47].

Additionally, all contigs from MegaHit were processed using GeNomad [58] end-to-end for comparison with VirSorter2 results.

Read mapping

CoverM v0.6.1 was used to map quality-trimmed reads to viral populations (https://github.com/wwood/CoverM; –output-format dense, min-read-percent-identity 0.95, min-read-alignment-percent 0.75, min-covered-fraction 0.75, -m trimmed_mean). Reads were retained if they had ≥95% identity, ≥75% alignment, ≥75% coverage, and the trimmed mean was used to remove the top 5% and bottom 5% depths [70]. CoverM generates BAM files which are then summarized into coverage values that are then normalized by dividing them with read depth from the quality-trimmed reads per sample and multiplying by 1,000,000,000.

Diversity calculations and statistical analyses

Diversity estimates were based on the normalized relative abundance tables generated via read recruitment. The alpha (Inverse Simpson) and beta (Bray-Curtis dissimilarity) diversity statistics were calculated using vegan on R (v 2.6.4) [102]. Read were log2 transformed before calculating Bray-Curtis (function vegdist, method = ‘bray’). Affinity propagation using APCluster on R was used to cluster samples into groups with similar viral relative abundance and community structure [103]. Communities were then divided into groups based on similarity for analysis and on depth and time of day for separate alpha/beta diversity analysis. Principal coordinate analysis (function cmdscale) and nonmetric multidimensional scaling (function metaMDS, permutations = 999) were ordination methods used on the Bray-Curtis dissimilarity matrices [104–106]. The statistical significance between viral communities was validated by comparing the within-community and between-community distances with MRPP and ANOSIM [107,108].

Temporal analysis with the detection of rhythmic behavior in time series

For all datasets from BATS, diel periodicity was determined using the rank-based Jonckheere-Terpstra umbrella test implemented in the R RAIN package [85]. Normalized read abundance tables of the viral populations were observed for oscillating behavior across the 112-hour time series. Samples were first detrended (linear regression with respect to time was subtracted from time series) to increase power of rhythmicity detection using the detrend function in the R pracma package v.2.4.4 [85]. RAIN was run with the options method = ‘deltat = 4, period = 24’ for the surface samples to establish four-hour sampling periods and a 24-hours time period to observe for oscillation. Viral populations within the SUR samples had the options method = ‘measure.sequence = c(1,1,1,1,1,1,1,1,1,1,1,1,1,0,1,1,1,1,1,1,1,1,0,1,1,1,1,1,1)’ to account for sample “T52_S” and “T88_S,” that were excluded from analysis. Samples from the DCM were run with the options method = ‘deltat = 12, period = 24′ to establish 12-hour sampling periods and a 24-hour time period to observe for oscillation. Viral populations within the DCM samples had the options method = ‘measure.sequence = c(0,1,0,1,1,1,1,1,1,1)’ to account for samples “T4_D” and “T28_D” that were excluded from analysis. After RAIN implementation, the Benjamini–Hochberg false discovery rate control procedure was implemented to assess significance at the P = 0.05 level for each data type.

Calculation of peak rank time

To estimate the mean peak time for viral populations, a rank-based heuristic was calculated. For a given viral population, the abundance at each time point was ranked. The ranks from all measurements were then averaged and the peak mean rank time was defined as the time with the highest average, where ties were summarized as the center between tied times [84].

Clustering analysis

Detrended diel time series were scaled to make data dimensionless to reduce impact of magnitude on euclidean distance matrices. Euclidean distance matrices had a Hopkins statistic calculated to determine the meaningfulness of clustering where a value of h = 0.71 was found, indicating data structure cannot be explained by random distribution of distances. To determine the best clustering method we employed hierarchical clustering (implemented by the hclust function in the R stats package v.4.1.1), medoid clustering [109], and training of SOMs [110]. These clustering methods were then evaluated for best fit using the Calinski-Harabasz metric and average silhouette distance. SOM was selected as the best fit clustering method on the basis of identifying the ‘elbow’ in decreasing average silhouette width to initially select three as the operational number of clusters. Fits were assessed in more detail for two and four clusters as well.

Taxonomic classification

Prodigal v2.6.3 [111] was used to predict proteins from the viral populations greater than 10 kb. From the 15,197 viral populations greater than 10 kb, prodigal predicted 360,993 proteins. vConTACT 2 version 0.11.1 [112], was used to classify viral genomic sequence data. This specific tool is designed to cluster and provide a taxonomic context of metagenomic sequencing data. VConTACT2 results were then read into cytoscape v3.10.1 for visualization of the network following the protocol published on protocols.io [112].

Functional annotation and auxiliary metabolic genes (AMGs)

AMGs were annotated using DRAMv v1.4.0 (min-contig-size = 10,000, --skip_trnascan; [113]). AMGs with a score of 1–3 and assigned to a metabolic module, and/or to a previously described AMG (flags M, K, or E) were selected as putative AMGs [70,114]. AMGs were then curated for a conservative catalog using previously established rules [70] where an AMG was only kept if located within virus regions called by checkV and/or found on a contig with a score ≥0.95 assigned by VirSorter2 [70] we screened for non-virus regions by checking for sequences adjacent to phage genome ends, including tRNA regions and inverted/direct repeats. tRNA regions were detected by tRNAscan-SE (v1.2.3; [115]) using general tRNA models. EMBOSS (v6.6.0; [116]) with standard qualifiers and -scircular1 for circular virus contigs was used to detect inverted and direct repeats in contigs. For AMGs with predicted phage ends we used the criteria mentioned in the aforementioned established rules for a conservative catalog [70] and removed AMGs found on viral contigs containing mobile genetic elements and other genes that may facilitate the random integration of microbial metabolic genes using the keywords transposons, lipopolysaccharide islands (glycosyltransferase, nucleotidyl transferase, carbohydrate kinases, and nucleotide sugar epimerase), endonucleases, integrases, or plasmid stability genes [70]. We then filtered for those containing pathway information from KEGG bringing our conservative catalog from 887 AMGs to 349 that contain a KEGG identifier. The conservative AMG catalog was mapped to metabolic pathways using Anvi’o v8 [117]. Metabolic pathways were visualized in KEGG Metabolic Pathways (map 01100) with iPath 3.0 [118] based on the presence of AMG-carrying viruses across depth layers and their diel signals. Pathways with default stepwise completeness less than 0.75 were considered incomplete.

Identifying hosts using iPHoP with the standard database

IPHoP v1.3.2 [65] was used to predict hosts from viral populations >5 kb. Hosts were first identified using the standard database using the default commands.

Identifying hosts using iPHoP with a custom database

A custom database was also created using 89 MAGs from a previous BATS study done in July of 2017 [52]. Following the protocol on iPHoP’s github page (https://bitbucket.org/srouxjgi/iphop/src/main/), GTDB [119] was used to assign taxonomy to the MAGs, which were then added to the standard iPHoP database to create a custom database. GTDB was run separately identifying bacteria with the outgroup p_Patescibacteria and archaea with the outgroup p_Altiarchaeota. Identified MAGs were then added to the standard iPHoP database using the add_to_db command. Identifying hosts from the custom database was done using the predict command.

Sample collection from which MAGs were identified

The custom database uses MAGs identified from Warwick-Dugdale, 2017 dataset [52]. These MAGS were derived from 12 samples at the BATS station (31°40′N, 151 64°10′W) during dusk (~19:00 local time) and dawn (~06:00 local time), from the depths of 80 and 200 m, over a period of four consecutive days from the 8th to 11th of July 2017.

Supporting information

S1 Fig. VirSorter2 and Genomad comparison.

Euler plot of viral identification between Genomad and VirSorter2 tools. Each tool was run on whole dataset, filtered to 5 kb sequences, then results were combined and clustered. Clustered sequences were then checked for if sequences that make up each cluster were exclusively from Genomad, VirSorter2, or shared.

https://doi.org/10.1371/journal.pbio.3003474.s001

(TIF)

S2 Fig. Affinity propagation of depth clusters through viral relative abundance.

Affinity propagation heatmap of Bray-Curtis dissimilarities compared between viral communities at each time point. Clustering generated through affinity propagation of the Bray-Curtis data was significantly different (Multi-Response Permutation Procedure [MRPP], p = 0.001). Clusters within affinity propagation heatmap are distinguished by a colored bar bordering the heatmap and dissimilarity is distinguished by the Bray-Curtis³ color scale. The data underlying this Figure can be found in the zenodo repository (see Data availability) within file all-meancov.txt.

https://doi.org/10.1371/journal.pbio.3003474.s002

(TIF)

S3 Fig. Principal Coordinates Analysis (PCoA) of viral community between SUR and DCM samples.

PCoA of viral community was performed on Bray-Curtis dissimilarities of viral populations in SUR and DCM samples. Each point represents a sample from SUR or DCM indicated by the color of the dot. Ellipses indicate the 95% confidence interval around the centroid of each region, calculated assuming a multivariate normal distribution. Communities were significantly different (Multi-Response Permutation Procedure [MRPP], p = 0.001, a PERMANOVA test p = 0.001). The data underlying this Figure can be found in the zenodo repository (see Data availability) within file all-meancov.txt.

https://doi.org/10.1371/journal.pbio.3003474.s003

(TIF)

S4 Fig. Alpha diversity compared against Global Ocean Virome 2 (GOV2) dataset ecological zones.

Difference of Inverse Simpson’s diversity indices SUR-DCM is plotted for our BATS samples (from October, 2019) along with data for the ecological zones from the GOV2 data [47] of TT (Temperate and tropical epipelagic waters), ANT (Antarctic water), and ARC (Arctic waters). The data underlying this Figure can be found in the zenodo repository (see Data availability) within file Normalized_Viral_Abundances_ALL_95ANI_5kb_ECOLOGY.txt and all-meancov.txt.

https://doi.org/10.1371/journal.pbio.3003474.s004

(TIF)

S5 Fig. Viral Families of SUR diel and non-diel viral populations.

Alluvial plot of viral taxonomy for viral populations present at the family level. Height, order, and pie chart are as described in Fig 3. The data underlying this Figure can be found in S2 Data and in the zenodo repository (see Data availability) within file all-meancov.txt.

https://doi.org/10.1371/journal.pbio.3003474.s005

(TIF)

S6 Fig. Principal Coordinates Analysis (PCoA) of beta-dispersion among three viral archetypes.

Beta-dispersion PCoA was performed on euclidean distance of surface viral populations based on abundance data across time series using S12 Table, reordered by archetype information in S13 Table. Each point represents a viral population colored by archetype, with length of line signifying the distance to median of multivariate dispersion. Archetypes are significantly different from one another using PERMANOVA and Tukey’s test (p < 0.001). The data underlying this Figure can be found in the zenodo repository (see Data availability) within file all-meancov.txt.

https://doi.org/10.1371/journal.pbio.3003474.s006

(TIF)

S1 Table. Metadata for BATS 2019 Cruise.

Table of metadata collected for dataset with regards to latitude, longitude, salinity, temperature, chlorophyll, and depth.

https://doi.org/10.1371/journal.pbio.3003474.s007

(XLSX)

S2 Table. Raw and clean read numbers for samples.

Table of number of raw reads and bases along with clean reads and bases for each sample. Sample T28_D is crossed out due to low sequencing depth from the sequencing run.

https://doi.org/10.1371/journal.pbio.3003474.s008

(XLSX)

S3 Table. Sample data across viral identification pipeline.

Table of number of reads in files throughout the VirSorter2 viral identification pipeline. Starting at sample read depth and ending with checkV clustering and filtering to 10 kb size. Outlier samples are highlighted in different colors.

https://doi.org/10.1371/journal.pbio.3003474.s009

(XLSX)

S4 Table. BATS MAGs GTDB-tk summary file.

Table containing the MAGs used to form the custom iPHoP library for host predictions.

https://doi.org/10.1371/journal.pbio.3003474.s010

(XLSX)

S5 Table. Conservative filtered AMGs annotations.

Table of annotations for all AMGs that passed the conservative filtration process and possess a KEGG identifier.

https://doi.org/10.1371/journal.pbio.3003474.s011

(XLSX)

S6 Table. Diel AMG annotations.

Table of annotations for all conservative AMGs found within viral populations classified as diel.

https://doi.org/10.1371/journal.pbio.3003474.s012

(XLSX)

S7 Table. Non-Diel AMG annotations.

Table of annotations for all conservative AMGs found within viral populations classified as non-diel.

https://doi.org/10.1371/journal.pbio.3003474.s013

(XLSX)

S8 Table. Non-Diel AMG modules.

Table containing the module and pathway information for non-diel AMGs based on Anvi’o classification using anvi-estimate-metabolism that takes the ko_id from an AMG and the gene identifier to determine the pathway and module.

https://doi.org/10.1371/journal.pbio.3003474.s014

(XLSX)

S9 Table. Diel AMG modules.

Table containing the module and pathway information for diel AMGs based on Anvi’o classification using anvi-estimate-metabolism that takes the ko_id from an AMG and the gene identifier to determine the pathway and module.

https://doi.org/10.1371/journal.pbio.3003474.s015

(XLSX)

S10 Table. Host predictions with associated AMGs for depth groups.

Table containing conservative AMGs organized by host prediction for the viral population it was found in along with the number of instances that AMG was found in that predicted host for each depth. The table also contains the ko_name, symbol, and pathway information according to KEGG.

https://doi.org/10.1371/journal.pbio.3003474.s016

(XLSX)

S11 Table. Host predictions with associated AMGs for diel groups.

Table containing conservative AMGs organized by host prediction for the viral population it was found in along with the number of instances that AMG was found in that predicted host for either diel or non-diel viral populations. The table also contains the ko_name, symbol, and pathway information according to KEGG.

https://doi.org/10.1371/journal.pbio.3003474.s017

(XLSX)

S12 Table. Diel viral populations.

Table containing the viral populations considered as diel and their trimmed_mean counts from CoverM across the dataset.

https://doi.org/10.1371/journal.pbio.3003474.s018

(XLSX)

S13 Table. Archetype information.

Table containing viral populations that were diel along with the archetype (cluster) they belong to and the time block they peaked on average throughout the study (time_rank). The x and y columns pertain to coordinates for plotting.

https://doi.org/10.1371/journal.pbio.3003474.s019

(XLSX)

S1 Data. Table of depth profile of chlorophyll and temperature down to 1,000 m depth for 2019 BATS cruise.

https://doi.org/10.1371/journal.pbio.3003474.s020

(XLSX)

S2 Data. Table of viral taxonomy from vConTACT3 using viral populations that are 10 kb or larger.

https://doi.org/10.1371/journal.pbio.3003474.s021

(XLSX)

Acknowledgments

The authors would like to thank the captain and crew of the RV Atlantic Explorer for conducting cruise AE1926 along with the marine technicians who helped in the collection of seawater samples. The authors would also like to thank Garrett Smith, Rokaiya Shatadru, James Riddell V, and James Tan for their suggestions to the research and manuscript, along with the OSU Microbiome Informatics course Microbiology 8161 for their support in the early stages of the analyses. J.S.W. is an investigator at the University of Maryland-Institute for Health Computing, which is supported by funding from Montgomery County, Maryland and The University of Maryland Strategic Partnership: MPowering the State, a formal collaboration between the University of Maryland, College Park and the University of Maryland, Baltimore.

References

1. DeVries T. The ocean carbon cycle. Annu Rev Environ Resour. 2022;47(1):317–41.
- View Article
- Google Scholar
2. IPCC. Climate change 2023 synthesis report. 2023 [cited 2024 Sept 10]; Available from: https://hdl.handle.net/10568/138481
3. Mignot A, Claustre H, Uitz J, Poteau A, D’Ortenzio F, Xing X. Understanding the seasonal dynamics of phytoplankton biomass and the deep chlorophyll maximum in oligotrophic environments: a Bio‐Argo float investigation. Global Biogeochem Cycles. 2014;28(8):856–76.
- View Article
- Google Scholar
4. Beckmann A, Hense I. A fresh look at the nutrient cycling in the oligotrophic ocean. Biogeochemistry. 2009;96(1):1–11.
- View Article
- Google Scholar
5. Wilhelm SW, Suttle CA. Viruses and nutrient cycles in the sea: viruses play critical roles in the structure and function of aquatic food webs. BioScience. 1999;49(10).
- View Article
- Google Scholar
6. Sullivan MB, Weitz JS, Wilhelm S. Viral ecology comes of age. Environ Microbiol Rep. 2017;9(1):33–5. pmid:27888577
- View Article
- PubMed/NCBI
- Google Scholar
7. Luo E, Leu AO, Eppley JM, Karl DM, DeLong EF. Diversity and origins of bacterial and archaeal viruses on sinking particles reaching the abyssal ocean. ISME J. 2022;16(6):1627–35. pmid:35236926
- View Article
- PubMed/NCBI
- Google Scholar
8. Kaneko H, Blanc-Mathieu R, Endo H, Chaffron S, Delmont TO, Gaia M, et al. Eukaryotic virus composition can predict the efficiency of carbon export in the global ocean. iScience. 2020;24(1):102002. pmid:33490910
- View Article
- PubMed/NCBI
- Google Scholar
9. Vincent F, Gralka M, Schleyer G, Schatz D, Cabrera-Brufau M, Kuhlisch C, et al. Viral infection switches the balance between bacterial and eukaryotic recyclers of organic matter during coccolithophore blooms. Nat Commun. 2023;14(1):510. pmid:36720878
- View Article
- PubMed/NCBI
- Google Scholar
10. Walde M, Camplong C, de Vargas C, Baudoux AC, Simon N. Viral infection impacts the 3D subcellular structure of the abundant marine diatom Guinardia delicatula. Front Mar Sci. 2023;9.
- View Article
- Google Scholar
11. Guidi L, Chaffron S, Bittner L, Eveillard D, Larhlimi A, Roux S, et al. Plankton networks driving carbon export in the oligotrophic ocean. Nature. 2016;532(7600):465–70. pmid:26863193
- View Article
- PubMed/NCBI
- Google Scholar
12. Forterre P. The virocell concept and environmental microbiology. ISME J. 2013;7(2):233–6. pmid:23038175
- View Article
- PubMed/NCBI
- Google Scholar
13. Howard-Varona C, Lindback MM, Bastien GE, Solonenko N, Zayed AA, Jang H, et al. Phage-specific metabolic reprogramming of virocells. ISME J. 2020;14(4):881–95. pmid:31896786
- View Article
- PubMed/NCBI
- Google Scholar
14. Howard-Varona C, Lindback MM, Fudyma JD, Krongauz A, Solonenko NE, Zayed AA, et al. Environment-specific virocell metabolic reprogramming. ISME J. 2024;18(1):wrae055. pmid:38552150
- View Article
- PubMed/NCBI
- Google Scholar
15. Howard-Varona C, Roux S, Bowen BP, Silva LP, Lau R, Schwenck SM. Protist impacts on marine cyanovirocell metabolism. ISME Commun. 2022;2(1):1–14.
- View Article
- Google Scholar
16. Lindback MM, Howard-Varona C, Fudyma J, Tfaily M, Sullivan MB, Duhaime MB. Virocell resource manipulation under nutrient limitation. mSystems. 2025;10(7):e00521–25.
- View Article
- Google Scholar
17. Urvoy M, Howard-Varona C, Owusu-Ansah C, Stai AJ, Bouranis JA, Burris M, et al. Phage resistance mutations in a marine bacterium impact biogeochemically relevant cellular processes. Nat Microbiol. 2026;11(1):195–210. https://doi.org/10.1038/s41564-025-02202-5
- View Article
- Google Scholar
18. Thingstad TF. Elements of a theory for the mechanisms controlling abundance, diversity, and biogeochemical role of lytic bacterial viruses in aquatic systems. Limnol Oceanogr. 2000;45(6):1320–8.
- View Article
- Google Scholar
19. Winter C, Bouvier T, Weinbauer MG, Thingstad TF. Trade-offs between competition and defense specialists among unicellular planktonic organisms: the “killing the winner” hypothesis revisited. Microbiol Mol Biol Rev. 2010;74(1):42–57. pmid:20197498
- View Article
- PubMed/NCBI
- Google Scholar
20. Thingstad T, Lignell R. Theoretical models for the control of bacterial growth rate, abundance, diversity and carbon demand. Aquat Microb Ecol. 1997;13:19–27.
- View Article
- Google Scholar
21. Jover LF, Cortez MH, Weitz JS. Mechanisms of multi-strain coexistence in host-phage systems with nested infection networks. J Theor Biol. 2013;332:65–77. pmid:23608631
- View Article
- PubMed/NCBI
- Google Scholar
22. Weitz JS, Stock CA, Wilhelm SW, Bourouiba L, Coleman ML, Buchan A, et al. A multitrophic model to quantify the effects of marine viruses on microbial food webs and ecosystem processes. ISME J. 2015;9(6):1352–64. pmid:25635642
- View Article
- PubMed/NCBI
- Google Scholar
23. Krishna S, Peterson V, Listmann L, Hinners J. Modelling the interactive effects of viral presence and global warming on Baltic Sea ecosystem dynamics. Biogeosciences Discussions. 2023;:1–23.
- View Article
- Google Scholar
24. Marantos A, Mitarai N, Sneppen K. From kill the winner to eliminate the winner in open phage-bacteria systems. PLoS Comput Biol. 2022;18(8):e1010400. pmid:35939510
- View Article
- PubMed/NCBI
- Google Scholar
25. Beckett SJ, Demory D, Coenen AR, Casey JR, Dugenne M, Follett CL, et al. Disentangling top-down drivers of mortality underlying diel population dynamics of Prochlorococcus in the North Pacific Subtropical Gyre. Nat Commun. 2024;15(1):2105. pmid:38453897
- View Article
- PubMed/NCBI
- Google Scholar
26. Talmy D, Beckett SJ, Taniguchi DAA, Brussaard CPD, Weitz JS, Follows MJ. An empirical model of carbon flow through marine viruses and microzooplankton grazers. Environ Microbiol. 2019;21(6):2171–81. pmid:30969467
- View Article
- PubMed/NCBI
- Google Scholar
27. Hinson A, Papoulis S, Fiet L, Knight M, Cho P, Szeltner B, et al. A model of algal‐virus population dynamics reveals underlying controls on material transfer. Limnol Oceanogr. 2022;68(1):165–80.
- View Article
- Google Scholar
28. Weitz J. Quantitative viral ecology: dynamics of viruses and their microbial hosts. Oxford: Princeton University Press; 2015.
29. Record NR, Talmy D, Våge S. Quantifying tradeoffs for marine viruses. Front Mar Sci. 2016;3.
- View Article
- Google Scholar
30. Lambert S, Tragin M, Lozano J-C, Ghiglione J-F, Vaulot D, Bouget F-Y, et al. Rhythmicity of coastal marine picoeukaryotes, bacteria and archaea despite irregular environmental perturbations. ISME J. 2019;13(2):388–401. pmid:30254323
- View Article
- PubMed/NCBI
- Google Scholar
31. Karl DM, Church MJ. Microbial oceanography and the Hawaii Ocean Time-series programme. Nat Rev Microbiol. 2014;12(10):699–713. pmid:25157695
- View Article
- PubMed/NCBI
- Google Scholar
32. DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, Frigaard NU. Community genomics among stratified microbial assemblages in the ocean’s interior. Science. 2006;311(5760):496–503.
- View Article
- Google Scholar
33. Luo E, Eppley JM, Romano AE, Mende DR, DeLong EF. Double-stranded DNA virioplankton dynamics and reproductive strategies in the oligotrophic open ocean water column. ISME J. 2020;14(5):1304–15. pmid:32060418
- View Article
- PubMed/NCBI
- Google Scholar
34. Chow C-ET, Kim DY, Sachdeva R, Caron DA, Fuhrman JA. Top-down controls on bacterial community structure: microbial network analysis of bacteria, T4-like viruses and protists. ISME J. 2014;8(4):816–29. pmid:24196323
- View Article
- PubMed/NCBI
- Google Scholar
35. Needham DM, Chow C-ET, Cram JA, Sachdeva R, Parada A, Fuhrman JA. Short-term observations of marine bacterial and viral communities: patterns, connections and resilience. ISME J. 2013;7(7):1274–85. pmid:23446831
- View Article
- PubMed/NCBI
- Google Scholar
36. Yeh Y-C, Fuhrman JA. Effects of phytoplankton, viral communities, and warming on free-living and particle-associated marine prokaryotic community structure. Nat Commun. 2022;13(1):7905. pmid:36550140
- View Article
- PubMed/NCBI
- Google Scholar
37. Sullivan MB. Viromes, not gene markers, for studying double-stranded DNA virus communities. J Virol. 2015;89(5):2459–61. pmid:25540374
- View Article
- PubMed/NCBI
- Google Scholar
38. Ignacio-Espinoza JC, Ahlgren NA, Fuhrman JA. Long-term stability and Red Queen-like strain dynamics in marine viruses. Nat Microbiol. 2020;5(2):265–71. pmid:31819214
- View Article
- PubMed/NCBI
- Google Scholar
39. Boysen AK, Carlson LT, Durham BP, Groussman RD, Aylward FO, Ribalet F, et al. Particulate metabolites and transcripts reflect diel oscillations of microbial activity in the surface ocean. mSystems. 2021;6(3):e00896-20. pmid:33947808
- View Article
- PubMed/NCBI
- Google Scholar
40. Ottesen EA, Young CR, Eppley JM, Ryan JP, Chavez FP, Scholin CA, et al. Pattern and synchrony of gene expression among sympatric marine microbial populations. Proc Natl Acad Sci U S A. 2013;110(6):E488-97. pmid:23345438
- View Article
- PubMed/NCBI
- Google Scholar
41. Kolody BC, McCrow JP, Allen LZ, Aylward FO, Fontanez KM, Moustafa A, et al. Diel transcriptional response of a California current plankton microbiome to light, low iron, and enduring viral infection. ISME J. 2019;13(11):2817–33. pmid:31320727
- View Article
- PubMed/NCBI
- Google Scholar
42. Aylward FO, Boeuf D, Mende DR, Wood-Charlson EM, Vislova A, Eppley JM, et al. Diel cycling and long-term persistence of viruses in the ocean’s euphotic zone. Proc Natl Acad Sci U S A. 2017;114(43):11446–51. pmid:29073070
- View Article
- PubMed/NCBI
- Google Scholar
43. Hevroni G, Flores-Uribe J, Béjà O, Philosof A. Seasonal and diel patterns of abundance and activity of viruses in the Red Sea. Proc Natl Acad Sci U S A. 2020;117(47):29738–47. pmid:33172994
- View Article
- PubMed/NCBI
- Google Scholar
44. Liu R, Liu Y, Chen Y, Zhan Y, Zeng Q. Cyanobacterial viruses exhibit diurnal rhythms during infection. Proc Natl Acad Sci U S A. 2019;116(28):14077–82. pmid:31235591
- View Article
- PubMed/NCBI
- Google Scholar
45. Brum JR, Sullivan MB. Rising to the challenge: accelerated pace of discovery transforms marine virology. Nat Rev Microbiol. 2015;13(3):147–59. pmid:25639680
- View Article
- PubMed/NCBI
- Google Scholar
46. Gregory AC, Solonenko SA, Ignacio-Espinoza JC, LaButti K, Copeland A, Sudek S, et al. Genomic differentiation among wild cyanophages despite widespread horizontal gene transfer. BMC Genomics. 2016;17(1):930. pmid:27852226
- View Article
- PubMed/NCBI
- Google Scholar
47. Gregory AC, Zayed AA, Conceição-Neto N, Temperton B, Bolduc B, Alberti A. Marine DNA viral macro- and microdiversity from pole to pole. Cell. 2019;177(5):1109-1123.e14.
- View Article
- Google Scholar
48. Roux S. A viral ecogenomics framework to uncover the secrets of nature’s “Microbe Whisperers”. mSystems. 2019;4(3):e00111-19. pmid:31120025
- View Article
- PubMed/NCBI
- Google Scholar
49. Bates NR, Johnson RJ. Forty years of ocean acidification observations (1983–2023) in the Sargasso Sea at the Bermuda Atlantic Time-series Study site. Frontiers in Marine Science. 2023.
- View Article
- Google Scholar
50. Parsons RJ, Breitbart M, Lomas MW, Carlson CA. Ocean time-series reveals recurring seasonal patterns of virioplankton dynamics in the northwestern Sargasso Sea. ISME J. 2012;6(2):273–84. pmid:21833038
- View Article
- PubMed/NCBI
- Google Scholar
51. Angly FE, Felts B, Breitbart M, Salamon P, Edwards RA, Carlson C. The marine viromes of four oceanic regions. PLOS Biology. 2006;4(11):e368.
- View Article
- Google Scholar
52. Warwick-Dugdale J, Tian F, Michelsen ML, Cronin DR, Moore K, Farbos A, et al. Long-read powered viral metagenomics in the oligotrophic Sargasso Sea. Nat Commun. 2024;15(1):4089. pmid:38744831
- View Article
- PubMed/NCBI
- Google Scholar
53. John SG, Mendez CB, Deng L, Poulos B, Kauffman AKM, Kern S, et al. A simple and efficient method for concentration of ocean viruses by chemical flocculation. Environ Microbiol Rep. 2011;3(2):195–202. pmid:21572525
- View Article
- PubMed/NCBI
- Google Scholar
54. Gilbert NE, Muratore D, Gochev CS, LeCleir GR, Cagle SM, Pound HL, et al. Seasonal enhancement of the viral shunt catalyzes a subsurface oxygen maximum in the Sargasso Sea. Nat Commun. 2025;17(1):352. pmid:41353448
- View Article
- PubMed/NCBI
- Google Scholar
55. Guo J, Bolduc B, Zayed AA, Varsani A, Dominguez-Huerta G, Delmont TO, et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome. 2021;9(1):37. pmid:33522966
- View Article
- PubMed/NCBI
- Google Scholar
56. Guo J, Vik D, Pratama AA, Roux S, Sullivan M. Viral sequence identification SOP with VirSorter2. 2021 July 19 [cited 2025 June 16]; Available from: https://www.protocols.io/view/viral-sequence-identification-sop-with-virsorter2-bwm5pc86
57. Roux S, Adriaenssens EM, Dutilh BE, Koonin EV, Kropinski AM, Krupovic M, et al. Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat Biotechnol. 2019;37(1):29–37. pmid:30556814
- View Article
- PubMed/NCBI
- Google Scholar
58. Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, et al. Identification of mobile genetic elements with geNomad. Nat Biotechnol. 2024;42(8):1303–12. pmid:37735266
- View Article
- PubMed/NCBI
- Google Scholar
59. Hegarty B, Riddell V J, Bastien E, Langenfeld K, Lindback M, Saini JS, et al. Benchmarking informatics approaches for virus discovery: caution is needed when combining in silico identification methods. mSystems. 2024;9(3):e0110523. pmid:38376167
- View Article
- PubMed/NCBI
- Google Scholar
60. Duhaime MB, Solonenko N, Roux S, Verberkmoes NC, Wichels A, Sullivan MB. Comparative omics and trait analyses of marine pseudoalteromonas phages advance the phage OTU concept. Front Microbiol. 2017;8:1241.
- View Article
- Google Scholar
61. Cox JL, Wiebe PH, Ortner P, Boyd S. Seasonal development of subsurface chlorophyll maxima in slope water and northern Sargasso Sea of the northwestern Atlantic Ocean. Biol Oceanogr. 1982;1(3):271–85.
- View Article
- Google Scholar
62. Gill JG, Hill-Spanik KM, Whittaker KA, Jones ML, Plante C. Sargasso Sea bacterioplankton community structure and drivers of variance as revealed by DNA metabarcoding analysis. PeerJ. 2022;10:e12835.
- View Article
- Google Scholar
63. Michaels AF, Knap AH. Overview of the U.S. JGOFS Bermuda Atlantic Time-series Study and the Hydrostation S program. Deep Sea Res Part II Topic Stud Oceanogr. 1996;43(2–3):157–98.
- View Article
- Google Scholar
64. Liu R, Liu Y, Chen Y, Zhan Y, Zeng Q. Cyanobacterial viruses exhibit diurnal rhythms during infection. Proc Natl Acad Sci U S A. 2019;116(28):14077–82.
- View Article
- Google Scholar
65. Roux S, Camargo AP, Coutinho FH, Dabdoub SM, Dutilh BE, Nayfach S, et al. iPHoP: an integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria. PLoS Biol. 2023;21(4):e3002083. pmid:37083735
- View Article
- PubMed/NCBI
- Google Scholar
66. Bolaños LM, Tait K, Somerfield PJ, Parsons RJ, Giovannoni SJ, Smyth T, et al. Influence of short and long term processes on SAR11 communities in open ocean and coastal systems. ISME Commun. 2022;2(1):116. pmid:37938786
- View Article
- PubMed/NCBI
- Google Scholar
67. Giovannoni SJ. SAR11 bacteria: the most abundant plankton in the oceans. Ann Rev Mar Sci. 2017;9:231–55. pmid:27687974
- View Article
- PubMed/NCBI
- Google Scholar
68. Malmstrom RR, Coe A, Kettler GC, Martiny AC, Frias-Lopez J, Zinser ER, et al. Temporal dynamics of Prochlorococcus ecotypes in the Atlantic and Pacific oceans. ISME J. 2010;4(10):1252–64. pmid:20463762
- View Article
- PubMed/NCBI
- Google Scholar
69. Breitbart M, Bonnain C, Malki K, Sawaya NA. Phage puppet masters of the marine microbial realm. Nat Microbiol. 2018;3(7):754–66. pmid:29867096
- View Article
- PubMed/NCBI
- Google Scholar
70. Tian F, Wainaina JM, Howard-Varona C, Domínguez-Huerta G, Bolduc B, Gazitúa MC. Prokaryotic-virus-encoded auxiliary metabolic genes throughout the global oceans. Microbiome. 2024;12:159.
- View Article
- Google Scholar
71. Hawco NJ, McIlvin MM, Bundy RM, Tagliabue A, Goepfert TJ, Moran DM, et al. Minimal cobalt metabolism in the marine cyanobacterium Prochlorococcus. Proc Natl Acad Sci U S A. 2020;117(27):15740–7. pmid:32576688
- View Article
- PubMed/NCBI
- Google Scholar
72. Saito MA, Moffett JW, Chisholm SW, Waterbury JB. Cobalt limitation and uptake in Prochlorococcus. Limnol Oceanogr. 2002;47(6):1629–36.
- View Article
- Google Scholar
73. Wittmers F, Needham DM, Hehenberger E, Giovannoni SJ, Worden AZ. Genomes from uncultivated pelagiphages reveal multiple phylogenetic clades exhibiting extensive auxiliary metabolic genes and cross-family multigene transfers. mSystems. 2022;7(5):e01522-21.
- View Article
- Google Scholar
74. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol. 2010;12(11):3035–56. pmid:20662890
- View Article
- PubMed/NCBI
- Google Scholar
75. Goldsmith DB, Parsons RJ, Beyene D, Salamon P, Breitbart M. Deep sequencing of the viral phoH gene reveals temporal variation, depth-specific composition, and persistent dominance of the same viral phoH genes in the Sargasso Sea. PeerJ. 2015;3:e997.
- View Article
- Google Scholar
76. Goldsmith DB, Crosti G, Dwivedi B, McDaniel LD, Varsani A, Suttle CA, et al. Development of phoH as a novel signature gene for assessing marine phage diversity. Appl Environ Microbiol. 2011;77(21):7730–9. pmid:21926220
- View Article
- PubMed/NCBI
- Google Scholar
77. Xiang S, Li J, Chen Z, Cheng R, Wang L, Yu L, et al. Ecological diversity and metabolic strategies of widespread Marinisomatota in global oceans. Mar Life Sci Technol. 2025;7(3):523–36.
- View Article
- Google Scholar
78. Prabhu A, Zaugg J, Chan CX, McIlroy SJ, Rinke C. Insights into phylogeny, diversity and functional potential of Poseidoniales viruses. Environ Microbiol. 2025;27(1):e70017. pmid:39777783
- View Article
- PubMed/NCBI
- Google Scholar
79. Chen S, Tao J, Chen Y, Wang W, Fan L, Zhang C. Interactions between marine group II archaea and phytoplankton revealed by population correlations in the northern coast of South China Sea. Front Microbiol. 2022;12:785532.
- View Article
- Google Scholar
80. Wu H-H, Pun MD, Wise CE, Streit BR, Mus F, Berim A, et al. The pathway for coenzyme M biosynthesis in bacteria. Proc Natl Acad Sci U S A. 2022;119(36):e2207190119. pmid:36037354
- View Article
- PubMed/NCBI
- Google Scholar
81. Hall N, Wong WW, Lappan R, Ricci F, Glud RN, Kawaichi S. Aerotolerant methanogens use seaweed and seagrass metabolites to drive marine methane emissions. bioRxiv. 2024. 2024.10.14.618369. https://doi.org/10.1101/2024.10.14.618369
82. Kolomijeca A, Marx L, Reynolds S, Cariou T, Mawji E, Boulart C. An update on dissolved methane distribution in the subtropical North Atlantic Ocean. Ocean Sci. 2022;18(5):1377–88.
- View Article
- Google Scholar
83. Vik D, Bolduc B, Roux S, Sun CL, Pratama AA, Krupovic M, et al. MArVD2: a machine learning enhanced tool to discriminate between archaeal and bacterial viruses in viral datasets. ISME Commun. 2023;3(1):87. pmid:37620369
- View Article
- PubMed/NCBI
- Google Scholar
84. Muratore D, Boysen AK, Harke MJ, Becker KW, Casey JR, Coesel SN, et al. Complex marine microbial communities partition metabolism of scarce resources over the diel cycle. Nat Ecol Evol. 2022;6(2):218–29. pmid:35058612
- View Article
- PubMed/NCBI
- Google Scholar
85. Thaben PF, Westermark PO. Detecting rhythms in time series with RAIN. J Biol Rhythms. 2014;29(6):391–400. pmid:25326247
- View Article
- PubMed/NCBI
- Google Scholar
86. Mrudulakumari Vasudevan U, Lee EY. Flavonoids, terpenoids, and polyketide antibiotics: Role of glycosylation and biocatalytic tactics in engineering glycosylation. Biotechnol Adv. 2020;41:107550. pmid:32360984
- View Article
- PubMed/NCBI
- Google Scholar
87. Mruwat N, Carlson MCG, Goldin S, Ribalet F, Kirzner S, Hulata Y, et al. A single-cell polony method reveals low levels of infected Prochlorococcus in oligotrophic waters despite high cyanophage abundances. ISME J. 2021;15(1):41–54. pmid:32918065
- View Article
- PubMed/NCBI
- Google Scholar
88. Zablocki O, Michelsen M, Burris M, Solonenko N, Warwick-Dugdale J, Ghosh R, et al. VirION2: a short- and long-read sequencing and informatics workflow to study the genomic diversity of viruses in nature. PeerJ. 2021;9:e11088. pmid:33850654
- View Article
- PubMed/NCBI
- Google Scholar
89. Martinez-Hernandez F, Fornas O, Lluesma Gomez M, Bolduc B, de la Cruz Peña MJ, Martínez JM, et al. Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nat Commun. 2017;8:15892. pmid:28643787
- View Article
- PubMed/NCBI
- Google Scholar
90. Jang HB, Chittick L, Li Y-F, Zablocki O, Sanderson CM, Carrillo A, et al. Viral tag and grow: a scalable approach to capture and characterize infectious virus-host pairs. ISME Commun. 2022;2(1):12. pmid:37938680
- View Article
- PubMed/NCBI
- Google Scholar
91. Shatadru RN, Solonenko NE, Sun CL, Sullivan MB. Synthetic community Hi-C benchmarking provides a baseline for virus-host inferences. bioRxiv. 2025. 2025.02.12.637985. https://www.biorxiv.org/content/10.1101/2025.02.12.637985v2
92. Trubl G, Roux S, Kellom M, Vyshenska D, Tomatsu A, Singh K. Tracking persistence and dynamics of active soil viruses with SIP-Viromics. bioRxiv. 2025. 2025.05.25.655894. https://www.biorxiv.org/content/10.1101/2025.05.25.655894v1
93. Guo J, Aroney S, Dominguez-Huerta G, Smith D, Vik D, Ansah CO. Mobile genetic elements that shape microbial diversity and functions in thawing permafrost soils. bioRxiv. 2025. 2025.02.12.637893. https://www.biorxiv.org/content/10.1101/2025.02.12.637893v1
94. Régimbeau A, Tian F, Smith G, Riddell J, Andreani C, Bordron P. Planetary-scale marine community modeling predicts metabolic synergy and viral impacts. bioRxiv. 2025. 2025.02.13.638167. https://www.biorxiv.org/content/10.1101/2025.02.13.638167v1
95. Brum JR, Hurwitz BL, Schofield O, Ducklow HW, Sullivan MB. Seasonal time bombs: dominant temperate viruses affect Southern Ocean microbial dynamics. ISME J. 2016;10(2):437–49. pmid:26296067
- View Article
- PubMed/NCBI
- Google Scholar
96. D’Alelio D, Rampone S, Cusano LM, Morfino V, Russo L, Sanseverino N, et al. Machine learning identifies a strong association between warming and reduced primary productivity in an oligotrophic ocean gyre. Sci Rep. 2020;10(1):3287. pmid:32098970
- View Article
- PubMed/NCBI
- Google Scholar
97. Corno G, Karl DM, Church MJ, Letelier RM, Lukas R, Bidigare RR, et al. Impact of climate forcing on ecosystem processes in the North Pacific Subtropical Gyre. J Geophys Res. 2007;112(C4).
- View Article
- Google Scholar
98. Henn MR, Sullivan MB, Stange-Thomann N, Osburne MS, Berlin AM, Kelly L, et al. Analysis of high-throughput sequencing and annotation strategies for phage genomes. PLoS One. 2010;5(2):e9083. pmid:20140207
- View Article
- PubMed/NCBI
- Google Scholar
99. Ohio Supercomputer Center [Internet]. 2014 [cited 2024 Sept 10]. Available from: https://www.osc.edu/resources/getting_started/citation
100. Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31(10):1674–6. pmid:25609793
- View Article
- PubMed/NCBI
- Google Scholar
101. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. 2021;39(5):578–85. pmid:33349699
- View Article
- PubMed/NCBI
- Google Scholar
102. Simpson EH. Measurement of diversity. Nature. 1949;163(4148):688.
- View Article
- Google Scholar
103. Frey BJ, Dueck D. Clustering by passing messages between data points. Science. 2007;315(5814):972–6. pmid:17218491
- View Article
- PubMed/NCBI
- Google Scholar
104. Gower JC. Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika. 1966;53(3–4):325–38.
- View Article
- Google Scholar
105. Field J, Clarke K, Warwick R. A practical strategy for analysing multispecies distribution patterns. Mar Ecol Prog Ser. 1982;8:37–52.
- View Article
- Google Scholar
106. Oksanen J, Simpson GL, Blanchet FG, Kindt R, Legendre P, Minchin PR, et al. vegan: community ecology package [Internet]. 2001 [cited 2024 Sept 10]. p. 2.6–8. Available from: https://CRAN.R-project.org/package=vegan
107. Mielke PW Jr, Berry KJ, Johnson ES. Multi-response permutation procedures for a priori classifications. Commun Stat Theory Methods. 1976;5(14):1409–24.
- View Article
- Google Scholar
108. Clarke K, Ainsworth M. A method of linking multivariate community structure to environmental variables. Mar Ecol Prog Ser. 1993;92:205–19.
- View Article
- Google Scholar
109. Maechler M, Rousseeuw PJ. Finding groups in data: cluster analysis extended. 2025. [cited 2025 Apr 19]. Available from: https://cran.r-project.org/web/packages/cluster/index.html
110. Wehrens R, Buydens LMC. Self- and super-organizing maps in R: the kohonen package. J Stat Softw. 2007;21:1–19.
- View Article
- Google Scholar
111. Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. pmid:20211023
- View Article
- PubMed/NCBI
- Google Scholar
112. Bolduc B. Applying vContact to viral sequences and visualizing the output (Cyverse). 2019 Feb 14 [cited 2024 Sept 10]; Available from: https://www.protocols.io/view/applying-vcontact-to-viral-sequences-and-visualizi-x5xfq7n
113. Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48(16):8883–900. pmid:32766782
- View Article
- PubMed/NCBI
- Google Scholar
114. Vik D, Gazitúa MC, Sun CL, Zayed AA, Aldunate M, Mulholland MR, et al. Genome-resolved viral ecology in a marine oxygen minimum zone. Environ Microbiol. 2021;23(6):2858–74. pmid:33185964
- View Article
- PubMed/NCBI
- Google Scholar
115. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64. pmid:9023104
- View Article
- PubMed/NCBI
- Google Scholar
116. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16(6):276–7.
- View Article
- Google Scholar
117. Eren AM, Kiefl E, Shaiber A, Veseli I, Miller SE, Schechter MS, et al. Community-led, integrated, reproducible multi-omics with anvi’o. Nat Microbiol. 2021;6(1):3–6. pmid:33349678
- View Article
- PubMed/NCBI
- Google Scholar
118. Darzi Y, Letunic I, Bork P, Yamada T. iPath3.0: interactive pathways explorer v3. Nucleic Acids Research. 2018;46(W1):W510-3.
- View Article
- Google Scholar
119. Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;36(6):1925–7.
- View Article
- Google Scholar

[ref1] 1. DeVries T. The ocean carbon cycle. Annu Rev Environ Resour. 2022;47(1):317–41.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. IPCC. Climate change 2023 synthesis report. 2023 [cited 2024 Sept 10]; Available from: https://hdl.handle.net/10568/138481

[ref3] 3. Mignot A, Claustre H, Uitz J, Poteau A, D’Ortenzio F, Xing X. Understanding the seasonal dynamics of phytoplankton biomass and the deep chlorophyll maximum in oligotrophic environments: a Bio‐Argo float investigation. Global Biogeochem Cycles. 2014;28(8):856–76.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref4] 4. Beckmann A, Hense I. A fresh look at the nutrient cycling in the oligotrophic ocean. Biogeochemistry. 2009;96(1):1–11.
View Article
Google Scholar

[9] View Article

[10] Google Scholar

[ref5] 5. Wilhelm SW, Suttle CA. Viruses and nutrient cycles in the sea: viruses play critical roles in the structure and function of aquatic food webs. BioScience. 1999;49(10).
View Article
Google Scholar

[12] View Article

[13] Google Scholar

[ref6] 6. Sullivan MB, Weitz JS, Wilhelm S. Viral ecology comes of age. Environ Microbiol Rep. 2017;9(1):33–5. pmid:27888577
View Article
PubMed/NCBI
Google Scholar

[15] View Article

[16] PubMed/NCBI

[17] Google Scholar

[ref7] 7. Luo E, Leu AO, Eppley JM, Karl DM, DeLong EF. Diversity and origins of bacterial and archaeal viruses on sinking particles reaching the abyssal ocean. ISME J. 2022;16(6):1627–35. pmid:35236926
View Article
PubMed/NCBI
Google Scholar

[19] View Article

[20] PubMed/NCBI

[21] Google Scholar

[ref8] 8. Kaneko H, Blanc-Mathieu R, Endo H, Chaffron S, Delmont TO, Gaia M, et al. Eukaryotic virus composition can predict the efficiency of carbon export in the global ocean. iScience. 2020;24(1):102002. pmid:33490910
View Article
PubMed/NCBI
Google Scholar

[23] View Article

[24] PubMed/NCBI

[25] Google Scholar

[ref9] 9. Vincent F, Gralka M, Schleyer G, Schatz D, Cabrera-Brufau M, Kuhlisch C, et al. Viral infection switches the balance between bacterial and eukaryotic recyclers of organic matter during coccolithophore blooms. Nat Commun. 2023;14(1):510. pmid:36720878
View Article
PubMed/NCBI
Google Scholar

[27] View Article

[28] PubMed/NCBI

[29] Google Scholar

[ref10] 10. Walde M, Camplong C, de Vargas C, Baudoux AC, Simon N. Viral infection impacts the 3D subcellular structure of the abundant marine diatom Guinardia delicatula. Front Mar Sci. 2023;9.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref11] 11. Guidi L, Chaffron S, Bittner L, Eveillard D, Larhlimi A, Roux S, et al. Plankton networks driving carbon export in the oligotrophic ocean. Nature. 2016;532(7600):465–70. pmid:26863193
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref12] 12. Forterre P. The virocell concept and environmental microbiology. ISME J. 2013;7(2):233–6. pmid:23038175
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref13] 13. Howard-Varona C, Lindback MM, Bastien GE, Solonenko N, Zayed AA, Jang H, et al. Phage-specific metabolic reprogramming of virocells. ISME J. 2020;14(4):881–95. pmid:31896786
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref14] 14. Howard-Varona C, Lindback MM, Fudyma JD, Krongauz A, Solonenko NE, Zayed AA, et al. Environment-specific virocell metabolic reprogramming. ISME J. 2024;18(1):wrae055. pmid:38552150
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref15] 15. Howard-Varona C, Roux S, Bowen BP, Silva LP, Lau R, Schwenck SM. Protist impacts on marine cyanovirocell metabolism. ISME Commun. 2022;2(1):1–14.
View Article
Google Scholar

[50] View Article

[51] Google Scholar

[ref16] 16. Lindback MM, Howard-Varona C, Fudyma J, Tfaily M, Sullivan MB, Duhaime MB. Virocell resource manipulation under nutrient limitation. mSystems. 2025;10(7):e00521–25.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref17] 17. Urvoy M, Howard-Varona C, Owusu-Ansah C, Stai AJ, Bouranis JA, Burris M, et al. Phage resistance mutations in a marine bacterium impact biogeochemically relevant cellular processes. Nat Microbiol. 2026;11(1):195–210. https://doi.org/10.1038/s41564-025-02202-5
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref18] 18. Thingstad TF. Elements of a theory for the mechanisms controlling abundance, diversity, and biogeochemical role of lytic bacterial viruses in aquatic systems. Limnol Oceanogr. 2000;45(6):1320–8.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref19] 19. Winter C, Bouvier T, Weinbauer MG, Thingstad TF. Trade-offs between competition and defense specialists among unicellular planktonic organisms: the “killing the winner” hypothesis revisited. Microbiol Mol Biol Rev. 2010;74(1):42–57. pmid:20197498
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref20] 20. Thingstad T, Lignell R. Theoretical models for the control of bacterial growth rate, abundance, diversity and carbon demand. Aquat Microb Ecol. 1997;13:19–27.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref21] 21. Jover LF, Cortez MH, Weitz JS. Mechanisms of multi-strain coexistence in host-phage systems with nested infection networks. J Theor Biol. 2013;332:65–77. pmid:23608631
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref22] 22. Weitz JS, Stock CA, Wilhelm SW, Bourouiba L, Coleman ML, Buchan A, et al. A multitrophic model to quantify the effects of marine viruses on microbial food webs and ecosystem processes. ISME J. 2015;9(6):1352–64. pmid:25635642
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref23] 23. Krishna S, Peterson V, Listmann L, Hinners J. Modelling the interactive effects of viral presence and global warming on Baltic Sea ecosystem dynamics. Biogeosciences Discussions. 2023;:1–23.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref24] 24. Marantos A, Mitarai N, Sneppen K. From kill the winner to eliminate the winner in open phage-bacteria systems. PLoS Comput Biol. 2022;18(8):e1010400. pmid:35939510
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref25] 25. Beckett SJ, Demory D, Coenen AR, Casey JR, Dugenne M, Follett CL, et al. Disentangling top-down drivers of mortality underlying diel population dynamics of Prochlorococcus in the North Pacific Subtropical Gyre. Nat Commun. 2024;15(1):2105. pmid:38453897
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref26] 26. Talmy D, Beckett SJ, Taniguchi DAA, Brussaard CPD, Weitz JS, Follows MJ. An empirical model of carbon flow through marine viruses and microzooplankton grazers. Environ Microbiol. 2019;21(6):2171–81. pmid:30969467
View Article
PubMed/NCBI
Google Scholar

[88] View Article

[89] PubMed/NCBI

[90] Google Scholar

[ref27] 27. Hinson A, Papoulis S, Fiet L, Knight M, Cho P, Szeltner B, et al. A model of algal‐virus population dynamics reveals underlying controls on material transfer. Limnol Oceanogr. 2022;68(1):165–80.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref28] 28. Weitz J. Quantitative viral ecology: dynamics of viruses and their microbial hosts. Oxford: Princeton University Press; 2015.

[ref29] 29. Record NR, Talmy D, Våge S. Quantifying tradeoffs for marine viruses. Front Mar Sci. 2016;3.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref30] 30. Lambert S, Tragin M, Lozano J-C, Ghiglione J-F, Vaulot D, Bouget F-Y, et al. Rhythmicity of coastal marine picoeukaryotes, bacteria and archaea despite irregular environmental perturbations. ISME J. 2019;13(2):388–401. pmid:30254323
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref31] 31. Karl DM, Church MJ. Microbial oceanography and the Hawaii Ocean Time-series programme. Nat Rev Microbiol. 2014;12(10):699–713. pmid:25157695
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref32] 32. DeLong EF, Preston CM, Mincer T, Rich V, Hallam SJ, Frigaard NU. Community genomics among stratified microbial assemblages in the ocean’s interior. Science. 2006;311(5760):496–503.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref33] 33. Luo E, Eppley JM, Romano AE, Mende DR, DeLong EF. Double-stranded DNA virioplankton dynamics and reproductive strategies in the oligotrophic open ocean water column. ISME J. 2020;14(5):1304–15. pmid:32060418
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref34] 34. Chow C-ET, Kim DY, Sachdeva R, Caron DA, Fuhrman JA. Top-down controls on bacterial community structure: microbial network analysis of bacteria, T4-like viruses and protists. ISME J. 2014;8(4):816–29. pmid:24196323
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref35] 35. Needham DM, Chow C-ET, Cram JA, Sachdeva R, Parada A, Fuhrman JA. Short-term observations of marine bacterial and viral communities: patterns, connections and resilience. ISME J. 2013;7(7):1274–85. pmid:23446831
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref36] 36. Yeh Y-C, Fuhrman JA. Effects of phytoplankton, viral communities, and warming on free-living and particle-associated marine prokaryotic community structure. Nat Commun. 2022;13(1):7905. pmid:36550140
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref37] 37. Sullivan MB. Viromes, not gene markers, for studying double-stranded DNA virus communities. J Virol. 2015;89(5):2459–61. pmid:25540374
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref38] 38. Ignacio-Espinoza JC, Ahlgren NA, Fuhrman JA. Long-term stability and Red Queen-like strain dynamics in marine viruses. Nat Microbiol. 2020;5(2):265–71. pmid:31819214
View Article
PubMed/NCBI
Google Scholar

[130] View Article

[131] PubMed/NCBI

[132] Google Scholar

[ref39] 39. Boysen AK, Carlson LT, Durham BP, Groussman RD, Aylward FO, Ribalet F, et al. Particulate metabolites and transcripts reflect diel oscillations of microbial activity in the surface ocean. mSystems. 2021;6(3):e00896-20. pmid:33947808
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref40] 40. Ottesen EA, Young CR, Eppley JM, Ryan JP, Chavez FP, Scholin CA, et al. Pattern and synchrony of gene expression among sympatric marine microbial populations. Proc Natl Acad Sci U S A. 2013;110(6):E488-97. pmid:23345438
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref41] 41. Kolody BC, McCrow JP, Allen LZ, Aylward FO, Fontanez KM, Moustafa A, et al. Diel transcriptional response of a California current plankton microbiome to light, low iron, and enduring viral infection. ISME J. 2019;13(11):2817–33. pmid:31320727
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref42] 42. Aylward FO, Boeuf D, Mende DR, Wood-Charlson EM, Vislova A, Eppley JM, et al. Diel cycling and long-term persistence of viruses in the ocean’s euphotic zone. Proc Natl Acad Sci U S A. 2017;114(43):11446–51. pmid:29073070
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

[ref43] 43. Hevroni G, Flores-Uribe J, Béjà O, Philosof A. Seasonal and diel patterns of abundance and activity of viruses in the Red Sea. Proc Natl Acad Sci U S A. 2020;117(47):29738–47. pmid:33172994
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref44] 44. Liu R, Liu Y, Chen Y, Zhan Y, Zeng Q. Cyanobacterial viruses exhibit diurnal rhythms during infection. Proc Natl Acad Sci U S A. 2019;116(28):14077–82. pmid:31235591
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref45] 45. Brum JR, Sullivan MB. Rising to the challenge: accelerated pace of discovery transforms marine virology. Nat Rev Microbiol. 2015;13(3):147–59. pmid:25639680
View Article
PubMed/NCBI
Google Scholar

[158] View Article

[159] PubMed/NCBI

[160] Google Scholar

[ref46] 46. Gregory AC, Solonenko SA, Ignacio-Espinoza JC, LaButti K, Copeland A, Sudek S, et al. Genomic differentiation among wild cyanophages despite widespread horizontal gene transfer. BMC Genomics. 2016;17(1):930. pmid:27852226
View Article
PubMed/NCBI
Google Scholar

[162] View Article

[163] PubMed/NCBI

[164] Google Scholar

[ref47] 47. Gregory AC, Zayed AA, Conceição-Neto N, Temperton B, Bolduc B, Alberti A. Marine DNA viral macro- and microdiversity from pole to pole. Cell. 2019;177(5):1109-1123.e14.
View Article
Google Scholar

[166] View Article

[167] Google Scholar

[ref48] 48. Roux S. A viral ecogenomics framework to uncover the secrets of nature’s “Microbe Whisperers”. mSystems. 2019;4(3):e00111-19. pmid:31120025
View Article
PubMed/NCBI
Google Scholar

[169] View Article

[170] PubMed/NCBI

[171] Google Scholar

[ref49] 49. Bates NR, Johnson RJ. Forty years of ocean acidification observations (1983–2023) in the Sargasso Sea at the Bermuda Atlantic Time-series Study site. Frontiers in Marine Science. 2023.
View Article
Google Scholar

[173] View Article

[174] Google Scholar

[ref50] 50. Parsons RJ, Breitbart M, Lomas MW, Carlson CA. Ocean time-series reveals recurring seasonal patterns of virioplankton dynamics in the northwestern Sargasso Sea. ISME J. 2012;6(2):273–84. pmid:21833038
View Article
PubMed/NCBI
Google Scholar

[176] View Article

[177] PubMed/NCBI

[178] Google Scholar

[ref51] 51. Angly FE, Felts B, Breitbart M, Salamon P, Edwards RA, Carlson C. The marine viromes of four oceanic regions. PLOS Biology. 2006;4(11):e368.
View Article
Google Scholar

[180] View Article

[181] Google Scholar

[ref52] 52. Warwick-Dugdale J, Tian F, Michelsen ML, Cronin DR, Moore K, Farbos A, et al. Long-read powered viral metagenomics in the oligotrophic Sargasso Sea. Nat Commun. 2024;15(1):4089. pmid:38744831
View Article
PubMed/NCBI
Google Scholar

[183] View Article

[184] PubMed/NCBI

[185] Google Scholar

[ref53] 53. John SG, Mendez CB, Deng L, Poulos B, Kauffman AKM, Kern S, et al. A simple and efficient method for concentration of ocean viruses by chemical flocculation. Environ Microbiol Rep. 2011;3(2):195–202. pmid:21572525
View Article
PubMed/NCBI
Google Scholar

[187] View Article

[188] PubMed/NCBI

[189] Google Scholar

[ref54] 54. Gilbert NE, Muratore D, Gochev CS, LeCleir GR, Cagle SM, Pound HL, et al. Seasonal enhancement of the viral shunt catalyzes a subsurface oxygen maximum in the Sargasso Sea. Nat Commun. 2025;17(1):352. pmid:41353448
View Article
PubMed/NCBI
Google Scholar

[191] View Article

[192] PubMed/NCBI

[193] Google Scholar

[ref55] 55. Guo J, Bolduc B, Zayed AA, Varsani A, Dominguez-Huerta G, Delmont TO, et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome. 2021;9(1):37. pmid:33522966
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref56] 56. Guo J, Vik D, Pratama AA, Roux S, Sullivan M. Viral sequence identification SOP with VirSorter2. 2021 July 19 [cited 2025 June 16]; Available from: https://www.protocols.io/view/viral-sequence-identification-sop-with-virsorter2-bwm5pc86

[ref57] 57. Roux S, Adriaenssens EM, Dutilh BE, Koonin EV, Kropinski AM, Krupovic M, et al. Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat Biotechnol. 2019;37(1):29–37. pmid:30556814
View Article
PubMed/NCBI
Google Scholar

[200] View Article

[201] PubMed/NCBI

[202] Google Scholar

[ref58] 58. Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, et al. Identification of mobile genetic elements with geNomad. Nat Biotechnol. 2024;42(8):1303–12. pmid:37735266
View Article
PubMed/NCBI
Google Scholar

[204] View Article

[205] PubMed/NCBI

[206] Google Scholar

[ref59] 59. Hegarty B, Riddell V J, Bastien E, Langenfeld K, Lindback M, Saini JS, et al. Benchmarking informatics approaches for virus discovery: caution is needed when combining in silico identification methods. mSystems. 2024;9(3):e0110523. pmid:38376167
View Article
PubMed/NCBI
Google Scholar

[208] View Article

[209] PubMed/NCBI

[210] Google Scholar

[ref60] 60. Duhaime MB, Solonenko N, Roux S, Verberkmoes NC, Wichels A, Sullivan MB. Comparative omics and trait analyses of marine pseudoalteromonas phages advance the phage OTU concept. Front Microbiol. 2017;8:1241.
View Article
Google Scholar

[212] View Article

[213] Google Scholar

[ref61] 61. Cox JL, Wiebe PH, Ortner P, Boyd S. Seasonal development of subsurface chlorophyll maxima in slope water and northern Sargasso Sea of the northwestern Atlantic Ocean. Biol Oceanogr. 1982;1(3):271–85.
View Article
Google Scholar

[215] View Article

[216] Google Scholar

[ref62] 62. Gill JG, Hill-Spanik KM, Whittaker KA, Jones ML, Plante C. Sargasso Sea bacterioplankton community structure and drivers of variance as revealed by DNA metabarcoding analysis. PeerJ. 2022;10:e12835.
View Article
Google Scholar

[218] View Article

[219] Google Scholar

[ref63] 63. Michaels AF, Knap AH. Overview of the U.S. JGOFS Bermuda Atlantic Time-series Study and the Hydrostation S program. Deep Sea Res Part II Topic Stud Oceanogr. 1996;43(2–3):157–98.
View Article
Google Scholar

[221] View Article

[222] Google Scholar

[ref64] 64. Liu R, Liu Y, Chen Y, Zhan Y, Zeng Q. Cyanobacterial viruses exhibit diurnal rhythms during infection. Proc Natl Acad Sci U S A. 2019;116(28):14077–82.
View Article
Google Scholar

[224] View Article

[225] Google Scholar

[ref65] 65. Roux S, Camargo AP, Coutinho FH, Dabdoub SM, Dutilh BE, Nayfach S, et al. iPHoP: an integrated machine learning framework to maximize host prediction for metagenome-derived viruses of archaea and bacteria. PLoS Biol. 2023;21(4):e3002083. pmid:37083735
View Article
PubMed/NCBI
Google Scholar

[227] View Article

[228] PubMed/NCBI

[229] Google Scholar

[ref66] 66. Bolaños LM, Tait K, Somerfield PJ, Parsons RJ, Giovannoni SJ, Smyth T, et al. Influence of short and long term processes on SAR11 communities in open ocean and coastal systems. ISME Commun. 2022;2(1):116. pmid:37938786
View Article
PubMed/NCBI
Google Scholar

[231] View Article

[232] PubMed/NCBI

[233] Google Scholar

[ref67] 67. Giovannoni SJ. SAR11 bacteria: the most abundant plankton in the oceans. Ann Rev Mar Sci. 2017;9:231–55. pmid:27687974
View Article
PubMed/NCBI
Google Scholar

[235] View Article

[236] PubMed/NCBI

[237] Google Scholar

[ref68] 68. Malmstrom RR, Coe A, Kettler GC, Martiny AC, Frias-Lopez J, Zinser ER, et al. Temporal dynamics of Prochlorococcus ecotypes in the Atlantic and Pacific oceans. ISME J. 2010;4(10):1252–64. pmid:20463762
View Article
PubMed/NCBI
Google Scholar

[239] View Article

[240] PubMed/NCBI

[241] Google Scholar

[ref69] 69. Breitbart M, Bonnain C, Malki K, Sawaya NA. Phage puppet masters of the marine microbial realm. Nat Microbiol. 2018;3(7):754–66. pmid:29867096
View Article
PubMed/NCBI
Google Scholar

[243] View Article

[244] PubMed/NCBI

[245] Google Scholar

[ref70] 70. Tian F, Wainaina JM, Howard-Varona C, Domínguez-Huerta G, Bolduc B, Gazitúa MC. Prokaryotic-virus-encoded auxiliary metabolic genes throughout the global oceans. Microbiome. 2024;12:159.
View Article
Google Scholar

[247] View Article

[248] Google Scholar

[ref71] 71. Hawco NJ, McIlvin MM, Bundy RM, Tagliabue A, Goepfert TJ, Moran DM, et al. Minimal cobalt metabolism in the marine cyanobacterium Prochlorococcus. Proc Natl Acad Sci U S A. 2020;117(27):15740–7. pmid:32576688
View Article
PubMed/NCBI
Google Scholar

[250] View Article

[251] PubMed/NCBI

[252] Google Scholar

[ref72] 72. Saito MA, Moffett JW, Chisholm SW, Waterbury JB. Cobalt limitation and uptake in Prochlorococcus. Limnol Oceanogr. 2002;47(6):1629–36.
View Article
Google Scholar

[254] View Article

[255] Google Scholar

[ref73] 73. Wittmers F, Needham DM, Hehenberger E, Giovannoni SJ, Worden AZ. Genomes from uncultivated pelagiphages reveal multiple phylogenetic clades exhibiting extensive auxiliary metabolic genes and cross-family multigene transfers. mSystems. 2022;7(5):e01522-21.
View Article
Google Scholar

[257] View Article

[258] Google Scholar

[ref74] 74. Sullivan MB, Huang KH, Ignacio-Espinoza JC, Berlin AM, Kelly L, Weigele PR, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ Microbiol. 2010;12(11):3035–56. pmid:20662890
View Article
PubMed/NCBI
Google Scholar

[260] View Article

[261] PubMed/NCBI

[262] Google Scholar

[ref75] 75. Goldsmith DB, Parsons RJ, Beyene D, Salamon P, Breitbart M. Deep sequencing of the viral phoH gene reveals temporal variation, depth-specific composition, and persistent dominance of the same viral phoH genes in the Sargasso Sea. PeerJ. 2015;3:e997.
View Article
Google Scholar

[264] View Article

[265] Google Scholar

[ref76] 76. Goldsmith DB, Crosti G, Dwivedi B, McDaniel LD, Varsani A, Suttle CA, et al. Development of phoH as a novel signature gene for assessing marine phage diversity. Appl Environ Microbiol. 2011;77(21):7730–9. pmid:21926220
View Article
PubMed/NCBI
Google Scholar

[267] View Article

[268] PubMed/NCBI

[269] Google Scholar

[ref77] 77. Xiang S, Li J, Chen Z, Cheng R, Wang L, Yu L, et al. Ecological diversity and metabolic strategies of widespread Marinisomatota in global oceans. Mar Life Sci Technol. 2025;7(3):523–36.
View Article
Google Scholar

[271] View Article

[272] Google Scholar

[ref78] 78. Prabhu A, Zaugg J, Chan CX, McIlroy SJ, Rinke C. Insights into phylogeny, diversity and functional potential of Poseidoniales viruses. Environ Microbiol. 2025;27(1):e70017. pmid:39777783
View Article
PubMed/NCBI
Google Scholar

[274] View Article

[275] PubMed/NCBI

[276] Google Scholar

[ref79] 79. Chen S, Tao J, Chen Y, Wang W, Fan L, Zhang C. Interactions between marine group II archaea and phytoplankton revealed by population correlations in the northern coast of South China Sea. Front Microbiol. 2022;12:785532.
View Article
Google Scholar

[278] View Article

[279] Google Scholar

[ref80] 80. Wu H-H, Pun MD, Wise CE, Streit BR, Mus F, Berim A, et al. The pathway for coenzyme M biosynthesis in bacteria. Proc Natl Acad Sci U S A. 2022;119(36):e2207190119. pmid:36037354
View Article
PubMed/NCBI
Google Scholar

[281] View Article

[282] PubMed/NCBI

[283] Google Scholar

[ref81] 81. Hall N, Wong WW, Lappan R, Ricci F, Glud RN, Kawaichi S. Aerotolerant methanogens use seaweed and seagrass metabolites to drive marine methane emissions. bioRxiv. 2024. 2024.10.14.618369. https://doi.org/10.1101/2024.10.14.618369

[ref82] 82. Kolomijeca A, Marx L, Reynolds S, Cariou T, Mawji E, Boulart C. An update on dissolved methane distribution in the subtropical North Atlantic Ocean. Ocean Sci. 2022;18(5):1377–88.
View Article
Google Scholar

[286] View Article

[287] Google Scholar

[ref83] 83. Vik D, Bolduc B, Roux S, Sun CL, Pratama AA, Krupovic M, et al. MArVD2: a machine learning enhanced tool to discriminate between archaeal and bacterial viruses in viral datasets. ISME Commun. 2023;3(1):87. pmid:37620369
View Article
PubMed/NCBI
Google Scholar

[289] View Article

[290] PubMed/NCBI

[291] Google Scholar

[ref84] 84. Muratore D, Boysen AK, Harke MJ, Becker KW, Casey JR, Coesel SN, et al. Complex marine microbial communities partition metabolism of scarce resources over the diel cycle. Nat Ecol Evol. 2022;6(2):218–29. pmid:35058612
View Article
PubMed/NCBI
Google Scholar

[293] View Article

[294] PubMed/NCBI

[295] Google Scholar

[ref85] 85. Thaben PF, Westermark PO. Detecting rhythms in time series with RAIN. J Biol Rhythms. 2014;29(6):391–400. pmid:25326247
View Article
PubMed/NCBI
Google Scholar

[297] View Article

[298] PubMed/NCBI

[299] Google Scholar

[ref86] 86. Mrudulakumari Vasudevan U, Lee EY. Flavonoids, terpenoids, and polyketide antibiotics: Role of glycosylation and biocatalytic tactics in engineering glycosylation. Biotechnol Adv. 2020;41:107550. pmid:32360984
View Article
PubMed/NCBI
Google Scholar

[301] View Article

[302] PubMed/NCBI

[303] Google Scholar

[ref87] 87. Mruwat N, Carlson MCG, Goldin S, Ribalet F, Kirzner S, Hulata Y, et al. A single-cell polony method reveals low levels of infected Prochlorococcus in oligotrophic waters despite high cyanophage abundances. ISME J. 2021;15(1):41–54. pmid:32918065
View Article
PubMed/NCBI
Google Scholar

[305] View Article

[306] PubMed/NCBI

[307] Google Scholar

[ref88] 88. Zablocki O, Michelsen M, Burris M, Solonenko N, Warwick-Dugdale J, Ghosh R, et al. VirION2: a short- and long-read sequencing and informatics workflow to study the genomic diversity of viruses in nature. PeerJ. 2021;9:e11088. pmid:33850654
View Article
PubMed/NCBI
Google Scholar

[309] View Article

[310] PubMed/NCBI

[311] Google Scholar

[ref89] 89. Martinez-Hernandez F, Fornas O, Lluesma Gomez M, Bolduc B, de la Cruz Peña MJ, Martínez JM, et al. Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nat Commun. 2017;8:15892. pmid:28643787
View Article
PubMed/NCBI
Google Scholar

[313] View Article

[314] PubMed/NCBI

[315] Google Scholar

[ref90] 90. Jang HB, Chittick L, Li Y-F, Zablocki O, Sanderson CM, Carrillo A, et al. Viral tag and grow: a scalable approach to capture and characterize infectious virus-host pairs. ISME Commun. 2022;2(1):12. pmid:37938680
View Article
PubMed/NCBI
Google Scholar

[317] View Article

[318] PubMed/NCBI

[319] Google Scholar

[ref91] 91. Shatadru RN, Solonenko NE, Sun CL, Sullivan MB. Synthetic community Hi-C benchmarking provides a baseline for virus-host inferences. bioRxiv. 2025. 2025.02.12.637985. https://www.biorxiv.org/content/10.1101/2025.02.12.637985v2

[ref92] 92. Trubl G, Roux S, Kellom M, Vyshenska D, Tomatsu A, Singh K. Tracking persistence and dynamics of active soil viruses with SIP-Viromics. bioRxiv. 2025. 2025.05.25.655894. https://www.biorxiv.org/content/10.1101/2025.05.25.655894v1

[ref93] 93. Guo J, Aroney S, Dominguez-Huerta G, Smith D, Vik D, Ansah CO. Mobile genetic elements that shape microbial diversity and functions in thawing permafrost soils. bioRxiv. 2025. 2025.02.12.637893. https://www.biorxiv.org/content/10.1101/2025.02.12.637893v1

[ref94] 94. Régimbeau A, Tian F, Smith G, Riddell J, Andreani C, Bordron P. Planetary-scale marine community modeling predicts metabolic synergy and viral impacts. bioRxiv. 2025. 2025.02.13.638167. https://www.biorxiv.org/content/10.1101/2025.02.13.638167v1

[ref95] 95. Brum JR, Hurwitz BL, Schofield O, Ducklow HW, Sullivan MB. Seasonal time bombs: dominant temperate viruses affect Southern Ocean microbial dynamics. ISME J. 2016;10(2):437–49. pmid:26296067
View Article
PubMed/NCBI
Google Scholar

[325] View Article

[326] PubMed/NCBI

[327] Google Scholar

[ref96] 96. D’Alelio D, Rampone S, Cusano LM, Morfino V, Russo L, Sanseverino N, et al. Machine learning identifies a strong association between warming and reduced primary productivity in an oligotrophic ocean gyre. Sci Rep. 2020;10(1):3287. pmid:32098970
View Article
PubMed/NCBI
Google Scholar

[329] View Article

[330] PubMed/NCBI

[331] Google Scholar

[ref97] 97. Corno G, Karl DM, Church MJ, Letelier RM, Lukas R, Bidigare RR, et al. Impact of climate forcing on ecosystem processes in the North Pacific Subtropical Gyre. J Geophys Res. 2007;112(C4).
View Article
Google Scholar

[333] View Article

[334] Google Scholar

[ref98] 98. Henn MR, Sullivan MB, Stange-Thomann N, Osburne MS, Berlin AM, Kelly L, et al. Analysis of high-throughput sequencing and annotation strategies for phage genomes. PLoS One. 2010;5(2):e9083. pmid:20140207
View Article
PubMed/NCBI
Google Scholar

[336] View Article

[337] PubMed/NCBI

[338] Google Scholar

[ref99] 99. Ohio Supercomputer Center [Internet]. 2014 [cited 2024 Sept 10]. Available from: https://www.osc.edu/resources/getting_started/citation

[ref100] 100. Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31(10):1674–6. pmid:25609793
View Article
PubMed/NCBI
Google Scholar

[341] View Article

[342] PubMed/NCBI

[343] Google Scholar

[ref101] 101. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. 2021;39(5):578–85. pmid:33349699
View Article
PubMed/NCBI
Google Scholar

[345] View Article

[346] PubMed/NCBI

[347] Google Scholar

[ref102] 102. Simpson EH. Measurement of diversity. Nature. 1949;163(4148):688.
View Article
Google Scholar

[349] View Article

[350] Google Scholar

[ref103] 103. Frey BJ, Dueck D. Clustering by passing messages between data points. Science. 2007;315(5814):972–6. pmid:17218491
View Article
PubMed/NCBI
Google Scholar

[352] View Article

[353] PubMed/NCBI

[354] Google Scholar

[ref104] 104. Gower JC. Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika. 1966;53(3–4):325–38.
View Article
Google Scholar

[356] View Article

[357] Google Scholar

[ref105] 105. Field J, Clarke K, Warwick R. A practical strategy for analysing multispecies distribution patterns. Mar Ecol Prog Ser. 1982;8:37–52.
View Article
Google Scholar

[359] View Article

[360] Google Scholar

[ref106] 106. Oksanen J, Simpson GL, Blanchet FG, Kindt R, Legendre P, Minchin PR, et al. vegan: community ecology package [Internet]. 2001 [cited 2024 Sept 10]. p. 2.6–8. Available from: https://CRAN.R-project.org/package=vegan

[ref107] 107. Mielke PW Jr, Berry KJ, Johnson ES. Multi-response permutation procedures for a priori classifications. Commun Stat Theory Methods. 1976;5(14):1409–24.
View Article
Google Scholar

[363] View Article

[364] Google Scholar

[ref108] 108. Clarke K, Ainsworth M. A method of linking multivariate community structure to environmental variables. Mar Ecol Prog Ser. 1993;92:205–19.
View Article
Google Scholar

[366] View Article

[367] Google Scholar

[ref109] 109. Maechler M, Rousseeuw PJ. Finding groups in data: cluster analysis extended. 2025. [cited 2025 Apr 19]. Available from: https://cran.r-project.org/web/packages/cluster/index.html

[ref110] 110. Wehrens R, Buydens LMC. Self- and super-organizing maps in R: the kohonen package. J Stat Softw. 2007;21:1–19.
View Article
Google Scholar

[370] View Article

[371] Google Scholar

[ref111] 111. Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. pmid:20211023
View Article
PubMed/NCBI
Google Scholar

[373] View Article

[374] PubMed/NCBI

[375] Google Scholar

[ref112] 112. Bolduc B. Applying vContact to viral sequences and visualizing the output (Cyverse). 2019 Feb 14 [cited 2024 Sept 10]; Available from: https://www.protocols.io/view/applying-vcontact-to-viral-sequences-and-visualizi-x5xfq7n

[ref113] 113. Shaffer M, Borton MA, McGivern BB, Zayed AA, La Rosa SL, Solden LM, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48(16):8883–900. pmid:32766782
View Article
PubMed/NCBI
Google Scholar

[378] View Article

[379] PubMed/NCBI

[380] Google Scholar

[ref114] 114. Vik D, Gazitúa MC, Sun CL, Zayed AA, Aldunate M, Mulholland MR, et al. Genome-resolved viral ecology in a marine oxygen minimum zone. Environ Microbiol. 2021;23(6):2858–74. pmid:33185964
View Article
PubMed/NCBI
Google Scholar

[382] View Article

[383] PubMed/NCBI

[384] Google Scholar

[ref115] 115. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955–64. pmid:9023104
View Article
PubMed/NCBI
Google Scholar

[386] View Article

[387] PubMed/NCBI

[388] Google Scholar

[ref116] 116. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16(6):276–7.
View Article
Google Scholar

[390] View Article

[391] Google Scholar

[ref117] 117. Eren AM, Kiefl E, Shaiber A, Veseli I, Miller SE, Schechter MS, et al. Community-led, integrated, reproducible multi-omics with anvi’o. Nat Microbiol. 2021;6(1):3–6. pmid:33349678
View Article
PubMed/NCBI
Google Scholar

[393] View Article

[394] PubMed/NCBI

[395] Google Scholar

[ref118] 118. Darzi Y, Letunic I, Bork P, Yamada T. iPath3.0: interactive pathways explorer v3. Nucleic Acids Research. 2018;46(W1):W510-3.
View Article
Google Scholar

[397] View Article

[398] Google Scholar

[ref119] 119. Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;36(6):1925–7.
View Article
Google Scholar

[400] View Article

[401] Google Scholar

Figures

Abstract

Introduction

Results

Assembly of a high quality and detailed viral reference database at BATS

Community level diversity significantly varies with depth, but not through 112-hour time-course

Viral populations and potential metabolic gene repertoire vary across depth

Diel periodicity analyses reveal population-level viral population dynamics

Unsupervised learning identifies day- versus night-peaking patterns in diel signals

Discussion

Methods

Sample collection

Library preparation and sequencing

Quality check, trimming, and assembly

Viral identification

Read mapping

Diversity calculations and statistical analyses

Temporal analysis with the detection of rhythmic behavior in time series

Calculation of peak rank time

Clustering analysis

Taxonomic classification

Functional annotation and auxiliary metabolic genes (AMGs)

Identifying hosts using iPHoP with the standard database

Identifying hosts using iPHoP with a custom database

Sample collection from which MAGs were identified

Supporting information

S1 Fig. VirSorter2 and Genomad comparison.

S2 Fig. Affinity propagation of depth clusters through viral relative abundance.

S3 Fig. Principal Coordinates Analysis (PCoA) of viral community between SUR and DCM samples.

S4 Fig. Alpha diversity compared against Global Ocean Virome 2 (GOV2) dataset ecological zones.

S5 Fig. Viral Families of SUR diel and non-diel viral populations.

S6 Fig. Principal Coordinates Analysis (PCoA) of beta-dispersion among three viral archetypes.

S1 Table. Metadata for BATS 2019 Cruise.

S2 Table. Raw and clean read numbers for samples.

S3 Table. Sample data across viral identification pipeline.

S4 Table. BATS MAGs GTDB-tk summary file.

S5 Table. Conservative filtered AMGs annotations.

S6 Table. Diel AMG annotations.

S7 Table. Non-Diel AMG annotations.

S8 Table. Non-Diel AMG modules.

S9 Table. Diel AMG modules.

S10 Table. Host predictions with associated AMGs for depth groups.

S11 Table. Host predictions with associated AMGs for diel groups.

S12 Table. Diel viral populations.

S13 Table. Archetype information.

S1 Data. Table of depth profile of chlorophyll and temperature down to 1,000 m depth for 2019 BATS cruise.

S2 Data. Table of viral taxonomy from vConTACT3 using viral populations that are 10 kb or larger.

Acknowledgments

References