Standing on the Shoulders of Giant Viruses: Five Lessons Learned about Large Viruses Infecting Small Eukaryotes and the Opportunities They Create

Viruses are generally considered to be amongst the smallest bioactive particles; dating back to the original observations, including those of luminaries such as Ivanosky and Beijerinck, size has always been at issue within the definition, a tradition that continued for many years [1]. It was thus a surprise to the scientific community in the early 2000s when French scientists demonstrated that a particle, previously thought to be a bacterium, was indeed a virus [2]. The discovery of the Mimivirus and the other “giants” that have followed, including Mamavirus, Pandoravirus, Faustovirus, and Mollivirus, has blurred the definition of what constitutes a virus and, indeed, the boundaries between viral particles and cellular life [3].


Introduction
Viruses are generally considered to be amongst the smallest bioactive particles; dating back to the original observations, including those of luminaries such as Ivanosky and Beijerinck, size has always been at issue within the definition, a tradition that continued for many years [1]. It was thus a surprise to the scientific community in the early 2000s when French scientists demonstrated that a particle, previously thought to be a bacterium, was indeed a virus [2]. The discovery of the Mimivirus and the other "giants" that have followed, including Mamavirus, Pandoravirus, Faustovirus, and Mollivirus, has blurred the definition of what constitutes a virus and, indeed, the boundaries between viral particles and cellular life [3].
What Are "Giant Viruses"?
In general, the term "giant virus" is now commonly used to refer to viruses that have a large genome (>200,000 base pairs) and/or particle size (>0.2 μm). While a variety of arguments can be made for altering these metrics, what is clear is that these viruses bring with them a potential (in terms of genes that are transcribed and translated) that is historically associated with cellular life forms: this includes members of the Mimiviridae that infect amoebas, as well as the "extended" phylogenetic group that infect algae [4]. These viruses fall into the Nucleocytoplasmic large DNA viruses (NCLDVs) group that also includes the Phycodnaviridae, Iridoviridae, Poxviridae, Marseilleviridae, Asfarviridae, and Ascoviridae. These viruses have been shown (to date) to infect organisms including algae and protists (although members of Poxviridae, Asfarviridae, and Iridoviridae infect humans and animals). And while their size is constantly surprising, it is the latter trait (i.e., their novel genetic collection) that is of interest to many researchers. A research opportunity is to question why these particles need to carry so many different genetic blueprints (i.e., genes), and how these extra costs provide benefits with respect to viral fitness and selection in a complicated and complex microbial and viral world.
Have We Really Only Known about Giant Viruses for Little More Than a Decade?
As noted above, the study of giant viruses emerged with the confluence of observation (the ability to see Mimivirus in a light microscope and the realization that it was a virus and not a bacterium) and opportunity (the emergence of modern techniques in molecular biology that have rapidly advanced the ability to study these viruses). In retrospect, there have historically been many other large viruses observed by scientists. Several algal viruses such as Ectocaprus siliculosus virus (EsV-1) [5] and Micromonas pusilla virus (MpV) [6] were observed decades ago and have been studied sporadically over the years. Others, highlighted by Emiliania huxleyi virus (EhV) [7] and perhaps the best-studied of the large virus systems, the Chlorella-infecting virus group [8], have been characterized in studies ranging from biochemical to emerging ecological studies in recent years. But a survey of the literature reveals many more opportunities for research with completely new giant virus-host systems. From the early 1970s through the 1990s, more broadly available imaging tools allowed researchers to observe giant virus-like particles in hosts that have yet to be deeply studied, including 240-and 390-nm particles found in the chlorophytes Oedogonium spp. "L" [9] and Uronema gigas [10], respectively, as well as a 385-nm virus particle in the dinoflagellate Gymnodinium uberrimum [11]. Indeed, an opportunity exists for researchers to take advantage of the availability of this information to identify new virus-host models for laboratory study. As is clear from the foundational work in Chloroviruses and Mimiviruses, there are rich prospects to advance science through the thorough isolation and study of new virus-host systems.

How Broadly Are Giant Viruses Distributed in Nature?
While the discovery of Mimivirus has spurred extensive research into the origins and capabilities of giant viruses, little is currently known about their global distribution and diversity. Giant viruses have been isolated largely from aquatic samples, often using Acanthamoeba spp. to enrich for virus populations. This technique has led to the discovery of novel giant viruses in marine and freshwater samples and, unexpectedly, ancient amoeba-infecting viruses that have persisted in~30,000-year-old Siberian permafrost [12]. Giant viruses have even recently been isolated from humans and may be linked to various disease/disorder states [13]. That the discovery of so many new giant viruses is surprising suggests that classical methods need to be reconsidered for the study of giant viruses. One tool that may help to address the question of giant virus diversity is the analysis of molecular sequencing data. Researchers have already found markers for giant viruses and, using shotgun and targeted metagenomes, have shown these particles to be distributed across a broad spectrum of environments (e.g., [14]). Indeed, publicly available meta-genomic/transcriptomic sequences from environmental samples may serve the same purpose as the archived images of virus infection processes. An emergent opportunity is that existing knowledge of giant virus genomic sequences, such as the conserved major capsid protein (MCP) (Fig 1), can be used to probe the wealth of available sequencing data. In addition, transcriptomics (i.e., RNA sequencing) data for these giant viruses may provide information on active infections, again providing opportunities to identify specific virushost relationships in future work. As we move forward in this arena, a second opportunity, the development of in situ approaches to study giant virus impacts on ecosystem scale ecology, also emerges.

What Can Giant Viruses Do That Makes Them Special?
Giant viruses exhibit diverse morphologies, lifestyles, and even genomic structure, but there are some shared features that set them apart from other systems. Most obvious is the "nucleocytoplasmic" distinction of replicative strategies: for example, in some of these giant viruses, a cytoplasmic, organelle-like virion factory quickly forms in the infected host as the site for virion morphogenesis-a feature previously witnessed only in RNA viruses [15]. This organization is thought to optimize control of intracellular resources and may be particularly important where Phylogenetic reconstruction of NCLDV major capsid protein sequences from environmental metatranscriptomes generated from an alkaline soil sample (NCBI ID: SRP043976), the Amazon River and River Delta (SRP037995, SRP039544 [21]), the North Pacific Ocean (SRP052554 [22]), Station ALOHA in the tropical Pacific Ocean (CAM_SMPL_000824 at iMicrobe.us [23]), and the North Sea (ERP004582 [24]). Public metatranscriptomes were assembled and searched for NCLDV-like major capsid protein coding transcripts. MCP contigs >300 bp with best hits to NCLDVs were aligned with an MCP reference database and placed on a maximum likelihood tree with a Shimodaira-Hasegawa-like approximate likelihood ratio test branch validation using pplacer (http://matsen.fhcrc.org/pplacer/). The broad spectrum of samples demonstrates active giant virus infections can be observed in many different environments. viral infection has been shown to alter metabolic processes. Another hallmark of giant viruses is the aforementioned gene content previously observed only in cellular organisms. Indeed, this includes (but is not limited to) central components of protein translation, parts of DNA repair pathways, polysaccharide synthesis enzymes, genes containing inteins, and, more recently, evidence for a genetic system that may offer protection against virion factory-infecting virophage [16]. These features indicate that giant viruses, unlike smaller lytic phages, encode for much more than the blueprints for generating new viruses. Interestingly, the location of these elements on the viral genome appears to influence gene conservation. For example, after being subcultured for several generations in a germ-free amoebal host, Mimivirus experienced a 17% genome reduction, with most gene losses occurring at the terminal ends [17]. Aureococcus anophagefferens virus (AaV) may also use this mechanism, as recent studies indicate that horizontally acquired genes often occur in the terminal regions [4]. Indeed, an emergent opportunity is to characterize the mechanics of how these viruses gain and lose genetic information: for example, it would be interesting to see if the "genetic accordion" theory [18] is at play in other large viruses, and if the mechanisms involved can result in not just duplications but in the horizontal transfer of materials from the hosts to viruses.

How Do Studies of Giant Viruses Shape Scientific Knowledge in a Larger Context?
One thing that has become clear since the discovery of the Mimivirus (and reinforced by the subsequent isolation of Pandoravirus, Pithovirus, and others) is that the "rules" are changing. In recent months, we have seen a series of researchers working to redefine "What is a virus?" [19] and even "What is life?" [20]. Apparent, but not always on the forefront of this debate, is how these viruses have changed life as we understand it. This statement is not to open the debate on what is life, but on how life has evolved and continues to evolve in the presence of these viruses. Within the genomes of these so-called giant viruses are genes that have been attributed to a variety of lineages: indeed, in just one example (AaV), we find a collection of genes that are phylogenetically most closely related to other giant viruses, to the host, to other picoeukaryotes, to bacteria and archaea, and even to phage [4]. The presence of these viruses and their often unique genomic architecture suggest that the horizontal transfer of genes between viruses and hosts as well as from organisms consumed by hosts (e.g., during phagotrophy) to viruses is likely rampant. These observations and the availability of a growing number of virus-host systems create an opportunity to study these dynamics in the laboratory. Just as researchers have now established long-term evolution experiments with microbes, there is an opportunity to follow the long-term evolution of these virus-host systems.