The Aggregated Gut Viral Catalogue (AVrC): A unified resource for exploring the viral diversity of the human gut
Fig 4
Aggregated Viral Catalogue (AVrC) structure and interface AVrC database schematical structure.
The AVrC database contains a fasta sequence catalogue containing the viral sequences in a Fasta format. The annotations of the sequences are grouped in three types of tables: [1] the raw output of each annotation tools, [2] the merged and harmonized annotations recapitulating the information concerning the sequence’s quality, taxonomy lifestyles and the predicted host information, and [3] a summary table containing the merged information for the vOTU representative sequences. The database is made available as csv files and a relational sql database in Zenodo (https://doi.org/10.5281/zenodo.11426064) This summary table is searchable through the AVrC toolkit, allowing users to select and search and select subsets of the dataset (https://github.com/aponsero/AVrC_toolkit).