Characterisation of the canine faecal virome in healthy dogs and dogs with acute diarrhoea using shotgun metagenomics

The virome has been increasingly investigated in numerous animal species and in different sites of the body, facilitating the identification and discovery of a variety of viruses. In spite of this, the faecal virome of healthy dogs has not been investigated. In this study we describe the faecal virome of healthy dogs and dogs with acute diarrhoea in Australia, using a shotgun metagenomic approach. Viral sequences from a range of different virus families, including both RNA and DNA families, and known pathogens implicated in enteric disease were documented. Twelve viral families were identified, of which four were bacteriophages. Eight eukaryotic viral families were detected: Astroviridae, Coronaviridae, Reoviridae, Picornaviridae, Caliciviridae, Parvoviridae, Adenoviridae and Papillomaviridae. Families Astroviridae, Picornaviridae and Caliciviridae were found only in dogs with acute diarrhoea, with Astroviridae being the most common family identified in this group. Due to its prevalence, characterisation the complete genome of a canine astrovirus was performed. These studies indicate that metagenomic analyses are useful for the investigation of viral populations in the faeces of dogs. Further studies to elucidate the epidemiological and biological relevance of these findings are warranted.


Introduction
Interest in the virome, or the entire population of viruses present in a biological sample, has increased recently due to improved availability of high throughput sequencing or next generation sequencing (NGS) technologies, and improved metagenomic analytical methods [1,2]. a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 The virome comprises all types of viruses, including those that infect prokaryotic and eukaryotic organisms, DNA or RNA viruses, and viruses that cause acute or chronic infections. Many of these viruses are difficult or impossible to propagate in cell culture, and molecular detection is difficult as no common gene such as the ribosomal 16S gene that is present in bacterial species exists in viruses. These limitations have hindered the identification and characterisation of uncultured viruses [3,4]. Recently, due to the advent of molecular enrichment protocols, high throughput sequencing and new metagenomic analytical methods we are now able to explore, identify and characterise viruses from different biological and environmental samples with a greater capacity [2,[5][6][7][8][9][10][11] In studies of human faeces, the virome has been shown to include viruses that infect eukaryotic organisms and viruses that infect prokaryotes (bacteriophages) [2,5,[12][13][14][15][16][17][18]. Bacteriophages have been reported in many studies to be the most frequently detected viral constituent in the gut of humans [1,2,5,8,16,19,20]. The faecal virome has been characterised for several animal species including pigs, bats, cats, pigeons, horses and ferrets [2,6,7,[9][10][11][21][22][23][24][25][26][27][28][29][30][31]. In dogs, the presence of enteric viral pathogens such as canine parvovirus, coronavirus, rotavirus and distemper virus (Paramyxoviridae) have been identified only through targeted studies [32][33][34][35]. To date, only one published study has used high throughput sequencing to investigate the faecal viral population in diarrhoeic dogs [6]. These investigators analysed faeces from dogs with acute diarrhoea and detected two new virus species, canine sapovirus and canine kobuvirus; known canine enteric viruses such as canine coronavirus, canine parvovirus, canine rotavirus as well as plant and insect viruses were also reported [6].
The aim of this study was to describe the faecal virome of samples collected from healthy dogs, and compare these findings to the faecal virome of dogs with acute diarrhoea in Australia, using an Illumina MiSeq shotgun metagenomic sequencing approach.

Overview of the canine faecal virome
A total of 16 faecal samples (8 from healthy and 8 from diarrhoeic dogs) were subjected to viral nucleic acid extraction, followed by nucleic acid enrichment, reverse transcription, random amplification and the creation of two libraries for each sample (DNA and cDNA), before being sequenced by Illumina MiSeq platform (Table 1). After sequencing, a total of 93,744,624 raw sequences were generated. All raw sequences are available in NCBI, (Bioproject ID: PRJNA380672). After trimming by quality 80,414,313 high quality reads (HQRs) were available. All sequences corresponding to dog and cellular organisms (383,785 and 27,825,631 respectively) were removed and the resultant reads were de novo assembled (Fig 1) generating in total 1,672,615 contigs and singletons (reads). From these contigs/singletons 1,285,171 (76.8%) had no hits in the database (S1 Table). Further analysis of contigs/singletons with no hits confirmed most sequences had no hits, while a limited number matched bacterial, human or animal sequences with a very low coverage. In addition to the contigs/singletons with no hit in the database, some contigs/singletons matching to cellular organisms and some with low complexity were identified, however, were not analysed any further (S1 Fig).
Sequences similar to twelve viral families were identified in faecal samples from healthy and diarrhoeic dogs after analyses with two different bioinformatic pipelines and comparison against viral and NCBI databases (Fig 1). Eight of these viral families infect eukaryotic organisms, and the remaining four infect prokaryotes.
Despite the known bias of SISPA in the resultant sequences after de novo assembly [36], we report the number of contigs/singletons matching viral families and the subsequent analysis of alignments with the lowest common ancestor according to MEGAN V5.2.1 [37].

Virome of healthy dogs
Faecal samples were collected from eight healthy dogs (Table 1). Genetic analyses identified 659,696 contigs/singletons with no hits and 3968 contigs/singletons were classified as viral, matching to five viral families that infect eukaryotes and four that infect prokaryotes. 75.9% (3012 contigs/singletons) of the total number of viral contigs/singletons were classified as bacteriophages in the healthy canine faecal virome. Bacteriophages were detected in the faeces of all dogs in this group and belonged to Caudovirales order and Microviridae family. Viral contigs/singletons from five eukaryotic virus families were identified in faecal samples from 4 of the 8 healthy dogs (Table 1). Three out of five viral families detected were DNA viruses. Adenoviridae and Papillomaviridae were detected in a single sample containing only  (Table 1), however, these were all detected in one sample. After analysis, only 76 contigs/singletons matched the reference sequence of Alphacoronavirus 1 (Feline infectious peritonitis virus, NC_002306.3) and covered only 0.5% of the complete genome (minimum match: 75% and minimum overlap: 50), which represented 3.2% of FIPV_gp02 (receptor binding molecule) region.
Contigs/singletons belonging to the Reoviridae family were found in only one sample. Genetic analysis revealed they covered between 11%-35.5% of VP1, VP2, VP3 and VP4 genes of reference sequences of Rotavirus A (NC_011506-NC_011510).
Another eukaryotic viral family found in one healthy dog sample was Parvoviridae, genetic analysis of the 3 contigs/singletons showed a coverage of approximately 3.5% of the complete genome of canine parvovirus reference sequence (NC_001539), or 9.3% of the polyprotetin Ns1-Ns2.

Virome of dogs with acute diarrhoea
In eight faecal samples from dogs with acute diarrhoea, a total of 625,475 contigs/singletons had no hits and 17,242 were identified as viral contigs/singletons comprising 6 eukaryotic and 4 prokaryotic viral families (Table 1). Bacteriophages comprised 98.19% of the total of viral contigs/singletons and they were present in all individuals and were identified as belonging to order Caudovirales and Microviridae family. Eukaryotic families found in this group were Coronaviridae, Parvoviridae, Reoviridae, Caliciviridae, Astroviridae, and Picornaviridae ( Table 1). The most common eukaryotic viruses identified were RNA viruses (5/6 viral families). Interestingly, all 8 samples in this group contained at least one eukaryotic family each. Co-infection was identified in 6 individual dog samples from this group. From the 8 samples from dogs with acute diarrhoea, 2 different eukaryotic virus families were detected in five samples and 3 eukaryotic families were detected in one sample ( Table 1). The most prevalent family identified was Astroviridae, present in 5 dogs followed by Reoviridae present in 4 of 8 dogs with acute diarrhoea (Table 1).
Astroviridae contigs/singletons from 5 dogs were compared with reference sequence of canine astrovirus (NC_026814.1), the lowest common ancestor according to MEGAN, and they covered between 2.4% and 5% of compete genome and between 6.3% and 12.9% of the ORF2. The sample with the most contigs/singletons was later characterised.
Reoviridae contigs/singletons found in 4 dogs were compared with reference sequence of Rotavirus A (NC_011503.2) and 3 of them covered between 10.5% and 23.4% of the VP4 gene and the other sample covered 23.6% of the VP7 gene.
Furthermore, contigs/singletons matching to canine parvovirus, were found in 2 dog samples with acute diarrhoea. One of the samples covered approximately 5.8% of the complete genome of canine parvovirus reference sequence (NC_001539), corresponding to 17.6% of VP2. The other sample contained 41 contigs/singletons that matched to this same reference sequence and in total they cover 100% of VP1, 66.2% of polyprotein NS1 and NS2 (CPVgp1) and 87.9% of VP2 genes.
In this group were also found contigs/singletons matching to the Coronaviridae family, covering between 0.6% and 1.7% of the complete genome of reference sequence Alphacoronavirus 1 (FIPV, NC_002306.3).
One dog with acute diarrhoea contained contigs/singletons similar to a canine norovirus (JF930689.1), covering approximately 7.9% of the complete genome. Other dog sample had contigs/singletons similar to a canine kobuvirus (JN387133.1), covering 2.2% of complete genome.

Canine astrovirus characterisation
To further explore the high abundance of contigs/singletons from Astroviridae family in dogs with acute diarrhoea and their absence in healthy dogs, a near complete full genome of a representative canine astrovirus was generated through Sanger sequencing (DD1, Table 1). The genome encoded the complete three open reading frames (ORFs): ORF1a, ORF1b and ORF2. The total length was 6513 nucleotides, excluding the 3' poly (A) tail and the nucleotide composition was 28% A, 22% G, 26% T, 23% C. The G/C composition was 45%. The GenBank accession number for the canine astrovirus sequence is KX756441.
A phylogenetic tree was constructed using the protein alignment from the conserved region of the capsid (ORF2) of the astrovirus characterised in this study (DD1) and other canine astrovirus ORF2, together with Mamastrovirus sequences from different mammalian species, including a chicken astrovirus as an outgroup. The phylogenetic analysis grouped our canine astrovirus within the canine astrovirus clade. The closest canine astroviruses to our Australian sample were from UK and China with an identity between 98.82%-99.41% (Fig 2).

Discussion
Using next generation sequencing and metagenomics analysis, the virome in faecal samples from 8 healthy dogs and 8 dogs with acute diarrhoea is described. Only a single previous shotgun metagenomic study investigating the faecal virome of dogs with diarrhoea has been reported. In that study, mammalian viruses were found in 15 samples and two new virus species were described [6]. Our study analysed 16 faecal samples from dogs (8 healthy and 8 diarrhoeic), and identified eukaryotic viruses in 12 samples, including all diarrhoeic samples and 50% of the healthy samples. Thus, 70% of canine faeces contained eukaryotic viruses, suggesting that mammalian viruses are a common component of the enteric microbial population in dogs.
Our results must be interpreted with caution, due to bias created by SISPA. Areas of exaggerated depth appear when the SISPA method is used, creating artefacts during de novo assembly. This results in regions of repetitive sequences [36]. In order to overcome this bias, all contigs/singletons were analysed at family level and only for viral eukaryotic families were the results further analysed to evaluate what percentage they covered to some specific viral species.
The most common viral contigs/singletons identified in both groups were bacteriophages, similar to previous findings from human and other animal faecal virome studies [2, 5, [11].
Bacteriophages modify diversity of bacterial populations due to their lytic life cycle and also promote different characteristics in the bacterial population due to their lysogenic life cycle transferring genes such as encoding toxins or resistance to antibiotic [38]. This life-cycle may lead to bacteriophages conferring advantage to some bacterial species in the environmental niche [39]. Therefore, it is possible that the greater amount of contigs/singletons corresponding to bacteriophages identified in the group of dogs with acute diarrhoea, when compared to healthy dogs, means a higher amount of bacteriophages. If so, bacteriophages could have generated a change in the normal balance of bacterial population resulting in dysbiosis, and ultimately causing diarrhoea. Conversely, it could be that an initial change in the bacterial population in these dogs resulted from the acute diarrhoea [40,41] is the cause of the variation in the bacteriophage population. In our sample population, the latter explanation is most likely, because in a shelter environment a higher number of circulating pathogens, changes in diet and a stressful environment could contribute to the dysbiosis associated with acute diarrhoea [42]. The bacterial microbiome and the analysis of contigs/singletons matching specific bacteriophages were not assessed in this study, therefore further microbiome/virome cross analysis is necessary to elucidate the association between bacteria and bacteriophages in dogs. However, even this analysis would be unlikely to determine the cause or effect relationship between bacteriophages and dysbiosis at a single point in time.
The analysis of the lowest common ancestor of eukaryotic viral families, according to MEGAN, identified eight eukaryotic virus species (Table 1). However, each of these results require validation by targeted PCR, or whole genome characterisation of each species, as the NGS results after SISPA amplification may be biased and not accurate depiction at a species level [36].
Sequences matching those of human viruses (adenovirus and papillomavirus) were found in one sample from a healthy dog. Only one contig from each virus covering a very small percentage of the genome in both cases. This finding could suggest contamination during collection or processing.
Known enteric pathogenic families Parvoviridae and Coronaviridae were identified in samples from both healthy dogs and dogs with acute diarrhoea. Interestingly, almost all positive samples were from puppies (between 4-8 months) that had been vaccinated less than one month prior to sampling. The lowest common ancestry analysis in MEGAN of the contigs/singletons matching Parvoviridae family, suggested they were canine parvovirus (CPV), but as CPV positive dogs (as tested by faecal antigen tests) were not included in this study it is highly likely these results represent vaccine derived sequences not detected by the CPV antigen detection kit, or represent a virus load below the level of detection. Previous studies have demonstrated that modified live vaccine virus can be detected in faecal samples for extended periods of time after vaccination [43]. Further genome characterisation of these canine parvovirus is warranted to confirm this hypothesis.
Three individual samples contained Coronaviridae contigs/singletons, two of which were from puppies [ND10 and DD8] ( Table 1) and one from an adult dog [DD1] ( Table 1). Our results are consistent with Li et al 2011, who also reported the highest number of Coronaviridae reads in one sample collected from a puppy [6]. Canine coronavirus can be shed in faeces in high numbers for up to 156 days [44,45]. These findings validate the affinity of the Coronaviridae viral family to infect young individuals [46], and present as a common enteric pathogen in a shelter environment [42,45,47].
The uncommon viruses, canine kobuvirus and canine norovirus, were identified only in samples from dogs with acute diarrhoea. Previous studies have suggested these viruses may have some association with enteric disease in dogs, however, both viral species have been detected in both healthy dogs and dogs with diarrhoea [6,48] Our shotgun metagenomic sequence data indicated that the most frequent RNA viral family in dog samples with acute diarrhoea was Astroviridae, being identified in more than half of the diarrheal samples. [49]. In dogs, astrovirus has been previously detected mainly in puppies with diarrhoea, but has also been occasionally reported in healthy dogs [50][51][52][53][54]. The only previous report of a possible canine astrovirus in Australia was described in canine faeces in the 1984, where astrovirus-like particles were detected using electronic microscopy in healthy dogs [55].
To date, canine astrovirus has been reported in USA [56], China [51], Italy [50,57,58], UK [52], France [53], Brazil [59], Korea [60] and Japan [54]. The first description of the complete genome of two canine astroviruses was reported by a group of researchers from the UK in 2015 [52]. The current study contributes the first description of the complete genome of a canine astrovirus identified in Australia. In our study, using Sanger sequencing a near complete genome of a canine astrovirus was assembled from one dog with acute diarrhoea. A phylogenetic tree, analysing the capsid region (ORF2) of this Australian canine astrovirus and other astrovirus sequences present in GenBank, determined that it belonged to the canine astrovirus clade, very closely related to the canine astrovirus strains from the UK and China (Fig 2).
It is interesting to note that all canine astrovirus positive samples, were collected from the same shelter and obtained within a short period of time (Sept-Nov 2012). We could infer that this virus was endemic at that time in that shelter, and or maybe could represent an outbreak of diarrhoea in the shelter within that period of time. A more sensitive test (i.e.: quantitative PCR) in a larger number of samples from cases and controls may be useful to better understand the potential role of astroviruses as an aetiological agent in acute diarrhoea of dogs.

Conclusion
In this study we analysed the faecal virome in healthy dogs and compared these findings with the faecal virome of dogs with acute diarrhoea. Known DNA and RNA viruses were found, together with different proportions of bacteriophages in each group. In addition, we described and characterised the first complete genome of a canine astrovirus in Australia. Future longitudinal studies analysing viruses, bacteria and other potential pathogens should be performed to assess the aetiology of diarrhoea in dogs and further elucidate the pathological importance of viruses found in dog intestines.

Animals and sample collection
Faecal samples from a total of 16 dogs were obtained between September 2012 and March 2013. All dogs were aged between 2.5 months and 7 years; and comprised 5 females and 11 males of various breeds ( Table 1).
All faecal samples were collected from a single shelter in Melbourne (Lost Dogs Home), Australia. All samples were maintained at 4˚C for up to 24 hrs, then were transported on dry ice before storing up to five aliquots of 500 mg of faeces each at -80˚C until further analyses. Information about age, sex, breed, diet, vaccination and deworming status was recorded for each dog (University of Melbourne Animal Ethics Committee approval IDs 1413272.2 and 1112035.1).
Animals were determined to be healthy based on physical examination by a veterinarian and absence of any clinical signs of disease. Faecal consistency was considered normal as per published criteria (Faecal scoring chart, PURINA), and all dogs had been treated with deworming drugs for prophylaxis (Ilium Pyraquantal, TROY or Milbemax, Novartis). All samples were lifted from the floor, first thing in the morning before cleaning, during November 2012.
Faecal samples from 8 dogs with an acute onset of diarrhoea (less than 3 days of duration), were collected by a veterinarian from within the animal's enclosure. All dogs with acute diarrhoea were tested for the presence of canine parvovirus antigen in faeces using the Anigen rapid CPV/CCV Ag Test kit, (Bionote). Positive samples were excluded from the study.
None of the dogs had been treated with antimicrobial drugs within the previous 8 weeks of sample collection. The majority of healthy dogs were receiving commercial dry food and some of the dogs with diarrhoea were being fed a high-fibre prescription veterinary diet (Hill's i/d diet).

Sample preparation and faecal extract preparation
Faecal samples were processed as described previously [6,12]. Briefly, aliquots of 500 mg of faecal sample were thawed and re-suspended in saline buffer (0.01M Tris solution (pH7.5), 0.15M NaCl, 0.01M CaCl 2 ) at 3:1 ratio of solid mass. One mm zirconia/silica beads were added to the stool solution, filling around 150μL of an Eppendorf tube, and vortexed vigorously for 3 minutes. The samples were then centrifuged at 17900 x g for 5 min, collecting the supernatant and repeating this step three more times. To reduce solid faecal matter and bacterial contamination, 500 μl of this solution was filtered through a 0.45 μm tube filter (Corning Costar Spin X) by centrifugation at 3800 x g for 5 minutes, then the filtrate was transferred to 2 mL tubes.

Pre-extraction nucleic acid digestion
To enrich for viral DNA and RNA, a DNase/RNase step was incorporated using a modified protocol described previously [6,12]. Each sample was treated with a cocktail of DNases (Turbo DNase, from Ambion, Baseline-ZERO from Epicentre, Benzonase from Novagen and DNase I from Roche) and RNase A (QIAGEN). This mixture was incubated in a water bath at 37˚C for 3 hours. To stop the enzymatic activity, EDTA (AMRESCO) was added in a final concentration of 15 mM to each sample and incubated at 75˚C for 10 min.

Nucleic acid extraction and reverse transcription
Viral DNA/RNA protected from digestion within viral capsids were extracted using QIAamp Viral RNA mini kit (QIAGEN), according to manufacturer's recommendations. A second DNase/RNase step was performed on the extracted viral RNA for elimination of genomic DNA, using DNase I recombinant, RNase free (10U/μl) (Roche) and Protector RNase inhibitor (40 U/μl) (Roche). After digestion of the DNA, the viral RNA was transcribed with Sensiscript Reverse Transcriptase kit (QIAGEN; Sensiscript RT kit) to generate cDNA, according to manufacturer's instructions with minor modifications. Briefly, for a more sensitive detection in the subsequent PCR, a mixture of oligo-dt primers (Oligo (dT)15 primer, Promega) and random primers (Random hexamers, TaqMan Reverse Transcription Reagents, Roche, Applied Biosystems) were used and a RNA denaturation step (95˚for 3 minutes) was added.

Random amplification, Sequence-Independent Single Primer Amplification (SISPA) method
Viral cDNA and genomic DNA were randomly amplified using a modified SISPA protocol [61,62]. Briefly, a second strand synthesis was performed with Large (Klenow) Fragment (New England Biolabs) and random hexamers (Roche, Biosystems, 50μM) followed by digestion of the second strand product with the restriction enzyme CviQI (Csp6.1), (New England Biolabs). Then a CSp11/NBam24 adaptor was ligated to the digested DNA using T4 DNA ligase (Invitrogen) followed by PCR amplification of the adaptor-ligated product with NBam24 PCR primers. An aliquot of the PCR product was validated on a 1% agarose TBE gel, where a positive smear with multiple bands confirmed the random SISPA amplification of nucleic acid products.

Viral library preparation and sequencing
The amplified PCR products were cleaned up using WIZARD SV Gel and PCR clean-up system (Promega) following manufacturer's recommendations and two libraries with dual indexing for each sample were generated (DNA and cDNA) with Illumina Nextera XT DNA Sample Preparation kit, according to manufacturer indications. After visualise it with Agilent 2200 Tape Station System (Agilent Technologies), the libraries were submitted to the Australian Genome Research Facility (AGRF) for a 250 bases paired-end sequencing on the MiSeq Illumina platform.

Bioinformatic analyses
All raw sequences were deposited under Bioproject ID: PRJNA380672 at NCBI database. Raw sequences were trimmed by quality score with PrinSeq software (v0.20.3) [63], filtering for low quality reads from both ends using the DUST score [64] with a threshold of 7. Poly A/T tails in both ends (ten nucleotides of each end) and SISPA primers sequences were also removed using this software. The Mothur software v.1.31.2 [65] was applied and the sequences were trimmed again, eliminating homopolymers, ambiguous bases and sequences less than 100bp. After these trimming steps, high quality reads (HQR) were obtained and all bad quality reads were removed from the group file (Fig 1). Subsequently, these dog free sequences were compared against a bacterial database (CAM-ERA prokaryotic nucleotide database 10572.V7, Nov 2012; http://camera.calit2.net/) [66] to eliminate bacterial sequences, using the BLASTn (Blast 2.2.29+ standalone) algorithm with an 80% identity cut off. To extract cellular organism sequences from the group file, MEGAN V5.2.1 and Mothur software were used as described above (Fig 1).
All contigs and singletons clusters were analysed through two pipelines. (1) Contigs and singletons clusters were compared against the CAMERA Viral Nucleotide Sequence database 10570.V9, using tBLASTx search with an E-value cut off 10 −5 ; (2) Contigs and singletons clusters were compared against the NCBI nucleotide database (2012) using BLASTn search with an E value cut off 1. All blast searches were performed in Blast 2.2.29+ standalone. These files were then analysed by MEGAN V5.2.1 [37] and the lowest common ancestor of known viral sequences were identified (Fig 1).
Finally, all viral contigs and singletons of eukaryotic organisms present in both analyses were aligned and compared with the NCBI reference sequence of the lowest common ancestor given by MEGAN V5.2.1. All alignments were made using Sequencher version 5.0.1 sequence analysis software (Gene Codes Corporation, Ann Arbor, MI USA) with minimum match percentage of 70%-80% and minimum overlap 50 as assembly parameters, evaluating the percentage of coverage of the genome.
All contigs and singletons with no hits were re-evaluated using online BLASTn with an E value cut off 10 and visualised with MEGAN V5.2.1 to evaluate the alignment with its lowest common ancestor.

Sequencing of canine astrovirus genome
In order to acquire the complete genome of canine astrovirus, multiple sets of primers were selected from the literature or designed based on sequences obtained from Illumina reads (S2 Table). Nucleic acids from a single faecal sample from a dog with acute diarrhoea (DD1), which had 18 contigs/singletons of canine astrovirus (after tBLASTx analysis) was used to determine the complete genome sequence. RNA was extracted directly from the centrifuged sample after faecal extraction, previous to enrichment of viral nucleic acids as outlined before.
RT-PCR was performed with SuperScript III One-Step RT-PCR System with Platinum Taq (Invitrogen™). PCR conditions used were: 45˚C for 60 min and 95˚C for 5 min, 35 cycle of 94˚C for 40 sec, 55˚C for 1min and 72˚C for 5 min, and a final elongation step of 72˚C for 5 min, followed by final hold at 4˚C. PCR products were run on a 1.2% agarose TBE gel stained with RedSafe™ nucleic acid staining solution (iNtRON Biotechnology). All PCR products were excised and cleaned up with WIZARD SV Gel and PCR clean-up system (Promega) following manufacturer's protocol and sequenced using Sanger sequencing at the AGRF.
The near complete genome of the canine astrovirus was assembled using Sequencher version 5.0.1 sequence analysis software (Gene Codes Corporation, Ann Arbor, MI USA) with minimum match percentage 80 and minimum overlap 50 as assembly parameters.

Phylogenetic analysis
Phylogenetic analysis of this canine astrovirus was performed aligning protein sequences of the 172 conserved amino acids of the capsid region (ORF2) from different species (S2 Fig), using CLUSTAL W, from MEGA version 6.0 [69] with default settings. A phylogenetic tree with 1000 bootstrap was generated using the Maximum likelihood method based on the JTT matrix-based model [70], using MEGA version 6.0. The percentage of identity was calculated with CLUSTALO 1.2.4 [71] Supporting information S1 Fig. MEGAN