Fig 1.
Flowchart of de novo assemble in swamp buffalo transcriptome and SSR discovery.
Table 1.
Summary of results of sequence analysis.
Fig 2.
Characteristics of the results of homology search for swamp buffalo unigenes.
(A) Venn diagram of BLAST hits for unigenes against protein databases (E-value ≤1.0e-05). Numbers in the circles indicate the number of unigenes annotated by single or multiple databases, (B) E-value distribution of BLAST hits for each unique sequence (E-value ≤1.0e-05). (C) Identity distribution of BLAST hits for each sequence. (D) Species distribution of the top BLAST hits for the assembled unigenes (E-value ≤1.0e-05).
Fig 3.
Gene ontology (GO) classification of assembled unigenes.
A total of 14,439 unigenes with significant similarity in NR protein databases were assigned to GO classifications.
Fig 4.
EuKaryotic orthologous group (KOG) classification.
Approximately 15,813 of the 86,017 unigenes with NR hits were grouped into 25 KOG classifications.
Fig 5.
Pathway assignment based on the Kyoto Encyclopedia of Genes and Genomes (KEGG).
(A) Classification based on cellular processes categories, (B) Classification based on environmental information processing categories, (C) Classification based on genetic information processing categories, (D) Classification based on human diseases categories, (E) Classification based on metabolism categories, and (F) Classification based on organismal systems categories.
Table 2.
Summary of SSR mining results.
Table 3.
Repeat motif type distribution in SSRs ≥ 15 bp in length.
Fig 6.
Frequency of classified repeat types of SSRs.
The most abundant dinucleotide and trinucleotide motifs were AC/GT and AGC/CTG.
Table 4.
Characterization of 69 SSRs in the swamp buffalo.
Fig 7.
UPGMA dendrogram of the genetic relationships among seven buffalo breeds.
The dendrogram was generated using the Nei similarity coefficient based on 69 polymorphic primer pairs.