Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoires
Fig 3
Comparison between dimensionality reduction on spectrograms versus computed features of syllables.
Each plot shows 20 syllables of Cassin’s vireo song. (A) UMAP projections of 18 features (see S2 Table) of syllables generated using BioSound. (B) UMAP applied to spectrograms of syllables. (C) UMAP of spectrograms where color is the syllable’s average fundamental frequency (D) The same as (C) where pitch saliency of each syllable, which corresponds to the relative size of the first auto-correlation peak represents color.