Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoires
Fig 20
Continuous projections from vocalizations.
(A) A spectrogram of each vocalization is computed. (B) Rolling windows are taken from each spectrogram at a set window length (here 5ms), and a step size of one time-frame of the short-time Fourier transform (STFT). (C) Windows are projected into latent space (e.g. UMAP or PCA).