Segregating Complex Sound Sources through Temporal Coherence
Figure 4
Segregation of speech mixtures.
(A) Mixture of two sample utterances (left panel) spoken by a female (middle panel) and male (right panel); pitch tracks of the utterances are shown below each panel. (B) The segregated speech using all C-matrix columns. (C) The segregated speech using only coincidences among the frequency-scale channels (no pitch information). (D) The segregated speech using the channels surrounding the pitch channels of the female speaker as the anchor.