A comprehensive study on bilingual and multilingual speech emotion recognition using a two-pass classification scheme

doi:10.1371/journal.pone.0220386

Table 1.

Recall, precision, and F1-score in the binary case.

More »

Expand

Table 2.

Emotions considered in bilingual emotion recognition with a common model set.

More »

Expand

Fig 1.

Computation of shifted delta cepstral (SDC) coefficients.

More »

Expand

Fig 2.

Architecture of the proposed convolutional neural networks-based classifier.

More »

Expand

Table 3.

Spoken language identification rates [%] using English and German emotional speech data.

More »

Expand

Table 4.

Recalls for speech emotion recognition using IEMOCAP and DNN.

More »

Expand

Table 5.

Recalls for speech emotion recognition using IEMOCAP and CNN.

More »

Expand

Table 6.

Precision of speech emotion recognition using IEMOCAP and DNN.

More »

Expand

Table 7.

Precision of speech emotion recognition using IEMOCAP and CNN.

More »

Expand

Table 8.

F1-scores for speech emotion recognition using IEMOCAP and DNN.

More »

Expand

Table 9.

F1-scores for speech emotion recognition using IEMOCAP and CNN.

More »

Expand

Table 10.

Confusion matrix [%] using IEMOCAP and DNN with MFCC/SDC features.

More »

Expand

Table 11.

Confusion matrix [%] using IEMOCAP and CNN with MFCC/SDC features.

More »

Expand

Table 12.

Recalls for speech emotion recognition using FAU Aibo and DNN.

More »

Expand

Table 13.

Recalls for speech emotion recognition using FAU Aibo and CNN.

More »

Expand

Table 14.

Precision of speech emotion recognition using FAU Aibo and DNN.

More »

Expand

Table 15.

Precision of speech emotion recognition using FAU Aibo and CNN.

More »

Expand

Table 16.

F1-scores for speech emotion recognition using FAU Aibo and DNN.

More »

Expand

Table 17.

F1-scores for speech emotion recognition using FAU Aibo and CNN.

More »

Expand

Table 18.

Confusion matrix [%] using FAU Aibo and DNN with MFCC/SDC features.

More »

Expand

Table 19.

Confusion matrix [%] using FAU Aibo and CNN with MFCC/SDC features.

More »

Expand

Table 20.

Recalls for speech emotion recognition using a common model set and DNN.

More »

Expand

Table 21.

Recalls for speech emotion recognition using a common model set and CNN.

More »

Expand

Table 22.

Precision of speech emotion recognition using a common model set and DNN.

More »

Expand

Table 23.

Precision of speech emotion recognition using a common model set and CNN.

More »

Expand

Table 24.

F1-scores for speech emotion recognition using a common model set and DNN.

More »

Expand

Table 25.

F1-scores for speech emotion recognition using a common model set and CNN.

More »

Expand

Table 26.

Training and test instances for the IEMOCAP corpus.

More »

Expand

Table 27.

Confusion matrix [%] of the spoken language identification in the first pass.

More »

Expand

Fig 3.

UARs for multilingual and monolingual emotion recognition for three languages.

More »

Expand