EAC-Agent: A deep learning framework for multimodal emotion-aware conversational agent with contextual response generation

doi:10.1371/journal.pone.0346770

Fig 1.

Advantages of conversational agents.

The figure summarizes the key benefits of conversational agents in enhancing user interaction, personalization, and accessibility.

More »

Expand

Table 1.

Comparative analysis of existing methods.

More »

Expand

Fig 2.

Proposed architecture of the EAC-Agent framework.

The figure presents the overall workflow of the proposed system, including multimodal feature extraction, self- and cross-attention-based fusion, and emotion-aware response generation.

More »

Expand

Fig 3.

Audio feature extraction process.

The figure illustrates the steps involved in noise reduction, MFCC computation, and statistical modeling for generating acoustic representations.

More »

Expand

Fig 4.

Video feature extraction using Vision Transformer (ViT).

The figure illustrates the process of patch extraction, embedding, and self-attention-based feature learning for visual representation.

More »

Expand

Fig 5.

Self and cross attention mechanism.

The figure illustrates the interaction between self-attention and cross-attention for multimodal feature fusion.

More »

Expand

Table 2.

Multimodal datasets and their description.

More »

Expand

Table 3.

Statistics of the datasets.

More »

Expand

Table 4.

Results on IEMOCAP dataset.

More »

Expand

Table 5.

Results on MELD dataset.

More »

Expand

Fig 6.

Performance comparison of different fusion methods on the two datasets.

The figure compares the classification performance of various fusion strategies on the IEMOCAP and MELD datasets.

More »

Expand

Fig 7.

Confusion matrices for emotion classification.

The figure illustrates the classification performance of the proposed model on the IEMOCAP and MELD datasets.

More »

Expand

Table 6.

Model performance across different test sets.

More »

Expand

Table 7.

Results of EAC-Agent on Perplexity, BLEU, and ROUGE scores across modalities.

More »

Expand

Table 8.

Ablation study on IEMOCAP and MELD.

More »

Expand