AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model

doi:10.1371/journal.pone.0273936

Table 1.

CMU-MOSI dataset information.

More »

Expand

Table 2.

CMU-MOSEI dataset information.

More »

Expand

Fig 1.

Structure of AFR-BERT multimodal sentiment analysis model.

AFR-BERT is divided into four network modules, which correspond to data input, data fusion, data analysis, and data output.

More »

Expand

Fig 2.

BiLSTM model structure.

(Forward) means forward propagation of the model. (Backward) means model backward propagation.

More »

Expand

Fig 3.

Cross-modal fusion attention mechanism structure.

(K_t) represents text feature data. (K_a) represents audio feature data. (Relu, Row Softmax, softmax, concat) are all function calculations. (Mask) is a matrix.

More »

Expand

Fig 4.

Scaled dot product attention structure.

(Q) means the query matrix. (K) means the key matrix. (V) means the value matrix. (Mask) represents matrix operations for processing non-fixed-length sequences. () is the scale factor for scaling.