Enhanced sentiment analysis in tourism reviews via multimodal graph convolutional networks

doi:10.1371/journal.pone.0333628

Table 1.

Comparative analysis of work related to multimodal sentiment analysis.

More »

Expand

Fig 1.

The proposed multimodal sentiment analysis model.

More »

Expand

Fig 2.

Tourism reviews data collection and preprocessing.

More »

Expand

Fig 3.

The process of constructing the text graph.

More »

Expand

Fig 4.

The process of constructing the image graph.

More »

Expand

Fig 5.

The process of constructing the fusion graph [37].

More »

Expand

Fig 6.

The principle of the GraphSAGE algorithm.

More »

Expand

Table 2.

Parameter settings for BERT, ResNet, and GraphSAGE in the proposed model.

More »

Expand

Table 3.

Results of text single-modal comparison of the proposed model with each of the six SOTA models on the Yelp dataset, TripAdvisor dataset and Ctrip dataset.

More »

Expand

Table 4.

Results of multi-modal comparison of the proposed model with each of the six SOTA models on the Yelp dataset, TripAdvisor dataset and Ctrip dataset.

More »

Expand

Fig 7.

The heat map of the experimental results of the model proposed in this study on the Yelp dataset.

More »

Expand

Table 5.

Performance comparison of the proposed model with different loss function weight settings (α, β) across four datasets (TripAdvisor, Amazon, Ctrip, MM-CPC) in text-only modality.

More »

Expand

Table 6.

Results of comparison experiment on each of the other four datasets in multi-modal mode.

More »

Expand

Table 7.

Computational efficiency comparison between the proposed model and TETFN in terms of training time, inference time, and memory usage.

More »

Expand

Fig 8.

Accuracy and F1-score comparison between the proposed model and baseline methods (TFN, LMF, MulT) on the TripAdvisor and Yelp datasets across three sentiment classes.

More »

Expand

Fig 9.

Performance impact of removing individual components (BERT text encoder, ResNet visual encoder, or GraphSAGE aggregator) measured by relative change in accuracy (Δ Acc) and macro-F1 score (Δ F1) across all test datasets.

More »

Expand

Table 8.

Ablation study results comparing BERT and Word2Vec for text feature extraction (Accuracy, Precision, Recall, F1-score, AUC-ROC).

More »

Expand

Table 9.

Ablation study results comparing ResNet and plain CNN for image feature extraction (Accuracy, Precision, Recall, F1-score, AUC-ROC).

More »

Expand

Table 10.

Ablation study results comparing GraphSAGE and standard GCN for multimodal graph aggregation (Accuracy, Precision, Recall, F1-score, AUC-ROC).

More »

Expand