Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning

doi:10.1371/journal.pone.0235367

Fig 1.

A sample chat transcript from the annotated dataset.

More »

Expand

Table 1.

Statistics of the developed dataset.

More »

Expand

Table 2.

Sentiment label distribution across the annotated dataset.

More »

Expand

Fig 2.

End-to-end framework for a two-level proposed hierarchical dialogue manager fused with sentiment (ss).

More »

Expand

Fig 3.

The architectural diagram of Intent Classifier (IC) module.

More »

Expand

Table 3.

Quantitative analysis of intent classification module.

More »

Expand

Fig 4.

The architectural diagram of Slot-Filling (SF) module.

More »

Expand

Table 4.

Quantitative analysis of sentiment classification module.

More »

Expand

Fig 5.

Learning curve of TR based policies during training for different algorithms.

More »

Expand

Fig 6.

Learning curve of various policies during training.

More »

Expand

Fig 7.

Performance of the VAs during testing with different measures: (a) User Satisfaction, (b) Avg. Turn.

More »

Expand

Table 5.

p-values reported by Welch’s t-test on comparing our proposed SR+TR model with other models.

More »

Expand

Fig 8.

Performance of the VAs tested with human evaluators: (a) success rate based on binary marking schema, (b) Distribution of user-ratings based on variable marking schema for SR+TR.

More »

Expand

Fig 9.

Performance of the VAs during testing: (a) SR+TR, (b) TR.

More »

Expand

Table 6.

Quantitative analysis of slot-filling module.

More »

Expand