Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning
Fig 5
Learning curve of TR based policies during training for different algorithms.
Click through the PLOS taxonomy to find articles in your field.
For more information about PLOS Subject Areas, click here.
Learning curve of TR based policies during training for different algorithms.