Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management

09/22/2020
by   Zhi Chen, et al.
0

The task-oriented spoken dialogue system (SDS) aims to assist a human user in accomplishing a specific task (e.g., hotel booking). The dialogue management is a core part of SDS. There are two main missions in dialogue management: dialogue belief state tracking (summarising conversation history) and dialogue decision-making (deciding how to reply to the user). In this work, we only focus on devising a policy that chooses which dialogue action to respond to the user. The sequential system decision-making process can be abstracted into a partially observable Markov decision process (POMDP). Under this framework, reinforcement learning approaches can be used for automated policy optimization. In the past few years, there are many deep reinforcement learning (DRL) algorithms, which use neural networks (NN) as function approximators, investigated for dialogue policy.

READ FULL TEXT

page 1

page 9

page 10

research
05/27/2019

AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning

Dialogue policy plays an important role in task-oriented spoken dialogue...
research
09/22/2020

Deep Reinforcement Learning for On-line Dialogue State Tracking

Dialogue state tracking (DST) is a crucial module in dialogue management...
research
11/29/2017

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Dialogue assistants are rapidly becoming an indispensable daily aid. To ...
research
02/11/2018

Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

In spoken dialogue systems, we aim to deploy artificial intelligence to ...
research
01/10/2013

Planning and Acting under Uncertainty: A New Model for Spoken Dialogue Systems

Uncertainty plays a central role in spoken dialogue systems. Some stocha...
research
11/30/2017

Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

In statistical dialogue management, the dialogue manager learns a policy...
research
02/16/2020

A Multimodal Dialogue System for Conversational Image Editing

In this paper, we present a multimodal dialogue system for Conversationa...

Please sign up or login with your details

Forgot password? Click here to reset