Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning

06/08/2016
by   Tiancheng Zhao, et al.
0

This paper presents an end-to-end framework for task-oriented dialog systems using a variant of Deep Recurrent Q-Networks (DRQN). The model is able to interface with a relational database and jointly learn policies for both language understanding and dialog strategy. Moreover, we propose a hybrid algorithm that combines the strength of reinforcement learning and supervised learning to achieve faster learning speed. We evaluated the proposed model on a 20 Question Game conversational game simulator. Results show that the proposed method outperforms the modular-based baseline and learns a distributed representation of the latent dialog state.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2017

Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models

In this paper, we present a deep reinforcement learning (RL) framework f...
research
02/10/2017

Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning

End-to-end learning of recurrent neural networks (RNNs) is an attractive...
research
08/02/2017

Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings

This paper is the first attempt to learn the policy of an inquiry dialog...
research
05/08/2018

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog

Creating an intelligent conversational system that understands vision an...
research
08/29/2018

Decoupling Strategy and Generation in Negotiation Dialogues

We consider negotiation settings in which two agents use natural languag...
research
05/19/2020

Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption

Spoken dialog systems have seen applications in many domains, including ...
research
04/07/2019

Unsupervised Dialog Structure Learning

Learning a shared dialog structure from a set of task-oriented dialogs i...

Please sign up or login with your details

Forgot password? Click here to reset