Dialogue manager domain adaptation using Gaussian process reinforcement learning

09/09/2016
by   Milica Gašić, et al.
0

Spoken dialogue systems allow humans to interact with machines using natural speech. As such, they have many benefits. By using speech as the primary communication medium, a computer interface can facilitate swift, human-like acquisition of information. In recent years, speech interfaces have become ever more popular, as is evident from the rise of personal assistants such as Siri, Google Now, Cortana and Amazon Alexa. Recently, data-driven machine learning methods have been applied to dialogue modelling and the results achieved for limited-domain applications are comparable to or outperform traditional approaches. Methods based on Gaussian processes are particularly effective as they enable good models to be estimated from limited training data. Furthermore, they provide an explicit estimate of the uncertainty which is particularly useful for reinforcement learning. This article explores the additional steps that are necessary to extend these methods to model multiple dialogue domains. We show that Gaussian process reinforcement learning is an elegant framework that naturally supports a range of methods, including prior knowledge, Bayesian committee machines and multi-agent learning, for facilitating extensible and adaptable dialogue systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2016

Deep Reinforcement Learning for Multi-Domain Dialogue Systems

Standard deep reinforcement learning methods such as Deep Q-Networks (DQ...
research
06/19/2017

Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning

Human conversation is inherently complex, often spanning many different ...
research
06/01/2011

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

This paper describes a novel method by which a spoken dialogue system ca...
research
11/22/2019

Fleet Control using Coregionalized Gaussian Process Policy Iteration

In many settings, as for example wind farms, multiple machines are insta...
research
03/31/2018

Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue Modelling

Recent statistical approaches have improved the robustness and scalabili...
research
05/24/2016

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems

The ability to compute an accurate reward function is essential for opti...
research
08/01/2019

Reinforcement Learning for Personalized Dialogue Management

Language systems have been of great interest to the research community a...

Please sign up or login with your details

Forgot password? Click here to reset