Robust Dialogue State Tracking with Weak Supervision and Sparse Data

02/07/2022
by   Michael Heck, et al.
0

Generalising dialogue state tracking (DST) to new data is especially challenging due to the strong reliance on abundant and fine-grained supervision during training. Sample sparsity, distributional shift and the occurrence of new concepts and topics frequently lead to severe performance degradation during inference. In this paper we propose a training strategy to build extractive DST models without the need for fine-grained manual span labels. Two novel input-level dropout methods mitigate the negative impact of sample sparsity. We propose a new model architecture with a unified encoder that supports value as well as slot independence by leveraging the attention mechanism. We combine the strengths of triple copy strategy DST and value matching to benefit from complementary predictions without violating the principle of ontology independence. Our experiments demonstrate that an extractive DST model can be trained without manual span labels. Our architecture and training strategies improve robustness towards sample sparsity, new concepts and topics, leading to state-of-the-art performance on a range of benchmarks. We further highlight our model's ability to effectively learn from non-dialogue data.

READ FULL TEXT
research
04/19/2021

Temporal Query Networks for Fine-grained Video Understanding

Our objective in this work is fine-grained classification of actions in ...
research
01/28/2021

Attention Guided Dialogue State Tracking with Sparse Supervision

Existing approaches to Dialogue State Tracking (DST) rely on turn level ...
research
10/21/2020

Multi-Domain Dialogue State Tracking based on State Graph

We investigate the problem of multi-domain Dialogue State Tracking (DST)...
research
09/22/2020

CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking

In dialogue systems, a dialogue state tracker aims to accurately find a ...
research
10/29/2017

Path-Based Attention Neural Model for Fine-Grained Entity Typing

Fine-grained entity typing aims to assign entity mentions in the free te...
research
09/16/2019

Domain Transfer in Dialogue Systems without Turn-Level Supervision

Task oriented dialogue systems rely heavily on specialized dialogue stat...

Please sign up or login with your details

Forgot password? Click here to reset