Multilingual and Cross-Lingual Intent Detection from Spoken Data

04/17/2021
by   Daniela Gerz, et al.
13

We present a systematic study on multilingual and cross-lingual intent detection from spoken data. The study leverages a new resource put forth in this work, termed MInDS-14, a first training and evaluation resource for the intent detection task with spoken data. It covers 14 intents extracted from a commercial system in the e-banking domain, associated with spoken examples in 14 diverse language varieties. Our key results indicate that combining machine translation models with state-of-the-art multilingual sentence encoders (e.g., LaBSE) can yield strong intent detectors in the majority of target languages covered in MInDS-14, and offer comparative analyses across different axes: e.g., zero-shot versus few-shot learning, translation direction, and impact of speech recognition. We see this work as an important step towards more inclusive development and evaluation of multilingual intent detectors from spoken data, in a much wider spectrum of languages compared to prior work.

READ FULL TEXT
research
10/23/2022

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

Zero-resource cross-lingual transfer approaches aim to apply supervised ...
research
05/15/2021

From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

The lack of publicly available evaluation data for low-resource language...
research
05/03/2023

Plug-and-Play Multilingual Few-shot Spoken Words Recognition

As technology advances and digital devices become prevalent, seamless hu...
research
09/02/2021

ConQX: Semantic Expansion of Spoken Queries for Intent Detection based on Conditioned Text Generation

Intent detection of spoken queries is a challenging task due to their no...
research
03/10/2020

Efficient Intent Detection with Dual Sentence Encoders

Building conversational systems in new domains and with added functional...
research
09/12/2020

Intent Detection with WikiHow

Modern task-oriented dialog systems need to reliably understand users' i...
research
01/04/2022

A Hierarchical Model for Spoken Language Recognition

Spoken language recognition (SLR) refers to the automatic process used t...

Please sign up or login with your details

Forgot password? Click here to reset