Oh, Jeez! or Uh-huh? A Listener-aware Backchannel Predictor on ASR Transcriptions

04/10/2023
by   Daniel Ortega, et al.
0

This paper presents our latest investigation on modeling backchannel in conversations. Motivated by a proactive backchanneling theory, we aim at developing a system which acts as a proactive listener by inserting backchannels, such as continuers and assessment, to influence speakers. Our model takes into account not only lexical and acoustic cues, but also introduces the simple and novel idea of using listener embeddings to mimic different backchanneling behaviours. Our experimental results on the Switchboard benchmark dataset reveal that acoustic cues are more important than lexical cues in this task and their combination with listener embeddings works best on both, manual transcriptions and automatically generated transcriptions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2018

Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks

While there has been substantial amount of work in speaker diarization r...
research
03/02/2018

Lexico-acoustic Neural-based Models for Dialog Act Classification

Recent works have proposed neural models for dialog act classification i...
research
09/11/2023

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Large language models (LLMs) have shown great promise for capturing cont...
research
09/14/2022

Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

Models of acoustic word embeddings (AWEs) learn to map variable-length s...
research
05/17/2020

Multi-modal Automated Speech Scoring using Attention Fusion

In this study, we propose a novel multi-modal end-to-end neural approach...
research
04/08/2019

Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection

Disfluencies in spontaneous speech are known to be associated with proso...
research
07/04/2016

Modelling Context with User Embeddings for Sarcasm Detection in Social Media

We introduce a deep neural network for automated sarcasm detection. Rece...

Please sign up or login with your details

Forgot password? Click here to reset