Exploring Speaker-Related Information in Spoken Language Understanding for Better Speaker Diarization

05/22/2023
by   Luyao Cheng, et al.
0

Speaker diarization(SD) is a classic task in speech processing and is crucial in multi-party scenarios such as meetings and conversations. Current mainstream speaker diarization approaches consider acoustic information only, which result in performance degradation when encountering adverse acoustic conditions. In this paper, we propose methods to extract speaker-related information from semantic content in multi-party meetings, which, as we will show, can further benefit speaker diarization. We introduce two sub-tasks, Dialogue Detection and Speaker-Turn Detection, in which we effectively extract speaker information from conversational semantics. We also propose a simple yet effective algorithm to jointly model acoustic and semantic information and obtain speaker-identified texts. Experiments on both AISHELL-4 and AliMeeting datasets show that our method achieves consistent improvements over acoustic-only speaker diarization systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2023

Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Speaker diarization has gained considerable attention within speech proc...
research
02/04/2022

The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

This paper describes our speaker diarization system submitted to the Mul...
research
05/09/2018

Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task

This work focuses on the use of acoustic cues for modeling turn-taking i...
research
02/21/2019

Incremental Transfer Learning in Two-pass Information Bottleneck based Speaker Diarization System for Meetings

The two-pass information bottleneck (TPIB) based speaker diarization sys...
research
03/13/2023

A two-stage speaker extraction algorithm under adverse acoustic conditions using a single-microphone

In this work, we present a two-stage method for speaker extraction under...
research
10/30/2020

Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews

Conversations between a clinician and a patient, in natural conditions, ...
research
04/10/2023

Modeling Speaker-Listener Interaction for Backchannel Prediction

We present our latest findings on backchannel modeling novelly motivated...

Please sign up or login with your details

Forgot password? Click here to reset