Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

09/03/2023
by   Yu-Wen Chen, et al.
0

Speech emotion recognition (SER) often experiences reduced performance due to background noise. In addition, making a prediction on signals with only background noise could undermine user trust in the system. In this study, we propose a Noise Robust Speech Emotion Recognition system, NRSER. NRSER employs speech enhancement (SE) to effectively reduce the noise in input signals. Then, the signal-to-noise-ratio (SNR)-level detection structure and waveform reconstitution strategy are introduced to reduce the negative impact of SE on speech signals with no or little background noise. Our experimental results show that NRSER can effectively improve the noise robustness of the SER system, including preventing the system from making emotion recognition on signals consisting solely of background noise. Moreover, the proposed SNR-level detection structure can be used individually for tasks such as data selection.

READ FULL TEXT
research
10/29/2020

UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition

Speech enhancement at extremely low signal-to-noise ratio (SNR) conditio...
research
11/19/2019

Distributed Microphone Speech Enhancement based on Deep Learning

Speech-related applications deliver inferior performance in complex nois...
research
05/17/2021

Dual-Stage Low-Complexity Reconfigurable Speech Enhancement

This paper proposes a dual-stage, low complexity, and reconfigurable tec...
research
01/21/2019

Learning sound representations using trainable COPE feature extractors

Sound analysis research has mainly been focused on speech and music proc...
research
10/21/2020

Dynamic Layer Customization for Noise Robust Speech Emotion Recognition in Heterogeneous Condition Training

Robustness to environmental noise is important to creating automatic spe...
research
03/02/2021

Investigations on Audiovisual Emotion Recognition in Noisy Conditions

In this paper we explore audiovisual emotion recognition under noisy aco...
research
01/29/2021

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Keyword spotting and in particular Wake-Up-Word (WUW) detection is a ver...

Please sign up or login with your details

Forgot password? Click here to reset