Emotion Profile Refinery for Speech Emotion Classification

08/12/2020
by   Shuiyang Mao, et al.
0

Human emotions are inherently ambiguous and impure. When designing systems to anticipate human emotions based on speech, the lack of emotional purity must be considered. However, most of the current methods for speech emotion classification rest on the consensus, e.g., one single hard label for an utterance. This labeling principle imposes challenges for system performance considering emotional impurity. In this paper, we recommend the use of emotional profiles (EPs), which provides a time series of segment-level soft labels to capture the subtle blends of emotional cues present across a specific speech utterance. We further propose the emotion profile refinery (EPR), an iterative procedure to update EPs. The EPR method produces soft, dynamically-generated, multiple probabilistic class labels during successive stages of refinement, which results in significant improvements in the model accuracy. Experiments on three well-known emotion corpora show noticeable gain using the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2021

Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning

Despite the widespread utilization of deep neural networks (DNNs) for sp...
research
08/15/2020

EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification

Human emotional speech is, by its very nature, a variant signal. This re...
research
08/15/2020

Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition

Categorical speech emotion recognition is typically performed as a seque...
research
06/23/2015

Detection and Analysis of Emotion From Speech Signals

Recognizing emotion from speech has become one the active research theme...
research
08/30/2018

Contribution of Glottal Waveform in Speech Emotion: A Comparative Pairwise Investigation

In this work, we investigated the contribution of the glottal waveform i...
research
05/10/2022

Bridging the prosody GAP: Genetic Algorithm with People to efficiently sample emotional prosody

The human voice effectively communicates a range of emotions with nuance...
research
11/22/2019

Decision Making guided by Emotion A computational architecture

A computational architecture is presented, in which "swift and fuzzy" em...

Please sign up or login with your details

Forgot password? Click here to reset