Clinical Depression and Affect Recognition with EmoAudioNet

11/01/2019
by   Emna Rejaibi, et al.
0

Automatic analysis of emotions and affects from speech is an inherently challenging problem with a broad range of applications in Human-Computer Interaction (HCI), health informatics, assistive technologies and multimedia retrieval. Understanding human's specific and basic emotions and reacting accordingly can improve HCI. Besides, giving machines skills to understand human's emotions when interacting with other humans can help humans with a socio-affective intelligence. In this paper, we present a deep Neural Network-based architecture called EmoAudioNet which studies the time-frequency representation of the audio signal and the visual representation of its spectrum of frequencies. Two applications are performed using EmoAudioNet : automatic clinical depression recognition and continuous dimensional emotion recognition from speech. The extensive experiments showed that the proposed approach significantly outperforms the state-of-art approaches on RECOLA and DAIC-WOZ databases. The competitive results call for applying EmoAudioNet on others affects and emotions recognition from speech applications.

READ FULL TEXT
research
10/27/2017

Detection and Analysis of Human Emotions through Voice and Speech Pattern Processing

The ability to modulate vocal sounds and generate speech is one of the f...
research
11/14/2022

Describing emotions with acoustic property prompts for speech emotion recognition

Emotions lie on a broad continuum and treating emotions as a discrete nu...
research
09/06/2019

Towards Multimodal Emotion Recognition in German Speech Events in Cars using Transfer Learning

The recognition of emotions by humans is a complex process which conside...
research
11/29/2022

Analysis of constant-Q filterbank based representations for speech emotion recognition

This work analyzes the constant-Q filterbank-based time-frequency repres...
research
05/30/2011

Neural Networks for Emotion Classification

It is argued that for the computer to be able to interact with humans, i...
research
05/07/2023

Learning Robust Self-attention Features for Speech Emotion Recognition with Label-adaptive Mixup

Speech Emotion Recognition (SER) is to recognize human emotions in a nat...
research
01/18/2022

A Study on the Ambiguity in Human Annotation of German Oral History Interviews for Perceived Emotion Recognition and Sentiment Analysis

For research in audiovisual interview archives often it is not only of i...

Please sign up or login with your details

Forgot password? Click here to reset