A breakthrough in Speech emotion recognition using Deep Retinal Convolution Neural Networks

07/12/2017
by   Yafeng Niu, et al.
0

Speech emotion recognition (SER) is to study the formation and change of speaker's emotional state from the speech signal perspective, so as to make the interaction between human and computer more intelligent. SER is a challenging task that has encountered the problem of less training data and low prediction accuracy. Here we propose a data augmentation algorithm based on the imaging principle of the retina and convex lens, to acquire the different sizes of spectrogram and increase the amount of training data by changing the distance between the spectrogram and the convex lens. Meanwhile, with the help of deep learning to get the high-level features, we propose the Deep Retinal Convolution Neural Networks (DRCNNs) for SER and achieve the average accuracy over 99 previous studies in terms of both the number of emotions and the accuracy of recognition. Predictably, our results will dramatically improve human-computer interaction.

READ FULL TEXT

page 2

page 7

research
04/28/2022

Emotion Recognition In Persian Speech Using Deep Neural Networks

Speech Emotion Recognition (SER) is of great importance in Human-Compute...
research
04/27/2017

End-to-End Multimodal Emotion Recognition using Deep Neural Networks

Automatic affect recognition is a challenging task due to the various mo...
research
09/15/2021

FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition

Using mel-spectrograms over conventional MFCCs features, we assess the a...
research
10/28/2022

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

In human-computer interaction, Speech Emotion Recognition (SER) plays an...
research
09/18/2021

Hybrid Data Augmentation and Deep Attention-based Dilated Convolutional-Recurrent Neural Networks for Speech Emotion Recognition

Speech emotion recognition (SER) has been one of the significant tasks i...
research
02/15/2018

Speech Emotion Recognition with Data Augmentation and Layer-wise Learning Rate Adjustment

In this work, we design a neural network for recognizing emotions in spe...
research
03/04/2021

Speech Emotion Recognition using Semantic Information

Speech emotion recognition is a crucial problem manifesting in a multitu...

Please sign up or login with your details

Forgot password? Click here to reset