All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

07/28/2023
by   Daniele Mari, et al.
0

Recent advances in deep learning and computer vision have made the synthesis and counterfeiting of multimedia content more accessible than ever, leading to possible threats and dangers from malicious users. In the audio field, we are witnessing the growth of speech deepfake generation techniques, which solicit the development of synthetic speech detection algorithms to counter possible mischievous uses such as frauds or identity thefts. In this paper, we consider three different feature sets proposed in the literature for the synthetic speech detection task and present a model that fuses them, achieving overall better performances with respect to the state-of-the-art solutions. The system was tested on different scenarios and datasets to prove its robustness to anti-forensic attacks and its generalization capabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2022

TIMIT-TTS: a Text-to-Speech Dataset for Multimodal Synthetic Media Detection

With the rapid development of deep learning techniques, the generation a...
research
09/28/2022

Deepfake audio detection by speaker verification

Thanks to recent advances in deep learning, sophisticated generation too...
research
09/20/2021

"Hello, It's Me": Deep Learning-based Speech Synthesis Attacks in the Real World

Advances in deep learning have introduced a new wave of voice synthesis ...
research
06/15/2023

Multi-modal Hate Speech Detection using Machine Learning

With the continuous growth of internet users and media content, it is ve...
research
10/06/2022

The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection

The recent integration of generative neural strategies and audio process...
research
09/15/2022

Open Challenges in Synthetic Speech Detection

In this paper the current status and open challenges of synthetic speech...
research
09/15/2022

Detecting Synthetic Speech Manipulation in Real Audio Recordings

Recent advances in artificial speech and audio technologies have improve...

Please sign up or login with your details

Forgot password? Click here to reset