Spec-ResNet: A General Audio Steganalysis scheme based on Deep Residual Network of Spectrogram

by   Yanzhen Ren, et al.

The widespread application of audio and video communication technology make the compressed audio data flowing over the Internet, and make it become an important carrier for covert communication. There are many steganographic schemes emerged in the mainstream audio compression data, such as AAC and MP3, followed by many steganalysis schemes. However, these steganalysis schemes are only effective in the specific embedded domain. In this paper, a general steganalysis scheme Spec-ResNet (Deep Residual Network of Spectrogram) is proposed to detect the steganography schemes of different embedding domain for AAC and MP3. The basic idea is that the steganographic modification of different embedding domain will all introduce the change of the decoded audio signal. In this paper, the spectrogram, which is the visual representation of the spectrum of frequencies of audio signal, is adopted as the input of the feature network to extract the universal features introduced by steganography schemes; Deep Neural Network Spec-ResNet is well-designed to represent the steganalysis feature; and the features extracted from different spectrogram windows are combined to fully capture the steganalysis features. The experiment results show that the proposed scheme has good detection accuracy and generality. The proposed scheme has better detection accuracy for three different AAC steganographic schemes and MP3Stego than the state-of-arts steganalysis schemes which are based on traditional hand-crafted or CNN-based feature. To the best of our knowledge, the audio steganalysis scheme based on the spectrogram and deep residual network is first proposed in this paper. The method proposed in this paper can be extended to the audio steganalysis of other codec or audio forensics.


page 1

page 4

page 5


Multi-Stage Residual Hiding for Image-into-Audio Steganography

The widespread application of audio communication technologies has speed...

A Deep Learning-based Audio-in-Image Watermarking Scheme

This paper presents a deep learning-based audio-in-image watermarking sc...

High Resilience Diverse Domain Multilevel Audio Watermarking with Adaptive Threshold

A novel diverse domain (DCT-SVD & DWT-SVD) watermarking scheme is propos...

Calibrated Audio Steganalysis

Calibration is a common practice in image steganalysis for extracting pr...

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers

Audio signals are often stored and transmitted in compressed formats. Am...

Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features

Existing fake audio detection systems perform well in in-domain testing,...

Automatic Localization and Detection Applicable to Robust Image Watermarking Resisting against Camera Shooting

Robust image watermarking that can resist camera shooting has become an ...

Please sign up or login with your details

Forgot password? Click here to reset