Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings

08/25/2022
by   Chunyan Zeng, et al.
0

The existing source cell-phone recognition method lacks the long-term feature characterization of the source device, resulting in inaccurate representation of the source cell-phone related features which leads to insufficient recognition accuracy. In this paper, we propose a source cell-phone recognition method based on spatio-temporal representation learning, which includes two main parts: extraction of sequential Gaussian mean matrix features and construction of a recognition model based on spatio-temporal representation learning. In the feature extraction part, based on the analysis of time-series representation of recording source signals, we extract sequential Gaussian mean matrix with long-term and short-term representation ability by using the sensitivity of Gaussian mixture model to data distribution. In the model construction part, we design a structured spatio-temporal representation learning network C3D-BiLSTM to fully characterize the spatio-temporal information, combine 3D convolutional network and bidirectional long short-term memory network for short-term spectral information and long-time fluctuation information representation learning, and achieve accurate recognition of cell-phones by fusing spatio-temporal feature information of recording source signals. The method achieves an average accuracy of 99.03 recognition of 45 cell-phones under the CCNU_Mobile dataset, and 98.18 small sample size experiments, with recognition performance better than the existing state-of-the-art methods. The experimental results show that the method exhibits excellent recognition performance in multi-class cell-phones recognition.

READ FULL TEXT
research
06/20/2020

Video Playback Rate Perception for Self-supervisedSpatio-Temporal Representation Learning

In self-supervised spatio-temporal representation learning, the temporal...
research
09/07/2023

HSTF-Model: an HTTP-based Trojan Detection Model via the Hierarchical Spatio-Temporal Features of Traffics

HTTP-based Trojan is extremely threatening, and it is difficult to be ef...
research
12/05/2022

End-to-end Recording Device Identification Based on Deep Representation Learning

Deep learning techniques have achieved specific results in recording dev...
research
09/27/2020

Handwriting Prediction Considering Inter-Class Bifurcation Structures

Temporal prediction is a still difficult task due to the chaotic behavio...
research
09/02/2022

In-Place Gestures Classification via Long-term Memory Augmented Network

In-place gesture-based virtual locomotion techniques enable users to con...
research
12/03/2018

Feature Extraction for Temporal Signal Recognition: An Overview

Due to the huge progress of the recording devices, data from heterogeneo...
research
07/20/2023

GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos

Automated surgical step recognition is an important task that can signif...

Please sign up or login with your details

Forgot password? Click here to reset