Deep Convolutional Pooling Transformer for Deepfake Detection

by   Tianyi Wang, et al.

Recently, Deepfake has drawn considerable public attention due to security and privacy concerns in social media digital forensics. As the wildly spreading Deepfake videos on the Internet become more realistic, traditional detection techniques have failed in distinguishing between the real and fake. Most existing deep learning methods mainly focus on local features and relations within the face image using convolutional neural networks as a backbone. However, local features and relations are insufficient for model training to learn enough general information for Deepfake detection. Therefore, the existing Deepfake detection methods have reached a bottleneck to further improving the detection performance. To address this issue, we propose a deep convolutional Transformer to incorporate the decisive image features both locally and globally. Specifically, we apply convolutional pooling and re-attention to enrich the extracted features and enhance the efficacy. Moreover, we employ the barely discussed image keyframes in model training for performance improvement and visualize the feature quantity gap between the key and normal image frames caused by video compression. We finally illustrate the transferability with extensive experiments on several Deepfake benchmark datasets. The proposed solution consistently outperforms several state-of-the-art baselines on both within- and cross-dataset experiments.


page 1

page 3

page 8


On Improving Cross-dataset Generalization of Deepfake Detectors

Facial manipulation by deep fake has caused major security risks and rai...

Local Relation Learning for Face Forgery Detection

With the rapid development of facial manipulation techniques, face forge...

One Detector to Rule Them All: Towards a General Deepfake Attack Detection Framework

Deep learning-based video manipulation methods have become widely access...

Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

The infrared small-dim target detection is one of the key techniques in ...

KESDT: knowledge enhanced shallow and deep Transformer for detecting adverse drug reactions

Adverse drug reaction (ADR) detection is an essential task in the medica...

Comment on "No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features"

In Neural Processing Letters 50,3 (2019) a machine learning approach to ...

Automatic Detection of Rail Components via A Deep Convolutional Transformer Network

Automatic detection of rail track and its fasteners via using continuous...

Please sign up or login with your details

Forgot password? Click here to reset