GLFF: Global and Local Feature Fusion for Face Forgery Detection

by   Yan Ju, et al.

With the rapid development of deep generative models (such as Generative Adversarial Networks and Auto-encoders), AI-synthesized images of the human face are now of such high quality that humans can hardly distinguish them from pristine ones. Although existing detection methods have shown high performance in specific evaluation settings, e.g., on images from seen models or on images without real-world post-processings, they tend to suffer serious performance degradation in real-world scenarios where testing images can be generated by more powerful generation models or combined with various post-processing operations. To address this issue, we propose a Global and Local Feature Fusion (GLFF) to learn rich and discriminative representations by combining multi-scale global features from the whole image with refined local features from informative patches for face forgery detection. GLFF fuses information from two branches: the global branch to extract multi-scale semantic features and the local branch to select informative patches for detailed local artifacts extraction. Due to the lack of a face forgery dataset simulating real-world applications for evaluation, we further create a challenging face forgery dataset, named DeepFakeFaceForensics (DF^3), which contains 6 state-of-the-art generation models and a variety of post-processing techniques to approach the real-world scenarios. Experimental results demonstrate the superiority of our method to the state-of-the-art methods on the proposed DF^3 dataset and three other open-source datasets.


page 1

page 2

page 4

page 6

page 10


Fusing Global and Local Features for Generalized AI-Synthesized Image Detection

With the development of the Generative Adversarial Networks (GANs) and D...

Detecting CNN-Generated Facial Images in Real-World Scenarios

Artificial, CNN-generated images are now of such high quality that human...

Bridging the Gap: Enhancing the Utility of Synthetic Data via Post-Processing Techniques

Acquiring and annotating suitable datasets for training deep learning mo...

Semi-Cycled Generative Adversarial Networks for Real-World Face Super-Resolution

Real-world face super-resolution (SR) is a highly ill-posed image restor...

Hierarchical Forgery Classifier On Multi-modality Face Forgery Clues

Face forgery detection plays an important role in personal privacy and s...

An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification

Effective fusion of multi-scale features is crucial for improving speake...

GridDehazeNet+: An Enhanced Multi-Scale Network with Intra-Task Knowledge Transfer for Single Image Dehazing

We propose an enhanced multi-scale network, dubbed GridDehazeNet+, for s...

Please sign up or login with your details

Forgot password? Click here to reset