🔍 SOTDeepfakeDetection_Mechanisms

May 3, 2025 · View on GitHub

A curated collection of GitHub repositories and papers covering state-of-the-art Deepfake Detection techniques, as categorized in our systematic review:

📄 Paper: Unmasking Synthetic Realities in the GAI Era: A Systematic Review of Uni-Modal and Multi-Modal Deepfake Detection for Enhanced Adversarial Robustness
🔗 [Link to paper (coming soon)]
🔗 GitHub Repository for Paper

Threats posed by Generative Artificial Intelligence (GAI)

The most pressing challenges associated with GAI is the proliferation of synthetic media, commonly known as DeepFakes. DeepFakes leverage generative models to manipulate visual, auditory, and textual content, posing substantial threats to digital security, democratic stability, and public trust.

Multi-modal Deepfake Detection — **Figure 1:** Illustration of a multi-stage pipeline in which a threat actor manipulates a source image using a text-conditional image generator, guided by identity-altering instructions, to produce a synthetic target image with modified personal attributes.

Broad taxonomy of Deepfake generation and detection strategies

This systematic review provides a comprehensive synthesis of deepfake detection methodologies, encompassing all primary modalities—image, video, audio, text, and multi-modal systems. It bridges critical gaps in the literature by integrating uni-modal and multi-modal approaches while offering a structured evaluation of their resilience to adversarial threats and their adaptability across diverse synthetic media contexts.

Deepfake Taxonomy — **Figure 2:** The broad taxonomy of Deepfake generation and detection strategies.

📚 Repository Structure

Uni-modal methods focus on domain-specific artifacts within images, audio, or text, leveraging spatial inconsistencies, frequency distortions, or linguistic anomalies to detect manipulations.

📷 Image-Based Detection

2023

Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection (CVPR 2023): GitHub
SepMark (ACM International Conference on Multimedia 2023): GitHub
DIRE for Diffusion-Generated Image Detection (ICCV 2023): GitHub
Interpretable-through-prototypes deepfake detection for diffusion models (ICCV workshop 2023): GitHub
PTW: Pivotal Tuning Watermarking for Pre-Trained Image Generators (USENIX 2023): GitHub
UniversalFakeDetect (CVPR 2023): GitHub

2024

DistilDIRE (ICML 2024): GitHub
An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape (IEEE S&P 2024): GitHub
FreqNet-DeepfakeDetection (AAAI 2024): GitHub
LampMark: Proactive Deepfake Detection via Training-Free Landmark Perceptual Watermarks (ACM MM'24): GitHub
Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection: (CVPR 2024): GitHub
FatFormer (CVPR 2024): GitHub
FREQUENCY MASKING FOR UNIVERSAL DEEPFAKE DETECTION (ICASSP 2024): GitHub
ON THE EXPLOITATION OF DCT-TRACES IN THE GENERATIVE-AI DOMAIN (IEEE ICIP 2024): GitHub
Faster Than Lies (CVPR Workshop 2024): GitHub
AI-assisted Deepfake Detection Using Adaptive Blind Image Watermarking (Jvcir 2024): GitHub
Deepfake Detection without Deepfakes (arXive 2024): GitHub
Data-Independent Operator: (arXiv 2024): GitHub
Adaptive Meta-Learning for Robust Deepfake Detection:(arXive 2024): GitHub

2025

DeepFeatureX Net: (International Conference on Pattern Recognition 2025): GitHub
Optimized Frequency Collaborative Strategy Drives AI Image Detection (IEEE Internet of Things Journal 2025): GitHub
Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection (WACV 2025): GitHub

🎞️ Video-Based Detection

2022

SeqDeepFake: Detecting and Recovering Sequential DeepFake Manipulation (ECCV 2022): GitHub
SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes (arXiv 2022): GitHub

2023

Masked Relation Learning for DeepFake Detection (TIFS 2023): GitHub
MARLIN: Masked Autoencoder for facial video Representation LearnINg (CVPR 2023): GitHub
Contrastive Pseudo Learning for Open-World DeepFake Attribution (ICCV 2023): GitHub
Face Forgery Detection by 3D Decomposition and Composition Search (IEEE Transactions on Pattern Analysis and Machine Intelligence 2023): GitHub
DeepFidelity: Perceptual Forgery Fidelity Assessment for Deepfake Detection (arXiv 2023): GitHub
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization (CVPR 2023): GitHub
Unmasking Deepfake Faces from Videos An Explainable Cost-Sensitive Deep Learning Approach (ICCIT 2023): GitHub

2024

Exposing Lip-syncing Deepfakes from Mouth Inconsistencies (ICME 2024): GitHub
Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection (IJCV 2024): GitHub
Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture (arXiv 2024): GitHub
Deepfake Detection Using Spatiotemporal Transformer (ACM Trans. Multimedia Comput. Commun. Appl. 2024): GitHub
Improved Deepfake Video Detection Using Convolutional Vision Transformer (GEM 2024): GitHub
Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection (AAAI 2024): GitHub
LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection (CVPR 2024): GitHub
Preserving Fairness Generalization in Deepfake Detection (CVPR 2024): GitHub

2025

Deepfake detection using Enhanced Self Blended Images using DWT features (Image and Vision Computing 2025): GitHub
GrDT: Towards Robust Deepfake Detection using Geometric Representation Distribution and Texture (WACV workshop 2025): GitHub
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection (IJCV 2025): GitHub
C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection (AAAI 2025): GitHub
Robust face forgery detection integrating local texture and global texture information (Journal of Information Security 2025): GitHub
Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples (WACVW 2025): GitHub

🔊 Audio-Based Detection

2023

BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence Encoder (ICASSP 2023): GitHub
A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods (arXiv 2023): GitHub

2024

What to remember: Self-adaptive continual learning for audio deepfake detection (AAAI 2024): GitHub
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection (arXiv 2024): GitHub
Prompt Tuning for Audio Deepfake Detection: (INTERSPEECH 2024): GitHub
A ROBUST AUDIO DEEPFAKE DETECTION SYSTEM VIA MULTI-VIEW FEATURE (ACASSP 2024): GitHub
Audio Deepfake Detection with XLS-R and SLS classfier (ACM International Conference on Multimedia 2024): GitHub
SafeEar: Content Privacy-Preserving Audio Deepfake Detection (ACM CCS 2024): GitHub
Cross-Domain-Audio-Deepfake-Detection-Dataset-and-Analysis (Conference on Empirical Methods in Natural Language Processing 2024): GitHub
CLAD: Robust Audio Deepfake Detection Against Manipulation Attacks with Contrastive Learning (arXiv 2024): GitHub

2025

Region-Based Optimization in Continual Learning for Audio Deepfake Detection (AAAI 2025): GitHub
SONICS: Synthetic Or Not - Identifying Counterfeit Songs (ICLR 2025): GitHub
Dual-Channel-Audio-Deepfake-Detection (IEEE Access 2025): GitHub

📝 Text-Based Detection

2023

Deepfake Text Detection: Limitations and Opportunities (IEEE S&P 2023): GitHub

2024

TopFormer: Topology-Aware Authorship Attribution of Deepfake Texts with Diverse Writing Styles(ECAI 2024): GitHub
MAGE: Machine-generated Text Detection in the Wild (Association for Computational Linguistics 2024): GitHub

2023

DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models (CCS 2023): GitHub
Detecting and Grounding Multi-Modal Media Manipulation (CVPR 2023): GitHub
Audio-Visual Person-of-Interest DeepFake Detection (CVPR 2023): GitHub
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection (CVPR 2023): GitHub
Detecting Deepfakes Without Seeing Any (arXiv 2023): GitHub
AltFreezing for More General Video Face Forgery Detection (CVPR 2023): GitHub
AVoiD-DF: Audio-Visual Joint Learning for Detecting Deepfake (IEEE 2023): GitHub

2024

AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors (arXive 2024): GitHub
Raising the Bar of AI-generated Image Detection with CLIP (arXiv 2024): GitHub
Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics (arXiv 2024): GitHub
Harnessing the Power of Large Vision Language Models for Synthetic Image Detection (arXiv 2024): GitHub
GM-DF: Generalized Multi-Scenario Deepfake Detection (arXiv 2024): GitHub
Conditioned Prompt-Optimization for Continual Deepfake Detection (arXiv 2024): GitHub
Robust Diffusion Model-Generated Image Detection with CLIP (arXiv 2024): GitHub
On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection (NeurIPS 2024): GitHub
Common Sense Reasoning for Deepfake Detection (arXiv 2024): GitHub
SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection (NeurIPS 2024): GitHub
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes (NeruIPS 2024): GitHub
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection (arXiv 2024): GitHub
A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection (arXiv 2024): GitHub
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection (CVPR 2024): GitHub
AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies (arXiv 2024): GitHub
Audio Multi-View Spoofing Detection Framework Based on Audio-Text-Emotion Correlations (IEEE 2024): GitHub
A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning (NeurIPS 2024): GitHub
Where Deepfakes Gaze at? Spatial-Temporal Gaze Inconsistency Analysis for Video Face Forgery Detection (IEEE 2024): GitHub
FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models (arXiv 2024): GitHub
Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection: GitHub
Masked Autoencoders enable strong Audio-Visual Early Fusion (CVPR 2024): GitHub

2025

GrDT: Towards Robust Deepfake Detection using Geometric Representation Distribution and Texture (WACV 2025): GitHub
Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learning (arXiv 2025): GitHub

3. ✨ Subtle Edits Detection & Localization

2023

TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization (CVPR 2023): GitHub
Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR 2023): GitHub
IML-ViT: Benchmarking Image Manipulation Localization by Vision Transformer (arXiv 2023): GitHub
Pre-training-free Image Manipulation Localization through Non-Mutually Exclusive Contrastive Learning (ICCV 2023): GitHub
Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization (CCBR 2023): GitHub
Undercover Deepfakes: Detecting Fake Segments in Videos (ICCV 2023): GitHub
Locate and Verify: A Two-Stream Network for Improved Deepfake Detection (ACM MM'23): GitHub
Interpretable Spatial-Temporal Video Transformer for Deepfake Detection (IEEE 2023): GitHub
Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization (CVIU 2023): GitHub
UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization (ACM MM'23): GitHub

2024

Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization (MMM 2024): GitHub
Localization of Diffusion Model-Based Inpainting through Inter-Intra Similarity of Frequency Features (Elsevier 2024): GitHub
Weakly-supervised deepfake localization in diffusion-generated images (WACV 2024): GitHub
SAMIF: Adapting Segment Anything Model for Image Inpainting Forensics (ACCV 2024): GitHub
EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection (CVPR 2024): GitHub
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking (arXiv 2024): GitHub
PIM-Net: Progressive Inconsistency Mining Network for Image Manipulation Localization (Pattern Recognition 2024): GitHub
HDF-Net: Capturing Homogeny Difference Features to Localize the Tampered Image (IEEE 2024): GitHub
FFAA: Face Forgery Analysis Assistant (arXiv 2024): GitHub
Toward real text manipulation detection: New dataset and new solution (Pattern Recogntion, 2024): GitHub
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods(CVPR 2024): GitHub
ObjectFormer for Image Manipulation Detection and Localization (CVPR 2024): GitHub
Spatio-Temporal Co-Attention Fusion Network for Video Splicing Localization (JEI 2024): GitHub
1M-Deepfakes Detection Challenge (MM 2024): GitHub
WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection (arXiv 2024): GitHub
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization (IEEE 2024): GitHub
AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies (arXiv 2024): GitHub
AudioSeal: Proactive Detection of Voice Cloning with Localized Watermarking (ICML 2024): GitHub
Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization (ACM MM'24): GitHub
Multi-spectral Class Center Network for Face Manipulation Detection and Localization (arXiv 2024): GitHub

2025

Is JPEG AI going to change image forensics? (arXiv 2025): GitHub
A Lightweight and Effective Image Tampering Localization Network with Vision Mamba (arXiv 2025): GitHub
Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization (IEEE 2025): GitHub
DeCLIP: Decoding CLIP representations for deepfake localization (WACV 2025): GitHub
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer (AAAI 2025): GitHub
Dense Feature Interaction Network for Image Inpainting Localization (IEEE 2025): GitHub
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models (ICLR 2025): GitHub
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization (AAAI 2025): GitHub
Localization and detection of deepfake videos based on self-blending method (nature 2025): GitHub
VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs (NAACL 2025): GitHub
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation (IEEE 2025): GitHub
Adversarial Training and Gradient Optimization for Partially Deepfake Audio Localization (ICASSP 2025): GitHub
HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection (WACV 2025): GitHub

4. 🧪 Adversarial attacks Methods on Deepfake detection models

2023

Evading DeepFake Detectors via Adversarial Statistical Consistency (CVPR 2023): GitHub
Imperceptible Adversarial Attack via Invertible Neural Networks (AAAI 2023): GitHub
AVA: Inconspicuous Attribute Variation-based Adversarial Attack bypassing DeepFake Detection (arXiv 2023): GitHub
DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection (arXiv 2023): GitHub

2024

Towards Quantitative Evaluation of Explainable AI Methods for Deepfake Detection (ICASSP 2024): GitHub
ROBUSTNESS OF AI-IMAGE DETECTORS: FUNDAMENTAL LIMITS AND PRACTICAL ATTACKS (ICLR 2024): GitHub
2D-Malafide: Adversarial Attacks Against Face Deepfake Detection Systems (BIOSIG 2024): GitHub
StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model (ACM MM'24): GitHub
DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection (CVPR 2024): GitHub

2025

Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models (IEEE 2025): GitHub
Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification (arXiv 2025): GitHub

5. 🧪 Adversarial Robustness-Focused Methods

2023

Defense against Adversarial Attacks on Audio DeepFake Detection (Proc. INTERSPEECH 2023): GitHub

2024

XAI-Based Detection of Adversarial Attacks on Deepfake Detectors (TMLR 2024): GitHub

2025

CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes (arXiv 2025): GitHub

6. 🚀 Datasets & Benchmarks

A list of datasets used or referenced in the reviewed papers.

Flickr-Faces-HQ Dataset (FFHQ): GitHub
COCOFake: Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images (ACM MCCA 2024): GitHub
ForenSynths (CVPR 2020): GitHub
FaceForensics++: Learning to Detect Manipulated Facial Images (ICCV 2019): GitHub
DFDC: DeepFake Detection Challenge Dataset (arXiv 2020): Link to Download
Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics (CVPR 2020): GitHub
OpenLLMText Dataset Link to Download
TweepFake: about Detecting Deepfake Tweets (arXiv 2021): GitHub
fake-and-real-news-dataset: Link to Download
FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset (arXiv 2021): GitHub
AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset (MM 2024): GitHub

Modality	Dataset Examples
Image	FFHQ, ProGAN, COCOFake, ForenSynths
Video	FaceForensics++, DFDC, Celeb-DF
Audio	ASVspoof, FakeAVCeleb
Text	OpenLLMText, TweepFake, RealNews
Multi-modal	FakeAVCeleb, AV-Deepfake1M

🤝 Contributing

We welcome contributions from the community to keep this repository up-to-date with new methods, datasets, and benchmarks. Please open an issue or a pull request if you'd like to contribute.

📫 Contact

For questions or collaborations, reach out to:
📧 Naseem Khan — nakh12498@hbku.edu.qa
🏛️ Hamad Bin Khalifa University, Qatar