πŸ” SOTDeepfakeDetection_Mechanisms

May 3, 2025 Β· View on GitHub

A curated collection of GitHub repositories and papers covering state-of-the-art Deepfake Detection techniques, as categorized in our systematic review:

πŸ“„ Paper: Unmasking Synthetic Realities in the GAI Era: A Systematic Review of Uni-Modal and Multi-Modal Deepfake Detection for Enhanced Adversarial Robustness
πŸ”— [Link to paper (coming soon)]
πŸ”— GitHub Repository for Paper


Threats posed by Generative Artificial Intelligence (GAI)

The most pressing challenges associated with GAI is the proliferation of synthetic media, commonly known as DeepFakes. DeepFakes leverage generative models to manipulate visual, auditory, and textual content, posing substantial threats to digital security, democratic stability, and public trust.

Multi-modal Deepfake Detection
Figure 1: Illustration of a multi-stage pipeline in which a threat actor manipulates a source image using a text-conditional image generator, guided by identity-altering instructions, to produce a synthetic target image with modified personal attributes.

Broad taxonomy of Deepfake generation and detection strategies

This systematic review provides a comprehensive synthesis of deepfake detection methodologies, encompassing all primary modalitiesβ€”image, video, audio, text, and multi-modal systems. It bridges critical gaps in the literature by integrating uni-modal and multi-modal approaches while offering a structured evaluation of their resilience to adversarial threats and their adaptability across diverse synthetic media contexts.

Deepfake Taxonomy
Figure 2: The broad taxonomy of Deepfake generation and detection strategies.

πŸ“š Repository Structure

1. 🧠 Uni-modal Deepfake Detection

Uni-modal methods focus on domain-specific artifacts within images, audio, or text, leveraging spatial inconsistencies, frequency distortions, or linguistic anomalies to detect manipulations.

πŸ“· Image-Based Detection

2023

  • Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection (CVPR 2023): GitHub
  • SepMark (ACM International Conference on Multimedia 2023): GitHub
  • DIRE for Diffusion-Generated Image Detection (ICCV 2023): GitHub
  • Interpretable-through-prototypes deepfake detection for diffusion models (ICCV workshop 2023): GitHub
  • PTW: Pivotal Tuning Watermarking for Pre-Trained Image Generators (USENIX 2023): GitHub
  • UniversalFakeDetect (CVPR 2023): GitHub

2024

  • DistilDIRE (ICML 2024): GitHub
  • An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape (IEEE S&P 2024): GitHub
  • FreqNet-DeepfakeDetection (AAAI 2024): GitHub
  • LampMark: Proactive Deepfake Detection via Training-Free Landmark Perceptual Watermarks (ACM MM'24): GitHub
  • Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection: (CVPR 2024): GitHub
  • FatFormer (CVPR 2024): GitHub
  • FREQUENCY MASKING FOR UNIVERSAL DEEPFAKE DETECTION (ICASSP 2024): GitHub
  • ON THE EXPLOITATION OF DCT-TRACES IN THE GENERATIVE-AI DOMAIN (IEEE ICIP 2024): GitHub
  • Faster Than Lies (CVPR Workshop 2024): GitHub
  • AI-assisted Deepfake Detection Using Adaptive Blind Image Watermarking (Jvcir 2024): GitHub
  • Deepfake Detection without Deepfakes (arXive 2024): GitHub
  • Data-Independent Operator: (arXiv 2024): GitHub
  • Adaptive Meta-Learning for Robust Deepfake Detection:(arXive 2024): GitHub

2025

  • DeepFeatureX Net: (International Conference on Pattern Recognition 2025): GitHub
  • Optimized Frequency Collaborative Strategy Drives AI Image Detection (IEEE Internet of Things Journal 2025): GitHub
  • Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection (WACV 2025): GitHub

🎞️ Video-Based Detection

2022

  • SeqDeepFake: Detecting and Recovering Sequential DeepFake Manipulation (ECCV 2022): GitHub
  • SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes (arXiv 2022): GitHub

2023

  • Masked Relation Learning for DeepFake Detection (TIFS 2023): GitHub
  • MARLIN: Masked Autoencoder for facial video Representation LearnINg (CVPR 2023): GitHub
  • Contrastive Pseudo Learning for Open-World DeepFake Attribution (ICCV 2023): GitHub
  • Face Forgery Detection by 3D Decomposition and Composition Search (IEEE Transactions on Pattern Analysis and Machine Intelligence 2023): GitHub
  • DeepFidelity: Perceptual Forgery Fidelity Assessment for Deepfake Detection (arXiv 2023): GitHub
  • Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization (CVPR 2023): GitHub
  • Unmasking Deepfake Faces from Videos An Explainable Cost-Sensitive Deep Learning Approach (ICCIT 2023): GitHub

2024

  • Exposing Lip-syncing Deepfakes from Mouth Inconsistencies (ICME 2024): GitHub
  • Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake Detection (IJCV 2024): GitHub
  • Open-Set Deepfake Detection: A Parameter-Efficient Adaptation Method with Forgery Style Mixture (arXiv 2024): GitHub
  • Deepfake Detection Using Spatiotemporal Transformer (ACM Trans. Multimedia Comput. Commun. Appl. 2024): GitHub
  • Improved Deepfake Video Detection Using Convolutional Vision Transformer (GEM 2024): GitHub
  • Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection (AAAI 2024): GitHub
  • LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection (CVPR 2024): GitHub
  • Preserving Fairness Generalization in Deepfake Detection (CVPR 2024): GitHub

2025

  • Deepfake detection using Enhanced Self Blended Images using DWT features (Image and Vision Computing 2025): GitHub
  • GrDT: Towards Robust Deepfake Detection using Geometric Representation Distribution and Texture (WACV workshop 2025): GitHub
  • DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection (IJCV 2025): GitHub
  • C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection (AAAI 2025): GitHub
  • Robust face forgery detection integrating local texture and global texture information (Journal of Information Security 2025): GitHub
  • Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples (WACVW 2025): GitHub

πŸ”Š Audio-Based Detection

2023

  • BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence Encoder (ICASSP 2023): GitHub
  • A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods (arXiv 2023): GitHub

2024

  • What to remember: Self-adaptive continual learning for audio deepfake detection (AAAI 2024): GitHub
  • RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection (arXiv 2024): GitHub
  • Prompt Tuning for Audio Deepfake Detection: (INTERSPEECH 2024): GitHub
  • A ROBUST AUDIO DEEPFAKE DETECTION SYSTEM VIA MULTI-VIEW FEATURE (ACASSP 2024): GitHub
  • Audio Deepfake Detection with XLS-R and SLS classfier (ACM International Conference on Multimedia 2024): GitHub
  • SafeEar: Content Privacy-Preserving Audio Deepfake Detection (ACM CCS 2024): GitHub
  • Cross-Domain-Audio-Deepfake-Detection-Dataset-and-Analysis (Conference on Empirical Methods in Natural Language Processing 2024): GitHub
  • CLAD: Robust Audio Deepfake Detection Against Manipulation Attacks with Contrastive Learning (arXiv 2024): GitHub

2025

  • Region-Based Optimization in Continual Learning for Audio Deepfake Detection (AAAI 2025): GitHub
  • SONICS: Synthetic Or Not - Identifying Counterfeit Songs (ICLR 2025): GitHub
  • Dual-Channel-Audio-Deepfake-Detection (IEEE Access 2025): GitHub

πŸ“ Text-Based Detection

2023

  • Deepfake Text Detection: Limitations and Opportunities (IEEE S&P 2023): GitHub

2024

  • TopFormer: Topology-Aware Authorship Attribution of Deepfake Texts with Diverse Writing Styles(ECAI 2024): GitHub
  • MAGE: Machine-generated Text Detection in the Wild (Association for Computational Linguistics 2024): GitHub

2. πŸ”€ Multi-modal Deepfake Detection

2023

  • DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models (CCS 2023): GitHub
  • Detecting and Grounding Multi-Modal Media Manipulation (CVPR 2023): GitHub
  • Audio-Visual Person-of-Interest DeepFake Detection (CVPR 2023): GitHub
  • Self-Supervised Video Forensics by Audio-Visual Anomaly Detection (CVPR 2023): GitHub
  • Detecting Deepfakes Without Seeing Any (arXiv 2023): GitHub
  • AltFreezing for More General Video Face Forgery Detection (CVPR 2023): GitHub
  • AVoiD-DF: Audio-Visual Joint Learning for Detecting Deepfake (IEEE 2023): GitHub

2024

  • AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors (arXive 2024): GitHub
  • Raising the Bar of AI-generated Image Detection with CLIP (arXiv 2024): GitHub
  • Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics (arXiv 2024): GitHub
  • Harnessing the Power of Large Vision Language Models for Synthetic Image Detection (arXiv 2024): GitHub
  • GM-DF: Generalized Multi-Scenario Deepfake Detection (arXiv 2024): GitHub
  • Conditioned Prompt-Optimization for Continual Deepfake Detection (arXiv 2024): GitHub
  • Robust Diffusion Model-Generated Image Detection with CLIP (arXiv 2024): GitHub
  • On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection (NeurIPS 2024): GitHub
  • Common Sense Reasoning for Deepfake Detection (arXiv 2024): GitHub
  • SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection (NeurIPS 2024): GitHub
  • Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes (NeruIPS 2024): GitHub
  • Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection (arXiv 2024): GitHub
  • A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection (arXiv 2024): GitHub
  • AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection (CVPR 2024): GitHub
  • AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies (arXiv 2024): GitHub
  • Audio Multi-View Spoofing Detection Framework Based on Audio-Text-Emotion Correlations (IEEE 2024): GitHub
  • A Hitchhikers Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning (NeurIPS 2024): GitHub
  • Where Deepfakes Gaze at? Spatial-Temporal Gaze Inconsistency Analysis for Video Face Forgery Detection (IEEE 2024): GitHub
  • FakeBench: Probing Explainable Fake Image Detection via Large Multimodal Models (arXiv 2024): GitHub
  • Vulnerability-Aware Spatio-Temporal Learning for Generalizable and Interpretable Deepfake Video Detection: GitHub
  • Masked Autoencoders enable strong Audio-Visual Early Fusion (CVPR 2024): GitHub

2025

  • GrDT: Towards Robust Deepfake Detection using Geometric Representation Distribution and Texture (WACV 2025): GitHub
  • Circumventing shortcuts in audio-visual deepfake detection datasets with unsupervised learning (arXiv 2025): GitHub

3. ✨ Subtle Edits Detection & Localization

2023

  • TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization (CVPR 2023): GitHub
  • Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR 2023): GitHub
  • IML-ViT: Benchmarking Image Manipulation Localization by Vision Transformer (arXiv 2023): GitHub
  • Pre-training-free Image Manipulation Localization through Non-Mutually Exclusive Contrastive Learning (ICCV 2023): GitHub
  • Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization (CCBR 2023): GitHub
  • Undercover Deepfakes: Detecting Fake Segments in Videos (ICCV 2023): GitHub
  • Locate and Verify: A Two-Stream Network for Improved Deepfake Detection (ACM MM'23): GitHub
  • Interpretable Spatial-Temporal Video Transformer for Deepfake Detection (IEEE 2023): GitHub
  • Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization (CVIU 2023): GitHub
  • UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization (ACM MM'23): GitHub

2024

  • Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization (MMM 2024): GitHub
  • Localization of Diffusion Model-Based Inpainting through Inter-Intra Similarity of Frequency Features (Elsevier 2024): GitHub
  • Weakly-supervised deepfake localization in diffusion-generated images (WACV 2024): GitHub
  • SAMIF: Adapting Segment Anything Model for Image Inpainting Forensics (ACCV 2024): GitHub
  • EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection (CVPR 2024): GitHub
  • OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking (arXiv 2024): GitHub
  • PIM-Net: Progressive Inconsistency Mining Network for Image Manipulation Localization (Pattern Recognition 2024): GitHub
  • HDF-Net: Capturing Homogeny Difference Features to Localize the Tampered Image (IEEE 2024): GitHub
  • FFAA: Face Forgery Analysis Assistant (arXiv 2024): GitHub
  • Toward real text manipulation detection: New dataset and new solution (Pattern Recogntion, 2024): GitHub
  • Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods(CVPR 2024): GitHub
  • ObjectFormer for Image Manipulation Detection and Localization (CVPR 2024): GitHub
  • Spatio-Temporal Co-Attention Fusion Network for Video Splicing Localization (JEI 2024): GitHub
  • 1M-Deepfakes Detection Challenge (MM 2024): GitHub
  • WWW: Where, Which and Whatever Enhancing Interpretability in Multimodal Deepfake Detection (arXiv 2024): GitHub
  • Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization (IEEE 2024): GitHub
  • AVT2-DWF: Improving Deepfake Detection with Audio-Visual Fusion and Dynamic Weighting Strategies (arXiv 2024): GitHub
  • AudioSeal: Proactive Detection of Voice Cloning with Localized Watermarking (ICML 2024): GitHub
  • Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization (ACM MM'24): GitHub
  • Multi-spectral Class Center Network for Face Manipulation Detection and Localization (arXiv 2024): GitHub

2025

  • Is JPEG AI going to change image forensics? (arXiv 2025): GitHub
  • A Lightweight and Effective Image Tampering Localization Network with Vision Mamba (arXiv 2025): GitHub
  • Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization (IEEE 2025): GitHub
  • DeCLIP: Decoding CLIP representations for deepfake localization (WACV 2025): GitHub
  • Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding Transformer (AAAI 2025): GitHub
  • Dense Feature Interaction Network for Image Inpainting Localization (IEEE 2025): GitHub
  • FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models (ICLR 2025): GitHub
  • Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization (AAAI 2025): GitHub
  • Localization and detection of deepfake videos based on self-blending method (nature 2025): GitHub
  • VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs (NAACL 2025): GitHub
  • LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation (IEEE 2025): GitHub
  • Adversarial Training and Gradient Optimization for Partially Deepfake Audio Localization (ICASSP 2025): GitHub
  • HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection (WACV 2025): GitHub

4. πŸ§ͺ Adversarial attacks Methods on Deepfake detection models

2023

  • Evading DeepFake Detectors via Adversarial Statistical Consistency (CVPR 2023): GitHub
  • Imperceptible Adversarial Attack via Invertible Neural Networks (AAAI 2023): GitHub
  • AVA: Inconspicuous Attribute Variation-based Adversarial Attack bypassing DeepFake Detection (arXiv 2023): GitHub
  • DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection (arXiv 2023): GitHub

2024

  • Towards Quantitative Evaluation of Explainable AI Methods for Deepfake Detection (ICASSP 2024): GitHub
  • ROBUSTNESS OF AI-IMAGE DETECTORS: FUNDAMENTAL LIMITS AND PRACTICAL ATTACKS (ICLR 2024): GitHub
  • 2D-Malafide: Adversarial Attacks Against Face Deepfake Detection Systems (BIOSIG 2024): GitHub
  • StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model (ACM MM'24): GitHub
  • DiffAM: Diffusion-based Adversarial Makeup Transfer for Facial Privacy Protection (CVPR 2024): GitHub

2025

  • Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models (IEEE 2025): GitHub
  • Unlocking The Potential of Adaptive Attacks on Diffusion-Based Purification (arXiv 2025): GitHub

5. πŸ§ͺ Adversarial Robustness-Focused Methods

2023

  • Defense against Adversarial Attacks on Audio DeepFake Detection (Proc. INTERSPEECH 2023): GitHub

2024

  • XAI-Based Detection of Adversarial Attacks on Deepfake Detectors (TMLR 2024): GitHub

2025

  • CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes (arXiv 2025): GitHub

6. πŸš€ Datasets & Benchmarks

A list of datasets used or referenced in the reviewed papers.

  • Flickr-Faces-HQ Dataset (FFHQ): GitHub

  • COCOFake: Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images (ACM MCCA 2024): GitHub

  • ForenSynths (CVPR 2020): GitHub

  • FaceForensics++: Learning to Detect Manipulated Facial Images (ICCV 2019): GitHub

  • DFDC: DeepFake Detection Challenge Dataset (arXiv 2020): Link to Download

  • Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics (CVPR 2020): GitHub

  • OpenLLMText Dataset Link to Download

  • TweepFake: about Detecting Deepfake Tweets (arXiv 2021): GitHub

  • fake-and-real-news-dataset: Link to Download

  • FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset (arXiv 2021): GitHub

  • AV-Deepfake1M: A large-scale LLM-driven audio-visual deepfake dataset (MM 2024): GitHub

ModalityDataset Examples
ImageFFHQ, ProGAN, COCOFake, ForenSynths
VideoFaceForensics++, DFDC, Celeb-DF
AudioASVspoof, FakeAVCeleb
TextOpenLLMText, TweepFake, RealNews
Multi-modalFakeAVCeleb, AV-Deepfake1M

🀝 Contributing

We welcome contributions from the community to keep this repository up-to-date with new methods, datasets, and benchmarks. Please open an issue or a pull request if you'd like to contribute.


πŸ“« Contact

For questions or collaborations, reach out to:
πŸ“§ Naseem Khan β€” nakh12498@hbku.edu.qa
πŸ›οΈ Hamad Bin Khalifa University, Qatar


Β© 2025 | For academic and non-commercial use only.