[ICCV 2025] Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection
December 23, 2025 · View on GitHub
This repository contains the official implementation of our ICCV 2025 paper, "Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection." arxiv page
Abstract
We introduce a deepfake video detection approach that exploits pixel-wise temporal inconsistencies, which traditional spatial frequency-based detectors often overlook. Traditional detectors represent temporal information merely by stacking spatial frequency spectra across frames, resulting in the failure to detect temporal artifacts in the pixel plane. Our approach performs a 1D Fourier transform on the time axis for each pixel, extracting features highly sensitive to temporal inconsistencies, especially in areas prone to unnatural movements. To precisely locate regions containing the temporal artifacts, we introduce an attention proposal module trained in an end-to-end manner. Additionally, our joint transformer module effectively integrates pixel-wise temporal frequency features with spatio-temporal context features, expanding the range of detectable forgery artifacts. Our framework represents a significant advancement in deepfake video detection, providing robust performance across diverse and challenging detection scenarios.
Environment Setting
System Setting
apt-get update
apt-get -y install libgl1-mesa-glx &&
apt-get -y install libglib2.0-0
apt-get install -y libsm6 &&
apt-get -y install libxext6 &&
apt-get -y install libxrender-dev
apt-get install -y libx11-6
Python Dependencies
pip install opencv-python sympy timm simplejson fvcore
pip install torchmetrics pytorch-losses
Updates
- inference:
./inference/test_on_raw_video.py --video [video_path] --out_dir [output_path] --model_path [model_path] - model weights: Google Drive
Key References for Video Deepfake Detection
The following works have significantly influenced our understanding and design choices for video deepfake detection.
FTCN: Exploring Temporal Coherence for More General Video Face Forgery Detection (ICCV 2021)
- GitHub:https://github.com/yinglinzheng/FTCN
- Paper: arXiv:2108.06693
AltFreezing: Alternating Freezing for More General Video Face Forgery Detection (CVPR 2023)
- GitHub: https://github.com/ZhendongWang6/AltFreezing
- Paper: CVPR 2023 Paper
StyleFlow: Exploiting Style Latent Flows for Generalizing Deepfake Video Detection (CVPR 2024)
- GitHub: https://github.com/jongwook-Choi/StyleFlow
- Paper: arXiv:2403.06592