[CVPR 2026 Highlight] CDA-VSR: Compressed-Domain-Aware Online Video Super-Resolution
June 14, 2026 · View on GitHub
CDA-VSR is a compressed-domain-aware framework for online video super-resolution.
It explicitly exploits motion vectors, residual maps, and frame types from the compressed bitstream to improve both reconstruction quality and inference efficiency.
News • Overview • Results • Repository Structure • How to use • Citation
News
- [2026/03] Repository initialized.
- [2026/03] README released.
- [2026/04] Pretrained model and dataset released.
Overview
Online video super-resolution (VSR) aims to recover high-resolution (HR) frames from low-resolution (LR) inputs under strict latency constraints, where only the current and previous frames are available. Most existing online VSR methods operate only on decoded LR frames and often suffer from expensive motion estimation or limited temporal modeling efficiency.
To address this issue, CDA-VSR explicitly introduces compressed-domain priors into online VSR, including:
- Motion Vectors (MV) for efficient coarse alignment
- Residual Maps for selective temporal fusion
- Frame Types (I/P frames) for adaptive reconstruction
Based on these priors, CDA-VSR contains three key modules:
- MVGDA: Motion-Vector-Guided Deformable Alignment
- RMGF: Residual Map Gated Fusion
- FTAR: Frame-Type-Aware Reconstruction
This design enables CDA-VSR to achieve a favorable trade-off between restoration quality and runtime efficiency.
Results
Quantitative Highlights
- On REDS4, CDA-VSR achieves approximately 90 FPS while maintaining competitive or superior reconstruction quality.
Qualitative Comparison
CDA-VSR reconstructs clearer edges and finer textures than representative online VSR baselines, especially in regions with large motion and compression artifacts.
Repository Structure
CDA-VSR/
├── basicsr/ # Core framework
│ ├── archs/ # Network architectures
│ ├── data/ # Dataset definitions
│ ├── losses/ # Loss functions
│ ├── metrics/ # Evaluation metrics
│ ├── models/ # Model wrappers
│ ├── train.py # Training entry
│ └── test.py # Testing entry
├── options/
│ ├── train/
│ │ └── train_CDA-VSR.yaml
│ └── test/
│ ├── test_CDA-VSR_REDS4.yaml
│ └── test_CDA-VSR_Inter4K.yaml
├──pretrained_models/
└── best.pth
├── figures/ # README figures
├── requirement.txt
├── LICENSE.txt
└── README.md
Dataset and Pretrained Models
Dataset
We provide the processed REDS compressed-domain dataset used in this work.
| Dataset | Description | Download |
|---|---|---|
| REDS_CRF18_23_28_new | Processed REDS dataset with compressed-domain information, including LR frames, motion vectors, residual maps, and frame-type information under different CRF settings. | Baidu Netdisk |
Extraction code: zc8e
After downloading, please organize the dataset as follows:
REDS_CRF18_23_28_new/
├── train/
└── test/
Pretrained Models
pretrained_models/
└── best.pth
How to use
Requirements
- Python 3.8+
- PyTorch 1.10+
- CUDA 11.x
Training
Run training with:
python basicsr/train.py -opt options/train/train_CDA-VSR.yaml
Testing
Test on REDS4
Run training with:
python basicsr/test.py -opt options/test/test_CDA-VSR_REDS4.yaml
Test on REDS4
Run evaluation with:
python basicsr/test.py -opt options/test/test_CDA-VSR_Inter4K.yaml
Citation
If you find this repository useful in your research, please cite:
@article{wang2026cdavsr,
title={Compressed-Domain-Aware Online Video Super-Resolution},
author={Wang, Yuhang and Li, Hai and Hou, Shujuan and Dong, Zhetao and Yang, Xiaoyao},
journal={arXiv preprint arXiv:2603.07694},
year={2026}
}
Contact
Please leave a issue or contact Yuhang Wang with bitwangyuhang@163.com
License and Acknowledgement
This project is built upon BasicSR and TMP. We sincerely thank the authors for making their code publicly available.