ArbDR
February 27, 2026 ยท View on GitHub
Cascaded Robust Rectification for Arbitrary Document Images
Official implementation of "Cascaded Robust Rectification for Arbitrary Document Images"
Accepted by IEEE Transactions on Multimedia (TMM) 2026 ๐
Authors: Chaoyun Wang, Quanxin Huang, I-Chao Shen, Takeo Igarashi, Nanning Zheng, Caigui Jiang
Institutes: Xi'an Jiaotong University, The University of Tokyo
๐ข News
- [2026.02.27] ๐ We have released the paper results! You can download them via Baidu Cloud (Access Code:
mdqd). - [2026.01.01] ๐ Our paper has been accepted by IEEE Transactions on Multimedia (TMM)!
- [Update] Code is being organized and will be fully released soon.
๐ Introduction
Document rectification in real-world scenarios poses significant challenges due to extreme variations in camera perspectives and physical distortions. Driven by the insight that complex transformations can be decomposed and resolved progressively, we introduce a novel multi-stage framework that progressively reverses distinct distortion types in a coarse-to-fine manner. Specifically, our framework first performs a global affine transformation to correct perspective distortions arising from the camera's viewpoint, then rectifies geometric deformations resulting from physical paper curling and folding, and finally employs a content-aware iterative process to eliminate fine-grained content distortions. To address limitations in existing evaluation protocols, we also propose two enhanced metrics: layout-aligned OCR metrics (AED/ACER) for a stable assessment that decouples geometric rectification quality from the layout analysis errors of OCR engines, and masked AD/AAD (AD-M/AAD-M) tailored for accurately evaluating geometric distortions in documents with incomplete boundaries. Extensive experiments show that our method establishes new state-of-the-art performance on multiple challenging benchmarks, yielding a substantial reduction of 14.1%--34.7% in the AAD metric and demonstrating superior efficacy in real-world applications.
๐ Citation
If you find our work useful in your research, please consider citing:
@article{wang2025cascaded,
title={Cascaded Robust Rectification for Arbitrary Document Images},
author={Wang, Chaoyun and Huang, Quanxin and Shen, I and Igarashi, Takeo and Zheng, Nanning and Jiang, Caigui and others},
journal={arXiv preprint arXiv:2511.23150},
year={2025}
}