README.md
February 27, 2026 · View on GitHub
Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning
Zhongxiao Cong Qitao Zhao Minsik Jeon Shubham Tulsiani
Carnegie Mellon University
Overview
Flow3r augments visual geometry learning with dense 2D correspondences (`flow') as supervision, enabling scalable training from unlabeled monocular videos. Flow3r achieves state-of-the-art results across eight benchmarks spanning static and dynamic scenes, with its largest gains on in-the-wild dynamic videos where labeled data is most scarce.
Quick Start
1. Create the environment
conda create -n flow3r python=3.11
conda activate flow3r
pip install -r requirements.txt
2. Download and place checkpoint
flow3r.bin: Flow3r trained on ~834k video sequences.
Please fetch the checkpoint manually from Google Drive and drop the file into checkpoints/.
3. Launch the Gradio app
python gradio_app.py
Acknowledgements
- Our work builds upon several fantastic open-source projects. We would like to acknowledge and thank the authors of:
- We also thank the members of the Physical Perception Lab at CMU for their valuable discussions.
Citation
If you find our work useful, please cite:
@inproceedings{cong2026flow3r,
title={Flow3r: Factored Flow Prediction for Scalable Visual Geometry Learning},
author={Cong, Zhongxiao and Zhao, Qitao and Jeon, Minsik and Tulsiani, Shubham},
booktitle={CVPR},
year={2026}
}