๐Ÿ† [ICCV 2025] ZeroStereo: Zero-shot Stereo Matching from Single Images ๐Ÿ†

May 21, 2026 ยท View on GitHub

Xianqi Wang, Hao Yang, Gangwei Xu, Junda Cheng, Min Lin, Yong Deng, Jinliang Zang, Yurui Chen, Xin Yang

Huazhong University of Science and Technology, Autel Robotics, Optics Valley Laboratory

arxiv

Used title: StereoGen: High-quality Stereo Image Generation from a Single Image

ZeroStereo

๐Ÿ”„ Update

  • 09/16/2025: Update MfS35K to Hugging Face.
  • 09/16/2025: Update MfS35K to Baidu Netdisk.
  • 07/29/2025: Update the fine-tuning code for SDv2I.
  • 07/14/2025: Update the generation code to improve the quality of the right image edges.

โš™๏ธ Environment

conda create -n zerostereo python=3.12
conda activate zerostereo

pip install tqdm numpy wandb opt_einsum hydra-core
pip install scipy torch torchvision diffusers transformers opencv-python matplotlib
pip install xformers accelerate scikit-image timm==0.5.4

๐Ÿ“‚ Required Data

Data for fine-tuning:

Data for generation:

The filepath format should be consistent with the filelist.

Data for training:

Data for evaluation:

๐ŸŽ Pre-Trained Model

ModelLink
SDv2IDownload ๐Ÿค—
StereoGenDownload ๐Ÿค—
Zero-RAFT-StereoDownload ๐Ÿค—
Zero-IGEV-StereoDownload ๐Ÿค—

The link to the original SDv2I is invalid. Please use the copy from others, like Download ๐Ÿค—.

๐Ÿš€ Fine-tuning

To fine-tune SDv2I, run:

accelerate launch train_stereogen.py

๐Ÿ› ๏ธ Generation

To generate MfS35K, run:

accelerate launch generate_mono.py
accelerate launch generate_stereo.py

๐Ÿš€ Training

To train Zero-RAFT-Stereo and Zero-IGEV-Stereo, run:

CUDA_VISIBLE_DEVICES='0,1' accelerate launch train_stereo.py

๐Ÿ“Š Evaluation

To evaluate Zero-RAFT-Stereo, run:

accelerate launch evaluate_stereo.py

To evaluate Zero-IGEV-Stereo, modify config/evaluate_stereo.yaml or run:

accelerate launch evaluate_stereo.py model=igev_stereo checkpoint=checkpoint/igev_stereo/model_700.safetensors  

๐ŸŽฅ Demo

To save disparity, run:

accelerate launch save_disparity.py

๐Ÿ”” Notification

๐Ÿ™ Acknowledgement

This project is based on MfS-Stereo, Depth Anything V2, Marigold, RAFT-Stereo, and IGEV-Stereo. We thank the original authors for their excellent works.