README.md

July 16, 2026 · View on GitHub

GPOcc: Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction

Changqing Zhou¹, Yueru Luo², Changhao Chen^{1 ✉}

¹The Hong Kong University of Science and Technology (Guangzhou)
²The Chinese University of Hong Kong, Shenzhen

_{✉ Corresponding author.}

GPOcc leverages generalizable visual geometry priors, such as VGGT, and represents volumetric evidence as sparse 3D Gaussians for efficient monocular 3D occupancy prediction. It further supports streaming embodied perception with an incremental fusion strategy for online scene understanding.

News

[2026.07] GPOcc++ is released.
[2026.05] Code is released.
[2026.02] :tada: GPOcc was accepted to CVPR 2026.

GPOcc generalizes powerful visual geometry priors to sparse Gaussian occupancy prediction. The core idea is to lift monocular observations into sparse 3D Gaussian scene elements and aggregate them into occupancy-aware scene representations for downstream prediction.

Getting Started

Follow docs/install.md to prepare the environment.
Follow docs/data.md to organize datasets.
Follow docs/train_eval.md to launch training and evaluation.

Demos

Citation

If you find this work useful, please consider citing:

@inproceedings{zhou2026generalizing,
  title={Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction},
  author={Zhou, Changqing and Luo, Yueru and Chen, Changhao},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={28578--28587},
  year={2026}
}

We recommend checking out the following related projects:

GPOcc: Generalizing Visual Geometry Priors to Sparse Gaussian Occupancy Prediction

News

Overview

Getting Started

Demos

Citation

Related Projects