StreamOcc

June 18, 2026 · View on GitHub

StreamOcc

Streaming Dense Voxel Representations for 3D Occupancy Prediction

Seokha Moon^1,5,† · Janghyun Baek¹ · Yujin Jeong² · Daewon Chae³ · Giseop Kim^4,5,‡ · Jungbeom Lee¹ · Jinkyu Kim^1,* · Sunwook Choi^5,*

¹Korea University · ²TU Darmstadt & hessian.AI · ³University of Michigan · ⁴DGIST · ⁵NAVER LABS

^†Work done during an internship at NAVER LABS · ^‡Work done while at NAVER LABS

^* Corresponding authors

🚀 News

2026.06.18 — StreamOcc has been accepted to ECCV 2026.
2025.11.29 — Code released.
2025.11.27 — StreamOcc paper has been updated on arXiv.

StreamOcc is a real-time 3D occupancy prediction framework that streams dense voxel representations across time. It addresses two key failure modes of naive dense voxel streaming: warping distortion from temporal alignment and degraded dynamic-object representations from image-to-voxel projection.

✨ Highlights

StreamOcc introduces a dual aggregation strategy combining StreamAgg for temporal dense voxel accumulation and QueryAgg for targeted dynamic-object refinement.
Achieves state-of-the-art performance:
- Occ3D-nuScenes: 41.9 mIoU (+2.3 over prior SOTA / in real-time setting)
- SurroundOcc benchmark: 23.4 mIoU / 21.0 mIoU_D (+1.5 / +2.0 over prior SOTA)
- RayIoU: 41.1 RayIoU (+0.8 over prior SOTA), with 34.2 / 41.9 / 47.1 at 1m / 2m / 4m
Runs within real-time constraints (83.3 ms) and requires only 2.8 GB of GPU memory.

💡 Method

Method

StreamOcc predicts voxel occupancy in a streaming manner through two complementary aggregation stages:

StreamAgg: Rectified Voxel Streaming Aggregation

Propagates dense voxel features through a recurrent streaming buffer.
Aligns past voxel features to the current ego frame using motion-aware warping.
Rectifies interpolation artifacts with adaptive residual refinement.

QueryAgg: Query-Guided Aggregation

Extracts instance-level dynamic-object semantics from image features.
Propagates object queries over time and injects them into corresponding occupied voxel regions.
Complements dense voxel streaming for distant, occluded, and overlapping dynamic objects.

StreamAgg and QueryAgg jointly produce a fast, memory-efficient, and high-fidelity 3D occupancy representation.

# Train
bash local_train.sh StreamOcc
# Test
bash local_test.sh StreamOcc path/to/checkpoint

🙏 Acknowledgement

This project is not possible without multiple great open-sourced code bases. We list some notable examples below.

📃 Bibtex

If this work is helpful for your research, please consider citing the following BibTeX entry.

@article{moon2025streamocc,
  title={Streaming Dense Voxel Representations for 3D Occupancy Prediction},
  author={Moon, Seokha and Baek, Janghyun and Jeong, Yujin and Chae, Daewon and Kim, Giseop and Lee, Jungbeom and Kim, Jinkyu and Choi, Sunwook},
  journal={arXiv preprint arXiv:2503.22087},
  year={2025}
}

StreamOcc

StreamOcc

Streaming Dense Voxel Representations for 3D Occupancy Prediction

🚀 News

Overview

✨ Highlights

💡 Method

StreamAgg: Rectified Voxel Streaming Aggregation

QueryAgg: Query-Guided Aggregation

🎨 Qualitative Results

📊 Quantitative Results

Occ3D-nuScenes

SurroundOcc Benchmark and RayIoU

🔧 Getting Started

🏋️ Training & Inference

🙏 Acknowledgement

📃 Bibtex