DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
April 12, 2026 ยท View on GitHub
DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model
https://github.com/user-attachments/assets/7da724d5-acbc-40f7-b5f8-dac38cfbe24a
https://github.com/user-attachments/assets/9ab4237c-67c0-4718-8576-32068a74bccf
https://github.com/user-attachments/assets/f29149f3-a749-4c55-9777-3d05912bebe1
Our Occupancy World Model can generate long-duration occupancy forecasts and can be effectively controlled by trajectory conditions.
๐ Overview
Our method consists of two components: (a) Occ-VAE Pipeline encodes occupancy frames into a continuous latent space, enabling efficient data compression. (b)DOME Pipeline learns to predict 4D occupancy based on historical occupancy observations.
๐๏ธ News
- [2025.1.1] We release the code and checkpoints.
- [2024.11.18] Project page is online!
๐๏ธ TODO
- Code release.
- Checkpoint release.
๐ Setup
clone the repo
git clone https://github.com/gusongen/DOME.git
cd DOME
environment setup
conda env create --file environment.yml
data preparation
-
Create soft link from
data/nuscenesto your_nuscenes_path -
Prepare the gts semantic occupancy introduced in Occ3d
-
Download our generated train/val pickle files and put them in
data/
The dataset should be organized as follows:
.
โโโ data/
โโโ nuscenes # downloaded from www.nuscenes.org/
โ โโโ lidarseg
โ โโโ maps
โ โโโ samples
โ โโโ sweeps
โ โโโ v1.0-trainval
โ โโโ gts # download from Occ3d
โโโ nuscenes_infos_train_temporal_v3_scene.pkl
โโโ nuscenes_infos_val_temporal_v3_scene.pkl
ckpt preparation
Download the pretrained weights from here and put them in ckpts folder.
๐ Run the code
(optional) Preprocess resampled data
cd resample
python launch.py \
--dst ../data/resampled_occ \
--imageset ../data/nuscenes_infos_train_temporal_v3_scene.pkl \
--data_path ../data/nuscenes
OCC-VAE
# train
sh tools/train_vae.sh
# eval
sh tools/eval_vae.sh
# visualize
sh tools/vis_vae.sh
DOME
# train
sh tools/train_diffusion.sh
# eval
sh tools/eval.sh
# visualize
sh tools/vis_diffusion.sh
๐ซ Acknowledgment
This code draws inspiration from their work. We sincerely appreciate their excellent contribution.
๐๏ธ Citation
@article{gu2024dome,
title={Dome: Taming diffusion model into high-fidelity controllable occupancy world model},
author={Gu, Songen and Yin, Wei and Jin, Bu and Guo, Xiaoyang and Wang, Junming and Li, Haodong and Zhang, Qian and Long, Xiaoxiao},
journal={arXiv preprint arXiv:2410.10429},
year={2024}
}