README.md
June 8, 2026 ยท View on GitHub
Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving
Yinan Zheng*, Tianyi Tan*, Bin Huang*, Enguang Liu, Ruiming Liang, Jianlin Zhang, Jianwei Cui, Guang Chen, Kun Ma, Hangjun Ye, Long Chen, Ya-Qin Zhang, Xianyuan Zhan, Jingjing Liu
Check out the real-vehicle demo on our project page
The official implementation of Hyper Diffusion Planner. Our work demonstrates that diffusion models, when properly designed and trained, can serve as effective and scalable E2E AD planners for complex, real-world autonomous driving tasks.
Real-world urban scenario testing uses model output, with only simple smoothness post-refinement.
To Do List
Note: In this repository, we will release implementation details from the paper and provide benchmark implementations for NAVSIM and NuPlan to support community research. Since our design is derived from real-world vehicle experiments, its performance on simulated benchmarks may not fully align with its real-world efficacy, which is a known limitation of such benchmarks as discussed in our paper.
- Real Vehicle Demo
- Nuplan Implementation
- NAVSIM Implementation
- initial repo & paper
NuPlan Implementation
We provide the NuPlan implementation in HDP-nuplan/, based on Diffusion Planner with several modifications for HDP.
Main Modifications
- Diffusion loss space:
support flexible combinations of model prediction target and supervision target via
--diffusion_model_typeand--diffusion_supervision_typeintrain_predictor.py. The conversion logic is implemented inhdp_nuplan/model/diffusion_utils/sde.pyand used inhdp_nuplan/loss.py. Current options:x_start,noise,v,score. - Hybrid planning loss:
we use
L_hybrid = L_velocity + w * L_waypointswherewis set by--planning_hybrid_loss(default0.01). The detached integration used by the waypoint term is implemented inhdp_nuplan/utils/traj_kinematics.py.
For additional notes, see HDP-nuplan/README.md.
NAVSIM Implementation
We provide the NAVSIM implementation in HDP-navsim/, based on the NAVSIM devkit with diffusion-based VLA agents for supervised pretraining and reward-based fine-tuning.
Main Features
- Supervised diffusion VLA:
DpVlaAgentimplements the base and HDP models in the paper, using a Florence-2 encoder and a DiT trajectory decoder for supervised diffusion training on NAVSIM. - Reward-based fine-tuning:
DpVlaRlAgentimplements HDP-RL by fine-tuning a pretrainedDpVlaAgentwith NAVSIM's PDM simulator and scorer. - Pretrained checkpoint: we provide the supervised DP-VLA base model checkpoint, which reaches 88.6 PDMS.
- Devkit and cache updates: the implementation reuses upstream NAVSIM packages where possible, replaces YAML data configs with JSON for faster startup, and stores image paths instead of raw image arrays in the data cache for simpler caching and augmentation.
For setup, cache preparation, training, RL fine-tuning, evaluation, and checkpoint links, see HDP-navsim/README.md.
Bibtex
If you find our code and paper can help, please cite our paper as:
@article{
zheng2026unleash,
title={Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving},
author={Yinan Zheng and Tianyi Tan and Bin Huang and Enguang Liu and Ruiming Liang and Jianlin Zhang and Jianwei Cui and Guang Chen and Kun Ma and Hangjun Ye and Long Chen and Ya-Qin Zhang and Xianyuan Zhan and Jingjing Liu},
journal={arXiv preprint arXiv:2602.22801},
year={2026}
}
If you find our diffusion and flow-matching designs useful, please cite our papers as:
@inproceedings{
zheng2025diffusionplanner,
title={Diffusion-Based Planning for Autonomous Driving with Flexible Guidance},
author={Yinan Zheng and Ruiming Liang and Kexin ZHENG and Jinliang Zheng and Liyuan Mao and Jianxiong Li and Weihao Gu and Rui Ai and Shengbo Eben Li and Xianyuan Zhan and Jingjing Liu},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025}
}
@inproceedings{
tan2025flow,
title={Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling},
author={Tianyi Tan and Yinan Zheng and Ruiming Liang and Zexu Wang and Kexin Zheng and Jinliang Zheng and Jianxiong Li and Xianyuan Zhan and Jingjing Liu},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025}
}
If you find our diffusion-based RL designs useful, please cite our papers as:
@inproceedings{liang2026dipole,
title={Dichotomous Diffusion Policy Optimization},
author={Ruiming Liang and Yinan Zheng and Kexin Zheng and Tianyi Tan and Jianxiong Li and Liyuan Mao and Zhihao Wang and Guang Chen and Hangjun Ye and Jingjing Liu and Jinqiao Wang and Xianyuan Zhan},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026}
}
@inproceedings{
zheng2024safe,
title={Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model},
author={Yinan Zheng and Jianxiong Li and Dongjie Yu and Yujie Yang and Shengbo Eben Li and Xianyuan Zhan and Jingjing Liu},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024}
}