SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

April 1, 2026 ยท View on GitHub

https://github.com/swc-17/SparseDrive/assets/64842878/867276dc-7c19-4e01-9a8e-81c4ed844745

News

  • 1 April, 2026: Our following work SparseDriveV2 is released. arxiv, Code.
  • 17 March, 2025: SparseDrive is accepted by ICRA 2025.
  • 24 June, 2024: We reorganize code for better readability. Code & Models are released.
  • 31 May, 2024: We release the SparseDrive paper on arXiv. Code & Models will be released in June, 2024. Please stay tuned!

Introduction

SparseDrive is a Sparse-Centric paradigm for end-to-end autonomous driving.

  • We explore the sparse scene representation for end-to-end autonomous driving and propose a Sparse-Centric paradigm named SparseDrive, which unifies multiple tasks with sparse instance representation.
  • We revise the great similarity shared between motion prediction and planning, correspondingly leading to a parallel design for motion planner. We further propose a hierarchical planning selection strategy incorporating a collision-aware rescore module to boost the planning performance.
  • On the challenging nuScenes benchmark, SparseDrive surpasses previous SOTA methods in terms of all metrics, especially the safety-critical metric collision rate, while keeping much higher training and inference efficiency.

Overview of SparseDrive. SparseDrive first encodes multi-view images into feature maps, then learns sparse scene representation through symmetric sparse perception, and finally perform motion prediction and planning in a parallel manner. An instance memory queue is devised for temporal modeling.

Model architecture of symmetric sparse perception, which unifies detection, tracking and online mapping in a symmetric structure.

Model structure of parallel motion planner, which performs motion prediction and planning simultaneously and outputs safe planning trajectory.

Results in paper

  • Comprehensive results for all tasks on nuScenes.
MethodNDSAMOTAminADE (m)L2 (m) AvgCol. (%) AvgTraining Time (h)FPS
UniAD0.4980.3590.710.730.611441.8
SparseDrive-S0.5250.3860.620.610.08209.0
SparseDrive-B0.5880.5010.600.580.06307.3
  • Open-loop planning results on nuScenes.
MethodL2 (m) 1sL2 (m) 2sL2 (m) 3sL2 (m) AvgCol. (%) 1sCol. (%) 2sCol. (%) 3sCol. (%) AvgFPS
UniAD0.450.701.040.730.620.580.630.611.8
VAD0.410.701.050.720.030.190.430.214.5
SparseDrive-S0.290.580.960.610.010.050.180.089.0
SparseDrive-B0.290.550.910.580.010.020.130.067.3

Results of released checkpoint

We found that some collision cases were not taken into consideration in our previous code, so we re-implement the evaluation metric for collision rate in released code and provide updated results.

Main results

Modelconfigckptlogdet: NDSmapping: mAPtrack: AMOTAtrack: AMOTPmotion: EPA_carmotion: minADE_carmotion: minFDE_carmotion: MissRate_carplanning: CRplanning: L2
Stage1cfgckptlog0.52600.56890.3851.260
Stage2cfgckptlog0.52570.56560.3721.2480.4920.610.950.1330.097%0.61

Detailed results for planning

MethodL2 (m) 1sL2 (m) 2sL2 (m) 3sL2 (m) AvgCol. (%) 1sCol. (%) 2sCol. (%) 3sCol. (%) Avg
UniAD0.450.701.040.730.660.660.720.68
UniAD-wo-post-optim0.320.580.940.610.170.270.420.29
VAD0.410.701.050.720.030.210.490.24
SparseDrive-S0.300.580.950.610.010.050.230.10

Quick Start

Quick Start

Citation

If you find SparseDrive useful in your research or applications, please consider giving us a star ๐ŸŒŸ and citing it by the following BibTeX entry.

@inproceedings{sun2025sparsedrive,
  title={Sparsedrive: End-to-end autonomous driving via sparse scene representation},
  author={Sun, Wenchao and Lin, Xuewu and Shi, Yining and Zhang, Chuang and Wu, Haoran and Zheng, Sifa},
  booktitle={2025 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={8795--8801},
  year={2025},
  organization={IEEE}
}

Acknowledgement