README.md

January 3, 2026 · View on GitHub

DistillDrive

End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model

Accepted to ICCV 2025

Rui Yu1, Xianghang Zhang2, Runkai Zhao3, Huaicheng Yan1, Meng Wang1,

1 East China University of Science and Technology, 2 SenseAuto Research, 3 The University of Sydney

DistillDrive  DistillDrive  huggingface weights 

News

  • Sep. 7th, 2025: We reorganize code for better readability. Code & Models are released.
  • Aug. 08, 2025: We release the DistillDrive paper on arXiv.
  • Jun. 26, 2025: DistillDrive is accepted to ICCV 2025!
  • Oct. 26, 2025: DistillDrive is published at ICCV 2025 on CVF Open Access Paper.

Introduction

We introduce DistillDrive, an end-to-end knowledge distillation-based autonomous driving model that leverages diversified instance imitation to enhance multi-mode motion feature learning

  • We propose a distillation architecture for multi-mode instance supervision in end-to-end planning, tackling single-target imitation learning limitations.
  • We introduce reinforcement learning-based state optimization to enhance state-to-decision space understanding and mitigate ego motion state leakage.
  • To address missing motion-guided attributes, we use a generative model for distribution-wise interaction between expert trajectories and instance features.
  • We conduct open- and closed-loop planning experiments on the nuScenes and NAVSIM datasets, achieving a 50% reduction in collision rate and a 3-point increase in both EP and PDMS over the baseline.

Overview

The Overview of our proposed DistillDrive. Initially, we train a teacher model with scene-structured annotation data, integrating reinforcement and imitation learning to enhance multi-mode planning. Subsequently, we constructed an end-to-end student model and used a generative model to implement motion-oriented distribution interactions in latent space. Ultimately, multi-stage knowledge distillation and multi-mode supervision synergistically enhance the planning diversity and safety margins of autonomous driving models.

  • Open-loop planning results on nuScenes.
ModelInput1s2s3sAvg.1s2s3sAvg.FPS ↑
L2 (m) ↓Collision (%) ↓
FFLiDAR0.551.202.541.430.060.171.070.43-
EOLiDAR0.671.362.781.600.040.090.880.33-
ST-P3Camera1.332.112.902.110.230.631.270.711.6
UniADCamera0.450.701.040.730.620.580.630.611.8
VADCamera0.410.701.050.720.030.190.430.214.5
SparseDriveCamera0.310.601.000.630.010.080.300.136.5
DistillDrive (Teacher)Annotation0.270.510.820.530.010.040.100.0531.6
DistillDrive (Student)Camera0.280.540.830.570.000.030.170.066.0
  • Close-loop planning results on NAVSIM.
ModelInputBackboneNC ↑DAC ↑TTCComf. ↑EP ↑PDMS ↑
Const Velocity--69.057.858.010019.420.6
Ego Status MLP--93.077.383.610062.865.6
UniADCResNet-3497.891.992.210078.883.4
VADV2CResNet-3492.289.191.610076.080.9
TransfuserC & LResNet-3497.892.392.910078.683.5
Hydra-MDPC & LResNet-3497.991.792.910077.683.0
DistillDrive (Teacher)GT-97.596.092.810081.086.5
DistillDrive (Student)C & LResNet-3498.194.693.610081.086.2
Modle NameStageWeight Download
distilldrive_stage0_distribution.pthstage0huggingface
distilldrive_stage0_label.pthstage0huggingface
distilldrive_stage1_adamax.pthstage1huggingface
distilldrive_stage1_soap.pthstage1huggingface
distilldrive_stage2_distribution.pthstage2huggingface
distilldrive_stage2_label.pthstage2huggingface

Getting Started

Video Demo on Real-world Application

https://huggingface.co/RuiYuStudying/DistillDrive/blob/main/demo.mp4

Acknowledgement

Citation

If you find DistillDrive is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@inproceedings{yu2025distilldrive,
  title={Distilldrive: End-to-end multi-mode autonomous driving distillation by isomorphic hetero-source planning model},
  author={Yu, Rui and Zhang, Xianghang and Zhao, Runkai and Yan, Huaicheng and Wang, Meng},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={26188--26197},
  year={2025}
}