DeepInteraction & DeepInteraction++

April 16, 2025 ยท View on GitHub

DeepInteraction: 3D Object Detection via Modality Interaction,
Zeyu Yang, Jiaqi Chen, Zhenwei Miao, Wei Li, Xiatian Zhu, Li Zhang
NeurIPS 2022

DeepInteraction++: Multi-Modality Interaction for Autonomous Driving,
Zeyu Yang, Nan Song, Wei Li, Xiatian Zhu, Li Zhang, Philip H.S. Torr
TPAMI 2025

News

  • (2022/6/27) DeepInteraction-e ranks first on nuScenes among all solutions.
  • (2022/6/26) DeepInteraction-large ranks first on nuScenes among all non-ensemble solutions.
  • (2022/5/18) DeepInteraction-base ranks first on nuScenes among all solutions that do not use test-time augmentation and model ensemble.

Results

3D Object Detection (on nuScenes test)

ModelModalitymAPNDS
DeepInteraction-eC+L75.7476.34
DeepInteraction-largeC+L74.1275.52
DeepInteraction-baseC+L70.7873.43

3D Object Detection (on nuScenes val)

ModelModalitymAPNDSCheckpoint
DeepInteractionC+L69.8572.63Fusion_0075_refactor.pth (GoogleDrive/HuggingFace)
DeepInteraction++C+L70.6373.27Fusion_0075_plusplus.pth (GoogleDrive/HuggingFace)

Get Started

Environment

This implementation is build upon mmdetection3d, please follow the steps in install.md to prepare the environment.

Data

Please follow the official instructions of mmdetection3d to process the nuScenes dataset.(https://mmdetection3d.readthedocs.io/en/latest/datasets/nuscenes_det.html)

Pretrained

Downloads the pretrained backbone weights (GoogleDrive/HuggingFace) to pretrained/

Train & Test

# train DeepInteraction with 8 GPUs
tools/dist_train.sh projects/configs/nuscenes/Fusion_0075_refactor.py 8
# test DeepInteraction with 8 GPUs
tools/dist_test.sh projects/configs/nuscenes/Fusion_0075_refactor.py ${CHECKPOINT_FILE} 8 --eval=bbox

Acknowledgement

Many thanks to the following open-source projects:

Reference

@inproceedings{yang2022deepinteraction,
  title={DeepInteraction: 3D Object Detection via Modality Interaction},
  author={Yang, Zeyu and Chen, Jiaqi and Miao, Zhenwei and Li, Wei and Zhu, Xiatian and Zhang, Li},
  booktitle={NeurIPS},
  year={2022}
}
@article{yang2024deepinteractionpp,
  title={DeepInteraction++: Multi-Modality Interaction for Autonomous Driving},
  author={Yang, Zeyu and Song, Nan and Li, Wei and Zhu, Xiatian and Zhang, Li and Torr, Philip H.S.},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)},
  year={2025}
}