EIMC: Efficient Instance-aware Multi-modal Collaborative Perception

February 24, 2026 · View on GitHub

EIMC: Efficient Instance-aware Multi-modal Collaborative Perception

ICRA 2026 Python 3.8+ PyTorch License

EIMC achieves state-of-the-art multi-modal collaborative 3D detection while reducing bandwidth by 87.98% compared with the best published multi-modal collaborative detector.

Paper | Getting Started

Highlights

  • Early Collaborative Paradigm: Injects lightweight collaborative voxels into the ego's local modality-fusion step, yielding compact yet informative 3D collaborative priors.
  • Heatmap-driven Instance Communication: Only Top-K instance vectors from low-confidence, high-discrepancy regions are queried from peers — drastically reducing redundancy.
  • Instance Completion & Refinement: Cross-attention completion recovers occluded objects; self-attention refinement enhances instance features across agents.
  • 87.98% Bandwidth Reduction: Instance-centric messaging achieves superior performance with minimal communication overhead.

Results

Multi-modal Collaborative Detection

MethodOPV2V AP50OPV2V AP70DAIR-V2X AP50DAIR-V2X AP70Bandwidth (log2)
No Coll63.7458.3265.0253.820.00
V2VNet93.1389.0072.2252.9525.43
V2X-ViT93.6686.0671.8755.4624.00
CoBEVT93.0384.6471.7055.8524.00
BM2CP93.0488.9472.3756.1823.18
EIMC94.7189.1673.0158.3720.16

Installation

This project is built upon the HEAL framework.

Step 1: Environment Setup

conda create -n eimc python=3.8
conda activate eimc
conda install pytorch==1.12.0 torchvision==0.13.0 torchaudio==0.12.0 cudatoolkit=11.6 -c pytorch -c conda-forge
pip install -r requirements.txt
python setup.py develop

Step 2: Install Spconv

pip install spconv-cu116  # match your CUDA version

Step 3: Compile CUDA Extensions

python opencood/utils/setup.py build_ext --inplace

Data Preparation

Download and organize datasets under dataset/:

EIMC/dataset
├── OPV2V
│   ├── train
│   ├── validate
│   └── test
└── my_dair_v2x
    ├── v2x_c
    ├── v2x_i
    └── v2x_v

Usage

Training

# OPV2V
python opencood/tools/train.py -y opencood/hypes_yaml/opv2v/MM/DSfusion_V2.yaml

# DAIR-V2X
python opencood/tools/train.py -y opencood/hypes_yaml/dairv2x/MM/DSfusion_V2.yaml

Evaluation

python opencood/tools/inference.py --model_dir ${CHECKPOINT_FOLDER} --fusion_method intermediate

Citation

If you find this work useful, please cite:

@inproceedings{eimc2026icra,
  title={EIMC: Efficient Instance-aware Multi-modal Collaborative Perception},
  author={Kang Yang, Peng Wang, Lantao Li, Tianci Bu, Chen Sun, Deying Li, Yongcai Wang*},
  booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2026}
}

Acknowledgements

This project is built upon HEAL and OpenCOOD. We thank the authors for their excellent work.

License

This project is released under the Academic Software License.