[NeurIPS 2024] UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles

November 3, 2024 · View on GitHub

website | paper

UAV3D is a public large-scale benchmark designed for 3D perception tasks from Unmanned Aerial Vehicle (UAV) platforms. This benchmark comprises the synthetic data and 3D perception algorithms, aiming to facilitate research in both single UAV and collaborative UAVs 3D perception tasks.

News

Installation

Prepare dataset

Training and evaluation

Main Results

3D Object Detection (UAV3D val)

ModelBackboneSizemAP↑NDS↑mATE↓mASE↓mAOE↓CheckpointLog
PETRRes-50704×2560.5120.5710.7410.1730.072linklink
BEVFusionRes-50704×2560.4870.4580.6150.1521.000linklink
DETR3DRes-50704×2560.4300.5090.7910.1870.100linklink
PETRRes-50800×4500.5810.6320.6250.1600.064linklink
BEVFusionRes-101800×4500.5360.5820.5210.1540.343linklink
DETR3DRes-101800×4500.6180.6710.4940.1580.070linklink

3D Object Tracking (UAV3D val)

ModelBackboneSizeAMOTA↑AMOTP↓MOTA↑MOTP↓TID↓LGD↓det_resultLog
PETRRes-50704×2560.1991.2940.1950.7941.2802.970linklink
BEVFusionRes-50704×2560.5661.1370.5010.6950.7901.600linklink
DETR3DRes-50704×2560.0891.3820.1210.8001.5403.530linklink
PETRRes-50800×4500.2911.1560.2560.6771.0902.550linklink
BEVFusionRes-101800×4500.6061.0060.5400.6270.7001.390linklink
DETR3DRes-101800×4500.2621.1230.2380.5611.1402.720linklink

Collaborative 3D Object Detection (UAV3D val)

ModelmAP↑NDS↑mATE↓mASE↓mAOE↓AP@IoU=0.5↑AP@IoU=0.7↑CheckpointLog
Lower-bound0.5440.5560.5400.1470.5780.4570.140linklink
When2com0.5500.5070.5340.1560.6790.4610.166linklink
Who2com0.5460.5970.5410.1500.2630.4530.141linklink
V2VNet0.6470.6280.5080.1670.5330.5450.141linklink
DiscoNet0.7000.6890.4230.1430.4220.6490.247linklink
Upper-bound0.7200.7480.3910.1060.1170.6730.316linklink

Collaborative 3D Object Tracking (UAV3D val)

ModelAMOTA↑AMOTP↓MOTA↑MOTP↓TID↓LGD↓det_resultLog
Lower-bound0.6441.0180.5930.6110.6201.280linklink
When2com0.6461.0120.5950.6180.5901.200linklink
Who2com0.6481.0120.6020.6230.5801.200linklink
V2VNet0.7820.8030.7350.5870.3600.710linklink
DiscoNet0.8090.7030.7660.5160.3000.590linklink
Upper-bound0.8120.6720.7810.4760.3000.570linklink

Citation

If you find this repository useful, please consider giving a star :star: and citation :blue_book::

@inproceedings{uav3d2024,
  title={UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles},
  author={Hui Ye and Raj Sunderraman and Shihao Ji},
  booktitle={The 38th Conference on Neural Information Processing Systems (NeurIPS)},
  year={2024}
}

Acknowledgement

In collecting UAV3D, we received valuable help and suggestions from the authors of CoPerception-UAV and Where2comm.

For 3D object detection task, our implementation is based on PETR, BEVFusion, and DETR3D.

For Collaborative 3D object detection task, our implementation is based on BEVFusion and CoPerception.

For object tracking task, our implementation is based on CenterPoint.

The software and data were created by Georgia State University Research Foundation under Army Research Laboratory (ARL) Award Numbers W911NF-22-2-0025 and W911NF-23-2-0224. ARL, as the Federal awarding agency, reserves a royalty-free, nonexclusive and irrevocable right to reproduce, publish, or otherwise use this software for Federal purposes, and to authorize others to do so in accordance with 2 CFR 200.315(b).