Detection Model Zoo

December 2, 2022 ยท View on GitHub

Inference default use V100 16G.

YOLOX-PAI

Pretrained on COCO2017 dataset. (The result has been optimized with PAI-Blade, and only computes the model inference time. To learn about end2end inference time, you can refer to export.md.)

AlgorithmConfigParamsSpeedV100
fp16 b32
mAPval
0.5:0.95
APval
50
Download
YOLOX-syolox_s_8xb16_300e_coco9M0.68ms40.058.9model - log
PAI-YOLOXsyoloxs_pai_8xb16_300e_coco16M0.71ms41.460.0model - log
PAI-YOLOXs-ASFFyoloxs_pai_asff_8xb16_300e_coco21M0.87ms42.861.8model - log
PAI-YOLOXs-ASFF-TOOD3yoloxs_pai_asff_tood3_8xb16_300e_coco24M1.15ms43.962.1model - log
YOLOX-myolox_m_8xb16_300e_coco25M1.52ms46.364.9model - log
YOLOX-lyolox_l_8xb8_300e_coco54M2.47ms48.967.5model - log
YOLOX-xyolox_x_8xb8_300e_coco99M4.74ms50.969.2model - log
YOLOX-tinyyolox_tiny_8xb16_300e_coco5M0.28ms31.549.2model - log
YOLOX-nanoyolox_nano_8xb16_300e_coco2.2M0.19ms26.542.6model - log

ViTDet

AlgorithmConfigParams
(backbone/total)
Train memory
(GB)
inference time(V100)
(ms/img)
bbox_mAPval
0.5:0.95
mask_mAPval
0.5:0.95
Download
ViTDet_MaskRCNNvitdet_maskrcnn86M/111M13.3 (fp16)138ms50.6545.41model - log

FCOS

AlgorithmConfigParams
(backbone/total)
Train memory
(GB)
inference time(V100)
(ms/img)
mAPval
0.5:0.95
APval
50
Download
FCOS-r50(caffe)fcos-r5023M/32M5.085.8ms38.5857.18model - log
FCOS-r50(torch)fcos-r5023M/32M4.0 (fp16)105.3ms38.8858.01model - log

DETR

AlgorithmConfigParams
(backbone/total)
Train memory
(GB)
inference time(V100)
(ms/img)
bbox_mAPval
0.5:0.95
APval
50
Download
DETR-r50detr-r5023M/41M8.548.5ms39.9260.52model - log
DAB-DETR-r50dab-detr-r5023M/43M2.658.5ms42.5263.03model - log
DN-DETR-r50dab-detr-r5023M/43M7.858.5ms44.3964.66model - log

DINO

AlgorithmConfigParams
(backbone/total)
inference time(V100)
(ms/img)
bbox_mAPval
0.5:0.95
APval
50
DownloadComment
DINO_4sc_r50_12eDINO_4sc_r50_12e23M/47M184ms48.7166.27model - logInference use V100 32G
DINO_4sc_r50_36eDINO_4sc_r50_36e23M/47M184ms50.6968.60model - logInference use V100 32G
DINO_4sc_swinl_12eDINO_4sc_swinl_12e195M/217M155ms56.8675.61model - logInference use V100 32G
DINO_4sc_swinl_36eDINO_4sc_swinl_36e195M/217M155ms58.0476.76model - logInference use V100 32G
DINO_5sc_swinl_36eDINO_5sc_swinl_36e195M/217M235ms58.4777.10model - logInference use V100 32G
DINO++_5sc_swinl_18eDINO++_5sc_swinl_18e195M/218M325ms63.3980.25model - logInference use A100 80G