SimpleDet Model Zoo

June 2, 2020 ยท View on GitHub

Introduction

This file documents a large collection of baselines trained with SimpleDet.

Common Settings

  • All models were trained on train2014+valminusminival2014, and tested on minival2014.
  • We adopt the same training schedules as Detectron. 1x indicates 6 epochs and 2x indicates 12 epochs since we append flipped images into training data.
  • We report the training GPU memory as what nvidia-smi shows.

ImageNet Pretrained Models

We provide the ImageNet pretrained models used by SimpleDet. Unless otherwise noted, these models are trained on the standard ImageNet-1k dataset.

ResNetV1b Baselines

All config files can be found in config/resnet_v1b. Pretrains are converted from GluonCV. All AP results are reported on minival2014 of the COCO dataset.

ModelBackboneHeadTrain ScheduleAPAP50AP75APsAPmAPl
FasterR50v1b-C4C5-512ROI1X35.756.737.918.640.448.1
FasterR50v1b-C4C5-512ROI2X36.957.939.319.941.450.2
FasterR101v1b-C4C5-512ROI1X40.061.343.121.544.854.3
FasterR101v1b-C4C5-512ROI2X40.561.243.822.544.855.4
FasterR152v1b-C4C5-512ROI1X41.362.644.623.446.255.6
FasterR152v1b-C4C5-512ROI2X41.862.445.223.246.056.9
FasterR50v1b-FPN2MLP1X37.259.440.422.341.347.6
FasterR50v1b-FPN2MLP2X38.059.741.522.241.648.8
FasterR101v1b-FPN2MLP1X39.962.143.523.144.451.1
FasterR101v1b-FPN2MLP2X40.462.144.023.244.452.7
FasterR152v1b-FPN2MLP1X41.563.545.724.746.053.3
FasterR152v1b-FPN2MLP2X42.063.645.924.845.955.0
Mask(BBox)R50v1b-FPN2MLP1X37.859.940.922.941.548.0
Mask(BBox)R50v1b-FPN2MLP2X38.660.341.822.642.449.8
Mask(BBox)R101v1b-FPN2MLP1X40.462.244.124.044.452.1
Mask(BBox)R101v1b-FPN2MLP2X41.362.845.023.945.453.7
Mask(BBox)R152v1b-FPN2MLP1X41.863.746.125.346.353.6
Mask(BBox)R152v1b-FPN2MLP2X42.863.846.824.647.155.9
Mask(Inst)R50v1b-FPN2MLP1X34.456.536.218.737.946.4
Mask(Inst)R50v1b-FPN2MLP2X34.956.937.118.338.447.8
Mask(Inst)R101v1b-FPN2MLP1X36.358.838.619.439.749.7
Mask(Inst)R101v1b-FPN2MLP2X36.959.339.419.140.751.0
Mask(Inst)R152v1b-FPN2MLP1X37.460.139.820.041.650.7
Mask(Inst)R152v1b-FPN2MLP2X38.060.640.619.841.952.8
TridentR50v1b-C4C5-128ROI1X38.459.741.521.443.652.4
TridentR50v1b-C4C5-128ROI2X39.660.942.922.544.553.9
TridentR101v1b-C4C5-128ROI1X42.263.645.324.547.257.7
TridentR101v1b-C4C5-128ROI2X43.064.346.325.347.958.4
TridentR152v1b-C4C5-128ROI1X43.764.148.026.947.958.9
TridentR152v1b-C4C5-128ROI2X44.465.448.326.449.459.6
TridentFastR50v1b-C4C5-128ROI1X37.758.740.319.542.452.7
TridentFastR50v1b-C4C5-128ROI2X39.060.241.820.843.653.8
TridentFastR101v1b-C4C5-128ROI1X41.162.543.922.145.757.7
TridentFastR101v1b-C4C5-128ROI2X42.563.746.023.346.759.3
TridentFastR152v1b-C4C5-128ROI1X42.764.045.623.447.559.1
TridentFastR152v1b-C4C5-128ROI2X43.965.147.025.148.160.4
RetinaR50v1b-FPN4Conv1X36.656.939.020.340.747.2
RetinaR101v1b-FPN4Conv1X39.259.542.222.844.051.1
RetinaR152v1b-FPN4Conv1X40.461.143.423.645.052.3
FasterR50v1b-C4-DCNv1C5-512ROI1X38.860.041.320.643.353.2
FasterR101v1b-C4-DCNv1C5-512ROI1X41.463.044.722.746.156.8
FasterR50v1b-C4-DCNv2C5-512ROI1X39.660.842.720.843.954.2
FasterR50v1b-C4-DCNv2C5-512ROI2X41.262.244.721.745.357.0
FasterR101v1b-C4-DCNv2C5-512ROI1X41.763.044.722.846.157.3
FasterR101v1b-C4-DCNv2C5-512ROI2X42.763.746.024.946.957.9
RetinaR50v1b-FPN-TR152v1b1X4Conv1X38.959.041.621.443.352.1
RetinaR50v1b-FPN-TR152v1b1X4Conv2X40.160.643.121.844.554.3
FasterR50v1b-FPN-TR152v1b2X2MLP1X39.961.343.622.744.252.7
FasterR50v1b-FPN-TR152v1b2X2MLP2X40.562.243.923.144.753.9

Box, and Mask Detection Baselines

All AP results are reported on minival2014 of the COCO dataset.

ModelBackboneHeadTrain ScheduleGPUImage/GPUFP16Train MEMTrain SpeedBox AP(Mask AP)Link
FasterR50v1-C4C5-512ROI1X8X 1080Ti2no5.9G(4.5G)20 img/s34.2model
FasterR50v1-C4C5-512ROI1X8X TitanV2yes6.1G49 img/s34.4model
FasterR50v2-C4C5-256ROI1X8X 1080Ti2no5.1G33 img/s32.8model
CascadeR50v2-C52MLP1X8X 1080Ti2no5.9G25 img/s38.8model
CascadeR50v1-FPN2MLP1X8X 1080Ti2no6.6G21 img/s40.3model
FasterR50v1-FPN2MLP1X8X 1080Ti2no4.2G(2.6G)43 img/s36.5model
MaskR50v1-FPN2MLP+4CONV1X8X 1080Ti2no5.7G(3.6G)35 img/s37.1(33.7)model
RetinaR50v1-FPN4Conv1X8X 1080Ti2no4.7G(2.2G)44 img/s35.6model
TridentR50v2-C4C5-128ROI1X8X 1080Ti2no7.0G(5.3G)20 img/s37.1model
FasterR101v2-C4C5-256ROI1X8X 1080Ti2no6.7G25 img/s37.6model
Faster-SyncBNR101v2-C4C5-256ROI1X8X 1080Ti2no7.8G17 img/s38.6model
FasterR101v1-C4C5-512ROI1X8X 1080Ti2no10.2G16 img/s38.3model
FasterR101v1-C4C5-512ROI1X8X TitanV2yes7.0G35 img/s38.1model
FasterR101v1-FPN2MLP1X8X 1080Ti2no5.3G(3.4G)31 img/s38.7model
CascadeR101v2-C52MLP1X8X 1080Ti2no7.6G22 img/s41.0model
CascadeR101v1-FPN2MLP1X8X 1080Ti2no8.7G19 img/s42.3model
TridentR101v2-C4C5-128ROI1X8X 1080Ti1no6.6G9 img/s40.6model
Trident-FastR101v2-C4C5-128ROI1X8X 1080Ti1no6.6G9 img/s39.9model
RetinaR101v1-FPN4Conv1X8X 1080Ti2no5.9G(3.0G)31 img/s37.8model

FP16 Speed Benchmark

Here we provide the FP16 speeed benchmark results of several models.

ModelBackboneHeadTrain ScheduleGPUImage/GPUFP16Train MEMTrain Speed
FasterR50v1-C4C5-512ROI1X8X 1080Ti2no8.4G20 img/s
FasterR50v1-C4C5-512ROI1X8X TitanV2yes6.1G49 img/s
FasterR50v1-C4C5-512ROI1X8X TitanV4yes11.2G55 img/s
FasterR50v2-C4C5-256ROI1X8X 1080Ti2no5.1G33 img/s
FasterR50v2-C4C5-256ROI1X8X TitanV2yes3.8G61 img/s
FasterR50v2-C4C5-256ROI1X8X TitanV4yes6.6G73 img/s
FasterR101v1-C4C5-512ROI1X8X 1080Ti2no10.2G16 img/s
FasterR101v1-C4C5-512ROI1X8X TitanV2yes7.0G35 img/s
FasterR50v1-FPN2MLP1X8X 1080Ti2no4.2G(2.6G)43 img/s
FasterR50v1-FPN2MLP1X8X 2080Ti2yes3.7G(3.1G)65 img/s
FasterR50v1-FPN2MLP1X8X 2080Ti4yes6.2G(6.4G)77 img/s