BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

December 11, 2023 ยท View on GitHub

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation;
Hao Chen, Kunyang Sun, Zhi Tian, Chunhua Shen, Yongming Huang, and Youliang Yan;
In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2020.

[Paper] [BibTeX]

This project contains training BlendMask for instance segmentation and panoptic segmentation on COCO and configs for segmenting persons on PIC.

Quick Start

Demo

wget -O blendmask_r101_dcni3_5x.pth https://cloudstor.aarnet.edu.au/plus/s/vbnKnQtaGlw8TKv/download
python demo/demo.py \
    --config-file configs/BlendMask/R_101_dcni3_5x.yaml \
    --input datasets/coco/val2017/000000005992.jpg \
    --confidence-threshold 0.35 \
    --opts MODEL.WEIGHTS blendmask_r101_dcni3_5x.pth

Training and evaluation

To train a model with "train_net.py", first setup the corresponding datasets following datasets/README.md,

Then follow these steps to generate blendmask format annotations for instance segmentation.

then run:

OMP_NUM_THREADS=1 python tools/train_net.py \
    --config-file configs/BlendMask/R_50_1x.yaml \
    --num-gpus 4 \
    OUTPUT_DIR training_dir/blendmask_R_50_1x

To evaluate the model after training, run:

OMP_NUM_THREADS=1 python tools/train_net.py \
    --config-file configs/BlendMask/R_50_1x.yaml \
    --eval-only \
    --num-gpus 4 \
    OUTPUT_DIR training_dir/blendmask_R_50_1x \
    MODEL.WEIGHTS training_dir/blendmask_R_50_1x/model_final.pth

Models

COCO Instance Segmentation Baselines

ModelNameinf. timebox APmask APdownload
Mask R-CNNR_50_1x13 FPS38.635.2
BlendMaskR_50_1x14 FPS39.935.8model
Mask R-CNNR_50_3x13 FPS41.037.2
BlendMaskR_50_3x14 FPS42.737.8model
Mask R-CNNR_101_3x10 FPS42.938.6
BlendMaskR_101_3x11 FPS44.839.5model
BlendMaskR_101_dcni3_5x10 FPS46.841.1model

BlendMask Real-time Models

ModelNameinf. timebox APmask APdownload
Mask R-CNN550_R_50_3x16 FPS39.135.3
BlendMask550_R_50_3x28 FPS38.734.5model
BlendMaskRT_R_50_4x_syncbn_shtw31 FPS39.335.1model
BlendMaskRT_R_50_4x_bn-head_syncbn_shtw31 FPS39.335.1model
BlendMaskDLA_34_4x32 FPS40.836.3model

COCO Panoptic Segmentation Baselines with BlendMask

ModelNamePQPQThPQStdownload
Panoptic FPNR_50_3x41.548.331.2
BlendMaskR_50_3x42.549.532.0model
Panoptic FPNR_101_3x43.049.732.9
BlendMaskR_101_3x44.351.633.2model
BlendMaskR_101_dcni3_5x46.052.935.5model

Citing BlendMask

If you use BlendMask in your research or wish to refer to the baseline results, please use the following BibTeX entries.

@inproceedings{chen2020blendmask,
  title     =  {{BlendMask}: Top-Down Meets Bottom-Up for Instance Segmentation},
  author    =  {Chen, Hao and Sun, Kunyang and Tian, Zhi and Shen, Chunhua and Huang, Yongming and Yan, Youliang},
  booktitle =  {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)},
  year      =  {2020}
}