Mask Encoding for Single Shot Instance Segmentation

December 11, 2023 ยท View on GitHub

Rufeng Zhang, Zhi Tian, Chunhua Shen, Mingyu You, Youliang Yan

[arXiv] [BibTeX]

Models

COCO Instance Segmentation Baselines with MEInst

Nameinf. timebox APmask APdownload
MEInst_R_50_1x_none13 FPS39.530.7model
MEInst_R_50_1x12 FPS40.131.7model
MEInst_R_50_3x12 FPS43.634.5model
MEInst_R_50_3x_51219 FPS40.832.2model

Inference time is measured on a NVIDIA 1080Ti with batch size 1.

Quick Start

  1. Download the matrix file for mask encoding during training
  2. Symlink the matrix path to datasets/components/xxx.npz, e.g., coco/components/coco_2017_train_class_agnosticTrue_whitenTrue_sigmoidTrue_60.npz
  3. Follow AdelaiDet for install, train and inference

Step by step for Mask Encoding (Optional)

We recommend to directly download the matrix file and use it, as it can already handle most cases. And we also provide tools to generate encoding matrix yourself.

Example:

  • Generate encoding matrix

    python adet/modeling/MEInst/LME/mask_generation.py

  • Evaluate the quality of reconstruction

    python adet/modeling/MEInst/LME/mask_evaluation.py

Citing MEInst

If you use MEInst, please use the following BibTeX entry.

@inproceedings{zhang2020MEInst,
  title     =  {Mask Encoding for Single Shot Instance Segmentation},
  author    =  {Zhang, Rufeng and Tian, Zhi and Shen, Chunhua and You, Mingyu and Yan, Youliang},
  booktitle =  {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)},
  year      =  {2020}
}

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.