README.md

February 22, 2023 ยท View on GitHub

PWC PWC PWC

Railroad is not a Train: Saliency as Pseudo-pxiel Supervision for Weakly Supervised Semantic Segmentation (CVPR 2021)

CVPR 2021 paper

Seungho Lee1,* , Minhyun Lee1,*, Jongwuk Lee2, Hyunjung Shim1

* indicates an equal contribution

1 School of Integrated Technology, Yonsei University
2 Department of Computer Science of Engineering, Sungkyunkwan University

Introduction

EPS Existing studies in weakly-supervised semantic segmentation (WSSS) using image-level weak supervision have several limitations: sparse object coverage, inaccurate object boundaries, and co-occurring pixels from non-target objects. To overcome these challenges, we propose a novel framework, namely Explicit Pseudo-pixel Supervision (EPS), which learns from pixel-level feedback by combining two weak supervisions; the image-level label provides the object identity via the localization map and the saliency map from the off-the-shelf saliency detection model offers rich boundaries. We devise a joint training strategy to fully utilize the complementary relationship between both information. Our method can obtain accurate object boundaries and discard co-occurring pixels, thereby significantly improving the quality of pseudo-masks.

Updates

12 Jul, 2021: Initial upload

19 Aug, 2021: Minor update on information about dCRF and the pre-trained model of the segmentation networks

28 Aug, 2021: Major updates about MS-COCO 2014 dataset and minor updates (cleanup)

15 Apr, 2022: Minor update on information about the method setting up 'cls_labels.npy' the for ms-coco 17 dataset

  • Please see the issue: coco17

22 Feb, 2023: Minor update on the download link for coco dataset (Masks, Saliency maps)

Installation

  • Python 3.6
  • Pytorch >= 1.0.0
  • Torchvision >= 0.2.2
  • MXNet
  • Pillow
  • opencv-python (opencv for Python)

Execution

Dataset & pretrained model

Classification network

  • Execute the bash file for training, inference and evaluation.

    # Please see these files for the detail of execution.
    
    # PASCAL VOC 2012 
    # Baseline
    bash script/vo12_cls.sh
    # EPS
    bash script/voc12_eps.sh
    
    # MS-COCO 2014
    # Baseline
    bash script/coco_cls.sh
    # EPS
    bash script/coco_eps.sh  
    
  • We provide checkpoints, training logs, and performances for each method and each dataset.

    Please see the details from the script files.

    DatasetMETHODTrain(mIoU)CheckpointTraining log
    PASCAL VOC 2012Base47.05Downloadvoc12_cls.log
    PASCAL VOC 2012EPS69.22Downloadvoc12_eps.log
    MS-COCO 2014Base31.23Downloadcoco_cls.log
    MS-COCO 2014EPS37.15Downloadcoco_eps.log
  • dCRF hyper-parameters

    • We did not use dCRF for our pseudo-masks, but only used for the comparision in the paper.
    • We chose the hyper-parameters for dCRF used in ResNet101-based DeepLabV2 among other candidates(OAA, and PSA)
    • Please see the official deeplab website for information
    CRF parameters: bi_w = 4, bi_xy_std = 67, bi_rgb_std = 3, pos_w = 3, pos_xy_std = 1.
    

Segmentation network

Results

results

Acknowledgement

This code is highly borrowed from PSA. Thanks to Jiwoon, Ahn.