Appearance-Based Refinement for Object-Centric Motion Segmentation
October 23, 2024 · View on GitHub
Junyu Xie, Weidi Xie, Andrew Zisserman
Visual Geometry Group, Department of Engineering Science, University of Oxford
ECCV, 2024
The YouTubeVOS2018-motion Dataset
- YouTubeVOS2018-motion (short for YTVOS18-m, where "m" is for motion) is a subset selected from training split of YTVOS2018.
- These selected sequences are used for evaluation, with predominantly moving objects involved (i.e., objects can be discovered based on their motion).
- The list of selected sequences can be found in
resources/ytvos18m_seq.json. - The raw video frames can be downloaded from YouTubeVOS.
- The GT annotations are available here.
Pre-computed results and checkpoints
- The masks after refinements can be found here.
- The flow-predicted masks from self-supervised adapted OCLR models are provided as the inputs for the refinement, which can be found here. The corresponding checkpoint for self-supervised adapted OCLR models can be found here.
- The checkpoints for the mask selector and self-supervised adapted mask correctors can be found here.
Scripts
Requirements
pytorch>=2.0,
Pillow,
opencv,
einops
Inference
python main.py --save_pred --dataset DAVIS17m --ckpt_selector={} --ckpt_corrector={} \
--img_dir={} --gt_dir={} --mask_dir={} --save_dir={}
where --save_pred saves the refined masks
--ckpt_selector and --ckpt_corrector indicate the checkpoints for the mask selector and corrector
--img_dir and --gt_dir denote the directories for dataset images and corresponding gt annotations
--mask_dir denote the directory of input flow-predicted masks to be refined
--save_dir specifies the directory to save predicted masks
Citation
If you find our paper/repository helpful, please consider citing our works:
@InProceedings{xie24appearrefine,
title = {Appearance-Based Refinement for Object-Centric Motion Segmentation},
author = {Junyu Xie and Weidi Xie and Andrew Zisserman},
booktitle = {ECCV},
year = {2024}
}
@inproceedings{xie2022segmenting,
title = {Segmenting Moving Objects via an Object-Centric Layered Representation},
author = {Junyu Xie and Weidi Xie and Andrew Zisserman},
booktitle = {NeurIPS},
year = {2022}
}