README.md
June 17, 2024 ยท View on GitHub
Amodal Completion via Progressive Mixed Context Diffusion (CVPR 2024)
Project Page | Paper | arXiv | Bibtex
Katherine Xu, Lingzhi Zhang, Jianbo Shi
University of Pennsylvania, Adobe Inc.
Our method can recover the hidden pixels of objects in diverse images. Occluders may be co-occurring (a person on a surfboard), accidental (a cat in front of a microwave), the image boundary (giraffe), or a combination of these scenarios.
The pink outline indicates an occluder object.
We use pretrained diffusion inpainting models, and no additional training is required!
๐ Updates
- Stay tuned for our code release!
Table of Contents
Requirements
- Python 3.10
- Docker
Setup
-
Clone this
amodalrepository, and runcd Grounded-Segment-Anything. -
In the Dockerfile, change all instances of
/home/appuserto your path for theamodalrepository. -
Run
make build-image. -
Start and attach to a docker container from the image
gsa:v0. Then, navigate to theamodalrepository. -
Run
./install.shto finish setup and download model checkpoints.
Dataset
- Run
./download_dataset.shto download the COCO dataset.
Usage
Progressive Occlusion-aware Completion Pipeline
-
In
./main.sh, modifyinput_dirto your folder path for the images. -
Run
./main.sh. You may need to usechmodto change the file permissions first.
Citation
If you find our work useful, please cite our paper:
@inproceedings{xu2024amodal,
title={Amodal completion via progressive mixed context diffusion},
author={Xu, Katherine and Zhang, Lingzhi and Shi, Jianbo},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={9099--9109},
year={2024}
}