COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder

October 13, 2021 · View on GitHub

Project | Paper | Video (3m) | Video (10m) | Two Minute Papers Video

Software Installation

For installation, please checkout INSTALL.md.

Hardware Requirement

We trained our model using an NVIDIA DGX1 with 8 V100 32GB GPUs. Training took about one week.

Training

COCO-FUNIT prefers the following file arrangement.

${DATASET_ROOT_FOLDER}
└───images_content
    └───content_001.jpg
    └───content_002.jpg
    └───content_003.jpg
    ...
└───images_style
    └───style_001.jpg
    └───style_002.jpg
    ...

Training data preparation

To ease the trouble, we provide a copy of the Animal Faces dataset for quick experiments.

Download the dataset and unzip the files. The raw images are saved in projects/coco_funit/data/training
Build the lmdbs

for f in train train_all val; do
python -m imaginaire.tools.build_lmdb \
--config  configs/projects/coco_funit/animal_faces/base64_bs8_class119.yaml \
--data_root projects/coco_funit/data/raw/training/animal_faces/${f} \
--output_root projects/coco_funit/data/lmdb/training/animal_faces/${f} \
--overwrite
done

Training command

python -m torch.distributed.launch --nproc_per_node=8 train.py \
--config configs/projects/coco_funit/animal_faces/base64_bs8_class119.yaml \
--logdir logs/projects/coco_funit/animal_faces/base64_bs8_class119.yaml

Inference

Download test data by running

python scripts/download_test_data.py --model_name coco_funit

python inference.py --single_gpu \
--config configs/projects/coco_funit/animal_faces/base64_bs8_class149.yaml \
--output_dir projects/coco_funit/output/animal_faces

The results are stored in projects/coco_funit/output/animal_faces

Below we show the expected style--content-output images.

Style	Content	Translation

Mammals dataset

python inference.py --single_gpu \
--config configs/projects/coco_funit/mammals/base64_bs8_class305.yaml \
--output_dir projects/coco_funit/output/mammals

The results are stored in projects/coco_funit/output/mammals

Below we show the expected style--content-outpt images.

Style	Content	Translation

Citation

If you use this code for your research, please cite our papers.

@inproceedings{saito2020cocofunit,
  title={COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder},
  author={Kuni Saito and Kate Saenko and Ming-Yu Liu},
  booktitle={European Conference on Computer Vision (ECCV)}},
  year={2020}
}