README.md

September 20, 2024 · View on GitHub

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen    Tianqi Liu    Huiqiang Sun    Xinyi Ye    Baopu Li    Jianming Zhang    Zhiguo Cao

Corresponding Autor

Installation

git clone https://github.com/leoShen917/DreamMover.git
cd DreamMover
conda create -n mover python=3.8.5
conda activate mover
pip install -r requirement.txt

You can download the pretrained model Stable Diffusion v1.5 from Huggingface, and specify the model_path to your local directory.

[Optional] You can download the fine-tuned vae model from Huggingface for better performance.

Run Gradio UI

To start the Gradio UI of DreamMover, run the following in your environment:

python gradio_ui.py

Then, by default, you can access the UI at http://127.0.0.1:7860.

Usage

To start with, run the following command to train a Lora for image pair:

python lora/train_dreambooth_lora.py --pretrained_model_name_or_path [model_path] --instance_data_dir [img_path] --output_dir [lora_path] --instance_prompt [prompt] --lora_rank 16

After that, we now can run the main code:

python main.py \
  --prompt [prompt] --img_path [img_path] --model_path [model_path] --vae_path [vae_path] --lora_path [lora_path] --save_dir [save_dir] --Time 33

The script also supports the following options:

  • --prompt: Prompt of the image pair(default: "")
  • --img_path: Path of the image pair
  • --model_path: Pretrained model path (default: "runwayml/stable-diffusion-v1-5")
  • --vae_path: vae model path (default= "default")
  • --lora_path: lora model path (the output path of train_lora)
  • --save_dir: path of the output images (default= "./results")
  • --Time: the frames of generated video

Citation

If you find our work useful in your research, please consider to cite our paper:

@article{shen2024dreammover,
  title={DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion},
  author={Shen, Liao and Liu, Tianqi and Sun, Huiqiang and Ye, Xinyi and Li, Baopu and Zhang, Jianming and Cao, Zhiguo},
  journal={arXiv preprint arXiv:2409.09605},
  year={2024}
}

Acknowledgement

This code borrows heavily from DragDiffusion, DiffMorpher and Diffusers. We thank the respective authors for open sourcing their method.