README.md

September 20, 2024 · View on GitHub

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen Tianqi Liu Huiqiang Sun Xinyi Ye Baopu Li Jianming Zhang Zhiguo Cao✉

✉Corresponding Autor

Installation

git clone https://github.com/leoShen917/DreamMover.git
cd DreamMover
conda create -n mover python=3.8.5
conda activate mover
pip install -r requirement.txt

You can download the pretrained model Stable Diffusion v1.5 from Huggingface, and specify the model_path to your local directory.

[Optional] You can download the fine-tuned vae model from Huggingface for better performance.

Run Gradio UI

To start the Gradio UI of DreamMover, run the following in your environment:

python gradio_ui.py

Then, by default, you can access the UI at http://127.0.0.1:7860.

Usage

To start with, run the following command to train a Lora for image pair:

python lora/train_dreambooth_lora.py --pretrained_model_name_or_path [model_path] --instance_data_dir [img_path] --output_dir [lora_path] --instance_prompt [prompt] --lora_rank 16

After that, we now can run the main code:

python main.py \
  --prompt [prompt] --img_path [img_path] --model_path [model_path] --vae_path [vae_path] --lora_path [lora_path] --save_dir [save_dir] --Time 33

The script also supports the following options:

--prompt: Prompt of the image pair(default: "")
--img_path: Path of the image pair
--model_path: Pretrained model path (default: "runwayml/stable-diffusion-v1-5")
--vae_path: vae model path (default= "default")
--lora_path: lora model path (the output path of train_lora)
--save_dir: path of the output images (default= "./results")
--Time: the frames of generated video

Citation

If you find our work useful in your research, please consider to cite our paper:

@article{shen2024dreammover,
  title={DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion},
  author={Shen, Liao and Liu, Tianqi and Sun, Huiqiang and Ye, Xinyi and Li, Baopu and Zhang, Jianming and Cao, Zhiguo},
  journal={arXiv preprint arXiv:2409.09605},
  year={2024}
}

Acknowledgement

This code borrows heavily from DragDiffusion, DiffMorpher and Diffusers. We thank the respective authors for open sourcing their method.