UPDATE:
April 19, 2024 ยท View on GitHub
Tencent has released official Comfy ELLA nodes that should now be used instead of this:
https://github.com/TencentQQGYLab/ComfyUI-ELLA
ComfyUI wrapper nodes to use the Diffusers implementation of ELLA
Installing
Either use the Manager and it's install from git -feature, or clone this repo to custom_nodes and run:
pip install -r requirements.txt
or if you use portable (run this in ComfyUI_windows_portable -folder):
python_embeded\python.exe -m pip install -r ComfyUI\custom_nodes\ComfyUI-ELLA-wrapper\requirements.txt
You can use any 1.5 model, rest is auto downloaded, sources:
ELLA model (132MB):
https://huggingface.co/QQGYLab/ELLA/blob/main/ella-sd1.5-tsc-t5xl.safetensors
Also requires google-flan-t5-xl, for which I have opted to autodownload this repo, 6GB):
https://huggingface.co/ybelkada/flan-t5-xl-sharded-bf16/tree/main
Original repo:
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Official code of "ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment".
๐ Changelog
- [2024.4.9] ๐ฅ๐ฅ๐ฅ Release ELLA-SD1.5 Checkpoint! Welcome to try!
- [2024.3.11] ๐ฅ Release DPG-Bench! Welcome to try!
- [2024.3.7] Initial update
Inference
ELLA-SD1.5
# get ELLA-SD1.5 at https://huggingface.co/QQGYLab/ELLA/blob/main/ella-sd1.5-tsc-t5xl.safetensors
# comparing ella-sd1.5 and sd1.5
# will generate images at `./assets/ella-inference-examples`
python3 inference.py test --save_folder ./assets/ella-inference-examples --ella_path /path/to/ella-sd1.5-tsc-t5xl.safetensors
# build a demo for ella-sd1.5
GRADIO_SERVER_NAME=0.0.0.0 GRADIO_SERVER_PORT=8082 python3 ./inference.py demo /path/to/ella-sd1.5-tsc-t5xl.safetensors
๐ DPG-Bench
The guideline of DPG-Bench:
-
Generate your images according to our prompts.
It is recommended to generate 4 images per prompt and grid them to 2x2 format. Please Make sure your generated image's filename is the same with the prompt's filename.
-
Run the following command to conduct evaluation.
bash dpg_bench/dist_eval.sh $YOUR_IMAGE_PATH $RESOLUTION
Thanks to the excellent work of DSG sincerely, we follow their instructions to generate questions and answers of DPG-Bench.
๐ TODO
- add huggingface demo link
- release checkpoint
- release inference code
- release DPG-Bench
๐ก Others
We have also found LaVi-Bridge, another independent but similar work completed almost concurrently, which offers additional insights not covered by ELLA. The difference between ELLA and LaVi-Bridge can be found in issue 13. We are delighted to welcome other researchers and community users to promote the development of this field.
๐ Citation
If you find ELLA useful for your research and applications, please cite us using this BibTeX:
@misc{hu2024ella,
title={ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment},
author={Xiwei Hu and Rui Wang and Yixiao Fang and Bin Fu and Pei Cheng and Gang Yu},
year={2024},
eprint={2403.05135},
archivePrefix={arXiv},
primaryClass={cs.CV}
}