DreamO v1.1

June 24, 2025 · View on GitHub

After the release of DreamO v1, we received valuable feedback from the community. We're grateful for your support and suggestions.

The main issues identified were:

In version 1.1, we addressed these through post-training with SFT and DPO on high-quality datasets. Key improvements and comparisons are shown below.

In stylized scenes, v1 frequently suffered from structural and anatomical failures. v1.1 significantly improves stability, as illustrated:

In realistic scenes, v1.1 offers better hand and body rendering, with improved overall composition and aesthetics:

v1.1 reduces the excessive glossiness and plastic-like appearance in faces and scenes, resulting in more natural and appealing outputs:

Tips

Applying the FLUX LoRA Super-Realism can further enhance realism. However, it may interfere with stylization, so it's not included by default. You can experiment with it in ComfyUI. If you discover better LoRAs, we welcome contributions.
By default, reference images are resized to 512×512. For images with fine text or intricate details, consider increasing the resolution. This enables the generated image to retain more details of the reference image, see the figure below. However, only do so when necessary, as it increases inference time and may reduce editability.

Diffuser: Pull the latest code from this repo and run app.py as before. v1.1 is used by default and will automatically download from Hugging Face.
ComfyUI: Refer to our official native implementation here: ComfyUI-DreamO.
Hugging Face Demo: The online demo has been updated to use the latest v1.1 model.

Compared to v1, v1.1 may perform slightly worse in distinguishing multiple similar subjects within the same composition.