[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

June 22, 2026 · View on GitHub

[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

Huanlin Gao^1,2^*, Ping Chen^1,2^*, Fuyuan Shi^1,2, Chao Tan^1,2, Zhaoxiang Liu^1,2
Fang Zhao^1,2^†, Kai Wang^1,2, Shiguo Lian^1,2^†

¹Data Science & Artificial Intelligence Research Institute, China Unicom, ²Unicom Data Intelligence, China Unicom

(* Equal contribution. † Corresponding author.)

LeMiCa Overview

Introduction

LeMiCa is a training-free acceleration framework for diffusion-based video generation (and extendable to image generation). Instead of using local heuristic thresholds, LeMiCa formulates cache scheduling as a global path optimization problem with error-weighted edges and introduces a Lexicographic Minimax strategy to bound the worst-case global error. This global planning improves both inference speed and consistency across frames. For more details and visual results, please visit our project page.

🔥 Latest News

[2026/06/22] 🔥 Our latest work "OTCache: Optimal Transport for Geometry-Aware Caching in Diffusion Models" is accepted by ECCV 2026!
[2026/04/16] 🚀 Support ERNIE-Image text-to-image acceleration with LeMiCa.
[2026/01/29] ✨ Our latest work "MeanCache: From Instantaneous to Average Velocity for Accelerating Flow Matching Inference" is accepted by ICLR 2026! Codes are available at MeanCache! MeanCache achieves 4.12×, 4.56×, and 3.59× acceleration on FLUX.1, Qwen-Image, and HunyuanVideo, while consistently outperforming state-of-the-art caching baselines in generation quality. For more details, please refer to our latest research paper.
[2026/01/20] ✨ Added support for FLUX.1-dev and FLUX.2-Klein in LeMiCa4FLUX
[2025/12/15] ✨ ComfyUI-LeMiCa has been seamlessly integrated into ComfyUI and is fully compatible with ComfyUI’s native nodes.
[2025/12/08] ✨ Support HunyuanVideo-1.5 for both T2V and I2V.
[2025/12/02] ✨ Support Z-Image and FLUX.2.
[2025/11/14] ⭐ We have open-sourced Awesome-Acceleration-GenAI, collecting the latest generation acceleration techniques. Feel free to check it out !
[2025/11/13] ✨ Support Wan2.1 for both T2V and I2V.
[2025/11/07] ✨ Support Qwen-Image and Inference Code Released !
[2025/10/29] 🚀 Code will be released soon !
[2025/09/18] ✨ Selected as a NeurIPS 2025 Spotlight paper.
[2025/09/18] ✨ Initial public release of LeMiCa.

Demo

ComfyUI-LeMiCa

ComfyUI-LeMiCa Workflow

ERNIE-Image

Method	ERNIE-Image	LeMiCa-slow	LeMiCa-medium	LeMiCa-fast
Latency	32.168 s	16.471 s	11.432 s	7.043 s
T2I

FLUX.2 [Klein-9B]

Method	Flux.2(klein-9B)	LeMiCa-slow	LeMiCa-medium	LeMiCa-fast	LeMiCa-ultra
Latency	20.04 s	10.77 s	8.45 s	6.54 s	4.59 s
T2I

Qwen-Image-2512

Method	Qwen-Image-2512	LeMiCa-slow	LeMiCa-medium	LeMiCa-fast
Latency	32.8 s	18.83 s	14.35 s	10.41 s
T2I

HunyuanVideo1.5

T2V 720P (Up to a 2.86× speedup）

https://github.com/user-attachments/assets/ebed2e0f-87f4-408e-98e3-93bd29bbc99f

I2V 720P (Up to a 3.88× speedup）

https://github.com/user-attachments/assets/d1a83d45-579f-4174-9477-ba0b9aebb322

FLUX.2

Method	Flux.2(cpu-offload)	Flux.2	LeMiCa-slow	LeMiCa-medium	LeMiCa-fast
Latency	101.2 s	32.70 s	13.41 s	10.20 s	6.99 s
T2I

Z-Image

Method	Z-Image	LeMiCa-slow	LeMiCa-medium	LeMiCa-fast
Latency	2.55 s	2.19 s	1.94 s	1.78 s
T2I

Wan2.1

https://github.com/user-attachments/assets/3d99b959-7253-47ec-af0a-da13a66e6d49

Open-Sora

Click to expand Open-Sora example

https://github.com/user-attachments/assets/ba205856-2d77-494a-aaa9-09189ba2915c

Qwen-Image

Click to expand Qwen-Image example

Supported Models

LeMiCa currently supports and has been tested on the following diffusion-based models:

Text-to-Video

Text-to-Image

ToDo List

🗹 Public Project Page
🗹 Paper Released
🗹 Text-to-Image Forward Inference
🗹 Text-to-Video Forward Inference
☐ DAG Construction Code
☐ Support Acceleration Framework

Community Contributions & Friendly Links

Qwen-Image and CogVideo featured LeMiCa on their project homepages.
Cache-DiT A unified and flexible inference engine for DiTs, integrating and applying LeMiCa’s core insights. Welcome to try and explore. Details
ComfyUI-LeMiCa now includes Z-Image nodes. Thanks @scruffynerf.

@inproceedings{gao2025lemica,
  title     = {LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation},
  author    = {Huanlin Gao and Ping Chen and Fuyuan Shi and Chao Tan and Zhaoxiang Liu and Fang Zhao and Kai Wang and Shiguo Lian},
  journal   = {Advances in Neural Information Processing Systems (NeurIPS)},
  year      = {2025},
  url       = {https://arxiv.org/abs/2511.00090}
}

[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

Introduction

🔥 Latest News

Demo

ComfyUI-LeMiCa

ERNIE-Image

FLUX.2 [Klein-9B]

Qwen-Image-2512

HunyuanVideo1.5

T2V 720P (Up to a 2.86× speedup）

I2V 720P (Up to a 3.88× speedup）

FLUX.2

Z-Image

Wan2.1

Open-Sora

Qwen-Image

Supported Models

ToDo List

Community Contributions & Friendly Links

Acknowledgement

License

📖 Citation

⭐ Star History