[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

June 11, 2026 · View on GitHub

📚English | 📚中文阅读   |   机器之心   |   百度文心

[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

Huanlin Gao1,2* Ping Chen1,2* Fuyuan Shi1,2 Chao Tan1,2 Zhaoxiang Liu1,2
Fang Zhao1,2 Kai Wang1,2 Shiguo Lian1,2
1Data Science & Artificial Intelligence Research Institute, China Unicom,  2Unicom Data Intelligence, China Unicom
(* Equal contribution. † Corresponding author.)
Project Page Paper License GitHub Stars

LeMiCa Overview

Introduction

LeMiCa is a training-free acceleration framework for diffusion-based video generation (and extendable to image generation). Instead of using local heuristic thresholds, LeMiCa formulates cache scheduling as a global path optimization problem with error-weighted edges and introduces a Lexicographic Minimax strategy to bound the worst-case global error. This global planning improves both inference speed and consistency across frames. For more details and visual results, please visit our project page.

🔥 Latest News

  • [2026/04/16] ✨ Support ERNIE-Image text-to-image acceleration with LeMiCa.
  • [2026/01/29] 🔥 Our latest work "MeanCache: From Instantaneous to Average Velocity for Accelerating Flow Matching Inference" is accepted by ICLR 2026! Codes are available at MeanCache! MeanCache achieves 4.12×, 4.56×, and 3.59× acceleration on FLUX.1, Qwen-Image, and HunyuanVideo, while consistently outperforming state-of-the-art caching baselines in generation quality. For more details, please refer to our latest research paper.
  • [2026/01/20] 🔥 Added support for FLUX.1-dev and FLUX.2-Klein in LeMiCa4FLUX
  • [2025/12/15] ✨ ComfyUI-LeMiCa has been seamlessly integrated into ComfyUI and is fully compatible with ComfyUI’s native nodes.
  • [2025/12/08] ✨ Support HunyuanVideo-1.5 for both T2V and I2V.
  • [2025/12/02] ✨ Support Z-Image and FLUX.2.
  • [2025/11/14] ⭐ We have open-sourced Awesome-Acceleration-GenAI, collecting the latest generation acceleration techniques. Feel free to check it out !
  • [2025/11/13] ✨ Support Wan2.1 for both T2V and I2V.
  • [2025/11/07] ✨ Support Qwen-Image and Inference Code Released !
  • [2025/10/29] 🚀 Code will be released soon !
  • [2025/09/18] ✨ Selected as a NeurIPS 2025 Spotlight paper.
  • [2025/09/18] ✨ Initial public release of LeMiCa.

Demo

ComfyUI-LeMiCa

ComfyUI-LeMiCa Workflow

ERNIE-Image

MethodERNIE-ImageLeMiCa-slowLeMiCa-mediumLeMiCa-fast
Latency32.168 s16.471 s11.432 s7.043 s
T2IERNIE-ImageLeMiCa-slowLeMiCa-mediumLeMiCa-fast

FLUX.2 [Klein-9B]

MethodFlux.2(klein-9B)LeMiCa-slowLeMiCa-mediumLeMiCa-fastLeMiCa-ultra
Latency20.04 s10.77 s8.45 s6.54 s4.59 s
T2IFlux.2(klein-9B)LeMiCa-slowLeMiCa-mediumLeMiCa-fastLeMiCa-ultra

Qwen-Image-2512

MethodQwen-Image-2512LeMiCa-slowLeMiCa-mediumLeMiCa-fast
Latency32.8 s18.83 s14.35 s10.41 s
T2IQwen-Image-2512LeMiCa-slowLeMiCa-mediumLeMiCa-fast

HunyuanVideo1.5

T2V 720P (Up to a 2.86× speedup)

https://github.com/user-attachments/assets/ebed2e0f-87f4-408e-98e3-93bd29bbc99f

I2V 720P (Up to a 3.88× speedup)

https://github.com/user-attachments/assets/d1a83d45-579f-4174-9477-ba0b9aebb322

FLUX.2

MethodFlux.2(cpu-offload)Flux.2LeMiCa-slowLeMiCa-mediumLeMiCa-fast
Latency101.2 s32.70 s13.41 s10.20 s6.99 s
T2IFlux2 CPU-offloadFlux2LeMiCa-slowLeMiCa-mediumLeMiCa-fast

Z-Image

MethodZ-ImageLeMiCa-slowLeMiCa-mediumLeMiCa-fast
Latency2.55 s2.19 s1.94 s1.78 s
T2IZ-ImageLeMiCa-slowLeMiCa-mediumLeMiCa-fast

Wan2.1

https://github.com/user-attachments/assets/3d99b959-7253-47ec-af0a-da13a66e6d49

Open-Sora

Click to expand Open-Sora example

https://github.com/user-attachments/assets/ba205856-2d77-494a-aaa9-09189ba2915c

Qwen-Image

Click to expand Qwen-Image example
Qwen-Image visual result

Supported Models

LeMiCa currently supports and has been tested on the following diffusion-based models:

Text-to-Video

Text-to-Image

ToDo List

  • 🗹 Public Project Page
  • 🗹 Paper Released
  • 🗹 Text-to-Image Forward Inference
  • 🗹 Text-to-Video Forward Inference
  • ☐ DAG Construction Code
  • ☐ Support Acceleration Framework

Acknowledgement

This repository is built based on or inspired by the following open-source projects: Diffusers, TeaCache, VideoSys. We sincerely thank these communities for their open contributions and inspiration.

License

The majority of this project is released under the Apache 2.0 license as found in the LICENSE file.

📖 Citation

If you find LeMiCa useful in your research or applications, please consider giving us a star ⭐ and citing it by the following BibTeX entry:

@inproceedings{gao2025lemica,
  title     = {LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation},
  author    = {Huanlin Gao and Ping Chen and Fuyuan Shi and Chao Tan and Zhaoxiang Liu and Fang Zhao and Kai Wang and Shiguo Lian},
  journal   = {Advances in Neural Information Processing Systems (NeurIPS)},
  year      = {2025},
  url       = {https://arxiv.org/abs/2511.00090}
}

⭐ Star History

Star History Chart