Foresight: Adaptive Layer Reuse for Accelerated and High-Quality Text-to-Video Generation

November 26, 2025 · View on GitHub

Foresight proposes adaptive coarse grained reuse framework for accelerating text-to-video generation models while maintaining video quality.

This repository contains the source code implementation of Foresight.

This source code is available under the Apache 2.0 License.

BaselineStatic ReuseForesight (Adaptive Reuse)

⚙️ Environment Setup

conda Environment

You can create a new conda environment using script.

conda env create -n foresight-env python=3.10 -y
conda activate foresight-env
pip install -e .

💻 System Requirements

Right now, Foresight has been tested on a 1xA100 node for Open-Sora, Latte and CogVideoX models on single GPU.

We welcome contributions to evaluate Foresight across different models.

🏁 Using Foresight

Foresight Configuration

Foresight requires configuring below parameters to control the warmup phase and reuse phase reuse.

Parameters

  • warmup: No of denoising steps used during warmup phase.

    • Type: Integer
  • recalculate: Mandatory computation interval.

    • Format: Integer
  • threshold: Scaling factor for threshold.

    • Type: Float

Example Configuration

warmup: 5
recalculate: 2
threshold: 0.5

Example Runs

cd examples/open_sora
python sample.py
cd examples/latte
python sample.py
cd examples/cogvideox
python sample.py

Thank You

Foresight has been implemented on top of VideoSys, an easy and efficient system for video generation.

📝 Citation

@article{foresight,
  title={Foresight: Adaptive Layer Reuse for Accelerated and High-Quality Text-to-Video Generation},
  author={Adnan, Muhammad and Kurella, Nithesh and Arunkumar, Akhil and Nair, Prashant},
  year={2025},
  booktitle = {Proceedings of the 39th International Conference on Neural Information Processing Systems},
  location = {San Diego, CA, USA}
}