GuideDiffuSeq - Understanding the Quality-Diversity Trade-off in Diffusion Language Models
March 17, 2025 ยท View on GitHub
Access the paper here.
This is the codebase for GuideDiffuSeq, which proposes several methods for controlling quality and diversity in token-level diffusion language models, such as the effect of classifier-free guidance (CFG).
The code is based on the diffusers library.
Setup
git clone https://github.com/zzbuzzard/GuideDiffuSeq
cd GuideDiffuSeq
pip install -r requirements.txt
The codebase uses wandb, which you may disable with the environment variable WANDB_MODE=offline.
Usage
Create an empty directory models/[my_model_name], and add config.json and train_config.json files
which specify the parameters required in config.json.
Training is then run via
python train.py -m models/[my_model_name]
Following training, evaluation can be run with eval.py. See python eval.py --help
for usage information.
We include the QQP dataset in this repo (datasets/QQP) as it is of a reasonably small size.
Citation
@article{buzzard2025guidediffuseq,
title={Understanding the Quality-Diversity Trade-off in Diffusion Language Models},
author={Zak Buzzard},
year={2025},
url={https://arxiv.org/abs/2503.10683},
}