RemeDi: Remasking-enabled Diffusion Language Model

January 28, 2026 ยท View on GitHub

weixin RemeDiย 

Static Badgeย  Static Badgeย 

Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS".

๐Ÿšง Update Progress

[2026-1-28]

  • Model weights for RemeDi have been uploaded to HuggingFace (see resource links below).
  • Inference code is released.

๐Ÿ”ฌ Method Overview

RemeDi lets every token be revised at every diffusion step. Instead of fixing in an early guess, the model evaluates the quality of each token and can remask low-confidence positions, allowing later steps to resample them with richer contextโ€”built-in self-correction.

RemeDi extends the original model with a dual-stream transformer:

  • Token Prediction Stream (TPS) predicts masked tokens as usual.

  • Unmasking Policy Stream (UPS) outputs per-token confidence scores, deciding which tokens to unmask or remask.

At each denoising step, tokens with low confidence can be remasked and resampled, enabling iterative refinement. For the training and RL algorithms, see the Methods section of the paper.

RemeDi architecture and performance radar

๐Ÿ“ˆ Key Results

RemeDi performance table

๐Ÿ“‚ Repository Structure

โ”œโ”€โ”€ inference.py     # inference scripts
โ”œโ”€โ”€ remedi/          # networks configs 
โ””โ”€โ”€ README.md

๐Ÿš€ Inference

To run inference, execute: python inference.py

๐Ÿ“ฅ Citation

@article{huang2025don,
  title={Don't Settle Too Early: Self-Reflective Remasking for Diffusion Language Models},
  author={Huang, Zemin and Wang, Yuhang and Chen, Zhiyang and Qi, Guo-Jun},
  journal={arXiv preprint arXiv:2509.23653},
  year={2025}
}