Rethinking Text Generation: T2M's Edge Over T2T

By Signe EriksenMay 27, 2026

Token-to-Mask (T2M) offers a cleaner approach to text generation by resetting suspected errors to mask state. It outperforms Token-to-Token (T2T) editing, especially in math tasks.

Discrete masked diffusion language models, like LLaDA, have transformed text generation through iterative denoising. LLaDA2.1, for example, introduced a Token-to-Token (T2T) editing mechanism, promising faster generation. But there's a catch.

The Problem with T2T

T2T editing has its drawbacks, primarily because it merges error detection with replacement. This mix taints the generation context with potentially incorrect tokens. Another problem is the mismatch between training and inference noise. While systematic errors arise during inference, training typically involves random noise. The result? A skewed context that affects the output quality.

T2M: A Cleaner Alternative

Enter Token-to-Mask (T2M) remasking. This training-free, drop-in solution offers a fresh take by resetting suspected erroneous tokens back to the mask state. This allows the diffusion process to re-predict under a cleaner context. The paper's key contribution: T2M not only purifies the generation context but also aligns systematic inference errors with the model's native mask noise type, enhancing token-level precision.

Why This Matters

Why should we care? The ablation study reveals that T2M improves performance across various benchmarks, including knowledge, reasoning, mathematics, coding, and instruction-following. It's especially notable in mathematics tasks, with a 5.92% performance gain on CMATH. That kind of improvement isn't just incremental. It's transformative in fields where precision is critical.

The Key Takeaway

The analysis identified that a significant failure mode is last-mile token corruption. Correct reasoning leads to a corrupted final answer. Remarkably, T2M repairs 59.4% of such cases. Is it time to reevaluate how we handle errors in text generation? The evidence suggests so. T2M's approach offers a cleaner, more efficient path forward. It builds on prior work by addressing fundamental issues with current models. Code and data are available at the project's repository, inviting further exploration and validation.

Share this article:

Get AI news in your inbox

Daily digest of what matters in AI.