LEAF: Revolutionizing Speech-Aware Language Models

By Signe EriksenJune 9, 2026

LEAF introduces a novel method to improve speech-aware language models by addressing coarse credit assignment, outperforming current SOTA models.

refining speech-aware large language models, traditional GRPO-style methods have hit a snag. They often struggle with coarse credit assignment, applying the same terminal-reward advantage uniformly across tokens. This overlooks the inherent structure within rollout batches, especially as speech-conditioned completions tend to share prefixes before forking into critical decisions.

Introducing LEAF

Enter Low-rank Exploration with Adaptive Forking (LEAF), a breakthrough in retrospective tree-based reinforcement learning. LEAF sidesteps the need for online branching or additional decoding by sampling complete responses, identifying high-surprisal boundaries, grouping these by shared prefixes, and assigning span-level advantages through descendant rewards. The method's theoretical underpinnings justify its span-level credit assignment approach and boundary-selection design.

Why LEAF Matters

LEAF isn't just a theoretical improvement. Empirically, it outperforms GRPO across benchmarks in speech question answering and speech translation. And it does this all while operating within the same rollout and low-rank adaptation budget. Notably, even smaller LEAF-trained models surpass the current state-of-the-art, full-parameter baselines. This raises a critical question: are we nearing the end of an era for conventional post-training methods?

The Significance of LEAF's Approach

LEAF's ability to harness structure within rollout batches addresses a long-standing gap in language model training. This retrospective method captures nuanced decision points, leading to more precise and efficient model training. If smaller models can indeed outshine their full-parameter counterparts, the implications for computational efficiency and resource allocation are significant. Could this herald a new standard for speech-aware models?

The paper's key contribution? A shift towards more nuanced credit assignment. This has profound implications for how we approach model refinement in speech-aware applications. With LEAF, we're not just tweaking models. We're fundamentally rethinking how they learn and adapt.

Share this article:

Get AI news in your inbox

Daily digest of what matters in AI.

LEAF: Revolutionizing Speech-Aware Language Models

Introducing LEAF

Why LEAF Matters

The Significance of LEAF's Approach

Key Terms Explained