Taiji: A New Force in Recommender Systems

The digital advertising landscape is witnessing a potential transformation with the introduction of Taiji, a pioneering framework designed for recommender systems. Currently deployed on Kuaishou's advertising platform, Taiji serves over 400 million users daily, promising vast improvements in how recommendations are tailored and delivered.

The Challenge and the Solution

Recommender systems have grappled with a fundamental challenge: aligning the semantic space of large language models (LLMs) with the practical ID space used by these systems. This has historically involved intricate post-training processes like supervised fine-tuning (SFT) and reinforcement learning (RL), each fraught with its own set of issues.

Taiji, however, brings a fresh approach. By employing reverse-engineered reasoning and open-ended rejection sampling, the framework generates high-quality, domain-specific chain-of-thought (CoT) data. This tackles one of the major hurdles in open-domain recommendation during SFT, the difficulty in measuring and enhancing CoT quality.

Redefining Reward Structures

Another significant innovation of Taiji lies in its handling of reward structures. Traditional LLM4Rec paradigms often struggle with balancing semantic rewards from the LLM with recommendation preference rewards. Enter Pareto Optimal Policy Optimization (POPO), a method that flexibly adjusts cross-domain reward weights to achieve an optimal balance. This ensures that the rich semantic world knowledge inherent in LLMs is effectively paired with the collaborative ID features highlighting user preferences.

In essence, Taiji offers a more nuanced approach to optimizing rewards, potentially setting a new standard in the industry. But why should this matter to the average user or developer? Because it's about making recommendations not only smarter but more aligned with user needs, leading to enhanced user satisfaction and engagement.

The Commercial Impact

Beyond the technical intricacies, the commercial implications of Taiji's deployment are noteworthy. Since its launch on Kuaishou's platform in May 2026, Taiji hasn't only demonstrated impressive scalability, but it has also contributed significantly to commercial revenue. For an industry relentlessly seeking efficiency and better user engagement, this represents a breakthrough.

One might ask, in a world saturated with recommendation algorithms, do we really need another one? The answer is a resounding yes, if it means creating a more intuitive and effortless experience for users while driving revenue for businesses. According to two people familiar with the negotiations, Taiji is poised to set a new benchmark, challenging existing paradigms and potentially reshaping the future of online recommendations.

Taiji: A New Force in Recommender Systems

The Challenge and the Solution

Redefining Reward Structures

The Commercial Impact

Key Terms Explained