What does this AI glossary cover?

Machine Brief's AI glossary covers 175+ terms spanning machine learning, deep learning, natural language processing, computer vision, generative AI, and AI safety.

Is this glossary free?

Yes, Machine Brief's AI glossary is 100% free to use. No account or signup required.

Who is this glossary for?

Anyone who wants to understand AI terminology — from complete beginners to engineers switching into AI.

What concepts are related to Autoregressive Model?

Key concepts related to Autoregressive Model include: Transformer, Next-Token Prediction, Language Model, Activation Function, Adam Optimizer, AGI. Understanding these related terms helps build a deeper knowledge of ai and how Autoregressive Model fits into the broader ecosystem.

Autoregressive Model - AI Glossary

Definition

A model that generates output one piece at a time, with each new piece depending on all the previous ones. GPT and other large language models work this way — they predict the next token based on everything that came before it. Great for text generation, but inherently sequential.

How It Works

Autoregressive models generate output one piece at a time, where each new piece depends on everything that came before it. In language models, this means predicting one token, appending it to the sequence, then predicting the next token, and so on. It's like writing a sentence where each word choice constrains what comes next.

This approach is what makes chatbots feel like they're "thinking" as they type — they literally are generating one token at a time. GPT, Claude, LLaMA, and virtually every modern language model works this way. The tradeoff is speed: since each token depends on the previous ones, you can't easily parallelize generation. That's why responses take time to stream in.

The alternative approaches — like masked language models (BERT) or diffusion models — work differently. BERT predicts missing words in the middle of sentences (good for understanding, not great for generation). Diffusion models start with noise and refine it into output. But for open-ended text generation, autoregressive models still dominate because they naturally capture the left-to-right flow of language.

Autoregressive Model

Definition

How It Works

Example Usage

Share this term

Learn More About Autoregressive Model

Related Terms

Transformer

Next-Token Prediction

Language Model

Activation Function

Adam Optimizer

AGI

Explore More

Want to learn more about AI?

Autoregressive Model

Definition

How It Works

Example Usage

Share this term

Learn More About Autoregressive Model

Related Terms

Transformer

Next-Token Prediction

Language Model

Activation Function

Adam Optimizer

AGI

Explore More

Want to learn more about AI?