What does this AI glossary cover?

Machine Brief's AI glossary covers 175+ terms spanning machine learning, deep learning, natural language processing, computer vision, generative AI, and AI safety.

Is this glossary free?

Yes, Machine Brief's AI glossary is 100% free to use. No account or signup required.

Who is this glossary for?

Anyone who wants to understand AI terminology — from complete beginners to engineers switching into AI.

What concepts are related to Parameter?

Key concepts related to Parameter include: Weight, Training, Hyperparameter, Activation Function, Adam Optimizer, AGI. Understanding these related terms helps build a deeper knowledge of ai and how Parameter fits into the broader ecosystem.

Parameter - AI Glossary

Definition

A value the model learns during training — specifically, the weights and biases in neural network layers. When we say GPT-4 has hundreds of billions of parameters, we mean that many individual numbers were learned. More parameters generally means more capacity to learn complex patterns, but also more compute needed.

How It Works

Parameters are the internal numerical values that a neural network adjusts during training to learn patterns. Think of them as the model's "knowledge" encoded as numbers. When people say GPT-4 has trillions of parameters or LLaMA has 70 billion, they're talking about how many adjustable values the model contains.

Each parameter is a single number — a weight on a connection between neurons or a bias term. During training, the model processes examples and adjusts these numbers to minimize errors. After training, the parameters are fixed and determine how the model responds to any input. More parameters generally means more capacity to learn complex patterns, which is why the AI industry has been racing to build larger models.

But parameter count alone doesn't determine model quality. Training data quality, architecture choices, and training techniques all matter enormously. A well-trained 7B parameter model (like Mistral 7B) can outperform a poorly trained 13B model. And some researchers argue we've been training models that are too large for their data — the Chinchilla scaling laws suggest that many models should have been smaller but trained on more data. Parameter count is one dimension of capability, not the whole story.

Parameter

Definition

How It Works

Example Usage

Share this term

Related Terms

Weight

Training

Hyperparameter

Activation Function

Adam Optimizer

AGI

Explore More

Want to learn more about AI?

Parameter

Definition

How It Works

Example Usage

Share this term

Related Terms

Weight

Training

Hyperparameter

Activation Function

Adam Optimizer

AGI

Explore More

Want to learn more about AI?