What does this AI glossary cover?

Machine Brief's AI glossary covers 175+ terms spanning machine learning, deep learning, natural language processing, computer vision, generative AI, and AI safety.

Is this glossary free?

Yes, Machine Brief's AI glossary is 100% free to use. No account or signup required.

Who is this glossary for?

Anyone who wants to understand AI terminology — from complete beginners to engineers switching into AI.

AI Glossary | 175+ Terms Explained Simply

A

A2A

Infrastructure

Agent-to-Agent (A2A) is a protocol developed by Google that allows AI agents from different vendors to communicate and collaborate with each other.

Activation Function

ai

A mathematical function applied to a neuron's output that introduces non-linearity into the network.

Adam Optimizer

ai

An optimization algorithm that combines the best parts of two other methods — AdaGrad and RMSProp.

Agentic AI

Core Concepts

Agentic AI refers to AI systems that can autonomously plan, execute multi-step tasks, use tools, and make decisions with minimal human oversight.

AGI

ai

Artificial General Intelligence.

AI Agent

ai

An autonomous AI system that can perceive its environment, make decisions, and take actions to achieve goals.

AI Alignment

ai

The research field focused on making sure AI systems do what humans actually want them to do.

AI Safety

ai

The broad field studying how to build AI systems that are safe, reliable, and beneficial.

Anthropic

ai

An AI safety company founded in 2021 by former OpenAI researchers, including Dario and Daniela Amodei.

Artificial Intelligence

ai

The science of creating machines that can perform tasks requiring human-like intelligence — reasoning, learning, perception, language understanding, and decision-making.

ASI

ai

Artificial Superintelligence.

Attention

ai

A mechanism that lets neural networks focus on the most relevant parts of their input when producing output.

Attention Mechanism

Architecture

The attention mechanism is a technique that lets neural networks focus on the most relevant parts of their input when producing output.

Autoencoder

ai

A neural network trained to compress input data into a smaller representation and then reconstruct it.

Autonomous AI

ai

AI systems capable of operating independently for extended periods without human intervention.

Autoregressive Model

ai

A model that generates output one piece at a time, with each new piece depending on all the previous ones.

B

Backpropagation

ai

The algorithm that makes neural network training possible.

Batch Normalization

ai

A technique that normalizes the inputs to each layer in a neural network, making training faster and more stable.

Batch Size

ai

The number of training examples processed together before the model updates its weights.

Beam Search

ai

A decoding strategy that keeps track of multiple candidate sequences at each step instead of just picking the single best option.

Benchmark

ai

A standardized test used to measure and compare AI model performance.

BERT

ai

Bidirectional Encoder Representations from Transformers.

Bias

ai

In AI, bias has two meanings.

BPE

ai

Byte Pair Encoding.

C

Catastrophic Forgetting

ai

When a neural network trained on new data suddenly loses its ability to perform well on previously learned tasks.

Chain of Thought

ai

A prompting technique where you ask an AI model to show its reasoning step by step before giving a final answer.

Chatbot

ai

An AI system designed to have conversations with humans through text or voice.

Chinchilla

ai

A research paper from DeepMind that proved most large language models were over-sized and under-trained.

Classification

ai

A machine learning task where the model assigns input data to predefined categories.

Claude

ai

Anthropic's family of AI assistants, including Claude Haiku, Sonnet, and Opus.

CLIP

ai

Contrastive Language-Image Pre-training.

CNN

ai

Convolutional Neural Network.

Compute

ai

The processing power needed to train and run AI models.

Computer Vision

ai

The field of AI focused on enabling machines to interpret and understand visual information from images and video.

Constitutional AI

ai

An approach developed by Anthropic where an AI system is trained to follow a set of principles (a 'constitution') rather than relying solely on human feedback for every decision.

Context Window

ai

The maximum amount of text a language model can process at once, measured in tokens.

Contrastive Learning

ai

A self-supervised learning approach where the model learns by comparing similar and dissimilar pairs of examples.

Conversational AI

ai

AI systems designed for natural, multi-turn dialogue with humans.

Cross-Attention

ai

An attention mechanism where one sequence attends to a different sequence.

CUDA

ai

NVIDIA's parallel computing platform that lets developers use GPUs for general-purpose computing.

D

DALL-E

ai

OpenAI's text-to-image generation model.

Data Augmentation

ai

Techniques for artificially expanding training datasets by creating modified versions of existing data.

Data Poisoning

ai

Deliberately corrupting training data to manipulate a model's behavior.

Decoder

ai

The part of a neural network that generates output from an internal representation.

Deep Learning

ai

A subset of machine learning that uses neural networks with many layers (hence 'deep') to learn complex patterns from large amounts of data.

Deepfake

ai

AI-generated media that realistically depicts a person saying or doing something they never actually did.

DeepMind

ai

A leading AI research lab, now part of Google.

Diffusion Model

ai

A generative AI model that creates data by learning to reverse a gradual noising process.

Distillation

ai

A technique where a smaller 'student' model learns to mimic a larger 'teacher' model.

DPO

ai

Direct Preference Optimization.

Dropout

ai

A regularization technique that randomly deactivates a percentage of neurons during training.

E

Edge AI

ai

Running AI models directly on local devices (phones, laptops, IoT devices) instead of in the cloud.

Embedding

ai

A dense numerical representation of data (words, images, etc.

Emergent Abilities

ai

Capabilities that appear suddenly as language models reach certain sizes.

Emergent Behavior

ai

Capabilities that appear in AI models at scale without being explicitly trained for.

Encoder

ai

The part of a neural network that processes input data into an internal representation.

Encoder-Decoder

ai

A neural network architecture with two parts: an encoder that processes the input into a representation, and a decoder that generates the output from that representation.

Epoch

ai

One complete pass through the entire training dataset.

Ethical AI

ai

The practice of developing AI systems that are fair, transparent, accountable, and respect human rights.

Evaluation

ai

The process of measuring how well an AI model performs on its intended task.

Explainability

ai

The ability to understand and explain why an AI model made a particular decision.

F

Feature Extraction

ai

The process of identifying and pulling out the most important characteristics from raw data.

Federated Learning

ai

A training approach where the model learns from data spread across many devices without that data ever leaving those devices.

Few-Shot Learning

ai

The ability of a model to learn a new task from just a handful of examples, often provided in the prompt itself.

Fine-Tuning

ai

The process of taking a pre-trained model and continuing to train it on a smaller, specific dataset to adapt it for a particular task or domain.

Flash Attention

ai

An optimized attention algorithm that's mathematically equivalent to standard attention but runs much faster and uses less GPU memory.

Foundation Model

ai

A large AI model trained on broad data that can be adapted for many different tasks.

Function Calling

ai

A capability that lets language models interact with external tools and APIs by generating structured function calls.

G

GAN

ai

Generative Adversarial Network.

GELU

ai

Gaussian Error Linear Unit.

Gemini

ai

Google's flagship multimodal AI model family, developed by Google DeepMind.

Generative AI

ai

AI systems that create new content — text, images, audio, video, or code — rather than just analyzing or classifying existing data.

GPT

ai

Generative Pre-trained Transformer.

GPU

ai

Graphics Processing Unit.

Gradient Accumulation

ai

A technique that simulates larger batch sizes by accumulating gradients over multiple forward passes before updating weights.

Gradient Descent

ai

The fundamental optimization algorithm used to train neural networks.

Grounding

ai

Connecting an AI model's outputs to verified, factual information sources.

Guardrails

ai

Safety measures built into AI systems to prevent harmful, inappropriate, or off-topic outputs.

H

Hallucination

ai

When an AI model generates confident-sounding but factually incorrect or completely fabricated information.

Hallucination Detection

ai

Methods for identifying when an AI model generates false or unsupported claims.

Hugging Face

ai

The leading platform for sharing and collaborating on AI models, datasets, and applications.

Hyperparameter

ai

A setting you choose before training begins, as opposed to parameters the model learns during training.

I

Image Classification

ai

The task of assigning a label to an image from a set of predefined categories.

ImageNet

ai

A massive image dataset containing over 14 million labeled images across 20,000+ categories.

In-Context Learning

ai

A model's ability to learn new tasks simply from examples provided in the prompt, without any weight updates.

Inference

ai

Running a trained model to make predictions on new data.

Instruction Tuning

ai

Fine-tuning a language model on datasets of instructions paired with appropriate responses.

J

Jailbreak

ai

A technique for bypassing an AI model's safety restrictions and guardrails.

K

Knowledge Distillation

ai

Training a smaller model to replicate the behavior of a larger one.

Knowledge Graph

ai

A structured representation of information as a network of entities and their relationships.

L

Language Model

ai

An AI model that understands and generates human language.

Large Language Model

ai

An AI model with billions of parameters trained on massive text datasets.

Latent Space

ai

The compressed, internal representation space where a model encodes data.

Layer Normalization

ai

A technique that normalizes activations across the features of each training example, rather than across the batch.

Learning Rate

ai

A hyperparameter that controls how much the model's weights change in response to each update.

LLaMA

ai

Meta's family of open-weight large language models.

LLM

ai

Large Language Model.

LoRA

ai

Low-Rank Adaptation.

Loss Function

ai

A mathematical function that measures how far the model's predictions are from the correct answers.

LSTM

ai

Long Short-Term Memory.

M

Machine Learning

ai

A branch of AI where systems learn patterns from data instead of following explicitly programmed rules.

Masked Language Modeling

ai

A pre-training technique where random words in text are hidden (masked) and the model learns to predict them from context.

MCP

Infrastructure

Model Context Protocol (MCP) is an open standard created by Anthropic that lets AI models connect to external tools, data sources, and APIs through a unified interface.

Meta-Learning

ai

Training models that learn how to learn — after training on many tasks, they can quickly adapt to new tasks with very little data.

Midjourney

ai

A popular AI image generation service known for its distinctive artistic style.

Mistral

ai

A French AI company that builds efficient, high-performance language models.

Mixture of Experts

ai

An architecture where multiple specialized sub-networks (experts) share a model, but only a few activate for each input.

MMLU

ai

Massive Multitask Language Understanding.

Model Collapse

ai

A degradation that happens when AI models are trained on data generated by other AI models.

Multi-Head Attention

ai

An extension of the attention mechanism that runs multiple attention operations in parallel, each with different learned projections.

Multimodal

ai

AI models that can understand and generate multiple types of data — text, images, audio, video.

N

Narrow AI

ai

AI systems designed for a specific task, as opposed to general intelligence.

Natural Language Processing

ai

The field of AI focused on enabling computers to understand, interpret, and generate human language.

Neural Network

ai

A computing system loosely inspired by biological brains, consisting of interconnected nodes (neurons) organized in layers.

Next-Token Prediction

ai

The fundamental task that language models are trained on: given a sequence of tokens, predict what comes next.

NLP

ai

Natural Language Processing.

NVIDIA

ai

The dominant provider of AI hardware.

O

Object Detection

ai

A computer vision task that identifies and locates objects within an image, drawing bounding boxes around each one.

Open Source AI

ai

AI models whose weights, code, and sometimes training data are publicly released for anyone to use, modify, and build upon.

OpenAI

ai

The AI company behind ChatGPT, GPT-4, DALL-E, and Whisper.

Optimization

ai

The process of finding the best set of model parameters by minimizing a loss function.

Overfitting

ai

When a model memorizes the training data so well that it performs poorly on new, unseen data.

P

Parameter

ai

A value the model learns during training — specifically, the weights and biases in neural network layers.

Perplexity

ai

A measurement of how well a language model predicts text.

Positional Encoding

ai

Information added to token embeddings to tell a transformer the order of elements in a sequence.

Pre-Training

ai

The initial, expensive phase of training where a model learns general patterns from a massive dataset.

Prompt Engineering

ai

The art and science of crafting inputs to AI models to get the best possible outputs.

Prompting

ai

The text input you give to an AI model to direct its behavior.

PyTorch

ai

The most popular deep learning framework, developed by Meta.

Q

Quantization

ai

Reducing the precision of a model's numerical values — for example, from 32-bit to 4-bit numbers.

R

RAG

ai

Retrieval-Augmented Generation.

Reasoning

ai

The ability of AI models to draw conclusions, solve problems logically, and work through multi-step challenges.

Reasoning Models

Model Types

Reasoning models are AI systems specifically designed to "think" through problems step-by-step before giving an answer.

Recurrent Neural Network

ai

A neural network architecture where connections form loops, letting the network maintain a form of memory across sequences.

Red Teaming

ai

Systematically testing an AI system by trying to make it produce harmful, biased, or incorrect outputs.

Regression

ai

A machine learning task where the model predicts a continuous numerical value.

Regularization

ai

Techniques that prevent a model from overfitting by adding constraints during training.

Reinforcement Learning

ai

A learning approach where an agent learns by interacting with an environment and receiving rewards or penalties.

ReLU

ai

Rectified Linear Unit.

Representation Learning

ai

The idea that useful AI comes from learning good internal representations of data.

Responsible AI

ai

The practice of developing and deploying AI systems with careful attention to fairness, transparency, safety, privacy, and social impact.

Reward Model

ai

A model trained to predict how helpful, harmless, and honest a response is, based on human preferences.

RLHF

ai

Reinforcement Learning from Human Feedback.

RNN

ai

Recurrent Neural Network.

RoPE

ai

Rotary Position Embedding.

S

Sampling

ai

The process of selecting the next token from the model's predicted probability distribution during text generation.

Scaling Laws

ai

Mathematical relationships showing how AI model performance improves predictably with more data, compute, and parameters.

Self-Attention

ai

An attention mechanism where a sequence attends to itself — each element looks at all other elements to understand relationships.

Self-Supervised Learning

ai

A training approach where the model creates its own labels from the data itself.

Semantic Search

ai

Search that understands meaning and intent rather than just matching keywords.

Sentiment Analysis

ai

Automatically determining whether a piece of text expresses positive, negative, or neutral sentiment.

Softmax

ai

A function that converts a vector of numbers into a probability distribution — all values between 0 and 1 that sum to 1.

Speech Recognition

ai

Converting spoken audio into written text.

Stable Diffusion

ai

An open-source image generation model released by Stability AI.

Structured Output

ai

Getting a language model to generate output in a specific format like JSON, XML, or a database schema.

Supervised Learning

ai

The most common machine learning approach: training a model on labeled data where each example comes with the correct answer.

Synthetic Data

ai

Artificially generated data used for training AI models.

System Prompt

ai

Instructions given to an AI model that define its role, personality, constraints, and behavior rules.

T

Temperature

ai

A parameter that controls the randomness of a language model's output.

TensorFlow

ai

Google's open-source deep learning framework.

Text-to-Image

ai

AI models that generate images from text descriptions.

Text-to-Speech

ai

AI systems that convert written text into natural-sounding spoken audio.

Token

ai

The basic unit of text that language models work with.

Tokenizer

ai

The component that converts raw text into tokens that a language model can process.

Tool Use

ai

The ability of AI models to interact with external tools and systems — browsing the web, running code, querying APIs, reading files.

Top-P Sampling

ai

A text generation method (also called nucleus sampling) that only considers tokens whose cumulative probability exceeds a threshold P.

TPU

ai

Tensor Processing Unit.

Training

ai

The process of teaching an AI model by exposing it to data and adjusting its parameters to minimize errors.

Transfer Learning

ai

Using knowledge learned from one task to improve performance on a different but related task.

Transformer

ai

The neural network architecture behind virtually all modern AI language models.

Turing Test

ai

A test proposed by Alan Turing in 1950: if a human can't reliably tell whether they're talking to a machine or another human, the machine passes.

U

Underfitting

ai

When a model is too simple to capture the patterns in the data, performing poorly on both training and test sets.

Unsupervised Learning

ai

Machine learning on data without labels — the model finds patterns and structure on its own.

V

VAE

ai

Variational Autoencoder.

Vector Database

ai

A database optimized for storing and searching high-dimensional vectors (embeddings).

Vision Transformer

ai

A transformer architecture adapted for image processing.

Voice Cloning

ai

Using AI to create a synthetic copy of someone's voice from a small sample of their speech.

W

Weight

ai

A numerical value in a neural network that determines the strength of the connection between neurons.

Whisper

ai

OpenAI's open-source speech recognition model.

Word2Vec

ai

One of the earliest successful word embedding models, from Google in 2013.

World Model

ai

An AI system's internal representation of how the world works — understanding physics, cause and effect, and spatial relationships.

Y

YOLO

ai

You Only Look Once.

Z

Zero-Shot Learning

ai

A model's ability to perform a task it was never explicitly trained on, with no examples provided.