A computing system loosely inspired by biological brains, consisting of interconnected nodes (neurons) organized in layers.
A computing system loosely inspired by biological brains, consisting of interconnected nodes (neurons) organized in layers. Each connection has a weight that's adjusted during training. Simple in concept but capable of learning incredibly complex patterns when scaled up. The foundation of modern AI.
A neural network is a computing system loosely inspired by biological brains, built from layers of interconnected nodes (neurons) that process information. Each connection has a weight that determines its strength, and the network learns by adjusting these weights during training. Modern AI — from image recognition to language models — is built on neural networks.
The architecture varies by task. Convolutional neural networks (CNNs) excel at image processing by detecting spatial patterns. Recurrent neural networks (RNNs) were used for sequential data like text before transformers took over. Transformers, the architecture behind modern LLMs, use attention mechanisms to process sequences in parallel. All of these are neural networks — they just wire their neurons differently.
What makes neural networks powerful is their ability to learn complex patterns from data without being explicitly programmed. You don't tell a neural network the rules for recognizing cats — you show it millions of cat photos and it figures out the patterns. This learning ability, combined with massive scale and enormous datasets, is what produced the AI revolution we're living through. The networks used today have billions or trillions of parameters, far beyond anything researchers imagined when the field started decades ago.
"At its core, every LLM is just a very large neural network — billions of simple mathematical operations that, together, produce remarkably intelligent-seeming behavior."
A subset of machine learning that uses neural networks with many layers (hence 'deep') to learn complex patterns from large amounts of data.
The algorithm that makes neural network training possible.
A mathematical function applied to a neuron's output that introduces non-linearity into the network.
An optimization algorithm that combines the best parts of two other methods — AdaGrad and RMSProp.
Artificial General Intelligence.
An autonomous AI system that can perceive its environment, make decisions, and take actions to achieve goals.
Browse our complete glossary or subscribe to our newsletter for the latest AI news and insights.