What does this AI glossary cover?

Machine Brief's AI glossary covers 175+ terms spanning machine learning, deep learning, natural language processing, computer vision, generative AI, and AI safety.

Is this glossary free?

Yes, Machine Brief's AI glossary is 100% free to use. No account or signup required.

Who is this glossary for?

Anyone who wants to understand AI terminology — from complete beginners to engineers switching into AI.

What concepts are related to LoRA?

Key concepts related to LoRA include: Fine-Tuning, Quantization, Activation Function, Adam Optimizer, AGI, AI Agent. Understanding these related terms helps build a deeper knowledge of ai and how LoRA fits into the broader ecosystem.

LoRA - AI Glossary

Definition

Low-Rank Adaptation. An efficient fine-tuning method that freezes the original model weights and only trains small adapter matrices. Drastically reduces the compute and memory needed for fine-tuning — you can customize a 70B model on a single GPU. QLoRA adds quantization for even more savings.

How It Works

LoRA (Low-Rank Adaptation) is a technique that makes fine-tuning large models dramatically cheaper and faster. Instead of updating all billions of parameters during fine-tuning, LoRA freezes the original model and injects small trainable matrices into each layer. These adapters capture task-specific knowledge with a tiny fraction of the parameters — often less than 1% of the original model size.

The math behind LoRA is based on the observation that weight updates during fine-tuning tend to have low rank — meaning they can be decomposed into two small matrices multiplied together. Instead of a huge weight update matrix, you learn two thin matrices whose product approximates the full update. A 7-billion parameter model might need only 10-50 million trainable parameters with LoRA.

The practical impact has been massive. Before LoRA, fine-tuning a large model required multiple expensive GPUs and significant memory. With LoRA, you can fine-tune a 7B model on a single consumer GPU with 24GB of VRAM. You can also swap LoRA adapters without reloading the base model — so one server can serve multiple specialized versions. QLoRA (quantized LoRA) pushes this further by combining LoRA with 4-bit quantization, making fine-tuning possible on even more modest hardware.

LoRA

Definition

How It Works

Example Usage

Share this term

Learn More About LoRA

Related Terms

Fine-Tuning

Quantization

Activation Function

Adam Optimizer

AGI

AI Agent

Explore More

Want to learn more about AI?

LoRA

Definition

How It Works

Example Usage

Share this term

Learn More About LoRA

Related Terms

Fine-Tuning

Quantization

Activation Function

Adam Optimizer

AGI

AI Agent

Explore More

Want to learn more about AI?