Introduction
Training large AI models from scratch is very expensive and slow. LoRA and QLoRA help make this process faster and cheaper by changing how the model learns and stores information.
Imagine you have a huge book that contains all the knowledge you need. Instead of rewriting the whole book to add new information, you just add small sticky notes with updates. LoRA is like adding these sticky notes, and QLoRA is like making the notes smaller so they take less space.
┌─────────────────────────────┐ │ Large AI Model │ │ ┌───────────────┐ │ │ │ Original Model │ │ │ └───────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────┐ │ │ │ LoRA Matrices │ │ │ │ (small additions) │ │ │ └─────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────┐ │ │ │ QLoRA Compression │ │ │ │ (smaller data size) │ │ │ └─────────────────────┘ │ └─────────────────────────────┘