LoRA — Low-Rank Decomposition

Advertisement

d 20 r 4

Full ΔW: d×d params. LoRA A·B: 2·d·r params. Savings = d²/(2dr) = d/(2r).

r=8-32 captures most fine-tuning gains. Up to ~250× fewer trainable params.

★ KEY TAKEAWAY

LoRA: replace full ΔW (d×d) with A·B (d×r and r×d). Up to 250× fewer trainable params. Standard fine-tuning recipe in 2026.

▶ WHAT TO TRY