← all repositories

Guitaricet/relora

Official implementation of ReLoRA, a parameter-efficient fine-tuning method for training LLaMA models with low-rank updates.

473 stars Jupyter Notebook Language ModelsML FrameworksLLMOps · Eval
relora
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This repository contains the official code for the ReLoRA paper, which proposes training high-rank networks through low-rank update matrices. It implements a PEFT (Parameter Efficient Fine-Tuning) approach specifically designed for LLaMA-style transformer models. The method resets low-rank components periodically and scales learning rates to enable full-rank training through efficient parameter updates, with support for distributed training across multiple GPUs.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.