microsoft/LoRA
An implementation of Low-Rank Adaptation (LoRA) for efficiently fine-tuning large language models by learning rank-decomposition matrices.

This repository contains loralib, a Python package implementing LoRA for adapting large language models. The technique reduces trainable parameters by learning pairs of rank-decomposition matrices while freezing original weights, drastically lowering storage requirements and enabling efficient task-switching during deployment without inference latency overhead. It integrates with PyTorch models including those from Hugging Face and supports RoBERTa and DeBERTa architectures.