← all repositories

mallorbc/Finetune_LLMs

A repository for fine-tuning causal LLMs with support for LoRA, QLoRA, and DeepSpeed training methods.

465 stars Python Language ModelsML Frameworks
Finetune_LLMs
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

This repository provides code to fine-tune Large Language Models on custom datasets, specifically using LoRA, QLoRA, and DeepSpeed techniques. It supports multiple model architectures including GPT-J, Falcon, LLaMA, LLaMA2, and MPT. The repo includes Docker-based GPU workflows for managing the training environment and a quotes dataset formatted for fine-tuning. It also incorporates code from the finetune-gpt2xl project with modifications to support additional models and training methods.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.