← all repositories
unslothai/unsloth

A local LLM workshop that actually fits on your GPU

Unsloth Studio wraps training, inference, and RL into a single web UI with aggressive memory optimizations.

unsloth
Velocity · 7d
+72
★ / day
Trend
steady
star history

What it does Unsloth Studio is a browser-based control panel for running and fine-tuning open models locally. It handles model search, download, chat, export, and training—including reinforcement learning and multimodal tasks—through a visual interface. A code-first “Unsloth Core” variant exists for script addicts.

The interesting bit The project leans hard into kernel-level optimization: custom Triton kernels, FP8 support, and “padding free + packing” batching that claims 7× longer context windows for RL compared to standard setups. They also collaborate directly with model teams (Gemma, Qwen, Llama, Mistral) to fix upstream bugs before they hit users.

Key highlights

  • Supports 500+ models with claimed 2× training speedup and up to 70% VRAM reduction
  • One-liner install via curl/powershell on Windows, Linux, macOS, and WSL
  • Built-in “Data Recipes” auto-generate training datasets from PDFs, CSVs, DOCX via a node workflow
  • Self-healing tool calling, code execution sandbox, and API endpoint for Claude Code/Codex integration
  • Multi-GPU training supported; AMD/Intel GPU support requires falling back to Core version

Caveats

  • Studio training on AMD GPUs is “out soon”; current AMD support is chat and data recipes only
  • macOS training runs but relies on MLX/GGUF paths, not the full CUDA stack
  • Multi-GPU has a “major upgrade on the way,” suggesting current implementation is functional but not final

Verdict Worth a spin if you want to fine-tune or run local LLMs without writing PyTorch boilerplate or renting A100s. Skip it if you need mature enterprise orchestration or are allergic to beta software.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.