← all repositories

henrywoo/chatllama

Open-source RLHF-based implementation for building ChatGPT-style assistants from LLaMA models.

chatllama
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

This repository provides a complete open-source implementation for fine-tuning LLaMA models using Reinforcement Learning from Human Feedback (RLHF) to create ChatGPT-style assistants. It supports all LLaMA model sizes (7B to 65B parameters) with built-in DeepSpeed ZERO acceleration for faster training. The library enables single-GPU inference and claims 15x faster training compared to the original ChatGPT while maintaining cost-effectiveness.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.