henrywoo/chatllama
Open-source RLHF-based implementation for building ChatGPT-style assistants from LLaMA models.

Velocity · 7d
+1.0
★ / day
Trend
→steady
star history
This repository provides a complete open-source implementation for fine-tuning LLaMA models using Reinforcement Learning from Human Feedback (RLHF) to create ChatGPT-style assistants. It supports all LLaMA model sizes (7B to 65B parameters) with built-in DeepSpeed ZERO acceleration for faster training. The library enables single-GPU inference and claims 15x faster training compared to the original ChatGPT while maintaining cost-effectiveness.