← all repositories

mshumer/gpt-llm-trainer

An automated pipeline that fine-tunes LLaMA 2 and GPT-3.5 models from task descriptions by generating datasets and handling the full training workflow.

4.2k stars Jupyter Notebook ML FrameworksLanguage ModelsLLMOps · Eval
gpt-llm-trainer
Velocity · 7d
+4.0
★ / day
Trend
steady
star history

The repository provides a notebook-based system that abstracts away the complexity of training task-specific models. Users input a task description, and the system uses Claude 3 or GPT-4 to generate training prompts and responses, creates appropriate system messages, splits the data into training and validation sets, and fine-tunes either LLaMA 2 7B or GPT-3.5 models. The pipeline is designed to streamline the process of going from an idea to a deployed fine-tuned model.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.