mshumer/gpt-llm-trainer
An automated pipeline that fine-tunes LLaMA 2 and GPT-3.5 models from task descriptions by generating datasets and handling the full training workflow.

The repository provides a notebook-based system that abstracts away the complexity of training task-specific models. Users input a task description, and the system uses Claude 3 or GPT-4 to generate training prompts and responses, creates appropriate system messages, splits the data into training and validation sets, and fine-tunes either LLaMA 2 7B or GPT-3.5 models. The pipeline is designed to streamline the process of going from an idea to a deployed fine-tuned model.