← all repositories

minimaxir/aitextgen

A Python package for fine-tuning and generating text with GPT-2 and GPT Neo models using PyTorch and Hugging Face Transformers.

1.8k stars Python Language ModelsML Frameworks
aitextgen
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

The package provides tools for fine-tuning pretrained GPT-2 (124M-774M) and GPT Neo (125M-350M) models on custom datasets, or training from scratch. It leverages pytorch-lightning for distributed training across multiple GPUs with optimized memory usage and fast generation. The project also supports model upload/download from the HuggingFace model hub and includes caching, dataset merging, and cross-training capabilities.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.