alpa-projects/alpa
Compiler system that automates distributed training and serving of large neural networks across compute clusters.

Velocity · 7d
+1.6
★ / day
Trend
→steady
star history
Alpa provides automatic parallelization for single-device neural network code across distributed clusters, supporting data, operator, and pipeline parallelism. It integrates with Jax, XLA, and Ray to deliver linear scaling for models with billions of parameters. The project offers a Hugging Face transformers-compatible interface for large model inference on distributed backends.