vllm-project/aibrix
AIBrix provides infrastructure components for cost-efficient GenAI inference deployment and scaling.

Velocity · 7d
+6.7
★ / day
Trend
→steady
star history
AIBrix is an open-source, cloud-native platform for deploying and managing large language model inference at enterprise scale. It offers pluggable infrastructure components designed to optimize resource utilization and reduce operational costs for generative AI workloads. The project includes gateway routing, load balancing, and scaling mechanisms specifically tuned for LLM serving.