xusenlinzy/api-for-open-llm
Unified OpenAI-style API gateway for serving multiple open-source LLMs including LLaMA, ChatGLM, Qwen, and CodeLLaMA.

Velocity · 7d
+2.2
★ / day
Trend
→steady
star history
This project provides a REST API interface that wraps open-source large language models, offering OpenAI-compatible endpoints for inference. It supports a wide range of models including LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM variants, CodeLLaMA, and SQLCoder. The project includes Docker deployment support and integrates with langchain for building LLM applications.