Is llm-engine open source?

Yes — scaleapi/llm-engine is open source, released under the Apache-2.0 license.

What language is llm-engine written in?

scaleapi/llm-engine is primarily written in Python.

How popular is llm-engine?

scaleapi/llm-engine has 834 stars on GitHub.

Where can I find llm-engine?

scaleapi/llm-engine is on GitHub at https://github.com/scaleapi/llm-engine.

← all repositories

scaleapi/llm-engine

A Python client for hosted LLMs with self-hosting ambitions

An open-source Python client for Scale’s hosted LLM API, with Helm charts for teams that eventually want to bring inference in-house.

★834 stars Python Inference · Serving Language Models LLMOps · Eval

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

LLM Engine is a Python library and CLI for interacting with large language models. It offers APIs for completion and fine-tuning against open-source models like LLaMA, MPT, and Falcon, plus streaming inference and dynamic batching. The repository also packages Helm charts for running inference and fine-tuning in your own Kubernetes infrastructure, though the project currently directs new users to Scale’s hosted Spellbook service.

The interesting bit

The project tries to split the difference between managed SaaS and DIY infrastructure: you can call Scale’s API today with a single Python import, or theoretically deploy the same stack yourself via Helm if you already know your way around a cluster. That dual identity is the whole pitch, even if the self-hosting documentation is still marked “coming soon.”

Key highlights

Python library and CLI for calling open-source models (LLaMA, MPT, Falcon) through a unified API
Fine-tuning and streaming inference with dynamic batching
Hugging Face integration to deploy any compatible model with one command
Helm charts included for Kubernetes self-hosting, though documentation is still pending

Caveats

The only documented quickstart requires a Scale Spellbook account and API key; self-hosting instructions are explicitly marked “coming soon”
Scale-to-zero, fast cold-start scaling, and cost optimizations are listed as future features, so their current availability in the open-source charts is unclear

Verdict

Grab it if you want a streamlined Python client for Scale’s hosted LLM endpoints and are willing to wait for the self-hosting docs to mature. If you need a fully documented, standalone inference platform on your own infrastructure today, keep looking.

Frequently asked

What is scaleapi/llm-engine?: An open-source Python client for Scale’s hosted LLM API, with Helm charts for teams that eventually want to bring inference in-house.
Is llm-engine open source?: Yes — scaleapi/llm-engine is open source, released under the Apache-2.0 license.
What language is llm-engine written in?: scaleapi/llm-engine is primarily written in Python.
How popular is llm-engine?: scaleapi/llm-engine has 834 stars on GitHub.
Where can I find llm-engine?: scaleapi/llm-engine is on GitHub at https://github.com/scaleapi/llm-engine.