Is serve open source?

Yes — pytorch/serve is open source, released under the Apache-2.0 license.

What language is serve written in?

pytorch/serve is primarily written in Java.

How popular is serve?

pytorch/serve has 4.3k stars on GitHub.

Where can I find serve?

pytorch/serve is on GitHub at https://github.com/pytorch/serve.

← all repositories

pytorch/serve

The PyTorch model server still works, but nobody is home

Built to serve PyTorch models in production at scale, now frozen in place with no planned updates or security patches.

★4.3k stars Java Inference · Serving LLMOps · Eval

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

TorchServe wraps trained PyTorch models behind REST and gRPC endpoints, handling batching, model management, and multi-model workflows. It targets production deployments across diverse hardware—CPUs, GPUs, AWS Inferentia/Trainium, Google Cloud TPUs, and NVIDIA MPS—and integrates with Kubernetes, KServe, SageMaker, and Vertex AI. The project also recently added LLM-serving capabilities via vLLM and TensorRT-LLM backends.

The interesting bit

Despite being the default serving layer for major cloud ML platforms, the project carries a blunt “Limited Maintenance” notice: no planned updates, bug fixes, or security patches. That makes its extensive feature set—complete with Grafana metrics, token authorization, and dynamic batching—something of a time capsule.

Key highlights

Broad hardware support: runs on CPU, GPU, AWS Inferentia/Trn1, Google TPU, and NVIDIA MPS
Native integrations with SageMaker, Vertex AI, KServe, Kubeflow, and MLflow
LLM deployment support through vLLM and TRT-LLM engines with continuous batching
Built-in performance tooling: TorchScript, ONNX, IPEX, TensorRT, and torch.compile previews
Workflow DAGs for chaining multiple interdependent models

Caveats

Explicitly unmaintained: the README states there are “no planned updates, bug fixes, new features, or security patches” and unaddressed vulnerabilities may exist
Token authorization and model API restrictions are now enforced by default, which may break existing unattended deployments

Verdict

Evaluate it if you need a battle-tested, feature-rich PyTorch model server and can accept a frozen codebase. Avoid it for new greenfield projects where active maintenance and security response are non-negotiable.

Frequently asked

What is pytorch/serve?: Built to serve PyTorch models in production at scale, now frozen in place with no planned updates or security patches.
Is serve open source?: Yes — pytorch/serve is open source, released under the Apache-2.0 license.
What language is serve written in?: pytorch/serve is primarily written in Java.
How popular is serve?: pytorch/serve has 4.3k stars on GitHub.
Where can I find serve?: pytorch/serve is on GitHub at https://github.com/pytorch/serve.