pytorch/serve
PyTorch's official framework for serving and scaling deep learning models in production environments.

Velocity · 7d
+1.8
★ / day
Trend
→steady
star history
TorchServe is a production-grade model serving tool developed by PyTorch that handles model deployment, scaling, and inference workloads. It provides a REST API for predictions, supports GPU and CPU acceleration, and integrates with Kubernetes for containerized deployments. The tool includes built-in metrics, batch inference, and model versioning capabilities for operational machine learning workflows.