ray-project/kuberay
A Kubernetes operator that manages Ray clusters for distributed ML and deep-learning workloads.

Velocity · 7d
+1.2
★ / day
Trend
→steady
star history
KubeRay provides three custom resource definitions—RayCluster, RayJob, and RayService—to automate the deployment, scaling, and fault-tolerant management of Ray applications on Kubernetes. It handles cluster lifecycle management, autoscaling, and zero-downtime upgrades. The toolkit includes a kubectl plugin for simplified workflows and an APIServer for simplified configuration.