← all repositories
kubeflow/arena

A CLI that hides Kubernetes from data scientists

Arena lets ML researchers submit distributed TensorFlow jobs without learning kubectl first.

811 stars Go LLMOps · Eval
arena
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

What it does

Arena is a Go CLI that wraps Kubernetes, Helm, and Kubeflow into commands data scientists already understand: submit, list, logs, top. It handles solo and distributed TensorFlow training jobs, plus a top command for checking GPU availability across the cluster. The pitch is single-machine ergonomics backed by cluster-scale hardware.

The interesting bit

The project bets that “very little knowledge about kubernetes” is a feature, not a bug. That’s a harder abstraction to maintain than it sounds — every leaked pod name or RBAC error undermines the illusion.

Key highlights

  • Submit and monitor TensorFlow training without writing YAML
  • GPU resource visibility via arena top
  • Distributed training support out of the box
  • CPU profiling built in (--pprof flag)
  • Separate Chinese documentation maintained

Caveats

  • TensorFlow-only for training frameworks; PyTorch, JAX, etc. not mentioned in README
  • Go 1.8+ prerequisite suggests the build docs may be stale
  • 811 stars is modest for the Kubeflow ecosystem; check if active maintenance matches your timeline

Verdict

Worth a look if your team is drowning in Helm charts and your data scientists refuse to learn them. Skip if you need multi-framework flexibility or already have a polished internal platform.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.