A pattern book for ML that outgrew your laptop
Companion repo for Yuan Tang's Manning book on scaling machine learning systems with Kubernetes, Kubeflow, and Argo.

What it does This repository holds the code and references for Distributed Machine Learning Patterns, a Manning book by Yuan Tang (Senior Principal Engineer at Red Hat AI, maintainer in Argo, Kubeflow, and Kubernetes communities). It covers taking ML models from a single laptop to distributed Kubernetes clusters—data ingestion, distributed training, model serving, workflow automation, and monitoring.
The interesting bit The patterns angle is the hook. Rather than tool documentation, Tang frames each chapter around recurring problems—distributed training, failure handling, dynamic serving traffic—and walks through trade-offs for each approach. The author credentials are unusually deep: he’s not just writing about these tools, he’s maintaining them.
Key highlights
- Covers TensorFlow, Kubernetes, Kubeflow, and Argo Workflows from a practitioner’s view
- Each pattern includes real-world scenarios and explicit trade-off discussions
- Target reader: data scientist or engineer who knows Bash, Python, Docker, and basic ML already
- Also available in Korean and Chinese translations
- Tang has authored three technical books and holds leadership roles in multiple CNCF projects
Caveats
- The repo itself appears to be primarily a landing page and reference collection; the actual code volume and structure aren’t visible from the README alone
- Published 2023 (based on ISBN and Manning listing), so some Kubernetes/ML tool versions may have moved on
Verdict Worth bookmarking if you’re designing ML pipelines on Kubernetes and want pattern-level thinking rather than just tool tutorials. Skip if you’re looking for a hands-on framework or if your ML workloads aren’t cluster-scale yet.