databricks/spark-deep-learning
HorovodRunner enables distributed deep learning training jobs on Apache Spark clusters.

Velocity · 7d
+0.6
★ / day
Trend
→steady
star history
This repository provides deep learning pipelines for Apache Spark, specifically offering HorovodRunner to run distributed deep learning training as Spark jobs. It integrates Horovod for multi-GPU and multi-node training within the Spark ecosystem. The open-source version serves local development purposes, while Databricks Runtime ML enables full distributed training capabilities.