← all repositories
Delta-ML/delta

Didi's TensorFlow toolkit for NLP and speech, minus the hype

A production-proven platform that bundles training, benchmarking, and deployment for text and voice models behind YAML configs and shell scripts.

delta
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

What it does

DELTA is an end-to-end TensorFlow platform for natural language and speech processing. It covers training, evaluation, inference, and model export through a single CLI entry point (delta/main.py) driven by YAML configuration files. The project also includes a deployment stack (DELTA-NN) with Docker images, a graph compiler via TensorFlow Grappler, and benchmark scripts for reproducible results.

The interesting bit

The “what you see in training is what you get in serving” pitch: data preprocessing and feature extraction are baked into the model graph itself, so the serving path doesn’t drift from the training path. For speech tasks, it wraps Kaldi rather than pretending to replace it.

Key highlights

  • One-command training for NLP (text classification, NER, summarization, etc.) and speech (ASR, speaker verification, emotion recognition)
  • Benchmark tables comparing DELTA implementations against published baselines on standard datasets (CoNLL 2003, HKUST, VoxCeleb, etc.)
  • Modular component design with pre-built examples in egs/ directory
  • Deployment pipeline via dpl/ with Docker support and SavedModel export
  • Apache 2.0 license, backed by Didi and hosted under LF AI & Data Foundation

Caveats

  • Full speech support requires Kaldi installation; the pip package (delta-nlp) is NLP-only
  • TensorFlow 2.x dependency; the conda environment pins Python 3.6 and TF 2.3.0 in examples, which are aging
  • Benchmarks show mixed results: some tasks match or beat baselines, others lag slightly (e.g., BERT NER at 94.6 vs. 94.9 baseline F1)

Verdict

Worth a look if you need a unified, opinionated pipeline for both NLP and speech research with deployment baked in. Skip it if you’re already invested in PyTorch, Hugging Face, or newer speech frameworks like ESPnet — this is a 2019-era TensorFlow stack with the maintenance profile to match.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.