Didi's TensorFlow toolkit for NLP and speech, minus the hype
A production-proven platform that bundles training, benchmarking, and deployment for text and voice models behind YAML configs and shell scripts.

What it does
DELTA is an end-to-end TensorFlow platform for natural language and speech processing. It covers training, evaluation, inference, and model export through a single CLI entry point (delta/main.py) driven by YAML configuration files. The project also includes a deployment stack (DELTA-NN) with Docker images, a graph compiler via TensorFlow Grappler, and benchmark scripts for reproducible results.
The interesting bit
The “what you see in training is what you get in serving” pitch: data preprocessing and feature extraction are baked into the model graph itself, so the serving path doesn’t drift from the training path. For speech tasks, it wraps Kaldi rather than pretending to replace it.
Key highlights
- One-command training for NLP (text classification, NER, summarization, etc.) and speech (ASR, speaker verification, emotion recognition)
- Benchmark tables comparing DELTA implementations against published baselines on standard datasets (CoNLL 2003, HKUST, VoxCeleb, etc.)
- Modular component design with pre-built examples in
egs/directory - Deployment pipeline via
dpl/with Docker support and SavedModel export - Apache 2.0 license, backed by Didi and hosted under LF AI & Data Foundation
Caveats
- Full speech support requires Kaldi installation; the pip package (
delta-nlp) is NLP-only - TensorFlow 2.x dependency; the conda environment pins Python 3.6 and TF 2.3.0 in examples, which are aging
- Benchmarks show mixed results: some tasks match or beat baselines, others lag slightly (e.g., BERT NER at 94.6 vs. 94.9 baseline F1)
Verdict
Worth a look if you need a unified, opinionated pipeline for both NLP and speech research with deployment baked in. Skip it if you’re already invested in PyTorch, Hugging Face, or newer speech frameworks like ESPnet — this is a 2019-era TensorFlow stack with the maintenance profile to match.