← all repositories
Conchylicultor/DeepQA

A chatbot that will call you Debbie and mean it

TensorFlow seq2seq implementation of Google's neural conversational model, warts and all.

2.9k stars Python Chat AssistantsLanguage Models
DeepQA
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

What it does

DeepQA trains an encoder-decoder LSTM on movie subtitles, Supreme Court transcripts, Ubuntu support chats, or your own text, then lets you talk to it. It ships with a Django web interface, TensorBoard integration, and enough command-line flags to keep a small airport busy.

The interesting bit

The author posts the actual, unfiltered output — including the bot telling users to “Fuck you,” confusing genders, and declaring “The man are obsolete.” It’s a refreshing antidote to polished demo videos, and a useful reminder of what 2016-era seq2seq actually looked like in the wild.

Key highlights

  • Supports multiple corpora out of the box: Cornell Movie Dialogs (included), OpenSubtitles, Supreme Court data, Ubuntu Dialogue Corpus, or custom lightweight format
  • Optional pre-trained word embeddings to speed up training
  • Web interface built on Django + Channels + Redis, with Docker setup available
  • Can get “some results” in 1–2 hours on a GeForce GT 740M if you cripple the sequence length to 5 words
  • Includes explicit suggestions for improvements: add SoftMax sampling, swap in attention mechanism, monitor test curves

Caveats

  • Pretrained model links are dead; the README strikes them through with a blunt “Note: Links do not work anymore”
  • Locked to TensorFlow 1.0–1.2 and Python 3.5, which are now archaeological artifacts
  • Deterministic predictions mean the bot always gives the same answer to the same question — no sampling mechanism implemented
  • The author openly notes overfitting, gender confusion, and a “thought vector” too small to track basic facts across sentences

Verdict Worth cloning if you’re studying the history of neural dialogue systems or need a concrete baseline to beat. Skip it if you want a chatbot that actually works; modern retrieval-augmented or instruction-tuned models have left this approach in the dust.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.