A voice assistant that learns, searches Wikipedia, and chats like a movie character
Dragonfire is an open-source Linux virtual assistant that tries to understand you through DeepSpeech, falls back to Google Speech, and will literally learn facts about you if you tell it.

What it does Dragonfire listens for wake words (“DRAGONFIRE”, “HEY”, “WAKE UP”) and processes commands through a four-step pipeline: built-in actions first, then learning from your statements, then Wikipedia search via its open-domain QA engine, and finally a seq2seq neural network trained on movie dialog for conversational fallback. It can open apps, search the web, control basic UI, and remember facts you teach it.
The interesting bit The learning module is the unusual part: tell it “THE SUN IS HOT” and “THE SUN IS YELLOW,” then ask “DESCRIBE THE SUN,” and it synthesizes “THE SUN IS HOT AND YELLOW.” This is simple fact accumulation via NLP, not a vector database or RAG pipeline — more like a very patient, very literal secretary.
Key highlights
- Uses Mozilla DeepSpeech (1.31 GB model) for on-device speech recognition, with optional Google Speech fallback
- Text-to-speech via Festival, with a 3D animated avatar (can be disabled with
--headless) - Ships as
.debpackage, Docker image, ormake installfor Ubuntu/KDE neon/elementary OS - Server mode exposes a RESTful spaCy API and Twitter-integrated chatbot
- Built-in commands cover app launching, web search, media control, and even “SHUT DOWN THE COMPUTER”
Caveats
- README shows last release as 1.1.1 but commit activity and CI status suggest maintenance has slowed; unclear if DeepSpeech 0.4.1 model is current
- Requires “preferably a CUDA supported GPU” and 2GB RAM, but no specific performance numbers are given
- English only; the learning examples show rigid pattern matching that may not generalize well
Verdict Worth a look if you want a hackable, fully local voice assistant on Ubuntu and don’t mind updating dependencies yourself. Skip it if you need modern LLM reasoning or active project maintenance — this is pre-transformer architecture with a 2018-era feel.