← all repositories
xming521/WeClone

Clone your chat persona with your own Telegram history

Fine-tune a local LLM on your actual message history, then wire it to a chatbot so your friends can text... well, 'you'.

WeClone
Velocity · 7d
+21
★ / day
Trend
steady
star history

What it does WeClone is an end-to-end pipeline that ingests your Telegram chat exports, scrubs them for PII with Microsoft Presidio (plus a manual blocklist), fine-tunes a local LLM via LoRA, and deploys the result as a chatbot through Telegram, Discord, Slack, or WeChat. It defaults to Qwen2.5-VL-7B-Instruct and wraps LLaMA Factory for the actual training.

The interesting bit The project treats your chat history as training data for style, not just facts. It supports image-modal fine-tuning too, so your digital twin can theoretically reference the visual context of your conversations. The README is refreshingly honest that 7B models are “average” and you need 14B+ for decent results.

Key highlights

  • Single CLI entrypoint (weclone-cli) for dataset prep, training, inference, and serving
  • LoRA/QLoRA support means you can train on a 6GB GPU with 4-bit quantization, or scale to multi-GPU with DeepSpeed
  • Privacy scrubbing is built-in but explicitly not guaranteed—manual blocklist required for anything sensitive
  • Deploys via AstrBot or LangBot; exposes an OpenAI-compatible API for integration
  • Image data support added June 2025; Telegram export support added July 2025

Caveats

  • Windows is untested; WSL recommended
  • WhatsApp, Discord, and Slack data sources are all marked 🚧 (not yet implemented)
  • Tool calling breaks after fine-tuning and must be manually disabled in your bot framework
  • The authors are actively soliciting English-speaking test cases, suggesting the project has been primarily validated on Chinese-language data so far

Verdict Worth a weekend project if you have years of Telegram history and a GPU with 16GB+ VRAM. Skip it if you expect plug-and-play quality from a 7B model, or if your chat history is mostly WhatsApp.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.