mlc-ai/web-llm-chat
A private, server-free chat interface that runs open-source LLMs directly in the browser using WebGPU.

Velocity · 7d
+1.4
★ / day
Trend
→steady
star history
WebLLM Chat is a client-side chat application that leverages WebLLM and WebGPU to execute large language models entirely in the browser without server dependencies. It supports multiple open models including Llama, Gemma, Mistral, Phi, and Qwen, with vision capabilities for image-based conversations. All processing occurs locally, ensuring user privacy as no data leaves the browser.