Mesh-LLM/mesh-llm
A decentralized mesh network for running and serving LLM inference across distributed GPU resources.

Velocity · 7d
+9.6
★ / day
Trend
→steady
star history
Mesh LLM creates a distributed computing network that pools GPUs and memory from multiple machines to serve LLM inference. It exposes an OpenAI-compatible REST API, automatically selecting whether to run models locally, route to peer nodes, or use stage-split execution for large models. The system supports joining public meshes for shared compute or setting up private distributed clusters.