HazyResearch/minions
A protocol enabling small on-device LLMs to collaborate with frontier cloud LLMs for cost-efficient inference.

Velocity · 7d
+2.7
★ / day
Trend
→steady
star history
Minions provides a communication protocol where lightweight on-device models work alongside powerful cloud models. On-device models read long contexts locally and decide what information to send to the cloud frontier model, reducing cloud API costs while maintaining output quality. The repository includes demo applications, CLI tooling, WebGPU integration, Docker support, and a secure end-to-end encrypted chat variant.