← all repositories
alichherawalla/off-grid-mobile-ai

Your phone as a bunker for AI models

A React Native app that runs LLMs, image generation, vision, and voice transcription entirely offline.

off-grid-mobile-ai
Velocity · 7d
+18
★ / day
Trend
steady
star history

What it does Off Grid is a mobile AI suite that downloads and runs models locally on Android, iOS, and macOS. It handles text generation, Stable Diffusion image creation, vision analysis, Whisper speech-to-text, document RAG, and tool calling — all without network access. You can also fall back to local-network OpenAI-compatible servers (Ollama, LM Studio) if you want bigger models without the cloud.

The interesting bit The project bundles a surprising amount of native plumbing: llama.cpp and whisper.cpp via React Native bindings, Core ML diffusion on iOS, NPU acceleration on Snapdragon, on-device MiniLM embeddings for knowledge base search, and a full tool-calling loop with runaway prevention. The “Pro” tier is being crowdfunded with a refreshingly small ask — $10 from 100 people to fund 12 weeks of full-time work.

Key highlights

  • Runs GGUF models (Qwen 3, Llama 3.2, Gemma 3, Phi-4, custom) at 15-30 tok/s on flagship devices
  • Stable Diffusion with real-time preview: 5-10s on NPU, ~15s on CPU
  • Vision inference (~7s) with SmolVLM, Qwen3-VL, Gemma 3n
  • On-device RAG: chunking, MiniLM embeddings, SQLite storage, cosine similarity retrieval
  • Tool calling with built-in search, calculator, datetime, device info, knowledge base
  • Cross-platform test coverage: Jest/RNTL, JUnit, XCTest, plus Maestro E2E on BrowserStack

Caveats

  • Performance drops significantly on mid-range hardware (5-15 tok/s, ~30s image gen)
  • macOS support is via Mac Catalyst/iPad compatibility, not a native Mac app
  • The “Pro” features (custom personas, Kokoro TTS, MCP servers, larger models) are pre-orders shipping in 12 weeks, with refund promise but no working code yet

Verdict Worth a look if you want a reference implementation for running multiple AI modalities on-device, or if you actually need offline AI on your phone. Skip if you’re expecting ChatGPT-level quality from a 3B parameter model — the tradeoffs are real and documented.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.