Is mlx-omni-server open source?

Yes — madroidmaq/mlx-omni-server is open source, released under the MIT license.

What language is mlx-omni-server written in?

madroidmaq/mlx-omni-server is primarily written in Python.

How popular is mlx-omni-server?

madroidmaq/mlx-omni-server has 734 stars on GitHub.

Where can I find mlx-omni-server?

madroidmaq/mlx-omni-server is on GitHub at https://github.com/madroidmaq/mlx-omni-server.

← all repositories

madroidmaq/mlx-omni-server

Your Mac, Dressed Up as OpenAI and Anthropic

It reroutes OpenAI and Anthropic SDK calls to an Apple Silicon Mac for local MLX inference.

★734 stars Python Inference · Serving Language Models

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

MLX Omni Server is a FastAPI-based inference backend built on Apple’s MLX framework. It exposes local models through endpoints that mirror the OpenAI and Anthropic API shapes, so existing client libraries can talk to your M-series Mac instead of the cloud. The server handles chat completions, speech-to-text, text-to-speech, image generation, and embeddings.

The interesting bit

Most local inference projects pick one API dialect; this one translates both. That means you can keep using openai or anthropic Python SDKs and simply swap the base_url to localhost. The server auto-discovers MLX models in your local Hugging Face cache and handles on-demand loading.

Key highlights

Dual API compatibility: supports OpenAI /v1/* and Anthropic /anthropic/v1/* endpoints including streaming, function calling, and structured output.
Runs entirely offline on Apple Silicon (M1–M4) with hardware-accelerated MLX inference.
Covers a full stack: chat, embeddings, audio transcription/synthesis, and image generation.
Auto-discovers and caches models from the Hugging Face mlx-community hub.

Caveats

Requires an Apple Silicon Mac and Python 3.11+; there is no fallback for Intel Macs or other platforms.
The README notes it is not affiliated with OpenAI, Anthropic, or Apple.

Verdict

Worth a look if you already build against OpenAI or Anthropic APIs and want a private, local staging environment on a modern Mac. Skip it if you are not on Apple Silicon or need production-grade multi-user serving.

Frequently asked

What is madroidmaq/mlx-omni-server?: It reroutes OpenAI and Anthropic SDK calls to an Apple Silicon Mac for local MLX inference.
Is mlx-omni-server open source?: Yes — madroidmaq/mlx-omni-server is open source, released under the MIT license.
What language is mlx-omni-server written in?: madroidmaq/mlx-omni-server is primarily written in Python.
How popular is mlx-omni-server?: madroidmaq/mlx-omni-server has 734 stars on GitHub.
Where can I find mlx-omni-server?: madroidmaq/mlx-omni-server is on GitHub at https://github.com/madroidmaq/mlx-omni-server.