← all repositories
mem0ai/mem0

A memory layer that actually remembers, without the agentic loop tax

Mem0 gives AI assistants long-term recall with a single LLM call—no update/delete churn, no multi-step reasoning overhead.

mem0
Velocity · 7d
+53
★ / day
Trend
steady
star history

What it does Mem0 is a memory layer for AI agents and assistants. You feed it conversations or facts; it extracts, embeds, and stores them. Later, when a user asks something, it retrieves the relevant context and injects it into the prompt. The April 2026 algorithm claims 91.6 on LoCoMo and 94.8 on LongMemEval—benchmarks for long-context memory—using around 7K tokens and ~1s latency per query.

The interesting bit The new algorithm is add-only. One LLM call extracts facts; nothing gets updated or deleted, memories just accumulate. This sounds wasteful, but they fuse semantic search, BM25 keyword matching, entity linking, and temporal reasoning to surface the right memory anyway. It is the opposite of the usual CRUD approach: accept the bloat, optimize the retrieval.

Key highlights

  • Single-pass extraction: no agentic loops, no UPDATE/DELETE operations
  • Multi-signal retrieval: semantic + BM25 + entity matching scored in parallel
  • Temporal reasoning: ranks memories by time relevance (current state vs. past events vs. future plans)
  • Three deployment modes: pip library, self-hosted Docker stack, or managed cloud
  • CLI and SDKs in Python and Node; agent sign-up without email or dashboard

Caveats

  • Self-hosted auth is now mandatory by default; upgrading from older builds requires explicit configuration
  • The BEAM 10M benchmark score (48.6) drops notably from the 1M score, suggesting scale still stresses the system
  • Default setup leans on OpenAI (gpt-5-mini, text-embedding-3-small); swapping LLMs or embedders is supported but requires extra config

Verdict Worth a look if you are building persistent AI assistants and tired of hand-rolling RAG pipelines. Skip it if your use case is stateless or your memory horizon is a single session.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.