← all repositories

Victorwz/LongMem

A research project implementing a NeurIPS 2023 paper that augments language models with long-term memory using a side network and memory bank retrieval.

826 stars Python Language ModelsRAG · Search
LongMem
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

This project implements a language model architecture capable of utilizing long-term memory. It augments a frozen pre-trained LLM backbone with a trainable side network and joint attention mechanism to retrieve and integrate relevant past context from a memory bank. The system uses Faiss-GPU for efficient vector similarity search over encoded memory chunks, enabling the model to attend to information from extended context windows.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.