OpenMemory is an embeddable memory engine for LLM agents that stores, retrieves, and manages conversational context and agent state using semantic indexing and temporal facts. It functions as a semantic memory store backed by vector indexing, where memories are organized by meaning rather than by exact key, and includes a tiered decay engine that gradually reduces the salience of unused memories while compressing cold vectors and fingerprinting dormant entries to conserve storage. The system also maintains a temporal fact database that records factual statements with subject-predicate-object structure and explicit validity windows for time-aware knowledge queries.
The engine differentiates itself through several integrated memory management capabilities. It applies emotional salience boosting to prioritize emotionally charged content, generates reflection-based summaries by periodically clustering related memories and synthesizing condensed summaries, and supports fingerprint compression for cold storage that can restore dormant memories to full representation on access. A stateful agent loop orchestrates a cycle of observation, recall, planning, tool execution, and storage to maintain persistent conversational context, while standalone embedding mode allows the memory engine to run as an embedded library within a host application without requiring a separate backend server.
The system provides full CRUD operations for memory records, semantic search with filtering by user, sector, or minimum salience, and temporal analysis including entity timelines and state transition tracking. It supports document and webpage ingestion to extract and store content as memories, user summary generation and retrieval, and memory compression that condenses verbose content while preserving core meaning. Deployment options include Railway, Render, and Vercel standalone modes, with monitoring capabilities for decay processing and engine state inspection.