Memary

Memary is a memory-augmented agent framework that stores and retrieves contextual information from a knowledge graph to personalize responses and maintain long-term memory across interactions. It automatically captures all agent interactions and stores them as structured memories without requiring explicit instrumentation, then injects top-ranked user entities and themes into the active context window to tailor agent responses dynamically.

The framework distinguishes itself through a multi-retriever memory search that combines COLBERT reranking with recursive graph queries across databases, enabling fine-tuned agent recall. It decomposes complex user queries into sub-questions to retrieve more targeted information from memory stores, and supports switching between locally downloaded LLMs via Ollama integration for flexible on-device inference without external API dependencies. Memary also provides a conversational memory interface that allows users to query and review specific agent memories, supporting debugging and understanding of past reasoning.

Beyond core memory management, the system includes a multi-agent memory orchestrator that manages separate memory stores and knowledge graphs for multiple agents, enabling personalized context per user. It tracks entity frequency and recency to infer a user's depth of knowledge, and can inject custom data into memory by combining multiple parsers for advanced ingestion. The framework also supports registering user-defined Python functions as tools that agents can call during task execution, and provides memory benchmarking capabilities to test and compare different memory strategies.

Features

Agentic LLM Frameworks - Provides a memory-augmented agent framework that stores and retrieves contextual information from a knowledge graph.

Agent Memory Management - Automatically captures, stores, and retrieves agent interactions and user knowledge using knowledge graphs and multiple memory stores.

Multi-Agent Memory Stores - Manages separate memory stores and knowledge graphs for multiple agents to enable personalized context per user.

Agent Memory Layers - Provides a memory layer that works with locally downloaded LLMs via Ollama to capture, update, and retrieve agent memories.

Automatic Memory Extractors - Captures all agent interactions and automatically stores them as memories without explicit instrumentation.

Dynamic Injections - Injects top-ranked user entities and themes into the context window to tailor agent responses dynamically.

Memory Retrieval Systems - Retrieves stored memories and combines multiple retrievers for advanced memory lookup.

Multi-Retriever Search - Combines multiple retrievers to search across different memory databases for fine-tuned agent recall.

Multi-Agent Systems - Configures separate agents for different users, each with its own memory and knowledge graph.

Personalized Agent Systems - Runs multiple personal agents with individual memory and knowledge graphs to tailor responses to each user's expertise.

Knowledge Graphs - Stores entities and their relationships in a graph database for structured retrieval across interactions.

Agent Memory Stores - Stores and retrieves contextual information using a graph database with entities and relationships for structured recall.

Natural Language Memory Queries - Searches a knowledge graph using recursive retrieval and multi-hop reasoning to find related entities.

Knowledge Graph Management - Stores and organizes contextual information in a knowledge graph structure for structured recall.

Knowledge Graph Retrieval - Queries knowledge graphs with recursive and multi-hop reasoning to find relevant entities.

Agent Interaction Memory Stores - Captures all agent interactions and automatically stores them as memories without explicit instrumentation.

Recursive Graph Queries - Queries a knowledge graph using recursive and multi-hop reasoning to build contextual subgraphs for responses.

Response Personalization - Injects the user's top-ranked entities and themes into the context window to tailor agent replies.

Shared Knowledge Graph Memory - Organizes agent interactions and user knowledge into a structured graph database for entity-based recall.

Memory Inspection Interfaces - Provides a chat-based interface to query and review specific agent memories for debugging.

Custom Tool Registrations - Extends agent capabilities by registering user-defined functions as tools that can be invoked during task execution.

Ollama Engine Integrations - Switches between downloaded models via Ollama integration, including llama3 and LLaVA, for offline agent operation.

Ollama Model Runners - Switches between downloaded local models via Ollama integration for flexible on-device inference.

Runtime Switchers - Switches between downloaded models using Ollama integration, including llama3 and LLaVA as defaults.

Context Summarizations - Compresses past conversations into a short summary to fit within the LLM's context window.

Dynamic Updates - Adjusts the active context window to include relevant memories as a conversation progresses.

Result Reranking - Uses COLBERT reranking to improve the relevance of retrieved memory items.

COLBERT Rerankers - Uses COLBERT to rerank retrieved memory items, improving the relevance of search results.

Contextual Response Tailoring - Uses categorized entities and themes from memory to adjust agent responses to match current interests.

Recency and Frequency Rankers - Tracks entity mention frequency and recency to identify the user's most familiar concepts.

Custom Data Injectors - Ingests proprietary data into agent memory using multiple parsers for advanced ingestion.

Response Injectors - Writes final agent outputs directly into an existing knowledge graph to enrich it with new information.

Fallback Graph Retrievers - Queries a knowledge graph and falls back to an external LLM search when no related nodes exist.

Recursive Query Decomposition - Decomposes complex user queries into sub-questions to retrieve more targeted information from memory.

Sub-Query Decomposers - Splits user queries into sub-questions to retrieve more targeted information from memory stores.

Multi-Database Searchers - Combines multiple memory databases to improve how an agent recalls information.

Entity Frequency and Recency Trackers - Tracks entity frequency and recency to infer a user's depth of knowledge for personalized responses.

Conversational Memory Accessors - Accesses specific agent memories on the go through a conversational interface.

Memory Trace Reviews - Provides a conversational interface to review stored agent memories for debugging and understanding past reasoning.

Agent Frameworks - Long-term memory for autonomous agents.

Agent Orchestration - Memory layer specifically for autonomous agent systems.

Memory Management - Open-source memory layer for autonomous agent architectures.

kingjulio8238Memary

Features

Star history