6 Repos
Techniques for reducing the footprint of stored agent context.
Distinguishing note: Focuses on data optimization.
Explore 6 awesome GitHub repositories matching artificial intelligence & ml · Memory Compression. Refine with filters or upvote what's useful.
Caveman is a set of tools and configurations designed for large language model token optimization. It focuses on reducing the amount of data processed during AI interactions to lower costs and maximize the available context window. The project implements a fragmented communication style that replaces full grammatical sentences with concise technical keywords. This approach extends to AI context optimization by condensing memory files and tool descriptions, and includes a specialized configuration for generating terse, one-line code reviews and short conventional commit messages. The system i
Condenses project documentation and memory files into a shorter format to reduce input tokens.
Mem0 is an agent-agnostic memory layer designed to provide intelligent agents with long-term persistence and cross-session state management. By acting as a centralized service, it allows diverse AI agents to recall user preferences, past interactions, and historical context, ensuring continuity across multiple workflows and independent agent systems. The platform distinguishes itself through a multi-signal retrieval engine that combines semantic vectors, keyword matching, and entity-linked metadata to surface the most relevant information. It employs an adaptive memory engine that automatical
Reduces stored data size to optimize retrieval speed while preserving context accuracy.
AgentMemory is a persistent knowledge store and memory server designed to provide AI coding agents with long-term memory. It functions as a knowledge graph engine and vector database store that saves and recalls project context, architectural decisions, and patterns across different sessions. The system distinguishes itself by using a tiered-memory consolidation pipeline that compresses raw observations into episodic, semantic, and procedural layers to optimize token usage. It employs a hybrid retrieval strategy combining keyword matching, vector embeddings, and graph traversal to surface rel
Summarizes incoming observations into condensed memories using language models to minimize token usage.
CS-Base is a comprehensive educational platform and technical repository designed to support software engineers in mastering backend architecture, artificial intelligence engineering, and career development. It functions as a centralized knowledge hub that combines illustrated theoretical tutorials with practical, project-based learning to bridge the gap between foundational computer science concepts and professional industry requirements. The project distinguishes itself by integrating a robust career mentorship framework with advanced AI engineering resources. It provides users with tools f
Optimizes token consumption and maintains conversational coherence through tiered memory storage and summarization.
Context-Engineering is a prompt engineering framework and cognitive architecture for large language models. It provides a set of patterns and methodologies for designing structured prompts and modular reasoning flows that decompose complex tasks into specialized, step-by-step problem solving templates. The project distinguishes itself through stateful prompt management and context window optimization. It maintains persistent memory across multiple interaction turns by compressing conversation history into compact internal state cells and employs techniques to maximize information density per
Persists conversation history by condensing interaction data into compact internal state structures.
OpenMemory is an embeddable memory engine for LLM agents that stores, retrieves, and manages conversational context and agent state using semantic indexing and temporal facts. It functions as a semantic memory store backed by vector indexing, where memories are organized by meaning rather than by exact key, and includes a tiered decay engine that gradually reduces the salience of unused memories while compressing cold vectors and fingerprinting dormant entries to conserve storage. The system also maintains a temporal fact database that records factual statements with subject-predicate-object s
Automatically condenses verbose memory content while preserving its core semantic meaning.