Open-source frameworks and libraries for building AI agents that maintain long-term memory across user sessions.
CopilotKit is an agentic framework designed to integrate large language models into application frontends, enabling natural language control over software features and data. It provides the infrastructure to build intelligent assistants that manage conversation history, track application state, and execute complex workflows through conversational prompts. The framework distinguishes itself by its ability to render dynamic, interactive user interface components in real time based on model outputs. By utilizing a standardized communication protocol, it maps natural language intents to executable tool functions and synchronizes application state between the frontend and the agentic backend. This allows users to manipulate data and perform tasks directly within the chat interface. The system includes a declarative configuration layer for defining agent capabilities and a persistent orchestration layer that manages bidirectional message streams. These components ensure that language models maintain the necessary context for accurate task execution across long sessions. The toolkit is distributed as a set of components for developers to integrate into their existing application environments.
This framework provides the necessary infrastructure for building AI agents with persistent conversation history, state synchronization, and tool-calling capabilities, making it a strong fit for developers looking to integrate stateful agents into their applications.
The agent-framework is an LLM agent orchestration framework and multi-agent workflow engine designed for building autonomous AI agents. It provides a tool integration layer for binding external functions, APIs, and sandboxed code as executable tools for language models. The framework distinguishes itself through a graph-based system for designing sequential and parallel task flows, featuring state management and checkpointing for long-running processes. It implements comprehensive conversational state management and an observability suite that uses telemetry to trace execution flows and monitor tool calls. The project covers a broad range of capabilities, including retrieval augmented generation via vector database integration, human-in-the-loop approval gating for tool use, and a middleware-based request pipeline for security and telemetry. It also supports structured output enforcement, session-based context restoration, and standardized protocols for remote agent connectivity.
This framework provides a robust orchestration engine for building autonomous agents with built-in support for persistent session state, long-term memory via vector integration, and complex multi-turn workflows.
This project is a comprehensive framework for building AI-powered applications, providing a unified toolkit for orchestrating language models, autonomous agents, and interactive user interfaces. It serves as a central library for managing the entire lifecycle of AI interactions, from initial prompt generation and model provider abstraction to complex, multi-step reasoning and tool execution. The framework distinguishes itself through its deep integration with frontend development, specifically by enabling generative user interfaces that render dynamic components directly from model outputs. It features a robust agentic execution engine that manages recursive reasoning loops, allowing developers to define custom stopping conditions, delegate tasks to subagents, and enforce structured workflows. By providing a standardized interface for streaming data and state management, it ensures that backend model responses and frontend UI components remain synchronized in real time. Beyond its core orchestration capabilities, the project covers a broad surface of AI integration features, including schema-driven data extraction, multi-modal input processing, and middleware-based request interception. It supports a wide range of operational needs such as persistent conversation history, retrieval-augmented generation, and comprehensive observability tools for monitoring token usage and execution flows. The library is designed for TypeScript environments and provides a collection of hooks and utilities that simplify the implementation of chat interfaces and agentic workflows.
This framework provides a comprehensive toolkit for building AI agents with built-in support for persistent conversation history, retrieval-augmented generation, and complex agentic workflows, making it a robust choice for stateful AI applications.
This project is a comprehensive suite of AI tools and frameworks, featuring an LLM multi-agent orchestrator, an autonomous agent runtime, and a stateful application framework. It provides the infrastructure to build and manage specialized AI agents capable of coordinating complex tasks through graph-based workflows and shared state. The system is distinguished by its implementation of the Model Context Protocol, allowing for standardized resource discovery and communication between AI clients and servers. It further includes an AI-powered documentation generator designed to analyze source code repositories and transform them into instructional tutorials. The codebase covers a broad range of capabilities, including web browser automation, sandboxed code execution, and asynchronous task processing. It provides tools for state management through conversation history tracking and progress checkpointing, as well as high-performance data storage using key-value and multi-dimensional array systems. The framework integrates API development utilities, including JSON-RPC communication, automated OpenAPI documentation, and a pub-sub message exchange for background job management.
This framework provides a comprehensive infrastructure for building autonomous agents with graph-based workflows, built-in stateful session persistence, and long-term memory capabilities, making it a direct match for your requirements.
Flowise is a low-code platform designed for building and deploying complex language model workflows through a visual, node-based interface. It functions as an orchestrator for autonomous multi-agent systems, allowing users to construct conversational pipelines by connecting language models, memory stores, and external tools on a drag-and-drop canvas. The platform distinguishes itself through its support for sophisticated agentic patterns, including supervisor-worker delegation and iterative reasoning strategies. Users can design directed acyclic graphs to manage conditional branching, state persistence, and complex task distribution. It also provides a robust framework for retrieval-augmented generation, enabling the creation of self-correcting systems that can index document data and validate information autonomously. Beyond its visual design capabilities, the project serves as a comprehensive backend for AI applications. It includes a secure credential management layer for third-party API keys, role-based access controls, and a RESTful API that allows for programmatic management of chat sessions, workflows, and assistant configurations. The application is designed for flexible deployment, supporting containerized environments for consistent operation across local and cloud infrastructure. Detailed documentation and tutorials are available to guide users through the lifecycle of building, testing, and scaling production-ready AI agents.
Flowise is a low-code platform that provides a visual interface for building agentic workflows, featuring built-in support for persistent memory, tool calling, and multi-turn context, all while being fully self-hostable.
Agent Squad is a multi-agent system orchestrator and language model agent orchestration framework. It serves as an AI workflow automation engine and tool integration layer designed to coordinate teams of specialized agents to solve complex tasks through routing, parallel execution, and state management. The project is distinguished by its ability to dynamically compose purpose-specific agents on-demand and route requests based on intent, language, or domain expertise. It supports advanced coordination patterns, including parallel subtask distribution, sequential task pipelines, and the ability to escalate complex interactions to human operators. The framework covers a broad range of capabilities, including retrieval-augmented generation, conversation state preservation across storage backends, and the integration of external APIs and serverless functions. It also provides utilities for automatic language detection, text translation, content moderation filtering, and incremental response streaming. The orchestration logic is designed for cross-environment deployment, supporting local machines, cloud platforms, and event-driven serverless environments.
This framework provides a comprehensive orchestration layer for multi-agent systems, featuring built-in support for persistent conversation state, long-term memory backends, and complex agentic workflows that are fully self-hostable.
Leon is a framework for building personal AI assistants that integrates large language models with local tool execution and persistent memory. It functions as an agentic workflow orchestrator and modular skill engine, enabling the creation of autonomous assistants capable of planning and executing multi-step tasks. The system features a retrieval-augmented generation memory architecture that indexes conversation history and user facts for context-aware grounding. It utilizes a modular skill system to interact with external binaries and APIs, supported by a loop that handles tool calling, schema validation, and failure recovery. The project covers several broad capability areas, including voice interaction through speech-to-text and text-to-speech synthesis, natural language understanding for intent parsing, and a dynamic persona engine that adapts communication tone. It also includes administrative interfaces for assistant information management and security layers for HTTP API and client socket access. The application is provided as a dockerized AI server to ensure consistent deployment and hosting.
Leon is a self-hostable AI agent framework that provides persistent memory through RAG-based indexing and supports complex agentic workflows, tool execution, and multi-turn context management.
Swarm is a framework for building conversational systems that coordinate multi-agent workflows. It functions as an orchestration engine that manages persistent, multi-turn dialogues by routing tasks between specialized agents and executing local functions. The system is designed to handle complex, multi-step processes by maintaining shared state and context across agent interactions. The framework distinguishes itself through its approach to dynamic task delegation and execution control. It enables agents to hand off tasks to one another by returning agent objects, allowing for modular, domain-specific handling of user requests. The runtime manages these transitions through a synchronous execution loop that resolves structured function calls and maintains persistent variables, ensuring that session context remains consistent as control shifts between agents. Beyond core orchestration, the system provides capabilities for integrating external tools and data sources to inform agent responses. It supports real-time visibility into multi-agent workflows through incremental stream processing, which emits updates and control signals as tasks are executed. The framework also includes tools for monitoring and validating agent decision-making performance through automated testing of conversation inputs.
Swarm is an orchestration framework designed for multi-agent coordination and stateful conversation management, providing the necessary tools for agentic workflows and function calling in a self-hostable Python environment.
Agentscope is a comprehensive toolkit for developing and orchestrating autonomous multi-agent systems. It provides a unified framework for building agents that can reason, execute tools, and manage memory, enabling the creation of complex, collaborative workflows where multiple specialized agents interact to solve multi-step objectives. The platform distinguishes itself through a robust orchestration engine that supports both sequential and concurrent agent pipelines. It utilizes a centralized event bus for real-time telemetry, allowing developers to track agent reasoning, tool usage, and system performance. By employing a provider-agnostic interface, the framework abstracts diverse language model APIs, while its middleware-based execution hooks allow for the injection of custom logic to intercept, validate, or transform agent behavior at runtime. Beyond core orchestration, the project includes extensive capabilities for tool integration, including dynamic schema parsing from function docstrings and support for secure, sandboxed code execution. It also features built-in support for retrieval-augmented generation, long-term memory management, and systematic performance evaluation, providing a complete environment for the lifecycle management of agentic applications. The library is designed for extensibility, offering base classes for custom memory backends, prompt formats, and tool providers. It is distributed as a Python package, with documentation and interactive development tools available to assist in prototyping and managing multi-agent projects.
Agentscope is a comprehensive framework for building multi-agent systems that natively supports long-term memory, tool calling, and complex agentic workflows, making it a complete solution for your requirements.
Langroid is a multi-agent orchestration framework and tool integration suite designed for building complex AI applications. It serves as a multi-modal integration layer that connects diverse local and remote language models with an agentic retrieval-augmented generation system. The project distinguishes itself through a collaborative message-exchange paradigm, allowing specialized agents to delegate tasks hierarchically and coordinate via structured communication. It features an advanced state management system for conversational AI, including the ability to rewind and prune conversation history to correct errors and optimize token usage. The framework provides a broad set of capabilities for grounding model responses in factual data using vector databases, graph databases, and tabular datasets. It includes a schema-driven tool execution system that binds models to Python functions and external protocol servers, as well as a comprehensive observability suite for tracing message lineage and monitoring reasoning paths. The library provides installation guidance via import errors when optional dependencies are missing.
Langroid is a multi-agent orchestration framework that natively supports persistent state management, tool calling, and RAG-based long-term memory, making it a comprehensive solution for building complex, self-hostable agentic workflows.
This project is a cross-platform chatbot framework designed to integrate generative artificial intelligence models into messaging services. It provides a unified architecture for building and deploying automated bots that maintain consistent conversation state, user identity, and interaction logic across multiple messaging platforms from a single codebase. The framework distinguishes itself through a modular adapter system that normalizes platform-specific webhooks and events into a standardized internal schema. It includes a comprehensive toolkit for constructing rich, interactive user interfaces—such as modal forms and dynamic cards—that adapt to native platform formats. Furthermore, it supports complex automation workflows by implementing human-in-the-loop oversight, allowing for manual approval of AI-driven actions before they are executed within a workspace. Beyond its core integration capabilities, the project manages the full lifecycle of bot operations, including distributed concurrency control, persistent state management for conversation history, and real-time content streaming. It also provides diagnostic tools for validating bot logic and monitoring message processing to ensure reliable performance in distributed environments.
This framework provides the necessary persistent state management and conversation history tracking required for building AI agents, though it is primarily structured as a messaging integration platform rather than a general-purpose agentic workflow engine.
LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows that manage state, memory, and tool execution. The project distinguishes itself through a durable execution runtime that maintains persistent state across long-running processes by checkpointing progress to external storage. It models agent workflows as directed graphs, allowing for explicit node-to-node routing and state management. Furthermore, it includes a human-in-the-loop control layer that enables developers to pause execution at defined breakpoints, allowing for manual inspection, modification, and approval of agent actions during runtime. Beyond its core orchestration capabilities, the framework supports a tiered memory architecture that separates short-term conversation context from long-term persistent data. It also provides comprehensive observability tools for tracing and monitoring execution flows, alongside security features for managing authentication and fine-grained access control. The platform is supported by extensive documentation and standardized interfaces for models, embeddings, and data sources to facilitate the development of production-grade agentic systems.
LangChain provides a comprehensive orchestration framework for building AI agents, featuring robust support for persistent memory, multi-turn state management, complex agentic workflows, and tool calling in a self-hostable environment.
This framework provides a development environment for building collaborative systems where autonomous agents interact to solve complex tasks through conversational workflows. It functions as a conversational workflow engine and event-driven runtime, coordinating multi-step processes by translating high-level goals into structured dialogue sequences between specialized agents. The system distinguishes itself through its message-passing orchestration, which manages state transitions and task delegation between independent participants. It supports dynamic conversation state management to provide persistent memory during multi-turn interactions, and it incorporates human-in-the-loop capabilities that allow for review or modification of agent outputs at specific message boundaries. Beyond core orchestration, the framework enables the integration of pluggable tools, allowing agents to invoke external functions and APIs through natural language requests. This architecture supports the construction of scalable, event-driven systems that automate sequences of tasks across digital tools and connect large language models to external data sources for autonomous reasoning.
This framework is a comprehensive platform for building multi-agent systems that natively supports persistent conversation state, tool calling, and complex agentic workflows, making it a flagship solution for your requirements.
Context-Engineering is a prompt engineering framework and cognitive architecture for large language models. It provides a set of patterns and methodologies for designing structured prompts and modular reasoning flows that decompose complex tasks into specialized, step-by-step problem solving templates. The project distinguishes itself through stateful prompt management and context window optimization. It maintains persistent memory across multiple interaction turns by compressing conversation history into compact internal state cells and employs techniques to maximize information density per token to reduce inference latency and cost. The framework covers several capability areas, including agentic workflow orchestration, retrieval augmented generation patterns for factual grounding, and the use of symbolic formats and protocol shells to standardize model output. It further incorporates multi-agent reasoning flows and the pruning of contextual noise to optimize the delivery of information within the context window.
This framework provides a cognitive architecture for managing stateful prompts and persistent memory across interaction turns, making it a specialized tool for building agentic workflows with long-term context.
Parlant is an agentic workflow engine and orchestration framework designed for building conversational AI that adheres to strict behavioral guidelines. It provides a platform for managing multi-turn interactions through state-machine-based logic, allowing developers to define complex, hierarchical conversational flows that can adapt, skip, or revisit steps based on real-time user input. The framework distinguishes itself through its focus on behavioral governance and observability. It enables developers to define precise domain terminology and enforce instruction compliance through prioritized guidelines, ensuring that agents remain consistent and brand-aligned. To maintain transparency, the system includes built-in reasoning audits and decision tracing, which log internal decision paths and guideline matches to help developers troubleshoot agent behavior and refine instructions. Beyond core orchestration, the platform supports a wide range of operational capabilities, including tool execution middleware, dynamic data injection, and event-driven hooks for external integrations. It manages the full interaction lifecycle, from intent disambiguation and session context maintenance to frontend metadata attachment and response streaming. These features allow for the creation of context-aware interfaces that remain grounded in current information while providing a responsive user experience.
Parlant is an agentic orchestration framework that provides robust session state management and multi-turn context handling, making it a strong fit for building conversational agents with persistent interaction flows.
Eino is an AI agent development kit and LLM application framework designed for building autonomous agents and orchestrating complex language model workflows. It serves as a multi-agent orchestration engine and workflow orchestrator, providing a graph-based execution model to route data between models, tools, and retrievers. The framework distinguishes itself through a robust set of multi-agent coordination patterns, including supervisor-led management, sequential flows, and autonomous reasoning loops like ReAct. It features advanced agent execution controls such as active turn preemption, checkpoint-based state persistence for pausing and resuming workflows, and human-in-the-loop interrupt mechanisms for manual approvals. The project covers a wide range of capability areas, including RAG pipeline implementation with semantic tool retrieval and document processing. It provides standardized component abstractions for model integration, a middleware-based interception system for observability and tracing, and tool integration for filesystem and shell command execution. Agent runtimes can be exposed as external services using HTTP and Server-Sent Events for real-time streaming communication.
Eino is a comprehensive AI agent framework that provides graph-based orchestration, tool calling, and robust state management through checkpoint-based persistence, making it a direct fit for building stateful, self-hostable agents.
Deepagents is an LLM agent orchestration platform and stateful application server designed for deploying and managing AI agents built with computational graphs. It provides a containerized runtime environment that handles agent execution, state persistence, and the versioning of AI assistants. The platform distinguishes itself through deep integration with the Model Context Protocol, allowing agents to function as servers that expose tools and capabilities to external clients. It features a sophisticated observability suite for capturing execution traces, performing LLM-based evaluations against datasets, and conducting side-by-side model output comparisons. The system covers a broad range of operational capabilities, including cron-based task scheduling, multi-tenant workspace isolation, and human-in-the-loop review workflows. It also manages long-term memory through semantic search and provides automated scaling of compute resources across cloud environments. A command-line interface is provided for local agent validation, graph packaging, and rapid testing via a local development server.
Deepagents is a comprehensive AI agent orchestration platform that natively supports stateful execution, long-term memory via semantic search, and complex agentic workflows, making it a direct fit for your requirements.
OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system enables agents to autonomously navigate codebases, implement features, and resolve issues through iterative reasoning and tool execution. The platform distinguishes itself through a model-agnostic orchestrator that connects diverse language models to a unified tool registry. It supports complex, multi-agent collaboration via hierarchical task delegation, allowing parent agents to spawn and manage independent sub-agents for parallelized workflows. Security is managed through configurable action approval policies and real-time risk evaluation, ensuring that autonomous operations remain within defined safety boundaries. The system covers a broad capability surface including persistent conversation state management, automated code review, and web research automation. It features an event-driven architecture that serializes interactions into immutable logs, facilitating observability and time-travel debugging. Developers can extend agent functionality through custom skill definitions, plugin packages, and integration with external services via standardized protocols. The project provides a command-line interface for managing agent sessions, remote server deployments, and containerized workspace lifecycles. It is designed for extensibility, allowing users to configure agent behavior through structured objects, markdown-based definitions, and environment-specific settings.
OpenHands is a comprehensive agent framework that supports persistent state management, multi-turn reasoning, and complex agentic workflows, making it a robust choice for autonomous software engineering tasks.
Mem0 is an agent-agnostic memory layer designed to provide intelligent agents with long-term persistence and cross-session state management. By acting as a centralized service, it allows diverse AI agents to recall user preferences, past interactions, and historical context, ensuring continuity across multiple workflows and independent agent systems. The platform distinguishes itself through a multi-signal retrieval engine that combines semantic vectors, keyword matching, and entity-linked metadata to surface the most relevant information. It employs an adaptive memory engine that automatically extracts, compresses, and updates data, while applying temporal decay logic to prioritize recent information and reduce noise. To support enterprise requirements, the system provides hierarchical multi-tenancy, enforcing strict data isolation and access control boundaries between different organizations, projects, and user groups. Beyond its core storage capabilities, the project offers a comprehensive suite of tools for managing the information lifecycle, including asynchronous event orchestration, webhook integration, and schema-based data structuring. It supports both self-hosted and cloud-based deployments, allowing developers to maintain full control over their infrastructure and data privacy. The project provides a Python-based initialization process and a command-line interface for managing memory records and configuring agent environments. Detailed documentation and integration guides are available to assist with implementation across various technology stacks.
Mem0 is a dedicated memory layer that provides long-term persistence, cross-session state management, and tool-like retrieval for AI agents, making it a comprehensive solution for the requested features.
langchaingo is an LLM application framework for Go designed for building language model-powered applications and autonomous agents. It serves as an orchestration library and tool integration framework that allows developers to link prompt sequences and model calls into complex, multi-step workflows. The project provides a toolkit for implementing retrieval-augmented generation pipelines by processing unstructured documents and retrieving relevant context via vector search. It includes a dedicated integration layer for indexing high-dimensional embeddings and performing similarity searches across various vector database backends. Its broader capabilities cover AI workflow automation, the creation of autonomous agents that use reasoning to execute external tools, and the management of conversation state to maintain context across multi-turn dialogues. The framework also supports integrating external search tools, executing database queries, and triggering third-party workflows.
This is a Go-based orchestration framework that provides the necessary tools for building autonomous agents, including support for conversation state management, vector-based long-term memory, and tool-calling capabilities.