31 dépôts
Sandboxed or managed environments for agents to execute generated code.
Distinguishing note: Focuses on the execution of agent-generated logic rather than general-purpose code runners.
Explore 31 awesome GitHub repositories matching artificial intelligence & ml · Code Execution Environments. Refine with filters or upvote what's useful.
This project serves as a centralized directory and interoperability hub for the Model Context Protocol, providing a curated collection of standardized service connectors that bridge artificial intelligence models with external software, databases, and APIs. It facilitates the integration of AI agents with diverse ecosystems by offering a registry of machine-readable interface definitions that enable dynamic tool discovery and structured context injection. The directory distinguishes itself by focusing on the protocol-based interoperability required for autonomous AI agents to interact with he
Lists server implementations that provide isolated environments for models to safely execute and evaluate source code.
Deer-flow is an autonomous agent orchestration platform designed to manage multi-step workflows where AI agents reason, plan, and execute tasks. It functions as a development framework for building agents that utilize various large language models to solve complex problems through structured, sequential, and parallel reasoning. The platform distinguishes itself through a secure, sandboxed execution engine that isolates generated code and system operations from the host environment. This architecture allows agents to safely test and validate solutions within ephemeral containers, ensuring that
Ships a secure, sandboxed runtime environment for agents to safely execute and validate generated code and system operations.
Open Interpreter is a local language model agent framework that enables the deployment of autonomous agents capable of controlling a local operating system and its applications. It provides an execution environment where language models can run code and scripts directly on a computer to automate system tasks. The framework includes a computer control interface that allows language models to interact with web browsers and native user interfaces through programmatic commands. To ensure system stability, it utilizes a secure sandbox environment for the execution of model-generated code. The sys
Provides a managed environment for agents to execute commands and scripts across multiple operating systems.
Open Interpreter is a coding agent that uses large language models to write and execute code directly on a local host machine. It functions as a system for performing operating system tasks and file manipulations through a natural language interface. The project features a model orchestrator that allows switching between different language model providers and emulation harnesses. It employs a loop-based reasoning process to iteratively generate code and process execution output until a goal is achieved. Its capabilities include cross-platform system automation, local model integration for da
Executes agent-generated code within a restricted environment on the host machine for secure system manipulation.
AstrBot is an orchestration framework designed for building and managing autonomous agents that integrate multimodal artificial intelligence with secure, isolated execution environments. It serves as a platform for coordinating complex agentic workflows, allowing users to connect diverse language, speech, and vision models while maintaining personalized agent personas and domain-specific knowledge bases. The platform distinguishes itself through a modular plugin architecture and a centralized visual dashboard, which together enable users to extend agent capabilities and manage operational set
Provides secure, isolated environments for agents to execute generated code and shell commands safely.
This framework provides a development toolkit for building autonomous agents that utilize language models to solve complex, non-deterministic tasks. Its core design centers on a code-executing architecture where agents generate and run Python code snippets to perform logic, data manipulation, and tool interactions. By moving beyond structured data formats, the system enables agents to manage program flow and object state through iterative reasoning cycles. The project distinguishes itself through its focus on code-based agent implementation and secure execution environments. Developers can ch
Enables agents to generate and execute code snippets to perform complex logic and data manipulation.
This project is a Python framework for building autonomous, event-driven agent systems. It provides a unified runtime for orchestrating multi-agent workflows, managing persistent conversation state, and executing code within secure, isolated sandbox environments. The framework is designed to handle complex task delegation, allowing agents to invoke other agents as tools while maintaining context across multi-turn interactions. The framework distinguishes itself through its deep integration with the Model Context Protocol, enabling agents to connect to external data sources and remote services
Executes agent-driven code, shell commands, and filesystem operations within secure, isolated, and ephemeral sandbox environments.
Agentscope is a comprehensive toolkit for developing and orchestrating autonomous multi-agent systems. It provides a unified framework for building agents that can reason, execute tools, and manage memory, enabling the creation of complex, collaborative workflows where multiple specialized agents interact to solve multi-step objectives. The platform distinguishes itself through a robust orchestration engine that supports both sequential and concurrent agent pipelines. It utilizes a centralized event bus for real-time telemetry, allowing developers to track agent reasoning, tool usage, and sys
Executes agent-driven code within secure, isolated environments.
AgenticSeek is a multi-agent orchestration system designed to decompose complex user objectives into granular, actionable tasks. By coordinating a team of specialized autonomous workers, the platform manages end-to-end workflows, ensuring that each component of a project is assigned to the most capable agent for execution. The system operates as a local-first runtime, executing all artificial intelligence models directly on user hardware to maintain data sovereignty and privacy. It integrates a browser automation engine for autonomous web research and interaction, alongside a sandboxed enviro
Write, debug, and run scripts in various programming languages directly within the environment to perform complex tasks without needing external dependencies or constant human supervision.
Forem is an open-source platform designed for building and managing technical communities. It functions as a social publishing engine that enables members to share long-form content, participate in threaded discussions, and engage through social interactions. The platform provides tools for organizations to maintain branded profiles, host community hackathons, and facilitate collaborative learning through structured educational tracks. Beyond its social features, Forem integrates advanced capabilities for AI agent workflow orchestration and codebase knowledge graphing. It allows developers to
Runs agent-generated code within isolated, secure environments to protect host infrastructure.
Qwen-7B is a pretrained causal language model designed for natural language generation, text processing, and complex reasoning tasks. It is available as an instruction-tuned model optimized for conversational interactions and a tool-use model capable of executing function calls and interacting with external APIs. The project provides a quantized version of the model to reduce GPU memory usage and supports the development of autonomous agents that can execute code and perform functions to complete complex goals. The system covers a wide range of capabilities including model fine-tuning throug
Ships an environment where the model can write and execute code as an interpreter to solve problems.
PydanticAI is a Python framework designed for building production-grade autonomous agents. It provides a unified interface for interacting with diverse language models, enabling developers to construct agents that perform complex tasks through structured data validation, tool execution, and multi-turn conversation management. The library centers on type-safe schema enforcement, ensuring that model inputs and outputs remain consistent and reliable throughout the agent's lifecycle. The framework distinguishes itself through a robust architecture that emphasizes modularity and testability. It ut
Runs arbitrary code snippets in a secure environment to perform calculations, data analysis, or logic verification during the agent reasoning process.
This project is a comprehensive framework for building and managing autonomous agent systems. It provides a unified architecture for orchestrating multi-agent societies, where specialized agents collaborate through roleplay to decompose and solve complex tasks. The system integrates language models with external environments, enabling agents to perform real-world actions through a standardized tool-calling abstraction layer. The framework distinguishes itself through its focus on iterative reasoning and data reliability. It employs automated feedback loops to refine agent outputs and self-eva
Provides sandboxed environments for agents to execute generated Python code securely.
Kilocode is an autonomous engineering platform designed to orchestrate AI agents for complex software development tasks. It functions as a comprehensive system for automating coding, testing, and repository management by integrating directly with your codebase and terminal. The platform provides a unified gateway for model orchestration, allowing for the management of agentic workflows, event-driven automation, and persistent session state across distributed development environments. The platform distinguishes itself through its federated task management and policy-based access control, which
Automates code modifications and shell script execution within managed agent environments.
ChatGLM3 is a comprehensive framework for deploying, fine-tuning, and serving large language models. It functions as a high-performance inference engine designed to support conversational AI, enabling developers to build interactive agents capable of multi-turn dialogue, autonomous code execution, and structured tool invocation. The project distinguishes itself through its focus on hardware-agnostic deployment and resource optimization. It supports distributed model parallelism across multiple graphics cards, paged key-value caching for concurrent request processing, and weight quantization t
Runs generated code within a sandboxed environment to perform complex tasks like data visualization or symbolic computation.
ChatGLM3 is an open-weights large language model designed for bilingual conversational interactions in English and Chinese. It functions as a tool-augmented system capable of calling external functions and executing internal code to resolve complex tasks. The model utilizes four-bit quantization to reduce memory requirements, enabling inference on consumer hardware and diverse processing units including GPUs and CPUs. It features an expanded context window for processing and summarizing long documents and includes a supervised fine-tuning pipeline for adapting the model to specialized domains
Executes agent-generated code within a sandboxed environment to solve complex reasoning and mathematical tasks.
oh-my-pi is an agentic workflow automation platform and AI coding agent orchestrator designed for autonomous software engineering. It functions as a multi-model LLM router and an LSP-integrated development environment, coordinating specialized AI agents to perform codebase analysis, automated refactoring, and complex task execution. The system distinguishes itself through the use of subagent coordination to execute parallel tasks within isolated environments and an auto-research framework for iterative experiments. It employs AST-driven structural search for code discovery and content-hash an
Provides managed environments where AI agents can execute generated logic and process data.
LangChain4j is a framework and library for building applications powered by large language models on the JVM. It provides a unified API for developing AI agents, implementing retrieval augmented generation, and integrating generative AI capabilities into professional software built with frameworks like Spring Boot or Quarkus. The project enables the creation of autonomous agents that can reason through tasks, manage memory, and execute external tools to achieve specific goals. It differentiates itself through a unified model interface that allows developers to switch between multiple model pr
Provides managed environments for AI agents to execute dynamically generated code for computations or analysis.
RD-Agent is an autonomous framework designed to orchestrate multi-step software engineering and data science workflows. By leveraging large language models, the system decomposes complex technical requirements into actionable research, planning, and execution phases, ultimately generating and running code to solve specific development tasks. The platform distinguishes itself through a containerized execution sandbox that ensures secure dependency management and system stability for all autonomously generated code. It employs multi-agent orchestration to manage iterative feedback loops, allowi
Provides sandboxed environments for agents to securely execute generated code and manage dependencies.
Moltworker is an AI agent sandbox and model orchestrator designed for the secure execution of untrusted code and shell commands generated by large language models. It functions as a gateway proxy that routes requests to multiple AI providers through a unified interface, integrating a container runtime backed by S3-compatible object storage to persist state across ephemeral lifecycles. The system distinguishes itself by combining an AI model orchestrator with a headless browser controller for automated web scraping and screenshot capture. It manages the full lifecycle of AI agents, including m
Provides sandboxed environments specifically designed for AI agents to safely execute generated code and shell commands.