30 open-source projects similar to datapuporg/datapup, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best DataPup alternative.
PraisonAI is an autonomous AI agent platform that coordinates multiple LLM-powered agents for research, planning, and execution of complex workflows. It functions as a multi-agent orchestration framework, a workflow builder, and a Model Context Protocol server, while also providing retrieval-augmented generation through vector knowledge bases. Agents can interact via CLI, web, or standardized protocols with sandboxed code execution. The platform distinguishes itself with a rich set of agent communication protocols, including A2A, REST, WebSocket, voice and telephony integration, and MCP, allo
Bloop is an AI code analysis tool and semantic search engine designed for understanding and querying large-scale codebases. It utilizes a high-performance indexing system written in Rust to enable fast symbol and text retrieval across multiple programming languages. The project differentiates itself by using on-device embeddings for semantic code search, allowing users to locate logic based on meaning and intent rather than exact keywords. It combines a language model with a retrieval-augmented generation approach to provide a natural language interface for conversational querying and the gen
whodb is a multi-database management interface and notebook client designed for exploring and managing data across various engines, including Postgres, MySQL, MongoDB, and Redis. It functions as a graphical interface for managing database connections, records, and schemas through a unified layer. The project features a natural language query interface that uses large language models to translate plain English into executable SQL or NoSQL queries. This is supported by schema-aware prompting that injects database metadata into the model context to ensure generated queries match actual table def
TMLR A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolkits, Code&Doc Repo RAG, etc.
Open Source, Language Agnostic Mutation Testing
PR-Agent is an AI-powered code review tool and developer assistant designed to automate pull request workflows. It functions as an automated reviewer and git workflow automation tool that uses language models to analyze code diffs and provide technical feedback. The project distinguishes itself through the ability to generate automated pull request descriptions and project changelogs based on code changes. It also enables contextual querying of a codebase, allowing users to ask questions about specific lines of code or change sets within a pull request. The system includes capabilities for A
Continue is an automated code review platform that integrates AI agents directly into the software development lifecycle. By executing custom validation rules against pull request diffs, it provides immediate feedback through repository status checks, allowing teams to enforce quality, security, and documentation standards before manual review begins. The system distinguishes itself through a file-based configuration model where validation logic is defined in version-controlled markdown files. These files act as system prompts that guide autonomous agents in evaluating code changes. This appr
DeepSeek-Coder is a large language model and foundational neural network architecture designed specifically for software development tasks. It functions as an artificial intelligence assistant capable of interpreting complex programming instructions to generate, transpile, and structure source code. The system distinguishes itself through its ability to perform project-level code generation, analyzing broader context and patterns across entire software projects rather than isolated files. It supports multimodal input processing, allowing for the integration of text and visual data to inform i
E2B is a cloud-based infrastructure platform designed to provide secure, isolated execution environments for code and shell commands. It functions as an ephemeral orchestrator that provisions lightweight virtual machines, allowing developers and autonomous agents to run untrusted processes within a sandbox that is completely separated from the host system. The platform distinguishes itself through its focus on programmable, serverless workspaces that support the full lifecycle of cloud-based development. By utilizing hardware-level isolation and snapshot-based resumption, it enables the near-
The open-source React.js Autonomous LLM Agent
MetaGPT is an agentic workflow orchestrator and multi-agent framework designed to transform natural language requirements into complete software deliverables. It functions as an AI software engineering suite that automates the creation of technical documentation, data structures, and source code by treating natural language as a programming environment. The system distinguishes itself by assigning professional roles to large language models, creating specialized agent teams that collaborate through a shared communication structure. It utilizes standard operating procedures to convert organiza
Free, Micro AI Code Reviews That Run on Git Commit
Open Interpreter is a coding agent that uses large language models to write and execute code directly on a local host machine. It functions as a system for performing operating system tasks and file manipulations through a natural language interface. The project features a model orchestrator that allows switching between different language model providers and emulation harnesses. It employs a loop-based reasoning process to iteratively generate code and process execution output until a goal is achieved. Its capabilities include cross-platform system automation, local model integration for da
AI Code Review with Full Control Over Model Choice and Costs.
BondAI is an open-source tool for developing AI Agent Systems. BondAI handles the implementation complexities including memory/context management, error handling, vector/semantic search and includes a powerful set of out of the box tools and integrations.
Explore the Linux kernel source code with AI-generated summaries
Manifest is a language model provider unification system that standardizes access to multiple AI backends through a single interface. It functions as a centralized management layer for integrating various cloud-based and local model providers to simplify how applications request completions. The system provides intelligent model routing and high availability infrastructure by directing queries based on complexity and automatically triggering model fallbacks when a primary provider fails. It distinguishes itself through multi-tenant AI management, organizing agents into isolated groups with de
An AI-powered GitHub code review tool that uses LLMs to detect high-confidence, high-impact issues—such as security vulnerabilities, bugs, and maintainability concerns.
ChatDev is an automated software engineering platform that orchestrates the end-to-end development lifecycle through a multi-agent framework. It functions as a programmable engine that coordinates specialized autonomous agents to handle design, coding, testing, and documentation tasks by transitioning through predefined phases of a software project. The system distinguishes itself by using role-based agent specialization to simulate a professional engineering team, assigning distinct personas and knowledge bases to individual agents. It employs prompt-driven task decomposition to break high-l
OpenDevin is an autonomous software engineering agent and orchestrator designed to execute coding tasks and manage development workflows using large language models. It functions as a centralized control center for managing and switching between various local and cloud artificial intelligence backends. The system utilizes a Docker sandbox environment to isolate autonomous agents in containers, protecting the host filesystem during code execution. It includes an automated engineering workflow tool that integrates with version control and chat services to trigger tasks via webhooks or scheduled
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
Scalene is a high-performance diagnostic utility designed to measure resource consumption during the execution of Python applications. It functions as a line-level monitor, providing granular insights that pinpoint the specific source code responsible for performance overhead. The tool distinguishes itself through statistical profiling that captures stack traces and resource usage without requiring manual instrumentation of the source code. It tracks CPU, GPU, and memory consumption by intercepting library-level calls and hardware driver commands, allowing for the analysis of both managed and
SWE-agent is a collection of autonomous agents designed for software engineering, competitive programming, and offensive cybersecurity operations. These agents utilize large language models to navigate codebases, interact with file systems, and use terminal interfaces to resolve GitHub issues or complete technical challenges. The system employs specialized agent modes that switch prompting strategies based on whether the task is a software bug, an algorithmic programming problem, or a security vulnerability. It includes dedicated capabilities for automated repository maintenance and offensive
Supplement of Copilot and Cursor - utilizes AI for batch processing of the entire codebase (对Copilot和Cursor们的补充:用 AI 批量处理项目代码)
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
An open source implementation of OpenAI's ChatGPT Code interpreter