30 open-source projects similar to business-science/ai-data-science-team, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Ai Data Science Team alternative.
PraisonAI is an autonomous AI agent platform that coordinates multiple LLM-powered agents for research, planning, and execution of complex workflows. It functions as a multi-agent orchestration framework, a workflow builder, and a Model Context Protocol server, while also providing retrieval-augmented generation through vector knowledge bases. Agents can interact via CLI, web, or standardized protocols with sandboxed code execution. The platform distinguishes itself with a rich set of agent communication protocols, including A2A, REST, WebSocket, voice and telephony integration, and MCP, allo
The GenAI Toolbox is a framework designed to integrate large language models with structured databases, enabling autonomous data analysis and information retrieval. It functions as an agentic orchestrator that translates natural language prompts into executable database queries, allowing users to interact with complex data sources through conversational interfaces. The system distinguishes itself by utilizing schema-driven metadata serialization, which maps database structures into formats that language models can interpret to perform autonomous reasoning. By maintaining stateful conversation
This project provides a comprehensive framework for building, deploying, and orchestrating autonomous agents within a decentralized network. It serves as a collection of patterns and examples for developing intelligent software entities capable of performing complex tasks, making decisions, and interacting with other agents to achieve shared goals. The framework distinguishes itself through its focus on multi-agent orchestration and decentralized communication. It enables the coordination of specialized agent teams that collaborate on workflows through structured messaging protocols, allowing
Eigent is a comprehensive platform for developing, configuring, and orchestrating autonomous AI agents. It functions as an agent development environment and workflow automation engine, enabling users to build modular agents equipped with custom toolsets, domain-specific skill packages, and external API connections to perform targeted operational tasks. The framework distinguishes itself through a robust multi-agent orchestration layer that coordinates teams of specialized agents to execute complex workflows. By utilizing hierarchical task decomposition, the system breaks high-level goals into
ZenML is an extensible machine learning orchestration framework designed to manage the end-to-end lifecycle of data pipelines and AI agent workflows. It functions as a durable orchestrator that executes machine learning tasks as directed acyclic graphs, ensuring that every step is containerized for consistent performance across local, cloud, and hybrid infrastructure. By decoupling pipeline code from underlying compute and storage backends, the platform allows developers to define infrastructure-agnostic stacks that remain portable across diverse environments. The project distinguishes itself
Oh-my-agent is a vendor-agnostic orchestration framework designed to manage autonomous agent teams and automate complex engineering workflows. It functions as a multi-agent development tool that synchronizes agent behavior, skills, and project-specific rules across diverse development environments and command-line interfaces. The platform distinguishes itself through configuration-based projection, which maintains a single source of truth for agent definitions that are mapped into various vendor-specific runtime formats. By utilizing cross-platform symlink bridging and a vendor-agnostic skill
AG2 is a multi-agent large language model orchestration framework, agentic workflow automation tool, and RAG-enabled agent platform. It functions as a communication protocol and framework for coordinating multiple AI agents to solve complex tasks through shared state and standardized messaging. The project distinguishes itself through flexible coordination strategies, including hierarchical agent organization, hub-and-spoke models, and dynamic routing that analyzes conversation context to distribute work. It implements multi-stage feedback loops for iterative refinement and uses schema-constr
CrewAI is a multi-agent orchestration framework and autonomous agent workflow engine. It provides a system for coordinating autonomous AI agents with specific roles and goals to solve complex tasks through collaborative intelligence. The framework distinguishes itself through a collaborative AI agent system that enables multiple language model instances to share intelligence and execute multi-step objectives via role-playing. It incorporates human-in-the-loop mechanisms, allowing for manual review checkpoints to validate decisions and refine outcomes within autonomous execution paths. The pl
This project is a comprehensive framework for building and managing autonomous agent systems. It provides a unified architecture for orchestrating multi-agent societies, where specialized agents collaborate through roleplay to decompose and solve complex tasks. The system integrates language models with external environments, enabling agents to perform real-world actions through a standardized tool-calling abstraction layer. The framework distinguishes itself through its focus on iterative reasoning and data reliability. It employs automated feedback loops to refine agent outputs and self-eva
Miroflow is an agent orchestration framework designed to coordinate multiple large language models and autonomous agents to perform complex research and reasoning tasks. It functions as a hierarchical workflow manager that distributes workloads across specialized agents using intent recognition and structured planning to gather deep information and solve challenging queries. The system distinguishes itself through a multi-model integration gateway and a provider-agnostic interface, allowing it to unify various language model providers. It extends these models via a tool-augmented framework th
gptme is an autonomous AI agent server and framework designed for local system automation, software development, and code execution. It operates as a local execution engine that enables language models to run shell commands, modify local files, and interact with the operating system. The project functions as a Model Context Protocol client, integrating with external servers to expand agent capabilities with standardized tools and data sources. It features a provider-agnostic routing system to orchestrate tasks across multiple proprietary cloud APIs and local AI backends. The system includes
Antigravity-kit is a multi-agent orchestrator and routing engine designed to coordinate specialized large language model agents. It functions as a conversational workflow automation tool and a context management system that executes complex tasks through a chat interface. The system utilizes a routing engine to classify user requests and dispatch them to domain-expert agents. It employs a multi-agent orchestration model that allows specialist workers to operate in parallel and combine their outputs. To manage operational efficiency, the kit includes a memory layer for storing project convent
SQL Chat is a Docker-deployed chat interface that translates natural language questions into SQL queries and executes them against connected databases. It uses a large language model to generate SQL from plain English instructions, supporting both querying and record modification through INSERT, UPDATE, and DELETE statements within the chat conversation flow. The application connects to MySQL, PostgreSQL, MSSQL, TiDB Cloud, and OceanBase databases through a unified driver abstraction layer, allowing users to interact with multiple database types from a single chat interface. Users provide the
DataFlow is an agent-based workflow orchestrator and data pipeline designed to synthesize, clean, and augment large-scale datasets for training large language models. It functions as a synthetic data generator and text curation tool, utilizing an intelligent assistant to assemble modular processing operators into functional pipelines based on user requirements. The project distinguishes itself through a low-code approach, providing a web-based visual interface for designing and monitoring multi-stage execution flows. It features an operator-based registry system that allows for the integratio
whodb is a multi-database management interface and notebook client designed for exploring and managing data across various engines, including Postgres, MySQL, MongoDB, and Redis. It functions as a graphical interface for managing database connections, records, and schemas through a unified layer. The project features a natural language query interface that uses large language models to translate plain English into executable SQL or NoSQL queries. This is supported by schema-aware prompting that injects database metadata into the model context to ensure generated queries match actual table def
This is an open-source Python SDK for building and orchestrating production-grade AI agents. It provides a unified framework for creating conversational agents that can use tools, maintain state, and coordinate across multiple language model providers including OpenAI, Anthropic, Google, Amazon Bedrock, and locally-hosted models. The SDK supports multi-agent orchestration through graphs, teams, and swarms, allowing several specialized agents to collaborate on complex tasks. Agents can be composed as callable tools that other agents invoke, and the framework includes policy handlers that inspe
This project is a Java-based framework integration that provides an AI agent runtime, a graph-based AI workflow engine, and an LLM orchestration framework for Spring applications. It enables the development of stateful autonomous agents and the implementation of retrieval-augmented generation systems using document processing and vector databases. The framework distinguishes itself through a graph-based workflow runtime for designing complex AI pipelines with conditional routing and persistent state. It supports multi-agent orchestration via service-discovery coordination and provides human-i
Kiro is an AI-powered development tool and multi-agent workflow orchestrator. It functions as a context-aware code generator and coding assistant that transforms natural language requirements into structured implementation plans and production-grade code. The system distinguishes itself through multi-agent task decomposition, where complex requirements are broken into sequenced tasks and assigned to specialized agents. It features multi-model orchestration to select specific language models based on reasoning complexity, cost, and latency, and includes a headless command-line interface for id
gh-aw is a GitHub automation platform and orchestration framework that uses an agentic workflow engine to automate repository management and code reviews. It translates natural language markdown and configuration files into secure, automated task sequences driven by large language models. The system integrates a Model Context Protocol gateway to route calls between AI agents and external tools. It distinguishes itself through a comprehensive security guardrail system that provides sandboxed execution for protocol servers, network egress controls via domain allowlists, and human-in-the-loop ap
Apache NiFi is a flow-based programming platform that enables the visual design, monitoring, and management of data pipelines. At its core, it provides a web-based visual dataflow designer where users build directed graphs of processors to route, transform, and mediate data movement between any source and destination without writing custom code. The system records fine-grained data provenance for every data item from ingestion to delivery, supporting audit, debugging, and replay of data lineage. The platform distinguishes itself through a zero-master cluster architecture that distributes proc
Sqlcoder is a text-to-SQL large language model specialized in converting natural language questions into structured, executable database queries. It functions as a database interface and query generator that allows for data retrieval without requiring manual code. The system utilizes an instruction-tuned model combined with schema-aware prompting and dynamic context injection. By ingesting database metadata and using in-context learning with example query pairs, it generates syntactically valid queries that match the specific schema of a connected database. The project covers a broader range
Langroid is a multi-agent orchestration framework and tool integration suite designed for building complex AI applications. It serves as a multi-modal integration layer that connects diverse local and remote language models with an agentic retrieval-augmented generation system. The project distinguishes itself through a collaborative message-exchange paradigm, allowing specialized agents to delegate tasks hierarchically and coordinate via structured communication. It features an advanced state management system for conversational AI, including the ability to rewind and prune conversation hist
Kilocode is an autonomous engineering platform designed to orchestrate AI agents for complex software development tasks. It functions as a comprehensive system for automating coding, testing, and repository management by integrating directly with your codebase and terminal. The platform provides a unified gateway for model orchestration, allowing for the management of agentic workflows, event-driven automation, and persistent session state across distributed development environments. The platform distinguishes itself through its federated task management and policy-based access control, which
PydanticAI is a Python framework designed for building production-grade autonomous agents. It provides a unified interface for interacting with diverse language models, enabling developers to construct agents that perform complex tasks through structured data validation, tool execution, and multi-turn conversation management. The library centers on type-safe schema enforcement, ensuring that model inputs and outputs remain consistent and reliable throughout the agent's lifecycle. The framework distinguishes itself through a robust architecture that emphasizes modularity and testability. It ut
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Youtu Agent is an open-source framework for building, running, and evaluating autonomous agents powered by large language models. It provides the core infrastructure for creating agents that follow reasoning loops, use toolkits, and coordinate with other agents to solve complex tasks, all managed through YAML-driven configuration files. The framework distinguishes itself through its support for multi-agent orchestration, where a planner agent decomposes tasks and coordinates specialized worker agents, and through its integration with the Model Context Protocol for connecting to external toolk
The agent-framework is an LLM agent orchestration framework and multi-agent workflow engine designed for building autonomous AI agents. It provides a tool integration layer for binding external functions, APIs, and sandboxed code as executable tools for language models. The framework distinguishes itself through a graph-based system for designing sequential and parallel task flows, featuring state management and checkpointing for long-running processes. It implements comprehensive conversational state management and an observability suite that uses telemetry to trace execution flows and monit
This project is an autonomous software engineering platform and orchestration framework designed to manage specialized artificial intelligence agents. It provides a suite of tools for coordinating autonomous entities to execute complex development tasks, ranging from architectural planning and code reviews to performance optimization. The platform distinguishes itself through its multi-agent orchestration layer, which dynamically assigns roles based on an analysis of a project's technology stack. By utilizing a modular agent registry, the system scales capabilities across different software m
This project serves as a dual-purpose platform that functions both as a comprehensive software engineering learning resource and an autonomous agent orchestration framework. It provides a structured curriculum focused on the Java ecosystem, offering technical roadmaps, interview preparation materials, and career mentorship. Simultaneously, it acts as a technical foundation for building intelligent systems, enabling developers to construct complex, multi-step agent pipelines. The framework distinguishes itself by integrating advanced automation capabilities directly into its educational missio
CS-Base is a comprehensive educational platform and technical repository designed to support software engineers in mastering backend architecture, artificial intelligence engineering, and career development. It functions as a centralized knowledge hub that combines illustrated theoretical tutorials with practical, project-based learning to bridge the gap between foundational computer science concepts and professional industry requirements. The project distinguishes itself by integrating a robust career mentorship framework with advanced AI engineering resources. It provides users with tools f