LLM Tools

LLM tools and development frameworks for building, orchestrating, and deploying AI-powered applications, including agentic systems, RAG pipelines, and runtimes.

Find the best repos with AI.We'll search the best matching repositories with AI.

microsoft/autogen
microsoft/autogen
59,002View on GitHub
This framework provides a development environment for building collaborative systems where autonomous agents interact to solve complex tasks through conversational workflows. It functions as a conversational workflow engine and event-driven runtime, coordinating multi-step processes by translating high-level goals into structured dialogue sequences between specialized agents. The system distinguishes itself through its message-passing orchestration, which manages state transitions and task delegation between independent participants. It supports dynamic conversation state management to provide persistent memory during multi-turn interactions, and it incorporates human-in-the-loop capabilities that allow for review or modification of agent outputs at specific message boundaries. Beyond core orchestration, the framework enables the integration of pluggable tools, allowing agents to invoke external functions and APIs through natural language requests. This architecture supports the construction of scalable, event-driven systems that automate sequences of tasks across digital tools and connect large language models to external data sources for autonomous reasoning.
A leading framework for building collaborative systems where autonomous agents interact via conversational workflows.
PythonMulti-Agent Orchestration Frameworks
View on GitHub59,002
bentoml/openllm
bentoml/OpenLLM
12,115View on GitHub
OpenLLM is a framework for deploying, managing, and scaling open-source large language models
A robust framework focused on the deployment, serving, and performance monitoring of large language models.
PythonAI Workflow OrchestratorsLarge Language ModelsModel Gateways
View on GitHub12,115
open-webui/open-webui
open-webui/open-webui
141,777View on GitHub
Open WebUI is a self-hosted, web-based platform designed for interacting with local and remote artificial intelligence models. It functions as a unified interface and orchestration suite, enabling users to build, deploy, and manage specialized AI agents equipped with custom instructions, external tool access, and private knowledge bases. The platform distinguishes itself through a modular architecture that supports complex AI workflows. It features a plugin-based framework for custom logic and pipeline-based request processing, allowing developers to filter or transform data streams before they reach a model. For enterprise environments, it provides centralized model management, role-based access control, and integration with standard identity providers like LDAP and SSO. It also includes sandboxed code execution and vector-database-based retrieval, enabling models to perform secure computations and semantic searches across private document collections. Beyond its core chat capabilities, the platform offers extensive administrative and operational tools. It supports multi-node deployments, horizontal scaling, and comprehensive system observability to ensure reliability in production settings. Users can further customize the interface, manage API access via personal tokens, and utilize persistent workspaces for collaborative knowledge management. The software is packaged for container-orchestrated deployment, allowing for consistent execution across diverse cloud and local infrastructure.
A self-hosted interface and orchestration suite for interacting with and managing local or remote LLMs.
PythonAgent FrameworksAI Agent DevelopmentModel Gateways
View on GitHub141,777
h2oai/h2ogpt
h2oai/h2ogpt
12,016View on GitHub
h2oGPT is a self-hosted platform designed for running large language models and executing retrieval-augmented generation workflows locally. It provides a comprehensive web interface that allows users to index private document collections into searchable databases, enabling context-aware question answering and summarization without exposing sensitive data to external services. The platform distinguishes itself by offering a modular architecture that supports both local model execution and connections to external inference servers. It facilitates the development of autonomous agents capable of performing multi-step tasks by delegating actions to various tools and models. Beyond simple chat, the system includes capabilities for fine-tuning models on local hardware and managing the full lifecycle of predictive assets, from data ingestion and feature engineering to model deployment and performance monitoring. The software covers a broad range of enterprise-grade requirements, including document intelligence for extracting structured data from unstructured files, multi-GPU training support, and robust access control mechanisms. It provides tools for model explainability, compliance tracking, and collaborative experiment management to ensure transparency and reproducibility in machine learning workflows. The project is designed for containerized deployment, utilizing standard configuration files to ensure consistent execution across local and cloud environments.
A self-hosted platform for running LLMs and executing RAG workflows with a comprehensive web interface.
PythonAgentic Workflow AutomationAgentic Workflow OrchestrationAutonomous Agent Orchestration
View on GitHub12,016

Explore further

LLM Tools

microsoft/autogen

bentoml/OpenLLM

open-webui/open-webui

h2oai/h2ogpt