LLM tools and development frameworks for building, orchestrating, and deploying AI-powered applications, including agentic systems, RAG pipelines, and runtimes.
This framework provides a development environment for building collaborative systems where autonomous agents interact to solve complex tasks through conversational workflows. It functions as a conversational workflow engine and event-driven runtime, coordinating multi-step processes by translating high-level goals into structured dialogue sequences between specialized agents. The system distinguishes itself through its message-passing orchestration, which manages state transitions and task delegation between independent participants. It supports dynamic conversation state management to provide persistent memory during multi-turn interactions, and it incorporates human-in-the-loop capabilities that allow for review or modification of agent outputs at specific message boundaries. Beyond core orchestration, the framework enables the integration of pluggable tools, allowing agents to invoke external functions and APIs through natural language requests. This architecture supports the construction of scalable, event-driven systems that automate sequences of tasks across digital tools and connect large language models to external data sources for autonomous reasoning.
A leading framework for building collaborative systems where autonomous agents interact via conversational workflows.
OpenLLM is a framework for deploying, managing, and scaling open-source large language models
A robust framework focused on the deployment, serving, and performance monitoring of large language models.
Open WebUI is a self-hosted, web-based platform designed for interacting with local and remote artificial intelligence models. It functions as a unified interface and orchestration suite, enabling users to build, deploy, and manage specialized AI agents equipped with custom instructions, external tool access, and private knowledge bases. The platform distinguishes itself through a modular architecture that supports complex AI workflows. It features a plugin-based framework for custom logic and pipeline-based request processing, allowing developers to filter or transform data streams before they reach a model. For enterprise environments, it provides centralized model management, role-based access control, and integration with standard identity providers like LDAP and SSO. It also includes sandboxed code execution and vector-database-based retrieval, enabling models to perform secure computations and semantic searches across private document collections. Beyond its core chat capabilities, the platform offers extensive administrative and operational tools. It supports multi-node deployments, horizontal scaling, and comprehensive system observability to ensure reliability in production settings. Users can further customize the interface, manage API access via personal tokens, and utilize persistent workspaces for collaborative knowledge management. The software is packaged for container-orchestrated deployment, allowing for consistent execution across diverse cloud and local infrastructure.
A self-hosted interface and orchestration suite for interacting with and managing local or remote LLMs.
h2oGPT is a self-hosted platform designed for running large language models and executing retrieval-augmented generation workflows locally. It provides a comprehensive web interface that allows users to index private document collections into searchable databases, enabling context-aware question answering and summarization without exposing sensitive data to external services. The platform distinguishes itself by offering a modular architecture that supports both local model execution and connections to external inference servers. It facilitates the development of autonomous agents capable of performing multi-step tasks by delegating actions to various tools and models. Beyond simple chat, the system includes capabilities for fine-tuning models on local hardware and managing the full lifecycle of predictive assets, from data ingestion and feature engineering to model deployment and performance monitoring. The software covers a broad range of enterprise-grade requirements, including document intelligence for extracting structured data from unstructured files, multi-GPU training support, and robust access control mechanisms. It provides tools for model explainability, compliance tracking, and collaborative experiment management to ensure transparency and reproducibility in machine learning workflows. The project is designed for containerized deployment, utilizing standard configuration files to ensure consistent execution across local and cloud environments.
A self-hosted platform for running LLMs and executing RAG workflows with a comprehensive web interface.
Explore further