This project is a containerized local AI infrastructure stack designed to deploy large language models and vector databases on private hardware. It functions as an orchestration platform that combines AI runners, knowledge graphs, and a visual workflow builder for creating agentic chatflows and automating tasks via tool integration.
The platform distinguishes itself through a low-code approach to agent orchestration, utilizing a visual interface to design complex sequences and connect agents to external tools and search engines. It includes a dedicated local observability stack to track prompts, traces, and application performance, as well as hardware-specific optimization profiles to maximize inference speed on graphics processors and central processing units.
The system covers a broad range of operational capabilities, including retrieval-augmented generation via vector database storage, centralized traffic routing with reverse proxy encryption, and shared-volume filesystem mounting for local data synchronization. It also manages network exposure to toggle between private and public web traffic configurations.
The infrastructure is deployed as a pre-configured set of Docker-based services.