Youtu Agent is an open-source framework for building, running, and evaluating autonomous agents powered by large language models. It provides the core infrastructure for creating agents that follow reasoning loops, use toolkits, and coordinate with other agents to solve complex tasks, all managed through YAML-driven configuration files.
The framework distinguishes itself through its support for multi-agent orchestration, where a planner agent decomposes tasks and coordinates specialized worker agents, and through its integration with the Model Context Protocol for connecting to external toolkits. It includes a sandboxed code execution environment that supports over 20 programming languages, browser automation capabilities for web research, and a trajectory-based performance distillation method that improves agent performance without fine-tuning model parameters.
Beyond core agent development, the framework offers a comprehensive evaluation pipeline with database-backed experiment tracking, configurable judging using language models or rule-based matching, and support for resuming interrupted evaluations. It provides tooling for defining custom reward functions, running multi-phase benchmarks, and comparing experiment results. The system also includes web-based interfaces for interacting with agents, Docker deployment options, and support for multimodal inputs including images and video.