1 repo
Systems for benchmarking and scoring the performance of automated agents against expected outcomes.
Distinguishing note: Focuses on accuracy scoring and performance benchmarking rather than standard unit testing.
Explore 1 awesome GitHub repository matching testing & quality assurance · Automated Evaluation Frameworks. Refine with filters or upvote what's useful.
Agno is an agent operating system designed to manage the lifecycle, tool execution, and persistent state of autonomous agents across distributed infrastructure. It provides a unified runtime environment that wraps diverse agent frameworks into a consistent, interoperable protocol, allowing developers to build and deploy complex multi-agent systems that coordinate tasks and delegate sub-processes. The platform distinguishes itself through a robust governance and orchestration layer that includes human-in-the-loop approval gates, role-based access control, and a centralized API gateway. It feat
AgentOS measures agent or team performance by comparing actual responses against expected outputs using an automated judge to score accuracy.