awesome-repositories.comBlog
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPBlogSitemapPrivacyTerms
Agent Evaluation Tools · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesAgent Evaluation Tools

Specialized testing suites for assessing the reasoning, tool usage, and output quality of autonomous AI agents.

Distinguishing note: Distinct from general model evaluation: focuses on multi-step agentic workflows and tool-use verification.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Agent Evaluation Tools. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Agent Evaluation Tools

Awesome Agent Evaluation Tools GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • mlflow/mlflow

    mlflow/mlflow

    24,319View on GitHub↗

    Analyze agent performance by defining test datasets and custom scorers to assess both final outputs and intermediate tool usage.

    Pythonagentopsagentsai
    24,319View on GitHub↗