2 repos

Awesome GitHub RepositoriesAgent Evaluation Frameworks

Systems for assessing agent decision-making, action success, and conversation quality through automated scoring and feedback loops.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Agent Evaluation Frameworks. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

dair-ai/Prompt-Engineering-Guide
dair-ai/Prompt-Engineering-Guide
70,526GitHubView on GitHub
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Tracks key performance indicators such as task completion rates to measure the reliability of autonomous agent workflows.
MDXagentagentsai-agents
OpenHands/OpenHands
OpenHands/OpenHands
67,974GitHubView on GitHub
OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system
Triggers iterative refinement cycles whenever agent output quality falls below defined success thresholds.
Pythonagentartificial-intelligencechatgpt

Explore sub-tags

2 repos

Awesome GitHub RepositoriesAgent Evaluation Frameworks

Systems for assessing agent decision-making, action success, and conversation quality through automated scoring and feedback loops.

Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Agent Evaluation Frameworks. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

dair-ai/Prompt-Engineering-Guide
dair-ai/Prompt-Engineering-Guide
70,526GitHubView on GitHub
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Tracks key performance indicators such as task completion rates to measure the reliability of autonomous agent workflows.
MDXagentagentsai-agents
OpenHands/OpenHands
OpenHands/OpenHands
67,974GitHubView on GitHub
OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system
Triggers iterative refinement cycles whenever agent output quality falls below defined success thresholds.
Pythonagentartificial-intelligencechatgpt

Awesome Agent Evaluation Frameworks GitHub Repositories

dair-ai/Prompt-Engineering-Guide

OpenHands/OpenHands

Explore sub-tags

Awesome Agent Evaluation Frameworks GitHub Repositories

dair-ai/Prompt-Engineering-Guide

OpenHands/OpenHands

Explore sub-tags