awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Model-Based Evaluation · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesModel-Based Evaluation

Automated testing and verification of model outputs using secondary models or benchmarks.

Distinguishing note: Focuses on using AI to evaluate AI, distinct from traditional software unit testing.

Explore 1 awesome GitHub repository matching testing & quality assurance · Model-Based Evaluation. Refine with filters or upvote what's useful.

  1. Home
  2. Testing & Quality Assurance
  3. Model-Based Evaluation

Awesome Model-Based Evaluation GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • dair-ai/Prompt-Engineering-Guide

    dair-ai/Prompt-Engineering-Guide

    70,526View on GitHub↗

    This project is a comprehensive educational resource and technical guide focused on the development, optimization, and application of large language models. It provides a structured curriculum for mastering prompt engineering, ranging from foundational principles of instruction design to advanced techniques for improving model reasoning, accuracy, and reliability. The guide distinguishes itself by offering deep technical insights into agentic workflows and autonomous system design. It covers the implementation of multi-step reasoning chains, tool integration through function calling, and stat

    Uses automated evaluation loops to validate and refine model outputs against predefined criteria.

    MDXagentagentsai-agents
    70,526View on GitHub↗