awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
AI Performance Evaluation Harnesses · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesAI Performance Evaluation Harnesses

Tools for measuring, tracking, and optimizing the performance of AI programs using metrics and model-based judges.

Distinguishing note: Focuses on programmatic evaluation and optimization of AI outputs, distinct from general application performance monitoring (APM).

Explore 1 awesome GitHub repository matching testing & quality assurance · AI Performance Evaluation Harnesses. Refine with filters or upvote what's useful.

  1. Home
  2. Testing & Quality Assurance
  3. AI Performance Evaluation Harnesses

Awesome AI Performance Evaluation Harnesses GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • stanfordnlp/dspy

    stanfordnlp/dspy

    32,291View on GitHub↗

    DSPy is a declarative programming framework designed for building complex language model applications. It treats model interactions as modular, composable programs, allowing developers to define task logic through typed class schemas rather than relying on manually written prompts. By organizing workflows into hierarchical, reusable Python objects, the framework enables the construction of sophisticated AI systems that manage state and execution flow independently. The framework distinguishes itself through an automated optimization engine that iteratively refines prompt instructions and few-

    DSPy tracks internal execution metrics and inspects runtime behavior using integrated diagnostic tools to identify bottlenecks and improve overall software reliability during development.

    Python
    32,291View on GitHub↗