1 repo
Tools for measuring, tracking, and optimizing the performance of AI programs using metrics and model-based judges.
Distinguishing note: Focuses on programmatic evaluation and optimization of AI outputs, distinct from general application performance monitoring (APM).
Explore 1 awesome GitHub repository matching testing & quality assurance · AI Performance Evaluation Harnesses. Refine with filters or upvote what's useful.
DSPy is a declarative programming framework designed for building complex language model applications. It treats model interactions as modular, composable programs, allowing developers to define task logic through typed class schemas rather than relying on manually written prompts. By organizing workflows into hierarchical, reusable Python objects, the framework enables the construction of sophisticated AI systems that manage state and execution flow independently. The framework distinguishes itself through an automated optimization engine that iteratively refines prompt instructions and few-
DSPy tracks internal execution metrics and inspects runtime behavior using integrated diagnostic tools to identify bottlenecks and improve overall software reliability during development.