1 repo
Comprehensive suites for assessing the quality and safety of non-deterministic AI systems.
Distinguishing note: Focuses on end-to-end AI evaluation rather than unit testing.
Explore 1 awesome GitHub repository matching testing & quality assurance · AI Application Evaluation. Refine with filters or upvote what's useful.
Provides a complete framework for evaluating AI application quality using automated and human-in-the-loop methods.