1 repo
Ready-to-use automated judges for assessing common AI quality concerns like hallucination and relevance.
Distinguishing note: Focuses on pre-configured evaluation models rather than custom metric definitions.
Explore 1 awesome GitHub repository matching testing & quality assurance · Pre-built Evaluation Judges. Refine with filters or upvote what's useful.
Ships pre-built judges to instantly assess safety, hallucination, and retrieval quality.