1 repo
Collections of tests and metrics specifically designed to evaluate the reliability, safety, and ethical alignment of large language models.
Explore 1 awesome GitHub repository matching testing & quality assurance · Trustworthiness Benchmarks. Refine with filters or upvote what's useful.
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task