1 repo
Standardized tests and metrics used to measure the performance, reasoning capabilities, and context handling of artificial intelligence models.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Artificial Intelligence Benchmarks. Refine with filters or upvote what's useful.
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Lists benchmarks and evaluation metrics for assessing how well models recall information across large context windows.