2 repos
Standardized metrics and comparative evaluations used to assess the performance of machine learning models.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Model Benchmarks. Refine with filters or upvote what's useful.
DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting na
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task