9 repos

Awesome GitHub RepositoriesEvaluation & Validation

Explore 9 awesome GitHub repositories matching artificial intelligence & ml · Evaluation & Validation. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

langchain-ai/langchain
langchain-ai/langchain
127,015GitHubView on GitHub
LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t
Benchmarks model performance to assist in selecting appropriate providers for specific requirements.
Pythonagentsaiai-agents
microsoft/generative-ai-for-beginners
microsoft/generative-ai-for-beginners
106,618GitHubView on GitHub
This project is a comprehensive, open-source educational curriculum designed to guide developers through the mastery of generative artificial intelligence. It provides a structured learning path that covers foundational concepts, prompt engineering, and the practical application of large language models. The repository
Presents methodologies for systematically evaluating and comparing the performance of various large language models.
Jupyter Notebookaiazurechatgpt
fighting41love/funNLP
fighting41love/funNLP
78,999GitHubView on GitHub
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
Aggregates comparative data and interaction tools to help users evaluate the capabilities of diverse conversational agents.
Python
mlabonne/llm-course
mlabonne/llm-course
75,340GitHubView on GitHub
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we
Establishes systematic workflows for benchmarking model performance to inform iterative training improvements and data adjustments.
courselarge-language-modelsllm
dair-ai/Prompt-Engineering-Guide
dair-ai/Prompt-Engineering-Guide
70,526GitHubView on GitHub
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Provides frameworks for using language models as automated evaluators to assess the quality and accuracy of other system outputs.
MDXagentagentsai-agents
scikit-learn/scikit-learn
scikit-learn/scikit-learn
65,178GitHubView on GitHub
Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict conti
Compares algorithm configurations and tunes hyperparameters to identify the most accurate approach for specific predictive tasks.
Pythondata-analysisdata-sciencemachine-learning
karpathy/nanoGPT
karpathy/nanoGPT
53,461GitHubView on GitHub
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi
Measures training speed and iteration throughput to identify performance bottlenecks.
Python
ultralytics/ultralytics
ultralytics/ultralytics
53,426GitHubView on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Validates segmentation accuracy by calculating performance metrics such as mean average precision for masks and boxes.
Pythonclicomputer-visiondeep-learning
unslothai/unsloth
unslothai/unsloth
52,461GitHubView on GitHub
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Monitors training progress via loss metrics and validates output quality through manual chat sessions or automated test sets.
Pythonagentdeepseekdeepseek-r1

Explore sub-tags

9 repos

Awesome GitHub RepositoriesEvaluation & Validation

Explore 9 awesome GitHub repositories matching artificial intelligence & ml · Evaluation & Validation. Refine with filters or upvote what's useful.

We'll search the best matching repositories with AI.

langchain-ai/langchain
langchain-ai/langchain
127,015GitHubView on GitHub
LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t
Benchmarks model performance to assist in selecting appropriate providers for specific requirements.
Pythonagentsaiai-agents
microsoft/generative-ai-for-beginners
microsoft/generative-ai-for-beginners
106,618GitHubView on GitHub
This project is a comprehensive, open-source educational curriculum designed to guide developers through the mastery of generative artificial intelligence. It provides a structured learning path that covers foundational concepts, prompt engineering, and the practical application of large language models. The repository
Presents methodologies for systematically evaluating and comparing the performance of various large language models.
Jupyter Notebookaiazurechatgpt
fighting41love/funNLP
fighting41love/funNLP
78,999GitHubView on GitHub
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
Aggregates comparative data and interaction tools to help users evaluate the capabilities of diverse conversational agents.
Python
mlabonne/llm-course
mlabonne/llm-course
75,340GitHubView on GitHub
This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we
Establishes systematic workflows for benchmarking model performance to inform iterative training improvements and data adjustments.
courselarge-language-modelsllm
dair-ai/Prompt-Engineering-Guide
dair-ai/Prompt-Engineering-Guide
70,526GitHubView on GitHub
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
Provides frameworks for using language models as automated evaluators to assess the quality and accuracy of other system outputs.
MDXagentagentsai-agents
scikit-learn/scikit-learn
scikit-learn/scikit-learn
65,178GitHubView on GitHub
Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict conti
Compares algorithm configurations and tunes hyperparameters to identify the most accurate approach for specific predictive tasks.
Pythondata-analysisdata-sciencemachine-learning
karpathy/nanoGPT
karpathy/nanoGPT
53,461GitHubView on GitHub
nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi
Measures training speed and iteration throughput to identify performance bottlenecks.
Python
ultralytics/ultralytics
ultralytics/ultralytics
53,426GitHubView on GitHub
Ultralytics is a comprehensive computer vision framework designed for training, validating, and deploying deep learning models across a wide range of visual recognition tasks. It provides a unified interface for core operations including object detection, instance segmentation, pose estimation, and image classification
Validates segmentation accuracy by calculating performance metrics such as mean average precision for masks and boxes.
Pythonclicomputer-visiondeep-learning
unslothai/unsloth
unslothai/unsloth
52,461GitHubView on GitHub
Unsloth is a high-performance training and inference platform designed to optimize the lifecycle of large language and multimodal models. It provides a comprehensive engine for fine-tuning, executing, and managing models locally, with a focus on reducing memory consumption and increasing compute speed on consumer-grade
Monitors training progress via loss metrics and validates output quality through manual chat sessions or automated test sets.
Pythonagentdeepseekdeepseek-r1

Awesome Evaluation & Validation GitHub Repositories

langchain-ai/langchain

microsoft/generative-ai-for-beginners

fighting41love/funNLP

mlabonne/llm-course

dair-ai/Prompt-Engineering-Guide

scikit-learn/scikit-learn

karpathy/nanoGPT

ultralytics/ultralytics

unslothai/unsloth

Explore sub-tags

Awesome Evaluation & Validation GitHub Repositories

langchain-ai/langchain

microsoft/generative-ai-for-beginners

fighting41love/funNLP

mlabonne/llm-course

dair-ai/Prompt-Engineering-Guide

scikit-learn/scikit-learn

karpathy/nanoGPT

ultralytics/ultralytics

unslothai/unsloth

Explore sub-tags