6 repos
Diagnostic and benchmarking tools designed to inspect model reasoning, trace execution flows, and validate performance metrics.
Explore 6 awesome GitHub repositories matching artificial intelligence & ml · AI Observability and Evaluation. Refine with filters or upvote what's useful.
This project is a community-maintained directory of technical resources, tools, and services that offer free tiers for developers. It serves as a centralized reference point for discovering infrastructure, software, and educational materials, helping individuals and teams minimize operational costs while building and s
This project is a comprehensive, curated collection of software resources designed for the macOS ecosystem. It serves as a centralized directory for discovering applications across a wide range of functional domains, including professional development, system management, and personal productivity. The directory distin
This repository serves as a comprehensive collection of resources, templates, and starter code for building artificial intelligence applications. It provides a centralized hub for developers to access practical implementations of common workflows, including retrieval-augmented generation pipelines and autonomous agent
This project is a community-driven knowledge base and curated repository focused on natural language processing and large language model development. It serves as a centralized index for high-quality tools, libraries, and research materials, organizing technical resources into structured, version-controlled documentati
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task
OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system