awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
LLM Application Evaluation · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesLLM Application Evaluation

Frameworks and utilities for measuring the performance, accuracy, and reliability of language model applications.

Distinguishing note: Specifically targets the evaluation of LLM-based systems, distinct from traditional software unit or integration testing.

Explore 1 awesome GitHub repository matching testing & quality assurance · LLM Application Evaluation. Refine with filters or upvote what's useful.

  1. Home
  2. Testing & Quality Assurance
  3. LLM Application Evaluation

Awesome LLM Application Evaluation GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • run-llama/llama_index

    run-llama/llama_index

    47,075View on GitHub↗

    LlamaIndex is a comprehensive development framework designed to connect private or external data sources to large language models. It functions as a data-centric toolkit that enables the construction of retrieval-augmented generation systems, allowing developers to build applications that provide context-aware answers based on specific organizational information. The project distinguishes itself through a robust agentic orchestration engine that supports the creation of autonomous agents capable of multi-step reasoning, memory management, and complex tool execution. Beyond simple retrieval, i

    LlamaIndex evaluates application performance using standardized datasets and testing patterns to iteratively improve accuracy and reliability.

    Pythonagentsapplicationdata
    47,075View on GitHub↗