awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Synthetic Data Generators · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesSynthetic Data Generators

Tools that automatically create datasets from existing content for testing and benchmarking purposes.

Distinguishing note: Focuses on automated dataset creation for evaluation, distinct from general-purpose testing frameworks.

Explore 1 awesome GitHub repository matching testing & quality assurance · Synthetic Data Generators. Refine with filters or upvote what's useful.

  1. Home
  2. Testing & Quality Assurance
  3. Synthetic Data Generators

Awesome Synthetic Data Generators GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • run-llama/llama_index

    run-llama/llama_index

    47,075View on GitHub↗

    LlamaIndex is a comprehensive development framework designed to connect private or external data sources to large language models. It functions as a data-centric toolkit that enables the construction of retrieval-augmented generation systems, allowing developers to build applications that provide context-aware answers based on specific organizational information. The project distinguishes itself through a robust agentic orchestration engine that supports the creation of autonomous agents capable of multi-step reasoning, memory management, and complex tool execution. Beyond simple retrieval, i

    LlamaIndex generates synthetic questions from source documents to create datasets for testing and benchmarking pipelines without requiring manual label creation.

    Pythonagentsapplicationdata
    47,075View on GitHub↗