awesome-repositories.comBlog
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPBlogSitemapPrivacyTerms
Multi-Pass Extraction Pipelines · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesMulti-Pass Extraction Pipelines

Techniques for performing sequential extraction passes to improve data recall.

Distinguishing note: Focuses on the iterative improvement of extraction results.

Explore 1 awesome GitHub repository matching data & databases · Multi-Pass Extraction Pipelines. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Multi-Pass Extraction Pipelines

Awesome Multi-Pass Extraction Pipelines GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • google/langextract

    google/langextract

    33,310View on GitHub↗

    Langextract is a framework designed to transform unstructured text into structured, machine-readable data using language model orchestration. It provides a high-performance pipeline that processes large volumes of narrative text by utilizing parallel execution and sequential extraction passes. The library is built to handle complex data extraction tasks, including specialized support for clinical information and medical entity relationship recognition. The project distinguishes itself through a plugin-based architecture that supports both local hardware execution and cloud-hosted model endpoi

    Performs multiple independent extraction passes over text to improve recall and capture missed entities.

    Pythongeminigemini-aigemini-api
    33,310View on GitHub↗