awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Schema-Driven Extraction · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesSchema-Driven Extraction

Tools that map unstructured web content into predefined data structures using automated path selection.

Explore 1 awesome GitHub repository matching data & databases · Schema-Driven Extraction. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Data Processing Pipelines
  4. Data Processing
  5. Document and Unstructured Extraction
  6. Schema-Driven Extraction

Awesome Schema-Driven Extraction GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • unclecode/crawl4ai

    unclecode/crawl4ai

    60,452GitHubView on GitHub↗

    Crawl4AI is an AI-powered web crawling and data extraction engine designed to transform complex web content into structured formats. It functions as a headless browser orchestrator, enabling the navigation of dynamic websites, the execution of custom scripts, and the capture of visual assets like screenshots and PDFs.

    Python