awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Extensible Data Extractors · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesExtensible Data Extractors

Modular architectures for building custom data extraction plugins and storage backends.

Distinguishing note: Focuses on the extensibility of the extraction architecture, distinct from the scrapers themselves.

Explore 1 awesome GitHub repository matching software engineering & architecture · Extensible Data Extractors. Refine with filters or upvote what's useful.

  1. Home
  2. Software Engineering & Architecture
  3. Extensible Data Extractors

Awesome Extensible Data Extractors GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • NanmiCoder/MediaCrawler

    NanmiCoder/MediaCrawler

    44,037View on GitHub↗

    MediaCrawler is an automated web scraping framework designed to extract public posts, comments, and creator metadata from various social media platforms. It functions as a headless browser automator, utilizing real browser instances to render dynamic content and execute the client-side scripts necessary for interacting with modern web interfaces. The system distinguishes itself through a focus on session persistence and network flexibility. It supports remote debugging to reuse active browser sessions and cookies, which helps minimize the risk of triggering platform security challenges. To ma

    Provides a modular architecture for integrating new platforms and custom storage backends.

    Python
    44,037View on GitHub↗