awesome-repositories.comBlog
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPBlogSitemapPrivacyTerms
Web Data Extraction · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesWeb Data Extraction

Tools for programmatically scraping and processing web content.

Distinguishing note: Focuses on data collection rather than general browser automation.

Explore 1 awesome GitHub repository matching data & databases · Web Data Extraction. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Web Data Extraction

Awesome Web Data Extraction GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • SeleniumHQ/selenium

    SeleniumHQ/selenium

    34,054View on GitHub↗

    Selenium is a comprehensive browser automation framework that provides a standardized interface for controlling web browsers to perform automated tasks, user interactions, and data extraction. It functions as a cross-browser testing tool, enabling developers to execute identical automation scripts across various browser engines and operating systems to ensure consistent application behavior. By implementing the WebDriver protocol, it maps high-level automation commands to browser-specific drivers using a standardized HTTP-based wire protocol. The project distinguishes itself through its distr

    Navigates websites to programmatically collect and process information from public sources.

    Javadotnetjavajavascript
    34,054View on GitHub↗