Pipet

Open-source alternatives to Pipet

Similar open-source projects, ranked by how many features they share with Pipet.

psf/requests-html
psf/requests-html
13,826View on GitHub
requests-html is a Python HTML parsing library and web scraping framework. It functions as an asynchronous HTTP client and a JavaScript rendering engine designed to fetch and parse web pages for structured data extraction. The project integrates a headless browser to execute JavaScript, allowing it to retrieve dynamically generated content that standard HTML parsers cannot see. It provides tools for automated data extraction using CSS selectors and XPath expressions to isolate specific text or attributes from HTML structures. The framework covers network operations including asynchronous pag
Pythonbeautifulsoupcss-selectorshtml
View on GitHub13,826
snehasishroy/leetcode-companywise-interview-questions
snehasishroy/leetcode-companywise-interview-questions
2,656View on GitHub
Javaamazon-interviewapple-interviewfacebook-interview
View on GitHub2,656
code4craft/webmagic
code4craft/webmagic
11,680View on GitHub
Webmagic is a Java web crawling framework designed for building scalable automated crawlers to download and process large volumes of web pages. It functions as a distributed web crawler and dynamic content crawler, utilizing an XPath HTML parser to locate and extract specific data points from page structures. The framework distinguishes itself through its ability to handle dynamic content by rendering JavaScript and executing asynchronous requests to extract data from non-static pages. It also allows users to define and execute crawler logic via scripting languages, enabling the update of col
Javacrawlerframeworkjava
View on GitHub11,680
apify/crawlee-python
apify/crawlee-python
8,097View on GitHub
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Pythonapifyautomationbeautifulsoup
View on GitHub8,097

See all 30 alternatives to Pipet

bjesuspipet

Features

Open-source alternatives to Pipet

psf/requests-html

snehasishroy/leetcode-companywise-interview-questions

code4craft/webmagic

apify/crawlee-python

Star history

Open-source alternatives to Pipet

psf/requests-html

snehasishroy/leetcode-companywise-interview-questions

code4craft/webmagic

apify/crawlee-python