pydoll is a Chrome DevTools Protocol automation library and headless browser controller used for web data extraction and parallel browser automation. It controls Chromium-based browsers via direct WebSocket connections, allowing it to manage isolated browser contexts and tabs while bypassing the overhead and detection associated with WebDriver.
The project features an anti-bot evasion framework that mimics natural human behavior, including mouse movements generated via Bezier curves and variable typing patterns. It provides specialized stealth capabilities to bypass behavioral analysis and automate interactions with CAPTCHA challenges.
The library covers a broad range of capabilities, including network traffic interception for mocking server responses, comprehensive DOM manipulation and shadow DOM traversal, and structured data mapping for extracting content from dynamic pages. It also includes tools for browser fingerprint spoofing, identity synchronization, and the capture of page screenshots, PDFs, and screencasts.