This project is an automation framework that connects large language models to web browsers via the Chrome DevTools Protocol for autonomous task execution. It functions as a bridge between intelligent agents and browser engines, allowing for the direct control of browser sessions and profiles.
The framework features a self-healing agent capable of generating and executing custom scripts during runtime to resolve failures and optimize browser tasks. It supports stealthy deployment through the use of integrated proxies and captcha solvers to bypass bot detection and security mitigations.
The system covers a wide range of capabilities including cloud-based headless browser management, remote session orchestration via authentication keys, and page state inspection through combined visual and DOM analysis. It also provides mechanisms for integrating site-specific logic and pre-defined skills from local workspaces to increase task reliability.