php-webdriver is a WebDriver PHP client and browser automation framework that implements the W3C WebDriver standard. It serves as a programmatic interface for controlling web browsers, executing JavaScript, and managing browser sessions in both headed and headless environments. The library functions as a Selenium protocol implementation, allowing PHP applications to communicate with browser drivers such as ChromeDriver or GeckoDriver. It provides the ability to automate user actions, navigate pages, and validate DOM elements for web UI testing. Its capabilities cover broad areas of browser i
This project is a high-level Python library and wrapper for Selenium designed for web browser automation and functional testing. It provides a simplified interface for controlling browsers to execute automated workflows and end-to-end tests across Chrome and Firefox. The library distinguishes itself by replacing technical CSS selectors and identifiers with label-based element discovery, allowing elements to be located via visible text. It further simplifies browser control by automating window management through page titles and handling nested frame interactions without requiring manual conte
LaVague is an LLM web agent framework and large action model designed to translate natural language instructions into executable browser automation scripts. It functions as a multi-modal orchestrator that reasons over web page states and HTML content to automate multi-step tasks via a Selenium-based automation engine. The framework features a modular model provider layer, allowing users to swap between different language and vision models from providers such as Anthropic, Gemini, and Azure OpenAI. It employs a multi-modal world model to process screenshots and HTML structures, utilizing retri
Open Interpreter is a local language model agent framework that enables the deployment of autonomous agents capable of controlling a local operating system and its applications. It provides an execution environment where language models can run code and scripts directly on a computer to automate system tasks. The framework includes a computer control interface that allows language models to interact with web browsers and native user interfaces through programmatic commands. To ensure system stability, it utilizes a secure sandbox environment for the execution of model-generated code. The sys