What are the best open-source alternatives to Page Agent?

30 open-source projects similar to alibaba/page-agent, ranked by shared features. Top picks: steel-dev/steel-browser, hkuds/autoagent, lavague-ai/lavague, viarotel-org/escrcpy, google-gemini/computer-use-preview, executeautomation/mcp-playwright, garrytan/gstack, m1heng/clawdbot-feishu, kilo-org/kilocode, mastra-ai/mastra.

Is steel-dev/steel-browser a good alternative to Page Agent?

Steel is a cloud browser automation platform that provides a REST API for launching and controlling remote Chrome browser sessions. It enables programmatic browsing and web scraping using standard automation tools like Puppeteer, Playwright, and Selenium, connecting to cloud-hosted browser instance…

Is hkuds/autoagent a good alternative to Page Agent?

AutoAgent is a multi-agent orchestrator and natural language workflow builder designed to connect multiple large language models with external API tools. It provides a framework for designing multi-step agent interactions and reasoning processes using plain text instead of manual code. The platfor…

Is lavague-ai/lavague a good alternative to Page Agent?

LaVague is an LLM web agent framework and large action model designed to translate natural language instructions into executable browser automation scripts. It functions as a multi-modal orchestrator that reasons over web page states and HTML content to automate multi-step tasks via a Selenium-base…

Is viarotel-org/escrcpy a good alternative to Page Agent?

escrcpy is an Android device mirroring tool and ADB device manager that enables the display and control of Android screens on a computer via USB or network connections. It functions as a multi-device screen orchestrator, providing a visual interface to arrange and control several mirrored device wi…

Is google-gemini/computer-use-preview a good alternative to Page Agent?

This project is a browser automation system that connects Google's Gemini API to a web browser, enabling an AI agent to perform tasks on a user's behalf by interpreting natural language instructions. At its core, it operates through a continuous screenshot-based action loop, where the agent capture…

Is executeautomation/mcp-playwright a good alternative to Page Agent?

This project is a Model Context Protocol server that enables Large Language Models to control Playwright browsers for web automation, scraping, and end-to-end testing. It functions as a programmable interface for executing JavaScript, capturing screenshots, and interacting with web elements across…

Is garrytan/gstack a good alternative to Page Agent?

gstack is an AI agent framework and development workflow system designed to automate the software development lifecycle. It coordinates specialized AI personas to manage tasks across product design, engineering management, and quality assurance, transforming product intent into technical specificat…

Is m1heng/clawdbot-feishu a good alternative to Page Agent?

This project is a framework for integrating Large Language Models into the Feishu messaging platform to create automated assistants. It functions as a self-hosted AI assistant and a chatbot gateway that routes messages between chat platforms and remote AI cloud providers. The system features a mul…

Is kilo-org/kilocode a good alternative to Page Agent?

Kilocode is an autonomous engineering platform designed to orchestrate AI agents for complex software development tasks. It functions as a comprehensive system for automating coding, testing, and repository management by integrating directly with your codebase and terminal. The platform provides a…

Is mastra-ai/mastra a good alternative to Page Agent?

Mastra is an orchestration framework designed for building, deploying, and managing autonomous AI agents and multi-agent systems. It provides a comprehensive suite of primitives for creating resilient AI applications, including durable workflow orchestration, event-driven agent loops, and semantic…

Back to alibaba/page-agent

Open-source alternatives to Page Agent

30 open-source projects similar to alibaba/page-agent, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Page Agent alternative.

steel-dev/steel-browser
steel-dev/steel-browser
6,450View on GitHub
Steel is a cloud browser automation platform that provides a REST API for launching and controlling remote Chrome browser sessions. It enables programmatic browsing and web scraping using standard automation tools like Puppeteer, Playwright, and Selenium, connecting to cloud-hosted browser instances via WebSocket and the Chrome DevTools Protocol. The platform supports both headless and headful browser sessions, with language-specific SDKs for TypeScript and Python. The service distinguishes itself through comprehensive anti-detection capabilities, including residential proxy rotation, CAPTCHA
TypeScriptaiai-agentsai-tools
View on GitHub6,450
hkuds/autoagent
HKUDS/AutoAgent
8,583View on GitHub
AutoAgent is a multi-agent orchestrator and natural language workflow builder designed to connect multiple large language models with external API tools. It provides a framework for designing multi-step agent interactions and reasoning processes using plain text instead of manual code. The platform functions as a tool integration gateway, linking agents to third-party platforms and authenticated browser sessions. It enables the execution of complex analytical tasks and deep research by distributing work across collaborative agent frameworks and importing browser cookies to access restricted w
Pythonagentllms
View on GitHub8,583
lavague-ai/lavague
lavague-ai/LaVague
6,374View on GitHub
LaVague is an LLM web agent framework and large action model designed to translate natural language instructions into executable browser automation scripts. It functions as a multi-modal orchestrator that reasons over web page states and HTML content to automate multi-step tasks via a Selenium-based automation engine. The framework features a modular model provider layer, allowing users to swap between different language and vision models from providers such as Anthropic, Gemini, and Azure OpenAI. It employs a multi-modal world model to process screenshots and HTML structures, utilizing retri
Pythonaibrowserlarge-action-model
View on GitHub6,374

Open-source alternatives to Page Agent

steel-dev/steel-browser

HKUDS/AutoAgent

lavague-ai/LaVague

viarotel-org/escrcpy

google-gemini/computer-use-preview

executeautomation/mcp-playwright

garrytan/gstack

m1heng/clawdbot-feishu

Kilo-Org/kilocode

mastra-ai/mastra

browserbase/stagehand

browser-use/browser-use

droidrun/droidrun

GoogleCloudPlatform/kubectl-ai

chromedp/chromedp

santinic/how2

lightpanda-io/browser

vercel-labs/agent-browser

sinaptik-ai/pandas-ai

google-gemini/cookbook

microsoft/UFO

bytebot-ai/bytebot

browser-use/browser-harness

business-science/ai-data-science-team

mendableai/firecrawl

ntegrals/openbrowser

BrowserMCP/mcp

jasperproject/jasper-client

h4ckf0r0day/obscura

microsoft/magentic-ui