27 repositorios
Mechanisms for maintaining and reusing browser sessions to bypass security challenges and detection.
Distinguishing note: Focuses on session persistence for automation, distinct from general session management.
Explore 27 awesome GitHub repositories matching web development · Browser Session Persistence. Refine with filters or upvote what's useful.
gstack is an AI agent framework and development workflow system designed to automate the software development lifecycle. It coordinates specialized AI personas to manage tasks across product design, engineering management, and quality assurance, transforming product intent into technical specifications and final releases. The project is distinguished by its deep integration of headless browser automation and semantic code memory. It utilizes a persistent Chromium daemon for web scraping and visual auditing, and implements a searchable knowledge base that logs architectural decisions and repos
Implements a long-lived headless Chromium daemon to preserve cookies and login state across separate tool invocations.
MediaCrawler is an automated web scraping framework designed to extract public posts, comments, and creator metadata from various social media platforms. It functions as a headless browser automator, utilizing real browser instances to render dynamic content and execute the client-side scripts necessary for interacting with modern web interfaces. The system distinguishes itself through a focus on session persistence and network flexibility. It supports remote debugging to reuse active browser sessions and cookies, which helps minimize the risk of triggering platform security challenges. To ma
Maintains persistent login states to minimize detection and avoid repetitive security challenges.
This project is an agentic framework designed to enable autonomous web navigation and browser automation. It functions as a controller that translates natural language instructions into deterministic browser actions, allowing agents to interact with websites, perform data extraction, and manage complex authentication flows. By leveraging accessibility trees and semantic element resolution, the framework mimics human-like navigation, moving beyond brittle DOM selectors to interact reliably with modern web interfaces. The framework distinguishes itself through its focus on secure, scalable exec
Saves and restores cookies and local storage data to maintain authentication status between agent runs.
CloakBrowser is a modified Chromium browser and anti-detect profile manager designed to bypass bot detection and automation signals. It serves as a replacement for standard browser engines in automation frameworks, using source-level binary patching to remove automation flags and spoof internal hardware signals. The project focuses on identity masking through the generation and maintenance of unique hardware and software fingerprints, including canvas noise and screen dimensions. It synchronizes browser timezones and locales with network proxy exit points and replaces programmatic input with
Maintains browser sessions and authentication state to bypass incognito detection and security challenges.
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
Stores cookies, local storage, and cache to maintain user state across multiple scraping sessions.
Automa is a browser-based automation platform that enables users to build, schedule, and execute repetitive web tasks through a visual, no-code interface. By operating as a browser extension, it provides a canvas-based environment where users construct workflows by connecting functional blocks to interact with web elements, manage browser state, and process data. The platform distinguishes itself through its deep integration with the browser environment, allowing for complex orchestration such as event-driven triggers, cross-origin request handling, and the ability to package workflows as sta
Maintains variable state in browser storage to ensure data availability across multiple workflow executions.
Stagehand is an AI-native browser automation framework that enables developers to build reliable web automations using a hybrid of natural language instructions and deterministic TypeScript code.
Maintains cookies, local storage, and authentication state across sessions to ensure agents retain context between tasks.
This project is a comprehensive framework for building and managing autonomous agent systems. It provides a unified architecture for orchestrating multi-agent societies, where specialized agents collaborate through roleplay to decompose and solve complex tasks. The system integrates language models with external environments, enabling agents to perform real-world actions through a standardized tool-calling abstraction layer. The framework distinguishes itself through its focus on iterative reasoning and data reliability. It employs automated feedback loops to refine agent outputs and self-eva
Preserves login states and session data across multiple runs to maintain browser-based automation context.
This project is a graphical user interface for controlling, configuring, and monitoring AI agents that automate web browser interactions. It provides a visual dashboard to execute autonomous web tasks and manage the behavior of browser-based agents without requiring raw code for every operation. The system includes a browser profile manager to link agents to local executables and user data directories, which allows for persistent authenticated sessions. To support remote observation, it features a VNC streamer that provides a real-time visual feed of headless browser agents operating within a
Maintains browser sessions by linking agents to specific user data directories for persistent authentication.
Playwright for Python is a browser automation framework designed for end-to-end testing, web scraping, and user interaction simulation. It functions as a headless browser controller that enables programmatic navigation, data extraction, and the execution of complex workflows across multiple rendering engines. The framework distinguishes itself through an actionability-aware interaction engine that automatically verifies element readiness before performing actions, significantly reducing test flakiness. It utilizes isolated browser contexts to maintain separate storage and cookies for parallel
Persists and restores cookies, local storage, and tokens across browser contexts to bypass repetitive login steps.
FlareSolverr is a proxy server designed to provide programmatic access to websites protected by automated security challenges and firewall restrictions. It functions by orchestrating headless browser instances to render web pages, execute JavaScript, and retrieve the necessary cookies and content required to bypass common security hurdles. The service distinguishes itself by maintaining persistent browser sessions in memory, which allows for the reuse of authenticated states across multiple requests. It integrates with external captcha resolution services to handle interactive security challe
Retains active browser instances to preserve cookies and accelerate subsequent requests by avoiding repeated security challenges.
Undetected-chromedriver is a framework for automated browser navigation designed to bypass anti-bot security measures. It functions by patching browser drivers at the binary level to obscure automation signals, allowing scripts to interact with protected websites without being flagged or blocked by security services. The project distinguishes itself through its ability to maintain stealth during automated sessions, including those executed in headless mode. It achieves this by injecting custom configurations to mimic human user behavior and by hooking into low-level browser debugging protocol
Maintains stealthy browser sessions by patching drivers to bypass anti-bot detection and security blocks during automated navigation.
DrissionPage is a Python library designed for web automation, data scraping, and testing. It functions as a browser automation framework that communicates directly with the browser engine via the Chrome DevTools Protocol, allowing for precise control over browser instances and page states. The library distinguishes itself by providing a unified interface that combines full browser automation with raw HTTP request capabilities. This hybrid approach allows users to switch between lightweight network requests and heavy browser-based interactions within a single workflow. By wrapping asynchronous
Maintains browser state and sessions across automation runs using local user data directories.
Maestro is a declarative mobile and web UI automation framework designed for end-to-end testing. It operates by querying the native accessibility tree of an application, allowing for black-box testing without requiring source code instrumentation or platform-specific dependencies. The framework distinguishes itself through a unified command syntax that abstracts interactions across Android, iOS, and web environments. It features a dynamic synchronization engine that automatically pauses test execution to account for non-deterministic animations and network-dependent content loading, ensuring
Maintains and clears browser session data like cookies and local storage to ensure consistent test states.
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Rotates and persists user-like browser sessions to bypass bot detection and security challenges.
Supercookie is a browser fingerprinting tool that assigns persistent, cross-session tracking identifiers by exploiting the browser's favicon cache. It operates without storing any data on the server, relying entirely on the browser's cache behavior to maintain tracking state across visits, including during incognito or private browsing sessions. The technique works by encoding unique user identifiers through distinct favicon subpaths, creating separate cache entries for each tracking token. A sequence of HTTP redirects through multiple favicon subpaths writes and reads cache-based tracking pa
Assigns unique browser identifiers by exploiting the favicon cache for cross-session tracking.
pydoll is a Chrome DevTools Protocol automation library and headless browser controller used for web data extraction and parallel browser automation. It controls Chromium-based browsers via direct WebSocket connections, allowing it to manage isolated browser contexts and tabs while bypassing the overhead and detection associated with WebDriver. The project features an anti-bot evasion framework that mimics natural human behavior, including mouse movements generated via Bezier curves and variable typing patterns. It provides specialized stealth capabilities to bypass behavioral analysis and au
Implements mechanisms to maintain and reuse browser sessions for automation and bypassing security challenges.
Steel is a cloud browser automation platform that provides a REST API for launching and controlling remote Chrome browser sessions. It enables programmatic browsing and web scraping using standard automation tools like Puppeteer, Playwright, and Selenium, connecting to cloud-hosted browser instances via WebSocket and the Chrome DevTools Protocol. The platform supports both headless and headful browser sessions, with language-specific SDKs for TypeScript and Python. The service distinguishes itself through comprehensive anti-detection capabilities, including residential proxy rotation, CAPTCHA
Keeps browser sessions alive across multiple tasks to avoid repeated connection handshake overhead.
LaVague is an LLM web agent framework and large action model designed to translate natural language instructions into executable browser automation scripts. It functions as a multi-modal orchestrator that reasons over web page states and HTML content to automate multi-step tasks via a Selenium-based automation engine. The framework features a modular model provider layer, allowing users to swap between different language and vision models from providers such as Anthropic, Gemini, and Azure OpenAI. It employs a multi-modal world model to process screenshots and HTML structures, utilizing retri
Supports loading existing browser profiles and cookies to maintain authenticated sessions and bypass bot detection.
Komga is a self-hosted digital comics and eBook server that organizes, serves, and streams CBZ, CBR, PDF, and EPUB files through a web interface. Built as a Java Spring Boot application with SQLite-based metadata storage, it provides multi-user access controls with individual reading progress tracking and supports the OPDS protocol for compatibility with third-party reader applications. The server distinguishes itself through comprehensive authentication options, including OAuth2 and OpenID Connect support with automatic account creation and email verification, alongside traditional cookie an
Defines how long an inactive user session remains valid before expiring, defaulting to 30 minutes.