# getmaxun/maxun

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/getmaxun-maxun).**

15,049 stars · 1,219 forks · TypeScript · agpl-3.0

## Links

- GitHub: https://github.com/getmaxun/maxun
- Homepage: https://www.maxun.dev
- awesome-repositories: https://awesome-repositories.com/repository/getmaxun-maxun.md

## Topics

`agents` `api` `automation` `browser-automation` `crawler` `crawling` `data-extraction` `no-code` `nocode` `playwright` `robotic-process-automation` `rpa` `scraper` `self-hosted` `web-scraper` `web-scraping` `web-search` `webscraping`

## Description

Maxun is an open-source web scraping and automation platform designed to transform dynamic website content into structured data. By leveraging artificial intelligence to interpret natural language prompts, the system identifies page elements and extracts information without requiring manual selector configuration. It serves as a bridge between raw web content and intelligent workflows, providing structured outputs in formats optimized for large language model ingestion and agent-based applications.

The platform distinguishes itself through its ability to handle complex, authenticated, and dynamic web environments. It synchronizes local browser sessions to access password-protected content and employs proxy rotation and browser fingerprinting to bypass anti-scraping measures. Users can orchestrate multi-step browser interactions—such as clicking buttons and filling forms—to replicate human navigation, while the self-hosted infrastructure ensures full control over data pipelines and extraction robots.

Beyond core extraction, the platform supports a broad range of automation capabilities, including recurring task scheduling, web search integration, and visual content capture. It provides programmatic access through a command-line interface and a dedicated software development kit, allowing for seamless integration with external systems via webhooks. The platform also includes monitoring tools to track website changes and distill large volumes of information into actionable insights.

## Tags

### Web Development

- [Web Scraping and Automation](https://awesome-repositories.com/f/web-development/web-automation-scraping/web-scraping-automation.md) — Provides a platform for building and scheduling browser-based extraction workflows for AI agents. ([source](https://www.maxun.dev/blog/top-3-nocode-scrapers-2026))
- [AI-Powered Web Crawlers](https://awesome-repositories.com/f/web-development/web-automation-scraping/web-scraping-automation/web-scraping/ai-powered-web-crawlers.md) — Uses language models to interpret web pages and transform unstructured content into structured formats.
- [Browser Automation](https://awesome-repositories.com/f/web-development/web-automation-scraping/web-scraping-automation/browser-automation.md) — Records and replays browser actions like clicking buttons and filling forms to replicate manual navigation workflows. ([source](https://www.maxun.dev/pricing))
- [Dynamic Web Scrapers](https://awesome-repositories.com/f/web-development/dynamic-web-scrapers.md) — Handles complex client-side rendering and dynamic content loading to ensure consistent data extraction from modern web pages. ([source](https://www.maxun.dev/blog/youtube-trend-research))
- [Browser Task Orchestrators](https://awesome-repositories.com/f/web-development/browser-task-orchestrators.md) — Manages complex scraping workflows through a centralized control layer that schedules recurring tasks and coordinates browser interactions.
- [Browser Session Managers](https://awesome-repositories.com/f/web-development/web-automation-scraping/web-scraping-automation/browser-automation/browser-session-managers.md) — Connects local browser sessions to cloud platforms to enable authenticated data extraction. ([source](https://www.maxun.dev/blog/secure-extract))
- [Web APIs](https://awesome-repositories.com/f/web-development/web-apis.md) — Transforms dynamic website content into accessible data endpoints for consistent retrieval in automated pipelines. ([source](https://www.maxun.dev/about))

### Data & Databases

- [Structured Data Extraction](https://awesome-repositories.com/f/data-databases/structured-data-extraction.md) — Captures specific information from webpages and organizes it into structured formats for export or further processing. ([source](https://www.maxun.dev/pricing))
- [Web Data Extraction](https://awesome-repositories.com/f/data-databases/web-data-extraction.md) — Converts raw web pages into clean, structured data formats to simplify downstream processing and automated information collection. ([source](https://www.maxun.dev/products/scrape))
- [Data Pipeline Automation](https://awesome-repositories.com/f/data-databases/data-pipeline-automation.md) — Schedules recurring data extraction tasks and integrates retrieved information into external systems via webhooks. ([source](https://www.maxun.dev/talk-to-sales))
- [Data Extraction Pipelines](https://awesome-repositories.com/f/data-databases/data-extraction-pipelines.md) — Links extracted data to external applications through webhooks and command-line interfaces for seamless automation. ([source](https://www.maxun.dev/))
- [Web Crawlers](https://awesome-repositories.com/f/data-databases/web-crawlers.md) — Navigates entire websites to extract page contents and transform them into clean markdown or HTML for downstream processing. ([source](https://www.maxun.dev/))

### Development Tools & Productivity

- [Headless Browser Automation](https://awesome-repositories.com/f/development-tools-productivity/headless-browser-automation.md) — Executes real-world browser interactions using headless engines to render dynamic content and navigate complex web interfaces.
- [Browser Automation Orchestrators](https://awesome-repositories.com/f/development-tools-productivity/browser-automation-orchestrators.md) — Records and replays complex browser interactions to automate navigation across web applications.
- [Natural Language Interfaces](https://awesome-repositories.com/f/development-tools-productivity/natural-language-interfaces.md) — Uses natural language prompts to identify page elements and perform extraction tasks without manual selector configuration. ([source](https://www.maxun.dev/blog/ai-mode))
- [Python SDKs](https://awesome-repositories.com/f/development-tools-productivity/api-development-sdks/python-sdks.md) — Provides a dedicated software development kit to manage automated data extraction robots within custom Python codebases. ([source](https://www.maxun.dev/blog))
- [Automated Workflow Integration](https://awesome-repositories.com/f/development-tools-productivity/build-tooling/build-orchestration-logic/build-orchestration-configuration/build-automation-systems/workflow-orchestration/automated-workflow-integration.md) — Enables programmatic creation and management of custom scraping robots for integration into software workflows. ([source](https://www.maxun.dev/products/extract))
- [CLI Task Managers](https://awesome-repositories.com/f/development-tools-productivity/terminal-shell-cli/terminal-cli-enhancements/cli-task-managers.md) — Enables creation, execution, and monitoring of data extraction tasks directly from the terminal. ([source](https://www.maxun.dev/blog))
- [Task Scheduling](https://awesome-repositories.com/f/development-tools-productivity/task-scheduling.md) — Executes automated scraping tasks and browser workflows on a fixed timetable. ([source](https://www.maxun.dev/blog/python-sdk))

### DevOps & Infrastructure

- [Self-Hosted Infrastructure](https://awesome-repositories.com/f/devops-infrastructure/self-hosted-infrastructure.md) — Provides open-source infrastructure for hosting extraction robots on private servers. ([source](https://www.maxun.dev/about))

### Networking & Communication

- [Proxy and Fingerprint Rotation](https://awesome-repositories.com/f/networking-communication/proxy-rotation-services/proxy-and-fingerprint-rotation.md) — Distributes network traffic across multiple IP addresses and fingerprint profiles to prevent detection by anti-scraping systems.

### Security & Cryptography

- [Session-Based Authentication Proxies](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/session-and-credential-handling/session-credential-management/session-based-authentication-proxies.md) — Maintains access to protected web content by synchronizing local browser session state with remote extraction workers.
- [Browser Session Authentication](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/session-and-credential-handling/session-credential-management/browser-session-authentication.md) — Synchronizes local browser sessions to access password-protected content without storing credentials.

### Testing & Quality Assurance

- [Browser Automation Frameworks](https://awesome-repositories.com/f/testing-quality-assurance/software-testing/testing-frameworks/test-frameworks/browser-and-ui-testing/browser-automation-frameworks.md) — Provides a framework for recording and replaying navigation steps to capture data from web portals.
- [Screenshot Capture](https://awesome-repositories.com/f/testing-quality-assurance/automation-interaction-tools/screenshot-capture.md) — Generates screenshots of web pages, including full-page captures, for visual monitoring. ([source](https://www.maxun.dev/blog/python-sdk))

### Artificial Intelligence & ML

- [AI Agent Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-agent-integrations.md) — Connects automated scraping pipelines directly to agent frameworks to supply structured data for intelligent workflows. ([source](https://www.maxun.dev/blog/integrations))
- [AI Agent Tool Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-agent-integrations/ai-agent-tool-integrations.md) — Connects automated scraping pipelines directly to language model frameworks for intelligent processing.
- [AI Automation Workflows](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-automation-workflows.md) — Integrates extraction outputs into automation platforms to enable research, data enrichment, and complex content processing. ([source](https://www.maxun.dev/blog/top-3-nocode-scrapers-2026))
- [AI Data Extraction](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-data-extraction.md) — Transforms extracted web content into structured JSON, Markdown, or clean text optimized for large language model ingestion. ([source](https://www.maxun.dev/blog/top-3-nocode-scrapers-2026))
- [Web Search Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/web-search-tools.md) — Executes search queries across the web and retrieves results as structured metadata for further processing. ([source](https://www.maxun.dev/blog/python-sdk))
- [Data Preparation](https://awesome-repositories.com/f/artificial-intelligence-ml/data-preparation.md) — Prepares and structures website data to facilitate the development of automated agents and data pipelines. ([source](https://www.maxun.dev/products/scrape))
- [Text Summarization](https://awesome-repositories.com/f/artificial-intelligence-ml/text-summarization.md) — Generates concise text summaries from crawled web pages to distill large volumes of information into actionable insights. ([source](https://www.maxun.dev/blog))
- [AI Model Configurations](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-model-configurations.md) — Allows selection between local or cloud-based language models to balance data privacy and processing performance. ([source](https://www.maxun.dev/blog/ai-mode))

### User Interface & Experience

- [UI Element Selectors](https://awesome-repositories.com/f/user-interface-experience/ui-element-selectors.md) — Uses language models to interpret natural language prompts and identify target data elements on a page.

### Software Engineering & Architecture

- [Webhook Integrations](https://awesome-repositories.com/f/software-engineering-architecture/webhook-integrations.md) — Triggers external data pipelines and automated workflows by pushing extracted content to specified endpoints upon task completion.

### System Administration & Monitoring

- [Website Content Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/website-content-monitoring.md) — Tracks specific website elements to detect content updates and trigger alerts. ([source](https://www.maxun.dev/pricing))
