# firecrawl/firecrawl-mcp-server

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/firecrawl-firecrawl-mcp-server).**

5,542 stars · 606 forks · JavaScript · mit

## Links

- GitHub: https://github.com/firecrawl/firecrawl-mcp-server
- Homepage: https://firecrawl.dev
- awesome-repositories: https://awesome-repositories.com/repository/firecrawl-firecrawl-mcp-server.md

## Topics

`batch-processing` `claude` `content-extraction` `data-collection` `firecrawl` `firecrawl-ai` `javascript-rendering` `llm-tools` `mcp` `mcp-server` `model-context-protocol` `search-api` `web-crawler` `web-scraping`

## Description

Firecrawl MCP Server is a Model Context Protocol tool server that exposes the full suite of Firecrawl’s web scraping, crawling, and automation capabilities as tools that large language models can invoke directly. It acts as a proxy to the Firecrawl cloud platform, which manages headless browser orchestration, async job queues, and rate limiting behind the scenes.

The server distinguishes itself by packaging autonomous web agents — both a research agent that browses and collects structured data from multiple pages, and a general web agent that performs multi-step browsing and extraction tasks — as callable MCP tools. It also provides LLM-guided structured extraction, allowing users to define a schema and have a language model parse unstructured web content into precise fields. Beyond scraping, the server supports live page interaction (clicking, typing, scrolling via natural language or code), web change monitoring with webhook notifications, and recursive crawling that discovers and indexes linked pages up to a configurable depth.

The broader capability surface includes single and batch URL scraping with output in markdown, HTML, JSON, or screenshot format, parsing of non-HTML documents such as PDFs and Office files, web search that returns structured results, and site link mapping to reveal page structure. All of these are registered as MCP tools, enabling any compatible language model client to orchestrate web data collection and automation tasks through a unified interface.

Setup requires installing the server (via npm or from source) and configuring it with a Firecrawl API key; the server then registers its tools with the MCP client, making each Firecrawl action available for use in prompts and agent workflows.

## Tags

### Artificial Intelligence & ML

- [MCP Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/mcp-servers.md) — The server registers each Firecrawl capability as a Model Context Protocol tool, enabling language models to invoke web scraping and browsing actions directly.
- [Autonomous Web Agents](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agent-orchestration-multi-agent/autonomous-agents/autonomous-web-agents.md) — Firecrawl's autonomous web agent that searches and extracts data from multiple web pages to fulfill a given request. ([source](https://docs.firecrawl.dev/))
- [Multi-Step Workflows](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agent-orchestration-multi-agent/autonomous-agents/autonomous-web-agents/multi-step-workflows.md) — Firecrawl's multi-step agent workflow capability that orchestrates autonomous tasks on the web using credits per session. ([source](https://firecrawl.dev/pricing))
- [Autonomous Web Researchers](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agent-orchestration-multi-agent/autonomous-agents/autonomous-web-researchers.md) — Running an AI agent that browses multiple web pages, searches, and collects data to fulfill a research request.
- [Autonomous Web Research Agents](https://awesome-repositories.com/f/artificial-intelligence-ml/autonomous-web-research-agents.md) — An AI-powered agent that searches, browses, and extracts structured data from multiple web pages to fulfill complex research requests.

### Part of an Awesome List

- [Web Crawling](https://awesome-repositories.com/f/awesome-lists/data/web-crawling.md) — Provides recursive website crawling that follows links to extract content from discovered pages up to a configurable depth. ([source](https://docs.firecrawl.dev/))

### Content Management & Publishing

- [Recursive Content Crawlers](https://awesome-repositories.com/f/content-management-publishing/recursive-content-crawlers.md) — Starting from a seed URL, the system automatically discovers and visits linked pages up to a configurable depth, building a site map.

### Data & Databases

- [Structured Data Extraction](https://awesome-repositories.com/f/data-databases/structured-data-extraction.md) — Uses a language model to extract specific fields like prices or names from web pages based on a schema. ([source](https://cdn.jsdelivr.net/gh/firecrawl/firecrawl-mcp-server@main/README.md))
- [LLM-to-Structured Data Converters](https://awesome-repositories.com/f/data-databases/structured-data-extraction/llm-to-structured-data-converters.md) — A language model parses web content according to a user-defined schema to extract precise fields from unstructured pages.

### Development Tools & Productivity

- [Web Scraping](https://awesome-repositories.com/f/development-tools-productivity/web-scraping.md) — Extracts content from a single URL as markdown, HTML, JSON, or screenshot, including JavaScript-rendered pages. ([source](https://docs.firecrawl.dev/))

### Networking & Communication

- [API Proxies](https://awesome-repositories.com/f/networking-communication/api-proxies.md) — Requests are forwarded to Firecrawl's API endpoints, which manage actual browser instances, crawl state, and rate limiting.

### Programming Languages & Runtimes

- [Asynchronous Extraction Job Management](https://awesome-repositories.com/f/programming-languages-runtimes/language-features-paradigms/concurrency-models/background-task-management/job-batching/asynchronous-extraction-job-management.md) — Long-running crawl and agent tasks are assigned session IDs, allowing clients to poll for completion and retrieve results later.

### Web Development

- [Recursive Crawlers](https://awesome-repositories.com/f/web-development/web-application-hosting/dynamic-content-delivery/sitemap-crawlers/recursive-crawlers.md) — Discovering site structure or crawling pages for bulk data extraction and producing a sitemap.
- [Website Crawlers and Scrapers](https://awesome-repositories.com/f/web-development/website-crawlers-and-scrapers.md) — A tool that recursively retrieves content from all pages of a website, handling navigation and converting pages to structured formats.
- [Headless Browser Orchestrators](https://awesome-repositories.com/f/web-development/web-automation-scraping/browser-orchestration-systems/headless-browser-orchestrators.md) — Firecrawl's backend spins up headless Chromium instances to render JavaScript-heavy pages and execute user interactions.
- [Programmatic Page Interactions](https://awesome-repositories.com/f/web-development/web-page-retrievers/programmatic-page-interactions.md) — Performs browser actions such as clicking, typing, and scrolling on live web pages using natural language or code. ([source](https://firecrawl.dev/pricing))

### System Administration & Monitoring

- [Web Change Monitors](https://awesome-repositories.com/f/system-administration-monitoring/web-change-monitors.md) — A service that periodically checks web pages for changes and triggers notifications or webhooks when updates are detected. ([source](https://cdn.jsdelivr.net/gh/firecrawl/firecrawl-mcp-server@main/README.md))
