1 repo
Managed or self-hosted environments for running web crawlers.
Explore 1 awesome GitHub repository matching devops & infrastructure · Crawling Infrastructure. Refine with filters or upvote what's useful.
Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi