1 repo
Automated workflows designed to extract, transform, and load data specifically from web-based sources and internet services.
Explore 1 awesome GitHub repository matching data & databases · Web Data Pipelines. Refine with filters or upvote what's useful.
Firecrawl is a web data extraction platform designed to convert unstructured web content into clean, LLM-ready formats like markdown or JSON. It functions as an autonomous web crawler and scraper, capable of mapping entire domains, performing recursive navigation, and executing complex data gathering tasks. By leveragi