This project is a distributed scraping engine designed to extract business details, customer reviews, and lead information from Google Maps. It functions as a business scraper and data extractor that can be deployed as a permanent system or as on-demand serverless functions.
The system utilizes a proxy-routed web crawler to manage request origins via SOCKS5, HTTP, and HTTPS proxies. To locate contact information, it includes an email extraction tool that recursively crawls business websites linked within map listings.
The software supports coordinate-based radius searches for efficient data retrieval and synchronizes scraping jobs across multiple machines or pods using a shared database. Extracted data can be saved through a plugin-based system into CSV, JSON, or other external database formats.