2 repos
Specialized frameworks for the initial acquisition and structured parsing of raw data, focusing on plugin orchestration and state management rather than general transformation.
Explore 2 awesome GitHub repositories matching data & databases · Extraction and Ingestion Workflows. Refine with filters or upvote what's useful.
Scrapy is a comprehensive framework designed for automated web data extraction and large-scale crawling. It operates on an asynchronous, event-driven engine that manages non-blocking network requests and data processing tasks, allowing for the efficient retrieval of structured information from web documents using path-
Faceswap is a comprehensive framework for automated media manipulation and neural face synthesis. It provides a modular pipeline that manages the entire lifecycle of facial feature extraction, deep learning model training, and image conversion. By coordinating complex computer vision workflows, the system enables users