1 repo
Libraries for building custom logic to transform web content into structured data.
Distinguishing note: Focuses on development of personalized extraction logic.
Explore 1 awesome GitHub repository matching content management & publishing · Data Extraction Frameworks. Refine with filters or upvote what's useful.
RSSHub is a headless, server-side engine designed to generate standardized RSS and Atom feeds from websites that do not natively provide them. By acting as an extensible data aggregator, it enables the automated collection of web content, allowing users to monitor updates from disparate sources through centralized feed readers or workflow automation tools. The platform distinguishes itself through a route-based data extraction framework that maps specific URL patterns to custom scraping logic. This modular architecture is supported by a middleware-driven request pipeline and declarative route
Builds and hosts personalized data extraction logic to transform web content into structured formats.