30 open-source projects similar to googlechrome/rendertron, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Rendertron alternative.
This project is a distributed headless Chrome web crawler and data extraction framework. It functions as a JavaScript rendering engine that uses a headless browser to process dynamic pages, extracting structured data from websites that require JavaScript execution. The system is designed for scalable data collection across multiple nodes, using distributed task synchronization and shared caches to prevent duplicate work. It distinguishes itself through the ability to emulate specific client environments by configuring user agents and viewport dimensions, while capturing visual evidence such a
Splash is a headless browser HTTP API and JavaScript rendering engine designed to convert dynamic web content into static HTML or images. It functions as a Lua-scriptable browser service that exposes browser automation and rendering capabilities through a RESTful interface for programmatic data extraction. The service distinguishes itself by allowing the execution of custom Lua scripts to automate complex user interaction sequences and page navigation. It provides the ability to switch rendering engines on a per-request basis to verify cross-browser compatibility and visual consistency. The
Puppeteer Sharp is a web browser automation library and a headless Chrome .NET API. It provides a type-safe C# interface for controlling headless browsers, functioning as a Chrome DevTools Protocol wrapper that translates .NET method calls into JSON-RPC messages. The project enables programmatic navigation of pages, interaction with elements, and the execution of JavaScript within a .NET environment. It serves as an end-to-end testing framework for simulating user workflows and verifying web application behavior. Additional capabilities include automated screenshot generation for visual regr
ReLaXed is a document workflow and compilation system that transforms Markdown, JSON data, and HTML/CSS into print-ready PDF files. It utilizes a headless browser engine to render web technologies into final documents, serving as a build system for technical documentation and dynamic reports. The project features a pipeline that automates asset generation, allowing scripts to produce images, charts, and tables from raw data files during the build process. It supports the integration of LaTeX formulas for mathematical equations and converts CSV files into formatted HTML tables. The system inc
Crawlee is a web scraping framework designed for building scalable, reliable, and distributed data extraction pipelines. It provides a unified interface for managing headless browser automation and lightweight HTTP requests, allowing developers to handle complex web navigation, dynamic content rendering, and large-scale data collection within a single, modular architecture. The project distinguishes itself through its resource-aware concurrency controller, which dynamically scales task execution based on real-time CPU and memory usage to prevent host machine exhaustion. It also features a rob
wkhtmltopdf is a command-line utility that renders web pages into PDF documents or image files. It functions as a headless browser engine, utilizing the Qt WebKit rendering environment to process HTML, CSS, and JavaScript into visual representations suitable for server-side tasks. The tool distinguishes itself by translating standard web styling rules into physical page dimensions and layout constraints, allowing for the creation of structured documents from web-based source files. It supports the generation of automated tables of contents and provides granular control over document layout, i
carbon-now-cli is a command line tool that renders source code into high-quality images. It serves as a terminal-based renderer and image exporter, converting raw text files or input streams into styled visual assets. The tool focuses on precise content control, allowing users to specify exact line ranges for targeted code extraction. It supports multi-format export options, enabling images to be saved as files or sent directly to the system clipboard. Visual customization is managed through a system of styling presets. Users can define and save custom colors, fonts, and resolutions via loca
Pholcus is a distributed web crawling system designed for large-scale data scraping. It employs a master-worker distribution model to coordinate high-concurrency scraping tasks across a network of remote client nodes, enabling both horizontal and vertical data collection. The system features a hot-loadable rule engine that allows extraction and navigation logic to be updated at runtime without restarting the process. It handles dynamic content through headless browser integration and bypasses bot detection using proxy rotation, automated user authentication, and simulated human behavior. The
requests-html is a Python HTML parsing library and web scraping framework. It functions as an asynchronous HTTP client and a JavaScript rendering engine designed to fetch and parse web pages for structured data extraction. The project integrates a headless browser to execute JavaScript, allowing it to retrieve dynamically generated content that standard HTML parsers cannot see. It provides tools for automated data extraction using CSS selectors and XPath expressions to isolate specific text or attributes from HTML structures. The framework covers network operations including asynchronous pag
PySpider is a Python web crawling framework designed for automated data extraction. It provides a pipeline for periodically fetching web content, processing HTML, and persisting scraped information into database backends. The system features a web-based management interface for editing scraping scripts, monitoring task progress, and reviewing collected data. It includes a headless browser JavaScript renderer to capture rendered HTML from dynamic web pages and a distributed architecture that uses message queues to scale crawling workloads across multiple nodes. The framework also covers task
Spider is a web-based platform designed for automated data extraction, providing a centralized framework to collect, process, and route structured information from websites. It functions as a comprehensive pipeline that manages the entire lifecycle of data gathering, from initial configuration to final storage in external databases or message queues. The platform distinguishes itself through a visual configuration interface that allows users to define extraction rules and manage scraping templates without writing custom code. It supports both static and dynamic content retrieval by integratin
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
LapisCV is a PDF document generator and resume builder designed to convert structured Markdown text into professional curriculum vitae. It functions as a rendering pipeline that transforms simple markup and variables into print-ready documents using a headless browser engine or LaTeX templates. The project provides a collection of professional themes and visual styles to customize the typography and branding of academic and professional resumes. It utilizes variable-based style injection to allow for the adjustment of fonts, colors, and margins based on content volume and user preferences. T
Mermaid-cli is a command line interface and programmatic tool that converts text-based diagram syntax into static visual assets. It functions as a headless browser renderer and markdown diagram processor, enabling the generation of diagrams through a CLI or a JavaScript-based generator. The tool distinguishes itself by supporting the conversion of diagram definitions into multiple image formats, including SVG, PNG, and PDF. It allows for custom diagram styling by applying external CSS files to override default themes and add visual animations. The project provides capabilities for markdown i
Browsh is a text-based web browser and headless browser frontend that renders modern websites and web applications within a terminal emulator. It functions as a TTY web browser, allowing users to view and interact with complex web content directly from a command line interface. The project enables web navigation in environments where a graphical user interface is unavailable, such as when accessing a remote server via SSH or operating in low-bandwidth conditions. It translates browser pixels and colors into ANSI escape codes to simulate a graphical interface using text characters. The system
next-learn is a collection of educational resources and reference implementations for building full-stack web applications. It serves as a learning resource and tutorial for the Next.js framework, providing starter code and example projects that demonstrate server-side rendering and the React-based ecosystem. The project provides a full-stack web template that showcases a complete implementation of database integration, user authentication, and server-side logic. It includes reference examples for web performance optimization, specifically demonstrating the use of server components, server ac
luban-h5 is a low-code web framework and visual mobile page builder used to design interactive mobile web pages. It functions as a JSON-based page renderer that converts design configurations into live pages for end-user delivery and includes a visual form builder for creating interactive input fields and tracking submission statistics. The project features a PSD to HTML converter that parses Photoshop files into functional HTML pages to automate the design-to-code workflow. It distinguishes itself through a plugin-based system that allows for the development of custom UI components and speci
Waku is a minimal React framework built around React Server Components. It renders components exclusively on the server for data fetching and server-side logic, then hydrates client components in the browser for interactivity. The framework supports hybrid rendering, mixing static prerendering at build time with dynamic per-request rendering within a single application, and allows independently loading slice components that can be static or dynamic. The framework distinguishes itself through its file-based routing system that defines pages, layouts, and API endpoints by placing files in a dir
Surge is a network traffic routing and configuration system designed to categorize and steer network requests using curated rule sets, domain lists, and IP databases. It provides the data structures and filtering logic necessary to manage how application traffic is directed across proxies, direct connections, or regional gateways. The project distinguishes itself through granular traffic identification and steering. It identifies network requests by matching them against system process names, user-agent strings, and geographic IP ranges to apply specific routing policies. This allows for the
unioffice is a comprehensive document processing suite that provides a PDF document processor, an Open XML document library, a document security toolkit, and a document content extractor. It is designed to programmatically create, read, and modify Word, Excel, and PowerPoint files, as well as generate and edit PDF documents. The project is distinguished by its native language implementation of the Open XML standard, which removes native binary dependencies to simplify container deployments. It features advanced capabilities for digital document security, including hardware-based PDF signing,
likec4 is an architecture-as-code framework that transforms text-based architecture definitions into interactive diagrams, static websites, and image files. It serves as a system architecture visualizer and C4 model diagram generator, allowing users to define software components, boundaries, and deployment infrastructure using a domain-specific language. The tool distinguishes itself by providing a modeling environment with Language Server Protocol integration for real-time validation and autocomplete. It enables interactive architecture documentation where users can navigate through hierarch
miniblink49 is a stripped-down Blink browser kernel and embedded web UI framework used to integrate HTML user interfaces into native C++ applications. It functions as a C++ JavaScript interop bridge that maps high-level script calls to low-level native logic and provides a resource-optimized headless browser engine for processing web content without a graphical interface. The project enables custom browser environment control by bypassing cross-origin security policies to allow requests across different domains. It further allows for the interception of network resources to replace remote ass
Statamic is a content management system built on Laravel that utilizes a flat-file storage approach instead of a traditional relational database. It functions as a customizable administrative framework for managing site content and developing custom backend addons. The system stores website data in files and integrates with Git for versioning, allowing content changes to be tracked and rolled back through a repository. It employs a templating engine to transform structured data into dynamic HTML layouts for front-end rendering. The platform includes a control panel built as a single-page app
This project is a lightweight PHP framework designed for building web applications using the model-view-controller architectural pattern. It provides a structured environment that separates business logic, data management, and user interface presentation into distinct layers to improve code maintainability. The framework utilizes a front-controller to intercept all incoming web traffic, routing requests to specific controller actions based on defined URL patterns. It incorporates middleware-style request interception, allowing for the execution of custom logic such as authentication, security
EyeWitness is a web infrastructure mapper and reconnaissance tool designed to automate the visual mapping of exposed web services. It functions as a headless browser screenshotter and HTTP reconnaissance utility that captures visual evidence and extracts server headers from lists of web targets. The system identifies server technologies and audits for common default administrative credentials to map an organization's external attack surface. It generates searchable HTML security reports that combine screenshots, page source code, and categorized analysis results for vulnerability assessment.
This project is a Markdown static site generator and documentation server that converts files and folder structures into responsive websites. It functions as a Confluence documentation publisher, enabling the direct upload and synchronization of processed Markdown content to enterprise knowledge base servers. The system distinguishes itself through its technical rendering capabilities, transforming Mermaid syntax into visual diagrams and TeX expressions into mathematical formulas. It supports multilingual documentation by creating localized versions of a site with dedicated navigation structu
EyeWitness is a web reconnaissance framework used to identify infrastructure and software versions across multiple websites through automated headless browser scans. It functions as an HTTP website screenshotter and a security reporting tool that captures visual snapshots and server headers from a list of web targets. The system distinguishes itself by combining visual documentation with security analysis. It generates searchable HTML reports that categorize website screenshots and metadata by content system or device type, while simultaneously performing server header analysis to determine u
Marp is a presentation framework that transforms plain text files into slide decks using Markdown syntax. It functions as a converter and static site generator, enabling the creation of structured visual presentations that can be exported into formats including HTML, PDF, and PowerPoint. The project distinguishes itself through a modular architecture that supports custom themes and plugins, allowing users to extend the core conversion pipeline. It provides a command-line interface for automated file processing and includes reactive file-system watching to facilitate a live preview development
CasperJS is a headless browser testing framework and web functional testing suite. It provides a toolkit for automating web browser interactions to perform functional testing and visual verification of web applications. The project functions as a WebDriver automation tool and a browser screenshot utility, enabling the capture of images of web pages or specific elements to verify visual layout. It also serves as an XML test report generator, exporting the results of automated browser test suites into a standardized format for reporting tools. The framework covers automated browser testing, fu
Gowitness is a system for rendering web interfaces at scale to capture visual snapshots, HTTP metadata, and network scan results. It functions as a headless browser screenshot tool and a web surface mapper used to identify and visually document the attack surface of network ranges and URL lists. The tool includes a screenshot gallery server that provides a web-based interface for browsing, filtering, and managing a database of captures. It specifically serves as an Nmap target visualizer, parsing network scan results to automatically capture screenshots of discovered web services. Capabiliti