6 repository-uri
Utilities for parsing, sanitizing, and manipulating HTML documents.
Explore 6 awesome GitHub repositories matching part of an awesome list · HTML Processing. Refine with filters or upvote what's useful.
This repository contains the HTML specification, which defines the core standards for web page structuring, content organization, and document rendering. It establishes the fundamental algorithms for state-machine-based tokenization, tree construction for the document object model, and origin-based security isolation. The specification provides a framework for defining custom elements with independent lifecycles and registries. It also details the requirements for cross-document communication, session history management, and the synchronization of interface properties with content attributes.
Establishes the fundamental process of creating a document and associating it with a byte-stream parser.
Floki is a simple HTML parser that enables search for nodes using CSS selectors.
Simple HTML parser with CSS-like selectors.
An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Parsing and extracting data with CSS or XPath selectors.
Readability is Elixir library for extracting and curating articles.
Extracting and curating article content.
An Elixir client for the Nu HTML Checker (v.Nu).
Client for HTML, CSS, and SVG validation.
Elixir/Erlang bindings for lexborisov's myhtml
Bindings for high-performance HTML parsing.