2 مستودعات
Utilities that verify character sequence compliance with specific encoding standards like UTF-8.
Distinct from String Validation and Normalization: Distinct from general string validation: focuses on character encoding integrity rather than content-based constraints or normalization.
Explore 2 awesome GitHub repositories matching software engineering & architecture · Encoding Validators. Refine with filters or upvote what's useful.
simdjson is a high-performance JSON parser that utilizes SIMD instructions to process gigabytes of data per second. It functions as a SIMD JSON parser, a multithreaded NDJSON processing library, a UTF-8 validation engine, and a tool for JSON minification and string building. The project focuses on high-throughput data processing, enabling the ingestion of massive JSON volumes and the verification of UTF-8 encoding standards. It includes dedicated capabilities for constructing JSON strings with optimized memory usage and removing unnecessary whitespace from documents to reduce file size. The
Provides a high-throughput engine to verify that documents adhere to the UTF-8 encoding standard.
simdjson is a high-performance, header-only C++ library designed for parsing, querying, and serializing JSON data with minimal memory overhead. It functions as a hardware-aware data processing engine that leverages vector instructions to achieve gigabyte-per-second parsing speeds. By detecting host processor capabilities at runtime, the library automatically selects the most efficient instruction sets to accelerate structural analysis and validation. The library distinguishes itself through a focus on extreme efficiency and resource management. It utilizes memory mapping and padded buffer ali
The library verifies that strings and documents conform to UTF-8 or Unicode standards to ensure data integrity before and during processing.