5 रिपॉजिटरी
Tools that transform structured text data into optimized binary formats for fast querying.
Distinct from Static Binary Compilers: Candidates were focused on executable code compilers, not data-to-binary format compilers.
Explore 5 awesome GitHub repositories matching data & databases · Binary Database Compilers. Refine with filters or upvote what's useful.
ip2region is an offline IP geolocation library and framework designed to resolve IPv4 and IPv6 addresses to city-level regional information using local binary data files. It functions as a binary IP database compiler and a cross-language search client, allowing for regional lookups without relying on external APIs. The project distinguishes itself through a specialized binary format that supports high-performance query optimization. It employs adjacent-segment IP merging and deduplicated region storage to minimize the database footprint, while utilizing memory-mapped file caching and vector-i
Converts raw text-based IP location records into optimized binary files for high-speed offline lookups.
LanguageTool is a multilingual grammar and style checking engine designed to detect spelling, grammar, and writing errors across multiple languages. It provides automated proofreading capabilities that can be deployed as a self-hosted server or executed as a standalone local desktop application. The project distinguishes itself through a flexible rule development framework, allowing linguistic patterns to be defined via XML or implemented as custom Java classes. It utilizes n-gram frequency modeling for confused word detection and supports neural word embeddings to improve disambiguation betw
Transforms plain text word lists into optimized binary formats to improve spell-checking performance.
This project is a community-driven routing dataset and binary routing data compiler used to define network traffic rules. It provides a collection of categorized domain rules that direct web traffic based on site identity or location. The system converts human-readable domain lists into optimized binary formats for high-performance network filtering, specifically serving as a geosite data generator for V2Ray. It manages shared lists of domains and keywords through a collaborative repository of patterns. The toolset covers domain routing rule definition using sub-domains, full domains, keywor
Converts human-readable domain lists into optimized binary formats for high-performance network filtering.
GeoLite.mmdb is a curated repository of prebuilt MaxMind GeoIP databases in the MMDB binary format, providing ready-to-use files for mapping IP addresses to geographic locations and autonomous system network providers without requiring a MaxMind account or manual compilation. The project serves as a central source for country-level, city-level, and autonomous system number (ASN) databases that enable fast, decompression-free IP lookups. The repository delivers three primary database files: the GeoLite2 Country database for resolving IP addresses to their registered countries, the GeoLite2 Cit
Compiles geographic and network data into the MaxMind binary database format for fast IP lookups.
This project is a CJK input method framework and configuration set designed for the Rime input engine. It provides a comprehensive system of schemas and dictionary packs to optimize Chinese character entry through pinyin and double-pinyin workflows. The framework is distinguished by its use of Lua-powered extensions that add dynamic utilities, such as inline mathematical calculators, automated timestamps, and text formatting, directly to the input interface. It also features refined word libraries and language models specifically tuned to improve prediction accuracy and first-choice hit rates
Transforms text-based dictionaries into optimized binary formats for high-speed candidate retrieval.