29 Repos
Libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Explore 29 awesome GitHub repositories matching data & databases · Data Encoding and Serialization. Refine with filters or upvote what's useful.
Dieses Projekt ist ein umfassender Bildungs-Lehrplan, der Softwareingenieure durch die Beherrschung der Informatik-Grundlagen und die Vorbereitung auf technische Vorstellungsgespräche führen soll. Er bietet einen strukturierten, abhängigkeitsbewussten Lernpfad, der komplexe Informatikkonzepte in einen hierarchischen Lehrplan organisiert und es Nutzern ermöglicht, durch iteratives Studium und praktische Implementierung ein professionelles Engineering-Fundament aufzubauen. Der Lehrplan zeichnet sich durch die Integration von theoretischem Wissen mit beruflicher Entwicklung aus und bietet einen einheitlichen Index von querverweisenden Ressourcen, einschließlich Büchern, wissenschaftlichen Arbeiten und Video-Tutorials. Er betont die Standardisierung der algorithmischen Effizienz durch asymptotische Komplexitätsanalyse und bietet eine granulare, modulare Themenzerlegung, um fokussiertes, inkrementelles Lernen über weite technische Bereiche hinweg zu erleichtern. Neben Kernalgorithmen und Datenstrukturen deckt das Repository ein breites Spektrum ab, einschließlich Systemarchitektur-Design, verteilten Systemen, Computersicherheit und fortgeschrittener mathematischer Modellierung. Es bietet zudem strategische Beratung für den gesamten Einstellungsprozess, von der Lebenslaufoptimierung und der Vorbereitung auf verhaltensbezogene Interviews bis hin zum langfristigen Karrierewachstum. Die gesamte Wissensdatenbank wird als versionskontrolliertes, Markdown-gesteuertes Repository gepflegt, was einen plattformunabhängigen und kollaborativen Ansatz für die technische Bildung ermöglicht.
Reduces data footprint using encoding algorithms to enhance storage efficiency and transmission performance.
Dieses Projekt ist ein umfassendes Repository verifizierter Rechenimplementierungen, das als Bildungsressource für Informatik und algorithmische Problemlösung dienen soll. Es bietet eine strukturierte Sammlung von Codebeispielen, die grundlegende Datenstrukturen, mathematische Operationen und Kernkonzepte der Programmierung abdecken und es Nutzern ermöglichen, die Logik und Komplexität hinter verschiedenen Berechnungsmethoden zu studieren. Das Repository zeichnet sich durch ein modulares, referenzbasiertes Implementierungsmuster aus, das Code in logische Namespaces organisiert. Dieser Ansatz erleichtert die unabhängige Ausführung und pädagogische Klarheit und ermöglicht es Nutzern, die Entwicklung von Berechnungsstrategien von naiven Brute-Force-Ansätzen bis hin zu optimierten Hochleistungslösungen zu erforschen. Durch die Entkopplung von Datenstruktur-Abstraktionen von algorithmischen Operationen stellt das Projekt sicher, dass Implementierungen austauschbar und leicht zu analysieren bleiben. Das Fähigkeitsspektrum umfasst eine breite Palette technischer Bereiche, einschließlich maschinellem Lernen, Kryptographie, wissenschaftlichem Rechnen und Computer Vision. Es enthält Implementierungen für prädiktive Modellierung, neuronale Netze und statistische Analysen, neben Tools für digitale Signalverarbeitung, Netzwerkflussmanagement und Finanzmodellierung. Die Sammlung adressiert zudem spezialisierte mathematische Bedürfnisse, wie lineare Algebra, geometrische Berechnungen und Bit-Manipulation, und bietet eine breite Grundlage für Forschung und Engineering-Anwendungen.
Shrink digital information streams through encoding techniques to improve storage density and transmission speeds.
Axios is an isomorphic, promise-based HTTP client designed for making asynchronous network requests across different JavaScript execution environments, including the browser and Node.js. It functions as a JSON API client that serializes JavaScript objects into JSON and parses server responses into structured data. The project features a system for managing reusable client instances with shared configurations, such as base URLs and default settings. It includes a mechanism for intercepting outgoing requests and incoming responses globally, allowing data to be transformed before it reaches the
Converts JavaScript objects into JSON, multipart, or URL-encoded formats for network transmission.
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis f
Unpack language recognition assets and modify internal data structures using command-line utilities for custom configuration.
This project is a computer science educational resource and a library of common data structures and algorithms implemented in Swift. It serves as a practical reference for studying complexity and efficiency through solved algorithmic problems and conceptual guides. The collection includes implementations of linear and hierarchical data structures, such as stacks, queues, linked lists, and trees. It covers a wide range of computational patterns, including graph and pathfinding implementations, mathematical numerical methods, and data compression techniques. The project also provides implement
Implements data compression algorithms to reduce storage size and improve transmission efficiency.
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Provides libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Vector is a high-performance observability data pipeline designed to collect, transform, and route logs, metrics, and traces across distributed infrastructure. It functions as a modular engine that decouples data ingestion from processing and transmission, utilizing a component-based architecture to connect diverse sources to multiple destinations. The project distinguishes itself through a focus on reliability and flow control. It implements backpressure-aware data movement to prevent data loss during traffic spikes and utilizes disk-backed event buffering to ensure durability during network
Controls how events are grouped, compressed, and serialized into specific formats before transmission to downstream services.
Ciphey is an automated decryption tool and cryptographic analysis framework designed to identify and reverse encryptions, encodings, and hashes without requiring a known key or cipher. It functions as a hash cracking engine and a heuristic cipher identifier to recover original plaintext from unknown data patterns. The project features a nested encoding resolver that iteratively unwraps multiple layers of encryption and encoding until readable text is reached. It employs a heuristic cryptanalysis workflow to analyze data characteristics and guess the likely encoding scheme or encryption method
Provides automated workflows for unpacking data processed through multiple layers of different encryption formats.
Ciphey is an automated decryption and data obfuscation tool designed to identify and reverse complex, multi-layered encoding schemes. By utilizing statistical analysis and probability scoring, the system automatically detects unknown data formats and recovers human-readable plaintext from obfuscated input strings without requiring manual algorithm specification. The tool distinguishes itself through a recursive pipeline that processes nested data structures and strips formatting anomalies or invisible characters to ensure consistent input. It employs a heuristic search and multithreaded execu
Resolves multi-level, nested encoding structures through automated, recursive conversion steps.
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Maintains organized records by updating names and descriptions of data archives.
crypto-js is a JavaScript cryptography library providing a collection of standard cryptographic algorithms and data transformation tools. It functions as a symmetric encryption toolset, a cryptographic hash implementation, and a password derivation tool for the JavaScript environment. The library enables the protection of sensitive data through symmetric encryption and the production of fixed-size data digests to verify integrity. It includes utilities for converting user passwords into secure cryptographic keys and a data format converter for translating information between Base64, Hex, and
Includes utilities for base-encoding and cryptographic hashing to ensure compatible data transmission.
Redox is a POSIX-compliant, microkernel-based operating system written entirely in Rust. By utilizing a memory-safe language for the kernel and all system components, the project eliminates common vulnerabilities such as buffer overflows and use-after-free errors. Its architecture relies on a minimal kernel that manages only essential hardware and process isolation, delegating all other system services to unprivileged user-space processes. The system distinguishes itself through a modular design where hardware drivers and system services run as independent user-space daemons, allowing them to
Provides utilities for compressing and extracting files using standard archiving formats.
Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths. The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from stru
Performs cryptographic hashing and base-encoding to prepare data for secure storage.
This project provides a lossless compression algorithm and a byte-oriented compression library designed for high-speed data reduction and maximum decompression speed. It functions as a stream-oriented compression engine, a software library for encoding and decoding data blocks, and a command-line tool for managing interoperable compressed frames. The system distinguishes itself through the use of predefined pattern dictionaries to improve compression ratios for small data sets and small packets. It supports multiple processing modes, including high-speed block compression for minimal latency
Encodes data into a byte-oriented format using match-copy logic to minimize size while maintaining high throughput.
FileCentipede is a comprehensive file management and transfer application designed to handle diverse network protocols and data operations. It functions as a multi-protocol download manager, a full-featured BitTorrent client, and a remote filesystem manager, providing a unified interface for moving and organizing data across local and remote environments. The application distinguishes itself through deep browser integration, which allows for the direct capture of media streams, video, and bulk download links from web pages. It also includes a modular utility suite that enables users to perfor
Performs file merging, checksum verification, URI encoding, and regular expression testing for data processing.
AndroidNote is a technical knowledge base and reference resource for Android development. It provides comprehensive guidance on application architecture, custom view development, and advanced graphics programming. The project is distinguished by its depth in visual implementation, covering pseudo-3D perspective projections via virtual cameras and complex 2D rendering using Bézier curves and PorterDuff color blending. It also provides detailed methodologies for app modularization and the management of internal libraries through private Maven repositories and JitPack. The reference surface ext
Utilizes an encryption toolkit for performing cryptographic hashing and secure data encoding.
Fluent Bit ist ein Cloud-nativer Log-Shipper und einheitlicher Telemetrie-Collector, der als ressourceneffiziente Datenpipeline konzipiert ist. Er nimmt Logs, Metriken und Traces aus mehreren Quellen auf und verarbeitet sie in Echtzeit, bevor die Daten an externe Speicher-Backends weitergeleitet werden. Das Projekt fungiert als Echtzeit-Stream-Prozessor und OpenTelemetry-Log-Prozessor, der in der Lage ist, Daten mithilfe von SQL und bedingter Logik zu transformieren und zu filtern. Es fungiert zudem als verteilter Tracing-Agent, der Traces sampeln kann, um das Datenvolumen zu reduzieren und gleichzeitig vollständige Anforderungspfade zu bewahren. Das System bietet eine zuverlässige Datenübermittlung durch dateisystembasiertes Buffering und zustandsbehaftete Wiederholungslogik, um Datenverlust bei Ausfällen zu verhindern. Seine modulare Architektur unterstützt steckbare Eingabe- und Ausgabe-Plugins, metadatengesteuertes Routing und die Möglichkeit, Funktionen über gemeinsam genutzte Bibliotheken zu erweitern. Die Software kann als Container über verschiedene CPU-Architekturen und Betriebssysteme hinweg bereitgestellt werden.
Serializes and encodes processed telemetry data into specific binary or text protocols for remote delivery.
libhv is a high-performance C/C++ network library and event-driven I/O framework used to build TCP, UDP, SSL, HTTP, WebSocket, and MQTT clients and servers. It provides a non-blocking event loop for managing network sockets, timers, and system signals across multiple threads. The project is distinguished by its integrated support for specialized network roles, including a full HTTP web server with RESTful routing and middleware, an MQTT messaging client for IoT communication, and the ability to implement SOCKS5 and HTTP proxies. It also features a reliable UDP implementation to ensure ordered
Provides utilities for Base64 encoding and data checksumming to ensure information integrity.
HackTools is a browser extension pentesting toolkit designed for offensive security professionals. It serves as a centralized collection of tools for generating payloads, managing penetration testing workflows, and accessing security reference materials within a web-based interface. The project provides specialized utilities for generating attack strings for XSS, SQL injection, and reverse shells to identify and exploit web vulnerabilities. It includes a data encoding and hashing utility to convert information between various formats for the purpose of bypassing security filters or verifying
Provides cryptographic hashing and base-encoding utilities to transform input strings for security testing.
Encodes audio data directly into byte buffers without intermediate disk writes.