29 repository-uri
Libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Explore 29 awesome GitHub repositories matching data & databases · Data Encoding and Serialization. Refine with filters or upvote what's useful.
Acest proiect este un roadmap educațional cuprinzător conceput pentru a ghida inginerii software prin stăpânirea fundamentelor informaticii și pregătirea pentru interviurile tehnice. Oferă o cale de învățare structurată, conștientă de dependențe, care organizează concepte complexe de calcul într-un curriculum ierarhic, permițând utilizatorilor să construiască o fundație profesională de inginerie prin studiu iterativ și implementare practică. Curriculumul se distinge prin integrarea cunoștințelor teoretice cu dezvoltarea profesională, oferind un index unificat de resurse referențiate încrucișat, inclusiv cărți, lucrări academice și tutoriale video. Acesta pune accent pe standardizarea eficienței algoritmice prin analiza complexității asimptotice și oferă o descompunere granulară și modulară a subiectelor pentru a facilita învățarea concentrată și incrementală în domenii tehnice vaste. Dincolo de algoritmii de bază și structurile de date, repository-ul acoperă o suprafață largă de capabilități, inclusiv designul arhitecturii de sistem, sisteme distribuite, securitatea informatică și modelarea matematică avansată. De asemenea, oferă îndrumări strategice pentru întregul ciclu de angajare, de la optimizarea CV-ului și pregătirea interviului comportamental până la creșterea carierei pe termen lung. Întreaga bază de cunoștințe este menținută ca un repository controlat prin versiuni, bazat pe markdown, permițând o abordare agnostică față de platformă și colaborativă pentru educația tehnică.
Reduces data footprint using encoding algorithms to enhance storage efficiency and transmission performance.
Acest proiect este un repository cuprinzător de implementări computaționale verificate, conceput pentru a servi drept resursă educațională pentru informatică și rezolvarea problemelor algoritmice. Oferă o colecție structurată de exemple de cod care acoperă structuri de date fundamentale, operațiuni matematice și concepte de bază de programare, permițând utilizatorilor să studieze logica și complexitatea din spatele diferitelor metode computaționale. Repository-ul se distinge printr-un tipar de implementare modular, bazat pe referințe, care organizează codul în spații de nume logice. Această abordare facilitează execuția independentă și claritatea educațională, permițând utilizatorilor să exploreze evoluția strategiilor computaționale de la abordări naive de tip brute-force la soluții optimizate, de înaltă performanță. Prin decuplarea abstracțiilor structurilor de date de operațiunile algoritmice, proiectul asigură că implementările rămân interschimbabile și ușor de analizat. Suprafața de capabilități acoperă o gamă largă de domenii tehnice, inclusiv învățarea automată, criptografia, calculul științific și viziunea computerizată. Include implementări pentru modelare predictivă, rețele neuronale și analiză statistică, alături de instrumente pentru procesarea semnalelor digitale, gestionarea fluxului de rețea și modelarea financiară. Colecția abordează, de asemenea, nevoi matematice specializate, cum ar fi algebra liniară, calculele geometrice și manipularea biților, oferind o fundație largă pentru cercetare și aplicații de inginerie.
Shrink digital information streams through encoding techniques to improve storage density and transmission speeds.
Axios is an isomorphic, promise-based HTTP client designed for making asynchronous network requests across different JavaScript execution environments, including the browser and Node.js. It functions as a JSON API client that serializes JavaScript objects into JSON and parses server responses into structured data. The project features a system for managing reusable client instances with shared configurations, such as base URLs and default settings. It includes a mechanism for intercepting outgoing requests and incoming responses globally, allowing data to be transformed before it reaches the
Converts JavaScript objects into JSON, multipart, or URL-encoded formats for network transmission.
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis f
Unpack language recognition assets and modify internal data structures using command-line utilities for custom configuration.
This project is a computer science educational resource and a library of common data structures and algorithms implemented in Swift. It serves as a practical reference for studying complexity and efficiency through solved algorithmic problems and conceptual guides. The collection includes implementations of linear and hierarchical data structures, such as stacks, queues, linked lists, and trees. It covers a wide range of computational patterns, including graph and pathfinding implementations, mathematical numerical methods, and data compression techniques. The project also provides implement
Implements data compression algorithms to reduce storage size and improve transmission efficiency.
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Provides libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Vector is a high-performance observability data pipeline designed to collect, transform, and route logs, metrics, and traces across distributed infrastructure. It functions as a modular engine that decouples data ingestion from processing and transmission, utilizing a component-based architecture to connect diverse sources to multiple destinations. The project distinguishes itself through a focus on reliability and flow control. It implements backpressure-aware data movement to prevent data loss during traffic spikes and utilizes disk-backed event buffering to ensure durability during network
Controls how events are grouped, compressed, and serialized into specific formats before transmission to downstream services.
Ciphey is an automated decryption tool and cryptographic analysis framework designed to identify and reverse encryptions, encodings, and hashes without requiring a known key or cipher. It functions as a hash cracking engine and a heuristic cipher identifier to recover original plaintext from unknown data patterns. The project features a nested encoding resolver that iteratively unwraps multiple layers of encryption and encoding until readable text is reached. It employs a heuristic cryptanalysis workflow to analyze data characteristics and guess the likely encoding scheme or encryption method
Provides automated workflows for unpacking data processed through multiple layers of different encryption formats.
Ciphey is an automated decryption and data obfuscation tool designed to identify and reverse complex, multi-layered encoding schemes. By utilizing statistical analysis and probability scoring, the system automatically detects unknown data formats and recovers human-readable plaintext from obfuscated input strings without requiring manual algorithm specification. The tool distinguishes itself through a recursive pipeline that processes nested data structures and strips formatting anomalies or invisible characters to ensure consistent input. It employs a heuristic search and multithreaded execu
Resolves multi-level, nested encoding structures through automated, recursive conversion steps.
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Maintains organized records by updating names and descriptions of data archives.
crypto-js is a JavaScript cryptography library providing a collection of standard cryptographic algorithms and data transformation tools. It functions as a symmetric encryption toolset, a cryptographic hash implementation, and a password derivation tool for the JavaScript environment. The library enables the protection of sensitive data through symmetric encryption and the production of fixed-size data digests to verify integrity. It includes utilities for converting user passwords into secure cryptographic keys and a data format converter for translating information between Base64, Hex, and
Includes utilities for base-encoding and cryptographic hashing to ensure compatible data transmission.
Redox is a POSIX-compliant, microkernel-based operating system written entirely in Rust. By utilizing a memory-safe language for the kernel and all system components, the project eliminates common vulnerabilities such as buffer overflows and use-after-free errors. Its architecture relies on a minimal kernel that manages only essential hardware and process isolation, delegating all other system services to unprivileged user-space processes. The system distinguishes itself through a modular design where hardware drivers and system services run as independent user-space daemons, allowing them to
Provides utilities for compressing and extracting files using standard archiving formats.
Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths. The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from stru
Performs cryptographic hashing and base-encoding to prepare data for secure storage.
This project provides a lossless compression algorithm and a byte-oriented compression library designed for high-speed data reduction and maximum decompression speed. It functions as a stream-oriented compression engine, a software library for encoding and decoding data blocks, and a command-line tool for managing interoperable compressed frames. The system distinguishes itself through the use of predefined pattern dictionaries to improve compression ratios for small data sets and small packets. It supports multiple processing modes, including high-speed block compression for minimal latency
Encodes data into a byte-oriented format using match-copy logic to minimize size while maintaining high throughput.
FileCentipede is a comprehensive file management and transfer application designed to handle diverse network protocols and data operations. It functions as a multi-protocol download manager, a full-featured BitTorrent client, and a remote filesystem manager, providing a unified interface for moving and organizing data across local and remote environments. The application distinguishes itself through deep browser integration, which allows for the direct capture of media streams, video, and bulk download links from web pages. It also includes a modular utility suite that enables users to perfor
Performs file merging, checksum verification, URI encoding, and regular expression testing for data processing.
AndroidNote is a technical knowledge base and reference resource for Android development. It provides comprehensive guidance on application architecture, custom view development, and advanced graphics programming. The project is distinguished by its depth in visual implementation, covering pseudo-3D perspective projections via virtual cameras and complex 2D rendering using Bézier curves and PorterDuff color blending. It also provides detailed methodologies for app modularization and the management of internal libraries through private Maven repositories and JitPack. The reference surface ext
Utilizes an encryption toolkit for performing cryptographic hashing and secure data encoding.
Fluent Bit este un log shipper cloud-native și un colector unificat de telemetrie conceput ca un pipeline de date eficient din punct de vedere al resurselor. Acesta ingerează log-uri, metrici și urme (traces) din surse multiple, procesându-le în timp real înainte de a direcționa datele către backend-uri de stocare externe. Proiectul funcționează ca un procesor de flux în timp real și procesor de log-uri OpenTelemetry, capabil să transforme și să filtreze datele folosind SQL și logică condițională. Acționează, de asemenea, ca un agent de tracing distribuit care poate eșantiona urmele pentru a reduce volumul de date, păstrând în același timp căile complete ale cererilor. Sistemul oferă livrarea fiabilă a datelor prin buffering bazat pe sistemul de fișiere și logică de reîncercare cu stare pentru a preveni pierderea datelor în timpul întreruperilor. Arhitectura sa modulară suportă plugin-uri de input și output interschimbabile, rutare bazată pe metadate și capacitatea de a extinde funcționalitatea prin biblioteci partajate. Software-ul poate fi implementat ca un container pe diferite arhitecturi CPU și sisteme de operare.
Serializes and encodes processed telemetry data into specific binary or text protocols for remote delivery.
libhv is a high-performance C/C++ network library and event-driven I/O framework used to build TCP, UDP, SSL, HTTP, WebSocket, and MQTT clients and servers. It provides a non-blocking event loop for managing network sockets, timers, and system signals across multiple threads. The project is distinguished by its integrated support for specialized network roles, including a full HTTP web server with RESTful routing and middleware, an MQTT messaging client for IoT communication, and the ability to implement SOCKS5 and HTTP proxies. It also features a reliable UDP implementation to ensure ordered
Provides utilities for Base64 encoding and data checksumming to ensure information integrity.
HackTools is a browser extension pentesting toolkit designed for offensive security professionals. It serves as a centralized collection of tools for generating payloads, managing penetration testing workflows, and accessing security reference materials within a web-based interface. The project provides specialized utilities for generating attack strings for XSS, SQL injection, and reverse shells to identify and exploit web vulnerabilities. It includes a data encoding and hashing utility to convert information between various formats for the purpose of bypassing security filters or verifying
Provides cryptographic hashing and base-encoding utilities to transform input strings for security testing.
Encodes audio data directly into byte buffers without intermediate disk writes.