29 dépôts
Libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Explore 29 awesome GitHub repositories matching data & databases · Data Encoding and Serialization. Refine with filters or upvote what's useful.
Ce projet est une roadmap éducative complète conçue pour guider les ingénieurs logiciels à travers la maîtrise des fondamentaux de l'informatique et la préparation aux entretiens techniques. Il fournit un parcours d'apprentissage structuré et conscient des dépendances qui organise des concepts informatiques complexes dans un programme hiérarchique, permettant aux utilisateurs de construire une base d'ingénierie professionnelle grâce à une étude itérative et une mise en œuvre pratique. Le programme se distingue en intégrant les connaissances théoriques au développement professionnel, offrant un index unifié de ressources croisées, notamment des livres, des articles académiques et des tutoriels vidéo. Il met l'accent sur la standardisation de l'efficacité algorithmique par l'analyse de la complexité asymptotique et fournit une décomposition granulaire et modulaire des sujets pour faciliter un apprentissage ciblé et progressif à travers de vastes domaines techniques. Au-delà des algorithmes et des structures de données de base, le dépôt couvre une large surface de capacités, notamment la conception d'architecture système, les systèmes distribués, la sécurité informatique et la modélisation mathématique avancée. Il fournit également des conseils stratégiques pour l'ensemble du cycle de vie de l'embauche, de l'optimisation du CV et de la préparation aux entretiens comportementaux à la croissance de carrière à long terme. L'ensemble de la base de connaissances est maintenu en tant que dépôt contrôlé par version et piloté par markdown, permettant une approche agnostique de la plateforme et collaborative de l'éducation technique.
Reduces data footprint using encoding algorithms to enhance storage efficiency and transmission performance.
Ce projet est un dépôt complet d'implémentations computationnelles vérifiées conçu pour servir de ressource éducative pour l'informatique et la résolution de problèmes algorithmiques. Il fournit une collection structurée d'exemples de code qui couvrent les structures de données fondamentales, les opérations mathématiques et les concepts de programmation de base, permettant aux utilisateurs d'étudier la logique et la complexité derrière diverses méthodes computationnelles. Le dépôt se distingue par un modèle d'implémentation modulaire basé sur des références qui organise le code dans des espaces de noms logiques. Cette approche facilite l'exécution indépendante et la clarté éducative, permettant aux utilisateurs d'explorer l'évolution des stratégies computationnelles, des approches naïves par force brute aux solutions optimisées haute performance. En découplant les abstractions de structures de données des opérations algorithmiques, le projet garantit que les implémentations restent interchangeables et faciles à analyser. La surface de capacités couvre un large éventail de domaines techniques, notamment l'apprentissage automatique, la cryptographie, le calcul scientifique et la vision par ordinateur. Il inclut des implémentations pour la modélisation prédictive, les réseaux de neurones et l'analyse statistique, aux côtés d'outils pour le traitement du signal numérique, la gestion des flux réseau et la modélisation financière. La collection répond également à des besoins mathématiques spécialisés, tels que l'algèbre linéaire, les calculs géométriques et la manipulation de bits, fournissant une base large pour la recherche et les applications d'ingénierie.
Shrink digital information streams through encoding techniques to improve storage density and transmission speeds.
Axios is an isomorphic, promise-based HTTP client designed for making asynchronous network requests across different JavaScript execution environments, including the browser and Node.js. It functions as a JSON API client that serializes JavaScript objects into JSON and parses server responses into structured data. The project features a system for managing reusable client instances with shared configurations, such as base URLs and default settings. It includes a mechanism for intercepting outgoing requests and incoming responses globally, allowing data to be transformed before it reaches the
Converts JavaScript objects into JSON, multipart, or URL-encoded formats for network transmission.
Tesseract is a neural network-based optical character recognition engine designed to convert scanned images and digital documents into machine-readable, searchable text. It functions as both a command-line utility for automating large-scale digitization workflows and a cross-platform library that can be embedded into desktop, mobile, or server-side applications. By utilizing long short-term memory networks, the engine provides robust text extraction across more than one hundred languages and dozens of scripts. The project distinguishes itself through a sophisticated document layout analysis f
Unpack language recognition assets and modify internal data structures using command-line utilities for custom configuration.
This project is a computer science educational resource and a library of common data structures and algorithms implemented in Swift. It serves as a practical reference for studying complexity and efficiency through solved algorithmic problems and conceptual guides. The collection includes implementations of linear and hierarchical data structures, such as stacks, queues, linked lists, and trees. It covers a wide range of computational patterns, including graph and pathfinding implementations, mathematical numerical methods, and data compression techniques. The project also provides implement
Implements data compression algorithms to reduce storage size and improve transmission efficiency.
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Provides libraries for converting data between binary, text, and portable interchange formats for storage or transmission.
Vector is a high-performance observability data pipeline designed to collect, transform, and route logs, metrics, and traces across distributed infrastructure. It functions as a modular engine that decouples data ingestion from processing and transmission, utilizing a component-based architecture to connect diverse sources to multiple destinations. The project distinguishes itself through a focus on reliability and flow control. It implements backpressure-aware data movement to prevent data loss during traffic spikes and utilizes disk-backed event buffering to ensure durability during network
Controls how events are grouped, compressed, and serialized into specific formats before transmission to downstream services.
Ciphey is an automated decryption tool and cryptographic analysis framework designed to identify and reverse encryptions, encodings, and hashes without requiring a known key or cipher. It functions as a hash cracking engine and a heuristic cipher identifier to recover original plaintext from unknown data patterns. The project features a nested encoding resolver that iteratively unwraps multiple layers of encryption and encoding until readable text is reached. It employs a heuristic cryptanalysis workflow to analyze data characteristics and guess the likely encoding scheme or encryption method
Provides automated workflows for unpacking data processed through multiple layers of different encryption formats.
Ciphey is an automated decryption and data obfuscation tool designed to identify and reverse complex, multi-layered encoding schemes. By utilizing statistical analysis and probability scoring, the system automatically detects unknown data formats and recovers human-readable plaintext from obfuscated input strings without requiring manual algorithm specification. The tool distinguishes itself through a recursive pipeline that processes nested data structures and strips formatting anomalies or invisible characters to ensure consistent input. It employs a heuristic search and multithreaded execu
Resolves multi-level, nested encoding structures through automated, recursive conversion steps.
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Maintains organized records by updating names and descriptions of data archives.
crypto-js is a JavaScript cryptography library providing a collection of standard cryptographic algorithms and data transformation tools. It functions as a symmetric encryption toolset, a cryptographic hash implementation, and a password derivation tool for the JavaScript environment. The library enables the protection of sensitive data through symmetric encryption and the production of fixed-size data digests to verify integrity. It includes utilities for converting user passwords into secure cryptographic keys and a data format converter for translating information between Base64, Hex, and
Includes utilities for base-encoding and cryptographic hashing to ensure compatible data transmission.
Redox is a POSIX-compliant, microkernel-based operating system written entirely in Rust. By utilizing a memory-safe language for the kernel and all system components, the project eliminates common vulnerabilities such as buffer overflows and use-after-free errors. Its architecture relies on a minimal kernel that manages only essential hardware and process isolation, delegating all other system services to unprivileged user-space processes. The system distinguishes itself through a modular design where hardware drivers and system services run as independent user-space daemons, allowing them to
Provides utilities for compressing and extracting files using standard archiving formats.
Grav is a flat-file content management system that eliminates the need for a traditional database by storing site content and configuration in human-readable Markdown and YAML files. Built as a modular PHP web framework, it uses a hierarchical page routing system where the physical directory structure directly determines the site's URL paths. The platform is distinguished by its event-driven plugin architecture and a command-line interface that prioritizes system administration, deployment, and maintenance tasks. It utilizes a blueprint-driven system to generate administrative forms from stru
Performs cryptographic hashing and base-encoding to prepare data for secure storage.
This project provides a lossless compression algorithm and a byte-oriented compression library designed for high-speed data reduction and maximum decompression speed. It functions as a stream-oriented compression engine, a software library for encoding and decoding data blocks, and a command-line tool for managing interoperable compressed frames. The system distinguishes itself through the use of predefined pattern dictionaries to improve compression ratios for small data sets and small packets. It supports multiple processing modes, including high-speed block compression for minimal latency
Encodes data into a byte-oriented format using match-copy logic to minimize size while maintaining high throughput.
FileCentipede is a comprehensive file management and transfer application designed to handle diverse network protocols and data operations. It functions as a multi-protocol download manager, a full-featured BitTorrent client, and a remote filesystem manager, providing a unified interface for moving and organizing data across local and remote environments. The application distinguishes itself through deep browser integration, which allows for the direct capture of media streams, video, and bulk download links from web pages. It also includes a modular utility suite that enables users to perfor
Performs file merging, checksum verification, URI encoding, and regular expression testing for data processing.
AndroidNote is a technical knowledge base and reference resource for Android development. It provides comprehensive guidance on application architecture, custom view development, and advanced graphics programming. The project is distinguished by its depth in visual implementation, covering pseudo-3D perspective projections via virtual cameras and complex 2D rendering using Bézier curves and PorterDuff color blending. It also provides detailed methodologies for app modularization and the management of internal libraries through private Maven repositories and JitPack. The reference surface ext
Utilizes an encryption toolkit for performing cryptographic hashing and secure data encoding.
Fluent Bit est un collecteur de logs et de télémétrie unifié cloud-native conçu comme un pipeline de données efficace en ressources. Il ingère des logs, des métriques et des traces provenant de multiples sources, les traitant en temps réel avant d'acheminer les données vers des backends de stockage externes. Le projet fonctionne comme un processeur de flux en temps réel et un processeur de logs OpenTelemetry, capable de transformer et de filtrer les données en utilisant SQL et une logique conditionnelle. Il agit également comme un agent de traçage distribué capable d'échantillonner les traces pour réduire le volume de données tout en préservant les chemins de requête complets. Le système fournit une livraison de données fiable grâce à une mise en mémoire tampon basée sur le système de fichiers et une logique de réessai avec état pour éviter la perte de données lors des pannes. Son architecture modulaire prend en charge des plugins d'entrée et de sortie enfichables, un routage basé sur les métadonnées et la capacité d'étendre les fonctionnalités via des bibliothèques partagées. Le logiciel peut être déployé en tant que conteneur sur différentes architectures CPU et systèmes d'exploitation.
Serializes and encodes processed telemetry data into specific binary or text protocols for remote delivery.
libhv is a high-performance C/C++ network library and event-driven I/O framework used to build TCP, UDP, SSL, HTTP, WebSocket, and MQTT clients and servers. It provides a non-blocking event loop for managing network sockets, timers, and system signals across multiple threads. The project is distinguished by its integrated support for specialized network roles, including a full HTTP web server with RESTful routing and middleware, an MQTT messaging client for IoT communication, and the ability to implement SOCKS5 and HTTP proxies. It also features a reliable UDP implementation to ensure ordered
Provides utilities for Base64 encoding and data checksumming to ensure information integrity.
HackTools is a browser extension pentesting toolkit designed for offensive security professionals. It serves as a centralized collection of tools for generating payloads, managing penetration testing workflows, and accessing security reference materials within a web-based interface. The project provides specialized utilities for generating attack strings for XSS, SQL injection, and reverse shells to identify and exploit web vulnerabilities. It includes a data encoding and hashing utility to convert information between various formats for the purpose of bypassing security filters or verifying
Provides cryptographic hashing and base-encoding utilities to transform input strings for security testing.
Encodes audio data directly into byte buffers without intermediate disk writes.