76 Repos
Mechanisms for temporarily storing data to manage flow control and system stability.
Distinguishing note: Focuses on disk-backed memory buffering for queues.
Explore 76 awesome GitHub repositories matching data & databases · Data Buffering. Refine with filters or upvote what's useful.
FFmpeg is a cross-platform multimedia framework designed for the recording, conversion, and streaming of audio and video content. It functions as a comprehensive toolkit that provides both a command-line utility for direct media manipulation and a collection of low-level libraries for integration into custom applications. At its core, the project utilizes a packet-based stream engine and a format-agnostic abstraction layer to handle diverse media standards, containers, and network protocols. The framework distinguishes itself through a modular, graph-based filter execution model that allows f
Duplicates a single output stream to multiple destinations simultaneously to support diverse delivery requirements.
This project is a collection of educational resources and step-by-step tutorials for Java backend development. It provides implementation guides for building web services and applications using the Spring Boot framework, focusing on the development of data streams and concurrent tasks. The repository includes technical walkthroughs for Kubernetes cluster automation, specifically regarding the creation of custom operators and admission controllers. It also serves as a manual for cloud native integration, covering the packaging of applications into containers and the use of distributed event me
Covers the implementation of data streams for efficient reading and writing between files and memory.
NSQ is a distributed, brokerless messaging platform designed for high-throughput, fault-tolerant communication. By utilizing a decentralized topology, it eliminates single points of failure and allows for horizontal scaling across clusters. The system organizes message streams into topics and channels, effectively decoupling producers from consumers to support both streaming and job-oriented workloads. The platform distinguishes itself through a lookup-service-based discovery mechanism that enables clients to dynamically locate producers at runtime without requiring centralized coordination.
Spills message queues to disk when memory thresholds are exceeded to prevent service crashes.
fmt is a type-safe C++ text formatting library used to convert data into formatted strings and text. It serves as a high-performance string buffer utility and provided the basis for the formatting features introduced in the C++20 standard. The library shifts format string validation and size computation to the compilation phase to prevent runtime crashes and increase execution speed. It utilizes a memory-efficient interface to write formatted data directly into buffers, minimizing allocations and eliminating intermediate string overhead. The project covers a broad range of text processing ca
Writes data directly into stream buffers to eliminate intermediate memory allocations and maximize output speed.
Vector is a high-performance observability data pipeline designed to collect, transform, and route logs, metrics, and traces across distributed infrastructure. It functions as a modular engine that decouples data ingestion from processing and transmission, utilizing a component-based architecture to connect diverse sources to multiple destinations. The project distinguishes itself through a focus on reliability and flow control. It implements backpressure-aware data movement to prevent data loss during traffic spikes and utilizes disk-backed event buffering to ensure durability during network
Provides disk-backed event buffering to ensure data durability and prevent loss during network outages or service restarts.
Vitess is a database clustering system for horizontal scaling of MySQL. It functions as a middleware layer that abstracts complex sharding and physical topology, allowing applications to interact with a distributed database environment through a unified interface. By intercepting and routing SQL queries across multiple shards, it enables large-scale data management while maintaining the appearance of a single database instance. The platform distinguishes itself through its ability to perform online schema migrations and distributed transaction coordination without requiring application downti
Queues incoming queries during transient cluster instability or topology changes to prevent application errors and maintain service continuity.
Excelize is a library for reading and writing spreadsheet files in the Office Open XML format. It provides a comprehensive suite of tools for programmatically creating, modifying, and analyzing workbooks, worksheets, and cell data, ensuring compatibility across various office software suites through structured XML serialization. The library distinguishes itself with a built-in formula calculation engine that evaluates complex mathematical and logical expressions directly against workbook data. It also features a memory-mapped streaming architecture, which allows for the efficient processing o
Accumulates data in intermediate buffers to reduce expensive system calls during high-volume write operations.
TinyGo is a specialized compiler and development toolkit designed to bring the Go programming language to resource-constrained microcontrollers and WebAssembly environments. It provides a bare-metal runtime environment that enables high-level code execution without the need for a traditional operating system, utilizing an LLVM-based backend to generate efficient machine instructions. The project distinguishes itself through aggressive optimization techniques tailored for small hardware, including a static memory allocation strategy and whole-program dead code elimination that significantly re
Maintains circular memory queues to store incoming data streams and prevent overflow.
This project is a cross-platform graphics and compute framework that provides a unified, hardware-agnostic abstraction layer for rendering and parallel processing. It enables developers to build high-performance applications that execute consistently across diverse operating systems and hardware backends, including Vulkan, Metal, and DirectX. By mapping high-level graphics commands to native APIs, it serves as a portable foundation for both real-time 3D rendering and general-purpose GPU computing. The framework distinguishes itself through a robust architecture that supports both native deskt
Copies processed data from graphics memory back to system memory by mapping buffers for reading after rendering or compute operations complete.
bgfx is a cross-platform, graphics rendering abstraction layer designed for high-performance applications. It provides a unified interface that maps high-level rendering commands to native graphics APIs, allowing developers to maintain a single codebase that executes consistently across diverse operating systems and hardware architectures. The library distinguishes itself through a multi-threaded command submission model that decouples rendering logic from the main application thread, effectively minimizing CPU bottlenecks. It utilizes a backend-agnostic command buffer and a deferred resource
Allocates and updates vertex, index, and indirect draw buffers for the graphics processor.
VictoriaMetrics is a high-performance, scalable time series database and observability platform designed for long-term storage and analysis of metric, log, and trace data. It functions as a unified backend for monitoring ecosystems, offering full compatibility with industry-standard protocols and query languages. The system is built to handle massive data volumes through a distributed architecture that supports horizontal scaling and efficient data lifecycle management. The platform distinguishes itself through a storage engine that utilizes consistent hashing for data sharding and log-struct
Buffers collected metrics to local disk during network outages to ensure data durability and automatic retry.
This project is a comprehensive educational resource and programming course covering C++ language semantics and features from C++03 through C++26. It provides structured tutorials and technical guides focused on modern C++ development. The material offers specialized instruction on template metaprogramming, including the use of type traits and compile-time computations. It features detailed guides on concurrency and parallelism for multi-core execution, as well as a reference for software design applying SOLID principles and RAII. Additionally, it covers build performance optimization to redu
Teaches the use of formatted manipulators for reading and writing data with precision.
ExcelJS is a Node.js spreadsheet engine and manipulation library used for reading, writing, and modifying XLSX and CSV files. It functions as a formatting tool and asynchronous streaming parser for generating complex workbooks containing formulas, rich text, and custom styles. The library is distinguished by its ability to process large datasets using asynchronous data streaming and incremental processing, which minimizes memory usage during data extraction and file generation. Its capability surface covers comprehensive data management, including structured tables, named ranges, and cell da
Provides utilities for efficiently reading and writing binary XLSX components between memory buffers and files.
RapidJSON is a header-only C++ library designed for high-performance parsing, generation, and manipulation of JSON data. It functions as a dual-mode engine, providing both an in-memory document object model for tree-based manipulation and a stream-based interface for event-driven processing. The library is built to minimize memory footprint and maximize execution speed, making it suitable for resource-constrained environments. The library distinguishes itself through advanced memory management and optimization techniques, including in-situ parsing that modifies input buffers directly to elimi
Stores serialized data in an in-memory stream that dynamically resizes.
Sarama is a Go client library for producing and consuming messages from Apache Kafka clusters. It provides dedicated interfaces for a message producer to send typed data packets and a message consumer to read and process continuous data streams from Kafka topics. The library includes a mocking framework and simulation layer that mimics Kafka broker behavior. These tools allow for the testing of client applications and messaging logic without requiring a live cluster. The project handles cluster connectivity and integration through the implementation of the Kafka binary protocol over TCP. It
Implements internal buffering to collect multiple outgoing messages, reducing network overhead and increasing producer throughput.
This project serves as a comprehensive educational framework and reference library for mastering high-performance graphics programming and parallel compute resource management. It provides a collection of practical implementations designed to demonstrate the explicit control required by the Vulkan API, covering the fundamental mechanics of modern graphics pipelines and cross-platform hardware interaction. The repository distinguishes itself by focusing on the low-level architectural requirements of modern GPU development, including manual memory allocation, multi-threaded command recording, a
Consolidates multiple uniform blocks into a single buffer using dynamic offsets to reduce descriptor set overhead.
Libevent is a cross-platform library that provides a mechanism for executing callback functions when specific events occur on file descriptors, signals, or timers. It functions as a network event loop framework, enabling the development of applications that manage concurrent network connections and non-blocking input and output operations within a single execution context. The library distinguishes itself by providing a portable abstraction layer that automatically selects the most efficient system-level event notification mechanism available on the host operating system. It includes thread-s
Manages network input and output through internal memory buffers that trigger automatic notifications when data is ready.
This project is a memory-safe implementation of the QUIC transport protocol and HTTP/3, designed for high-throughput and efficient network communication. It provides a comprehensive toolkit for building secure, low-latency network applications by managing the full lifecycle of transport connections, including protocol negotiation, stream data exchange, and connection state management. The library distinguishes itself through a focus on performance and protocol integrity. It utilizes a formal state machine to enforce strict adherence to transport rules and employs zero-copy buffer management t
Tracks transmission buffer state to identify potential connection stalls.
StreamDiffusion is an interactive generative AI framework and inference engine designed for the low-latency delivery of image and video streams. It provides a real-time Stable Diffusion pipeline for text-to-image and image-to-image generation, enabling the creation of continuous generative image streams with minimized computational delay. The framework optimizes throughput using a pre-computed cache engine and residual-based guidance approximation to reduce the number of required model passes. It further manages GPU load through similarity-based frame skipping, which avoids redundant computat
Employs dedicated asynchronous queues to decouple input and output operations during high-frequency image generation.
F Prime ist ein komponentenbasiertes Framework für die Entwicklung und Bereitstellung von Embedded- und Raumfahrtsoftware. Es bietet eine modulare Architektur, die Softwarelogik von Kommunikationsschnittstellen entkoppelt und es Entwicklern ermöglicht, Systemstrukturen über eine domänenspezifische Modellierungssprache zu definieren. Dieser modellbasierte Ansatz ermöglicht eine automatisierte Codegenerierung, die Konsistenz über komplexe Systemtopologien hinweg gewährleistet und gleichzeitig strikte Schnittstellenverträge zwischen Softwaremodulen aufrechterhält. Das Framework zeichnet sich durch sein integriertes Build-System und eine Suite für Bodendatenoperationen aus. Es automatisiert den gesamten Lebenszyklus von Embedded-Software, von der Cross-Kompilierung und dem Abhängigkeitsmanagement bis hin zur Generierung von Telemetrie- und Befehlsschnittstellen. Durch die Bereitstellung einer einheitlichen Umgebung für Onboard-Flugsoftware und bodengestützte Überwachung erleichtert es die nahtlose Integration, das Testen sowie die Steuerung und Überwachung verteilter Embedded-Systeme über verschiedene Hardwareplattformen hinweg. Über die Kernarchitektur hinaus enthält das Projekt umfassende Werkzeuge für die Systembeobachtbarkeit, einschließlich Echtzeit-Telemetrie-Visualisierung, Ereignisprotokollierung und diagnostischer Tracing-Funktionen. Es unterstützt eine breite Palette von Bereitstellungsszenarien, von Bare-Metal-Umgebungen bis hin zu Echtzeitbetriebssystemen, und bietet Mechanismen für Speicherverwaltung, zustandsgesteuerte Verhaltensmodellierung und asynchrone Aufgabenausführung. Das Projekt wird als C++-Repository mit umfangreicher Dokumentation und Build-System-Unterstützung für die plattformübergreifende Entwicklung gepflegt.
Collects and manages telemetry snapshots using double-buffering to ensure consistent data downlinking.