13 مستودعات
Database schemas for persisting chat communications with support for advanced metadata.
Distinguishing note: Focuses on the persistence layer for chat rather than the UI or real-time transport.
Explore 13 awesome GitHub repositories matching data & databases · Message Storage. Refine with filters or upvote what's useful.
This project is an enterprise application framework designed to accelerate the construction of complex business software. It functions as a full-stack code generator that automatically produces backend logic, database operations, and frontend interface components from defined data schemas. By providing a standardized foundation for security, authentication, and administrative management, it enables developers to rapidly deploy functional, production-ready software environments. The platform distinguishes itself through its native support for multi-tenant architectures, allowing for secure dat
Organizes private and group communications in database tables supporting advanced message metadata.
Agentscope is a comprehensive toolkit for developing and orchestrating autonomous multi-agent systems. It provides a unified framework for building agents that can reason, execute tools, and manage memory, enabling the creation of complex, collaborative workflows where multiple specialized agents interact to solve multi-step objectives. The platform distinguishes itself through a robust orchestration engine that supports both sequential and concurrent agent pipelines. It utilizes a centralized event bus for real-time telemetry, allowing developers to track agent reasoning, tool usage, and sys
Labels stored messages with custom strings to enable efficient filtering and retrieval of conversation segments.
This project is a feature-rich Go client library designed for interacting with Redis. It serves as a comprehensive interface for managing remote data stores, enabling developers to execute standard database commands, handle complex data structures, and perform asynchronous operations within Go applications. The library distinguishes itself through its support for advanced Redis capabilities, including connection pooling, pipelining, and transactional integrity. It provides specialized primitives for managing distributed clusters, including automated topology updates and request routing to sha
Groups messages into sorted sets keyed by room ID, allowing for chronological storage and efficient retrieval of message history.
Vector is a high-performance observability data pipeline designed to collect, transform, and route logs, metrics, and traces across distributed infrastructure. It functions as a modular engine that decouples data ingestion from processing and transmission, utilizing a component-based architecture to connect diverse sources to multiple destinations. The project distinguishes itself through a focus on reliability and flow control. It implements backpressure-aware data movement to prevent data loss during traffic spikes and utilizes disk-backed event buffering to ensure durability during network
Reconstructs log entries split by container runtime size limits into complete messages.
NATS Server is a high-performance, lightweight messaging system designed for cloud-native applications, edge computing, and distributed microservices. It functions as a distributed publish-subscribe broker that routes messages using hierarchical, dot-separated subject strings, enabling decoupled communication between services without requiring centralized broker lookups. The system supports core messaging patterns including asynchronous publish-subscribe, request-reply, and load-balanced queue processing. The platform distinguishes itself through a decentralized architecture that eliminates t
Stores incoming messages on disk to ensure data durability and allow for later retrieval by consumers.
Buf is a toolchain for managing the full lifecycle of Protocol Buffers schemas. It provides a set of tools for schema governance, including linting to enforce style guides, a breaking change detector to ensure backward compatibility, and a system for producing language-specific source code via local or remote plugins. The project distinguishes itself through a remote schema registry that centralizes the hosting, versioning, and distribution of Protocol Buffers modules. This registry allows for server-side enforcement of governance policies, such as blocking updates that introduce backward-inc
Translates binary wire messages into JSON format to simplify debugging and interoperability.
Rerun is a multimodal data visualizer and robotics data logger designed for rendering synchronized streams of 3D spatial data, images, and time-series metrics. It functions as a tool for capturing high-frequency sensor data and AI outputs into a queryable columnar format, providing a dedicated interface for viewing MCAP recording files and analyzing physical environments. The project distinguishes itself as a machine learning dataset streamer, capable of feeding logged recordings directly into GPU buffers and PyTorch training pipelines without intermediate exports. It supports a high-performa
Decodes raw MCAP messages using reflection or archetypes to transform them into queryable components.
WildfireChat’s im-server is an open-source instant messaging backend that handles message routing, user management, and group chat for private deployments. It provides a self-hosted messaging platform that can run on private infrastructure without third-party cloud dependencies, supporting enterprise messaging with group management, user directories, and external system integration. The server synchronizes messages and state across mobile, desktop, web, and mini-program clients simultaneously, and supports real-time audio/video calls with group conferencing and live streaming. It includes enc
Persists messages in relational database tables with partitioned storage for efficient retrieval.
tdl is a command-line tool for Telegram account automation, media management, and data archiving. It provides a programmatic interface for downloading and uploading files, forwarding messages, and exporting chat history, member lists, and media into structured JSON files. The project distinguishes itself through a session management system that isolates multiple account identities using unique namespaces. It features expression-based routing for messages and uploads, allowing users to direct content to specific destination chats using custom logic and dynamic caption generation. The tool cov
Saves raw message data to facilitate analysis of the internal Telegram message format.
Kafdrop هو واجهة قائمة على الويب لمراقبة وإدارة مجموعات Apache Kafka، والمواضيع (topics)، والوسطاء (brokers)، ومجموعات المستهلكين. يعمل كمراقب للمجموعة ومدير للمواضيع يوفر تمثيلاً مرئياً لصحة الوسيط، وتعيينات الأقسام، وتأخر مجموعة المستهلكين. يتضمن النظام متصفح رسائل قادراً على قراءة وفك تشفير ونشر الرسائل إلى مواضيع Kafka باستخدام سجلات المخططات (schema registries) أو ملفات الوصف. كما يوفر واجهة برمجة تطبيقات للبيانات الوصفية تعرض معلومات المجموعة من خلال نقاط نهاية JSON للتكامل مع أدوات المراقبة الخارجية. تغطي المنصة مجالات إدارية مثل إدارة دورة حياة المواضيع، والتدقيق الأمني لقوائم التحكم في الوصول، وإنشاء اتصالات وسيط مشفرة باستخدام شهادات TLS وبيانات اعتماد SASL. تشمل قدرات المراقبة تتبع الإزاحات (offsets) لكل قسم وحساب تأخر مجموعة المستهلكين.
Uses local configuration files to decode raw binary Kafka messages into structured data for browsing.
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Parses messages in formats including JSON, Avro, Protobuf, and Arrow with support for schema registries.
Emitter هو منصة موزعة للناشر والمشترك (pub-sub) ووسيط رسائل يوفر توجيهاً للبيانات في الوقت الفعلي بين الناشرين والمشتركين عبر مجموعة موزعة. يعمل كوسيط رسائل MQTT للأجهزة منخفضة الطاقة وخادم اتصالات WebSocket للعملاء المستندين إلى الويب، بينما يعمل كمنسق قنوات آمن لإدارة تدفقات البيانات المشفرة. يتميز النظام بدمج تجميع الوسطاء الموزعين لضمان التوافر العالي ونظام تشغيل رسائل مدعوم بالاستمرارية. يتيح ذلك للمنصة تخزين الرسائل التاريخية وتسليمها للمشتركين الذين كانوا غير متصلين أثناء الإرسال الأولي. تُدار صلاحيات الوصول عبر مفاتيح مؤقتة تفرض تحكماً دقيقاً في القراءة والكتابة لمسارات قنوات محددة. تغطي المنصة مجموعة واسعة من قدرات المراسلة، بما في ذلك التسليم غير المتزامن، وتخزين رسائل العملاء غير المتصلين، ونقل البيانات الثنائية والنصية بتنسيقات متعددة. كما تدعم التوجيه القائم على القنوات مع تصفية الرسائل لفصل مكونات النظام، وتوفر مراقبة تشغيلية من خلال بيانات الاتصال الوصفية ومراقبة الأداء. يمكن نشر النظام عبر حاويات Docker وتكوينه باستخدام متغيرات البيئة وملفات الإعداد الخارجية.
Persists messages with configurable expiration periods to support historical retrieval for offline clients.
This project is a reference library of architectural blueprints, study materials, and design patterns for building scalable, high-availability distributed systems. It serves as a technical guide for scalability engineering, providing structural solutions for common engineering challenges. The repository focuses on distributed systems design, covering essential patterns for data replication, consensus algorithms, and transaction management. It distinguishes itself by offering detailed blueprints for specialized domains, including real-time data streaming, large-scale data storage, and high-ava
Provides database schemas and patterns for persisting massive volumes of chat communications.