12 مستودعات
Defines configuration settings for individual compute nodes within a distributed processing cluster.
Distinguishing note: Focuses on node-level resource and identity settings, distinct from high-level cluster orchestration.
Explore 12 awesome GitHub repositories matching data & databases · Cluster Node Management. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Defines cluster node settings including identifiers, license paths, and memory limits for cluster deployments.
Vitess is a database clustering system for horizontal scaling of MySQL. It functions as a middleware layer that abstracts complex sharding and physical topology, allowing applications to interact with a distributed database environment through a unified interface. By intercepting and routing SQL queries across multiple shards, it enables large-scale data management while maintaining the appearance of a single database instance. The platform distinguishes itself through its ability to perform online schema migrations and distributed transaction coordination without requiring application downti
Wraps individual database instances with a sidecar process to handle health monitoring, query execution, and lifecycle state transitions.
Excelize is a library for reading and writing spreadsheet files in the Office Open XML format. It provides a comprehensive suite of tools for programmatically creating, modifying, and analyzing workbooks, worksheets, and cell data, ensuring compatibility across various office software suites through structured XML serialization. The library distinguishes itself with a built-in formula calculation engine that evaluates complex mathematical and logical expressions directly against workbook data. It also features a memory-mapped streaming architecture, which allows for the efficient processing o
Defines configuration settings for managing nodes within a distributed processing cluster.
This project is a database driver for Node.js applications designed to interface with Redis. It provides structured access to data stores, enabling the execution of commands, management of data structures, and the implementation of atomic transaction processing. The client distinguishes itself through native support for the binary-safe serialization protocol and a promise-based command pipeline that groups operations to minimize latency. It includes a dedicated manager for distributed environments that handles node discovery and request routing, alongside an event-driven messaging system that
Handles node discovery, request routing, and connection resilience across distributed cluster topologies.
The AWS Cloud Development Kit is an infrastructure-as-code framework that enables developers to define and provision cloud resources using familiar programming languages. By utilizing construct-based synthesis, it translates high-level, object-oriented code into declarative templates, allowing for the automated management of complex cloud environments through a centralized, code-driven control plane. The framework distinguishes itself through its ability to model infrastructure as a dependency-aware resource graph, ensuring that components are provisioned and updated in the correct order. It
Allows granular configuration of cache node types, scaling modes, and availability zone placement.
pysheeet هي مكتبة مرجعية تقنية توفر مجموعة مختارة من مقتطفات التعليمات البرمجية وأنماط التنفيذ لتطوير Python المتقدم، وتكامل النظام، والحوسبة عالية الأداء. تعمل كدليل شامل لتنفيذ برمجة الشبكات منخفضة المستوى، وإضافات C الأصلية، والبرمجة غير المتزامنة والمتزامنة. يوفر المشروع أطر عمل متخصصة لتطوير ونشر نماذج اللغات الكبيرة، بما في ذلك أدوات لاستنتاج GPU الموزع والخدمة عالية الأداء. يتضمن أيضاً أنماطاً مفصلة لتنظيم مجموعات الحوسبة عالية الأداء، وتغطية تخصيص موارد GPU وإدارة عبء العمل متعدد العقد. تغطي المكتبة سطحاً واسعاً من القدرات، بما في ذلك اتصالات الشبكة الآمنة والتشفير، والتعيين الكائني-العلائقي وإدارة قواعد البيانات، وتنفيذ هياكل البيانات والخوارزميات المعقدة. كما توفر أدوات لإدارة الذاكرة، وقابلية التشغيل البيني الأصلية عبر واجهات الوظائف الخارجية، وتكامل نظام التشغيل على مستوى النظام.
Provides implementation patterns for coordinating distributed workloads and resource allocation across multi-node GPU clusters.
This project is a Go client library and API wrapper for interacting with Elasticsearch clusters. It serves as a programmatic interface for managing documents, indices, and cluster health, allowing Go applications to perform search and indexing operations via the REST API. The library functions as a distributed search orchestrator, providing specialized tools for high-throughput data ingestion and cluster administration. It features a buffered bulk processor with exponential backoff retries for optimizing write performance and supports automated index lifecycle transitions and historical data
Monitors data distribution across nodes by retrieving shard allocation data from the cluster.
Metrics Server is a lightweight, single-purpose daemon that collects CPU and memory usage data from every node and pod in a Kubernetes cluster and exposes those metrics through a standard Kubernetes API endpoint. It registers as an aggregated extension API server behind the Kubernetes apiserver, making resource utilization data available to the Horizontal Pod Autoscaler and Vertical Pod Autoscaler for automatic replica count and resource request adjustments. The project distinguishes itself by operating as a focused, in-cluster resource metrics collector that polls kubelet summary endpoints a
Polls kubelet summary endpoints on each node to gather CPU and memory usage for pods and nodes.
Lettuce is a Redis client library for Java that provides synchronous, asynchronous, and reactive programming models for interacting with Redis databases. It supports standalone, cluster, sentinel, pub/sub, and search operations through a single thread-safe connection model that handles command execution without blocking the calling thread. The library distinguishes itself through its reactive streams integration with Project Reactor, enabling non-blocking, backpressure-aware data processing with Mono and Flux types. It offers cluster slot routing that transparently handles MOVED and ASK redir
Dispatches commands to a subset of cluster nodes and collects results asynchronously.
gpustack هي منصة إدارة مجموعات GPU ومنسق استدلال LLM. تعمل كنظام مركزي لتجميع وتنسيق وحدات معالجة الرسومات عبر الخوادم المحلية وبيئات السحابة، وتعمل كمدير حوسبة غير متجانس لتكوينات الأجهزة والبرامج المتنوعة. يوفر النظام بوابة نشر نموذج ذكاء اصطناعي آمنة تخدم النماذج كخدمات قابلة للتوسع باستخدام مصادقة قائمة على المفاتيح. يتضمن مجدول موارد GPU يوازن أعباء العمل عبر المسرعات وينسق محركات استدلال متعددة لرسم خرائط نماذج ذكاء اصطناعي محددة لأجهزة متوافقة. تغطي المنصة تنسيق المجموعات الشامل، بما في ذلك استعادة الفشل الآلي، ومراقبة الموارد في الوقت الفعلي، وتوسيع نطاق الاستدلال الموزع. يدمج تحسين الأداء من خلال التكميم وفك التشفير التخميني لزيادة الإنتاجية وتقليل زمن الوصول. يتم الحفاظ على تكوينات النظام وحالة المجموعة عبر استمرارية حالة قاعدة بيانات علائقية خارجية.
Provides a centralized management plane for orchestrating distributed workloads and resource allocation across multi-node GPU clusters.
TheHive is a security incident response platform and multi-tenant case management system. It functions as a Security Orchestration, Automation, and Response (SOAR) tool and a threat intelligence platform designed to coordinate security investigations by managing alerts, cases, and observables. The platform is distinguished by its multi-tenant architecture, which isolates data across different organizations while supporting selective cross-tenant sharing. It features a SOAR automation engine capable of executing sandboxed JavaScript logic to automate workflows and trigger response actions thro
Manages individual database nodes, including decommissioning healthy nodes and removing crashed ones.
This project is a distributed computing platform designed to orchestrate containerized workloads across heterogeneous hardware clusters. It functions as a centralized control plane that manages resource allocation, scheduling, and execution environments, enabling organizations to share high-performance computing infrastructure securely among multiple users and projects. The platform distinguishes itself through advanced hardware virtualization and multi-tenant management capabilities. It supports the partitioning of physical graphics processing units into fractional slices, allowing multiple
Coordinates distributed containerized workloads and resource allocation across heterogeneous hardware clusters.