24 repository-uri
Collections of unique elements optimized for membership testing and set-theoretic operations like unions and intersections.
Distinguishing note: None of the provided candidates were relevant; this category specifically addresses set-based data management and operations.
Explore 24 awesome GitHub repositories matching data & databases · Set Data Structures. Refine with filters or upvote what's useful.
Dragonfly is a high-performance, multi-model in-memory data store designed to serve as a drop-in replacement for existing database infrastructures. By utilizing a multi-threaded, shared-nothing architecture and a fiber-based concurrency model, it maximizes CPU utilization and minimizes latency for read and write operations. The system supports a wide range of data structures, including strings, hashes, lists, sets, sorted sets, and JSON documents, while maintaining full compatibility with standard industry wire protocols and client libraries. What distinguishes Dragonfly is its focus on effic
Provides high-performance set operations including membership testing, unions, and intersections for unique data collections.
This project is a computer science educational resource and a library of common data structures and algorithms implemented in Swift. It serves as a practical reference for studying complexity and efficiency through solved algorithmic problems and conceptual guides. The collection includes implementations of linear and hierarchical data structures, such as stacks, queues, linked lists, and trees. It covers a wide range of computational patterns, including graph and pathfinding implementations, mathematical numerical methods, and data compression techniques. The project also provides implement
Implements hash tables and filters to store unique elements and key-value pairs with minimal latency.
This project is a comprehensive collection of common computer science algorithms and data structures implemented in Swift. It serves as an educational reference and library for studying computational complexity, algorithmic logic, and data structure engineering through practical code examples. The repository provides a wide suite of data structure implementations, including various types of linked lists, heaps, hash tables, and an extensive range of hierarchical trees such as Red-Black, B-Tree, and Splay trees. It also covers diverse sorting and searching techniques, from basic bubble sort to
A system for performing union, intersection, and difference operations to merge or isolate common elements.
Python is a high-level, interpreted programming language designed for readability and versatility. It operates via a bytecode-based virtual machine and manages memory automatically through reference-counting garbage collection. The language supports multiple programming paradigms, including object-oriented, imperative, and functional styles, and provides a comprehensive standard library for system operations, networking, and data handling. The language is distinguished by its dynamic nature, allowing for runtime object introspection and metaclass-driven class creation. It utilizes protocol-ba
Calculates unions, intersections, and differences between collections to analyze relationships between data groups.
This project is a comprehensive, community-maintained knowledge base and toolkit designed for competitive programming. It serves as a centralized repository for algorithmic theory, data structures, and mathematical techniques, providing a structured reference for informatics and collegiate programming competitions. The project distinguishes itself by integrating educational content with a robust suite of automation utilities. It provides a complete workflow for competitive programming, including tools for automated test case generation, solution verification, and direct interaction with onlin
Maintains unique ordered sets for efficient insertion and membership testing.
core-js is a comprehensive compatibility layer and standard library polyfill that implements ECMAScript proposals and stable language features across diverse JavaScript runtimes. It serves as a runtime environment shim to ensure consistent execution of global objects, iteration protocols, and standard library methods in older browsers or non-browser environments. The project is distinguished by its delivery models, offering both prototype-based global polyfilling and a pure-module implementation. This allows for the integration of modern functionality without modifying global prototypes to pr
Implements mathematical set operations including intersections, unions, and differences.
This project is a feature-rich Go client library designed for interacting with Redis. It serves as a comprehensive interface for managing remote data stores, enabling developers to execute standard database commands, handle complex data structures, and perform asynchronous operations within Go applications. The library distinguishes itself through its support for advanced Redis capabilities, including connection pooling, pipelining, and transactional integrity. It provides specialized primitives for managing distributed clusters, including automated topology updates and request routing to sha
Maintains collections of unique items and computes mathematical relationships like unions between multiple distinct sets.
This project is an educational resource designed for learning the Python programming language. It serves as a tutorial repository and programming guide, providing a collection of annotated scripts, code examples, and cheatsheets to help users master syntax and core fundamentals. The resource focuses on moving from basic language syntax to advanced implementation, with a particular emphasis on object-oriented programming, the use of the Python standard library, and scripting automation for business workflows. The content covers a broad range of programming capabilities, including control flow
Implements set operations such as union, intersection, and difference to find common or unique elements.
Cayley is a graph database engine designed for storing and querying interconnected data using a quad-based data model. It functions as an RDF quad store, managing information through subjects, predicates, objects, and labels. The system features a modular graph store architecture with pluggable backends, allowing it to swap between in-memory storage and various external persistent databases. It includes a GraphQL-inspired API and a dedicated data visualizer for the interactive exploration of nodes and edges. Query capabilities cover bidirectional path traversal and multi-syntax execution usi
Implements union, intersection, and exception operations to combine or subtract graph paths.
Rete is a framework for building interactive, node-based visual interfaces and dataflow programming environments. It provides a core engine that processes directed graphs, allowing developers to define modular logic where nodes represent operations and connections represent the flow of data or control. By decoupling the graph logic from the user interface, the framework enables the creation of custom visual editors that can be integrated into various frontend component libraries. The project distinguishes itself through a highly extensible, signal-driven architecture that supports complex req
Combines or modifies graph structures using union, difference, and intersection logic to manage collections of nodes and connections.
Garnet is a multi-threaded in-memory database and distributed key-value store. It functions as a high-performance remote cache store that implements the RESP wire protocol to maintain compatibility with existing Redis clients and libraries. The project is distinguished by a shared-memory architecture that enables parallel request processing across multiple cores for sub-millisecond latency. It features a tiered storage system that automatically offloads colder data from system memory to SSD or cloud storage layers, and includes a specialized vector search database for high-dimensional similar
Supports collections of unique elements with mathematical set operations like intersections and unions.
phpredis is a C-based native extension that bridges PHP applications with Redis servers for high-performance data storage and retrieval. It serves as an interface for manipulating strings, hashes, lists, sets, and sorted sets while providing a direct path for executing Redis commands and server-side scripts. The extension provides comprehensive support for distributed environments and high availability. It interfaces with Redis Cluster to distribute data across multiple nodes using hash slots and manages Redis Sentinel for service discovery and automatic failover. It also enables shared state
Implements unordered set operations, including the addition/removal of members and mathematical intersections and unions.
pysheeet este o bibliotecă de referință tehnică ce oferă o colecție curatoriată de fragmente de cod și modele de implementare pentru dezvoltarea avansată în Python, integrarea sistemelor și calculul de înaltă performanță. Servește ca un ghid cuprinzător pentru implementarea programării de rețea de nivel scăzut, extensiilor native C și programării asincrone și concurente. Proiectul oferă framework-uri specializate pentru dezvoltarea și implementarea modelelor de limbaj mari, inclusiv instrumente pentru inferență distribuită pe GPU și servire de înaltă performanță. Include, de asemenea, modele detaliate pentru orchestrarea clusterelor de calcul de înaltă performanță, acoperind alocarea resurselor GPU și gestionarea sarcinilor de lucru pe mai multe noduri. Biblioteca acoperă o gamă largă de capabilități, inclusiv comunicarea securizată în rețea și criptografia, object-relational mapping și gestionarea bazelor de date, precum și implementarea structurilor de date și algoritmilor complecși. Oferă, de asemenea, utilitare pentru gestionarea memoriei, interoperabilitate nativă prin interfețe de funcții străine (FFI) și integrarea la nivel de sistem de operare.
Demonstrates building sets of unique elements using literals, constructors, and comprehensions.
This project is a comprehensive library of practical Python code examples and patterns. It provides a collection of scripts and snippets designed to demonstrate a wide range of programming tasks, from basic syntax to advanced implementation patterns. The repository focuses on several core domains, including the implementation of concurrency and multithreading examples, data analysis snippets for cleaning and manipulating tabular data, and various data visualization examples. It also covers automation scripts for file system management and a variety of general programming patterns. Additional
Demonstrates how to create an empty set using the set constructor.
JimuReport is an open-source reporting and dashboard engine designed to be embedded directly into Spring Boot applications. Its core identity centers on generating data reports and full-screen dashboards from natural language descriptions, eliminating the need for manual design. The platform also provides a conversational query interface that translates plain-language questions into database queries, returning results as tables and charts without requiring SQL knowledge. What distinguishes JimuReport is its integration of AI skills that can be installed with a single command, enabling report
Connects to SQL, API, JavaBean, JSON, and shared data sources to supply data for reports.
go-datastructures is a collection of thread-safe and lock-free data structures designed for high-performance concurrent applications in Go. It provides a modular library of specialized algorithmic toolsets, including a lock-free collection library and an immutable data structure library. The project distinguishes itself through a suite of persistent AVL trees and hash array mapped tries that use branch-copying to preserve previous versions. It also implements non-blocking hash maps, queues, and tries that enable linearizable snapshots and concurrent updates without the use of mutual exclusion
Provides dense and sparse bitmaps for fast bitwise comparisons and intersections between integer sets.
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Creates a base64 encoded set of column values using optimized data structures like bitmaps or bloom filters for efficient membership testing.
Acest proiect oferă o colecție curatoriată de cuvinte chinezești de înaltă frecvență, non-informative, provenite din standarde academice și industriale. Servește drept set de date de referință și colecție de stopword-uri concepută pentru utilizarea în sarcini de procesare a limbajului natural (NLP). Repository-ul se concentrează pe preprocesarea textului chinezesc pentru a reduce zgomotul și a îmbunătăți acuratețea modelelor de machine learning. Oferă seturi de date filtrate special pentru regăsirea informațiilor în chineză, pregătirea analizei sentimentelor și curățarea generală a datelor. Proiectul utilizează lexicoane pre-compilate și stocare în fișiere plate pentru a permite filtrarea eficientă a stopword-urilor și agregarea vocabularului pentru corpora chinezești.
Uses hash set data structures to perform stopword membership checks with constant time complexity.
Acest proiect este o implementare TensorFlow a unui framework de traducere imagine-la-imagine bazat pe rețele generative adversariale condiționate. Oferă instrumentele necesare pentru a antrena modele care mapează imaginile de intrare la imaginile de ieșire pe baza modelelor vizuale învățate, precum și un server pentru procesarea cererilor de traducere a imaginilor și servirea checkpoint-urilor de model antrenate către clienții web. Framework-ul include un sistem pentru convertirea ponderilor modelului antrenat într-un format portabil pentru inferență bazată pe browser. De asemenea, dispune de un proces de validare care generează rapoarte comparative prin analizarea seturilor de imagini de intrare, ieșire și țintă folosind un checkpoint antrenat. Codul sursă acoperă întregul pipeline de la ingineria datelor, inclusiv pregătirea setului de date de imagini și pipelining-ul bazat pe perechi, până la antrenarea adversarială condiționată. Suportă transformări vizuale specifice, cum ar fi colorarea imaginilor grayscale și generarea de imagini sintetice.
Generates comparative reports analyzing input, output, and target image sets using trained checkpoints.
dplyr este o bibliotecă R pentru manipularea datelor care oferă o gramatică pentru transformarea cadrelor de date (data frames) tabelare. Funcționează ca un procesor de data frames în memorie și un instrument de algebră relațională, folosind un set consistent de verbe pentru a filtra, selecta și sumariza datele. Proiectul include un motor de traducere SQL care convertește expresiile de manipulare a datelor de nivel înalt în interogări optimizate. Acest lucru permite utilizatorilor să efectueze transformări direct pe baze de date relaționale la distanță și în stocarea cloud, fără a descărca datele local. Biblioteca acoperă o gamă largă de operațiuni tabelare, inclusiv mutarea coloanelor, subsetarea rândurilor și join-uri de date relaționale. De asemenea, oferă capabilități pentru analiza datelor grupate, permițând partiționarea seturilor de date pentru agregări și rezumate independente.
Provides tools to reduce datasets to summary forms by computing statistics for defined groups.