15 repository-uri
Reducing collections of values into single results using cumulative functions.
Distinct from Decentralized Data Aggregators: Candidates were related to decentralized aggregators or A/B testing, not functional reduction
Explore 15 awesome GitHub repositories matching data & databases · Functional Data Aggregation. Refine with filters or upvote what's useful.
Underscore is a JavaScript utility library providing a suite of functional programming and data manipulation helpers. It serves as a framework for transforming data collections, composing functions, managing objects, and performing precise data type validation without modifying core language prototypes. The project includes a functional programming toolkit designed to control function execution timing and behavior through techniques such as debouncing, throttling, and partial application. It also provides a dedicated object manipulation utility for cloning, merging, picking, and omitting prop
Includes aggregation helpers that collapse collections into a single result via a cumulative function.
type-fest is a library of reusable utility types for performing complex transformations and validations on objects, arrays, strings, and numeric ranges in TypeScript. It provides a collection of type definitions designed to handle advanced structural changes and constraints. The project distinguishes itself by offering specialized logic for string literal processing, such as casing transformations and pattern-based modifications, and type-level arithmetic for calculating numeric ranges and absolute values. It also includes utilities for enforcing deep immutability, ensuring union mutual exclu
Checks whether all elements in a list of booleans are true at the type level.
Redis is a high-performance in-memory key-value store that functions as a distributed cache, message broker, and NoSQL database. It provides sub-millisecond read and write access to data stored in RAM and can operate as a vector database for indexing high-dimensional embeddings. The system supports a wide range of data storage and synchronization primitives, including the management of strings, hashes, lists, sets, and JSON documents. It enables real-time data operations through atomic transactions, hybrid persistence using snapshots and append-only logs, and high-availability configurations
Groups indexed documents by fields and applies reduction functions like count, sum, and average on the server.
vxe-table is a high-performance data table component and UI library for Vue, designed for building data-heavy applications. It functions as a virtualized data grid and spreadsheet UI framework capable of rendering millions of rows by mounting only the visible elements of a dataset. The project distinguishes itself through spreadsheet-like functionality, including cell selection, copy-paste support, and the generation of cross-tabulated pivot tables. It also provides specialized tools for managing complex data hierarchies using virtual trees, row grouping, and cell merging. The library covers
Calculates summaries and applies mathematical functions to provide high-level insights from raw tabular data.
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Reduces collections of dataset entries into a single accumulated result using custom reduction functions.
This is an interactive Python tutorial delivered as a collection of Jupyter notebooks. It is designed as a structured learning path for beginners, teaching fundamental language concepts through a sequence of lessons that combine explanatory text with runnable code cells and embedded practice exercises. Each notebook is a self-contained unit that introduces a topic, demonstrates it with a minimal code example, and then asks the learner to write code themselves, receiving immediate feedback from the browser-based execution environment. The curriculum is built on a progressive concept-stacking mo
Teaches built-in any() and all() functions for boolean aggregation checks.
Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency access and high availability. The platform also serves as a distributed SQL query engine, allowing users to execute standard SQL statements against both in-memory datasets and external data sources. What distinguishes Hazelcast is its use of a distributed consensus subsystem to maintain strongly consis
Computes distributed functions like sum or max across map entries in parallel.
Arrow is a functional programming library for Kotlin that provides tools for implementing data-oriented programming patterns. It serves as a framework for typed error handling, a concurrency toolkit, and a library for the manipulation of immutable data. The project distinguishes itself through specialized capabilities for managing application failures using explicit types instead of exceptions and implementing resilience patterns such as circuit breakers and retry policies for distributed services. It also provides optics to update and query deeply nested immutable data structures without man
Provides functional reduction and cumulative functions to aggregate sequence data into single results.
smartTable este o componentă de grilă de date și un framework de vizualizare tabelară pentru Android. Funcționează ca o bibliotecă UI bazată pe adnotări care utilizează markere de clasă și câmp pentru a mapa automat modelele de date către coloane vizuale și a defini proprietățile tabelului. Proiectul se distinge prin integrarea profundă cu foile de calcul, oferind instrumente pentru a importa și exporta date către și din fișiere Excel, păstrând în același timp formatarea celulelor, stilurile, culorile și alinierea. De asemenea, dispune de un sistem de randare bazat pe canvas care suportă layout-uri de grilă complexe cu celule îmbinate, antete înghețate și blocarea coloanelor. Biblioteca acoperă o gamă largă de capabilități de gestionare a datelor, inclusiv sortarea, agregarea și calculul statisticilor pe coloane. Gestionează seturi mari de date prin paginare virtuală, controale de zoom și formatare condiționată a celulelor pentru a menține lizibilitatea și performanța. Framework-ul este implementat în Java pentru aplicații Android.
Performs mathematical reduction of table columns into summary values within a dedicated row.
This is an interactive notebook-based course that teaches machine learning from Python fundamentals through deep learning and natural language processing. It uses real datasets and multiple frameworks within a structured, hands-on curriculum that combines concise explanations with executable code cells, built-in datasets, and embedded exercise checkpoints. Learning progresses through data preparation and exploration, classical machine learning workflows, computer vision with convolutional neural networks, and natural language processing with deep learning, all delivered as a cohesive progressi
Provides mathematical reduction of table columns into summary values using group-by operations.
H2 este un sistem de gestionare a bazelor de date relaționale compatibil JDBC, scris în Java. Funcționează ca o bază de date SQL embeddable care poate rula direct în procesul unei aplicații pentru a elimina latența rețelei, sau ca o bază de date în memorie pentru stocare volatilă de înaltă performanță. Include, de asemenea, o consolă bazată pe web pentru executarea comenzilor SQL și administrarea schemelor. Sistemul se caracterizează prin moduri de implementare flexibile, inclusiv un mod server standalone pentru acces TCP/IP la distanță și un mod mixt pentru conectivitate locală și la distanță simultană. Dispune de un strat de emulare a dialectelor și moduri de compatibilitate care îi permit să imite comportamentul și sintaxa altor sisteme de baze de date. Motorul oferă un set larg de capabilități, acoperind tranzacții ACID cu controlul concurenței multi-versiune (MVCC), suport pentru date geospațiale și JSON, precum și funcții analitice avansate de tip window. Include instrumente pentru conservarea datelor prin backup-uri comprimate, restaurarea scripturilor SQL și gestionarea memoriei off-heap pentru a manipula seturi mari de date. Baza de date se integrează cu aplicațiile folosind drivere standard Java Database Connectivity și URL-uri de conexiune.
Provides built-in logical aggregation functions to determine if any or all expressions in a group are true.
MoreLINQ is a functional programming toolkit for .NET that provides a comprehensive collection of extension methods for LINQ to Objects. It enables declarative data transformation and sequence manipulation by extending standard collection interfaces with operators that support lazy evaluation and functional composition. By leveraging the iterator pattern, the library allows for efficient, streaming-based processing of data sets while maintaining strong type safety through generic constraints. The library distinguishes itself by offering advanced capabilities for structural manipulation and co
Reduces collections of values into single results using cumulative functions in a single pass.
Xan is a command-line tool and data transformation engine for processing CSV, TSV, and JSONL datasets. It functions as a processor for compressed files, enabling random access and seeking within gzipped and Zstd files, and serves as a converter for specialized bioinformatics data formats. The tool handles large datasets without requiring full memory loads by utilizing stream-based processing. It provides capabilities for merging, sorting, and deduplicating massive files, as well as converting data between various tabular formats. The project covers a broad range of data wrangling and analysi
Implements mathematical reduction of table columns into summary values to condense large datasets.
This project is a numerical computing library designed for scientific and engineering mathematical operations. It functions as a comprehensive linear algebra framework, a statistical analysis library, and a toolkit for mathematical optimization and numerical integration. The library is distinguished by its provider-based native acceleration, which allows managed code to be swapped for platform-native binary libraries to increase the performance of computationally intensive routines. It also supports a hybrid approach to matrix storage, implementing separate strategies for dense and sparse mat
Summarizes matrix data using functional fold and reduce routines across rows and columns.
Ktorm este un framework de mapare obiect-relațională (ORM) ușor pentru Kotlin care oferă un limbaj specific domeniului (DSL) SQL „type-safe” și API-uri funcționale de secvență pentru interacțiunea cu baza de date. Permite dezvoltatorilor să definească scheme de baze de date și să mapeze tabelele relaționale la obiecte bazate pe interfețe, asigurându-se că persistența și recuperarea datelor sunt gestionate prin expresii puternic tipizate. Framework-ul se distinge prin utilizarea unei abordări funcționale, de tip colecție, pentru construcția interogărilor, permițând dezvoltatorilor să înlănțuiască operațiuni precum filtrarea, sortarea și agregarea folosind tipare idiomatice de limbaj. Prin utilizarea evaluării leneșe (lazy) a secvențelor, biblioteca asigură că operațiunile bazei de date sunt amânate până când datele sunt accesate explicit, ceea ce optimizează performanța și utilizarea memoriei în timpul recuperării înregistrărilor. Sistemul acoperă o gamă cuprinzătoare de capabilități de gestionare a bazelor de date, inclusiv operațiuni de join automatizate, paginarea rezultatelor și gestionarea ciclului de viață al entităților. Susține definiții complexe de schemă și maparea tipurilor personalizate, oferind instrumente pentru a sincroniza stările obiectelor din memorie cu înregistrările persistente din baza de date, menținând în același timp consistența schemei prin metadate declarative.
Computes summary statistics like sums, averages, or counts from filtered database records.