Why is jashkenas/underscore a recommended Functional Data Aggregation GitHub Repositories repository?

Includes aggregation helpers that collapse collections into a single result via a cumulative function.

Why is sindresorhus/type-fest a recommended Functional Data Aggregation GitHub Repositories repository?

Checks whether all elements in a list of booleans are true at the type level.

Why is tporadowski/redis a recommended Functional Data Aggregation GitHub Repositories repository?

Groups indexed documents by fields and applies reduction functions like count, sum, and average on the server.

Why is x-extends/vxe-table a recommended Functional Data Aggregation GitHub Repositories repository?

Calculates summaries and applies mathematical functions to provide high-level insights from raw tabular data.

Why is apify/crawlee-python a recommended Functional Data Aggregation GitHub Repositories repository?

Reduces collections of dataset entries into a single accumulated result using custom reduction functions.

Why is jerry-git/learn-python3 a recommended Functional Data Aggregation GitHub Repositories repository?

Teaches built-in any() and all() functions for boolean aggregation checks.

Why is hazelcast/hazelcast a recommended Functional Data Aggregation GitHub Repositories repository?

Computes distributed functions like sum or max across map entries in parallel.

Why is arrow-kt/arrow a recommended Functional Data Aggregation GitHub Repositories repository?

Provides functional reduction and cumulative functions to aggregate sequence data into single results.

Why is huangyanbin/smarttable a recommended Functional Data Aggregation GitHub Repositories repository?

Performs mathematical reduction of table columns into summary values within a dedicated row.

Why is nyandwi/machine_learning_complete a recommended Functional Data Aggregation GitHub Repositories repository?

Provides mathematical reduction of table columns into summary values using group-by operations.

15 repository-uri

Awesome GitHub RepositoriesFunctional Data Aggregation

Reducing collections of values into single results using cumulative functions.

Distinct from Decentralized Data Aggregators: Candidates were related to decentralized aggregators or A/B testing, not functional reduction

Explore 15 awesome GitHub repositories matching data & databases · Functional Data Aggregation. Refine with filters or upvote what's useful.

Găsește cele mai bune repo-uri cu AI.Vom căuta cele mai potrivite repository-uri folosind AI.

jashkenas/underscore
jashkenas/underscore
27,345Vezi pe GitHub
Underscore is a JavaScript utility library providing a suite of functional programming and data manipulation helpers. It serves as a framework for transforming data collections, composing functions, managing objects, and performing precise data type validation without modifying core language prototypes. The project includes a functional programming toolkit designed to control function execution timing and behavior through techniques such as debouncing, throttling, and partial application. It also provides a dedicated object manipulation utility for cloning, merging, picking, and omitting prop
Includes aggregation helpers that collapse collections into a single result via a cumulative function.
JavaScript
Vezi pe GitHub27,345
sindresorhus/type-fest
sindresorhus/type-fest
17,233Vezi pe GitHub
type-fest is a library of reusable utility types for performing complex transformations and validations on objects, arrays, strings, and numeric ranges in TypeScript. It provides a collection of type definitions designed to handle advanced structural changes and constraints. The project distinguishes itself by offering specialized logic for string literal processing, such as casing transformations and pattern-based modifications, and type-level arithmetic for calculating numeric ranges and absolute values. It also includes utilities for enforcing deep immutability, ensuring union mutual exclu
Checks whether all elements in a list of booleans are true at the type level.
TypeScriptnpm-packagetypestypescript
Vezi pe GitHub17,233
tporadowski/redis
tporadowski/redis
9,987Vezi pe GitHub
Redis is a high-performance in-memory key-value store that functions as a distributed cache, message broker, and NoSQL database. It provides sub-millisecond read and write access to data stored in RAM and can operate as a vector database for indexing high-dimensional embeddings. The system supports a wide range of data storage and synchronization primitives, including the management of strings, hashes, lists, sets, and JSON documents. It enables real-time data operations through atomic transactions, hybrid persistence using snapshots and append-only logs, and high-availability configurations
Groups indexed documents by fields and applies reduction functions like count, sum, and average on the server.
Credisredis-for-windowsredis-msi-installer
Vezi pe GitHub9,987
x-extends/vxe-table
x-extends/vxe-table
8,595Vezi pe GitHub
vxe-table is a high-performance data table component and UI library for Vue, designed for building data-heavy applications. It functions as a virtualized data grid and spreadsheet UI framework capable of rendering millions of rows by mounting only the visible elements of a dataset. The project distinguishes itself through spreadsheet-like functionality, including cell selection, copy-paste support, and the generation of cross-tabulated pivot tables. It also provides specialized tools for managing complex data hierarchies using virtual trees, row grouping, and cell merging. The library covers
Calculates summaries and applies mathematical functions to provide high-level insights from raw tabular data.
TypeScriptcellcheckboxeditable
Vezi pe GitHub8,595
apify/crawlee-python
apify/crawlee-python
8,097Vezi pe GitHub
Crawlee-python is a web crawling framework for building scalable scrapers using Python. It serves as a comprehensive tool for web scraping automation, providing a system to extract structured data from websites using both lightweight HTTP requests and headless browser automation. The framework is distinguished by its anti-bot evasion capabilities, which include browser fingerprint impersonation and tiered proxy rotation to bypass detection systems and solve challenges such as Cloudflare. It also incorporates artificial intelligence for autonomous website navigation and schema-based data extra
Reduces collections of dataset entries into a single accumulated result using custom reduction functions.
Pythonapifyautomationbeautifulsoup
Vezi pe GitHub8,097
jerry-git/learn-python3
jerry-git/learn-python3
6,754Vezi pe GitHub
This is an interactive Python tutorial delivered as a collection of Jupyter notebooks. It is designed as a structured learning path for beginners, teaching fundamental language concepts through a sequence of lessons that combine explanatory text with runnable code cells and embedded practice exercises. Each notebook is a self-contained unit that introduces a topic, demonstrates it with a minimal code example, and then asks the learner to write code themselves, receiving immediate feedback from the browser-based execution environment. The curriculum is built on a progressive concept-stacking mo
Teaches built-in any() and all() functions for boolean aggregation checks.
HTMLjupyter-notebooklearning-pythonpython-exercises
Vezi pe GitHub6,754
hazelcast/hazelcast
hazelcast/hazelcast
6,570Vezi pe GitHub
Hazelcast is a distributed data platform that combines an in-memory data grid with a stream processing engine to support real-time analytics and event-driven applications. It functions as a partitioned, distributed key-value store that replicates data across cluster nodes to provide low-latency access and high availability. The platform also serves as a distributed SQL query engine, allowing users to execute standard SQL statements against both in-memory datasets and external data sources. What distinguishes Hazelcast is its use of a distributed consensus subsystem to maintain strongly consis
Computes distributed functions like sum or max across map entries in parallel.
Javabig-datacachingdata-in-motion
Vezi pe GitHub6,570
arrow-kt/arrow
arrow-kt/arrow
6,558Vezi pe GitHub
Arrow is a functional programming library for Kotlin that provides tools for implementing data-oriented programming patterns. It serves as a framework for typed error handling, a concurrency toolkit, and a library for the manipulation of immutable data. The project distinguishes itself through specialized capabilities for managing application failures using explicit types instead of exceptions and implementing resilience patterns such as circuit breakers and retry policies for distributed services. It also provides optics to update and query deeply nested immutable data structures without man
Provides functional reduction and cumulative functions to aggregate sequence data into single results.
Kotlinarrowcontinuationscoroutines
Vezi pe GitHub6,558
huangyanbin/smarttable
huangyanbin/smartTable
5,226Vezi pe GitHub
smartTable este o componentă de grilă de date și un framework de vizualizare tabelară pentru Android. Funcționează ca o bibliotecă UI bazată pe adnotări care utilizează markere de clasă și câmp pentru a mapa automat modelele de date către coloane vizuale și a defini proprietățile tabelului. Proiectul se distinge prin integrarea profundă cu foile de calcul, oferind instrumente pentru a importa și exporta date către și din fișiere Excel, păstrând în același timp formatarea celulelor, stilurile, culorile și alinierea. De asemenea, dispune de un sistem de randare bazat pe canvas care suportă layout-uri de grilă complexe cu celule îmbinate, antete înghețate și blocarea coloanelor. Biblioteca acoperă o gamă largă de capabilități de gestionare a datelor, inclusiv sortarea, agregarea și calculul statisticilor pe coloane. Gestionează seturi mari de date prin paginare virtuală, controale de zoom și formatare condiționată a celulelor pentru a menține lizibilitatea și performanța. Framework-ul este implementat în Java pentru aplicații Android.
Performs mathematical reduction of table columns into summary values within a dedicated row.
Javaandroidandroid-libraryandroid-table
Vezi pe GitHub5,226
nyandwi/machine_learning_complete
Nyandwi/machine_learning_complete
4,983Vezi pe GitHub
This is an interactive notebook-based course that teaches machine learning from Python fundamentals through deep learning and natural language processing. It uses real datasets and multiple frameworks within a structured, hands-on curriculum that combines concise explanations with executable code cells, built-in datasets, and embedded exercise checkpoints. Learning progresses through data preparation and exploration, classical machine learning workflows, computer vision with convolutional neural networks, and natural language processing with deep learning, all delivered as a cohesive progressi
Provides mathematical reduction of table columns into summary values using group-by operations.
Jupyter Notebookcomputer-visiondata-analysisdata-science
Vezi pe GitHub4,983
h2database/h2database
h2database/h2database
4,607Vezi pe GitHub
H2 este un sistem de gestionare a bazelor de date relaționale compatibil JDBC, scris în Java. Funcționează ca o bază de date SQL embeddable care poate rula direct în procesul unei aplicații pentru a elimina latența rețelei, sau ca o bază de date în memorie pentru stocare volatilă de înaltă performanță. Include, de asemenea, o consolă bazată pe web pentru executarea comenzilor SQL și administrarea schemelor. Sistemul se caracterizează prin moduri de implementare flexibile, inclusiv un mod server standalone pentru acces TCP/IP la distanță și un mod mixt pentru conectivitate locală și la distanță simultană. Dispune de un strat de emulare a dialectelor și moduri de compatibilitate care îi permit să imite comportamentul și sintaxa altor sisteme de baze de date. Motorul oferă un set larg de capabilități, acoperind tranzacții ACID cu controlul concurenței multi-versiune (MVCC), suport pentru date geospațiale și JSON, precum și funcții analitice avansate de tip window. Include instrumente pentru conservarea datelor prin backup-uri comprimate, restaurarea scripturilor SQL și gestionarea memoriei off-heap pentru a manipula seturi mari de date. Baza de date se integrează cu aplicațiile folosind drivere standard Java Database Connectivity și URL-uri de conexiune.
Provides built-in logical aggregation functions to determine if any or all expressions in a group are true.
Javadatabasejavajdbc
Vezi pe GitHub4,607
morelinq/morelinq
morelinq/MoreLINQ
3,827Vezi pe GitHub
MoreLINQ is a functional programming toolkit for .NET that provides a comprehensive collection of extension methods for LINQ to Objects. It enables declarative data transformation and sequence manipulation by extending standard collection interfaces with operators that support lazy evaluation and functional composition. By leveraging the iterator pattern, the library allows for efficient, streaming-based processing of data sets while maintaining strong type safety through generic constraints. The library distinguishes itself by offering advanced capabilities for structural manipulation and co
Reduces collections of values into single results using cumulative functions in a single pass.
C#dotnetlinq
Vezi pe GitHub3,827
medialab/xan
medialab/xan
3,752Vezi pe GitHub
Xan is a command-line tool and data transformation engine for processing CSV, TSV, and JSONL datasets. It functions as a processor for compressed files, enabling random access and seeking within gzipped and Zstd files, and serves as a converter for specialized bioinformatics data formats. The tool handles large datasets without requiring full memory loads by utilizing stream-based processing. It provides capabilities for merging, sorting, and deduplicating massive files, as well as converting data between various tabular formats. The project covers a broad range of data wrangling and analysi
Implements mathematical reduction of table columns into summary values to condense large datasets.
Rustclicsvrust
Vezi pe GitHub3,752
mathnet/mathnet-numerics
mathnet/mathnet-numerics
3,717Vezi pe GitHub
This project is a numerical computing library designed for scientific and engineering mathematical operations. It functions as a comprehensive linear algebra framework, a statistical analysis library, and a toolkit for mathematical optimization and numerical integration. The library is distinguished by its provider-based native acceleration, which allows managed code to be swapped for platform-native binary libraries to increase the performance of computationally intensive routines. It also supports a hybrid approach to matrix storage, implementing separate strategies for dense and sparse mat
Summarizes matrix data using functional fold and reduce routines across rows and columns.
C#csharpdifferentiationfft
Vezi pe GitHub3,717
kotlin-orm/ktorm
kotlin-orm/ktorm
2,336Vezi pe GitHub
Ktorm este un framework de mapare obiect-relațională (ORM) ușor pentru Kotlin care oferă un limbaj specific domeniului (DSL) SQL „type-safe” și API-uri funcționale de secvență pentru interacțiunea cu baza de date. Permite dezvoltatorilor să definească scheme de baze de date și să mapeze tabelele relaționale la obiecte bazate pe interfețe, asigurându-se că persistența și recuperarea datelor sunt gestionate prin expresii puternic tipizate. Framework-ul se distinge prin utilizarea unei abordări funcționale, de tip colecție, pentru construcția interogărilor, permițând dezvoltatorilor să înlănțuiască operațiuni precum filtrarea, sortarea și agregarea folosind tipare idiomatice de limbaj. Prin utilizarea evaluării leneșe (lazy) a secvențelor, biblioteca asigură că operațiunile bazei de date sunt amânate până când datele sunt accesate explicit, ceea ce optimizează performanța și utilizarea memoriei în timpul recuperării înregistrărilor. Sistemul acoperă o gamă cuprinzătoare de capabilități de gestionare a bazelor de date, inclusiv operațiuni de join automatizate, paginarea rezultatelor și gestionarea ciclului de viață al entităților. Susține definiții complexe de schemă și maparea tipurilor personalizate, oferind instrumente pentru a sincroniza stările obiectelor din memorie cu înregistrările persistente din baza de date, menținând în același timp consistența schemei prin metadate declarative.
Computes summary statistics like sums, averages, or counts from filtered database records.
Kotlinkotlinktormorm
Vezi pe GitHub2,336