9 repository-uri
Functions for manipulating nested array structures within tabular data.
Distinguishing note: Focuses on fixed-size array column manipulation.
Explore 9 awesome GitHub repositories matching data & databases · Array Column Operations. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Performs operations on fixed-size array columns using specialized functions for sorting and aggregating elements.
This is an educational tutorial that walks through implementing a complete JSON library from scratch in C. The project covers the full data lifecycle of JSON, including parsing text into structured in-memory representations, validating input against the specification, serializing data back into standard JSON output, and providing structured access to elements within parsed arrays and objects. The implementation is built around a hand-written recursive descent parser that processes JSON text by matching grammar rules to build a structured data tree. Parsed values are stored in a tagged union r
Constructs nested arrays and objects by pushing and popping elements on a dynamic stack during parsing.
Lance is a versioned columnar data format and storage engine designed as a multimodal AI lakehouse. It serves as a vector database storage engine and a cloud object store dataset manager, organizing images, video, audio, and embeddings into a unified format optimized for machine learning workflows. The project distinguishes itself by combining a columnar layout for structured data with a specialized blob store for large multimodal tensors. It implements a hybrid search engine that integrates vector similarity search, full-text search, and SQL analytics on a single dataset, supported by a stor
Extracts values, checks existence, or measures array lengths within JSON columns using JSONPath syntax.
Ibis is a portable Python dataframe library and multi-backend query engine that provides a unified interface for executing data transformations across diverse compute engines. It functions as a Python SQL expression compiler and dialect transpiler, allowing users to define data logic once and execute it across cloud warehouses, embedded databases, and distributed clusters without rewriting code. The project distinguishes itself through a database backend abstraction that decouples transformation logic from the underlying execution engine. It enables polyglot data workflows by mixing raw SQL s
Identifies common elements between two array columns to determine shared values.
Just este o colecție de biblioteci de utilitare JavaScript concepute pentru manipularea datelor, programarea funcțională, optimizarea performanței, analiza statistică și procesarea șirurilor de caractere. Oferă un set de instrumente pentru deep cloning, filtrarea și transformarea obiectelor și array-urilor complexe. Proiectul este structurat ca o serie de module fără dependențe, permițând utilizarea independentă a utilitarelor pentru a minimiza dimensiunea bundle-ului. Implementează tipare de programare funcțională, inclusiv currying, piping și aplicare parțială, și oferă controlul execuției prin memoizare, debouncing și throttling. Biblioteca acoperă o gamă largă de capabilități, inclusiv manipularea profundă a obiectelor, generarea combinatorie de date și operațiuni matematice precum verificarea numerelor prime și numeric clamping. Include, de asemenea, instrumente statistice pentru calcularea metricilor precum varianța și deviația standard, precum și utilitare de procesare a textului pentru conversia cazurilor și interpolarea șirurilor.
Returns a new array containing only the elements common to two source arrays.
Carp is a statically typed Lisp compiler that compiles Lisp-like syntax directly to C source code, enabling seamless integration with existing C libraries and low-level system programming. It manages memory deterministically at compile time using ownership tracking and linear types, eliminating garbage collection pauses and runtime overhead while ensuring type safety through an inferred static type system. The language distinguishes itself through compile-time macro expansion and metaprogramming capabilities, allowing code generation and transformation before final binary output. It enforces
Stores fixed-size collections of elements allocated on the stack.
jsondiffpatch este o bibliotecă de diff și patch JSON concepută pentru a calcula diferențele dintre două obiecte JSON și a aplica acele modificări pentru a sincroniza stările. Funcționează ca un instrument de sincronizare pentru calcularea delta-urilor și aplicarea patch-urilor pentru a actualiza sau a reveni la stări anterioare ale obiectelor JavaScript complexe imbricate. Proiectul oferă o implementare conformă cu standardul RFC 6902 JSON Patch pentru actualizări atomice și un renderer vizual de diferențe care convertește delta-urile de date în vizualizări HTML lizibile pentru oameni. Include un instrument specializat de text diff pentru efectuarea unei analize fine, la nivel de caracter, asupra șirurilor lungi din valorile datelor JSON. Biblioteca acoperă o gamă largă de capabilități, inclusiv generarea recursivă de delta-uri și calcularea diferențelor de array-uri folosind algoritmi de tip Longest Common Subsequence. Suportă diffing-ul logic al array-urilor pentru a detecta mutările de elemente prin hashing personalizat și oferă opțiuni de ieșire în mai multe formate, cum ar fi formatarea colorată în consolă și o componentă React dedicată pentru comparații vizuale.
Identifies moved, added, or deleted items within JSON arrays using custom hashing instead of simple index matching.
Arroyo is a high-performance stream processing platform built in Rust. It executes continuous SQL queries on streaming data with event-time semantics, enabling accurate windowed aggregations, joins, and stateful computations on unbounded event streams. The platform uses native Rust execution for high throughput and low latency, with periodic checkpointing for exactly-once fault tolerance and horizontal scaling across distributed workers. The system integrates deeply with Kafka for reading and writing topics with exactly-once delivery and supports change data capture (CDC) from MySQL and Postg
Provides built-in SQL functions for JSONPath extraction and array transformation on streaming data.
Rueidis is a high-performance Redis client library for Go that provides a type-safe and asynchronous interface for interacting with Redis servers. It includes a full implementation of the Redis serialization protocol and a dedicated connection manager to handle pooling, multiplexing, and automatic pipelining. The library is distinguished by its support for RDMA connectivity to reduce latency and CPU overhead. It features a distributed lock manager that implements majority-based locking and optimistic concurrency control, as well as client-side caching with invalidation signals to minimize net
Includes a utility that scans JSON array results directly into slices of structured types.