9 مستودعات
Functions for manipulating nested array structures within tabular data.
Distinguishing note: Focuses on fixed-size array column manipulation.
Explore 9 awesome GitHub repositories matching data & databases · Array Column Operations. Refine with filters or upvote what's useful.
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
Performs operations on fixed-size array columns using specialized functions for sorting and aggregating elements.
This is an educational tutorial that walks through implementing a complete JSON library from scratch in C. The project covers the full data lifecycle of JSON, including parsing text into structured in-memory representations, validating input against the specification, serializing data back into standard JSON output, and providing structured access to elements within parsed arrays and objects. The implementation is built around a hand-written recursive descent parser that processes JSON text by matching grammar rules to build a structured data tree. Parsed values are stored in a tagged union r
Constructs nested arrays and objects by pushing and popping elements on a dynamic stack during parsing.
Lance is a versioned columnar data format and storage engine designed as a multimodal AI lakehouse. It serves as a vector database storage engine and a cloud object store dataset manager, organizing images, video, audio, and embeddings into a unified format optimized for machine learning workflows. The project distinguishes itself by combining a columnar layout for structured data with a specialized blob store for large multimodal tensors. It implements a hybrid search engine that integrates vector similarity search, full-text search, and SQL analytics on a single dataset, supported by a stor
Extracts values, checks existence, or measures array lengths within JSON columns using JSONPath syntax.
Ibis is a portable Python dataframe library and multi-backend query engine that provides a unified interface for executing data transformations across diverse compute engines. It functions as a Python SQL expression compiler and dialect transpiler, allowing users to define data logic once and execute it across cloud warehouses, embedded databases, and distributed clusters without rewriting code. The project distinguishes itself through a database backend abstraction that decouples transformation logic from the underlying execution engine. It enables polyglot data workflows by mixing raw SQL s
Identifies common elements between two array columns to determine shared values.
Just is a collection of JavaScript utility libraries designed for data manipulation, functional programming, performance optimization, statistical analysis, and string processing. It provides a set of tools for deep cloning, filtering, and transforming complex objects and arrays. The project is structured as a series of zero-dependency modules, allowing utilities to be used independently to minimize bundle size. It implements functional programming patterns including currying, piping, and partial application, and provides execution control through memoization, debouncing, and throttling. The
Returns a new array containing only the elements common to two source arrays.
Carp is a statically typed Lisp compiler that compiles Lisp-like syntax directly to C source code, enabling seamless integration with existing C libraries and low-level system programming. It manages memory deterministically at compile time using ownership tracking and linear types, eliminating garbage collection pauses and runtime overhead while ensuring type safety through an inferred static type system. The language distinguishes itself through compile-time macro expansion and metaprogramming capabilities, allowing code generation and transformation before final binary output. It enforces
Stores fixed-size collections of elements allocated on the stack.
jsondiffpatch هي مكتبة JSON diff وpatch مصممة لحساب الاختلافات بين كائني JSON وتطبيق تلك التغييرات لمزامنة الحالات. تعمل كأداة مزامنة لحساب الفروق (deltas) وتطبيق التصحيحات لتحديث أو استعادة كائنات JavaScript المتداخلة والمعقدة. يوفر المشروع تنفيذاً متوافقاً مع معيار RFC 6902 JSON Patch للتحديثات الذرية ومحرك عرض للاختلافات المرئية يحول فروق البيانات إلى طرق عرض HTML قابلة للقراءة البشرية. يتضمن المشروع أداة نصية للفروق لإجراء تحليل دقيق على مستوى الأحرف للسلاسل الطويلة داخل قيم بيانات JSON. تغطي المكتبة مجموعة واسعة من القدرات بما في ذلك توليد الفروق المتكررة وحساب اختلاف المصفوفات باستخدام خوارزميات Longest Common Subsequence. تدعم المكتبة اختلاف المصفوفات المنطقي لاكتشاف تحركات العناصر عبر التجزئة المخصصة وتوفر خيارات مخرجات بتنسيقات متعددة مثل تنسيق وحدة التحكم الملون ومكون React مخصص للمقارنات المرئية.
Identifies moved, added, or deleted items within JSON arrays using custom hashing instead of simple index matching.
Arroyo is a high-performance stream processing platform built in Rust. It executes continuous SQL queries on streaming data with event-time semantics, enabling accurate windowed aggregations, joins, and stateful computations on unbounded event streams. The platform uses native Rust execution for high throughput and low latency, with periodic checkpointing for exactly-once fault tolerance and horizontal scaling across distributed workers. The system integrates deeply with Kafka for reading and writing topics with exactly-once delivery and supports change data capture (CDC) from MySQL and Postg
Provides built-in SQL functions for JSONPath extraction and array transformation on streaming data.
Rueidis is a high-performance Redis client library for Go that provides a type-safe and asynchronous interface for interacting with Redis servers. It includes a full implementation of the Redis serialization protocol and a dedicated connection manager to handle pooling, multiplexing, and automatic pipelining. The library is distinguished by its support for RDMA connectivity to reduce latency and CPU overhead. It features a distributed lock manager that implements majority-based locking and optimistic concurrency control, as well as client-side caching with invalidation signals to minimize net
Includes a utility that scans JSON array results directly into slices of structured types.