3 مستودعات
Index-addressable sequences of data stored in a memory-efficient sparse format.
Distinct from Sparse Data Processing: Unlike sparse matrices or embeddings, this is a general-purpose sparse array for string sequences.
Explore 3 awesome GitHub repositories matching data & databases · Sparse Arrays. Refine with filters or upvote what's useful.
Redis is a high-performance in-memory key-value store that functions as a distributed cache, message broker, and NoSQL database. It provides sub-millisecond read and write access to data stored in RAM and can operate as a vector database for indexing high-dimensional embeddings. The system supports a wide range of data storage and synchronization primitives, including the management of strings, hashes, lists, sets, and JSON documents. It enables real-time data operations through atomic transactions, hybrid persistence using snapshots and append-only logs, and high-availability configurations
Supports storing index-addressable sequences of strings using a memory-efficient sparse format.
go-datastructures is a collection of thread-safe and lock-free data structures designed for high-performance concurrent applications in Go. It provides a modular library of specialized algorithmic toolsets, including a lock-free collection library and an immutable data structure library. The project distinguishes itself through a suite of persistent AVL trees and hash array mapped tries that use branch-copying to preserve previous versions. It also implements non-blocking hash maps, queues, and tries that enable linearizable snapshots and concurrent updates without the use of mutual exclusion
Tracks integer sets using compressed bit-fields to reduce memory overhead compared to traditional hash maps.
Scanpy is a Python library for the preprocessing, visualization, and analysis of large-scale single-cell gene expression datasets. It serves as a toolkit for single-cell RNA sequencing analysis, providing a framework to process and analyze genomic data from individual cells to identify biological markers and cell types. The library includes a scalable data processing pipeline for cleaning and preparing genomic data, a clustering framework for grouping cells with similar expression profiles, and a system for modeling transitions between cell states to reconstruct biological development and dif
Provides memory-efficient storage of high-dimensional gene expression matrices and cell metadata using sparse arrays.