LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector search engine. It serves as a high-performance backend for indexing and retrieving high-dimensional embeddings, providing the foundation for machine learning data pipelines. The system distinguishes itself through a combination of cloud-native object storage and immutable version tracking, allowing for data time-travel and reproducible AI experiments. It integrates hybrid search capabilities, merging dense vector similarity with BM25 full-text search and SQL-like scalar filters
jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU hardware. Its primary purpose is to enable real-time computer vision and AI inference at the edge with low latency and high throughput. The project distinguishes itself through high-performance streaming analytics and the ability to execute concurrent AI pipelines on auto-grade silicon. It provides specialized support for multi-sensor stream processing, utilizing zero-copy data transport to load camera frames directly into GPU memory. The codebase covers a broad surface of capabiliti
zvec is an embedded vector database engine and indexing library designed for high-dimensional similarity search. It functions as a hybrid search engine and a retrieval-augmented generation knowledge base, allowing for the storage and retrieval of dense and sparse vectors. The system is distinguished by its hybrid retrieval pipeline, which fuses vector similarity, full-text keyword matching, and scalar metadata filtering into single query operations. It supports a plugin-based model integration system for registering custom embedding models and rerankers, as well as language bindings for nativ
Infinity is a distributed vector database and multimodal vector store designed to manage large-scale datasets for retrieval and similarity search. It serves as a backend for large language model applications and retrieval augmented generation pipelines by storing and retrieving dense vectors, sparse vectors, and full-text data. The system functions as a hybrid search engine, combining vector embeddings and full-text search with reranking algorithms to identify the most relevant documents. It supports multimodal data storage, allowing the maintenance of diverse data types including tensors, st
ruvector is a Rust-based vector store and graph database designed for local inference and nearest neighbor searches. It utilizes a vector graph database architecture and a graph neural network index to refine search rankings through structural attention. The system includes a hardware-accelerated quantum circuit simulator for executing state-vector simulations and complex search patterns, alongside a WebAssembly inference engine for running vector search and model execution…
The main features of ruvnet/ruvector are: Vector Databases, Local AI Inference, Local LLM Execution, Result Reranking, Packaged Deployments, GPU-Accelerated Vector Indexing, GNN Reranking Layers, Graph Relationship Queries.
Open-source alternatives to ruvnet/ruvector include: lancedb/lancedb — LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector… dusty-nv/jetson-inference — jetson-inference is a set of libraries and tools for executing optimized deep learning models on embedded GPU… alibaba/zvec — zvec is an embedded vector database engine and indexing library designed for high-dimensional similarity search. It… infiniflow/infinity — Infinity is a distributed vector database and multimodal vector store designed to manage large-scale datasets for… qdrant/qdrant — Qdrant is a high-performance vector similarity database designed to store, index, and search high-dimensional vectors… memgraph/memgraph — Memgraph is an in-memory, distributed graph database designed for high-performance labeled property graph management.…