1 repo
Techniques for mapping large data files directly into process memory for efficient access.
Distinguishing note: Focuses on I/O optimization for large-scale indices.
Explore 1 awesome GitHub repository matching data & databases · Memory-Mapped Storage. Refine with filters or upvote what's useful.
This project is a high-performance library designed for the similarity search and clustering of dense vectors across massive datasets. It functions as a vector similarity search engine, providing the necessary tools to organize complex numerical data into specialized structures that facilitate rapid retrieval and efficient querying of millions of records. The library distinguishes itself through a variety of advanced indexing and compression techniques, including hierarchical navigable small worlds for logarithmic time complexity and inverted file indexing to partition vector spaces into mana
Maps large index files directly into process address space to allow efficient access to data exceeding physical RAM capacity.