1 repo
Methods for finding relevant items in massive datasets based on mathematical proximity.
Distinguishing note: Focuses on the general similarity search domain.
Explore 1 awesome GitHub repository matching data & databases · Similarity Search. Refine with filters or upvote what's useful.
This project is a high-performance library designed for the similarity search and clustering of dense vectors across massive datasets. It functions as a vector similarity search engine, providing the necessary tools to organize complex numerical data into specialized structures that facilitate rapid retrieval and efficient querying of millions of records. The library distinguishes itself through a variety of advanced indexing and compression techniques, including hierarchical navigable small worlds for logarithmic time complexity and inverted file indexing to partition vector spaces into mana
Finds the most relevant items in massive datasets by comparing mathematical representations of data points based on their proximity.