# milvus-io/milvus

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/milvus-io-milvus).**

42,889 stars · 3,833 forks · Go · apache-2.0

## Links

- GitHub: https://github.com/milvus-io/milvus
- Homepage: https://milvus.io
- awesome-repositories: https://awesome-repositories.com/repository/milvus-io-milvus.md

## Topics

`anns` `cloud-native` `diskann` `distributed` `embedding-database` `embedding-similarity` `embedding-store` `faiss` `golang` `hnsw` `image-search` `llm` `nearest-neighbor-search` `rag` `vector-database` `vector-search` `vector-similarity` `vector-store`

## Description

Milvus is a specialized vector database engine designed for the indexing, management, and high-speed similarity retrieval of high-dimensional vector embeddings. It functions as a similarity search engine capable of identifying nearest neighbors within large-scale vector spaces, supporting the storage and retrieval of billions of data points while maintaining consistent performance.

The system utilizes a distributed architecture that decouples storage, query, and coordination into independent services, allowing for horizontal scaling across clusters. It employs a global indexing mechanism that builds specialized data structures across immutable, independently indexed segments. This design, combined with a shared-storage decoupled model, enables compute and storage resources to scale independently in cloud environments, while a log-based persistence layer ensures data durability and state recovery.

The platform supports a wide range of data retrieval patterns, including retrieval-augmented generation, hybrid search, and multimodal data retrieval for text, images, and graphs. Deployment options range from lightweight local instances for rapid prototyping to robust standalone setups and fully managed distributed clusters. Documentation includes sizing tools to assist in estimating hardware requirements based on specific data volumes and operational patterns.

## Tags

### Data & Databases

- [Similarity Search Engines](https://awesome-repositories.com/f/data-databases/similarity-search-engines.md) — Provides high-performance similarity search capabilities for high-dimensional vector data. ([source](https://milvus.io/))
- [Vector Databases](https://awesome-repositories.com/f/data-databases/vector-databases.md) — Functions as a specialized storage engine optimized for indexing and high-speed similarity retrieval of vector embeddings.
- [Vector Search Engines](https://awesome-repositories.com/f/data-databases/vector-search-engines.md) — Provides high-speed similarity search capabilities across massive high-dimensional datasets.
- [Indexing Engines](https://awesome-repositories.com/f/data-databases/indexing-engines.md) — Implements specialized indexing structures to enable high-performance similarity searches across massive vector datasets.
- [Multimodal Databases](https://awesome-repositories.com/f/data-databases/multimodal-databases.md) — Acts as a unified storage environment for organizing and retrieving complex data types like text and images.
- [Data Partitioning](https://awesome-repositories.com/f/data-databases/data-partitioning.md) — Partitions data into immutable segments to optimize memory usage and parallel search performance.
- [Standalone Databases](https://awesome-repositories.com/f/data-databases/standalone-databases.md) — Provides a standalone configuration for single-machine environments. ([source](https://milvus.io/))
- [Write-Ahead Logs](https://awesome-repositories.com/f/data-databases/write-ahead-logs.md) — Ensures data durability and consistent state recovery by recording all mutations in a distributed message log.

### DevOps & Infrastructure

- [Distributed Database Clusters](https://awesome-repositories.com/f/devops-infrastructure/distributed-database-clusters.md) — Supports distributed architecture to handle horizontal scaling across clusters for large-scale production needs. ([source](https://milvus.io/))
- [Distributed Data Architectures](https://awesome-repositories.com/f/devops-infrastructure/distributed-data-architectures.md) — Implements a distributed architecture that supports horizontal scaling and high availability across clusters.
- [Distributed Data Infrastructure](https://awesome-repositories.com/f/devops-infrastructure/distributed-data-infrastructure.md) — Manages and scales complex data storage systems across multiple server nodes for production environments.
- [Storage Decoupling](https://awesome-repositories.com/f/devops-infrastructure/storage-decoupling.md) — Separates compute and storage nodes to allow independent scaling of processing power and data capacity.

### Artificial Intelligence & ML

- [Retrieval-Augmented Generation Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/retrieval-augmented-generation-frameworks.md) — Provides the foundational infrastructure for building retrieval-augmented generation applications. ([source](https://milvus.io/docs/manage-collections.md))
- [Retrieval-Augmented Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/retrieval-augmented-generation.md) — Enhances AI models by providing contextually relevant data retrieved from vector-based knowledge bases.
- [Hybrid Search Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/hybrid-search-systems.md) — Implements hybrid search capabilities to combine vector similarity with other retrieval methods. ([source](https://milvus.io/docs/manage-collections.md))
- [Multimodal Retrieval Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/multimodal-retrieval-systems.md) — Enables searching across diverse media types by utilizing shared vector representations.
- [Multimodal Search Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/multimodal-search-engines.md) — Supports multimodal search patterns to query across diverse data types. ([source](https://milvus.io/docs/manage-collections.md))

### Software Engineering & Architecture

- [Microservice Architectures](https://awesome-repositories.com/f/software-engineering-architecture/microservice-architectures.md) — Decouples storage, query, and coordination into independent services to enable horizontal scaling.
