# databendlabs/databend

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/databendlabs-databend).**

9,158 stars · 851 forks · Rust · other

## Links

- GitHub: https://github.com/databendlabs/databend
- Homepage: https://docs.databend.com
- awesome-repositories: https://awesome-repositories.com/repository/databendlabs-databend.md

## Topics

`ai` `bigdata` `cloud-native` `database` `elasticsearch` `geospatial` `lakehouse` `olap` `rust` `serverless` `snowflake` `sql` `vector-database` `vector-search`

## Description

Databend is a cloud-native data warehouse and OLAP database designed for large-scale analytics. It functions as a SQL-compliant engine and serverless analytics platform that separates compute from storage to allow for independent scaling.

The system integrates vector database capabilities, indexing high-dimensional embeddings to enable semantic, hybrid, and full-text searches across massive datasets. It further distinguishes itself through serverless compute management that automatically scales resources based on demand and shuts them down during idle periods.

The platform covers a broad set of analytical and management capabilities, including data versioning and branching, automatic schema evolution, and multi-tiered storage management. It also provides enterprise security management with role-based access control, data masking, and automated pipeline orchestration via stored procedures and sandboxed user-defined functions.

## Tags

### Data & Databases

- [Cloud-Native Databases](https://awesome-repositories.com/f/data-databases/database-management-systems/database-engines/cloud-native-databases.md) — Functions as a cloud-native database designed for elastic scaling and independent compute and storage management.
- [Columnar Storage Engines](https://awesome-repositories.com/f/data-databases/columnar-storage-engines.md) — Implements a storage engine that organizes data by column to optimize high-volume analytical read performance.
- [Data Analytics Engines](https://awesome-repositories.com/f/data-databases/data-analysis-visualization/analytical-platforms-engines/data-analytics-engines.md) — Executes complex analytical queries over massive datasets to generate business intelligence insights. ([source](https://cdn.jsdelivr.net/gh/databendlabs/databend@main/README.md))
- [Analytics Data Platforms](https://awesome-repositories.com/f/data-databases/data-collections-datasets/analytics-data-platforms.md) — Serves as a centralized platform for large-scale data aggregation and insight generation via a cloud-native warehouse.
- [Serverless Warehouses](https://awesome-repositories.com/f/data-databases/data-warehousing/serverless-warehouses.md) — Implements a serverless data warehouse architecture that scales compute automatically and separates it from storage.
- [Stateless Compute Scaling](https://awesome-repositories.com/f/data-databases/horizontal-database-scaling/trace-storage-scaling/stateless-compute-scaling.md) — Scales compute resources independently of storage to minimize costs during idle periods.
- [Object Storage Persistence](https://awesome-repositories.com/f/data-databases/object-storage-services/log-object-storage/object-storage-persistence.md) — Persists data in cloud object storage to decouple compute from storage for independent scaling.
- [OLAP Database Engines](https://awesome-repositories.com/f/data-databases/olap-database-engines.md) — Provides an OLAP database engine optimized for complex analytical queries and aggregate summaries.
- [SQL Engines](https://awesome-repositories.com/f/data-databases/sql-engines.md) — Implements a SQL-compliant engine that manages complex query execution over large-scale cloud storage.
- [Vector Databases](https://awesome-repositories.com/f/data-databases/vector-databases.md) — Integrates a vector database capable of indexing high-dimensional embeddings for semantic search.
- [Vector Indexing](https://awesome-repositories.com/f/data-databases/vector-indexing.md) — Provides high-dimensional indexing structures to enable fast similarity searches across embedding vectors.
- [Vector Search](https://awesome-repositories.com/f/data-databases/vector-search.md) — Provides high-dimensional vector search capabilities for semantic and hybrid retrieval across massive datasets.
- [Schema Evolution](https://awesome-repositories.com/f/data-databases/data-governance-modeling/data-modeling-schemas/schema-evolution.md) — Automatically adjusts table schemas to accommodate changes in incoming data structures without manual intervention.
- [Data Versioning](https://awesome-repositories.com/f/data-databases/data-management/table-data-managers/data-versioning.md) — Creates snapshots and branches of production data to enable experimentation and testing without affecting primary datasets. ([source](https://cdn.jsdelivr.net/gh/databendlabs/databend@main/README.md))
- [Hybrid Vector-Keyword Indexing](https://awesome-repositories.com/f/data-databases/hybrid-vector-keyword-indexing.md) — Combines semantic vector embeddings with keyword matching to provide hybrid information retrieval. ([source](https://cdn.jsdelivr.net/gh/databendlabs/databend@main/README.md))
- [Storage Tiering](https://awesome-repositories.com/f/data-databases/storage-tiering.md) — Moves data between different storage tiers based on access frequency to balance performance and cost.
- [Relational Vector Engines](https://awesome-repositories.com/f/data-databases/vector-databases/relational-vector-engines.md) — Unifies relational SQL analytics with vector similarity search to filter results using structured metadata. ([source](https://docs.databend.com/guides/query/vector-db))
- [Vector Similarity Search](https://awesome-repositories.com/f/data-databases/vector-similarity-search.md) — Performs semantic search by comparing vector embeddings using mathematical distance metrics. ([source](https://docs.databend.com/guides/query/vector-db))

### DevOps & Infrastructure

- [Serverless Databases](https://awesome-repositories.com/f/devops-infrastructure/serverless-backend-hosting/serverless-databases.md) — Provides a serverless database experience that automatically scales compute resources and supports scaling to zero. ([source](https://docs.databend.com/guides/cloud/))
- [Analytics Engines](https://awesome-repositories.com/f/devops-infrastructure/serverless-platforms/analytics-engines.md) — Offers a serverless analytics platform that automatically scales compute resources based on real-time demand.

### Security & Cryptography

- [Enterprise Data Governance](https://awesome-repositories.com/f/security-cryptography/enterprise-data-governance.md) — Enforces role-based access control and data masking to ensure security and compliance across large datasets.
- [Enterprise Security Controls](https://awesome-repositories.com/f/security-cryptography/enterprise-security-controls.md) — Enforces enterprise-grade security through role-based access control, data masking, and compliance audit logs. ([source](https://docs.databend.com/guides/cloud/))

### Software Engineering & Architecture

- [Shared-Nothing Processing Engines](https://awesome-repositories.com/f/software-engineering-architecture/shared-nothing-architectures/shared-nothing-processing-engines.md) — Distributes query processing across independent worker nodes to ensure high performance and availability.

### Part of an Awesome List

- [Database Systems](https://awesome-repositories.com/f/awesome-lists/data/database-systems.md) — Cloud-native DBMS for real-time data processing and analytics.
