# taosdata/TDengine

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/taosdata-tdengine).**

24,734 stars · 4,999 forks · C · agpl-3.0

## Links

- GitHub: https://github.com/taosdata/TDengine
- Homepage: https://tdengine.com
- awesome-repositories: https://awesome-repositories.com/repository/taosdata-tdengine.md

## Topics

`bigdata` `cloud-native` `cluster` `connected-vehicles` `database` `distributed` `financial-analysis` `industrial-iot` `iot` `metrics` `monitoring` `scalability` `sql` `tdengine` `time-series` `time-series-database` `tsdb`

## Description

TDengine is a distributed time-series database designed for the high-speed ingestion, compression, and retrieval of timestamped metrics and sensor data. It functions as a SQL-compatible analytics engine, allowing users to perform complex operations on massive volumes of time-ordered information using standard relational syntax. The platform is built to serve as a backend foundation for industrial IoT environments, managing real-time data streams and device metadata through a cluster-based architecture.

The system distinguishes itself through a distributed sharding architecture that uses consistent hashing to ensure horizontal scalability and high-throughput ingestion. It employs a log-structured write path to minimize disk seek latency and utilizes super-table virtualization to provide a unified logical view across multiple physical tables. To maintain performance and cost-efficiency, the database features automated multi-tiered lifecycle management, which migrates data between high-performance memory and low-cost storage based on age and access frequency.

Beyond its core storage capabilities, the platform provides robust tools for edge-to-cloud synchronization, ensuring consistent data states across geographically distributed infrastructure. It includes built-in support for real-time stream processing, allowing for the analysis of live data without requiring external message queues. The system also incorporates comprehensive security frameworks, including user access control, audit logging, and encrypted transport protocols to protect sensitive operational data.

Developers can interact with the database through native client libraries that support connection pooling and query parameter binding. The system is documented with comprehensive error code diagnostics and provides command-line utilities for cluster administration, health monitoring, and configuration management.

## Tags

### Data & Databases

- [Analytics Engines](https://awesome-repositories.com/f/data-databases/analytics-engines.md) — Provides a SQL-compatible query layer for performing complex operations on massive volumes of time-ordered data.
- [Columnar Storage Engines](https://awesome-repositories.com/f/data-databases/columnar-storage-engines.md) — Organizes data into time-ordered blocks and columns to maximize compression and accelerate sequential read operations.
- [Time Series Databases](https://awesome-repositories.com/f/data-databases/time-series-databases.md) — Optimizes high-speed ingestion, compression, and retrieval of timestamped metrics and sensor readings.
- [Distributed Databases](https://awesome-repositories.com/f/data-databases/distributed-databases.md) — Provides a cluster-based architecture for high availability and fault tolerance.
- [Distributed Sharding Architectures](https://awesome-repositories.com/f/data-databases/distributed-sharding-architectures.md) — Partitions data across multiple nodes using consistent hashing to ensure horizontal scalability and high-throughput ingestion.
- [High-Performance Databases](https://awesome-repositories.com/f/data-databases/high-performance-databases.md) — Handles massive volumes of timestamped data with high write throughput.
- [Time-Series Data Modeling](https://awesome-repositories.com/f/data-databases/time-series-data-modeling.md) — Models time-series data by organizing metrics into tables for efficient storage and retrieval of sensor data. ([source](https://docs.tdengine.com/basic/))
- [Time-Series SQL Querying](https://awesome-repositories.com/f/data-databases/time-series-sql-querying.md) — Supports querying time-series data using standard SQL syntax to perform analysis and retrieve metrics. ([source](https://docs.tdengine.com/basic/))
- [Data Compression Algorithms](https://awesome-repositories.com/f/data-databases/data-compression-algorithms.md) — Compresses data during storage and transmission using specialized algorithms to reduce storage footprint. ([source](https://docs.tdengine.com/tdinternal/))
- [Log-Structured Storage](https://awesome-repositories.com/f/data-databases/log-structured-storage.md) — Appends incoming data to memory-resident buffers and sequential log files to minimize disk seek latency during writes.
- [Query Languages](https://awesome-repositories.com/f/data-databases/query-languages.md) — Supports standard SQL commands for querying and modifying database objects. ([source](https://docs.tdengine.com/develop/))
- [Backup and Recovery Systems](https://awesome-repositories.com/f/data-databases/backup-and-recovery-systems.md) — Supports cluster data backup and restoration to recover from accidental loss or damage. ([source](https://docs.tdengine.com/operation/))
- [Edge-to-Cloud Synchronization](https://awesome-repositories.com/f/data-databases/edge-to-cloud-synchronization.md) — Synchronizes time-series data automatically between edge and cloud nodes to ensure consistent information. ([source](https://docs.tdengine.com/advanced/))
- [High-Throughput Ingestion Pipelines](https://awesome-repositories.com/f/data-databases/high-throughput-ingestion-pipelines.md) — Streams large volumes of information using automated batching and sharding to maximize throughput. ([source](https://docs.tdengine.com/develop/))
- [Data Lifecycle Management](https://awesome-repositories.com/f/data-databases/data-lifecycle-management.md) — Automatically migrates data between high-performance memory and low-cost disk tiers based on access frequency and age.
- [Data Stream Subscriptions](https://awesome-repositories.com/f/data-databases/data-stream-subscriptions.md) — Provides built-in interfaces to subscribe to real-time data streams for downstream processing. ([source](https://docs.tdengine.com/tdinternal/))
- [Storage Tiering](https://awesome-repositories.com/f/data-databases/storage-tiering.md) — Optimizes storage costs by automatically moving older data to lower-cost media tiers. ([source](https://docs.tdengine.com/operation/))
- [Stream Processing](https://awesome-repositories.com/f/data-databases/stream-processing.md) — Enables real-time analysis and alerting directly on incoming data streams.
- [Query Parameterization](https://awesome-repositories.com/f/data-databases/query-parameterization.md) — Increases query performance and security by binding variables to database statements. ([source](https://docs.tdengine.com/develop/))
- [Real-Time Data Caching](https://awesome-repositories.com/f/data-databases/real-time-data-caching.md) — Caches incoming device status and calculation results to support time-sensitive monitoring. ([source](https://docs.tdengine.com/advanced/))
- [Schema-Agnostic Ingestion](https://awesome-repositories.com/f/data-databases/schema-agnostic-ingestion.md) — Allows for the ingestion of unstructured data points without requiring predefined table structures or rigid schema definitions.
- [Schemaless Data Stores](https://awesome-repositories.com/f/data-databases/schemaless-data-stores.md) — Records information without defining a rigid structure beforehand to maintain flexibility. ([source](https://docs.tdengine.com/develop/))

### DevOps & Infrastructure

- [Fault Tolerance Strategies](https://awesome-repositories.com/f/devops-infrastructure/fault-tolerance-strategies.md) — Implements fault tolerance and disaster recovery strategies including real-time data synchronization across locations. ([source](https://docs.tdengine.com/operation/))
- [Data Synchronization](https://awesome-repositories.com/f/devops-infrastructure/data-synchronization.md) — Maintains consistent data across geographically distributed edge and cloud infrastructure.

### Hardware & IoT

- [Industrial IoT Platforms](https://awesome-repositories.com/f/hardware-iot/industrial-iot-platforms.md) — Provides real-time ingestion and processing for industrial sensor data.
- [IoT Infrastructure](https://awesome-repositories.com/f/hardware-iot/iot-infrastructure.md) — Provides a backend foundation for managing real-time data and device metadata.
- [IoT Integrations](https://awesome-repositories.com/f/hardware-iot/iot-integrations.md) — Serves as a backend for IoT platforms to manage devices and sensor data. ([source](https://docs.tdengine.com/third-party/))

### Security & Cryptography

- [Identity and Access Management](https://awesome-repositories.com/f/security-cryptography/identity-access-management.md) — Manages user access and permissions for system resources, databases, and tables to enforce data security. ([source](https://docs.tdengine.com/operation/))
- [Cluster Security](https://awesome-repositories.com/f/security-cryptography/cluster-security.md) — Protects sensitive data through IP whitelisting, audit logging, and database encryption. ([source](https://docs.tdengine.com/operation/))

### System Administration & Monitoring

- [Cluster Health Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/cluster-health-monitoring.md) — Collects and stores cluster performance metrics into a dedicated database instance to ensure stable operation. ([source](https://docs.tdengine.com/operation/))

### User Interface & Experience

- [Data Visualization](https://awesome-repositories.com/f/user-interface-experience/data-visualization.md) — Connects to visualization software for real-time monitoring of data trends. ([source](https://docs.tdengine.com/third-party/))