# trinodb/trino

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/trinodb-trino).**

12,576 stars · 3,493 forks · Java · apache-2.0

## Links

- GitHub: https://github.com/trinodb/trino
- Homepage: https://trino.io
- awesome-repositories: https://awesome-repositories.com/repository/trinodb-trino.md

## Topics

`analytics` `big-data` `data-science` `database` `databases` `datalake` `delta-lake` `distributed-database` `distributed-systems` `hadoop` `hive` `iceberg` `java` `jdbc` `presto` `prestodb` `query-engine` `sql` `trino`

## Description

Trino is a distributed SQL query engine designed for large-scale data analytics. It functions as a data federation platform, providing a unified interface that allows users to execute complex analytical queries across multiple heterogeneous data sources simultaneously without requiring data movement or transformation.

The engine utilizes a massively parallel processing architecture to scale compute resources across clusters for high-speed data retrieval. It distinguishes itself through a cost-based query optimizer that analyzes metadata to determine efficient execution plans, alongside dynamic filtering propagation that reduces data movement by applying constraints directly to upstream sources.

The platform incorporates a comprehensive security framework that enforces granular, centralized authentication and authorization policies across all connected schemas. It also supports fault-tolerant task scheduling, which manages query state in persistent storage to enable automatic recovery from node or process failures during long-running analytical jobs.

## Tags

### Data & Databases

- [Distributed SQL Engines](https://awesome-repositories.com/f/data-databases/distributed-sql-engines.md) — Operates as a distributed SQL query engine for high-performance analytical processing across heterogeneous sources.
- [Federated Data Gateways](https://awesome-repositories.com/f/data-databases/federated-data-gateways.md) — Acts as a data federation platform that connects disparate storage systems for unified analytical access.
- [Federated Data Query Engines](https://awesome-repositories.com/f/data-databases/federated-data-query-engines.md) — Enables federated querying across multiple disparate data sources without requiring data movement or transformation. ([source](https://trino.io/docs/current/))
- [Data Analytics Engines](https://awesome-repositories.com/f/data-databases/data-analysis-visualization/analytical-platforms-engines/data-analytics-engines.md) — Functions as a high-performance engine for distributed data analytics and large-scale reporting.
- [Parallel Processing](https://awesome-repositories.com/f/data-databases/parallel-processing.md) — Utilizes a massively parallel processing engine to scale compute resources for high-speed data retrieval.
- [External Data Connectors](https://awesome-repositories.com/f/data-databases/external-data-connectors.md) — Provides a connector-based architecture to integrate and query diverse external data sources through a unified interface.
- [Cost-Based Optimizers](https://awesome-repositories.com/f/data-databases/query-optimizers/cost-based-optimizers.md) — Utilizes cost-based optimization to analyze metadata and statistics for generating efficient query execution plans.
- [Dynamic Filtering](https://awesome-repositories.com/f/data-databases/dynamic-filtering.md) — Implements dynamic filtering to push constraints to upstream sources and minimize data movement during query execution.

### Security & Cryptography

- [Access Control Systems](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/authorization-and-user-administration/access-control-authorization/access-control-systems.md) — Provides an enterprise access control system for centralized authentication and authorization across data schemas.
- [Access Control](https://awesome-repositories.com/f/security-cryptography/security/policies/access-control.md) — Enforces granular access control policies to restrict user access to specific data sets and schemas. ([source](https://trino.io/docs/current/))
- [Enterprise Security Controls](https://awesome-repositories.com/f/security-cryptography/enterprise-security-controls.md) — Ensures enterprise data security through centralized authentication and authorization across the organization.
- [Security Providers](https://awesome-repositories.com/f/security-cryptography/authentication-providers/security-providers.md) — Integrates pluggable security providers to enforce centralized authentication and authorization policies across data sources.

### Software Engineering & Architecture

- [Fault Tolerance](https://awesome-repositories.com/f/software-engineering-architecture/fault-tolerance.md) — Supports fault-tolerant task scheduling to ensure query resilience and automatic recovery from node failures.
- [Fault-Tolerant Workflow Execution](https://awesome-repositories.com/f/software-engineering-architecture/fault-tolerant-workflow-execution.md) — Provides fault-tolerant execution to automatically recover and resume long-running analytical queries after node interruptions. ([source](https://trino.io/docs/current/))
