# apache/hive

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/apache-hive).**

6,012 stars · 4,791 forks · Java · apache-2.0

## Links

- GitHub: https://github.com/apache/hive
- Homepage: https://hive.apache.org/
- awesome-repositories: https://awesome-repositories.com/repository/apache-hive.md

## Topics

`apache` `big-data` `database` `hadoop` `hive` `java` `sql`

## Description

Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveServer2 Thrift service for remote query execution, the Hive Metastore Service for central metadata management, the Hive ACID Transaction Engine for concurrent read-write operations, and the Hive LLAP Interactive Engine for low-latency analytical processing. The WebHCat REST API offers an HTTP interface for submitting Hive, MapReduce, and Pig jobs and managing HCatalog metadata.

Hive distinguishes itself through its multi-engine query execution, allowing queries to run on Apache Spark, Tez, or MapReduce to balance performance and resource usage across different workloads. It supports external data federation, enabling direct querying of remote databases, Druid, HBase, and Iceberg tables without moving data. Enterprise security integration provides authentication via Kerberos, LDAP, SAML, JWT, or OAuth2, with fine-grained access control through Apache Ranger. The cost-based optimizer, materialized views, and LLAP persistent daemon work together to deliver sub-second query responses on large datasets.

The platform offers comprehensive data management capabilities including ACID transactions, multiple storage formats such as ORC, Parquet, Avro, and RCFile, and support for cloud storage on S3, Azure Data Lake, and Google Cloud Storage. It includes a pluggable SerDe abstraction layer for custom data formats and a storage handler interface for connecting to external systems like HBase, Druid, Kudu, and JDBC sources. Advanced SQL features cover windowed aggregation, grouping sets, common table expressions, and geospatial calculations, while extensibility is provided through user-defined functions, custom MapReduce scripts, and procedural SQL execution.

Hive can be deployed via stable release tarballs, Docker containers, or Amazon EMR, and includes command-line tools like Beeline and HCatalog for interactive and batch query execution. Monitoring and observability features allow inspection of query execution plans, job status tracking, and runtime metrics viewing.

## Tags

### Data & Databases

- [Hadoop](https://awesome-repositories.com/f/data-databases/data-warehousing/hadoop.md) — Provides a SQL-on-Hadoop data warehouse for querying petabytes of distributed data.
- [Distributed SQL Querying](https://awesome-repositories.com/f/data-databases/distributed-sql-querying.md) — Executes SQL queries against petabytes of data in distributed storage for batch analytics. ([source](https://cdn.jsdelivr.net/gh/apache/hive@master/README.md))
- [Hadoop Data Warehouses](https://awesome-repositories.com/f/data-databases/sql-engines/hadoop-data-warehouses.md) — Provides a SQL-on-Hadoop data warehouse that queries and manages petabytes of data stored in distributed storage.
- [Concurrent Read-Write Transactions](https://awesome-repositories.com/f/data-databases/acid-transactional-cores/concurrent-read-write-transactions.md) — Executes concurrent read and write transactions on tables with optimistic or pessimistic locking and lockless reads. ([source](https://hive.apache.org/docs/latest/overview-of-major-changes/))
- [Data Lake Transaction Managers](https://awesome-repositories.com/f/data-databases/acid-transactional-cores/data-lake-transaction-managers.md) — Implements an ACID transaction manager for concurrent reads and writes on ORC tables.
- [ORC Transaction Managers](https://awesome-repositories.com/f/data-databases/acid-transactional-cores/data-lake-transaction-managers/orc-transaction-managers.md) — Provides ACID transactions on ORC tables for concurrent read-write operations.
- [Columnar Storage Engines](https://awesome-repositories.com/f/data-databases/columnar-storage-engines.md) — Uses the Optimized Row Columnar (ORC) format to improve read, write, and compression performance. ([source](https://hive.apache.org/docs/latest/language/))
- [Metastore Integrations](https://awesome-repositories.com/f/data-databases/connection-management/metastore-integrations.md) — Provides a central metadata repository serving table schemas to Hive, Spark, and Impala.
- [Hive Metastore Services](https://awesome-repositories.com/f/data-databases/connection-management/metastore-integrations/hive-metastore-services.md) — Provides a central metadata repository that stores table schemas and partition information for Hive and other data lake tools.
- [Cloud Native Object Storage](https://awesome-repositories.com/f/data-databases/data-engineering-infrastructure/data-persistence-storage/storage-solutions/cloud-native-object-storage.md) — Natively supports S3, Azure Data Lake, and Google Cloud Storage for cloud-native data warehousing. ([source](http://hive.apache.org/))
- [Automatic Partition Insertions](https://awesome-repositories.com/f/data-databases/data-partitioning/partitioned-synchronization/automatic-partition-insertions.md) — Automatically creates partitions during data insertion, eliminating manual partition management. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Partition Key Filters](https://awesome-repositories.com/f/data-databases/data-querying/table-item-filters/column-filters/partition-key-filters.md) — Restricts query scans to specific partitions using predicates on partition key columns. ([source](https://hive.apache.org/docs/latest/language/))
- [Data Serialization Formats](https://awesome-repositories.com/f/data-databases/data-serialization-formats.md) — Converts data between storage and in-memory representations using SerDes for multiple formats. ([source](https://hive.apache.org/docs/latest/hcatalog/))
- [SQL Data Loading and Transformation](https://awesome-repositories.com/f/data-databases/data-transformation-rules/load-time-transformations/sql-data-loading-and-transformation.md) — Loads data into tables and transforms it using INSERT, UPDATE, DELETE, and scripting operations. ([source](https://hive.apache.org/docs/latest/language/languagemanual))
- [SQL Data Analysis](https://awesome-repositories.com/f/data-databases/database-connectivity/elasticsearch-connectors/sql-data-analysis.md) — Provides a SQL interface for ad-hoc querying, summarization, and analysis of large datasets. ([source](https://hive.apache.org/docs/latest/user/))
- [Table Schemas](https://awesome-repositories.com/f/data-databases/database-management-systems/database-systems-management/database-management/schema-designers/table-schemas.md) — Creates, renames, adds columns to, and drops tables and partitions while preserving schema. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Tez-Based Query Executions](https://awesome-repositories.com/f/data-databases/database-management-systems/database-systems-management/database-operations/sql-query-execution/tez-based-query-executions.md) — Executes SQL queries using the Tez framework for lower latency than MapReduce. ([source](https://cdn.jsdelivr.net/gh/apache/hive@master/README.md))
- [Database Object Management](https://awesome-repositories.com/f/data-databases/database-object-management.md) — Creates, alters, drops, and describes databases, tables, views, functions, indexes, and partitions. ([source](https://hive.apache.org/docs/latest/language/))
- [Data Lake Table Metadata Stores](https://awesome-repositories.com/f/data-databases/distributed-key-value-stores/centralized-service-metadata/data-lake-table-metadata-stores.md) — Stores and serves table and partition metadata via a service API for Hive, Spark, Impala, and other data lake tools. ([source](http://hive.apache.org/))
- [SQL Query Interfaces](https://awesome-repositories.com/f/data-databases/distributed-storage/sql-query-interfaces.md) — Queries files stored in HDFS, HBase, or other storage systems through a unified SQL interface. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Federated Data Query Engines](https://awesome-repositories.com/f/data-databases/federated-data-query-engines.md) — Links Hive to remote databases via data connectors for federated query execution across disparate sources. ([source](https://hive.apache.org/docs/latest/user/))
- [File-Based Data Import](https://awesome-repositories.com/f/data-databases/file-based-data-import.md) — Imports data from HDFS files into Hive tables using external table definitions or load commands. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Key-Based Partitioning](https://awesome-repositories.com/f/data-databases/indexing-and-search/index-based-pruning/key-based-partitioning.md) — Divides table data into storage units based on column values for faster query analysis. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Low-Latency Analytical Queries](https://awesome-repositories.com/f/data-databases/low-latency-analytical-queries.md) — Delivers sub-second query responses by reusing persistent query infrastructure and optimized data caches. ([source](http://hive.apache.org/))
- [In-Memory Caching Engines](https://awesome-repositories.com/f/data-databases/low-latency-analytical-queries/in-memory-caching-engines.md) — Provides an in-memory caching engine for sub-second query responses on large datasets.
- [Parquet Data Processing](https://awesome-repositories.com/f/data-databases/parquet-scanners/parquet-data-processing.md) — Reads and writes data in the Parquet columnar format for efficient storage and querying. ([source](https://hive.apache.org/docs/latest/user/))
- [Multi-Engine Execution Backends](https://awesome-repositories.com/f/data-databases/query-execution-engines/multi-engine-execution-backends.md) — Executes queries on Spark, Tez, or MapReduce to balance performance and resource usage.
- [Cost-Based Optimizers](https://awesome-repositories.com/f/data-databases/query-optimizers/cost-based-optimizers.md) — Uses a cost-based optimizer with table statistics and materialized views for query planning.
- [Low-Latency Query Engines](https://awesome-repositories.com/f/data-databases/query-performance-tuning/low-latency-query-engines.md) — Ships the LLAP interactive engine for sub-second query responses on large datasets. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Remote Table Connectivity](https://awesome-repositories.com/f/data-databases/remote-table-connectivity.md) — Maps remote databases or schemas to Hive databases, making all tables queryable without manual mapping. ([source](https://hive.apache.org/docs/latest/user/data-connectors-in-hive/))
- [Row Aggregations](https://awesome-repositories.com/f/data-databases/row-expansion/row-aggregations.md) — Combines multiple input rows into a single output row, computing statistics like count, sum, average, and percentile. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))
- [Data Lake](https://awesome-repositories.com/f/data-databases/schema-management/data-lake.md) — Provides a central metastore for managing schemas across Hive, Spark, Impala, and other tools.
- [SQL Query Optimizations](https://awesome-repositories.com/f/data-databases/sql-query-optimizations.md) — Uses a cost-based optimizer to automatically improve SQL query performance and resource usage. ([source](http://hive.apache.org/))
- [Managed and External Tables](https://awesome-repositories.com/f/data-databases/virtual-table-querying/table-creation/managed-and-external-tables.md) — Defines tables whose data is either managed by Hive or stored at a user-specified location. ([source](https://hive.apache.org/docs/latest/language/))
- [Interactive](https://awesome-repositories.com/f/data-databases/batch-query-execution/interactive.md) — Executes HiveQL statements from a shell utility in both interactive and batch script modes. ([source](https://hive.apache.org/docs/latest/language/))
- [Iceberg Table Lifecycle Management](https://awesome-repositories.com/f/data-databases/big-data-processing/table-managers/iceberg-table-lifecycle-management.md) — Manages large analytic datasets in Iceberg format with snapshot isolation, branching, tagging, and full DML support. ([source](https://hive.apache.org/docs/latest/overview-of-major-changes/))
- [Common Table Expressions](https://awesome-repositories.com/f/data-databases/common-table-expressions.md) — Defines a temporary result set within a WITH clause that can be referenced by a single SELECT, INSERT, or CREATE statement. ([source](https://hive.apache.org/docs/latest/language/))
- [ACID Table Compaction Optimizations](https://awesome-repositories.com/f/data-databases/compaction-strategies/acid-table-compaction-optimizations.md) — Rebalances and prioritizes compaction requests for ACID and Iceberg tables to improve storage and query performance. ([source](https://hive.apache.org/docs/latest/))
- [Continuous Data Imports](https://awesome-repositories.com/f/data-databases/continuous-data-imports.md) — Streams small batches of records into an existing Hive partition or table, making them immediately visible to subsequent queries. ([source](https://hive.apache.org/docs/latest/user/))
- [Custom MapReduce Script Embedding](https://awesome-repositories.com/f/data-databases/custom-mapreduce-script-embedding.md) — Embeds user-provided mapper and reducer scripts within SQL queries using the TRANSFORM clause. ([source](https://hive.apache.org/docs/latest/language/))
- [Storage Compaction Utilities](https://awesome-repositories.com/f/data-databases/data-engineering-infrastructure/data-persistence-storage/data-storage/specialized-database-engines/time-series-data-storage/storage-compaction-utilities.md) — Rebalances and prioritizes compaction of ACID and Iceberg tables to optimize storage and query performance. ([source](https://hive.apache.org/docs/latest/overview-of-major-changes/))
- [Catalog Integration](https://awesome-repositories.com/f/data-databases/data-export/catalog-integration.md) — Connects to an Iceberg REST catalog secured with OAuth2 to manage tables with snapshots and branches. ([source](https://hive.apache.org/docs/latest/))
- [Data Import and Export](https://awesome-repositories.com/f/data-databases/data-import-and-export.md) — Exports and imports table or partition data and metadata between Hive instances. ([source](https://hive.apache.org/docs/latest/language/))
- [Spark Execution Backends](https://awesome-repositories.com/f/data-databases/data-processing-configurations/execution-engines/multi-engine-execution-backends/spark-execution-backends.md) — Supports running Hive queries on Apache Spark for accelerated performance. ([source](https://hive.apache.org/docs/latest/user/))
- [S3 Querying](https://awesome-repositories.com/f/data-databases/data-querying/s3-querying.md) — Declares tables over S3 data and runs SQL queries against it using EC2 compute resources. ([source](https://hive.apache.org/docs/latest/user/))
- [Cross-Cluster Table Replication](https://awesome-repositories.com/f/data-databases/data-replication/cross-cluster-table-replication.md) — Bootstraps and checkpoints replication of tables, including external tables, with tracking metrics for consistency. ([source](https://hive.apache.org/docs/latest/))
- [Avro Table Reads and Writes](https://awesome-repositories.com/f/data-databases/data-serialization/avro-decoding/avro-table-reads-and-writes.md) — Reads and writes Avro-encoded data as Hive tables, inferring the table schema from the Avro schema and supporting nested structures. ([source](https://hive.apache.org/docs/latest/user/))
- [Beeline Query Execution](https://awesome-repositories.com/f/data-databases/database-management-systems/database-systems-management/database-operations/sql-query-execution/beeline-query-execution.md) — Connects to HiveServer2 via the Beeline CLI client to run SQL queries interactively. ([source](https://hive.apache.org/development/quickstart))
- [Map Join Optimizations](https://awesome-repositories.com/f/data-databases/database-query-joins/map-join-optimizations.md) — Automatically selects map joins for star-schema queries by loading smaller tables into memory as hash tables. ([source](https://hive.apache.org/docs/latest/language/))
- [Predicate Pushdown Joins](https://awesome-repositories.com/f/data-databases/database-query-joins/predicate-pushdown-joins.md) — Applies filter predicates to all joined tables to avoid scanning unnecessary partitions. ([source](https://hive.apache.org/docs/latest/user/))
- [Date and Time Libraries](https://awesome-repositories.com/f/data-databases/date-and-time-libraries.md) — Provides built-in SQL functions for parsing, formatting, and extracting date and time components. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))
- [Disaster Recovery Solutions](https://awesome-repositories.com/f/data-databases/disaster-recovery-solutions.md) — Performs bootstrap and incremental replication for robust backup and business continuity. ([source](http://hive.apache.org/))
- [SQL Geospatial Analysis](https://awesome-repositories.com/f/data-databases/geospatial-data-services/geospatial-and-location-services/sql-geospatial-analysis.md) — Processes spatial data using ESRI-based functions for geometry operations and analysis. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))
- [Multi-Dimensional Aggregations](https://awesome-repositories.com/f/data-databases/grouped-aggregations/multi-dimensional-aggregations.md) — Computes subtotals and grand totals across multiple dimensions using CUBE, ROLLUP, and GROUPING SETS operators. ([source](https://hive.apache.org/docs/latest/language/))
- [Tez Execution Backends](https://awesome-repositories.com/f/data-databases/mapreduce-processing-engines/tez-execution-backends.md) — Supports running queries on Apache Tez for lower-latency DAG-based execution. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Materialized Views](https://awesome-repositories.com/f/data-databases/materialized-views.md) — Pre-computes and stores summary results, then rewrites incoming queries to use those summaries for faster execution. ([source](https://hive.apache.org/docs/latest/language/))
- [Pluggable Connector Frameworks](https://awesome-repositories.com/f/data-databases/pluggable-storage-drivers/pluggable-connector-frameworks.md) — Provides a pluggable connector framework for mapping external systems like HBase and Druid to Hive tables.
- [Distributed Row Distributions](https://awesome-repositories.com/f/data-databases/result-sorting/distributed-row-distributions.md) — Orders or distributes rows across reducers using ORDER BY, SORT BY, DISTRIBUTE BY, or CLUSTER BY clauses. ([source](https://hive.apache.org/docs/latest/language/))
- [Custom SQL Functions](https://awesome-repositories.com/f/data-databases/sql-aggregate-functions/custom-sql-functions.md) — Adds user-defined functions, aggregates, and table functions to SQL for custom data processing. ([source](https://hive.apache.org/docs/latest/))
- [Custom Aggregation Function Writing](https://awesome-repositories.com/f/data-databases/sql-aggregate-functions/custom-sql-functions/custom-aggregation-function-writing.md) — Extends built-in aggregation by implementing user-defined aggregation functions in Java. ([source](https://hive.apache.org/docs/latest/language/))
- [Custom Scalar Function Writing](https://awesome-repositories.com/f/data-databases/sql-aggregate-functions/custom-sql-functions/custom-scalar-function-writing.md) — Extends built-in SQL functions by implementing user-defined scalar functions in Java. ([source](https://hive.apache.org/docs/latest/language/))
- [Procedural SQL Execution](https://awesome-repositories.com/f/data-databases/sql-execution-interfaces/procedural-sql-execution.md) — Executes procedural SQL code, including PL/SQL and T-SQL, on Hive and other engines. ([source](https://hive.apache.org/docs/latest/user/))
- [Cost-Based Compilation Optimizations](https://awesome-repositories.com/f/data-databases/sql-query-optimizations/cost-based-compilation-optimizations.md) — Compiles SQL queries using materialized views, anti-joins, branch pruning, and histogram statistics for faster execution. ([source](https://hive.apache.org/docs/latest/overview-of-major-changes/))
- [Pluggable Serialization Frameworks](https://awesome-repositories.com/f/data-databases/storage-abstraction-layers/pluggable-serialization-frameworks.md) — Provides a pluggable SerDe layer that supports multiple storage formats like ORC, Parquet, and Avro.
- [Procedural SQL Execution](https://awesome-repositories.com/f/data-databases/stored-procedures/procedural-code-conversion/procedural-sql-execution.md) — Executes procedural SQL code alongside standard queries for complex data transformations. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Query Result Persistence](https://awesome-repositories.com/f/data-databases/task-result-storage/query-result-persistence.md) — Writes the output of a SELECT statement into another table, a local file, or a distributed filesystem directory. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Union All Result Combinations](https://awesome-repositories.com/f/data-databases/task-result-storage/query-result-persistence/union-all-result-combinations.md) — Merges the result sets of multiple SELECT statements into a single result set using UNION. ([source](https://hive.apache.org/docs/latest/language/))
- [Transactional Data Mutations](https://awesome-repositories.com/f/data-databases/transactional-data-mutations.md) — Inserts, updates, and deletes individual records in ACID tables without rewriting entire partitions. ([source](https://hive.apache.org/docs/latest/hcatalog/))
- [Iceberg Table Ingestion](https://awesome-repositories.com/f/data-databases/virtual-table-querying/external-table-querying/iceberg-table-ingestion.md) — Supports Apache Iceberg tables natively via a storage handler for cloud-native, high-performance data lake architectures. ([source](https://hive.apache.org/docs/latest/user/))
- [JDBC External Table Connectors](https://awesome-repositories.com/f/data-databases/virtual-table-querying/external-table-querying/jdbc-external-table-connectors.md) — Creates external tables that read from any JDBC data source with predicate pushdown support. ([source](https://hive.apache.org/docs/latest/user/))
- [Kudu Table Connectors](https://awesome-repositories.com/f/data-databases/virtual-table-querying/external-table-querying/kudu-table-connectors.md) — Provides a storage handler for querying Apache Kudu tables directly from Hive. ([source](https://hive.apache.org/docs/latest/user/))
- [Table Function Invocation](https://awesome-repositories.com/f/data-databases/virtual-table-querying/external-table-querying/table-function-invocation.md) — Applies a user-defined table-generating function to each input row and joins the output rows to the original input. ([source](https://hive.apache.org/docs/latest/language/))

### Part of an Awesome List

- [Thrift-Based Query Executions](https://awesome-repositories.com/f/awesome-lists/data/query-execution/thrift-based-query-executions.md) — Provides a Thrift-based service for executing SQL queries from remote clients with authentication. ([source](https://hive.apache.org/docs/latest/user/))
- [BI Tool Integration](https://awesome-repositories.com/f/awesome-lists/data/dashboards-and-bi/bi-tool-integration.md) — Provides JDBC and ODBC drivers that connect BI tools directly to Hive for querying. ([source](http://hive.apache.org/))
- [RCFile Data Access](https://awesome-repositories.com/f/awesome-lists/data/data-formats-and-parsers/rcfile-data-access.md) — Stores and retrieves table data in the Record Columnar File format for Hive tables. ([source](https://hive.apache.org/docs/latest/user/))
- [Druid Datasource Querying](https://awesome-repositories.com/f/awesome-lists/data/data-querying/druid-datasource-querying.md) — Indexes data from Hive into Druid and runs SQL queries against Druid datasources directly. ([source](https://hive.apache.org/docs/latest/user/))
- [Data Engineering](https://awesome-repositories.com/f/awesome-lists/devops/data-engineering.md) — Data warehouse software for managing large datasets.

### Development Tools & Productivity

- [Query Caching Daemons](https://awesome-repositories.com/f/development-tools-productivity/persistent-session-managers/session-independent-process-scopes/daemonized-session-persistence/query-caching-daemons.md) — Runs persistent daemon workers that cache data in memory for sub-second SQL query responses.
- [SQL Query Executions](https://awesome-repositories.com/f/development-tools-productivity/rest-apis/sql-query-executions.md) — Provides a REST endpoint for executing Hive queries and returning results. ([source](https://hive.apache.org/docs/latest/webhcat/))
- [WebHCat](https://awesome-repositories.com/f/development-tools-productivity/rest-apis/webhcat.md) — Ships a WebHCat REST API for submitting Hive, MapReduce, and Pig jobs and managing HCatalog metadata.
- [SerDe-Based Format Plugins](https://awesome-repositories.com/f/development-tools-productivity/custom-data-format-plugins/serde-based-format-plugins.md) — Uses a custom SerDe to serialize and deserialize table data in any format for reading and writing. ([source](https://hive.apache.org/docs/latest/user/))
- [Metadata REST Endpoints](https://awesome-repositories.com/f/development-tools-productivity/rest-apis/metadata-rest-endpoints.md) — Provides a REST API for managing table metadata and submitting Hadoop jobs. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Beeline Shell Connectivity](https://awesome-repositories.com/f/development-tools-productivity/shell-command-execution/beeline-shell-connectivity.md) — Provides a JDBC-based command-line shell for connecting to HiveServer2 in embedded or remote mode. ([source](https://hive.apache.org/docs/latest/user/))

### Networking & Communication

- [Apache Thrift Integrations](https://awesome-repositories.com/f/networking-communication/apache-thrift-integrations.md) — Uses Apache Thrift for remote procedure calls to submit queries to a Hive instance. ([source](https://hive.apache.org/docs/latest/admin/))
- [Thrift RPC Servers](https://awesome-repositories.com/f/networking-communication/apache-thrift-integrations/thrift-rpc-servers.md) — Implements a Thrift RPC server that accepts SQL queries and returns results over binary or HTTP transport.
- [Thrift Service Implementations](https://awesome-repositories.com/f/networking-communication/apache-thrift-integrations/thrift-service-implementations.md) — Implements a Thrift-based server that accepts SQL queries from remote clients with authentication. ([source](https://hive.apache.org/docs/latest/admin/))

### Programming Languages & Runtimes

- [Directed Acyclic Graph Execution Engines](https://awesome-repositories.com/f/programming-languages-runtimes/runtime-execution-environments/runtime-environments/runtimes/graph-symbolic-execution-engines/directed-acyclic-graph-execution-engines.md) — Uses a DAG execution engine (Tez) for lower-latency interactive SQL queries on Hadoop.
- [Conditional Logic](https://awesome-repositories.com/f/programming-languages-runtimes/conditional-logic.md) — Provides built-in SQL functions for conditional logic like CASE, IF, and NULL handling. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))
- [Automatic Query-Time Decompression](https://awesome-repositories.com/f/programming-languages-runtimes/data-compression-and-decompression/automatic-query-time-decompression.md) — Imports text files compressed with Gzip or Bzip2 and decompresses them automatically during query execution. ([source](https://hive.apache.org/docs/latest/user/))
- [SQL Type Conversions](https://awesome-repositories.com/f/programming-languages-runtimes/string-type-conversions/sql-type-conversions.md) — Ships a comprehensive set of SQL type conversion functions for query-time data transformation. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))

### Security & Cryptography

- [Authorization Mode Managers](https://awesome-repositories.com/f/security-cryptography/access-control-managers/authorization-mode-managers.md) — Restricts user actions on data and metadata through configurable authorization modes. ([source](https://hive.apache.org/docs/latest/language/languagemanual))
- [Enterprise Security Integrations](https://awesome-repositories.com/f/security-cryptography/identity-access-management/authentication-strategies/authorization-and-user-administration/enterprise-security-integrations.md) — Integrates with Kerberos, LDAP, SAML, JWT, OAuth2, and Apache Ranger for enterprise-grade security.
- [Multi-Method API Authentication](https://awesome-repositories.com/f/security-cryptography/multi-method-api-authentication.md) — Supports SAML, JWT, Kerberos, and LDAP authentication for HiveServer2 and the metastore. ([source](https://hive.apache.org/docs/latest/))
- [Enterprise Access Controls](https://awesome-repositories.com/f/security-cryptography/security/policies/access-control/enterprise-access-controls.md) — Integrates Kerberos, Apache Ranger, and Apache Atlas for enterprise-grade access control. ([source](http://hive.apache.org/))

### Software Engineering & Architecture

- [Schema-on-Read Formats](https://awesome-repositories.com/f/software-engineering-architecture/data-formats/schema-on-read-formats.md) — Applies a schema to various file formats like CSV, Parquet, and ORC, enabling SQL queries on unstructured data. ([source](https://hive.apache.org/docs/latest/introduction-to-apache-hive/))
- [Table Bucketing](https://awesome-repositories.com/f/software-engineering-architecture/distributed-systems/distributed-data-management/distributed-hash-tables/table-bucketing.md) — Distributes data across fixed buckets using hash of a column for efficient sampling and query pruning. ([source](https://hive.apache.org/docs/latest/user/tutorial))
- [Sampling Retrievals](https://awesome-repositories.com/f/software-engineering-architecture/distributed-systems/distributed-data-management/distributed-hash-tables/table-bucketing/sampling-retrievals.md) — Returns a random sample of rows from a table by selecting a specific bucket. ([source](https://hive.apache.org/docs/latest/language/))
- [String Manipulation](https://awesome-repositories.com/f/software-engineering-architecture/string-validation-and-normalization/string-encodings/string-manipulation.md) — Provides built-in SQL functions for string concatenation, extraction, encoding, and pattern matching. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))

### Artificial Intelligence & ML

- [Window Ranking Functions](https://awesome-repositories.com/f/artificial-intelligence-ml/search-and-ranking-algorithms/window-ranking-functions.md) — Applies analytic functions like RANK, ROW_NUMBER, and SUM over a defined window of rows. ([source](https://hive.apache.org/docs/latest/language/))

### Business & Productivity Software

- [Database Namespace Organization](https://awesome-repositories.com/f/business-productivity-software/group-management/data-model-groups/model-namespace-organizers/database-namespace-organization.md) — Groups tables, views, and partitions into namespaces to avoid naming conflicts and enforce security boundaries. ([source](https://hive.apache.org/docs/latest/user/tutorial))

### DevOps & Infrastructure

- [Compaction Worker Pools](https://awesome-repositories.com/f/devops-infrastructure/worker-pool-management/local-worker-pools/named-worker-pools/compaction-worker-pools.md) — Routes compaction requests to dedicated worker pools so high-priority tables are processed without waiting in the default queue. ([source](https://hive.apache.org/docs/latest/language/compaction-pooling))

### Operating Systems & Systems Programming

- [JDBC Driver Loading](https://awesome-repositories.com/f/operating-systems-systems-programming/dynamic-library-loading/jdbc-driver-loading.md) — Ships a JDBC driver that enables external applications to connect and run queries against Hive. ([source](https://hive.apache.org/docs/latest/admin/))

### Scientific & Mathematical Computing

- [SQL Mathematical Functions](https://awesome-repositories.com/f/scientific-mathematical-computing/sql-mathematical-functions.md) — Provides a wide range of mathematical functions callable directly within SQL queries. ([source](https://hive.apache.org/docs/latest/language/hive-udfs/))

### User Interface & Experience

- [HBase Table Connectors](https://awesome-repositories.com/f/user-interface-experience/data-table-accessibility/hbase-table-connectors.md) — Accesses HBase tables with HiveQL for both reading and writing data alongside native tables. ([source](https://hive.apache.org/docs/latest/user/))

### Web Development

- [Multi-Table Insertions](https://awesome-repositories.com/f/web-development/editor-insertion-commands/table-insertion/row-insertion/multi-table-insertions.md) — Sends the results of a single query to several tables or HDFS files in one pass. ([source](https://hive.apache.org/docs/latest/user/tutorial))