1 Repo
Runs user-provided mapper and reducer scripts within a query using the TRANSFORM clause.
Distinguishing note: No candidate covers embedding MapReduce scripts in SQL queries; this is a unique Hive extensibility feature.
Explore 1 awesome GitHub repository matching data & databases · Custom MapReduce Script Embedding. Refine with filters or upvote what's useful.
Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveServer2 Thrift service for remote query execution, the Hive Metastore Service for central metadata management, the Hive ACID Transaction Engine for concurrent read-write operations, and the Hive LLAP Interactive Engine for low-latency analytical processing. The WebHCat REST API offers an HTTP interfac
Embeds user-provided mapper and reducer scripts within SQL queries using the TRANSFORM clause.