hive ☆

« Back to VersTracker

Description:
Hadoop-based data summarization, query, and analysis

Type: Formula | Tracked Since: Dec 28, 2025

Links: Homepage | @ApacheHive | formulae.brew.sh

Category: Databases

Tags: hadoop sql data-warehouse big-data analytics

Install: brew install hive

About:
Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data summarization, query, and analysis. It enables easy SQL-like querying of large datasets stored in distributed storage by translating queries into MapReduce or Tez jobs. This allows analysts and engineers to work with big data using familiar SQL syntax without writing complex Java code.

Key Features:

SQL-like query language (HiveQL)
Schema-on-read data handling
Extensible with custom UDFs and SerDes
Integration with Hadoop ecosystem tools

Use Cases:

Batch processing and analysis of petabyte-scale datasets
Data warehousing on Hadoop clusters
ETL operations for big data pipelines

Alternatives:

Apache Spark SQL – Provides faster in-memory processing and more flexible analytics than Hive's batch-oriented approach
Presto – Offers interactive low-latency queries compared to Hive's higher-latency batch processing

Version History

Detected	Version	Rev	Change	Commit