hive
« Back to VersTracker
Description:
Hadoop-based data summarization, query, and analysis
Type: Formula  |  Tracked Since: Dec 28, 2025
Links: Homepage  |  @ApacheHive  |  formulae.brew.sh
Category: Databases
Tags: hadoop sql data-warehouse big-data analytics
Install: brew install hive
About:
Apache Hive is a data warehouse software project built on top of Apache Hadoop for providing data summarization, query, and analysis. It enables easy SQL-like querying of large datasets stored in distributed storage by translating queries into MapReduce or Tez jobs. This allows analysts and engineers to work with big data using familiar SQL syntax without writing complex Java code.
Key Features:
  • SQL-like query language (HiveQL)
  • Schema-on-read data handling
  • Extensible with custom UDFs and SerDes
  • Integration with Hadoop ecosystem tools
Use Cases:
  • Batch processing and analysis of petabyte-scale datasets
  • Data warehousing on Hadoop clusters
  • ETL operations for big data pipelines
Alternatives:
  • Apache Spark SQL – Provides faster in-memory processing and more flexible analytics than Hive's batch-oriented approach
  • Presto – Offers interactive low-latency queries compared to Hive's higher-latency batch processing
Version History
Detected Version Rev Change Commit