apache-spark
« Back to VersTracker
Description:
Engine for large-scale data processing
Type: Formula  |  Latest Version: 4.1.1@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  @ApacheSpark  |  formulae.brew.sh
Category: Devops
Tags: big-data analytics distributed-computing etl machine-learning
Install: brew install apache-spark
About:
Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python, and R, and an optimized engine that supports general computation graphs. It is designed to scale from a single machine to large clusters, offering significant performance improvements over traditional big data frameworks.
Key Features:
  • In-memory computation for faster performance
  • Support for SQL, streaming, and machine learning
  • Fault-tolerant distributed data processing
  • Rich APIs in Java, Scala, Python, and R
Use Cases:
  • Big data ETL pipelines and data warehousing
  • Real-time stream processing and analytics
  • Large-scale machine learning model training
Alternatives:
  • hadoop – Hadoop MapReduce writes to disk, making it slower than Spark's in-memory processing.
  • flink – Flink offers lower latency for streaming, while Spark is generally easier to use for batch processing.
License: Apache-2.0
Dependencies: openjdk@21
Bottles available for: all
Version History
Detected Version Rev Change Commit
Jan 9, 2026 10:54am 4.1.1 0 VERSION_BUMP 3206653e
Dec 16, 2025 2:20pm 4.1.0 0 VERSION_BUMP 6190c704
Dec 16, 2025 1:56pm 4.1.0 0 VERSION_BUMP b4afb3a2
Aug 12, 2024 12:18pm 3.5.2 0 VERSION_BUMP fc2457ba
Sep 13, 2023 4:30pm 3.4.1 0 VERSION_BUMP 880f15e0
Sep 10, 2023 10:42am 3.4.1 0 VERSION_BUMP 5749cbbd
Jun 23, 2023 1:24pm 3.4.1 0 VERSION_BUMP 40568f7f
May 22, 2023 4:42pm 3.4.0 0 VERSION_BUMP 536ea375