datafusion
« Back to VersTracker
Description:
Apache Arrow DataFusion and Ballista query engines
Type: Formula  |  Latest Version: 51.0.0@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  @ApacheArrow  |  formulae.brew.sh
Category: Databases
Tags: sql arrow query-engine rust big-data
Install: brew install datafusion
About:
DataFusion is a high-performance, in-memory query engine built on the Apache Arrow memory format. It provides a SQL interface and a DataFrame API for executing complex queries on diverse data sources. Ballista extends this capability to distributed computing, allowing queries to scale across a cluster of machines.
Key Features:
  • SQL and DataFrame API support
  • Columnar query execution using Apache Arrow
  • Extensible query optimizer and UDF support
  • Distributed execution via Ballista
Use Cases:
  • Analyzing large datasets locally or in a cluster
  • Building custom data processing pipelines
  • Embedding a query engine into Rust applications
Alternatives:
  • DuckDB – DuckDB is an embedded OLAP database, while DataFusion is primarily a query engine and library.
  • Apache Spark – Spark is a general-purpose distributed computing framework, whereas DataFusion is focused on SQL-based data querying.
License: Apache-2.0
Bottles available for: arm64_tahoe, arm64_sequoia, arm64_sonoma, sonoma, arm64_linux, x86_64_linux
Version History
Detected Version Rev Change Commit
Nov 19, 2025 11:58am 0 VERSION_BUMP 7bab76d0
Oct 13, 2025 4:57am 0 VERSION_BUMP 905c823c
Sep 16, 2025 12:01pm 0 VERSION_BUMP 348be947
Sep 15, 2025 6:49am 0 VERSION_BUMP 73fcfaf5
Dec 15, 2024 11:25am 0 VERSION_BUMP bcff5892
Oct 30, 2024 9:30pm 0 VERSION_BUMP 7ef24d61