odbc2parquet
« Back to VersTracker
Description:
CLI to query an ODBC data source and write the result into a Parquet file
Type: Formula  |  Tracked Since: Dec 28, 2025
Links: Homepage  |  formulae.brew.sh
Category: Databases
Tags: odbc parquet etl cli database
Install: brew install odbc2parquet
About:
Odbc2parquet is a command-line interface tool designed to execute SQL queries against any ODBC-compliant database. It efficiently streams the query results directly into Apache Parquet files, a highly optimized columnar storage format. This facilitates seamless data migration and integration into modern data analytics pipelines, such as those used by Apache Spark or DuckDB.
Key Features:
  • Supports any ODBC 3.0 compliant database driver
  • Efficient streaming of large result sets to minimize memory usage
  • Direct output to Apache Parquet format for analytics compatibility
  • Configurable batch sizes and compression options
Use Cases:
  • Migrating legacy relational data to modern data lakes
  • Creating daily snapshots of transactional databases for reporting
  • Extracting data from proprietary systems for analysis in Python/R
Alternatives:
  • odbc2csv – Similar concept but outputs to CSV, which lacks Parquet's compression and schema enforcement.
  • Python (pandas + pyodbc) – Requires writing custom scripts; Odbc2parquet offers a dedicated CLI for this specific task without coding.
Version History
Detected Version Rev Change Commit
Sep 16, 2025 8:40am 0 VERSION_BUMP 0c6c13c9