embulk
« Back to VersTracker
Description:
Data transfer between various databases, file formats and services
Type: Formula  |  Latest Version: 0.11.5@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  formulae.brew.sh
Category: Databases
Tags: etl data-pipeline data-loader jdbc bulk-transfer
Install: brew install embulk
About:
Embulk is a plugin-based parallel bulk data loader that unifies data extraction from diverse sources like databases, files, and cloud services. It standardizes data ingestion into various data stores, handling format conversions and parallel execution automatically. This makes it ideal for building robust and scalable ETL pipelines.
Key Features:
  • Plugin-based architecture for extensibility
  • Automatic parallel processing for high throughput
  • Unified command-line interface for various data sources
  • Handles data filtering and type conversion
Use Cases:
  • Migrating data from legacy databases to modern data warehouses
  • Automating daily ingestion of application logs and metrics
  • Integrating third-party API data into internal analytics systems
Alternatives:
  • Airbyte – Airbyte is a more modern, UI-driven platform with a larger catalog of pre-built connectors, while Embulk is a lightweight, code-first CLI tool.
  • Apache NiFi – NiFi offers a powerful GUI for complex data flow automation, whereas Embulk focuses specifically on high-performance bulk loading via configuration.
License: Apache-2.0
Dependencies: openjdk@21
Bottles available for: all
Version History
Detected Version Rev Change Commit
Sep 21, 2024 2:56pm 0 VERSION_BUMP 889c2cfd