embulk ☆

« Back to VersTracker

Description:
Data transfer between various databases, file formats and services

Type: Formula | Latest Version: 0.11.5@0 | Tracked Since: Dec 17, 2025

Links: Homepage | formulae.brew.sh

Category: Databases

Tags: etl data-pipeline data-loader jdbc bulk-transfer

Install: brew install embulk

About:
Embulk is a plugin-based parallel bulk data loader that unifies data extraction from diverse sources like databases, files, and cloud services. It standardizes data ingestion into various data stores, handling format conversions and parallel execution automatically. This makes it ideal for building robust and scalable ETL pipelines.

Key Features:

Plugin-based architecture for extensibility
Automatic parallel processing for high throughput
Unified command-line interface for various data sources
Handles data filtering and type conversion

Use Cases:

Migrating data from legacy databases to modern data warehouses
Automating daily ingestion of application logs and metrics
Integrating third-party API data into internal analytics systems

Alternatives:

Airbyte – Airbyte is a more modern, UI-driven platform with a larger catalog of pre-built connectors, while Embulk is a lightweight, code-first CLI tool.
Apache NiFi – NiFi offers a powerful GUI for complex data flow automation, whereas Embulk focuses specifically on high-performance bulk loading via configuration.

License: Apache-2.0

Dependencies: openjdk@21

Bottles available for: all

Version History

Detected	Version	Rev	Change	Commit
Sep 21, 2024 2:56pm		0	VERSION_BUMP	889c2cfd