datamash
« Back to VersTracker
Description:
Tool to perform numerical, textual & statistical operations
Type: Formula  |  Latest Version: 1.9@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  formulae.brew.sh
Category: Developer tools
Tags: cli statistics data-processing text-processing gnu
Install: brew install datamash
About:
GNU Datamash is a command-line program for basic numerical, textual, and statistical operations. It can perform calculations such as sum, average, max, min, and percentile on tabular data without requiring a database or spreadsheet. The tool is designed for processing text files directly from the command line, making it ideal for data analysis pipelines.
Key Features:
  • Supports statistical functions like percentile, median, and standard deviation
  • Performs text processing operations including sorting and collapsing
  • Handles CSV, TSV, and other delimited data formats
  • Requires no external dependencies or scripting
  • Designed for efficient processing of large text files
Use Cases:
  • Calculating summary statistics from log files or CSV exports
  • Filtering and transforming data streams in shell scripts
  • Quickly computing aggregates without loading data into spreadsheets
Alternatives:
  • awk – More general-purpose text processing, but requires more complex syntax for statistical functions
  • R – More powerful statistical analysis but heavier and less suited for simple command-line tasks
  • csvkit – CSV-specific toolkit with similar features but focused on CSV format
License: GPL-3.0-or-later
Bottles available for: arm64_tahoe, arm64_sequoia, arm64_sonoma, arm64_ventura, sonoma, ventura, arm64_linux, x86_64_linux
Version History
Detected Version Rev Change Commit
Sep 15, 2025 6:26am 0 VERSION_BUMP bb4efd4c