docx2txt
« Back to VersTracker
Description:
Converts Microsoft Office docx documents to equivalent text documents
Type: Formula  |  Latest Version: 1.4@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  formulae.brew.sh
Category: Productivity
Tags: text-processing documents cli productivity perl
Install: brew install docx2txt
About:
docx2txt is a command-line utility that extracts plain text content from Microsoft Office Open XML (DOCX) files. It operates by parsing the underlying XML structure of the document to retrieve text, ignoring complex formatting and embedded media. This tool is invaluable for content migration, indexing, and processing documents in environments where proprietary word processors are unavailable.
Key Features:
  • Preserves paragraph structure and basic text formatting
  • Handles both .docx files and standard input streams
  • Lightweight, fast, and scriptable for automation
  • Requires only a standard Perl interpreter to run
Use Cases:
  • Batch converting documents for text-based indexing or search
  • Extracting raw content for data analysis or migration pipelines
  • Quickly reading document content on headless servers without a GUI
Alternatives:
  • pandoc – Pandoc is a universal document converter that supports many formats but is significantly larger and more complex.
  • textutil – textutil is a native macOS utility for text conversion, but it is platform-specific and not available on Linux or Windows.
License: GPL-3.0-or-later
Bottles available for: all
Version History
Detected Version Rev Change Commit