pdf2json
« Back to VersTracker
Description:
PDF to JSON and XML converter
Type: Formula  |  Tracked Since: Dec 28, 2025
Links: Homepage  |  formulae.brew.sh
Category: Developer tools
Tags: pdf json xml converter cli
Install: brew install pdf2json
About:
Pdf2json is a command-line utility that parses PDF files and converts their content into structured JSON or XML formats. It extracts text, fonts, and layout information, making it easier to programmatically process PDF data. This tool is particularly useful for developers needing to integrate PDF parsing capabilities into web applications or data pipelines.
Key Features:
  • Converts PDF to JSON and XML
  • Extracts text and layout data
  • Command-line interface
  • Web-friendly output formats
Use Cases:
  • Extracting text from PDF documents for search indexing
  • Parsing PDF form data into JSON for web APIs
  • Converting PDF reports to XML for data migration
Alternatives:
  • pdftotext – Part of Poppler utilities, focuses on plain text extraction rather than structured JSON
  • PyMuPDF – Python library offering more granular control but requires scripting, not a standalone CLI tool
Version History
Detected Version Rev Change Commit