pup
« Back to VersTracker
Description:
Parse HTML at the command-line
Type: Formula  |  Tracked Since: Dec 28, 2025
Links: Homepage  |  formulae.brew.sh
Category: Developer tools
Tags: html scraper cli parsing web
Install: brew install pup
About:
Pup is a command-line tool for parsing HTML. It reads HTML from standard input or a file and allows users to extract specific elements using CSS selectors. Its main value is enabling quick data extraction and scraping tasks directly in the terminal without needing complex scripts.
Key Features:
  • CSS selector support
  • JSON output format
  • Streaming input processing
  • Attribute and text extraction
Use Cases:
  • Scraping data from web pages in shell scripts
  • Extracting specific elements from HTML for debugging
  • Converting HTML tables to JSON for data processing
Alternatives:
  • jq – jq is for JSON, while pup is specifically for HTML parsing
  • htmlq – Similar functionality, written in Rust vs Go
  • xmllint – More general XML tool, but can handle HTML with proper flags
Version History
Detected Version Rev Change Commit
Sep 14, 2024 10:37pm 0 VERSION_BUMP e77d5c6a