ekhtml
« Back to VersTracker
Description:
Forgiving SAX-style HTML parser
Type: Formula  |  Latest Version: 0.3.2@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  formulae.brew.sh
Category: Developer tools
Tags: html-parser sax c-library web-scraping parsing
Install: brew install ekhtml
About:
Ekhtml is a forgiving, SAX-style HTML parser designed for robust processing of real-world, often malformed, HTML. It operates as a stream-based parser, emitting events for tags and text data without building a full document tree in memory. This approach makes it ideal for applications requiring low memory overhead and high-speed parsing of HTML streams.
Key Features:
  • Forgiving parser that handles malformed HTML gracefully
  • SAX-style, stream-based event processing
  • Low memory footprint compared to DOM-based parsers
  • Written in C for high performance
Use Cases:
  • Extracting data from legacy or poorly formatted HTML documents
  • Building high-performance web scrapers or crawlers
  • Processing large HTML files with minimal memory usage
Alternatives:
  • libxml2 – libxml2 is a more comprehensive XML/HTML toolkit with full DOM support, but can be heavier and less tolerant of malformed HTML than Ekhtml.
  • Gumbo – Gumbo is a modern, spec-compliant HTML5 parser that also offers a forgiving parser, but Ekhtml is a more lightweight, stream-focused solution.
License: BSD-2-Clause
Bottles available for: arm64_tahoe, arm64_sequoia, arm64_sonoma, arm64_ventura, arm64_monterey, arm64_big_sur, sonoma, ventura, monterey, big_sur, catalina, arm64_linux, x86_64_linux
Version History
Detected Version Rev Change Commit
Sep 13, 2025 4:37pm 0 VERSION_BUMP 492f2264
Sep 12, 2024 6:11pm 0 VERSION_BUMP 43cfb0b5