tesseract-lang
« Back to VersTracker
Description:
Enables extra languages support for Tesseract
Type: Formula  |  Tracked Since: Dec 28, 2025
Links: Homepage  |  formulae.brew.sh
Category: Developer tools
Tags: tesseract ocr language text-recognition i18n
Install: brew install tesseract-lang
About:
This package provides language data files for Tesseract OCR, enabling recognition for dozens of additional languages. It uses the 'fast' traineddata format, which offers smaller file sizes by using integer models for faster inference. This is ideal for users who need broad multilingual OCR support without the storage overhead of standard models.
Key Features:
  • Supports dozens of languages and scripts
  • Optimized 'fast' models for smaller size and speed
  • Easy installation via Homebrew
  • Seamless integration with Tesseract OCR
Use Cases:
  • Processing multilingual documents for digitization
  • Building applications that require text extraction from images in various languages
  • Setting up a server-side OCR pipeline with support for international text
Alternatives:
  • tesseract-lang (standard) – Provides standard models with higher accuracy but significantly larger file sizes.
  • tessdata_best – Offers the highest accuracy models (LSTM float) but are the largest and slowest.
Version History
Detected Version Rev Change Commit