tesseract-lang ☆

« Back to VersTracker

Description:
Enables extra languages support for Tesseract

Type: Formula | Tracked Since: Dec 28, 2025

Links: Homepage | formulae.brew.sh

Category: Developer tools

Tags: tesseract ocr language text-recognition i18n

Install: brew install tesseract-lang

About:
This package provides language data files for Tesseract OCR, enabling recognition for dozens of additional languages. It uses the 'fast' traineddata format, which offers smaller file sizes by using integer models for faster inference. This is ideal for users who need broad multilingual OCR support without the storage overhead of standard models.

Key Features:

Supports dozens of languages and scripts
Optimized 'fast' models for smaller size and speed
Easy installation via Homebrew
Seamless integration with Tesseract OCR

Use Cases:

Processing multilingual documents for digitization
Building applications that require text extraction from images in various languages
Setting up a server-side OCR pipeline with support for international text

Alternatives:

tesseract-lang (standard) – Provides standard models with higher accuracy but significantly larger file sizes.
tessdata_best – Offers the highest accuracy models (LSTM float) but are the largest and slowest.

Version History

Detected	Change	Commit
Feb 15, 2023 5:15am	VERSION_BUMP	0df70f1e
Feb 15, 2023 4:23am	VERSION_BUMP	046f9011
Feb 15, 2023 2:30am	VERSION_BUMP	6a991e37