|
textract
☆
« Back to VersTracker
|
||||||||||
|
Description: Extract text from various different types of files |
||||||||||
| Type: Formula | Tracked Since: Dec 28, 2025 | ||||||||||
| Links: Homepage | formulae.brew.sh | ||||||||||
| Category: Developer tools | ||||||||||
| Tags: text-extraction document-processing python pdf ocr | ||||||||||
| Install: brew install textract | ||||||||||
|
About: Textract is a Python library that simplifies extracting text from a wide variety of file formats, including PDFs, Word documents, images, and spreadsheets. It provides a unified, simple interface to access content without needing to learn the specific APIs for each file type. The tool automatically selects the appropriate backend parser based on the file's MIME type, making document processing workflows highly efficient. |
||||||||||
Key Features:
|
||||||||||
Use Cases:
|
||||||||||
Alternatives:
|
||||||||||
| Version History | ||||||||||
|