Tag: document-processing 12 packages with this tag
« Back to VersTracker  |  All Categories  |  All Tags  |  Related: pdf cli ocr scanning library python xml xproc xslt developer-tools
Package Description Version
calabash formula XProc (XML Pipeline Language) implementation 1.5.7-120
cpdf formula PDF Command-line Tools 2.8.1
ghostscript formula Interpreter for PostScript and PDF
html2text formula Advanced HTML-to-text converter 2.4.0
ocrmypdf formula Adds an OCR text layer to scanned PDF files 16.12.0
pdfcpu formula PDF processor written in Go 0.11.1
pdfsandwich formula Generate sandwich OCR PDFs from scanned file
pdftk-java formula Port of pdftk in java
podofo formula Library to work with the PDF file format 1.0.3
pymupdf formula Python bindings for the PDF toolkit and renderer MuPDF 1.26.7
textract formula Extract text from various different types of files
tika formula Content analysis toolkit