
Tesseract OCR Application
Tesseract Open Source OCR Engine
Website: https://github.com/tesseract-ocr/tesseract
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (–oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.

CMDTwain
Command line driven image scanner program
Website: http://www.gssezisoft.com/main/
CmdTwain is a free command line image scanner program. Its purpose is to allow you to integrate scanned documents into programs that record information such as tax receipts, resumes, photo albums, correspondence and so on. Back to Contents. Installation.

Poppler
PDF Rendering library
Website: https://poppler.freedesktop.org/
Poppler is a PDF rendering library based on the xpdf-3.0 code base

Ghostscript
Website: https://www.ghostscript.com/
Ghostscript is an interpreter for the PostScript® language and PDF files. It is available under either the GNU GPL Affero license or licensed for commercial use from Artifex Software, Inc. It has been under active development for over 30 years and has been ported to several different systems during this time. Ghostscript consists of a PostScript interpreter layer and a graphics library.

FFmpeg
A complete, cross-platform solution to record, convert and stream audio and video.
Website: https://www.ffmpeg.org/

LM Studio
LM Studio is an easy to use desktop app for experimenting with local and open-source Large Language Models (LLMs). The LM Studio cross platform desktop app
Website: https://lmstudio.ai/

Tifcp
TIFCP (TIFF Combine Pages) is a command-line tool used to append multiple TIFF documents into a single, combined TIFF file. It’s a useful utility for merging multiple images, such as scanned documents, into a single file for easier management and sharing.