Skip to content

Latest commit

 

History

History
186 lines (135 loc) · 11 KB

AddOns.md

File metadata and controls

186 lines (135 loc) · 11 KB

For GUI interface to Tesseract and other 3rd Party projects, please see User Projects - 3rd Party

External tools, wrappers and training projects for Tesseract

Tesseract box editors and training tools

Platform support depends on used language and experience of user.

For Tesseract version 4 and up

Box file editors

jTessBoxEditor

For Tesseract 3.0x

Box file editors

Name Last update Language Multipage support
jTessBoxEditor 2023 Java yes
QT Box Editor 2019 C++, Qt4/Qt5 yes
tesseract-box-editor 2013 .NET 4 yes
Tesseract-OCR boxfile AJAX editor 2012 online tool
cowboxer 2012 C++, Qt4 no
moshPyTT 2011 Python, GTK2 no
pytesseracttrainer 2011 Python, GTK2 no

For Tesseract-OCR 2.0x

Box file editors

Name Last update Language
Tesseract-OCR boxfile AJAX editor 2012 online tool
owlboxer 2010 C++, Qt4
Tessboxer 2009 .NET
boxfilereader.php 2009 php
tessboxes 2008 C
JTesseract 2008 C#
wx-tetra 2008 perl, wx
bbtesseract 2008 VB.NET 2008

Other Training Tools

Community training projects

Ports

Tesseract wrappers

Tesseract 4.0x

Java

Python

  • tesserocr - A Python wrapper around Tesseract's C++ API
  • pytesseract - a wrapper class for Tesseract OCR (requires tesseract executable)
  • tesseract-ocr-wrapper - a python wrapper for tesseract-ocr with support for OCRing of pdf
  • aiopytesseract - asyncio tesseract wrapper for Tesseract-OCR.
  • image2text - A python wrapper for tesseract to work on large datasets and directories.

Objective-C

Swift

Flutter

R

  • tesseract Bindings to the C++ API for the R programming language

Ruby

  • rtesseract wrapper gem for Tesseract OCR (requires tesseract executable)

Rust

  • rusty-tesseract a wrapper class for Tesseract OCR (requires tesseract executable; based on pytesseract)

Elixir

Crystal

Tesseract 3.0x

C

  • Tesseract versions 3.02 and up include C API

.Net

Python

  • tesserocr - A Python wrapper around Tesseract's C++ API
  • pyocr - A Python wrapper for Tesseract (and Cuneiform)
  • tesserwrap - Python bindings to the Tesseract API
  • tesseract-sip - A python SIP wrapper for libtesseract (Apache license)
  • pytesseract - a wrapper class for Tesseract OCR (requires tesseract executable)
  • python-tesseract - A wrapper class for Tesseract OCR that allows any conventional image files (SWIG based)
  • http://code.google.com/p/pytess/ - A simple SWIG-based interface to Tesseract
  • aiopytesseract - asyncio tesseract wrapper for Tesseract-OCR.

R

  • tesseract Bindings to the C++ API for the R programming language

Ruby

Java

Node.js

  • penteract - The native node.js bindings to the Tesseract OCR project.

PHP

Objective-C

Go

Clojure

Tesseract 2.0x

Python

.NET

Java