Skip to main content

Collection of utilities for Tesseract OCR training

Project description

Python utilities for Tesseract OCR training

This module is a collection of different training utilities for Tesseract OCR. These utilities are also implemented as console scripts, hence they can be run from command line.

Requirements

This module requires the following modules to work:

  • pytesseract (Running Tesseract OCR)
  • editdistance (Calculation of error rates)

Packages

The module is split in several packages. The package pytesstrain.train contains the workhorse function run_text(). The package pytesstrain.cli contains the tolls you might run at the command line. The package pytesstrain.ambigs contains function around unicharambigs file. The package pytesstrain.text2image contains the interface to the text2image command from the Tesseract OCR; the interface relies on pytesseract module and is modelled after it as well. The package pytesstrain.metrics contains error rate calculations, as well the interface class Metrics. The package pytesstrain.utils has auxiliary functions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytesstrain-0.1.0.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

pytesstrain-0.1.0-py3-none-any.whl (20.2 kB view details)

Uploaded Python 3

File details

Details for the file pytesstrain-0.1.0.tar.gz.

File metadata

  • Download URL: pytesstrain-0.1.0.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.4

File hashes

Hashes for pytesstrain-0.1.0.tar.gz
Algorithm Hash digest
SHA256 d59e84da332bc70e16ea7a59417334ee64f8752920c1c1a434e9a5e601784e20
MD5 3da7b2583852d5ae3f07c7211eb2a4ff
BLAKE2b-256 c4b02e268ad7d32ba9c56b90fc60e94668cbfa9339aa2e9535e6b353d662446d

See more details on using hashes here.

File details

Details for the file pytesstrain-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: pytesstrain-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 20.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.5.4

File hashes

Hashes for pytesstrain-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 77e04609649d674aec7cd55e9ce2e9285c78571d8db9bec0ccba285ddd401ab0
MD5 2a6801f6c0afe06cfba55495fcdbde70
BLAKE2b-256 bfa92b6f1334faf143e268871f419dcf4ff88d2943dcad717e4534edf4ab7562

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page