Skip to main content

OCR-D framework

Project description

Collection of OCR-related python tools and wrappers from the OCR-D team

https://travis-ci.org/OCR-D/core.svg?branch=master Docker Automated build https://img.shields.io/pypi/v/ocrd.svg

Installation

To bootstrap the tool, you’ll need installed (Ubuntu packages):

  • Python (python)

  • pip (python-pip)

  • exiftool (libimage-exiftool-perl)

  • libxml2-utils for xmllint (libxml2-utils)

To install system-wide:

make deps-ubuntu deps-pip install

To develop, install to a virtualenv

pip install virtualenv
virtualenv --no-site-packages venv
source venv/bin/activate
make deps-pip install

Usage

pyocrd installs a binary ocrd that can be used to invoke the processors directly (ocrd process) or start (development) webservices (ocrd server)

TODO: Update docs here.

Examples:

# List available processors
ocrd process

# Region-segment with tesserocr all files in METS INPUT fileGrp
ocrd process -m /path/to/mets.xml segment-region/tesserocr

# Chain multiple processors
ocrd process -m /path/to/mets.xml characterize/exif segment-line/tesserocr recognize/tesserocr

# Start a processor web service at port 6543
ocrd server process -p 6543
http PUT localhost:6543/characterize url==http://server/path/to/mets.xml

Testing

Download assets (make assets)

Test with local files: make test

Test with local asset server:
  • Start asset-server: make asset-server

  • make test OCRD_BASEURL='http://localhost:5001/'

Test with remote assets:
  • make test OCRD_BASEURL='https://github.com/OCR-D/assets/raw/master/data/'

See Also

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocrd-0.4.3.tar.gz (63.2 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

ocrd-0.4.3-py3-none-any.whl (124.7 kB view details)

Uploaded Python 3

ocrd-0.4.3-py2-none-any.whl (121.5 kB view details)

Uploaded Python 2

File details

Details for the file ocrd-0.4.3.tar.gz.

File metadata

  • Download URL: ocrd-0.4.3.tar.gz
  • Upload date:
  • Size: 63.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for ocrd-0.4.3.tar.gz
Algorithm Hash digest
SHA256 a549b2cd1b5f3619529fbf280aae38c3ea5ad2967d6efc765d1b3e0822c41b35
MD5 5165eab582ba9ae03763df20801d7b18
BLAKE2b-256 622d37ce8d635d7771c6a5a4f64f4204e113fa696e8d7447fda929e21e5bf71d

See more details on using hashes here.

File details

Details for the file ocrd-0.4.3-py3-none-any.whl.

File metadata

  • Download URL: ocrd-0.4.3-py3-none-any.whl
  • Upload date:
  • Size: 124.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for ocrd-0.4.3-py3-none-any.whl
Algorithm Hash digest
SHA256 4aad67655ed3e29e96ad60bec97a293df7ec9833617e91cef3ee9e919bd999af
MD5 28e24613734b805fa686bae8074d5e61
BLAKE2b-256 f0879347209a295261b5bd6552b666af9e9bae0f3685e84365fe6e8382d0ba33

See more details on using hashes here.

File details

Details for the file ocrd-0.4.3-py2-none-any.whl.

File metadata

  • Download URL: ocrd-0.4.3-py2-none-any.whl
  • Upload date:
  • Size: 121.5 kB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for ocrd-0.4.3-py2-none-any.whl
Algorithm Hash digest
SHA256 dab6bb403a706a14c313236a6adc94125a43032872756c474fbc3ae9b3506f4e
MD5 efdeb2c878787f4d2937ae6f7ea75592
BLAKE2b-256 1a8adbee07d51d95bb7d2239744d8fcab3b303c7e58ded44130441ad0c4f844e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page