Kraken bindings

Project description

ocrd_kraken

OCR-D wrapper for the Kraken OCR engine

Introduction

This package offers OCR-D compliant workspace processors for (some of) the functionality of Kraken.

(Each processor is a parameterizable step in a configurable workflow of the OCR-D functional model. There are usually various alternative processor implementations for each step. Data is represented with METS and PAGE.)

It includes image preprocessing (binarization), layout analysis (region and line+baseline segmentation), and text recognition.

Installation

With Docker

This is the best option if you want to run the software in a container.

You need to have Docker

docker pull ocrd/kraken

To run with Docker:

docker run --rm \
-v path/to/workspaces:/data \
-v path/to/models:/usr/local/share/ocrd-resources \
ocrd/kraken ocrd-kraken-recognize ...
# or ocrd-kraken-segment or ocrd-kraken-binarize

Native, from PyPI

This is the best option if you want to use the stable, released version.

pip install ocrd_kraken

Native, from git

Use this option if you want to change the source code or install the latest, unpublished changes.

We strongly recommend to use venv.

git clone https://github.com/OCR-D/ocrd_kraken
cd ocrd_kraken
sudo make deps-ubuntu # or manually from git or via ocrd_all
make deps        # or pip install -r requirements.txt
make install     # or pip install .

Models

Kraken uses data-driven (neural) models for segmentation and recognition, but comes with no pretrained "official" models. There is a public repository of community-provided models, which can also be queried and downloaded from via kraken standalone CLI. (See Kraken docs for details.)

For the OCR-D wrapper, since all OCR-D processors must resolve file/data resources in a standardized way, there is a general mechanism for managing models, i.e. installing and using them by name. We currently manage our own list of recommended models (without delegating to the above repo).

Models always use the filename suffix .mlmodel, but are just loaded by their basename.

See the OCR-D model guide and

ocrd resmgr --help

Usage

For details, see docstrings in the individual processors and ocrd-tool.json descriptions, or simply --help.

Available OCR-D processors are:

ocrd-kraken-binarize (nlbin – not recommended)
- adds AlternativeImage files (per page, region or line) to the output fileGrp
ocrd-kraken-segment (all-in-one segmentation – recommended for handwriting and simply layouted prints, or as pure line segmentation)
- adds TextRegions to Page (if level-of-operation=page) or TableRegions (if table)
- adds TextLines (with Baseline) to TextRegions (for all level-of-operation)
- masks existing segments during detection (unless overwrite_segments)
ocrd-kraken-recognize (benefits from annotated Baselines, falls back to center-normalized bboxes)
- adds Words to TextLines
- adds Glyphs to Words
- adds TextEquiv (removing existing TextEquiv if overwrite_text)

Testing

make test

This downloads test data from https://github.com/OCR-D/assets under repo/assets, and runs some basic tests of the Python API.

Set PYTEST_ARGS="-s --verbose" to see log output (-s) and individual test results (--verbose).

Project details

Release history Release notifications | RSS feed

This version

1.0.1

Feb 12, 2025

0.4.1

Jun 3, 2024

0.3.0

Oct 23, 2022

0.1.1

Oct 21, 2019

0.1.0

Feb 28, 2019

0.0.2

Jan 4, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocrd_kraken-1.0.1.tar.gz (26.3 kB view details)

Uploaded Feb 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ocrd_kraken-1.0.1-py3-none-any.whl (23.9 kB view details)

Uploaded Feb 12, 2025 Python 3

File details

Details for the file ocrd_kraken-1.0.1.tar.gz.

File metadata

Download URL: ocrd_kraken-1.0.1.tar.gz
Upload date: Feb 12, 2025
Size: 26.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ocrd_kraken-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`c7516758d91d51a26da18571bb2918f8e047688cbf6c3e0aee375ebe6e4c32e7`
MD5	`9cc52100cc66515dc6cd853c9f9ac265`
BLAKE2b-256	`3178b538b9c588005e9f7ab84b5672b70d36c5e46f7749aa3c94de41242cd179`

See more details on using hashes here.

File details

Details for the file ocrd_kraken-1.0.1-py3-none-any.whl.

File metadata

Download URL: ocrd_kraken-1.0.1-py3-none-any.whl
Upload date: Feb 12, 2025
Size: 23.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ocrd_kraken-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`48a996baeb5526082f5d744f974b86165638ed3590ee56e7b07af726c4f37670`
MD5	`df54080d4e2e4e6f05298bb8c83ec501`
BLAKE2b-256	`c24c5926185972754e269b1a5401ea2b706451002e5ad2ee77823b07f3d1627f`

See more details on using hashes here.

ocrd-kraken 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

ocrd_kraken

Introduction

Installation

With Docker

Native, from PyPI

Native, from git

Models

Usage

Testing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes