OCR/HTR engine for all the languages
Project description
Description
kraken is a turn-key OCR system optimized for historical and non-Latin script material.
kraken’s main features are:
Fully trainable layout analysis and character recognition
Right-to-Left, BiDi, and Top-to-Bottom script support
ALTO, PageXML, abbyXML, and hOCR output
Word bounding boxes and character cuts
Multi-script recognition support
Public repository of model files
Lightweight model files
Variable recognition network architectures
Installation
When using a recent version of pip all dependencies will be installed from binary wheel packages, so installing build-essential or your distributions equivalent is often unnecessary. kraken only runs on Linux or Mac OS X. Windows is not supported.
Install the latest development version through conda:
$ wget https://raw.githubusercontent.com/mittagessen/kraken/master/environment.yml $ conda env create -f environment.yml
or:
$ wget https://raw.githubusercontent.com/mittagessen/kraken/master/environment_cuda.yml $ conda env create -f environment_cuda.yml
for CUDA acceleration with the appropriate hardware.
It is also possible to install the latest stable release from pypi:
$ pip install kraken
Finally you’ll have to scrounge up a model to do the actual recognition of characters. To download the default model for printed English text and place it in the kraken directory for the current user:
$ kraken get 10.5281/zenodo.2577813
A list of libre models available in the central repository can be retrieved by running:
$ kraken list
Quickstart
Recognizing text on an image using the default parameters including the prerequisite steps of binarization and page segmentation:
$ kraken -i image.tif image.txt binarize segment ocr
To binarize a single image using the nlbin algorithm:
$ kraken -i image.tif bw.png binarize
To segment an image (binarized or not) with the new baseline segmenter:
$ kraken -i image.tif lines.json segment -bl
To segment and OCR an image using the default model(s):
$ kraken -i image.tif image.txt segment -bl ocr
All subcommands and options are documented. Use the help option to get more information.
Documentation
Have a look at the docs
Funding
kraken is developed at the École Pratique des Hautes Études, Université PSL.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for kraken-3.0.0.0b20.dev7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f75a29deed42343adfccd385fef505d2a1e87351f028228333b2fe1cbe37a2c |
|
MD5 | 5b85697a034087d8538c01289053be53 |
|
BLAKE2b-256 | 00793daf92bbbcb00feb5299b121aea08ef85dc84389d91b2ad60a4263150987 |