Skip to main content

Typegroups classifier for OCR

Project description

ocrd_typegroups_classifier

Typegroups classifier for OCR

Installation

From PyPI

pip3 install ocrd_typegroup_classifier

From source

If needed, create a virtual environment for Python 3 (it was tested successfully with Python 3.7), activate it, and install ocrd.

virtualenv -p python3 ocrd-venv3
source ocrd-venv3/bin/activate
pip3 install ocrd

Enter in the folder containing the tool:

cd ocrd_typegroups_classifier/

Install the module and its dependencies

make install

Finally, run the test:

sh test/test.sh

** Important: ** The test makes sure that the system does work. For speed reasons, a very small neural network is used and applied only to the top-left corner of the image, therefore the quality of the results will be of poor quality.

Models

The model classifier-1.tgc is based on a ResNet-18, with less neurons per layer than the usual model. It was briefly trained on 12 classes: Adornment, Antiqua, Bastarda, Book covers and other irrelevant data, Empty Pages, Fraktur, Griechisch, Hebräisch, Kursiv, Rotunda, Textura, and Woodcuts - Engravings.

Heatmap Generation

Given a trained model, it is possible to produce heatmaps corresponding to classification results. Syntax:

python3 tools/heatmap.py ocrd_typegroups_classifier/models/classifier.tgc sample.jpg out

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocrd_typegroups_classifier-0.0.2.tar.gz (16.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocrd_typegroups_classifier-0.0.2-py3-none-any.whl (26.3 MB view details)

Uploaded Python 3

File details

Details for the file ocrd_typegroups_classifier-0.0.2.tar.gz.

File metadata

  • Download URL: ocrd_typegroups_classifier-0.0.2.tar.gz
  • Upload date:
  • Size: 16.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/2.7.9

File hashes

Hashes for ocrd_typegroups_classifier-0.0.2.tar.gz
Algorithm Hash digest
SHA256 8c9b0f8253a2b34985128201ff155329ce23a5094e21f5f162d9ffa12ce8230b
MD5 4e597d6a3f75c4991392b11e88f89f40
BLAKE2b-256 136cad140f1e282941da373f19236cfffdc7b4dfe8190cef547175d33c3de8d9

See more details on using hashes here.

File details

Details for the file ocrd_typegroups_classifier-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: ocrd_typegroups_classifier-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 26.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.23.3 CPython/2.7.9

File hashes

Hashes for ocrd_typegroups_classifier-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 75057c3c0c8be6f664f04c903ce3fd4337a5f87dea8c825a423e006a2c406a03
MD5 733fcd5009cf54a7349aa314bf9a6e47
BLAKE2b-256 bc821b0976ef56d24962249dd9c4ff1c8dff259413cb52cc99bb08bbea15e1f8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page