Skip to main content

Pixelwise binarization with selectional auto-encoders in Keras

Project description

Binarization

Binarization for document images

Examples

Introduction

This tool performs document image binarization (i.e. transform colour/grayscale to black-and-white pixels) for OCR using multiple trained models.

The method used is based on Calvo-Zaragoza/Gallego, 2018. A selectional auto-encoder approach for document image binarization.

Installation

Clone the repository, enter it and run

pip install .

Models

Pre-trained models can be downloaded from here:

https://qurator-data.de/sbb_binarization/

Usage

sbb_binarize \
  --patches \
  -m <directory with models> \
  <input image> \
  <output image>

Note In virtually all cases, the --patches flag will improve results.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sbb_binarization-0.0.6.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

sbb_binarization-0.0.6-py3-none-any.whl (11.7 kB view details)

Uploaded Python 3

File details

Details for the file sbb_binarization-0.0.6.tar.gz.

File metadata

  • Download URL: sbb_binarization-0.0.6.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.9

File hashes

Hashes for sbb_binarization-0.0.6.tar.gz
Algorithm Hash digest
SHA256 cccd5d1e95d55244a3d50caba15243265c493cefbd2dbfbe2bfa52fb9cfab15e
MD5 cf47886ccf11f10d099eefc7c3f8154d
BLAKE2b-256 326ac6d86a77176029514c441b424c5715d97c4fea2579ebf9d08b8ff0be3ba4

See more details on using hashes here.

File details

Details for the file sbb_binarization-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: sbb_binarization-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 11.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.9

File hashes

Hashes for sbb_binarization-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 900c681479921e621543c9b373d67bb9900c0570c5832abbbe64736a780181c8
MD5 2d535c2c2d54580073c7e8c0829b9d59
BLAKE2b-256 dff6d320c557eac137763c6034d3b7c460e40a54afed10efa073de4aac708c75

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page