Skip to main content

NUM Miner (Tool to create open dataset for Handwritten Text Recognition)

Project description

NUMiner

Build Status security: bandit PyPI version Coverage Status Code style: black

InstallationHow To UseContributingLicense

This is a Python library that creates training images for Handwritten Text Recognition or HTR related researches

Installation

Use the package manager pip to install numiner.

pip install numiner

How To Use

In general, the package has two main modes. One is sheet and another one is letter.

sheet - takes a path called <source> to a folder that's holding all the scanned sheet images or an actual image path and saves the processed images in the <result> path

$ numiner -s/--sheet <source> <result>

letter - takes a path called <source> to a folder that's holding all the cropped raw images or an actual image path and saves the processed images in the <result> path

$ numiner -l/--letter <source> <result>

Also you can override the default sheet labels by giving json file:

$ numiner -c path/to/labels.json -s path/to/source path/to/result

For sure you can also do this:

$ numiner --help

usage: numiner [-h] [-v] [-s <source> <result>] [-l <source> <result>] [-c <path>]

optional arguments:
  -h, --help                    show this help message and exit
  -v, --version                 show program's version number and exit
  -s/--sheet <source> <result>  a path to a folder or file that's holding the <source>
                                sheet image(s) & a path to a folder where all <result>
                                images will be saved
  -l/--letter <source> <result> a path to a folder or a file that's holding the cropped
                                image(s) & a path to a folder where all <result> images
                                will be saved
  -c <path>                     a path to .json file that's holding top to bottom, left
                                to right labels of the sheet with their ids

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

numiner-0.2.0.tar.gz (9.1 kB view details)

Uploaded Source

Built Distribution

numiner-0.2.0-py3-none-any.whl (9.3 kB view details)

Uploaded Python 3

File details

Details for the file numiner-0.2.0.tar.gz.

File metadata

  • Download URL: numiner-0.2.0.tar.gz
  • Upload date:
  • Size: 9.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.0 Linux/4.15.0-1028-gcp

File hashes

Hashes for numiner-0.2.0.tar.gz
Algorithm Hash digest
SHA256 f710e21a8d2cea19befc60f36b8ae00f5deae236a561a7ce669deba3a338dd08
MD5 4d7716670bc44dad529ac9beb9f46f97
BLAKE2b-256 781c085624559deefd3d02a1a0b5f8cbc5972f8d4034b08ea12dbd9c181041fa

See more details on using hashes here.

File details

Details for the file numiner-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: numiner-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 9.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.0.5 CPython/3.8.0 Linux/4.15.0-1028-gcp

File hashes

Hashes for numiner-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5deb328647d0b8e7be460def9144d536df91a1a8f4222e24572704088b1b99c9
MD5 6e7700e3a2e0aff5518699272cb8600d
BLAKE2b-256 1ff4692826fafd2bb944c6b3ebd48930fbd465f6b32665734e6160c335b8da6c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page