Skip to main content

Toolkit for document image processing

Project description

Contextualization

In progress toolkit for document image pre processing.

Aimed for images to be OCRed.

Main Available methods

  • Auto rotate image

    Uses left margin of a document to calculate the angle of rotation present, and correct it accordingly.

    Can be given the rotation direction (clocwise or counter_clockwise), or in auto mode tries to determine the side to which the document is tilted (can be none, in which case image won't be rotated).

  • Calculate rotation direction

    Calculates rotation direction of an image by finding the biggest sets of the first black pixels appearances (with outliers removed) in the image for each direction: clockwise, counter_clockwise and none.

    For none direction, the set is created based on pixels with same 'x' coordinate that with less than a 5% height difference, relative to the image's height.

  • Binarize document

    Normal binarization with otsu tresholding and fastNlMeansDenoising.

    Fax binarization, following the image magick command: convert "image" -colorspace Gray ( +clone -blur 15,15 ) -compose Divide_Src -composite -level 10%,90%,0.2

  • Split document into columns

    Analyzes document image pixel color frequency and split document image into columns.

  • Auto crop document

    Analyzes document image pixel color frequency and cut document margins, aiming mostly to remove possible folds in the corners.

  • Identify document images Identify document images in image, using algorithm available in leptonica's repository that finds potential image masks.

  • Get document delimiters Get document delimiters, using image transformations.

  • Segment document Segments document image into header, body and footer, using delimiters. Only the body is always guaranteed to have a value.

Bash commands:

  • binarize : binarize document image.

  • rotate_document : rotate document image.

  • split_columns : split document into column images.

  • d_auto_crop : auto crop document image.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_image_utils-0.1.25.tar.gz (32.0 kB view details)

Uploaded Source

Built Distribution

document_image_utils-0.1.25-py3-none-any.whl (36.9 kB view details)

Uploaded Python 3

File details

Details for the file document_image_utils-0.1.25.tar.gz.

File metadata

  • Download URL: document_image_utils-0.1.25.tar.gz
  • Upload date:
  • Size: 32.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.12 Linux/5.10.16.3-microsoft-standard-WSL2

File hashes

Hashes for document_image_utils-0.1.25.tar.gz
Algorithm Hash digest
SHA256 c9aba32015645fa263a0d4f24490cbedee3ca7955b9e476439f8f284c4f05520
MD5 33cf74c8462627bfff47a36698ba66c2
BLAKE2b-256 75c729be14e3949946b5d9384c7c7b3f5139350386904ad43c09148f543cdb12

See more details on using hashes here.

File details

Details for the file document_image_utils-0.1.25-py3-none-any.whl.

File metadata

  • Download URL: document_image_utils-0.1.25-py3-none-any.whl
  • Upload date:
  • Size: 36.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.12 Linux/5.10.16.3-microsoft-standard-WSL2

File hashes

Hashes for document_image_utils-0.1.25-py3-none-any.whl
Algorithm Hash digest
SHA256 08cb503dad1ddf134a69337e5bee93f33a10cc29144a6c82b62c91d85d0cd48f
MD5 3fc25eca1a438ce77d8e89f415b60534
BLAKE2b-256 88d5c193689ceeeaee5623a0088fb465d6248c81892c74bb069a903ec74c4eae

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page