Skip to main content

Optical character recognition (OCR) tool for printed book pages

Project description

BookOcr

Optical character recognition (OCR) tool for printed book pages.

Usage examples:

from bookocr.ocr import Ocr

ocr = Ocr()
image_path = "my_image.png"
nested_list_structure = ocr.image_ocr(image_path)  # pages > text areas > lines > words
text = ocr.get_data_as_text()  # the same result, but joined

print(text)
from bookocr.config import OcrConfig
from bookocr.stats_config import OcrStatsConfig
from bookocr.ocr import Ocr

# optional
config = OcrConfig()
# ... set config values here

# optional as well
# provides intermediate results of image processing 
stats_config = OcrStatsConfig()
stats_config.set_enabled_true("stats_folder")
# ... set stats_config values here

ocr = Ocr(config, stats_config)
image_path = "my_image.png"
nested_list_structure = ocr.image_ocr(image_path)  # pages > text areas > lines > words
text = ocr.get_data_as_text()  # the same result, but joined

print(text)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bookocr-1.0.2.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

bookocr-1.0.2-py3-none-any.whl (1.9 MB view details)

Uploaded Python 3

File details

Details for the file bookocr-1.0.2.tar.gz.

File metadata

  • Download URL: bookocr-1.0.2.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for bookocr-1.0.2.tar.gz
Algorithm Hash digest
SHA256 e689992e2ef40e26394d0624f6dedd7f54cf2908cafc3d3470c66144967549d8
MD5 0b9436edfddee85ba55f16ce115d646e
BLAKE2b-256 ba7f2a76319f78d918089809c95a7ff913b41d206abe5dfd1098c6b75dc0e474

See more details on using hashes here.

File details

Details for the file bookocr-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: bookocr-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 1.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for bookocr-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 da343cfd6cdb74cc8303d7304ac4e638156c0a380546b9acc32e8a5437f12698
MD5 abcf87eaa9ba882ea0bb252c407e1d7b
BLAKE2b-256 c46afd342cf24e295b9e29034fd03d82d52480f03c785d184a3c9eeb1eb9e20b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page