Skip to main content

Optical character recognition (OCR) tool for printed book pages

Project description

BookOcr

Optical character recognition (OCR) tool for printed book pages.

Usage examples:

from bookocr.ocr import Ocr

ocr = Ocr()
image_path = "my_image.png"
nested_list_structure = ocr.image_ocr(image_path)  # pages > text areas > lines > words
text = ocr.get_data_as_text()  # the same result, but joined

print(text)
from bookocr.config import OcrConfig
from bookocr.stats_config import OcrStatsConfig
from bookocr.ocr import Ocr

# optional
config = OcrConfig()
# ... set config values here

# optional as well
# provides intermediate results of image processing 
stats_config = OcrStatsConfig()
stats_config.set_enabled_true("stats_folder")
# ... set stats_config values here

ocr = Ocr(config, stats_config)
image_path = "my_image.png"
nested_list_structure = ocr.image_ocr(image_path)  # pages > text areas > lines > words
text = ocr.get_data_as_text()  # the same result, but joined

print(text)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bookocr-1.0.0.tar.gz (12.2 kB view details)

Uploaded Source

Built Distribution

bookocr-1.0.0-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file bookocr-1.0.0.tar.gz.

File metadata

  • Download URL: bookocr-1.0.0.tar.gz
  • Upload date:
  • Size: 12.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for bookocr-1.0.0.tar.gz
Algorithm Hash digest
SHA256 4fb55556fb36b6a688550a99b421d618f42001b0bc2e5902c3eda160291d3f18
MD5 ad3064cc600499c70491015323b6de69
BLAKE2b-256 811a2f6b81c5be23258173d7e1dd65189ccd5f7f43c64759e03117d773959fe9

See more details on using hashes here.

File details

Details for the file bookocr-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: bookocr-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 13.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.5

File hashes

Hashes for bookocr-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 458cb4c1fc833c556e43b4e54e6999a14af4febd096aeb2e6953173fa53c3a76
MD5 19117f32c034e63da5185f58c5d6334c
BLAKE2b-256 3945f592663f6cd036f9bad670682b02e4ac074b389610559f1c0122c5a72367

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page