Skip to main content

doc page extractor can identify text and format in images and return structured data.

Project description

doc page extractor

English | 中文

Introduction

doc page extractor can identify text and format in images and return structured data.

Installation

pip install doc-page-extractor
pip install onnxruntime==1.21.0

Using CUDA

Please refer to the introduction of PyTorch and select the appropriate command to install according to your operating system.

In addition, replace the command to install onnxruntime in the previous article with the following:

pip install onnxruntime-gpu==1.21.0

Example

from PIL import Image
from doc_page_extractor import DocExtractor

extractor = DocExtractor(
  model_dir_path=model_path, # Folder address where AI model is downloaded and installed
  device="cpu", # If you want to use CUDA, please change to device="cuda".
)
with Image.open("/path/to/your/image.png") as image:
  result = extractor.extract(
  image=image,
  lang="ch", # Language of image text
)
for layout in result.layouts:
  for fragment in layout.fragments:
    print(fragment.rect, fragment.text)

Acknowledgements

The code of doc_page_extractor/onnxocr in this repo comes from OnnxOCR.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

doc-page-extractor-test-0.2.2.tar.gz (62.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

doc_page_extractor_test-0.2.2-py3-none-any.whl (132.1 kB view details)

Uploaded Python 3

File details

Details for the file doc-page-extractor-test-0.2.2.tar.gz.

File metadata

  • Download URL: doc-page-extractor-test-0.2.2.tar.gz
  • Upload date:
  • Size: 62.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for doc-page-extractor-test-0.2.2.tar.gz
Algorithm Hash digest
SHA256 d7497327610e2d9c206d1993c40ec6da72a7f3d7bf05ff42ebac6d16c2cfc2e1
MD5 bf2b893da38ae5c0df831f4de53809da
BLAKE2b-256 84c00a5e06b9a0e3f38bd159047d79fcc6808a736b2029dbb3113b41b07315fd

See more details on using hashes here.

File details

Details for the file doc_page_extractor_test-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for doc_page_extractor_test-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b6a46341cd2cac46cff0aedf4e3da703aa83d76c978d0b134f16032c2ce2b332
MD5 94c261866e836bcbf762a134423f02d3
BLAKE2b-256 3d4a2f716334875da51b9fd48910e3e9ef4ce7d10d806535c27a1d59abf9960c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page