Skip to main content

OCR library with layout reconstruction, anonymization, and summarization

Project description

ocr_pdf2txt

A Python library for OCR-based text extraction with advanced features like layout reconstruction, anonymization, and summarization.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocr_pdf2txt-0.1.0.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocr_pdf2txt-0.1.0-py3-none-any.whl (4.3 kB view details)

Uploaded Python 3

File details

Details for the file ocr_pdf2txt-0.1.0.tar.gz.

File metadata

  • Download URL: ocr_pdf2txt-0.1.0.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for ocr_pdf2txt-0.1.0.tar.gz
Algorithm Hash digest
SHA256 38652f2b2600f2a212db2c5ab497c7c909627ab0b674bc72db5550a08085c0ce
MD5 928567c5a057b367553d9b529108e6e6
BLAKE2b-256 075419d7c05fcd5e94719811ce17a011b4f3a60ff0ff34f4c742f94d81b8d6cc

See more details on using hashes here.

File details

Details for the file ocr_pdf2txt-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ocr_pdf2txt-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for ocr_pdf2txt-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4eac792be420c750d72c15b7c16516d7b195a92bcc91d9aa4ea037cee8b2f279
MD5 2c7251df07353112d8bc30d7e4edf20a
BLAKE2b-256 583415f4a68471a2819c82d809d41f724123f97ea165a480f1a2054d2a7fd52b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page