Skip to main content

A versatile OCR and document processing command-line tool.

Project description

My OCR Tool

A versatile OCR and document processing command-line tool, built with Python and the docling library. It uses RapidOCR by default and also supports EasyOCR.

Features

  • Process various file types: PDF, PNG, JPG, DOCX, XLSX, CSV.
  • Supports rapidocr (default) and easyocr engines, configurable via YAML files.
  • Outputs processed documents into Markdown, JSON, and YAML formats.

Installation

First, ensure you have Python 3.8+ installed.

You can install the tool from PyPI. The default engine, rapidocr, is included automatically.

pip install myocr-tool

Test the Optional Engine (EasyOCR)

my-ocr-tool --ocr-engine easyocr "path/to/your/image.png"

Test with a Configuration File

my-ocr-tool --config rapidocr_config.yaml "path/to/your/image.png"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocr_my-0.1.1.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocr_my-0.1.1-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file ocr_my-0.1.1.tar.gz.

File metadata

  • Download URL: ocr_my-0.1.1.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.21

File hashes

Hashes for ocr_my-0.1.1.tar.gz
Algorithm Hash digest
SHA256 4eb344a368e41b74a845ecdb883991d3ef5dfb0477459748be2ff18cbc086de1
MD5 d71c487f7dd2a17c4ba30398f13e89da
BLAKE2b-256 d21ce4c99bf8b6f73fc45f1e68f324bad7e9645a30f00dedaefcfbc89991bff0

See more details on using hashes here.

File details

Details for the file ocr_my-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: ocr_my-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.21

File hashes

Hashes for ocr_my-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6caf40a1d0e5f4de771f0e8cee31bc026cbfb814cf5fdd44a7ad4ca600fb8e35
MD5 c8c9eefae89f4d55dc27f5fb5c2f5dba
BLAKE2b-256 01850fe55b2501ce3cc190faf1ae14294efd3d15301af9016edaaf74e1af4b74

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page