Skip to main content

A versatile OCR and document processing command-line tool.

Project description

My OCR Tool

A versatile OCR and document processing command-line tool, built with Python and the docling library. It uses RapidOCR by default and also supports EasyOCR.

Features

  • Process various file types: PDF, PNG, JPG, DOCX, XLSX, CSV.
  • Supports rapidocr (default) and easyocr engines, configurable via YAML files.
  • Outputs processed documents into Markdown, JSON, and YAML formats.

Installation

First, ensure you have Python 3.8+ installed.

You can install the tool from PyPI. The default engine, rapidocr, is included automatically.

pip install myocr-tool

Test the Optional Engine (EasyOCR)

my-ocr-tool --ocr-engine easyocr "path/to/your/image.png"

Test with a Configuration File

my-ocr-tool --config rapidocr_config.yaml "path/to/your/image.png"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocr_my-0.1.2.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocr_my-0.1.2-py3-none-any.whl (4.5 kB view details)

Uploaded Python 3

File details

Details for the file ocr_my-0.1.2.tar.gz.

File metadata

  • Download URL: ocr_my-0.1.2.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.21

File hashes

Hashes for ocr_my-0.1.2.tar.gz
Algorithm Hash digest
SHA256 34890fafe504560fbc2f4af9a36416dcd67eeab76c16ee13dc9cc1aa07904ad5
MD5 2e5a5ceb0141531902b1c8653dcc66a0
BLAKE2b-256 540ffd65e75dc265378c3d3264e9f373a4aa81afa63f8d7303f0dbcf939014fe

See more details on using hashes here.

File details

Details for the file ocr_my-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: ocr_my-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 4.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.21

File hashes

Hashes for ocr_my-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5b2cdd1910e12b9195a265b85a1422df916e8cd0d87d2cc09c22b2bf67ae5caa
MD5 54bf51f305831b4eb801753cff7c588a
BLAKE2b-256 9f6a1f73e3fb2bdc80fca637ba827f2c52e9ed93033d468621dd76c04207adf5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page