Skip to main content

No project description provided

Project description

ocrmypdf-rapidocr

ocrmypdf-rapidocr is an OCRmyPDF plugin that uses RapidOCR as an OCR engine.

Status

Supported:

  • OCR engine integration via OCRmyPDF plugin hooks
  • hOCR output path (--pdf-renderer auto or --pdf-renderer fpdf2)
  • ONNXRuntime backend only
  • Single language selection from -l/--language

Not supported:

  • --pdf-renderer sandwich
  • multi-language combinations such as -l eng+fra

Installation

pip install ocrmypdf-rapidocr

Or from source:

pip install .

Usage

Load the plugin explicitly with --plugin:

ocrmypdf --plugin ocrmypdf_rapidocr -l eng input.pdf output.pdf

Optional plugin arguments:

  • --rapidocr-config-path PATH: use a custom RapidOCR YAML config

Example:

ocrmypdf \
  --plugin ocrmypdf_rapidocr \
  -l deu \
  input.pdf output.pdf

Language behavior

The plugin uses the first OCRmyPDF language code and maps it to RapidOCR language families.

  • direct mappings: eng, chi_sim, chi_tra, jpn, kor, ara, rus, ukr, tha, tam, tel, ell/gre
  • selected Latin-script codes map to RapidOCR LATIN

If a language code is unsupported, OCRmyPDF exits with a clear error message.

Runtime model downloads

RapidOCR downloads model files on first use when model paths are not pinned in config. For offline or restricted environments, provide a custom config via --rapidocr-config-path that points to local model files.

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocrmypdf_rapidocr-1.0.0.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocrmypdf_rapidocr-1.0.0-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file ocrmypdf_rapidocr-1.0.0.tar.gz.

File metadata

  • Download URL: ocrmypdf_rapidocr-1.0.0.tar.gz
  • Upload date:
  • Size: 6.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ocrmypdf_rapidocr-1.0.0.tar.gz
Algorithm Hash digest
SHA256 d144d87e8e3caa1f4e630595f3e51dae9e5be60dbb2b7c84bd98f9ae876dedf4
MD5 46780636c0156d4b88e9ef1bb14faceb
BLAKE2b-256 692f62be7a22c4f11c97a4bfdeada94027ad94e49441331a87b33d7a6e2564f0

See more details on using hashes here.

File details

Details for the file ocrmypdf_rapidocr-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: ocrmypdf_rapidocr-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for ocrmypdf_rapidocr-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 90af147d45d1c5acb3e1ab667df927440999d93700edd4fe2beca8ebec344474
MD5 400e2e9eb8a6225fe76d2f5e87af3d10
BLAKE2b-256 b87ef9fb325afb8c79f88ba4bd1ed5ceefb4bf8d6e4065320875e1f48e596adc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page