No project description provided
Project description
ocrmypdf-rapidocr
ocrmypdf-rapidocr is an OCRmyPDF plugin that uses RapidOCR as an OCR engine.
Status
Supported:
- OCR engine integration via OCRmyPDF plugin hooks
hOCRoutput path (--pdf-renderer autoor--pdf-renderer fpdf2)- ONNXRuntime backend only
- Single language selection from
-l/--language
Not supported:
--pdf-renderer sandwich- multi-language combinations such as
-l eng+fra
Installation
pip install ocrmypdf-rapidocr
Or from source:
pip install .
Usage
Load the plugin explicitly with --plugin:
ocrmypdf --plugin ocrmypdf_rapidocr -l eng input.pdf output.pdf
Optional plugin arguments:
--rapidocr-config-path PATH: use a custom RapidOCR YAML config
Example:
ocrmypdf \
--plugin ocrmypdf_rapidocr \
-l deu \
input.pdf output.pdf
Language behavior
The plugin uses the first OCRmyPDF language code and maps it to RapidOCR language families.
- direct mappings:
eng,chi_sim,chi_tra,jpn,kor,ara,rus,ukr,tha,tam,tel,ell/gre - selected Latin-script codes map to RapidOCR
LATIN
If a language code is unsupported, OCRmyPDF exits with a clear error message.
Runtime model downloads
RapidOCR downloads model files on first use when model paths are not pinned in config.
For offline or restricted environments, provide a custom config via
--rapidocr-config-path that points to local model files.
References
- OCRmyPDF plugin API docs: https://github.com/ocrmypdf/OCRmyPDF/blob/main/docs/plugins.md
- OCRmyPDF EasyOCR reference plugin: https://github.com/ocrmypdf/OCRmyPDF-EasyOCR
- OCRmyPDF AppleOCR reference plugin: https://github.com/mkyt/OCRmyPDF-AppleOCR
- OCRmyPDF PaddleOCR reference plugin: https://github.com/clefru/ocrmypdf-paddleocr
- RapidOCR project: https://github.com/RapidAI/RapidOCR
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ocrmypdf_rapidocr-1.0.0.tar.gz.
File metadata
- Download URL: ocrmypdf_rapidocr-1.0.0.tar.gz
- Upload date:
- Size: 6.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d144d87e8e3caa1f4e630595f3e51dae9e5be60dbb2b7c84bd98f9ae876dedf4
|
|
| MD5 |
46780636c0156d4b88e9ef1bb14faceb
|
|
| BLAKE2b-256 |
692f62be7a22c4f11c97a4bfdeada94027ad94e49441331a87b33d7a6e2564f0
|
File details
Details for the file ocrmypdf_rapidocr-1.0.0-py3-none-any.whl.
File metadata
- Download URL: ocrmypdf_rapidocr-1.0.0-py3-none-any.whl
- Upload date:
- Size: 8.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
90af147d45d1c5acb3e1ab667df927440999d93700edd4fe2beca8ebec344474
|
|
| MD5 |
400e2e9eb8a6225fe76d2f5e87af3d10
|
|
| BLAKE2b-256 |
b87ef9fb325afb8c79f88ba4bd1ed5ceefb4bf8d6e4065320875e1f48e596adc
|