Fast CPU OCR — PaddleOCR PP-OCRv6 tiny (lightweight), reimplemented in Rust + ONNX Runtime. ~7x faster than PaddlePaddle, self-contained wheels with models bundled.

These details have not been verified by PyPI

Project links

Project description

faster-paddle

Fast, CPU-only OCR in Rust with Python bindings — a self-contained reimplementation of PaddleOCR's PP-OCRv6 detection + recognition pipeline powered by ONNX Runtime.

⚡ ~9× faster than paddleocr on CPU for the same models and output (parallel detection pre/post-processing + a concurrent recognition session pool).
📦 Self-contained — the tiny + small ONNX models are bundled inside the wheel. No paddlepaddle, no model downloads for tiny/small.
🎚️ Three model sizes: tiny (default, fastest), small, and medium (higher accuracy; downloaded once on first use and cached).
🦀 Pure-Rust pre/post-processing (detection DB decode, minAreaRect, perspective crop, CTC decode, reading-order text reconstruction). No OpenCV.
🖥️ Prebuilt wheels for Linux, Windows, macOS (x86-64 + arm64).

paddleocr (PaddlePaddle, CPU)        22.7 s / image
faster-paddle (Rust + ONNXRuntime)    2.5 s / image     →  ~9× faster

(test image 3157×4464, AMD Ryzen 7 5800X3D; both after warm-up, same weights.)

Install

pip install faster-paddle

Usage

import faster_paddle

# One-shot, using a shared default engine (lazily initialized):
with open("document.jpg", "rb") as f:
    result = faster_paddle.ocr(f.read())

print(result["text"])              # reading-order reconstructed text
for idx, b in result["bounds"].items():
    print(idx, b["text"], b["confidence"], b["topLeftCoord"], b["bottomRightCoord"])

Reuse an explicit engine (recommended for servers — load the models once):

from faster_paddle import OcrEngine

# model_size: "tiny" (default), "small", or "medium"
engine = OcrEngine(model_size="tiny", threads=None, det_max_side=1600)

result = engine.ocr(image_bytes)                 # raw jpeg/png/webp/bmp/tiff/gif bytes
result = engine.ocr_base64(b64_string)           # base64-encoded image

Optional preprocessing

ocr / ocr_base64 take four optional flags (all default False), applied — when enabled — in the optimal order, all in fast parallel Rust:

result = engine.ocr(
    image_bytes,
    resize=True,     # 1. downscale to ≤ 2100×3000 (aspect preserved) if larger
    denoise=True,    # 2. fast Non-Local-Means denoise (grayscale)
    deskew=True,     # 3. detect skew (Canny + Hough) and rotate to straighten
    binarize=True,   # 4. Sauvola adaptive thresholding (clean black/white)
)

Order rationale: resize first (everything downstream is then faster), denoise before angle detection and thresholding, deskew on the cleaned image, binarize last to produce the final B/W. Enabling resize typically makes OCR faster overall (less detector work). Any of denoise/deskew/binarize converts the image to grayscale.

Returned bounds are always in the original image's coordinate space — even when resize or deskew changes the working image, the boxes are mapped back so they line up with your input.

Preprocess only (no OCR)

prepare runs the same preprocessing in one pass and returns the prepared image as PNG bytes (grayscale once any of denoise/deskew/binarize is on, else color). If every option is False the original bytes are returned unchanged.

prepared = engine.prepare(image_bytes, resize=True, denoise=True, deskew=True, binarize=False)
# or module-level:  faster_paddle.prepare(image_bytes, resize=True, ...)

with open("prepared.png", "wb") as f:
    f.write(prepared)
# you can also feed it straight back in:
result = engine.ocr(prepared)

Model sizes

size	bundled	det+rec	notes
`tiny`	✅ yes	~6 MB	default, fastest, lightweight
`small`	✅ yes	~31 MB	better accuracy
`medium`	⬇️ on demand	~138 MB	best accuracy; downloaded once from the GitHub release and cached under your user cache dir

tiny and small are embedded in the wheel (offline). medium exceeds PyPI's file-size limit, so the first OcrEngine(model_size="medium") downloads it once (needs network that time only) and caches it for subsequent runs.

Result shape

{
  "text": "full reconstructed text...",
  "structured_text": "layout-preserving text (see below)",
  "bounds": {
     0: {
        "topLeftCoord":     (x1, y1),
        "bottomRightCoord": (x2, y2),
        "text":             "line text",
        "confidence":       0.97,
     },
     1: { ... },
  }
}

text and bounds match the JSON contract of the original paddle-ocr-api service, so it is a drop-in replacement.

`structured_text`

A spatial reconstruction that reads left-to-right, top-to-bottom while preserving the visual layout: vertical whitespace gaps split the page into columns/panes (each read fully before the next), and within each one the rows are laid out as a monospace grid, so indentation (tree nesting) and aligned sub-columns (key/value tables) are kept. Single-glyph UI icon noise is dropped.

Use structured_text for screenshots, forms, table/tree UIs, and code — anything where spatial structure carries meaning. Use text for dense multi-column prose: there the absolute pixel spacing of structured_text produces very wide lines, so the column-merging text reconstruction reads better. Both are always returned, so you can pick per use case.

Example structured_text for a two-pane file-tree + settings UI:

Project
 src (14)
   main.rs
   parser.rs
   utils.rs
 tests
 docs

Setting                                            Value
        max_connections                            128
        request_timeout_seconds                    30
        cache_size_mb                              512

API


`faster_paddle.ocr(image, resize=False, denoise=False, deskew=False, binarize=False) -> dict`	OCR encoded image bytes (shared default engine).
`faster_paddle.ocr_base64(image_base64, resize=False, denoise=False, deskew=False, binarize=False) -> dict`	OCR a base64 image string.
`OcrEngine(model_size="tiny", threads=None, rec_batch=None, det_max_side=None)`	Construct a reusable engine.
`OcrEngine.ocr(image, resize=False, denoise=False, deskew=False, binarize=False) -> dict`	OCR encoded image bytes.
`OcrEngine.ocr_base64(image_base64, resize=False, denoise=False, deskew=False, binarize=False) -> dict`	OCR a base64 image string.
`faster_paddle.prepare(image, resize=False, denoise=False, deskew=False, binarize=False) -> bytes`	Preprocess only; returns PNG bytes (no OCR).
`OcrEngine.prepare(image, resize=False, denoise=False, deskew=False, binarize=False) -> bytes`	Preprocess only; returns PNG bytes (no OCR).

resize/denoise/deskew/binarize: optional preprocessing (see above).
model_size: "tiny" (default), "small", or "medium".
threads: ONNX Runtime intra-op threads. Defaults to the number of physical CPU cores (SMT/logical threads tend to slow compute-bound inference down).
rec_batch: recognition batch cap (default 4; batching is otherwise adaptive).
det_max_side: cap on the detector's longer side (default 1600). Large images are downscaled to this for detection only — recognition still crops from the full-resolution image, so text stays sharp. This makes detection ~2× faster on typical documents with negligible quality loss (the tiny detector locates text just as well below its useful resolution). Never upscales. Raise toward 4000 (PaddleOCR's default) for microscopic text, or lower (e.g. 1280) for more speed.

Calls are thread-safe (serialized internally) and release the GIL during inference.

Parallelism (automatic, hardware-scaled)

All parallelism is derived from the detected hardware — nothing is hardcoded:

ONNX Runtime threads default to the number of physical cores.
Recognition runs across a pool of ONNX Runtime sessions sized to target ~4 threads per session (cores/4, capped for memory), so it scales with the core count. Crops are grouped by a pixel budget so narrow crops batch together while wide line-crops run nearly alone (no wasted padding compute).
The pre/post-processing (resize, denoise, deskew, binarize, DB decode, crop extraction) runs on rayon, scaled to the logical cores.

Override via OCR_THREADS, REC_POOL, REC_BUDGET, and RAYON_NUM_THREADS.

How it works

The pipeline faithfully mirrors PaddleOCR's lightweight path:

Detection — resize (min-side 736, cap the longer side at det_max_side = 1600 by default vs PaddleOCR's 4000, round to ×32), normalize (BGR mean/std), run the DB detector. Detecting at a lower resolution locates text just as well and is much faster; recognition still crops from the full-res image.
DB post-process — threshold 0.2, connected components, minAreaRect, box score ≥ 0.4, unclip ratio 1.4, rescale to source coordinates.
Sort boxes top-to-bottom / left-to-right; crop each via perspective warp.
Recognition — resize each crop to H=48, normalize, batch, run the CTC recognizer ([N, T, 6906]), greedy CTC decode.
Reconstruct reading-order text with dynamic column/line detection.

Detection matches PaddlePaddle at 96 % IoU>0.5 with 0.93 character-level similarity on the recognized text; the residual difference is ONNX-Runtime vs PaddlePaddle floating-point numerics, not the algorithm.

The bundled models are PP-OCRv6_tiny_det and PP-OCRv6_tiny_rec exported with paddle2onnx.

Building from source

pip install maturin
maturin develop --release      # build + install into the current environment
# or
maturin build --release        # produce a wheel in target/wheels/

Requires a Rust toolchain. ONNX Runtime is fetched automatically by the ort crate at build time and linked into the extension.

Tests

cargo test --release                 # Rust unit tests (geometry, resize, CTC)
maturin develop --release            # then the Python integration tests:
python faster_paddle/tests/test_integration.py

The integration tests check the result shape, known-text detection, that the recognition session pool is deterministic, that bounds map back to original coordinates after resize, that all preprocessing options run, and a speed regression guard.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.1

Jul 2, 2026

1.0.0

Jul 2, 2026

This version

0.0.14

Jul 1, 2026

0.0.13

Jul 1, 2026

0.0.12

Jun 30, 2026

0.0.11

Jun 30, 2026

0.0.10

Jun 30, 2026

0.0.9

Jun 30, 2026

0.0.8

Jun 30, 2026

0.0.7

Jun 23, 2026

0.0.6

Jun 23, 2026

0.0.5

Jun 22, 2026

0.0.4

Jun 22, 2026

0.0.3

Jun 22, 2026

0.0.2

Jun 22, 2026

0.0.1

Jun 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faster_paddle-0.0.14.tar.gz (32.3 MB view details)

Uploaded Jul 1, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

faster_paddle-0.0.14-cp38-abi3-win_amd64.whl (72.9 MB view details)

Uploaded Jul 1, 2026 CPython 3.8+Windows x86-64

faster_paddle-0.0.14-cp38-abi3-manylinux_2_34_x86_64.whl (74.0 MB view details)

Uploaded Jul 1, 2026 CPython 3.8+manylinux: glibc 2.34+ x86-64

faster_paddle-0.0.14-cp38-abi3-manylinux_2_28_aarch64.whl (74.9 MB view details)

Uploaded Jul 1, 2026 CPython 3.8+manylinux: glibc 2.28+ ARM64

faster_paddle-0.0.14-cp38-abi3-macosx_11_0_arm64.whl (72.9 MB view details)

Uploaded Jul 1, 2026 CPython 3.8+macOS 11.0+ ARM64

File details

Details for the file faster_paddle-0.0.14.tar.gz.

File metadata

Download URL: faster_paddle-0.0.14.tar.gz
Upload date: Jul 1, 2026
Size: 32.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.14.1

File hashes

Hashes for faster_paddle-0.0.14.tar.gz
Algorithm	Hash digest
SHA256	`46644ae4e4bfab03f82a3533e9837a0e43d1424d282687780a57deb3f0b67ed3`
MD5	`a17ec490e4f6c39bf3c55a3dfb830e0b`
BLAKE2b-256	`e938a564d28a716434d983e31f0972f93c201a0a8a4d8f703f4226412f01332b`

See more details on using hashes here.

File details

Details for the file faster_paddle-0.0.14-cp38-abi3-win_amd64.whl.

File metadata

Download URL: faster_paddle-0.0.14-cp38-abi3-win_amd64.whl
Upload date: Jul 1, 2026
Size: 72.9 MB
Tags: CPython 3.8+, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.14.1

File hashes

Hashes for faster_paddle-0.0.14-cp38-abi3-win_amd64.whl
Algorithm	Hash digest
SHA256	`8089d1b91f22b09d4bb9f131d54002d1249ece8f65454f669d6277301f69707f`
MD5	`22c510f6d2abc3ab07002353901c5eab`
BLAKE2b-256	`b6a0a0761687223986bb923b000fe4f1db97a9ef8ff2097cfe186c0153b4f9a8`

See more details on using hashes here.

File details

Details for the file faster_paddle-0.0.14-cp38-abi3-manylinux_2_34_x86_64.whl.

File metadata

Download URL: faster_paddle-0.0.14-cp38-abi3-manylinux_2_34_x86_64.whl
Upload date: Jul 1, 2026
Size: 74.0 MB
Tags: CPython 3.8+, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.14.1

File hashes

Hashes for faster_paddle-0.0.14-cp38-abi3-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`4a1ee89ab142342d4a109bdb1bd9f98a307149bfa1a57f41033a8dba14aaf10b`
MD5	`1ebc6a755b85bb304d26ad02503e1d63`
BLAKE2b-256	`9a9bbe3981fcc30f18559278dee69bd3f93d3b9dcf7235937657ddf5d99de0d6`

See more details on using hashes here.

File details

Details for the file faster_paddle-0.0.14-cp38-abi3-manylinux_2_28_aarch64.whl.

File metadata

Download URL: faster_paddle-0.0.14-cp38-abi3-manylinux_2_28_aarch64.whl
Upload date: Jul 1, 2026
Size: 74.9 MB
Tags: CPython 3.8+, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.14.1

File hashes

Hashes for faster_paddle-0.0.14-cp38-abi3-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`9aa8c94e1ffbe277023021e35730b3d7c0c9f263d75015cf3e9d10d8667b3a63`
MD5	`587148dc787959b178d270d554038924`
BLAKE2b-256	`47581e25bc3aa5c92608c89d0f7e359c98b115ec386612acc6f23019d1900437`

See more details on using hashes here.

File details

Details for the file faster_paddle-0.0.14-cp38-abi3-macosx_11_0_arm64.whl.

File metadata

Download URL: faster_paddle-0.0.14-cp38-abi3-macosx_11_0_arm64.whl
Upload date: Jul 1, 2026
Size: 72.9 MB
Tags: CPython 3.8+, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: maturin/1.14.1

File hashes

Hashes for faster_paddle-0.0.14-cp38-abi3-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`c5b10898813450e900e459d1cc41e48660291685e4c4f71b4e640aeeeb85a8a5`
MD5	`8491884297fd43e0adeecb0dc90f5d97`
BLAKE2b-256	`4fe226f0c9602a2068a2547635aa51db86848fd2888ce044643c84f5da780afe`

See more details on using hashes here.

faster-paddle 0.0.14

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

faster-paddle

Install

Usage

Optional preprocessing

Preprocess only (no OCR)

Model sizes

Result shape

structured_text

API

Parallelism (automatic, hardware-scaled)

How it works

Building from source

Tests

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

`structured_text`