OCR-Ops
Project description
ocr-ops
OCR-Ops is infrastructure to perform Optimal Character Recognition (OCR) at scale on a large number of images and videos. Built on top of the algo-ops framework, OCR-Ops is modular and extensible in its data processing operations.
Key Features:
- Supports building an OCRPipeline that can utilize multiple popular OCR annotation methods (e.g. PyTesseract, EasyOCR, etc.) and return the results in structured and efficient fashion within a unified framework.
- Enables multi-levels of information of the OCR application (e.g. text-only, bounding boxes, etc.)
- Allows definition of an image pre-processing pipeline (before OCR) and a text-cleaning pipeline (after OCR) of detected but noisy text to enable optimal and robust OCR performance.
- Supports several nice presets that are plug-and-play for the above purpose!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ocr_ops-0.0.0.4.3.2.tar.gz
(453.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ocr_ops-0.0.0.4.3.2.tar.gz.
File metadata
- Download URL: ocr_ops-0.0.0.4.3.2.tar.gz
- Upload date:
- Size: 453.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.26.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
97e5b36cb91aa6a882fe3b9446f6cd7f7d18f1c177cc88d6913246012e5a230f
|
|
| MD5 |
498678878a2451be75b82b94a3447743
|
|
| BLAKE2b-256 |
61fe0854bb60e1ebcd7c1f8b0c75e958a15c006089681cc1a70381d3b4719532
|
File details
Details for the file ocr_ops-0.0.0.4.3.2-py2.py3-none-any.whl.
File metadata
- Download URL: ocr_ops-0.0.0.4.3.2-py2.py3-none-any.whl
- Upload date:
- Size: 460.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.26.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3e913a0920ee8d6a49521491b9458f52ed90c048684eb0ff22437ac2f41012fd
|
|
| MD5 |
27fa3613e0de58e957c4bebd3a5a6395
|
|
| BLAKE2b-256 |
bd62e47b2ce30aa39ea69ca722fd14a1f582b0ab055fa20fc570204ce50ba8ac
|