unified interface to google vision, aws textract, azure, tesseract OCR, EasyOCR tools.
Project description
ocrpy
Unified interface to google vision, aws textract, azure and tesseract OCR tools.
Installation
pip install ocrpy
Sample Usage
from ocrpy import TextOcrPipeline
# running pipeline from pipeline config.
ocr_pipeline = TextOcrPipeline.from_config("ocrpy_config.yaml")
ocr_pipeline.process()
# alternatively you can also run a pipeline like this:
pipeline = TextOcrPipeline(source_dir='s3://document_bucket/',
destination_dir="gs://processed_document_bucket/outputs/",
parser_backend='aws-textract',
credentials={"AWS": "path/to/aws-credentials.env/file",
"GCP": "path/to/gcp-credentials.json/file"})
pipeline.process()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ocrpy-0.3.7.tar.gz
(43.2 MB
view hashes)
Built Distribution
ocrpy-0.3.7-py3-none-any.whl
(22.5 kB
view hashes)