text extractor from images
Project description
pytextractor
python ocr using tesseract/ with EAST opencv text detector
Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.
Usage main
usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
[-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
[-i MAX_ITERATIONS]
images [images ...]
Text/Number extractor from image
positional arguments:
images path(s) to input image(s)
optional arguments:
-h, --help show this help message and exit
--east EAST path to input EAST text detector
-c CONFIDENCE, --confidence CONFIDENCE
minimum probability required to inspect a region
-w WIDTH, --width WIDTH
resized image width (should be multiple of 32)
-e HEIGHT, --height HEIGHT
resized image height (should be multiple of 32)
-d, --display Display bounding boxes
-n, --numbers Detect only numbers
-p PERCENTAGE, --percentage PERCENTAGE
Expand/shrink detected bound box
-b MIN_BOXES, --min-boxes MIN_BOXES
minimum number of detected boxes to return
-i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
max number of iterations finding min_boxes
Usage lib
from pytextractor import pytextractor
extractor = pytextractor.PyTextractor()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pytextractor-0.7.tar.gz
(758.8 kB
view hashes)
Built Distribution
Close
Hashes for pytextractor-0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3dd9890ac3df1c1c55067781732d945a476004a5c83b45c1a4a4eb68889f7f83 |
|
MD5 | 0bbb7143d1868cd5d20bff466efdd2c0 |
|
BLAKE2b-256 | cb860f1758f7419fb07e840d8854e20234e69255b30029ed85382f2d7831ccae |