Skip to main content

text extractor from images

Project description

pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

Usage main

usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
                         [-i MAX_ITERATIONS]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  --east EAST           path to input EAST text detector
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box
  -b MIN_BOXES, --min-boxes MIN_BOXES
                        minimum number of detected boxes to return
  -i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
                        max number of iterations finding min_boxes

Usage lib

from pytextractor import pytextractor

extractor = pytextractor.PyTextractor()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytextractor-0.7.tar.gz (758.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytextractor-0.7-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file pytextractor-0.7.tar.gz.

File metadata

  • Download URL: pytextractor-0.7.tar.gz
  • Upload date:
  • Size: 758.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.4

File hashes

Hashes for pytextractor-0.7.tar.gz
Algorithm Hash digest
SHA256 e6285c380fa0505be355e2a423af60132d4034a58f22c52d6083ce19bf1d9dba
MD5 05c0433508bec0a279cf97c79d0c6375
BLAKE2b-256 7ea2b57372703125d494d092ef058f1905eacd23ee4262fc0bc1df03318b3386

See more details on using hashes here.

File details

Details for the file pytextractor-0.7-py3-none-any.whl.

File metadata

  • Download URL: pytextractor-0.7-py3-none-any.whl
  • Upload date:
  • Size: 7.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.4

File hashes

Hashes for pytextractor-0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 3dd9890ac3df1c1c55067781732d945a476004a5c83b45c1a4a4eb68889f7f83
MD5 0bbb7143d1868cd5d20bff466efdd2c0
BLAKE2b-256 cb860f1758f7419fb07e840d8854e20234e69255b30029ed85382f2d7831ccae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page