Skip to main content

text extractor from images

Project description

pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
                         [-i MAX_ITERATIONS]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  --east EAST           path to input EAST text detector
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box
  -b MIN_BOXES, --min-boxes MIN_BOXES
                        minimum number of detected boxes to return
  -i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
                        max number of iterations finding min_boxes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytextractor-0.1.tar.gz (754.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytextractor-0.1-py3-none-any.whl (761.2 kB view details)

Uploaded Python 3

File details

Details for the file pytextractor-0.1.tar.gz.

File metadata

  • Download URL: pytextractor-0.1.tar.gz
  • Upload date:
  • Size: 754.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.0

File hashes

Hashes for pytextractor-0.1.tar.gz
Algorithm Hash digest
SHA256 7bcfdc4421d2615358cce0dbdf913120893b4c2e74d76e3942b05b7bc85e6a7f
MD5 7c3ff81a6d6541ea2cb26a1b6fda9697
BLAKE2b-256 2154ada78ce7fba0aca27e703700b1a10672a9c083dfd04112bf88dfee10d1cf

See more details on using hashes here.

File details

Details for the file pytextractor-0.1-py3-none-any.whl.

File metadata

  • Download URL: pytextractor-0.1-py3-none-any.whl
  • Upload date:
  • Size: 761.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.0

File hashes

Hashes for pytextractor-0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f17d75907895498d9968d9a8a2d83f4f8ac0a13dd5edb2fe2a30c0d8e5450576
MD5 a3c9c5c9edef52b0b54d35018f47ef4c
BLAKE2b-256 02e63a9195694ab36866f37c70e882d3fb5dbf712744ff1243931725167ee3d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page