Skip to main content

Extract and organize screenshot text using OCR

Project description

img-to-text

Extract and organize text from screenshots with OCR.

CI Coverage Coverage 98% PyPI Python License: MIT

Features

  • OCR from screenshots using pytesseract or easyocr
  • Automatic pre-processing for better OCR quality
  • Structured extraction into labeled sections when possible
  • Full text is always preserved in output
  • Report output as Markdown, JSON, and plain text

Install

pip install img-to-text

Local development install:

pip install -e ".[dev,all-ocr]"

Usage

img-to-text extract screenshot.png
img-to-text extract screenshot.png --format all
img-to-text extract screenshot1.png screenshot2.png --stdout --format json
img-to-text raw screenshot.png --engine easyocr

Output formats

  • md: organized markdown report
  • json: structured JSON records including full_text
  • txt: plain text grouped by record
  • both: markdown + JSON (default)
  • all: markdown + JSON + text

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

img_to_text-0.1.0.tar.gz (14.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

img_to_text-0.1.0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file img_to_text-0.1.0.tar.gz.

File metadata

  • Download URL: img_to_text-0.1.0.tar.gz
  • Upload date:
  • Size: 14.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for img_to_text-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2935f6add61c4eef59816e30f39c1be2add1e355929b5508dd03eabe43afd210
MD5 955b8b4f924cd07308e5da0d47701e29
BLAKE2b-256 f100a65508be1c90c39487f07f46e9343b090ad6062c36b45a3343e34b3cd7c0

See more details on using hashes here.

Provenance

The following attestation bundles were made for img_to_text-0.1.0.tar.gz:

Publisher: release.yml on bhayanak/image-to-text

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file img_to_text-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: img_to_text-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for img_to_text-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 00ce3d084e12b26182371c9a8ab19e74f874eca6c33ae202c291272cf8965238
MD5 c8aba90601d0ef7971c769336d9ff129
BLAKE2b-256 98e430f4a2791bee296730d86351a18351ba1b111b1059c12cd406b4d1042e80

See more details on using hashes here.

Provenance

The following attestation bundles were made for img_to_text-0.1.0-py3-none-any.whl:

Publisher: release.yml on bhayanak/image-to-text

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page