Skip to main content

Finding text in photos

Project description

Scene Text Detection and Recognition

This project packages scene text algorithms for easy usage. Scenes as in photos are much harder for text detection and recognition than doing the same for scanned documents. The latter is mostly referred to as OCR and it is a well solved problem. Note that due to the usage of Deep Learning algorithms, text detection and recognition are rather slow, in particular on CPU-only machines and for images with a large number of words. Accuracies have reached an impressive level but are still below human performance. Currently, the following algorithms are available.

EAST is used for detection, where the implementation is mostly copied from Jan Zdenek.

MORAN is used for recognition, where the implementation is mostly copied from Canjie Luo.

Installation:

pip install scene_text

Basic usage from Python:

import cv2

# The first import after installation can take a little longer as it downloads
# pre-trained models and compiles some native code.
from scene_text import AllWordsRecognizer

# initialize models
pipeline = AllWordsRecognizer()

# read an image
img = cv2.imread('path/to/my/image/file')[:, :, ::-1]

# detect and recognize all words in the image
words, boxes = pipeline.get_all_words(img)

Basic usage from command line:

scene_text my/input/image/folder my/output/folder

If you have trouble with the complex dependencies try Docker:

docker build -t scene_text .

docker run --rm -i -t -v ${PWD}:/scene_text scene_text

# ...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scene-text-0.2.3.tar.gz (184.3 kB view details)

Uploaded Source

File details

Details for the file scene-text-0.2.3.tar.gz.

File metadata

  • Download URL: scene-text-0.2.3.tar.gz
  • Upload date:
  • Size: 184.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.20.1 setuptools/40.7.3 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.6.8

File hashes

Hashes for scene-text-0.2.3.tar.gz
Algorithm Hash digest
SHA256 65cd2386c0f5d6baa7136292f0f4912f66460f9cc70fc3119ecfd28e9db6f685
MD5 5cfb1f6a0c4339a0f6b1e0ad2109062e
BLAKE2b-256 15bd79bbc079c0f1006b17867507fd3969c3ec4cc977bfe386cd6d52185e0c3a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page