Skip to main content

A ocr tool for traditional chinese

Project description

Simpleocr library

Simpleocr is a traditional chinese OCR python package that based on deep learning method.

The library consists of text localization and text recognition.

Text localization

The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.

paper | github

Text recognition

The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.

CRNN

paper

Self attention

paper

Installation

$ pip install simpleocr

or

$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install

Usage

from simpleocr import ocr
ocr.get_text(['image.jpg'])

TODO

  1. English support
  2. GPU support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simpleocr-0.0.23.tar.gz (34.5 kB view details)

Uploaded Source

File details

Details for the file simpleocr-0.0.23.tar.gz.

File metadata

  • Download URL: simpleocr-0.0.23.tar.gz
  • Upload date:
  • Size: 34.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for simpleocr-0.0.23.tar.gz
Algorithm Hash digest
SHA256 146fd8543102973281ef753b18c97cb245c4684ab100dba5eb21623479f93c8e
MD5 6593322ed67c8dcff97f793406d1272f
BLAKE2b-256 34a9b8f62323ecb3cf6944e3b543df3ecbc9ceaf4a7a49db18fdfa14cc3cc33c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page