Skip to main content

A ocr tool for traditional chinese

Project description

Simpleocr library

Simpleocr is a traditional chinese OCR python package that based on deep learning method.

The library consists of text localization and text recognition.

Text localization

The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.

paper | github

Text recognition

The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.

CRNN

paper

Self attention

paper

Installation

$ pip install simpleocr

or

$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install

Usage

from simpleocr import ocr
ocr.get_text(['image.jpg'])

TODO

  1. English support
  2. GPU support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simpleocr-0.0.8.tar.gz (6.5 kB view details)

Uploaded Source

File details

Details for the file simpleocr-0.0.8.tar.gz.

File metadata

  • Download URL: simpleocr-0.0.8.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6

File hashes

Hashes for simpleocr-0.0.8.tar.gz
Algorithm Hash digest
SHA256 45ccdd9868ff3a880d2f5c29146630ab51a9ba03671510f4a70a6058ecbc22c4
MD5 72e59330efa757218a55d608e7e1daf9
BLAKE2b-256 84e65d535b01cd044ddbf1b352e71c0c01a5b05b338c0cbd1b9f07eb97153285

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page