Skip to main content

A ocr tool for traditional chinese

Project description

Simpleocr library

Simpleocr is a traditional chinese OCR python package that based on deep learning method.

The library consists of text localization and text recognition.

Text localization

The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.

paper | github

Text recognition

The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.

CRNN

paper

Self attention

paper

Installation

$ pip install simpleocr

or

$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install

Usage

from simpleocr import ocr
ocr.get_text(['image.jpg'])

TODO

  1. English support
  2. GPU support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simpleocr-0.0.23.tar.gz (34.5 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page