A ocr tool for traditional chinese
Project description
Simpleocr library
Simpleocr is a traditional chinese OCR python package that based on deep learning method.
The library consists of text localization and text recognition.
Text localization
The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.
Text recognition
The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.
CRNN
Self attention
Installation
$ pip install simpleocr
or
$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install
Usage
from simpleocr import ocr
ocr.get_text(['image.jpg'])
TODO
- English support
- GPU support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
simpleocr-0.0.23.tar.gz
(34.5 kB
view hashes)