A ocr tool for traditional chinese
Project description
Simpleocr library
Simpleocr is a traditional chinese OCR python package that based on deep learning method.
The library consists of text localization and text recognition.
Text localization
The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.
Text recognition
The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.
CRNN
Self attention
Installation
$ pip install simpleocr
or
$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install
Usage
from simpleocr import ocr
ocr.get_text(['image.jpg'])
TODO
- English support
- GPU support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
simpleocr-0.0.8.tar.gz
(6.5 kB
view details)
File details
Details for the file simpleocr-0.0.8.tar.gz.
File metadata
- Download URL: simpleocr-0.0.8.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
45ccdd9868ff3a880d2f5c29146630ab51a9ba03671510f4a70a6058ecbc22c4
|
|
| MD5 |
72e59330efa757218a55d608e7e1daf9
|
|
| BLAKE2b-256 |
84e65d535b01cd044ddbf1b352e71c0c01a5b05b338c0cbd1b9f07eb97153285
|