A ocr tool for traditional chinese
Project description
Simpleocr library
Simpleocr is a traditional chinese OCR python package that based on deep learning method.
The library consists of text localization and text recognition.
Text localization
The model is a reimplementation of CRAFT(Character-Region Awareness For Text detection) by tensorflow.
Text recognition
The reimplementation is based on CRNN model that RNN layer is replaced with self-attention layer.
CRNN
Self attention
Installation
$ pip install simpleocr
or
$ git clone https://github.com/xianyuntang/simpleocr
$ cd simpleocr
$ python setup.py install
Usage
from simpleocr import ocr
ocr.get_text(['image.jpg'])
TODO
- English support
- GPU support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
simpleocr-0.0.23.tar.gz
(34.5 kB
view details)
File details
Details for the file simpleocr-0.0.23.tar.gz
.
File metadata
- Download URL: simpleocr-0.0.23.tar.gz
- Upload date:
- Size: 34.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 146fd8543102973281ef753b18c97cb245c4684ab100dba5eb21623479f93c8e |
|
MD5 | 6593322ed67c8dcff97f793406d1272f |
|
BLAKE2b-256 | 34a9b8f62323ecb3cf6944e3b543df3ecbc9ceaf4a7a49db18fdfa14cc3cc33c |