A cross-platform framework for deep learning based text detection, recoginition and parsing
Project description
deeptext
A cross-platform framework for deep learning based text detection, recoginition and parsing
Getting started
Installation
- Install using conda for Linux, Mac and Windows (preferred):
conda install -c fcakyon deeptext
- Install using pip for Linux and Mac:
pip install deeptext
Install teserract-ocr for text recognition.
Basic Usage
# import package
import deeptext
# set image path and export folder directory
image_path = 'idcard.png'
output_dir = 'outputs/'
# apply text detection and export detected regions to output directory
detection_result = deeptext.detect_text(image_path, output_dir)
# apply text recognition to detected texts
recognition_result = deeptext.recognize_text(image_path=detection_result["text_crop_paths"])
Advanced Usage
You can pass filter parameters if you want to scrap texts from image by predefined regions.
# import package
import deeptext
# set image path and export folder directory
image_path = 'idcard.png'
output_dir = 'outputs/'
# define regions that you want to scrap, by quad (box) points
filter_params = {"type": "box"
"boxes": [[[0.1460 , 0.0395],
[0.8417, 0.0535],
[0.8412, 0.1099],
[0.1455, 0.0959]],
[[0.3467, 0.3398],
[0.5417, 0.3535],
[0.5412, 0.4099],
[0.3455, 0.3959]]],
"marigin_x": 0.05,
"marigin_y": 0.05,
"min_intersection_ratio": 0.9}
# or define regions that you want to scrap, by centroids
filter_params = {"type":"centroid",
"centers": [[0.44, 0.49],[0.49, 0.08]],
"marigin_x": 0.03,
"marigin_y": 0.05}
# apply craft text detection in predefined regions and export detected regions to output directory
detection_result = deeptext.detect_text(image_path,
output_dir,
detector="craft",
filter_params=filter_params)
# apply tesseract (eng) text recognition to detected texts
recognition_result = deeptext.recognize_text(image_path=detection_result["text_crop_paths"],
recognizer="tesseract-eng")
Updates
6 April, 2020: Conda package release
3 April, 2020: Tesseract text recoginition and positional text scraping support
30 March, 2020: Craft text detector support
TODO
- Craft text detection (inference)
- Ctpn text detection (inference)
- Psenet text detection (inference)
- Tesseract text recoginition (inference)
- Aster text recognition (training and inference)
- Moran text recognition (training and inference)
- Positional text scraping
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deeptext-0.1.3.tar.gz.
File metadata
- Download URL: deeptext-0.1.3.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
42a1f8a878264b5acf7df1739f3ac22a20394e052d71d5a68cd608e9a3262f14
|
|
| MD5 |
439cc1503baacd8d57746a0fe09b0d0a
|
|
| BLAKE2b-256 |
a2aa24c6db61bef1018c4dd165444ed095693dbcf3c71db480013db7061d7dda
|
File details
Details for the file deeptext-0.1.3-py3-none-any.whl.
File metadata
- Download URL: deeptext-0.1.3-py3-none-any.whl
- Upload date:
- Size: 7.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.8.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
db13ef88fecfff2f29f00077c3197b8d6a44c7b88d1b19c425fff5b384e40c6c
|
|
| MD5 |
a98e238c4cbc20f7102fec5541571998
|
|
| BLAKE2b-256 |
47177900bb796b0ecbbb115ef4c33c047da5ac365d01f23b6bf9172417904c36
|