Package to make sense of images with text information
Project description
NLPKnowledge
Installation instructions
Install package:
pip install NLPKnowledge
Install basic NLP model:
python -m spacy download en_core_web_sm
Install tesseract (Mac OS) or use https://github.com/tesseract-ocr/tesseract/wiki to find instructions for your platform
brew install tesseract
Check path where language models should be installed
brew list tesseract
*t should look like "/usr/local/Cellar/tesseract/4.0.0/share/tessdata/"
Install language models to tessdata:
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/eng.traineddata
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/rus.traineddata
How to
Name images:
name_screenshot <abs_file_path>
name_screenshots <abs_dir_path>
name_screenshots --ignore_named <abs_dir_path>
Create text siblings for each image:
create_text_siblings <abs_file_path>
create_text_siblings <abs_dir_path>
create_text_siblings --recursive <abs_dir_path>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file NLPKnowledge-0.0.2.7-py3-none-any.whl.
File metadata
- Download URL: NLPKnowledge-0.0.2.7-py3-none-any.whl
- Upload date:
- Size: 10.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d8e6f13a1604e1885f323c01cdee8df48f6fd0afe684b91f771973f866eafc80
|
|
| MD5 |
25e7b01cef096a45ea463a67ac12aa74
|
|
| BLAKE2b-256 |
084ef803c5dc196d3dc22cdfeed4c3f4932e863549f5a30ca99c7e26a9e87357
|