Package to make sense of images with text information
Project description
NLPKnowledge
Installation instructions
Install package:
pip install NLPKnowledge
Install basic NLP model:
python -m spacy download en_core_web_sm
Install tesseract (Mac OS) or use https://github.com/tesseract-ocr/tesseract/wiki to find instructions for your platform
brew install tesseract
Check path where language models should be installed
brew list tesseract
*t should look like "/usr/local/Cellar/tesseract/4.0.0/share/tessdata/"
Install language models to tessdata:
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/eng.traineddata
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/rus.traineddata
How to
Name images:
name_screenshot <abs_file_path>
name_screenshots <abs_dir_path>
name_screenshots --ignore_named <abs_dir_path>
Create text siblings for each image:
create_text_siblings <abs_file_path>
create_text_siblings <abs_dir_path>
create_text_siblings --recursive <abs_dir_path>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file NLPKnowledge-0.0.2.7-py3-none-any.whl
.
File metadata
- Download URL: NLPKnowledge-0.0.2.7-py3-none-any.whl
- Upload date:
- Size: 10.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.7.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8e6f13a1604e1885f323c01cdee8df48f6fd0afe684b91f771973f866eafc80 |
|
MD5 | 25e7b01cef096a45ea463a67ac12aa74 |
|
BLAKE2b-256 | 084ef803c5dc196d3dc22cdfeed4c3f4932e863549f5a30ca99c7e26a9e87357 |