Package to make sense of images with text information
Project description
NLPKnowledge
Installation instructions
Install package:
pip install NLPKnowledge
Install basic NLP model:
python -m spacy download en_core_web_sm
Install tesseract (Mac OS) or use https://github.com/tesseract-ocr/tesseract/wiki to find instructions for your platform
brew install tesseract
Check path where language models should be installed
brew list tesseract
*t should look like "/usr/local/Cellar/tesseract/4.0.0/share/tessdata/"
Install language models to tessdata:
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/eng.traineddata
wget -P /usr/local/Cellar/tesseract/4.0.0/share/tessdata/ https://github.com/tesseract-ocr/tessdata/raw/4.00/rus.traineddata
How to
Name images:
name_raw_screenshot <file_path>
name_raw_screenshots <dir_path>
name_raw_screenshots --ignore_named <dir_path>
Create text siblings for each image:
create_text_siblings <file_path>
create_text_siblings <dir_path>
create_text_siblings <path> --recursive
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for NLPKnowledge-0.0.2.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9e1b14b67d409914764861dbbeadac89677a33b3e1efb9245d40d8be7861e37a |
|
MD5 | f7e08d3c6f6188a6a8e32771847ed446 |
|
BLAKE2b-256 | dc040de85eb2edba30708ea888cbca87719ba72aa004991f8140064be5ccd243 |