texta-tools
Project description
TEXTA Tools Python Package
- Text Processor
- Embedding & Phraser
- MLP Analyzer
- Text Splitter - Documentation with user guide is available here.
Installation
pip install texta-tools
Testing
python -m pytest -v tests
Using TikaOCR with different languages
-
Install language packs: https://cwiki.apache.org/confluence/display/tika/TikaOCR
-
Override the configured language with your request:
res = parser.from_file("yourfile.png", requestOptions={"headers": {"X-Tika-OCRLanguage": "est+eng+rus"}})
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
texta-tools-1.2.1.tar.gz
(29.7 kB
view hashes)