Skip to main content

texta-tools

Project description

TEXTA Tools Python Package

  • Text Processor
  • Embedding & Phraser
  • MLP Analyzer
  • Text Splitter - Documentation with user guide is available here.

Installation

pip install texta-tools

Testing

python -m pytest -v tests

Using TikaOCR with different languages

  1. Install language packs: https://cwiki.apache.org/confluence/display/tika/TikaOCR

  2. Override the configured language with your request:

    res = parser.from_file("yourfile.png", requestOptions={"headers": {"X-Tika-OCRLanguage": "est+eng+rus"}})
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

texta-tools-1.2.1.tar.gz (29.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page