Skip to main content

Collection of classes and functions for text analysis

Project description

Introduction

Text-analysis-helpers is a collection of classes and functions for text analysis.

Installation

A Python 3 interpreter is required. It is recommended to install the package in a virtual environment in order to avoid corrupting the system's Python interpeter packages.

pip install text-analysis-helpers

python -m nltk.downloader "punkt"
python -m nltk.downloader "averaged_perceptron_tagger"
python -m nltk.downloader "maxent_ne_chunker"
python -m nltk.downloader "words"

Usage

You can use the HtmlAnalyser object to analyse the contents of a url.

from text_analysis_helpers.html import HtmlAnalyser

analyser = HtmlAnalyser()
analysis_result = analyser.analyse_url("http://www.add-a-url-here.com")

analysis_result.save("analysis_result.html")

You can see the scripts in the examples folder for some usage examples.

There is also an cli utility that can be used to analyse a url. For example to analyse a url and save the analysis result to a json encoded file execute the following command in the terminal.

text-analysis-helpers-cli analyse-url --json --output analysis_result.json https://www.the-url-to-analyse.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

text-analysis-helpers-0.2.1.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

text_analysis_helpers-0.2.1-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file text-analysis-helpers-0.2.1.tar.gz.

File metadata

  • Download URL: text-analysis-helpers-0.2.1.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.5.2

File hashes

Hashes for text-analysis-helpers-0.2.1.tar.gz
Algorithm Hash digest
SHA256 176e0937b28e4a8c0002ac8fc9f6947d2cf33813cb8f23f53807428d2425e307
MD5 9faa2e1b30a50ac2476e9a85138825ec
BLAKE2b-256 48d981eaef8e34ce53212d848d071ef15a6f046e37b2b1559f1d24b28f26c170

See more details on using hashes here.

File details

Details for the file text_analysis_helpers-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: text_analysis_helpers-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 13.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.5.2

File hashes

Hashes for text_analysis_helpers-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 918c5963d72c6558160239cf3f532f57e0ec9e1e5479400eeb2aa18d2de42416
MD5 cdf98b5145bfabaa600814643f91ac36
BLAKE2b-256 84d8698762db28fccef9451ed224952ca1608a98cd60a11708fee713fc6dc51f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page