Skip to main content

Simple text analysis from the command line

Project description

textkit:
=====

Simple text analysis from the command line.

Homepage: `http://learntextvis.github.io/textkit/ <http://learntextvis.github.io/textkit/>`_

About
-----

`textkit` is a series of small, unix-style tools that provide a suite of capabilities for
dealing with text as data.

Think of textkit as basic natural language processing capabilities - from the command line.

textkit Features
----------------

Here are some of the cool things you can do with textkit.

Convert a document to a set of word tokens and remove all punctuation from the tokens:

.. code-block:: python

textkit text2words input.txt | textkit filterpunc -

Count top used words in a text:

.. code-block:: python

textkit text2words alice.txt | textkit count - | head

Do the same, but with punctuation removed:

.. code-block:: python

textkit text2words alice.txt | textkit filterpunc - | textkit count - | head

Installation
------------

::

$ pip install -U textkit
$ textkit --help


Dev install
-----------

To test locally, clone the repo:

::
git clone git@github.com:learntextvis/textkit.git


Create a local virtual environment or `conda` environment.

Here is how I created my local `conda` environment for installing and testing textkit:

::

conda create -name textkit nltk

source activate textkit

Then I went into the `textkit` directory to install its requirements

::

cd textkit

pip install -r requirements.txt

Finally, I installed the local version of textkit using the `--editable` flag:

::

pip install --editable .

Examples
--------

See more examples at the `Quickstart guide`_.

.. _`Quickstart guide`: http://learntextvis.github.io/textkit/quickstart.html


Requirements
------------

- Python >= 2.6 or >= 3.3

Project Links
-------------

- Docs: http://learntextvis.github.io/textkit/
- PyPI: https://pypi.python.org/pypi/textkit
- Issues: https://github.com/learntextvis/textkit/issues

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textkit-0.0.3.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

textkit-0.0.3-py2.py3-none-any.whl (5.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file textkit-0.0.3.tar.gz.

File metadata

  • Download URL: textkit-0.0.3.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for textkit-0.0.3.tar.gz
Algorithm Hash digest
SHA256 290c676d6dc7844efab437ecedcb227b0c17d337dd336c71c7797c077175708f
MD5 3811ed5dd7adf47570cf79af35102715
BLAKE2b-256 47d19a87f0b3cc0cd66e5cdc4bc0ce57ab77ce6c90ab112eeeb60135a36e812d

See more details on using hashes here.

File details

Details for the file textkit-0.0.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for textkit-0.0.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f830ab4c141202851cd34db3d6d902fbcefe081babc2a40a84577f1d7ecb6a4b
MD5 8b1d3a672524d0dbafad60d4708965e2
BLAKE2b-256 5326457dd7a46a0114490491b13c26c9032654d3966f02cb53d05c1ca56127af

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page