Simple text analysis from the command line
Project description
textkit:
=====
Simple text analysis from the command line.
Homepage: `http://learntextvis.github.io/textkit/ <http://learntextvis.github.io/textkit/>`_
About
-----
`textkit` is a series of small, unix-style tools that provide a suite of capabilities for
dealing with text as data.
Think of textkit as basic natural language processing capabilities - from the command line.
textkit Features
----------------
Here are some of the cool things you can do with textkit.
Convert a document to a set of word tokens and remove all punctuation from the tokens:
.. code-block:: python
textkit text2words input.txt | textkit filterpunc -
Count top used words in a text:
.. code-block:: python
textkit text2words alice.txt | textkit count - | head
Do the same, but with punctuation removed:
.. code-block:: python
textkit text2words alice.txt | textkit filterpunc - | textkit count - | head
Installation
------------
::
$ pip install -U textkit
$ textkit --help
Dev install
-----------
To test locally, clone the repo:
::
git clone git@github.com:learntextvis/textkit.git
Create a local virtual environment or `conda` environment.
Here is how I created my local `conda` environment for installing and testing textkit:
::
conda create -name textkit nltk
source activate textkit
Then I went into the `textkit` directory to install its requirements
::
cd textkit
pip install -r requirements.txt
Finally, I installed the local version of textkit using the `--editable` flag:
::
pip install --editable .
Examples
--------
See more examples at the `Quickstart guide`_.
.. _`Quickstart guide`: http://learntextvis.github.io/textkit/quickstart.html
Requirements
------------
- Python >= 2.6 or >= 3.3
Project Links
-------------
- Docs: http://learntextvis.github.io/textkit/
- PyPI: https://pypi.python.org/pypi/textkit
- Issues: https://github.com/learntextvis/textkit/issues
=====
Simple text analysis from the command line.
Homepage: `http://learntextvis.github.io/textkit/ <http://learntextvis.github.io/textkit/>`_
About
-----
`textkit` is a series of small, unix-style tools that provide a suite of capabilities for
dealing with text as data.
Think of textkit as basic natural language processing capabilities - from the command line.
textkit Features
----------------
Here are some of the cool things you can do with textkit.
Convert a document to a set of word tokens and remove all punctuation from the tokens:
.. code-block:: python
textkit text2words input.txt | textkit filterpunc -
Count top used words in a text:
.. code-block:: python
textkit text2words alice.txt | textkit count - | head
Do the same, but with punctuation removed:
.. code-block:: python
textkit text2words alice.txt | textkit filterpunc - | textkit count - | head
Installation
------------
::
$ pip install -U textkit
$ textkit --help
Dev install
-----------
To test locally, clone the repo:
::
git clone git@github.com:learntextvis/textkit.git
Create a local virtual environment or `conda` environment.
Here is how I created my local `conda` environment for installing and testing textkit:
::
conda create -name textkit nltk
source activate textkit
Then I went into the `textkit` directory to install its requirements
::
cd textkit
pip install -r requirements.txt
Finally, I installed the local version of textkit using the `--editable` flag:
::
pip install --editable .
Examples
--------
See more examples at the `Quickstart guide`_.
.. _`Quickstart guide`: http://learntextvis.github.io/textkit/quickstart.html
Requirements
------------
- Python >= 2.6 or >= 3.3
Project Links
-------------
- Docs: http://learntextvis.github.io/textkit/
- PyPI: https://pypi.python.org/pypi/textkit
- Issues: https://github.com/learntextvis/textkit/issues
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
textkit-0.0.3.tar.gz
(3.2 kB
view details)
Built Distribution
File details
Details for the file textkit-0.0.3.tar.gz
.
File metadata
- Download URL: textkit-0.0.3.tar.gz
- Upload date:
- Size: 3.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 290c676d6dc7844efab437ecedcb227b0c17d337dd336c71c7797c077175708f |
|
MD5 | 3811ed5dd7adf47570cf79af35102715 |
|
BLAKE2b-256 | 47d19a87f0b3cc0cd66e5cdc4bc0ce57ab77ce6c90ab112eeeb60135a36e812d |
File details
Details for the file textkit-0.0.3-py2.py3-none-any.whl
.
File metadata
- Download URL: textkit-0.0.3-py2.py3-none-any.whl
- Upload date:
- Size: 5.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f830ab4c141202851cd34db3d6d902fbcefe081babc2a40a84577f1d7ecb6a4b |
|
MD5 | 8b1d3a672524d0dbafad60d4708965e2 |
|
BLAKE2b-256 | 5326457dd7a46a0114490491b13c26c9032654d3966f02cb53d05c1ca56127af |