TextFlows core text mining module
Project description
# TextFlows Core Module #
A [TextFlows](https://github.com/xflows/textflows/) package, which contains the core classes for representing an annotated document corpus, as well as text mining widgets (UI components) based on [NLTK](http://www.nltk.org/). The package can also be used with [ClowdFlows](https://github.com/xflows/clowdflows/) 2.0.
[![Documentation Status](https://readthedocs.org/projects/rdm/badge/?version=latest)](http://docs.textflows.org/)
Currently, the project contains several components for text preprocessing: tokenization, stop word removal, lemmatization, part-of-speech tagging, etc.
## Installation, documentation ##
Please find installation instructions, examples and API reference on [Read the Docs](http://docs.textflows.org/).
## Note ##
Please note that this is a research project and that drastic changes can be (and are) made pretty regularly. Changes are documented in the [CHANGELOG](CHANGELOG.md).
Pull requests and issues are welcome.
## Contributors to the tf_core package code ##
Matic Perovšek (@mperice), Matej Martinc (@matejMartinc), Roman Orač (@romanorac)
[Knowldge Technologies Department](http://kt.ijs.si), Jožef Stefan Institute, Ljubljana
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file tf_core_p3-0.0.4.tar.gz
.
File metadata
- Download URL: tf_core_p3-0.0.4.tar.gz
- Upload date:
- Size: 511.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.6.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ebf129efd284d0af617c4eb0e754576746985409610951e0f5130578fd83d72 |
|
MD5 | 9ccb9b961684dbf6061a1f6e1955af98 |
|
BLAKE2b-256 | 70ed0735c6ffd593f32b5176843016bb2fb6892590ccd0c2010691c75a576606 |