Skip to main content

ClowdFlows natural language processing module

Project description

# ClowdFlows NLP Module #

A [ClowdFlows](https://github.com/xflows/clowdflows/) package, which contains widgets for natural language processing. The package can also be used with [ClowdFlows](https://github.com/xflows/clowdflows/) 2.0.

[![Documentation Status](https://readthedocs.org/projects/rdm/badge/?version=latest)](http://clowdflows.readthedocs.io/)

Currently, the project contains components for different corpus operations, basic natural language processing operations such as tokenization, stop word removal, lemmatization, part-of-speech tagging, etc. It also has modules for tweet streaming, term extraction and gender classification.

## Installation, documentation ##

Since three pickled models are too big for github, you have to download the following models manually from external links and add them to the cf_nlp/models/reldi_tagger subfolder in order to make Reldi tagger and Reldi lemmatizer work:

Please note that because of package size limits the pypi packgage does not include the models, which needs to be added manually. This can be done by downloading the model folder from github (https://github.com/xflows/cf_nlp/tree/master/nlp/models). The three pickled models mentioned above need to be downloaded manually and added to the folder. You can also download a wheel with all the models inside from:

Please find other installation instructions, examples and API reference on [Read the Docs](http://clowdflows.readthedocs.io/).

## Note ##

Please note that this is a research project and that drastic changes can be (and are) made pretty regularly. Changes are documented in the [CHANGELOG](CHANGELOG.md).

Pull requests and issues are welcome.

## Contributors to the cf_nlp package code ##

Matej Martinc (@matejMartinc)

  • [Knowledge Technologies Department](http://kt.ijs.si), Jožef Stefan Institute, Ljubljana

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cf_nlp-0.1.11.tar.gz (287.6 kB view details)

Uploaded Source

Built Distribution

cf_nlp-0.1.11-py3-none-any.whl (340.2 kB view details)

Uploaded Python 3

File details

Details for the file cf_nlp-0.1.11.tar.gz.

File metadata

  • Download URL: cf_nlp-0.1.11.tar.gz
  • Upload date:
  • Size: 287.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.5.2

File hashes

Hashes for cf_nlp-0.1.11.tar.gz
Algorithm Hash digest
SHA256 d73189f18215599d48686df7f6bfbe93738e341196f941fcf014f1c9f961fc62
MD5 044cf3c4246edd5a576c6ada6ad9024f
BLAKE2b-256 93ec2f4d8f3f3d8c2ab02b0d062e5d06582f583ad60955c21b8a137ca269d375

See more details on using hashes here.

File details

Details for the file cf_nlp-0.1.11-py3-none-any.whl.

File metadata

  • Download URL: cf_nlp-0.1.11-py3-none-any.whl
  • Upload date:
  • Size: 340.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.5.2

File hashes

Hashes for cf_nlp-0.1.11-py3-none-any.whl
Algorithm Hash digest
SHA256 969547cce7f6cbc35b04f9699d1633d19786d70bdc8c316c1d57bfbe0650b7a2
MD5 ce6c3361ec5f5054c1d56c45a1dcb1b0
BLAKE2b-256 8bfa266844236d668bf14e109d724a84380cf3f5cf9080ebb04d121cd6a70a9a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page