Statistical NLP
Project description
snlp
Statistical NLP (SNLP): A practical package with statisical natural language processing tools. SNLP is based on statistical and distributional attributes of natural language and hence most of the functionalities are unsupervised.
Features
- Identifying Multiword Expressions (Collocations) in the corpus. Used for terminology and keyphrase extraction. Can lead to improvement in text classification.
- Identifying statistically redundant words for filtering. Usually leads to an improvement in document classification.
Upcoming Features
- Anamoly Detection.
- Identifying non-compositional compouds: Can be used for tasks such as profanity/hate-speech detection, and linguistic analysis of a corpus.
Usage
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
snlp-0.0.1.tar.gz
(11.7 kB
view hashes)
Built Distribution
snlp-0.0.1-py3-none-any.whl
(14.5 kB
view hashes)