A comprehensive package for stopwords in NLP and text mining
Project description
Stopwords Guilan NLP
This is a comprehensive stopwords for natural language processing and text mining.
Installation
Now stopwords_guilannlp is on PyPI!
- Download via PyPI:
$ pip3 install stopwords_guilannlp
- Or clone the repo:
$ git clone --recursive git://github.com/JoyeBright/Sstopwords_guilannlp.git
- Then install it:
$ python3 setup.py install
Usage
tokens = word_tokenize(s)
filtered_tokens = []
stopwords = stopwords_output("Persian", "nar")
for w in tokens:
if w not in stopwords:
filtered_tokens.append(w)
return filtered_tokens
- Note: Package does not support tokenizing process.
Supported Languages
- English
- Persian
Type of Outputs
- Dataframe
- Numpy Array
- Set
- List
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for stopwords_guilannlp-13.2019.3.5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9b402b08679fbe2d1b7ca38d0330a2d11c9ed24b8aa96c05a250a996fa12c227 |
|
MD5 | 3bfbd92fd1c3282693eb31b28fb082ce |
|
BLAKE2b-256 | 568bbe00cdce5a2b95caf1de54f59dbb5f8f339bb1f125da902186b37ecba422 |
Close
Hashes for stopwords_guilannlp-13.2019.3.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0d69adfab1316d204376f3df75d712f90577e72625aed6e53734f4e0beaf8d15 |
|
MD5 | d8273e513d042f3e9673b8cbcd60f35d |
|
BLAKE2b-256 | 44bca01c003b59a91187e89d11e73e8bb2834bb9ae6b36fe576a4b617c90bd23 |