Stopwords in NLP 🦜
Project description
Stopwords in NLP
Installation
pip install stopwds
you may want to checkout the version
stopwds version
Usage
stopwords
from stopwds import stopwords
'''
baidu: 百度停用词表
hit: 哈工大停用词表
scu: 四川大学机器智能实验室停用词表
cn: 中文停用词表
'''
for stopword in stopwords('baidu'):
print(stopword)
and you can use like this:
from stopwds import stopwords
import jieba
text = ('医美产业崛起的同时,我国医美行业也形成了一条清晰且完整的产业链,上游医美产品生产企业占据了产业链核心环节。')
cut_sent = [word for word in jieba.cut(text) if word and word not in stopwords()]
Acknowlegements
- stopwords
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
stopwds-0.1.5.tar.gz
(5.2 kB
view details)
Built Distribution
File details
Details for the file stopwds-0.1.5.tar.gz
.
File metadata
- Download URL: stopwds-0.1.5.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.7.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c12ac69fc2c4b4351e6798828c85e34246fad4986f9c0c51c24dcb0b0f52cad |
|
MD5 | aa48832b2dc94ab1583aef888ad28e58 |
|
BLAKE2b-256 | ba69cf9dc71114587cbc2953d588c03b1c911136b61044d732ded7f3e96a1489 |
File details
Details for the file stopwds-0.1.5-py2.py3-none-any.whl
.
File metadata
- Download URL: stopwds-0.1.5-py2.py3-none-any.whl
- Upload date:
- Size: 19.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.7.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5d7d4406b2b63a3e37a69edbe0afbe706969e25a0b9a632f4505dfd577de86d |
|
MD5 | a0d99dd65312472cc86b13c1735f901e |
|
BLAKE2b-256 | 90ba1a53c74261ed3c53206a5fb70580999e23c81668ab30b0afb816dc9eb8c1 |