Stopwords in NLP 🦜
Project description
Stopwords in NLP
Installation
pip install stopwds
you may want to checkout the version
stopwds version
Usage
stopwords
from stopwds import stopwords
'''
baidu: 百度停用词表
hit: 哈工大停用词表
scu: 四川大学机器智能实验室停用词表
cn: 中文停用词表
'''
for stopword in stopwords('baidu'):
print(stopword)
and you can use like this:
from stopwds import stopwords
import jieba
text = ('医美产业崛起的同时,我国医美行业也形成了一条清晰且完整的产业链,上游医美产品生产企业占据了产业链核心环节。')
cut_sent = [word for word in jieba.cut(text) if word and word not in stopwords()]
Acknowlegements
- stopwords
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
stopwds-0.1.5.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for stopwds-0.1.5-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c5d7d4406b2b63a3e37a69edbe0afbe706969e25a0b9a632f4505dfd577de86d |
|
MD5 | a0d99dd65312472cc86b13c1735f901e |
|
BLAKE2b-256 | 90ba1a53c74261ed3c53206a5fb70580999e23c81668ab30b0afb816dc9eb8c1 |