stopwords-zh
Project description
🔥stopwords-zh🔥
欢迎提交更新,共建中文停用词库
Install
pip install -U stopwords-zh
Docs
Usages
- source: string, 停用词来源,目前支持
- baidu: 百度停用词表
- hit: 哈工大停用词表
- ict: 中科院计算所停用词表
- scu: 四川大学机器智能实验室停用词库
- cn: 广为流传未知来源的中文停用词表
- marimo: Marimo multi-lingual stopwords collection 内的中文停用词
- iso: Stopwords ISO 内的中文停用词
- all: 上述所有停用词并集
import jieba
from stopwords import stopwords, filter_stopwords
print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))
TODO
- 停用词
- 情感字典
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for stopwords-zh-2023.6.5.13.18.38.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 70920c80e734f2b4449f9fd21e389365438a9b1328f25fefb0555f5b612b9cbc |
|
MD5 | e7ee27095a19a199650334494f0c1d12 |
|
BLAKE2b-256 | 8723c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3 |
Close
Hashes for stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee009979e2cf52095db449569792dbfce16bff5b64fff9c744da646319c8858d |
|
MD5 | 996f8a1c7d5b50639e75a19550f7ed14 |
|
BLAKE2b-256 | d71af5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0 |