stopwords-zh
Project description
🔥stopwords-zh🔥
欢迎提交更新,共建中文停用词库
Install
pip install -U stopwords-zh
Docs
Usages
- source: string, 停用词来源,目前支持
- baidu: 百度停用词表
- hit: 哈工大停用词表
- ict: 中科院计算所停用词表
- scu: 四川大学机器智能实验室停用词库
- cn: 广为流传未知来源的中文停用词表
- marimo: Marimo multi-lingual stopwords collection 内的中文停用词
- iso: Stopwords ISO 内的中文停用词
- all: 上述所有停用词并集
import jieba
from stopwords import stopwords, filter_stopwords
print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))
TODO
- 停用词
- 情感字典
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file stopwords-zh-2023.6.5.13.18.38.tar.gz
.
File metadata
- Download URL: stopwords-zh-2023.6.5.13.18.38.tar.gz
- Upload date:
- Size: 38.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 70920c80e734f2b4449f9fd21e389365438a9b1328f25fefb0555f5b612b9cbc |
|
MD5 | e7ee27095a19a199650334494f0c1d12 |
|
BLAKE2b-256 | 8723c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3 |
File details
Details for the file stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl
.
File metadata
- Download URL: stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl
- Upload date:
- Size: 39.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee009979e2cf52095db449569792dbfce16bff5b64fff9c744da646319c8858d |
|
MD5 | 996f8a1c7d5b50639e75a19550f7ed14 |
|
BLAKE2b-256 | d71af5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0 |