stopwords-zh
Project description
🔥stopwords-zh🔥
欢迎提交更新,共建中文停用词库
Install
pip install -U stopwords-zh
Docs
Usages
- source: string, 停用词来源,目前支持
- baidu: 百度停用词表
- hit: 哈工大停用词表
- ict: 中科院计算所停用词表
- scu: 四川大学机器智能实验室停用词库
- cn: 广为流传未知来源的中文停用词表
- marimo: Marimo multi-lingual stopwords collection 内的中文停用词
- iso: Stopwords ISO 内的中文停用词
- all: 上述所有停用词并集
import jieba
from stopwords import stopwords, filter_stopwords
print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))
TODO
- 停用词
- 情感字典
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for stopwords-zh-2023.4.27.17.54.24.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | cb9a920980e07ad84b6137b2012ba33412821f8ab4b6fcf6a970e749815dac02 |
|
MD5 | 977dbdca9d96f326c3428d7113af5067 |
|
BLAKE2b-256 | 3e26c09fc5c285cd94b8e0975553504079adc0190fc17dd6072f47e084300f83 |
Close
Hashes for stopwords_zh-2023.4.27.17.54.24-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5145533ecca63ae019f754a8397e675b8e3ae158e7646fcf89d05bfa84300aad |
|
MD5 | 5cafbed553042de45eb05ac217452087 |
|
BLAKE2b-256 | 3e5b62e434bc5d079b905e6bec1124ac1b80a3528bbe71dda8a702cb8ecc9f07 |