Skip to main content

stopwords-zh

Project description

image image image

🔥stopwords-zh🔥


欢迎提交更新,共建中文停用词库

Install

pip install -U stopwords-zh

Docs

Usages

  • source: string, 停用词来源,目前支持
    • baidu: 百度停用词表
    • hit: 哈工大停用词表
    • ict: 中科院计算所停用词表
    • scu: 四川大学机器智能实验室停用词库
    • cn: 广为流传未知来源的中文停用词表
    • marimo: Marimo multi-lingual stopwords collection 内的中文停用词
    • iso: Stopwords ISO 内的中文停用词
    • all: 上述所有停用词并集
import jieba
from stopwords import stopwords, filter_stopwords

print(filter_stopwords(jieba.cut('欢迎提交更新,共建中文停用词库')))

TODO

  • 停用词
  • 情感字典

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stopwords-zh-2023.6.5.13.18.38.tar.gz (38.0 kB view details)

Uploaded Source

Built Distribution

stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl (39.2 kB view details)

Uploaded Python 3

File details

Details for the file stopwords-zh-2023.6.5.13.18.38.tar.gz.

File metadata

File hashes

Hashes for stopwords-zh-2023.6.5.13.18.38.tar.gz
Algorithm Hash digest
SHA256 70920c80e734f2b4449f9fd21e389365438a9b1328f25fefb0555f5b612b9cbc
MD5 e7ee27095a19a199650334494f0c1d12
BLAKE2b-256 8723c5d6fb809da70a4c93c62352c3d8f6e4dec98f9cd2fdeb47f6a336734cd3

See more details on using hashes here.

File details

Details for the file stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl.

File metadata

File hashes

Hashes for stopwords_zh-2023.6.5.13.18.38-py3-none-any.whl
Algorithm Hash digest
SHA256 ee009979e2cf52095db449569792dbfce16bff5b64fff9c744da646319c8858d
MD5 996f8a1c7d5b50639e75a19550f7ed14
BLAKE2b-256 d71af5bc0439542b97f5f7705627142283e810da2deb14d74a463c67ed549fd0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page