Skip to main content

No project description provided

Project description

chinormfilter

PyPi version PyTest

Filter synonym written in lucene format to avoid duplication with Sudachi normalization. Mainly used when migrating to sudachi analyzer.

Usage

$ chinormfilter tests/test.txt -o out.txt

filtered result is following.

レナリドミド,レナリドマイド
リンゴ => 林檎
飲む,呑む
tlc => tlc,全肺気量
リンたんぱく質,リン蛋白質,リンタンパク質

↓ filter

レナリドミド,レナリドマイド
tlc => tlc,全肺気量

Specify system dict

$ chinormfilter tests/test.txt -s full -o out.txt

Use Custom Dict

Specify dict via sudachi.json

$ chinormfilter tests/test.txt -s sudachi.json -o out.txt

TODO

  • custom dict test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chinormfilter-0.5.3.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

chinormfilter-0.5.3-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file chinormfilter-0.5.3.tar.gz.

File metadata

  • Download URL: chinormfilter-0.5.3.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for chinormfilter-0.5.3.tar.gz
Algorithm Hash digest
SHA256 67fcfe6c5d191dcf505a3d7d73292b01f3da696a3933c1f4395bb5baead34f63
MD5 31ea1eb8fea7151ca26c485683f0430e
BLAKE2b-256 5ea225954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109

See more details on using hashes here.

File details

Details for the file chinormfilter-0.5.3-py3-none-any.whl.

File metadata

File hashes

Hashes for chinormfilter-0.5.3-py3-none-any.whl
Algorithm Hash digest
SHA256 28b181d50d78bfb94e8c3fbe39038ef47cff4c889f6ba3d2a9fe3ef8451a8380
MD5 f65a69eadb22b3d77c80147030ffebdb
BLAKE2b-256 017ee134bd54901bd904dd5dde58e45ffe3cc801732c184b6973faac0f543a67

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page