Skip to main content

Imitate Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.

Project description

Sudachi C Imitator

This package imitates Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.
You can add tokenization and part-of-speech estimation to your environment
with only 4MB additoinal disk space if you already have onnxruntime in your environment.

Installation

$ pip install sudachi_c_imitator

Usage Examples

>>> import sudachi_c_imitator
>>> tagger = sudachi_c_imitator.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 256 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sudachi_c_imitator-0.2.1.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sudachi_c_imitator-0.2.1-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file sudachi_c_imitator-0.2.1.tar.gz.

File metadata

  • Download URL: sudachi_c_imitator-0.2.1.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for sudachi_c_imitator-0.2.1.tar.gz
Algorithm Hash digest
SHA256 5dfd6d536369ef8610c85ff15e4453228bfc5c4fe168fb2fbc4772125ca111b7
MD5 e209987eef89fbb3e84a36c4f177c064
BLAKE2b-256 8eda0f2c4edbe3f09119d5a94e47866aa73daf4d64dfa74251fc81aa5b07c042

See more details on using hashes here.

File details

Details for the file sudachi_c_imitator-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for sudachi_c_imitator-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f31d93c12cb023f358a81645f2e5de6c550b3ce66f5ad434bc6f4a82182849eb
MD5 24f48532c15f01d810c78f8f59d70538
BLAKE2b-256 fb7e7db002d2e5c8e5c6cf7fce04b5182822e64a6595a482db8da70248f0f03a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page