Skip to main content

Imitate Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.

Project description

Sudachi C Imitator

This package imitates Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small onnx model.
You can add tokenization and part-of-speech estimation to your environment
with only 4MB additoinal disk space if you already have onnxruntime in your environment.

Installation

$ pip install sudachi_c_imitator

Usage Examples

>>> import sudachi_c_imitator
>>> tagger = sudachi_c_imitator.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 256 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sudachi_c_imitator-0.2.0.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sudachi_c_imitator-0.2.0-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file sudachi_c_imitator-0.2.0.tar.gz.

File metadata

  • Download URL: sudachi_c_imitator-0.2.0.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for sudachi_c_imitator-0.2.0.tar.gz
Algorithm Hash digest
SHA256 75666d5b34a46fae0af37b2c4b962f537f5b56c611b7315fb2b84eadd9a6d433
MD5 80e43f1c3cfd430a2e5f742b00f980fb
BLAKE2b-256 114c9e27846e210365ea99f700bd868556562a9ecf9e236a5ed3f8413356d134

See more details on using hashes here.

File details

Details for the file sudachi_c_imitator-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sudachi_c_imitator-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2072552fefc89fa4bafd46d854494ad91b5272d0d72c3275902e3e78e1ff018b
MD5 3e5e4d8a893011e8fad9582ada3aa034
BLAKE2b-256 928decf228e0bce9d8df4410fff23f19ad23f32cc7a1f212ea9eaa2925a737ac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page