Skip to main content

Imitate Japanese morphological analysis of Sudachi SplitMode.B and SudachiDict-full with a small onnx model.

Project description

Sudachi B Imitator

This package imitates Japanese morphological analysis of Sudachi SplitMode.B and SudachiDict-full with a small onnx model.
You can add tokenization and part-of-speech estimation to your environment
with only 4MB additoinal disk space if you already have onnxruntime in your environment.

Installation

$ pip install sudachi_b_imitator

Usage Examples

>>> import sudachi_b_imitator
>>> tagger = sudachi_b_imitator.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 256 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sudachi_b_imitator-0.2.1.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sudachi_b_imitator-0.2.1-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file sudachi_b_imitator-0.2.1.tar.gz.

File metadata

  • Download URL: sudachi_b_imitator-0.2.1.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for sudachi_b_imitator-0.2.1.tar.gz
Algorithm Hash digest
SHA256 e1a2efff7f50e7ec432d714d41db7de1058e2a64e9d3239f337c1a15a3258dc0
MD5 f10ccdb59d59bca7e1fb9a46a96cf0be
BLAKE2b-256 5acb0345207ae24e6594d1bfbfa0dc6c7514cf722b10ee8ac68cac8067e7976b

See more details on using hashes here.

File details

Details for the file sudachi_b_imitator-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for sudachi_b_imitator-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4da954edc4150f1a791752de5829ddbbe76e411ff56dd8b62bc910359063e468
MD5 38987a4cd730a00c24c8df0a4c25aa52
BLAKE2b-256 b702550e97f84084bbb790087c193fdbbeb88d0184cb79fabb37c9f204223f67

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page