Skip to main content

Imitate Japanese morphological analysis of Sudachi SplitMode.B and SudachiDict-full with a small onnx model.

Project description

Sudachi B Imitator

This package imitates Japanese morphological analysis of Sudachi SplitMode.B and SudachiDict-full with a small onnx model.
You can add tokenization and part-of-speech estimation to your environment
with only 4MB additoinal disk space if you already have onnxruntime in your environment.

Installation

$ pip install sudachi_b_imitator

Usage Examples

>>> import sudachi_b_imitator
>>> tagger = sudachi_b_imitator.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 256 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sudachi_b_imitator-0.2.0.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sudachi_b_imitator-0.2.0-py3-none-any.whl (3.1 MB view details)

Uploaded Python 3

File details

Details for the file sudachi_b_imitator-0.2.0.tar.gz.

File metadata

  • Download URL: sudachi_b_imitator-0.2.0.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for sudachi_b_imitator-0.2.0.tar.gz
Algorithm Hash digest
SHA256 71716a2e8132a86c4be09a0e15736470807829c4ab79c239816730d4e0536cd2
MD5 18b720d0bf3717fe5fc0b9452587ee53
BLAKE2b-256 09b64b1aa41b2148e1d95e01353e798b33273cd6d244b179270ddb3d307ba91b

See more details on using hashes here.

File details

Details for the file sudachi_b_imitator-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sudachi_b_imitator-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ecac2ee3c156e2a88fb7c93ccd5865f9d76dbcd72f4a72f9188ecbdeef702635
MD5 cf5e1ff4781ac504b455fe6769246195
BLAKE2b-256 f5d9d02107b529aefb2a13e326c62a8102dc0db2f03bd3e7281bd62ba8d7aea4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page