Skip to main content

Imitate Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small transformers model.

Project description

Sudachi C Imitator Transformers

This package imitates Japanese morphological analysis of Sudachi SplitMode.C and SudachiDict-full with a small transformers model.
You can add tokenization and part-of-speech estimation to your environment
with only 4MB additoinal disk space if you already have transrofmers in your environment.

Installation

$ pip install sudachi_c_imitator_transformers

Usage Examples

>>> import sudachi_c_imitator_transformers
>>> tagger = sudachi_c_imitator_transformers.Tagger()
>>> sample_text = '使い方のサンプルです。'
>>> tagger.parse(sample_text)
[('使い方', '名詞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 256 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sudachi_c_imitator_transformers-0.2.0.tar.gz (2.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sudachi_c_imitator_transformers-0.2.0.tar.gz.

File metadata

File hashes

Hashes for sudachi_c_imitator_transformers-0.2.0.tar.gz
Algorithm Hash digest
SHA256 ab2d9329888e134241cb6081bf02ff221db2ffaebf7367dee31411338b9fb275
MD5 47d2c27b834c071f179e1e3e3f225a11
BLAKE2b-256 78a5c929644581c8d79dafdb3fc6815d015cbcb993bd081acf1ada539f5d6c57

See more details on using hashes here.

File details

Details for the file sudachi_c_imitator_transformers-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sudachi_c_imitator_transformers-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0c0d86a8fb3b27f4a7947e280589045d9983094f76d2c2e2c45ae9120e06198d
MD5 e2f0153681236a7316ec6692e6e9736a
BLAKE2b-256 78807525f12280c0587d0d8a382b27005d2d9003ab05a50a1f7eae4cfd502f90

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page