Skip to main content

Imitate Japanese morphological analysis of mecab and unidic_lite with a small transformers model.

Project description

Unidic Lite Imitator

This package imitates Japanese morphological analysis of mecab and unidic_lite with a small transformers model.
You can add tokenization and part-of-speech estimation to your environment
with only 2MB additoinal disk space if you already have transfopmers in your environment.

Installation

$ pip install unidic_lite_imitator_transformers

Usage Examples

>> import unidic_lite_imitator_transformers
>> tagger = unidic_lite_imitator.Tagger()
>> sample_text = '使い方のサンプルです。'
>> tagger.parse(sample_text)
[('使い', '動詞'), ('方', '接尾辞'), ('の', '助詞'), ('サンプル', '名詞'), ('です', '助動詞'), ('。', '補助記号')]

Input string length must be 192 or less.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unidic_lite_imitator_transformers-0.1.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file unidic_lite_imitator_transformers-0.1.0.tar.gz.

File metadata

File hashes

Hashes for unidic_lite_imitator_transformers-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e64caf25e5ff7c67ab93429869199e661dbb049e6223a260a615e1dd72f6ccdc
MD5 e1f114f0868b47f2dfe2ff5b25d39121
BLAKE2b-256 03d8c838234632b3b98909ba91d7838b30a38a9008b20d3b006b344885263c23

See more details on using hashes here.

File details

Details for the file unidic_lite_imitator_transformers-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for unidic_lite_imitator_transformers-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 696092ec4b4e05d780d011456cc10277eee5d694889681fd398e74218d612c9e
MD5 1413380c52c0081790614c509f92a5c5
BLAKE2b-256 8971098a3f82580511d048b1f5536844d770d2a497b4bb8e8f2145bdfef1aa87

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page