No project description provided
Project description
kuro2sudachi
kuro2sudachi lets you to convert kuromoji user dict to sudachi user dict.
Usage
$ pip install kuro2sudachi
# prepase riwirte.def
# https://github.com/WorksApplications/Sudachi/blob/develop/src/main/resources/rewrite.def
$ ls
rewiite.def
$ kuro2sudachi kuromoji_dict.txt -o sudachi_user_dict.txt
Develop
test kuro2sudachi
$ poetry install
$ poetry run pytest
exec kuro2sudachi command
$ poetry run kuro2sudachi tests/kuromoji_dict_test.txt -o sudachi_user_dict.txt
Supported pos
* 固有名詞 -> 名詞,固有名詞,一般,*,*,*
* 名詞 -> 名詞,普通名詞,一般,*,*,*
* 記号 -> 記号,一般,*,*,*,*
* 形容詞 -> 形容詞,一般,*,*,*,*
* 副詞 -> 副詞,*,*,*,*,*
* 動詞 -> 動詞,一般,*,*,*,*
if you want to ignore unsupported pos error, use --ignore
flag.
TODO
- split mode
- change connection cost
- supports many pos
- supports custom dict converts pos
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
kuro2sudachi-0.2.0.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for kuro2sudachi-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 65724267458fda7d851156b0b369849dbce41a0aac6b4d30a63169638d9349a9 |
|
MD5 | cb0e6214eb4c96eb4254d440572eb551 |
|
BLAKE2b-256 | 66e65c811c023f000d62e797d7ec56f48a2a483aa97d10a9953ae45b2ed338f2 |