Korean speech/NLP tools
Project description
Korean Speech Tools
This package contains a variety of tools for Korean speech and language processing, including:
- Hangul romanization.
- Jamo conversion.
- Grapheme-to-phoneme conversion (G2P).
It is mostly a collection of previously existing libraries that are unmaintained or include unnecessary dependencies that have been removed here. Refer to the Credits section below for details.
Installation
pip install ko-speech-tools
The core part of this package does not have any external dependencies. For G2P, the mecab-ko package is required, which can be installed with:
pip install ko-speech-tools[g2p]
Usage
Romanization:
>>> from ko_speech_tools import hangul_romanize
>>> hangul_romanize("물엿")
'mul-yeos'
Jamo conversion (documentation):
>>> from ko_speech_tools.jamo import h2j, j2hcj, j2h
>>> h2j('한굴')
'한굴'
>>> j2hcj(h2j('한굴'))
'ㅎㅏㄴㄱㅜㄹ'
>>> j2h('ㅇ', 'ㅕ', 'ㅇ')
'영'
G2P:
>>> from ko_speech_tools import G2p
>>> g2p = G2p()
>>> g2p("것입니다")
'거심니다'
Credits
This package combines and adapts the following packages:
g2pkk: https://github.com/harmlessman/g2pkk (Apache-2.0), a fork of https://github.com/Kyubyong/g2pKhangul_romanize: https://github.com/youknowone/hangul-romanize (BSD-2-Clause)jamo: https://github.com/jdongian/python-jamo (Apache-2.0)
It additionally uses code from https://github.com/keithito/tacotron (MIT) to read CMUdict data.
The respective code can be used under the original license, see the individual files for details. Any new code is made available under Apache-2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ko_speech_tools-0.1.0.tar.gz.
File metadata
- Download URL: ko_speech_tools-0.1.0.tar.gz
- Upload date:
- Size: 987.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dcc5af8bd9f2a9d2260dcc36617d6de65106be625958302525068e81f94b2b1d
|
|
| MD5 |
e33e72991a1cf445b1daac96714ed404
|
|
| BLAKE2b-256 |
41fa51b14366b2a1bb958fb748be67a58519ac4b5aeea0faa3b31c8431f63b2c
|
File details
Details for the file ko_speech_tools-0.1.0-py3-none-any.whl.
File metadata
- Download URL: ko_speech_tools-0.1.0-py3-none-any.whl
- Upload date:
- Size: 997.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.8.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0fa57d6eb1d5a126a2f10df93dd21355298839c1048865ead1b4bb07695e3f33
|
|
| MD5 |
e1977a85daafc3aeec64f3064966c826
|
|
| BLAKE2b-256 |
e859f0cb1c0d5632ee647129e8272c3534554b395be013a134534fdc03e29914
|