Skip to main content

Korean speech/NLP tools

Project description

Korean Speech Tools

PyPI - License PyPI - Python Version PyPI - Version

This package contains a variety of tools for Korean speech and language processing, including:

  • Hangul romanization.
  • Jamo conversion.
  • Grapheme-to-phoneme conversion (G2P).

It is mostly a collection of previously existing libraries that are unmaintained or include unnecessary dependencies that have been removed here. Refer to the Credits section below for details.

Installation

pip install ko-speech-tools

The core part of this package does not have any external dependencies. For G2P, the mecab-ko package is required, which can be installed with:

pip install ko-speech-tools[g2p]

Usage

Romanization:

>>> from ko_speech_tools import hangul_romanize

>>> hangul_romanize("물엿")
'mul-yeos'

Jamo conversion (documentation):

>>> from ko_speech_tools.jamo import h2j, j2hcj, j2h

>>> h2j('한굴')
'한굴'

>>> j2hcj(h2j('한굴'))
'ㅎㅏㄴㄱㅜㄹ'

>>> j2h('ㅇ', 'ㅕ', 'ㅇ')
'영'

G2P:

>>> from ko_speech_tools import G2p

>>> g2p = G2p()
>>> g2p("것입니다")
'거심니다'

Credits

This package combines and adapts the following packages:

It additionally uses code from https://github.com/keithito/tacotron (MIT) to read CMUdict data.

The respective code can be used under the original license, see the individual files for details. Any new code is made available under Apache-2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ko_speech_tools-0.1.0.tar.gz (987.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ko_speech_tools-0.1.0-py3-none-any.whl (997.3 kB view details)

Uploaded Python 3

File details

Details for the file ko_speech_tools-0.1.0.tar.gz.

File metadata

  • Download URL: ko_speech_tools-0.1.0.tar.gz
  • Upload date:
  • Size: 987.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.22

File hashes

Hashes for ko_speech_tools-0.1.0.tar.gz
Algorithm Hash digest
SHA256 dcc5af8bd9f2a9d2260dcc36617d6de65106be625958302525068e81f94b2b1d
MD5 e33e72991a1cf445b1daac96714ed404
BLAKE2b-256 41fa51b14366b2a1bb958fb748be67a58519ac4b5aeea0faa3b31c8431f63b2c

See more details on using hashes here.

File details

Details for the file ko_speech_tools-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for ko_speech_tools-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0fa57d6eb1d5a126a2f10df93dd21355298839c1048865ead1b4bb07695e3f33
MD5 e1977a85daafc3aeec64f3064966c826
BLAKE2b-256 e859f0cb1c0d5632ee647129e8272c3534554b395be013a134534fdc03e29914

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page