Skip to main content

Fast multilingual text-to-phoneme converter for South East Asian languages.

Project description

🦭 SEA-G2P

image

Fast multilingual text-to-phoneme converter for South East Asian languages.

Author: Pham Nguyen Ngoc Bao

Installation

pip install sea-g2p

Requires espeak-ng only for fallback (built-in dictionary already covers ~99.9% of words).

Usage

Simple Pipeline

from sea_g2p import SEAPipeline

pipeline = SEAPipeline(lang="vi")
result = pipeline.run("Giá SP500 hôm nay là 4.200,5 điểm.")
print(result)
#zˈaːɜ ˈɛɜt̪ pˈe nˈam tʃˈam hˈom nˈaj lˌaː2 bˈoɜn ŋˈi2n hˈaːj tʃˈam fˈəɪ4 nˈam ɗˈiɛ4m.

Individual Modules

from sea_g2p import Normalizer, G2P

normalizer = Normalizer(lang="vi")
g2p = G2P(lang="vi")

text = "Giá SP500 hôm nay là 4.200,5 điểm"
normalized = normalizer.normalize(text)
print(normalized)
phonemes = g2p.convert(normalized)
print(phonemes)
#giá ét pê năm trăm hôm nay là bốn nghìn hai trăm phẩy năm điểm.
#zˈaːɜ ˈɛɜt̪ pˈe nˈam tʃˈam hˈom nˈaj lˌaː2 bˈoɜn ŋˈi2n hˈaːj tʃˈam fˈəɪ4 nˈam ɗˈiɛ4m.

Features

  • Fast dictionary-based lookup using SQLite.
  • Vietnamese text normalization (numbers, dates, units).
  • Bilingual support (Vietnamese/English).
  • Batch processing for efficiency.
  • eSpeak-NG fallback for unknown words.

Development

To install for development purposes:

  1. Clone the repository:

    git clone https://github.com/pnnbao97/sea-g2p
    cd sea-g2p
    
  2. Install in editable mode:

    pip install -e .
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sea_g2p-0.5.0.tar.gz (20.0 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

sea_g2p-0.5.0-pp310-pypy310_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (20.9 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ ARM64

sea_g2p-0.5.0-pp39-pypy39_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (20.9 MB view details)

Uploaded PyPymanylinux: glibc 2.17+ ARM64

sea_g2p-0.5.0-cp310-abi3-win_amd64.whl (20.7 MB view details)

Uploaded CPython 3.10+Windows x86-64

sea_g2p-0.5.0-cp310-abi3-win32.whl (20.6 MB view details)

Uploaded CPython 3.10+Windows x86

sea_g2p-0.5.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (20.9 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ ARM64

sea_g2p-0.5.0-cp310-abi3-macosx_11_0_arm64.whl (20.8 MB view details)

Uploaded CPython 3.10+macOS 11.0+ ARM64

sea_g2p-0.5.0-cp310-abi3-macosx_10_12_x86_64.whl (20.9 MB view details)

Uploaded CPython 3.10+macOS 10.12+ x86-64

sea_g2p-0.5.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (21.0 MB view details)

Uploaded CPython 3.8manylinux: glibc 2.17+ x86-64

File details

Details for the file sea_g2p-0.5.0.tar.gz.

File metadata

  • Download URL: sea_g2p-0.5.0.tar.gz
  • Upload date:
  • Size: 20.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for sea_g2p-0.5.0.tar.gz
Algorithm Hash digest
SHA256 5111284b6695a425608b8da39644a2a785eaf93eb9d775bd3efe4fa9fb6df0c1
MD5 a496d55830fbef0112d4668aecec373d
BLAKE2b-256 ff3c8396f3b4c780d7f4bf71725617278d990b008f66f77db65bc889e52daa12

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-pp310-pypy310_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-pp310-pypy310_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 e28c2d3a3ea9d810986744c9c815efd7ad1ccf94a3a66adbd6c6c747f997afc6
MD5 e86b230b9e1c795574f81e3f06e3a39d
BLAKE2b-256 c42c0e581915547b06f949c42bdcaa507bc5c1256d87c96bbabd210815b09ff4

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-pp39-pypy39_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-pp39-pypy39_pp73-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 ef564d19709b5b2b19d3e04293943be89be3d50d58cc883aad469dc34b57ac49
MD5 33068edceca1f6f90e003585af8e668d
BLAKE2b-256 28cb36081e7e3ae7976b1d110a8d5b944c6f39142ef19fd3863fd8aca8d70706

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp310-abi3-win_amd64.whl.

File metadata

  • Download URL: sea_g2p-0.5.0-cp310-abi3-win_amd64.whl
  • Upload date:
  • Size: 20.7 MB
  • Tags: CPython 3.10+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for sea_g2p-0.5.0-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 f986b1ffe37269c65e82ac68dd0388e1c87115cb29dcde531afd70d93af7f732
MD5 819f4ebaf8d344e30efc7c91a765bd52
BLAKE2b-256 8786d72696ebf55bc06799489d0beb97e18e56ccd455fbf4bf354785bd4bccf3

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp310-abi3-win32.whl.

File metadata

  • Download URL: sea_g2p-0.5.0-cp310-abi3-win32.whl
  • Upload date:
  • Size: 20.6 MB
  • Tags: CPython 3.10+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.12.6

File hashes

Hashes for sea_g2p-0.5.0-cp310-abi3-win32.whl
Algorithm Hash digest
SHA256 35f2759ae7dd364b858df616a377243e5923f813bb6e26b8f979e43192351583
MD5 07d6dfced7565590a1f7c7e021f11384
BLAKE2b-256 aea2d0b8608c40a4cd061f95a8d81bb90e1620a1940ae02dd27e8823e459ba3e

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 733225713f671718cd7db39d9a94d9043b13c0417dd1492645ecfaa744156adb
MD5 98b5cfc29be0a92f52de181ea283a3fc
BLAKE2b-256 bb21c2599b4d5b536ec4bd5c868562cac5fd92f500510054bc7eea4c51a77564

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 5d91d21230fd5bd62f11959b442a5f572c2934e2bc08158e1d79dbabe1e7d6b4
MD5 1dd7fc8d07017534fb1db8fb83f54d83
BLAKE2b-256 d32c5b6627dff41a6348e9313933ef721bee54ae3c2c680f9b2774d3cda8c3db

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 c75954cdeb511d9d1a8e9560679791b1d024f6bba135f3cbcd16c4a4910332a1
MD5 ea7c96d71249d3bd3ae612562181ab9d
BLAKE2b-256 fc5aac2f012c1ddfb0f0441fe5500e313ac9ac9fb2b993eb466577a62ff13624

See more details on using hashes here.

File details

Details for the file sea_g2p-0.5.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sea_g2p-0.5.0-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f176652f2313b5d8ea0b35ba31be2a8725834fbb161e9f4f717ff7e8a852ef8f
MD5 bdd59c99b77bf27b127b917d565bd433
BLAKE2b-256 9883fea23f039739170f2afba0ec07795da871bbc61bf1b1439ef670d1f9a017

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page