Skip to main content

文字/漢語發音/注音互轉。相似發聲字/單詞。

Project description

Python Chinese Pronounce

  • 文字轉注音、漢語發音
  • 注音、漢語發音轉文字
  • 尋找相似、相同發聲字/單詞

資料來源:政府開放資料, 超齊百萬字典檔, 教育部國語辭典公眾授權網

Install

From PyPI

pip install py-chinese-pronounce

From Repo

pip install -U git+https://github.com/p208p2002/py-chinese-pronounce.git

Usage

from py_chinese_pronounce import Word2Pronounce,Pronounce2Word

w2p = Word2Pronounce()
p2w = Pronounce2Word()

Word2Pronounce

字元轉注音

w2p.to_chewin("我") # ㄨㄛˇ

字元轉漢語發音

w2p.to_han("我") # wo3

句子轉注音

w2p.sent_to_chewin("你來扮演這個角色")
# ['ㄋㄧˇ', 'ㄌㄞˊ', 'ㄅㄢˋ', 'ㄧㄢˇ', 'ㄓㄜˋ', '˙ㄍㄜ', 'ㄐㄩㄝˊ', 'ㄙㄜˋ']

句子轉漢語發音

w2p.sent_to_han("你來扮演這個角色")
# ['ni3', 'lai2', 'ban4', 'yan3', 'zhe4', 'ge5', 'jue2', 'se4']

感謝 @Evanston0624 實作此功能

發音相似度比較

w2p.char_pronounce_similar(a,b)
w2p.sent_pronounce_similar(sent_a, sent_b)
# 數值區間落在[0-1]

其他轉換

  • Word2Pronounce._word2unicode(self, x)
  • Word2Pronounce._uni2word(self,uni)
  • Word2Pronounce._cns2word(self,cns)
  • Word2Pronounce._uni2cns(self, uni)

CNS: 中文標準交換碼

Pronounce2Word

注音找字元

p2w.chewin2word("ㄨㄛˇ") 
# ['䰀', '婑', '捰', '㦱', '我', '䂺']

漢語發音找字元

p2w.han2word("wo3")
# ['䰀', '婑', '捰', '㦱', '我', '䂺']

字元找同發音

p2w.find_same("我")
# ['䰀', '婑', '捰', '㦱', '䂺']

字元找近似發音

p2w.find_similar("我")
# ['蠖', '臥', '䇶', '䂺', '䪝', '捾', '偓', '握', '捰', '卧', '雘', '㦱', '濣', '䠎', '楃', '沃', '渥', '䁊', '涴', '幄', '龌', '㓇', '矱', '斡', '㠛', '肟', '齷', '仴', '䰀', '婑', '喔', '腛', '䀑']

相似發聲詞

p2w.find_similar_vocab("汽車") # 去除聲調找相似
# ['七尺', '棋車', '棋车', '气车', '氣車', '汽车', '騎車', '骑车']
p2w.find_similar_vocab_level("清晨",level=1) # 發音編輯距離找相似
# ['傾城', '清城', '清澄', '青城', '清淳', '清純', '清纯', '清醇', '清神', '青神', '星塵', ...

相同發聲詞

p2w.find_same_vocab("汽車")
# ['气车', '氣車', '汽车']

其他轉換

  • Pronounce2Word._find_similar_han_pronounces(self,han,level=1)

    尋找相似發音

    • han: 漢語發音
    • level: 編輯距離(越大越寬鬆)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_chinese_pronounce-0.1.14.tar.gz (19.2 MB view details)

Uploaded Source

Built Distribution

py_chinese_pronounce-0.1.14-py3-none-any.whl (19.2 MB view details)

Uploaded Python 3

File details

Details for the file py_chinese_pronounce-0.1.14.tar.gz.

File metadata

  • Download URL: py_chinese_pronounce-0.1.14.tar.gz
  • Upload date:
  • Size: 19.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.13 Linux/6.5.0-1022-azure

File hashes

Hashes for py_chinese_pronounce-0.1.14.tar.gz
Algorithm Hash digest
SHA256 ff81666e813e7b48aba96638c451765d5449dbe1721ae40148e87b26c31f66ef
MD5 1555d8f453dcb451b38232db900cccc6
BLAKE2b-256 7843aee4eb0f5bd2771068e12c5b6f4417ec30c93fdbcb38db4e6dc80f4b9695

See more details on using hashes here.

File details

Details for the file py_chinese_pronounce-0.1.14-py3-none-any.whl.

File metadata

File hashes

Hashes for py_chinese_pronounce-0.1.14-py3-none-any.whl
Algorithm Hash digest
SHA256 c53e7e42b106e7686879a487349ddc50473c904c45610e889325e84fd09e7b17
MD5 61509f2d489a0f91c6920110d5692830
BLAKE2b-256 cffa55c49602843699d17a0e233f091140749bf62385b67f16e8654426a8cc48

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page