Skip to main content

文字/漢語發音/注音互轉。相似發聲字/單詞。

Project description

Python Chinese Pronounce

  • 文字轉注音、漢語發音
  • 注音、漢語發音轉文字
  • 尋找相似、相同發聲字/單詞

資料來源:政府開放資料, 超齊百萬字典檔, 教育部國語辭典公眾授權網

Install

From PyPI

pip install py-chinese-pronounce

From Repo

pip install -U git+https://github.com/p208p2002/py-chinese-pronounce.git

Usage

from py_chinese_pronounce import Word2Pronounce,Pronounce2Word

w2p = Word2Pronounce()
p2w = Pronounce2Word()

Word2Pronounce

字元轉注音

w2p.to_chewin("我") # ㄨㄛˇ

字元轉漢語發音

w2p.to_han("我") # wo3

句子轉注音

w2p.sent_to_chewin("你來扮演這個角色")
# ['ㄋㄧˇ', 'ㄌㄞˊ', 'ㄅㄢˋ', 'ㄧㄢˇ', 'ㄓㄜˋ', '˙ㄍㄜ', 'ㄐㄩㄝˊ', 'ㄙㄜˋ']

句子轉漢語發音

w2p.sent_to_han("你來扮演這個角色")
# ['ni3', 'lai2', 'ban4', 'yan3', 'zhe4', 'ge5', 'jue2', 'se4']

感謝 @Evanston0624 實作此功能

發音相似度比較

w2p.char_pronounce_similar(a,b)
w2p.sent_pronounce_similar(sent_a, sent_b)
# 數值區間落在[0-1]

其他轉換

  • Word2Pronounce._word2unicode(self, x)
  • Word2Pronounce._uni2word(self,uni)
  • Word2Pronounce._cns2word(self,cns)
  • Word2Pronounce._uni2cns(self, uni)

CNS: 中文標準交換碼

Pronounce2Word

注音找字元

p2w.chewin2word("ㄨㄛˇ") 
# ['䰀', '婑', '捰', '㦱', '我', '䂺']

漢語發音找字元

p2w.han2word("wo3")
# ['䰀', '婑', '捰', '㦱', '我', '䂺']

字元找同發音

p2w.find_same("我")
# ['䰀', '婑', '捰', '㦱', '䂺']

字元找近似發音

p2w.find_similar("我")
# ['蠖', '臥', '䇶', '䂺', '䪝', '捾', '偓', '握', '捰', '卧', '雘', '㦱', '濣', '䠎', '楃', '沃', '渥', '䁊', '涴', '幄', '龌', '㓇', '矱', '斡', '㠛', '肟', '齷', '仴', '䰀', '婑', '喔', '腛', '䀑']

相似發聲詞

p2w.find_similar_vocab("汽車") # 去除聲調找相似
# ['七尺', '棋車', '棋车', '气车', '氣車', '汽车', '騎車', '骑车']
p2w.find_similar_vocab_level("清晨",level=1) # 發音編輯距離找相似
# ['傾城', '清城', '清澄', '青城', '清淳', '清純', '清纯', '清醇', '清神', '青神', '星塵', ...

相同發聲詞

p2w.find_same_vocab("汽車")
# ['气车', '氣車', '汽车']

其他轉換

  • Pronounce2Word._find_similar_han_pronounces(self,han,level=1)

    尋找相似發音

    • han: 漢語發音
    • level: 編輯距離(越大越寬鬆)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_chinese_pronounce-0.1.13.tar.gz (12.5 MB view hashes)

Uploaded Source

Built Distribution

py_chinese_pronounce-0.1.13-py3-none-any.whl (12.5 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page