Skip to main content

A nlp tool to transform numbers to Chinese characters

Project description

num2chinese

A nlp tool to transform numbers to Chinese

num2chinese uses regular expression to parse alphanumeric literals and transform them into readable Chinese charaters.

Why it matters

  • Chinese's pronuncication has lots of exceptions.
  • For Chinese numbers, a character is uttered dependent of context.
  • Lots of rules are required to handle messy Chinese number pronunciation. Dont' reinvent the wheel!

Examples

  • $120 : 美金一百二十
  • 200塊 : 兩百塊
  • 12121212個蘋果 : 一千兩百一十二萬一千兩百一十二個蘋果
  • 2002002支 : 兩百萬兩千零二支
  • 9487 : 九四八七
  • 080080123 : 零八零零八零一二

Usage

text = '12121212個蘋果''
normalizer = Normalizer()
text_normalized = normalizer.normalize(text)
print(text_normalized)
# result is '一千兩百萬十二萬一千兩百一十二個蘋果'

Installation

pip install num2chinese

Requirements

python>=3.6,<4.0

License

MIT license

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

num2chinese-0.0.2.tar.gz (7.2 kB view details)

Uploaded Source

Built Distribution

num2chinese-0.0.2-py3-none-any.whl (7.6 kB view details)

Uploaded Python 3

File details

Details for the file num2chinese-0.0.2.tar.gz.

File metadata

  • Download URL: num2chinese-0.0.2.tar.gz
  • Upload date:
  • Size: 7.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.8.2

File hashes

Hashes for num2chinese-0.0.2.tar.gz
Algorithm Hash digest
SHA256 229951cfcbd746337404588daa4a45a795fcad3dbc92fc64bd234fc16060f36b
MD5 2742a77e03a049230684df1fbbdc66a6
BLAKE2b-256 d1ac2d5da1a9e801ad6ff91807aaa59676279365ce4de18e382dfa579a455f12

See more details on using hashes here.

File details

Details for the file num2chinese-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: num2chinese-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 7.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.6.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.8.2

File hashes

Hashes for num2chinese-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c11b0aafc8eb95b3ef9c9348379be4725dc4daffee015d97d6d677decd5a099c
MD5 01ef2cd7dc00c5c216603f98e2f7f0d4
BLAKE2b-256 4c93da8f1fc032ddf1d0221bacf07efb8f3dd7ee054e1d453f7d6b77ba2dd72c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page