A nlp tool to transform numbers to Chinese characters
Project description
num2chinese
A nlp tool to transform numbers to Chinese
num2chinese
uses regular expression to parse alphanumeric literals and transform them into readable Chinese charaters.
Why it matters
- Chinese's pronuncication has lots of exceptions.
- For Chinese numbers, a character is uttered dependent of context.
- Lots of rules are required to handle messy Chinese number pronunciation. Dont' reinvent the wheel!
Examples
- $120 : 美金一百二十
- 200塊 : 兩百塊
- 12121212個蘋果 : 一千兩百一十二萬一千兩百一十二個蘋果
- 2002002支 : 兩百萬兩千零二支
- 9487 : 九四八七
- 080080123 : 零八零零八零一二
Usage
text = '12121212個蘋果''
normalizer = Normalizer()
text_normalized = normalizer.normalize(text)
print(text_normalized)
# result is '一千兩百萬十二萬一千兩百一十二個蘋果'
Installation
pip install num2chinese
Requirements
python>=3.6,<4.0
License
MIT license
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
num2chinese-0.0.2.tar.gz
(7.2 kB
view details)
Built Distribution
File details
Details for the file num2chinese-0.0.2.tar.gz
.
File metadata
- Download URL: num2chinese-0.0.2.tar.gz
- Upload date:
- Size: 7.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.6.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 229951cfcbd746337404588daa4a45a795fcad3dbc92fc64bd234fc16060f36b |
|
MD5 | 2742a77e03a049230684df1fbbdc66a6 |
|
BLAKE2b-256 | d1ac2d5da1a9e801ad6ff91807aaa59676279365ce4de18e382dfa579a455f12 |
File details
Details for the file num2chinese-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: num2chinese-0.0.2-py3-none-any.whl
- Upload date:
- Size: 7.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.6.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c11b0aafc8eb95b3ef9c9348379be4725dc4daffee015d97d6d677decd5a099c |
|
MD5 | 01ef2cd7dc00c5c216603f98e2f7f0d4 |
|
BLAKE2b-256 | 4c93da8f1fc032ddf1d0221bacf07efb8f3dd7ee054e1d453f7d6b77ba2dd72c |