Skip to main content

Package to convert numbers to words with support of multiple indian languages.

Project description

indic-num2words - Convert Numbers to Words for Indian Languages

The indic-num2words module converts numerical digits like 36 into words like छत्तीस in various Indian languages.

Installation

To install the latest stable release from PyPI:

pip install indic-num2words

Usage

The module provides a single function for converting numbers to words:

from num_to_words import num_to_word

# Examples:
print(num_to_word(36, lang='hi'))  # Output: छत्तीस
print(num_to_word('४५', lang='hi'))  # Output: पैंतालीस
print(num_to_word("35,43,57,730", lang='hi'))  # Output: पैंतीसकरोड़ तैंतालीसलाख सत्तावनहज़ार सातसौ तीस
print(num_to_word(795, lang='kn', separator='-'))  # Output: ಏಳುನೂರ-ತೊಂಬತ್ತೈದು
print(num_to_word(545589, lang='en', separator=', ', combiner='-'))  # Output: five-lakh, forty-five-thousand, five-hundred, eighty-nine

Function Parameters

  • num (int, str): The number to be converted. This can be an integer, string of digits, or a formatted number string.
  • lang (str): The language code for the desired output language. Supported languages are listed below.
  • separator (str, optional): A string to use as a separator between number groups. Default is a space.
  • combiner (str, optional): A string to combine words in complex numbers. Default is a space.

Supported Languages

The module currently supports conversion to the following languages:

  • en (English-India)
  • hi (Hindi)
  • gu (Gujarati)
  • mr (Marathi)
  • bn (Bengali)
  • te (Telugu)
  • ta (Tamil)
  • kn (Kannada)
  • or (Oriya)
  • pa (Punjabi)

Use Cases

  1. Speech Recognition Pre-processing: Convert numbers to words to improve speech recognition accuracy.
  2. Language Modeling Data Pre-processing: Prepare text data for training language models by converting numerical data to words.

What's Next

We are planning to add support for the following languages:

  • Malayalam
  • Urdu
  • Assamese

If you'd like to contribute or request support for another language, check utils/constants.py to see how you can add support. It's designed to be straightforward.

Contributing

We welcome contributions! If you'd like to help out, feel free to submit pull requests or raise issues on our GitHub repository.


Feel free to reach out if you have any questions or need further assistance. Happy coding!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

indic_num2words-1.2.2.tar.gz (16.8 kB view details)

Uploaded Source

Built Distribution

indic_num2words-1.2.2-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file indic_num2words-1.2.2.tar.gz.

File metadata

  • Download URL: indic_num2words-1.2.2.tar.gz
  • Upload date:
  • Size: 16.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for indic_num2words-1.2.2.tar.gz
Algorithm Hash digest
SHA256 f91dfb849f7e7e1441bdcb85ef526d105435a3273636cc7624876d4fed47fc69
MD5 bca2221601ddd264ed3173c12fe0d731
BLAKE2b-256 42a0d6c3cac5cce0d74a3249a84e16fc81280e0aac715fad839647ac6788890b

See more details on using hashes here.

File details

Details for the file indic_num2words-1.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for indic_num2words-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 41c1713388d345b34e548c022aef0bbeb543c14e44cb67423832d76089ef2398
MD5 7bf44d598501319f79241260ce7dad08
BLAKE2b-256 5653645b77f7460233fbedfd97d95ed5fe68da8baa384caf0ce40948f207ea02

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page