Skip to main content

Small library for encoding numbers to words and decoding it back.

Project description

word4num

Small library for encoding numbers to words and decoding it back. What can be encoded - any short code, phone number, decimal degrees geolocation, ip address etc. Please share thoughts, ideas and suggestions and feel free to contact me.

Installation

# install from PyPI
pip install word4num

Idea

Define a unified mapping from number to words and promote single configuration for similar use cases that combination of words will work in the same way. For example: words combination generated in Google Maps will point to the same location in Apple Maps

Usage

Simple use:

import word4num
# initialise converter:
# '1###' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 4 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1###', 'en', 4)
words=converter.encode_number('1234')
print(converter.decode_words(words[0]))

Use with modifiers:

import word4num
# initialise converters:
# '1####' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 5 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1####', 'en', 5)

# in case you don't need a precise number matching and length of words combination is more important
# could be used for decimal degrees geolocation
decreased_precision_converter=word4num.W4NConverter('1####', 'en', 5,lambda x: str(int(int(x)/5)),lambda x: str(int(x)*5) )

# encode same number using both converters
words=converter.encode_number('12344')
decreased_precision_words=decreased_precision_converter.encode_number('12344')

#compare length of words combination
print(len(words[0])>len(decreased_precision_words[0]))

# get decoded numbers
print(converter.decode_words(words[0]))
print(decreased_precision_converter.decode_words(decreased_precision_words[0]))

Other methods and available attributes:

import word4num
# initialise converters:
# '1####' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 5 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1####', 'en', 5)

# configured mask
converter.number_mask

#maximum convertible number for configured converter
converter.max_number

#maximum number of words in combination - useful with converter.word_to_num when you want to combine several results of encoding
converter.words_number

#dictionaries of configured converter
converter.word_to_num
converter.num_to_word

#functions that were passed to the converter
converter.modification_to
converter.modification_from
#length of dictionary for configured converter
converter.get_map_length()

Examples

  1. examples/4WordsGeo.py : Encoding and decoding location in decimal degrees - can be used in new possible substitution of What3Words Their solution has problems in dictionary with ambiguously sound words and being proprietary - https://en.wikipedia.org/wiki/What3words Use of 4 short and simple words could work better - room for discussion. Example: 10.000000, 0.000001 -> ['slit', 'elk', 'high', 'bent']
  2. examples/LocalPhoneWords.py : Encoding local telephone numbers in UK format into words. Example: (027) 4193 2856 -> ['lazy', 'pin', 'desk']
  3. examples/IntPhoneWords.py : Encoding international telephone numbers into words Example: (+44) (0)20 7930 4832 -> ['seaman', 'savory', 'beckon', 'say']

Reasons for 2 and 3 example: Words Carry Meaning: Words have meaning, context, and imagery associated with them, making them easier to remember. For example, words like "sun", "car", and "tree" evoke clear mental images or associations. Numbers Are Abstract: An 11-digit number like "12345678901" lacks inherent meaning and context, making it more difficult to recall without using specific memory techniques such as chunking or mnemonics. Chunking: The brain naturally chunks information for easier recall. While numbers can be chunked (e.g., phone numbers), it's generally easier for the brain to chunk and recall a few meaningful words than a long string of digits.

  1. examples/IPv4toWords.py : Can be implemented as a simple local DNS for temporarily used IP addresses, JFF (e.g. web browser). Ports can be supported using 4 words. Example: 255.255.255.255 -> ['mem', 'lamb', 'oft']
  2. examples/PostcodeWords.py : Encoding UK postcodes into 3 words that are easy to remember Example: SE10 0TY -> ['bulked', 'rip', 'deal']

Words dictionary generation

  1. English words were generated using NLTK using Brown Corpus.
  2. Then stop words, proper names, plural words, abbreviations were filtered out to keep only nouns, verbs, adjectives, and adverbs.
  3. Words frequencies were used to order list of words.
  4. The resulting list was separated to 3,4,5,6,7,8,9,10 letters words to use shortest in some cases.

TBD

  1. Find out if support is needed for longer words
  2. Support other languages
  3. Add checks and exceptions
  4. Add unit tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

word4num-0.0.4.tar.gz (242.4 kB view details)

Uploaded Source

Built Distribution

word4num-0.0.4-py3-none-any.whl (256.4 kB view details)

Uploaded Python 3

File details

Details for the file word4num-0.0.4.tar.gz.

File metadata

  • Download URL: word4num-0.0.4.tar.gz
  • Upload date:
  • Size: 242.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for word4num-0.0.4.tar.gz
Algorithm Hash digest
SHA256 83e2d5687cd82f510ed12cd7f91189468712923dd8b211969af2877eb4291f3d
MD5 61d0cbd2a0eb86cc7c0230dd685fbc90
BLAKE2b-256 7f46de6be9b3e997e581c162294c05f447ebe90811096d0c711d9d736292b773

See more details on using hashes here.

File details

Details for the file word4num-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: word4num-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 256.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for word4num-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 4fcac234e0d7a383898bbff55259a60d68bdb403179832d3d1318328547d9ad3
MD5 5391aade20cc07b6538a0f88646282b4
BLAKE2b-256 bcc3e6496b56d885fd2c8c8818b81a1bbe274508f686a49018abe20df2900fb8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page