Skip to main content

Small library for encoding numbers to words and decoding it back.

Project description

word4num

Small library for encoding numbers to words and decoding it back. What can be encoded - any short code, phone number, decimal degrees geolocation, ip address etc. Please share thoughts, ideas and suggestions and feel free to contact me.

Installation

# install from PyPI
pip install word4num

Idea

Define a unified mapping from number to words and promote single configuration for similar use cases that combination of words will work in the same way. For example: words combination generated in Google Maps will point to the same location in Apple Maps

Usage

Simple use:

import word4num
# initialise converter:
# '1###' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 4 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1###', 'en', 4)
words=converter.encode_number('1234')
print(converter.decode_words(words[0]))

Use with modifiers:

import word4num
# initialise converters:
# '1####' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 5 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1####', 'en', 5)

# in case you don't need a precise number matching and length of words combination is more important
# could be used for decimal degrees geolocation
decreased_precision_converter=word4num.W4NConverter('1####', 'en', 5,lambda x: str(int(int(x)/5)),lambda x: str(int(x)*5) )

# encode same number using both converters
words=converter.encode_number('12344')
decreased_precision_words=decreased_precision_converter.encode_number('12344')

#compare length of words combination
print(len(words[0])>len(decreased_precision_words[0]))

# get decoded numbers
print(converter.decode_words(words[0]))
print(decreased_precision_converter.decode_words(decreased_precision_words[0]))

Other methods and available attributes:

import word4num
# initialise converters:
# '1####' - number mask, # - means that it could be any digit
# 'en' - language (English is only currently supported)
# 5 - max length of one word, could be up to 10
converter=word4num.W4NConverter('1####', 'en', 5)

# configured mask
converter.number_mask

#maximum convertible number for configured converter
converter.max_number

#maximum number of words in combination - useful with converter.word_to_num when you want to combine several results of encoding
converter.words_number

#dictionaries of configured converter
converter.word_to_num
converter.num_to_word

#functions that were passed to the converter
converter.modification_to
converter.modification_from
#length of dictionary for configured converter
converter.get_map_length()

Examples

  1. examples/4WordsGeo.py : Encoding and decoding location in decimal degrees - can be used in new possible substitution of What3Words Their solution has problems in dictionary with ambiguously sound words and being proprietary - https://en.wikipedia.org/wiki/What3words Use of 4 short and simple words could work better - room for discussion. Example: 10.000000, 0.000001 -> ['timid', 'made', 'tip', 'barn']
  2. examples/LocalPhoneWords.py : Encoding local telephone numbers in UK format into words. Example: (027) 4193 2856 -> ['mew', 'atom', 'pick']
  3. examples/IntPhoneWords.py : Encoding international telephone numbers into words Example: (+44) (0)20 7930 4832 -> ['prim', 'enjoy', 'feuds', 'got']

Reasons for 2 and 3 example: Words Carry Meaning: Words have meaning, context, and imagery associated with them, making them easier to remember. For example, words like "sun", "car", and "tree" evoke clear mental images or associations. Numbers Are Abstract: An 11-digit number like "12345678901" lacks inherent meaning and context, making it more difficult to recall without using specific memory techniques such as chunking or mnemonics. Chunking: The brain naturally chunks information for easier recall. While numbers can be chunked (e.g., phone numbers), it's generally easier for the brain to chunk and recall a few meaningful words than a long string of digits.

  1. examples/IPv4toWords.py : Can be implemented as a simple local DNS for temporarily used IP addresses, JFF (e.g. web browser). Ports can be supported using 4 words. Example: 255.255.255.255 -> ['sting', 'year', 'soy']
  2. examples/PostcodeWords.py : Encoding UK postcodes into 3 words that are easy to remember Example: SE10 0TY -> ['neuron', 'toy', 'game']

Words dictionary generation

  1. English words were generated using NLTK using Brown Corpus.
  2. Then stop words, proper names, plural words, abbreviations were filtered out to keep only nouns, verbs, adjectives, and adverbs.
  3. Words frequencies were used to order list of words.
  4. The resulting list was separated to 3,4,5,6,7,8,9,10... letters words to use shortest in some cases.

TBD

  1. Support other languages
  2. Add checks and exceptions
  3. Add unit tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

word4num-0.0.5.tar.gz (242.4 kB view details)

Uploaded Source

Built Distribution

word4num-0.0.5-py3-none-any.whl (256.4 kB view details)

Uploaded Python 3

File details

Details for the file word4num-0.0.5.tar.gz.

File metadata

  • Download URL: word4num-0.0.5.tar.gz
  • Upload date:
  • Size: 242.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for word4num-0.0.5.tar.gz
Algorithm Hash digest
SHA256 a67349aa1c295806ae4d29cee341137c759fef477641d422e2e5758360233ebd
MD5 a120cafc91d8edb1c2204c661251a4b3
BLAKE2b-256 48937aec2b2ea23e797204480089f35b754847292aa083bfbe180c3950698c01

See more details on using hashes here.

File details

Details for the file word4num-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: word4num-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 256.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for word4num-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7c4bde41732f4b9ca169dba61cea6f1519a2859ec42f20c7321601b7fb2162a3
MD5 3cf55b718785132bccbed13673ff2d33
BLAKE2b-256 f1cd9e5530f95a5069726caec7abd8f7717c484bb7ed4e8bb7f9710b9ea424bd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page