Skip to main content

A new way to encode words and similarity calculate.

Project description



This application consists of a python package made to encode words and compare them through similarity calculation. The words are encoded in a matrix of 0's and 1's called "WordFP", where the first column refers to all the letters present in a word and the second column to the last refers to the position of a certain letter in a word. The search for similar words is calculated based on the metrics: geometric, arithmetic, tanimoto and tversky. A jupyter-notebook with an example of using this package is in the examples/how_to_use.ipynb directory

Another way to use this package is through of web app WordFP. It is possible to run locally too following the steps below.


Via pip

$ pip install wordfp


Via github

$ git clone
$ cd WordFP
$ python3 install

Install and Run WebAPP Locally

The web application is in the "app/" folder. Install dependencies:

$ pip install streamlit wordfp

To run:

$ cd .../app/
$ streamlit run


This project was an idea I came up with at a random moment while studying my PhD work, I hope it can help someone someday in areas like natural language processing (NLP).

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WordFP-1.0.2.linux-x86_64.tar.gz (1.8 MB view hashes)

Uploaded Source

Built Distribution

WordFP-1.0.2-py3-none-any.whl (1.8 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page