Skip to main content

A new way to encode words and similarity calculate.

Project description

DOI

WordFP

This application consists of a python package made to encode words and compare them through similarity calculation. The words are encoded in a matrix of 0's and 1's called "WordFP", where the first column refers to all the letters present in a word and the second column to the last refers to the position of a certain letter in a word. The search for similar words is calculated based on the metrics: geometric, arithmetic, tanimoto and tversky. A jupyter-notebook with an example of using this package is in the examples/how_to_use.ipynb directory

Another way to use this package is through of web app WordFP. It is possible to run locally too following the steps below.

Install

Via pip

$ pip install wordfp

or

Via github

$ git clone https://github.com/jeffrichardchemistry/WordFP
$ cd WordFP
$ python3 setup.py install

Install and Run WebAPP Locally

The web application is in the "app/app.py" folder. Install dependencies:

$ pip install streamlit wordfp

To run:

$ cd .../app/
$ streamlit run app.py

Considerations

This project was an idea I came up with at a random moment while studying my PhD work, I hope it can help someone someday in areas like natural language processing (NLP).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WordFP-1.0.2.linux-x86_64.tar.gz (1.8 MB view details)

Uploaded Source

Built Distribution

WordFP-1.0.2-py3-none-any.whl (1.8 MB view details)

Uploaded Python 3

File details

Details for the file WordFP-1.0.2.linux-x86_64.tar.gz.

File metadata

  • Download URL: WordFP-1.0.2.linux-x86_64.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.10

File hashes

Hashes for WordFP-1.0.2.linux-x86_64.tar.gz
Algorithm Hash digest
SHA256 53de5d2475b20190c9cd0ad16ba8a34cd8cdd3d16f6c741ccf5773e0cfab8fbd
MD5 850fc1c334cf62e113f1e9d19dd04a82
BLAKE2b-256 1a39c5e633e3c5ee3591264896830161f7cf33a520c842298df48e5f9cd62913

See more details on using hashes here.

File details

Details for the file WordFP-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: WordFP-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 1.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.10

File hashes

Hashes for WordFP-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 851b8dfe92431008d3b51b8b693050107a0d35ca3cca113c1945da7dbfc61a53
MD5 e02c3f7ae07857ea6b1939d3cd62cd6e
BLAKE2b-256 8e0ef449f306b1338577a5a7926b512e7f18dcfaea5db356a4a5d56ead10095a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page