User-friendly library to find similar objects
Project description
FindSimilar
User-friendly library to find similar objects
You can find Full Project Documentation here
First you can try our DEMO WEBAPP for a quick introduction to the find-similar
Workflows
PyPi
Anaconda
License
Support
PyPi Downloads
Anaconda Downloads
Languages
Development
Repository Stats
Menu
- Mission
- Open Source Project
- Features
- Requirements
- Development Status
- Install
- Quickstart
- Contributing
Mission
The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making.
Open Source Project
This is the open source project with MIT license. Be free to use, fork, clone and contribute.
Features
Find similar texts
- on different languages
- with or without stopwords
- using dictionary (or not)
- using keywords (or not)
Requirements
- nltk, pymorphy3
- See more in Full Documentation
Development Status
- Package already available on PyPi
- See more in Full Documentation
Install
with pip
pip install find-similar
See more in Full Documentation
Quickstart
from find_similar import find_similar
texts = ['one two', 'two three', 'three four']
text_to_compare = 'one four'
find_similar(text_to_compare, texts, count=10)
[TokenText(text="one two", len(tokens)=2, cos=0.5), TokenText(text="three four", len(tokens)=2, cos=0.5), TokenText(text="two three", len(tokens)=2, cos=0)]
- The result is the list of
TokenText
instances ordering bycos
cos
is the mark of texts similarity
See more examples in Full Documentation
See the demonstration and mini tutorial in the Demo project
Contributing
You are welcome! To easy start please check:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file find-similar-2.1.0.tar.gz
.
File metadata
- Download URL: find-similar-2.1.0.tar.gz
- Upload date:
- Size: 16.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ba8cd010c12f654cb45ae0c63ae2e08dacf1b22605e560fa8f46ec3a2bd9d78 |
|
MD5 | b57cfb9c58636808d93b4d33a317ee42 |
|
BLAKE2b-256 | c2d1014775791bdadff82bd3073c8eddf2417c9bb69ccd68cb998ffcd22c12ae |
Provenance
File details
Details for the file find_similar-2.1.0-py3-none-any.whl
.
File metadata
- Download URL: find_similar-2.1.0-py3-none-any.whl
- Upload date:
- Size: 17.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 366ac5b3fa01c7842219a9c82872e80fb911b33482b94cecc43f9078a726f0df |
|
MD5 | 94cf414fab279280fb45146842d3b31f |
|
BLAKE2b-256 | 1a4b82f46e34e6d8865b83d9b24af363e8412db4ab8ba5d9269cb4d9f42dc943 |