Skip to main content

User-friendly library to find similar objects

Project description

FindSimilar

User-friendly library to find similar objects

You can find Full Project Documentation here

First you can try our DEMO WEBAPP for a quick introduction to the find-similar


Workflows

Tests Pylint

PyPi

Version Development Status Python version Wheel

Anaconda

Version Last Updated Platforms

License

License

Support

Documentation Discussions Issues

PyPi Downloads

Day Downloads Week Downloads Month Downloads

Anaconda Downloads

Anaconda

Languages

Languages Top Language

Development

  • Release date Last Commit
  • Issues Closed Issues
  • Pull Requests Closed Pull Requests
  • Discussions

Repository Stats

Stars Contributors Forks


Menu

Mission

The mission of the FindSimilar project is to provide a powerful and versatile open source library that empowers developers to efficiently find similar objects and perform comparisons across a variety of data types. Whether dealing with texts, images, audio, or more, our project aims to simplify the process of identifying similarities and enhancing decision-making.

Open Source Project

This is the open source project with MIT license. Be free to use, fork, clone and contribute.

Features

Find similar texts

  • on different languages
  • with or without stopwords
  • using dictionary (or not)
  • using keywords (or not)

Requirements

Development Status

Install

with pip

pip install find-similar

See more in Full Documentation

Quickstart

from find_similar import find_similar

texts = ['one two', 'two three', 'three four']

text_to_compare = 'one four'
find_similar(text_to_compare, texts, count=10)
[TokenText(text="one two", len(tokens)=2, cos=0.5), TokenText(text="three four", len(tokens)=2, cos=0.5), TokenText(text="two three", len(tokens)=2, cos=0)]
  • The result is the list of TokenText instances ordering by cos
  • cos is the mark of texts similarity

See more examples in Full Documentation

See the demonstration and mini tutorial in the Demo project

Contributing

You are welcome! To easy start please check:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

find-similar-2.1.0.tar.gz (16.6 kB view details)

Uploaded Source

Built Distribution

find_similar-2.1.0-py3-none-any.whl (17.0 kB view details)

Uploaded Python 3

File details

Details for the file find-similar-2.1.0.tar.gz.

File metadata

  • Download URL: find-similar-2.1.0.tar.gz
  • Upload date:
  • Size: 16.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for find-similar-2.1.0.tar.gz
Algorithm Hash digest
SHA256 2ba8cd010c12f654cb45ae0c63ae2e08dacf1b22605e560fa8f46ec3a2bd9d78
MD5 b57cfb9c58636808d93b4d33a317ee42
BLAKE2b-256 c2d1014775791bdadff82bd3073c8eddf2417c9bb69ccd68cb998ffcd22c12ae

See more details on using hashes here.

Provenance

File details

Details for the file find_similar-2.1.0-py3-none-any.whl.

File metadata

  • Download URL: find_similar-2.1.0-py3-none-any.whl
  • Upload date:
  • Size: 17.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for find_similar-2.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 366ac5b3fa01c7842219a9c82872e80fb911b33482b94cecc43f9078a726f0df
MD5 94cf414fab279280fb45146842d3b31f
BLAKE2b-256 1a4b82f46e34e6d8865b83d9b24af363e8412db4ab8ba5d9269cb4d9f42dc943

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page