Similarity metrics for bibliography
Project description
Hunahpu
Colav Similairy
Description
Package with customized colav similarity algorithm.
Installation
Package
pip install hunahpu
Usage
This is a library package, so you can use it in your code as follows:
from hunahpu.ColavSimilarity import ColavSimilarity
paper1 = {}
paper1['title'] = 'My title one'
paper1["journal"] = "Journal one"
paper1["year"] = 2016
paper2 = {}
paper2['title'] = 'My title two'
paper2["journal"] = "Jornal two"
paper2["year"] = 2016
if ColavSimilarity(paper1, paper2):
print("The papers are similar")
else:
print("The papers are not similar")
it also allows several options for tunning such as:
ratio_thold: int
threshold for ratio matric
partial_thold: int
threshold for partial ratio
low_thold: int
low threshold for ratios
use_translation : str
enable translation support
use_parsing: boolean
use parsing to remove unneeded characters
example:
from hunahpu.ColavSimilarity import ColavSimilarity
paper1 = {}
paper1['title'] = 'My title one'
paper1["journal"] = "Journal one"
paper1["year"] = 2016
paper2 = {}
paper2['title'] = 'My title two'
paper2["journal"] = "Jornal two"
paper2["year"] = 2016
if ColavSimilarity(paper1, paper2, ratio_thold=90, partial_thold=92, low_thold=92, use_translation=True, use_parsing=True):
print("The papers are similar")
else:
print("The papers are not similar")
NOTE: translation does not work in all cases, so it is not recommended to use it.
License
BSD-3-Clause License
Links
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hunahpu-0.0.5a0.tar.gz
(5.4 kB
view details)
Built Distribution
File details
Details for the file Hunahpu-0.0.5a0.tar.gz
.
File metadata
- Download URL: Hunahpu-0.0.5a0.tar.gz
- Upload date:
- Size: 5.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 248ffc4aad06d13bb001f0a55cb125e587d7a39b660badcb52b53107ffdaa73d |
|
MD5 | 232dc6481e8f2644524a817fcf941f35 |
|
BLAKE2b-256 | 9c7e41d6c4d7285604800a07ce0cf1e8d30284aeba84896a2408c6383a9e54a7 |
File details
Details for the file Hunahpu-0.0.5a0-py3-none-any.whl
.
File metadata
- Download URL: Hunahpu-0.0.5a0-py3-none-any.whl
- Upload date:
- Size: 4.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cd0c2a730e5cea2b3f1b59457f4c32fdb9520af1d76856864ca2d1c092f9e1f3 |
|
MD5 | dfc75aa0b54de1590bad8a4117d2ccfd |
|
BLAKE2b-256 | 769d0bb02c5ab1710ee77d6f0c94237b1ec9e9091e9bbc18f812f8c545aacf15 |