Similarity metrics for bibliography
Project description
Hunahpu
Colav Similairy
Description
Package with customized colav similarity algorithm.
Installation
Package
pip install hunahpu
Usage
This is a library package, so you can use it in your code as follows:
from hunahpu.ColavSimilarity import ColavSimilarity
paper1 = {}
paper1['title'] = 'My title one'
paper1["journal"] = "Journal one"
paper1["year"] = 2016
paper2 = {}
paper2['title'] = 'My title two'
paper2["journal"] = "Jornal two"
paper2["year"] = 2016
if ColavSimilarity(paper1, paper2):
print("The papers are similar")
else:
print("The papers are not similar")
it also allows several options for tunning such as:
ratio_thold: int
threshold for ratio matric
partial_thold: int
threshold for partial ratio
low_thold: int
low threshold for ratios
use_translation : str
enable translation support
use_parsing: boolean
use parsing to remove unneeded characters
example:
from hunahpu.ColavSimilarity import ColavSimilarity
paper1 = {}
paper1['title'] = 'My title one'
paper1["journal"] = "Journal one"
paper1["year"] = 2016
paper2 = {}
paper2['title'] = 'My title two'
paper2["journal"] = "Jornal two"
paper2["year"] = 2016
if ColavSimilarity(paper1, paper2, ratio_thold=90, partial_thold=92, low_thold=92, use_translation=True, use_parsing=True):
print("The papers are similar")
else:
print("The papers are not similar")
NOTE: translation does not work in all cases, so it is not recommended to use it.
License
BSD-3-Clause License
Links
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hunahpu-0.0.3a0.tar.gz
(5.3 kB
view hashes)
Built Distribution
Close
Hashes for Hunahpu-0.0.3a0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a6f1fcba9e1ed806a4b894e3cdcbd69fa75eadb4cb1e8db5caf26ca530c9cb49 |
|
MD5 | d812842d4d5dabe96ff3eca74b9651e3 |
|
BLAKE2b-256 | 7ac4a98d0db2c1eb7dd7a20313a8c76c9c759f579314b9156eee517d162a84a2 |