Python implementation of binary similarity and distance measures.
Project description
binsdpy - binary similarity and distance measures
Python implementation of binary similarity and distance measures (see [1]). The bitsets
(immutable ordered set data type) and numpy.ndarray
are suported as feature vectors.
Example
Example based on bitsets
:
from bitsets import bitset
from binsdpy.similarity import jaccard
from binsdpy.distance import euclid
Colors = bitset("Colors", ("red", "blue", "green", "yellow"))
a = Colors.frommembers(["red", "blue"])
b = Colors.frommembers(["red", "yellow"])
jaccard(a, b)
# > 0.3333333333333333
euclid(a, b)
# > 1.4142135623730951
Example based on np.ndarray
:
import numpy as np
from binsdpy.similarity import jaccard
from binsdpy.distance import euclid
a = np.array([1, 1, 0, 0], dtype=bool)
b = np.array([1, 0, 0, 1], dtype=bool)
jaccard(a, b)
# > 0.3333333333333333
euclid(a, b)
# > 1.4142135623730951
Installation
Package is avaliable in alpha version via pip
.
$ pip install binsdpy
Dependencies
binsdpy requires:
- Python (>= 3.6)
- bitset
- numpy
Reference
[1] Choi, S. S., Cha, S. H., & Tappert, C. C. (2010). A survey of binary similarity and distance measures. Journal of systemics, cybernetics and informatics, 8(1), 43-48. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.352.6123&rep=rep1&type=pdf
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
binsdpy-0.1.1.tar.gz
(10.8 kB
view hashes)