Skip to main content

pydist2 is a library that provides a set of methods for calculating distances between observations.

Project description

pydist2

https://img.shields.io/pypi/v/pydist2.svg https://img.shields.io/travis/Harmouch101/pydist2.svg Documentation Status https://img.shields.io/pypi/status/pydist2.svg https://img.shields.io/pypi/wheel/pydist2.svg https://img.shields.io/github/license/Harmouch101/pydist2.svg

pydist2 is a python library that provides a set of methods for calculating distances between observations. There are two main classes:

  • pdist1 which calculates the pairwise distances between observations in one matrix and returns a distance matrix.
  • pdist2 computes the distances between observations in two matrices and also returns a distance matrix.

Usage

pdist1(P, metric = "euclidean", matrix=False)
pdist2(P, Q, metric = "minkowski", exp = 3)

Arguments:

  • two matrices P and Q.
  • metric: The distance function to use.
  • exp: The exponent of the Minkowski distance.

Installation

The pydist2 library is available on Pypi. Thus, you can install the latest available version using pip:

$pip install pydist2

Supported Python versions

pydist2 has been tested with Python 3.7 and 3.8.

For more information, please checkout the documentation which is available at readthedocs.

This program and the accompanying materials are made available under the terms of the MIT License.

Progress & Features

  • [X] Commit the first code’s version.

  • [X] Support the following list of distances.

  • [X] Display the distance in a matrix form(a combination for each pair of points):

    >>> X = np.array([[100, 100],[0, 100],[100, 0], [500, 400], [300, 600]])
    >>> pdist1(X,matrix=True) # by default, metric = 'euclidean'
    array([[100.    , 100.    , 100.    ,   0.    , 100.    ],
           [100.    , 100.    , 100.    , 100.    ,   0.    ],
           [500.    , 100.    , 100.    , 500.    , 400.    ],
           [538.5165, 100.    , 100.    , 300.    , 600.    ],
           [141.4214,   0.    , 100.    , 100.    ,   0.    ],
           [583.0952,   0.    , 100.    , 500.    , 400.    ],
           [583.0952,   0.    , 100.    , 300.    , 600.    ],
           [565.6854, 100.    ,   0.    , 500.    , 400.    ],
           [632.4555, 100.    ,   0.    , 300.    , 600.    ],
           [282.8427, 500.    , 400.    , 300.    , 600.    ]])
    

where the first column represents the distance between each pair of observations. for instance, the euclidean distance between (100. , 100.) and ( 0. , 100.) is 100.

  • [X] Support numpy arrays of the same size only.

Todo list

  • [ ] Re-validate the correctness of the distances equations.
  • [ ] Performance tests & vectorization.
  • [ ] Adding new distances.
  • [ ] Adding a squared form of the distance.
  • [ ] Support tuples and list.
  • [ ] Write more test cases.
  • [ ] Handling Exceptions.
  • [ ] Restructure the docs.

History

0.0.1 (2021-03-04)

  • First release on PyPI.

0.0.5 (2021-03-21)

  • Adding sqeuclidean metric.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydist2-0.0.5.tar.gz (21.3 kB view hashes)

Uploaded source

Built Distribution

pydist2-0.0.5-py2.py3-none-any.whl (10.0 kB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page