Compute statistical robust distances for mixed data.
Project description
robust-mixed-dist
Data scientists address real-world problems using multivariate and heterogeneous datasets, characterized by multiple variables of different natures. Selecting a suitable distance function between units is crucial, as many statistical techniques and machine learning algorithms depend on this concept. Traditional distances, such as Euclidean or Manhattan, are unsuitable for mixed-type data, and although Gower distance was designed to handle this kind of data, it may lead to suboptimal results in the presence of outlying units or underlying correlation structure.
In the paper Grané , Aurea; Scielzo-Ortiz, Fabio. “On generalized Gower distance for mixed-type data: extensive simulation study and new software tools”. SORT-Statistics and Operations Research Transactions, pp. 213-44, doi:10.57645/20.8080.02.28. robust distances for mixed-type data are defined and explored, namely robust generalized Gower and robust related metric scaling. In addition, the new Python package robust-mixed-dist is developed, which enables to
compute these robust proposals as well as classical ones.
The package is located in Python Package Index (PyPI), the standard repository of packages for the Python programming language: https://pypi.org/project/robust_mixed_dist/
-
Package documentation: https://fabioscielzoortiz.github.io/robust-mixed-dist-docu/intro.html
-
Paper link: https://raco.cat/index.php/SORT/article/view/9900373
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file robust_mixed_dist-0.1.9.tar.gz.
File metadata
- Download URL: robust_mixed_dist-0.1.9.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
32a5331563397a0689159ab6cd877f52d22c484ad1afcf1bf9ef675bbae168fe
|
|
| MD5 |
edc88dc0d7d886f84ed0808768b42b92
|
|
| BLAKE2b-256 |
16ff0308cad34f48502cad41221f46e3a74b30880594d1e7c4a69a111c241451
|
File details
Details for the file robust_mixed_dist-0.1.9-py3-none-any.whl.
File metadata
- Download URL: robust_mixed_dist-0.1.9-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ed5c59ab0dc48fe618b6dc4dc9c6006186b7b4cf155063e16404ad18a8c2c0b5
|
|
| MD5 |
535c62c90f448a708498e8a2f5acae2b
|
|
| BLAKE2b-256 |
c7bd1911ab75a27aa56605a9b8975e6592d1e3dfa0a7f9ac1704ea9309a54d6c
|