Skip to main content

Compute statistical robust distances for mixed data.

Project description

robust-mixed-dist

Data scientists address real-world problems using multivariate and heterogeneous datasets, characterized by multiple variables of different natures. Selecting a suitable distance function between units is crucial, as many statistical techniques and machine learning algorithms depend on this concept. Traditional distances, such as Euclidean or Manhattan, are unsuitable for mixed-type data, and although Gower distance was designed to handle this kind of data, it may lead to suboptimal results in the presence of outlying units or underlying correlation structure.

In the paper Grané , Aurea; Scielzo-Ortiz, Fabio. “On generalized Gower distance for mixed-type data: extensive simulation study and new software tools”. SORT-Statistics and Operations Research Transactions, pp. 213-44, doi:10.57645/20.8080.02.28. robust distances for mixed-type data are defined and explored, namely robust generalized Gower and robust related metric scaling. In addition, the new Python package robust-mixed-dist is developed, which enables to compute these robust proposals as well as classical ones.

The package is located in Python Package Index (PyPI), the standard repository of packages for the Python programming language: https://pypi.org/project/robust_mixed_dist/

Package documentation can be found here: https://fabioscielzoortiz.github.io/robust-mixed-dist-docu/intro.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robust_mixed_dist-0.1.7.tar.gz (13.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robust_mixed_dist-0.1.7-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file robust_mixed_dist-0.1.7.tar.gz.

File metadata

  • Download URL: robust_mixed_dist-0.1.7.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for robust_mixed_dist-0.1.7.tar.gz
Algorithm Hash digest
SHA256 745a9cd13ef730746ff566b53402600373bd3e13836dd739f738bc0c4e557e4e
MD5 7aa1d05594adc2ae7a20fe22f5e564dd
BLAKE2b-256 1b97b19d14c0ef8ebc8050c82061d9e0dc5d53cde7d95b3cc2bd5800816f894d

See more details on using hashes here.

File details

Details for the file robust_mixed_dist-0.1.7-py3-none-any.whl.

File metadata

File hashes

Hashes for robust_mixed_dist-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 7d71063b5ff5f0668d1566efd0fa7029986c6e80ca2cbbe22d20c74f1a2f0a16
MD5 13b91df4b303bf7e560d1324b75044e0
BLAKE2b-256 26b4140c0a173a79ca6ba00f2966647694b98432079c22b454946a6875c4d881

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page