Skip to main content

Compute statistical robust distances for mixed data.

Project description

robust-mixed-dist

Data scientists address real-world problems using multivariate and heterogeneous datasets, characterized by multiple variables of different natures. Selecting a suitable distance function between units is crucial, as many statistical techniques and machine learning algorithms depend on this concept. Traditional distances, such as Euclidean or Manhattan, are unsuitable for mixed-type data, and although Gower distance was designed to handle this kind of data, it may lead to suboptimal results in the presence of outlying units or underlying correlation structure.

In the paper Grané , Aurea; Scielzo-Ortiz, Fabio. “On generalized Gower distance for mixed-type data: extensive simulation study and new software tools”. SORT-Statistics and Operations Research Transactions, pp. 213-44, doi:10.57645/20.8080.02.28. robust distances for mixed-type data are defined and explored, namely robust generalized Gower and robust related metric scaling. In addition, the new Python package robust-mixed-dist is developed, which enables to compute these robust proposals as well as classical ones.

The package is located in Python Package Index (PyPI), the standard repository of packages for the Python programming language: https://pypi.org/project/robust_mixed_dist/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robust_mixed_dist-0.1.18.tar.gz (21.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robust_mixed_dist-0.1.18-py3-none-any.whl (22.4 kB view details)

Uploaded Python 3

File details

Details for the file robust_mixed_dist-0.1.18.tar.gz.

File metadata

  • Download URL: robust_mixed_dist-0.1.18.tar.gz
  • Upload date:
  • Size: 21.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for robust_mixed_dist-0.1.18.tar.gz
Algorithm Hash digest
SHA256 d1a0f5ef8322222376b5fa6d75c6ab93d0ab9ff78a1342d32d92eb96f8f7960f
MD5 ed5bab6fa93a8003a7080c72ed7c9182
BLAKE2b-256 13a4e100ff340dc0d926087877ec82806ca0551852d752aa3b1d4f34ef6e3213

See more details on using hashes here.

File details

Details for the file robust_mixed_dist-0.1.18-py3-none-any.whl.

File metadata

File hashes

Hashes for robust_mixed_dist-0.1.18-py3-none-any.whl
Algorithm Hash digest
SHA256 3281f9f3204f7c7b3ec0eddec6b6495f905b0855c688117750101369d97dfce8
MD5 f2c9def4dfd442e3db892ada0dbbeb68
BLAKE2b-256 1bf9030cc75a2ed4e103dafd1c4d0b4f0ee5e36198cacf4d2b626a36ca01bb8b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page