Skip to main content

Compute statistical robust distances for mixed data.

Project description

robust_mixed_dist

Data scientists address real-world problems using multivariate and heterogeneous datasets, characterized by multiple variables of different natures. Selecting a suitable distance function between units is crucial, as many statistical techniques and machine learning algorithms depend on this concept. Traditional distances, such as Euclidean or Manhattan, are unsuitable for mixed-type data, and although Gower distance was designed to handle this kind of data, it may lead to suboptimal results in the presence of outlying units or underlying correlation structure.

In the paper Grané, A., Scielzo-Ortiz, F.: On generalized Gower distance for mixed-type data: extensive simulation study and new software tools. Submitted to SORT (2025) robust distances for mixed-type data are defined and explored, namely robust generalized Gower and robust related metric scaling. In addition, the new Python package robust_mixed_dist is developed, which enables to compute these robust proposals as well as classical ones.

The package is located in Python Package Index (PyPI), the standard repository of packages for the Python programming language: https://pypi.org/project/robust_mixed_dist/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

robust_mixed_dist-0.1.3.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

robust_mixed_dist-0.1.3-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file robust_mixed_dist-0.1.3.tar.gz.

File metadata

  • Download URL: robust_mixed_dist-0.1.3.tar.gz
  • Upload date:
  • Size: 13.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for robust_mixed_dist-0.1.3.tar.gz
Algorithm Hash digest
SHA256 f9c95e93981becc036b3faa5dd51d943e8c749edc2b7262eb72476ca716cc16c
MD5 ebfcae50a4da02e54993d24530e4347f
BLAKE2b-256 591c17207fd1b97edf7ad9b205ee076f87a31508befc09cb9f23dff1e85588ad

See more details on using hashes here.

File details

Details for the file robust_mixed_dist-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for robust_mixed_dist-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c06e6fe69132ae0f6baa5823459449815403a92ea6323fd9a6de086a71951cc0
MD5 3b32665d126def0f68c138dc8a1756cc
BLAKE2b-256 848ca16efb60c5765a9fa36c68f243baa4f4a5807be3b41891ac2715ea92e555

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page