Data visualization for multivariate datasets with a nonlinear dependence structure
Project description
Copulogram
This package is provides a new data visualisation tool to explore multivariate datasets developped with V.Chabridon.
A copulogram is an innovative plot as it decomposes a mutivariate dataset between the effects of the marginals and those of the dependence between features. To do so, it represents the marginals with univariate kernel density estimation plots or histograms (diagonal), and the dependence structure with scatter plots in the ranked space (upper triangle). On the bottom triangle the scatter plots are set in the physical space, gathering the effects of the marginals and the dependencies. Since the dependence structure is theoretically modeled by an underlying copula, this plot is called copulogram, generalizing the well-known ``correlogram'' to nonlinear dependencies. It gives a synthetic and empirical decomposition of the dataset.
Copulogram of a wind-wave dataset
Installation
The following commands install the current version of the copulogram package.
~$ pip install copulogram
Example on iris dataset
Using the famous iris dataset, let us plot copulograms with different settings:
>>> import seaborn as sns
>>> import copulogram as cp
>>> data = sns.load_dataset('iris')
>>> copulogram = cp.Copulogram(data)
>>> copulogram.draw()
>>> copulogram.draw(alpha=0.8, hue='species', kde_on_marginals=False)
>>> copulogram.draw(hue='species', quantile_contour_levels=[0.2, 0.4, 0.6, 0.8])
References
-
Empirical Bernstein copula: Sancetta, A., & Satchell, S. (2004). The Bernstein Copula and Its Applications to Modeling and Approximations of Multivariate Distributions. Econometric Theory, 20(3), 535–562.
-
Nonparametric copula estimation: Nagler, T., Schellhase, C. & Czado, C. (2017). Nonparametric estimation of simplified vine copula models: comparison of methods. Dependence Modeling, 5(1), 99-120.
-
OpenTURNS: Baudin, M., Lebrun, R., Iooss, B., Popelin, A.L. (2017). OpenTURNS: An Industrial Software for Uncertainty Quantification in Simulation.
-
Wind-waves environmental dataset: The data was generated by a numerical model from ANEMOC (Digital Atlas of Ocean and Coastal Sea States, see http://anemoc.cetmef.developpement-durable.gouv.fr/)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file copulogram-0.0.4.tar.gz
.
File metadata
- Download URL: copulogram-0.0.4.tar.gz
- Upload date:
- Size: 19.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d826a2cf553d42cccb30988f9c1adb54ec3a7368651789178147db29f4e00a6b |
|
MD5 | ad1b09d65dd67a9c8992de8fff3de022 |
|
BLAKE2b-256 | 1d3c30fb0af7d86cd2cf628530c0bda39986f1feab4ccfca63abe0efeb4fae0c |
File details
Details for the file copulogram-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: copulogram-0.0.4-py3-none-any.whl
- Upload date:
- Size: 20.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ed1f2888606ddcf18c7b0738520d0a3809865344711926efad241ec52860027d |
|
MD5 | 701610c55d8c6361a97ab842d9c1efd5 |
|
BLAKE2b-256 | 0cc38c1f91f607813afce8f7adbf900b404e8491b4d361ed81775f961846e579 |