A Python package for kernel methods in Statistics/ML.
Project description
PyRKHSstats
A Python package implementing a variety of statistical/machine learning methods that rely on kernels (e.g. HSIC for independence testing).
Overview
- Independence testing with HSIC (Hilbert-Schmidt Independence Criterion), as introduced in A Kernel Statistical Test of Independence, A. Gretton, K. Fukumizu, C. Hui Teo, L. Song, B. Schölkopf, and A. Smola (NIPS 2007).
- Measurement of conditional independence with HSCIC (Hilbert-Schmidt Conditional Independence Criterion), as introduced in A Measure-Theoretic Approach to Kernel Conditional Mean Embeddings, J. Park and K. Muandet (NeurIPS 2020).
- The Kernel-based Conditional Independence Test (KCIT), as introduced in Kernel-based Conditional Independence Test and Application in Causal Discovery, K. Zhang, J. Peters, D. Janzing, B. Schölkopf (UAI 2011).
- Two-sample testing (also known as homogeneity testing) with the MMD (Maximum Mean Discrepancy), as presented in A Fast, Consistent Kernel Two-Sample Test, A. Gretton, K. Fukumizu, Z. Harchaoui, and B. K. Sriperumbudur (NIPS 2009) and in A Kernel Two-Sample Test, A. Gretton, K. M. Borgwardt, M. J. Rasch, B. Schölkopf, and A. Smola (JMLR, volume 13, 2012).
| Resource | Description |
|---|---|
| HSIC | For independence testing |
| HSCIC | For the measurement of conditional independence |
| KCIT | For conditional independence testing |
| MMD | For two-sample testing |
Implementations available
The following table details the implementation schemes for the different resources available in the package.
| Resource | Implementation Scheme | Numpy based available | PyTorch based available |
|---|---|---|---|
| HSIC | Resampling (permuting the xi's but leaving the yi's unchanged) | Yes | No |
| HSIC | Gamma approximation | Yes | No |
| HSCIC | N/A | Yes | Yes |
| KCIT | Gamma approximation | Yes | No |
| KCIT | Monte Carlo simulation (weighted sum of χ2 random variables) | Yes | No |
| MMD | Gram matrix spectrum | Yes | No |
In development
- Joint independence testing with dHSIC.
- Goodness-of-fit testing.
- Methods for time series models.
- Bayesian statistical kernel methods.
- Regression by independence maximisation.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file PyRKHSstats-2.1.0.tar.gz.
File metadata
- Download URL: PyRKHSstats-2.1.0.tar.gz
- Upload date:
- Size: 28.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
11ea1962bb4c0796f798c7e2218b6badaf4b9b980feb7a743628947a45d576c5
|
|
| MD5 |
8272c4a12de6963813861677c64b2099
|
|
| BLAKE2b-256 |
687f01cf21d15ef653abdd9c7bb00fb772467c8618cf19ba38944f833c4f224b
|
File details
Details for the file PyRKHSstats-2.1.0-py3-none-any.whl.
File metadata
- Download URL: PyRKHSstats-2.1.0-py3-none-any.whl
- Upload date:
- Size: 54.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b130d5f14f3a98faf098489c59718ec09422612e2b6be05bd1b75a2655cd2677
|
|
| MD5 |
a142749a522e659826f6b38a5705a8bf
|
|
| BLAKE2b-256 |
c4575c7054ed7e3b018f1166b57315bbc7722059b15fb05b03eb168a1ab28ecc
|