Efficient approximate Bayesian machine learning
Project description
xGPR
xGPR is a library for fitting approximate Gaussian process regression models to datasets ranging in size from hundreds to millions of datapoints. It uses an efficient implementation of the random features approximation (aka random Fourier features). It is designed to run on either CPU or GPU (GPU strongly preferred), to model tabular data, sequence & time series data and graph data, and to fit datasets too large to load into memory in a straightforward way.
Unlike exact Gaussian processes, which exhibit O(N^2) scaling and are completely impractical for large datasets, xGPR can scale easily; it is fairly straightforward to fit a few million datapoints on a GPU. Notably, xGPR is able to do this while providing accuracy competitive with deep learning models (unlike variational GP approximations). Unlike other libraries for Gaussian processes, which only provide kernels for fixed-vector data (tabular data), xGPR provides powerful convolution kernels for variable-length time series, sequences and graphs.
What's new in v0.4.5
Starting with version 0.4.5, xGPR is available as a precompiled binary / wheel for 64 bit Linux and as a source distribution for other platforms, so that in most cases, installation should typically be as simple as:
pip install xGPR
See the documentation for important information about installation and requirements.
Documentation
The documentation covers a variety of use cases, including tabular data, sequences and graphs, installation requirements and much more.
Citations
If using xGPR for research intended for publication, please cite either:
Linear-Scaling Kernels for Protein Sequences and Small Molecules Outperform Deep Learning While Providing Uncertainty Quantitation and Improved Interpretability Jonathan Parkinson and Wei Wang Journal of Chemical Information and Modeling 2023 63 (15), 4589-4601 DOI: 10.1021/acs.jcim.3c00601
or the preprint at:
Jonathan Parkinson, & Wei Wang. (2023). Linear Scaling Kernels for Protein Sequences and Small Molecules Outperform Deep Learning while Providing Uncertainty Quantitation and Improved Interpretability https://arxiv.org/abs/2302.03294
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file xgpr-0.4.5.tar.gz
.
File metadata
- Download URL: xgpr-0.4.5.tar.gz
- Upload date:
- Size: 2.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b5321e88164284567e50e9d22fda39198d8f8846dedcb4a1f3f97962352017d9 |
|
MD5 | be7f2ad730ff61e9ceb3bf8751f53c70 |
|
BLAKE2b-256 | 1be092019646acdc25ae035594b1a088b7af4995f32af8f2bcf11593606851c7 |
File details
Details for the file xgpr-0.4.5-cp312-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: xgpr-0.4.5-cp312-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 694.4 kB
- Tags: CPython 3.12+, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7909c9c2a4cba0681701ddeef363d2593c2d28765447626432eabe05947397df |
|
MD5 | 806e236a543b78b96db14ecd3276ced8 |
|
BLAKE2b-256 | 9570bea7b24cc7b72311a9451c1b62ca74ddd80e1ab1efdfa95d5a180c1bcc81 |
File details
Details for the file xgpr-0.4.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: xgpr-0.4.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 695.9 kB
- Tags: CPython 3.11, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f52f30f015c3520523304235ddf03574f3deb616ef5c7ca305791e25ccf19ebd |
|
MD5 | ba22c9433dc47b16fcd741ad25028f2a |
|
BLAKE2b-256 | 65f9fb8754ab5b75ac1a3a97fb34ec91a072496bdeb9645a42426ccd4cd46185 |
File details
Details for the file xgpr-0.4.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: xgpr-0.4.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 696.2 kB
- Tags: CPython 3.10, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1edb5d97a1062c1ebe507c90bb4801c00be6b92081a27bfea4f7e48f95f45d2b |
|
MD5 | 62a3411b971ba370ae75c109689922d9 |
|
BLAKE2b-256 | 8801d1b198dd61b8974b543f61c54253844ac01398db6ca9fdbfee6c2a825c89 |
File details
Details for the file xgpr-0.4.5-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
.
File metadata
- Download URL: xgpr-0.4.5-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Upload date:
- Size: 696.3 kB
- Tags: CPython 3.9, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | da879df3b20cbcdd8792553a07103435d0a12dfa5eba3417d3769fa62adff034 |
|
MD5 | 20471729c2eb55acfb8a3d408d5f2db7 |
|
BLAKE2b-256 | 39364f9441a7c64e218dc919bd629495177c9a6d75c504f5bf42d50edb17d254 |