Skip to main content

Efficient approximate Bayesian machine learning

Project description

xGPR

xGPR is a library for fitting approximate Gaussian process regression models to datasets ranging in size from hundreds to millions of datapoints. It uses an efficient implementation of the random features approximation (aka random Fourier features). It is designed to run on either CPU or GPU (GPU strongly preferred), to model tabular data, sequence & time series data and graph data, and to fit datasets too large to load into memory in a straightforward way.

Unlike exact Gaussian processes, which exhibit O(N^2) scaling and are completely impractical for large datasets, xGPR can scale easily; it is fairly straightforward to fit a few million datapoints on a GPU. Notably, xGPR is able to do this while providing accuracy competitive with deep learning models (unlike variational GP approximations). Unlike other libraries for Gaussian processes, which only provide kernels for fixed-vector data (tabular data), xGPR provides powerful convolution kernels for variable-length time series, sequences and graphs.

What's new in v0.4.5

Starting with version 0.4.5, xGPR is available as a precompiled binary / wheel for 64 bit Linux and as a source distribution for other platforms, so that in most cases, installation should typically be as simple as:

pip install xGPR

See the documentation for important information about installation and requirements.

Documentation

The documentation covers a variety of use cases, including tabular data, sequences and graphs, installation requirements and much more.

Citations

If using xGPR for research intended for publication, please cite either:

Linear-Scaling Kernels for Protein Sequences and Small Molecules Outperform Deep Learning While Providing Uncertainty Quantitation and Improved Interpretability Jonathan Parkinson and Wei Wang Journal of Chemical Information and Modeling 2023 63 (15), 4589-4601 DOI: 10.1021/acs.jcim.3c00601

or the preprint at:

Jonathan Parkinson, & Wei Wang. (2023). Linear Scaling Kernels for Protein Sequences and Small Molecules Outperform Deep Learning while Providing Uncertainty Quantitation and Improved Interpretability https://arxiv.org/abs/2302.03294

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xgpr-0.4.5.tar.gz (2.5 MB view details)

Uploaded Source

Built Distributions

xgpr-0.4.5-cp312-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (694.4 kB view details)

Uploaded CPython 3.12+ manylinux: glibc 2.17+ x86-64

xgpr-0.4.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (695.9 kB view details)

Uploaded CPython 3.11 manylinux: glibc 2.17+ x86-64

xgpr-0.4.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (696.2 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

xgpr-0.4.5-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (696.3 kB view details)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

File details

Details for the file xgpr-0.4.5.tar.gz.

File metadata

  • Download URL: xgpr-0.4.5.tar.gz
  • Upload date:
  • Size: 2.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for xgpr-0.4.5.tar.gz
Algorithm Hash digest
SHA256 b5321e88164284567e50e9d22fda39198d8f8846dedcb4a1f3f97962352017d9
MD5 be7f2ad730ff61e9ceb3bf8751f53c70
BLAKE2b-256 1be092019646acdc25ae035594b1a088b7af4995f32af8f2bcf11593606851c7

See more details on using hashes here.

File details

Details for the file xgpr-0.4.5-cp312-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for xgpr-0.4.5-cp312-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7909c9c2a4cba0681701ddeef363d2593c2d28765447626432eabe05947397df
MD5 806e236a543b78b96db14ecd3276ced8
BLAKE2b-256 9570bea7b24cc7b72311a9451c1b62ca74ddd80e1ab1efdfa95d5a180c1bcc81

See more details on using hashes here.

File details

Details for the file xgpr-0.4.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for xgpr-0.4.5-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f52f30f015c3520523304235ddf03574f3deb616ef5c7ca305791e25ccf19ebd
MD5 ba22c9433dc47b16fcd741ad25028f2a
BLAKE2b-256 65f9fb8754ab5b75ac1a3a97fb34ec91a072496bdeb9645a42426ccd4cd46185

See more details on using hashes here.

File details

Details for the file xgpr-0.4.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for xgpr-0.4.5-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 1edb5d97a1062c1ebe507c90bb4801c00be6b92081a27bfea4f7e48f95f45d2b
MD5 62a3411b971ba370ae75c109689922d9
BLAKE2b-256 8801d1b198dd61b8974b543f61c54253844ac01398db6ca9fdbfee6c2a825c89

See more details on using hashes here.

File details

Details for the file xgpr-0.4.5-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for xgpr-0.4.5-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 da879df3b20cbcdd8792553a07103435d0a12dfa5eba3417d3769fa62adff034
MD5 20471729c2eb55acfb8a3d408d5f2db7
BLAKE2b-256 39364f9441a7c64e218dc919bd629495177c9a6d75c504f5bf42d50edb17d254

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page