Estimate discontinuous timeseries from continuous covariates.
Project description
discontinuum
[!WARNING]
Experimental.
Overview
discontinuum
is a middleware for developing Gaussian process (GP) models.
Why might we want a middleware?
GP's are a flexible and elegant approach to modeling dynamical systems
for which we have sparse and uncertain observations.
In this arena, simple GP models, specified in several lines of math,
can often achieve state-of-the-art predictive performance.
However, fitting GP's is numerically intense, $\mathcal{O}(n^3)$ complexity.
They have several optimizations that take advantage of simplifying assumptions,
different algorithms, or GPUs, but each has tradeoffs.
Ideally, we could quickly write mathematical models, then run them on whichever
"engine" is best suited for a particular problem.
Furthermore, most models include a lot of relatively standard utility functions
for plotting, managing metadata, data pre-processing, and other "boiler plate."
discontinum
packages engines and utilities within a single ecosystem,
such that creating a new model is just a matter of writing a little math without
too much boilerplate.
Installation
pip install discontinuum
Models
loadset-gp
loadest-gp
is Gaussian-process model for estimating river constituent time series,
which borrows its namesake from the venerable LOAD ESTimator (LOADEST) software program.
However, LOADEST has several serious limitations
---it's essentially a linear regression---and it has been all but replaced by
the more flexible Weighted Regression on Time Discharge and Season (WRTDS),
which allows the relation between target and covariate to vary through time.
loadest-gp
takes the WRTDS idea and reimplements it as a GP.
Try it out in the loadest-gp demo.
rating-gp
rating-gp
is a Gaussian-process model for estimate river flow from stage time series.
Try it out in the rating-gp demo.
Engines
Currently, the only supported engines are the marginal likelihood implementation in pymc
and gpytorch
.
Latent GP implementations could be added in the future.
In general, the gpytorch
implementation is faster and provides a lot a powerful features,
like GPU support, whereas pymc
is a more complete probabilistic-programming framework,
which can be "friendlier" for certain use cases.
Roadmap
mindmap
root((discontinuum))
data providers
USGS
etc
engines
PyMC
PyTorch
utilities
pre-processing
post-processing
plotting
models
loadest-gp
rating-gp
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file discontinuum-0.5.tar.gz
.
File metadata
- Download URL: discontinuum-0.5.tar.gz
- Upload date:
- Size: 433.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d98098772b1a1710537ac8dd5d64cbf485531cf2d4cd87c20d6bccf67e31a23f |
|
MD5 | 9648582d268c14290961499f4fbe30cf |
|
BLAKE2b-256 | 17ebe866b88131cb2475936096e8f810a480bffd517c1b5b520464cea891660c |
File details
Details for the file discontinuum-0.5-py3-none-any.whl
.
File metadata
- Download URL: discontinuum-0.5-py3-none-any.whl
- Upload date:
- Size: 40.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 23f8043bd583fdd7a9b5083be97f181720e12c10f8ceba76d51467ce00b27c5d |
|
MD5 | ba1d9dd6549be3f392d57449d2c45a99 |
|
BLAKE2b-256 | fb73c27cb6aae04f1d831cc9f93267816e1a6280b63c38508151ef72493ee56b |