Implementation of the stochastic root-finding algorithm described by
Project description
This is an implementation of the probabilistic bisection algorithm (PBA) described by Rolf Waeber in his 2013 PhD thesis. PBA is a 1D stochastic root-finding algorithm. This means that it is meant to find the point where a noisy function (i.e. a function that may return different values each time its evaluated at the same point) crosses the x-axis. More precisely, given g(x) = f(x) + ε(x), the goal is to find x such that E[g(x)] = 0, where f(x) is the function we are interested in, ε is a normally distributed noise function with median 0, g(x) is the only way we can observe f(x), and E[g(x)] is the expected value of g(x).
This algorithm works by repeatedly evaluating the noisy function at a single x until the probability that we know the true sign of f(x) exceeds a specified threshold. This information is used to build a Bayesian posterior distribution describing the location of the root. The next point to evaluate is then chosen from this distribution. There are two features that make this algorithm unique:
It provides a true confidence interval in addition to the root itself.
It places very little restraints on the form of the noise function.
However, there are some caveats to be aware of:
In my experience, the algorithm rarely converges. This is due to the fact that the confidence intervals only narrow as new x-coordinates are sampled, but the closer the algorithm gets to the root, the more time it spends sampling each x-coordinate. That said, the algorithm seems to find the root very accurately even when it doesn’t converge.
The algorithm isn’t very numerically stable. The posterior distribution is represented using two arrays: one for the bin edges and one for the bin heights. As the algorithm progresses, the bins near the root get very thin and very tall, which leads to loss-of-precision and overflow issues. My only advice on how to avoid these problems is to limit the number of iterations. It might also help to increase the p_c parameter.
Note that I myself only have a rudimentary understanding of the math behind this algorithm. This implementation is based on scripts I received from the authors, and I tried to test my code as well as possible to convince myself that it’s doing the right thing, but I’m outside my comfort zone here.
Installation
Install from PyPI:
$ pip install pba_waeber2013
Usage
This module provides a single public function:
>>> from pba_waeber2013 import pba_waeber2013 >>> pba_waeber2013( ... f, a, b, * ... tol=None, ... rtol=None, ... alpha=0.05, ... p_c=0.65, ... maxiter=1000, ... check_bounds=False, ... slope=None, ... args=None, ... kwargs=None, ... )
Below are brief descriptions of the parameters:
- f
The stochastic function of interest. This function can take any number of arguments (see args and kwargs), but the first should be the x coordinate to evaluate the function at.
- a b
The lower and upper bounds, respectively, on where the root can occur. There must be exactly one root in this interval.
- tol
How narrow the confidence interval needs to be in order for the algorithm to be converged. Note that if neither tol nor rtol are specified, the algorithm will just run for the maximum number of iterations.
- rtol
Similar to tol, but multiplied by the estimated root.
- alpha
A parameter controlling the width of the confidence intervals. Specifically, the confidence level is given by 1 − alpha. In other words, for 95% confidence intervals, set alpha to 0.05.
- p_c
Repeatedly evaluate the function at each x coordinate until we can reject the null hypothesis that f(x) = 0 with probability p_c. At that point, the sign of f(x) will be taken as whichever sign we observed most often.
- maxiter
The maximum number of function calls to make.
- check_bounds
Evaluate the function at the bounds a and b before starting the bisection. This achieves two things. First, it checks that the bounds have different signs. If this is not the case, then there are either 0 or >1 roots in the interval, and so this algorithm is not applicable. Second, it determines the slope of the function. If the bounds are not checked, the slope parameter must be manually specified.
- slope
+1 if the function is increasing (i.e. negative before the root and positive after) or -1 if the function is decreasing. A slope must be given unless check_bounds is True (in which case it will be calculated internally).
- args
Any extra arguments to pass to the function on each evaluation.
- kwargs
Any extra keyword arguments to pass to the function on each evaluation.
The return value is an object with the following attributes:
- x
The location of the root.
- x_obs
All of the x coordinates where the function was evaluated.
- f_obs
The result of evaluating the function at each of the above x coordinates.
- x_post
The bin edges of the posterior distribution.
- log_p_post
The natural logarithms of the bin heights of the posterior distribution. Logarithms are used to avoid multiplication overflows.
- ci
The confidence interval evaluated independently after a sign is determined for each coordinate. Note that these intervals can grow and shrink over time. See [Waeber2013] §3.3 for more information.
- ci_seq
The sequential confidence interval. Unlike the ci intervals, these are guaranteed to never expand. For that reason, these are the intervals used to check for convergence.
- converged
True if the algorithm terminated because the confidence interval grew narrower than the given tolerance, False if the algorithm terminated because it reached the maximum number of iterations.
References
Waeber R. (2013) “Probabilistic bisection search for stochastic root-finding.”
Frazier PI, Henderson SG, Waeber R (2016) “Probabilistic bisection converges almost as quickly as stochastic approximation”, arXiv:1612.03964
Robbins H and Siegmund D. (1974) “The expected sample size of some tests of power one”, The Annals of Statistics, 2(3), pp. 415–436. doi:10.1214/aos/1176342704.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pba_waeber2013-0.1.0.tar.gz
.
File metadata
- Download URL: pba_waeber2013-0.1.0.tar.gz
- Upload date:
- Size: 16.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea71a2a8dce439417f4d6cfefe556fbcb1f1317f7b48530e4664abe58b431809 |
|
MD5 | b9e2d03d4a56b396ae591b84b5630340 |
|
BLAKE2b-256 | 95209ed46b10b10d347a55b2dd68e6fcc3185738dc695a6f85d2b971f98a4ca1 |
File details
Details for the file pba_waeber2013-0.1.0-py2.py3-none-any.whl
.
File metadata
- Download URL: pba_waeber2013-0.1.0-py2.py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.10.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 020e94007c722b79171c3d3add3e86faba9e3e7e434b0ffb89984d40852f6388 |
|
MD5 | bdf15ad6d67bf3255c49a0f367041856 |
|
BLAKE2b-256 | 1399ab673df030cd9e57b23797b6ef51e1ad67e58f40d7d84a6b571ecdab4169 |