Skip to main content

Bayesian A/B testing for proportions

Project description

BayesProp Logo

Bayesian A/B Testing for Proportions

PyPI Downloads Documentation Python License: MIT Tests codecov

A Python package for Bayesian hypothesis testing of success-rate differences in any Bernoulli-like experiment, using analytic and approximate inference methods. Input data can be binary (0/1) or real-valued on (0, 1) — continuous scores are automatically binarized at a configurable threshold. Typical applications include comparing treatments, groups, items, model variants, or any two conditions whose outcomes can be expressed as proportions. Please check out our Getting Started guide for installation and quick examples.

Features

  • Effect-size inference for proportions — estimate and test the difference in success rates for both paired and non-paired samples
  • Savage–Dickey Bayes Factor — test a point-null hypothesis ($\delta = 0$) without fitting a separate null model
  • Posterior of the null & ROPE — quantify the posterior mass inside a Region of Practical Equivalence for nuanced decisions beyond simple reject/accept
  • Posterior predictive checks — assess model fit by comparing observed data to data simulated from the posterior
  • Bayes Factor Design Analysis (BFDA) — plan sample sizes to reach a target level of evidence before running the experiment
  • Sequential / streaming design — update the posterior batch-by-batch as data arrive and stop early once the Bayes factor crosses an upper or lower threshold (SequentialNonPairedBayesPropTest, SequentialPairedBayesPropTest)
  • Publication-ready plots — posterior distributions, predictive checks, Savage–Dickey density-ratio plots, BFDA power curves, and sequential BF₁₀ trajectories out of the box

Models

Model Class Method When to use
Non-paired Beta–Bernoulli NonPairedBayesPropTest Conjugate Beta posteriors per arm; P(B>A) by quadrature, Δ summaries by Monte Carlo Independent groups, exact & fast
Paired Logistic (Laplace) PairedBayesPropTest MAP + Laplace approximation Paired scores, large n, fast iteration
Paired Logistic (Pólya–Gamma) PairedBayesPropTestPG Exact Gibbs sampling Paired scores, small n, exact posterior

Quick start

import numpy as np
from bayesprop.resources.bayes_nonpaired import NonPairedBayesPropTest

y_A = np.array([1,1,0,1,1,0,1,1,0,1,1,1,0,1,1,0,1,1,1,0])
y_B = np.array([0,1,0,0,1,1,0,1,0,0,1,0,0,1,0,1,0,1,0,0])

# Fit & summarise
model = NonPairedBayesPropTest(seed=42).fit(y_A, y_B)
print(model.summary)           # NonPairedSummary with mean_delta, ci_95, P(A>B), …

# Hypothesis test
bf = model.savage_dickey_test() # SavageDickeyResult with BF_10, decision, …

# Plots
model.plot_posteriors()
model.plot_savage_dickey()

Package structure

├── pyproject.toml
├── justfile                   # task runner (just <recipe>)
├── .pre-commit-config.yaml    # ruff format + lint hooks
├── data/                      # evaluation datasets
├── docs/                      # documentation source
├── src
│   ├── bayesprop
│   │   ├── config/            # global_config, YAML configs
│   │   ├── resources/
│   │   │   ├── bayes_nonpaired.py      # NonPairedBayesPropTest
│   │   │   ├── bayes_paired_laplace.py # PairedBayesPropTest
│   │   │   ├── bayes_paired_pg.py      # PairedBayesPropTestPG
│   │   │   ├── bfda_utils.py           # BFDA helpers
│   │   │   └── data_schemas.py         # Pydantic models
│   │   ├── services/
│   │   │   └── file.py                 
│   │   └── utils/
│   │       └── utils.py                # simulate, BFDA power curves, plots
│   └── notebooks/
│       ├── bayesian_AB_model_comparison_nonpaired.ipynb
│       ├── bayesian_AB_model_comparison_paired_laplace.ipynb
│       ├── bayesian_AB_model_comparison_paired_gibbs.ipynb
│       ├── sequential_nonpaired_demo.ipynb
│       └── sequential_paired_laplace_demo.ipynb
└── tests/

Installation

pip install BayesProp

Or with uv:

uv pip install BayesProp

For development (from source):

git clone https://github.com/AVoss84/bayesProp.git
cd bayesprop
uv venv --python 3.13
uv sync
source .venv/bin/activate

Dependencies

  • Python ≥ 3.13
  • numpy, scipy, matplotlib, pandas
  • pydantic (v2)
  • polyagamma

References

  • Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A. & Rubin, D. B. (2013). Bayesian Data Analysis (3rd ed.). Chapman & Hall/CRC.
  • Kruschke, J. K. (2018). Rejecting or accepting parameter values in Bayesian estimation. Advances in Methods and Practices in Psychological Science, 1(2), 270–280.
  • Polson, N. G., Scott, J. G. & Windle, J. (2013). Bayesian inference for logistic models using Pólya–Gamma latent variables. JASA, 108(504), 1339–1349.
  • Schönbrodt, F. D. & Wagenmakers, E.-J. (2018). Bayes factor design analysis: Planning for compelling evidence. Psychonomic Bulletin & Review, 25(1), 128–142.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bayesprop-0.1.0.5.tar.gz (65.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bayesprop-0.1.0.5-py3-none-any.whl (54.8 kB view details)

Uploaded Python 3

File details

Details for the file bayesprop-0.1.0.5.tar.gz.

File metadata

  • Download URL: bayesprop-0.1.0.5.tar.gz
  • Upload date:
  • Size: 65.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bayesprop-0.1.0.5.tar.gz
Algorithm Hash digest
SHA256 80769fd4eb3aea630f24cee11180a9d8eca7d243e873385089dff6ceed56f7a7
MD5 9c9b87412642d1b301d6e694d4fd14fa
BLAKE2b-256 9487f45e00620068531eb7a7cf5b82b1d9b017b5e7e91dd858b6d009c91d9bbd

See more details on using hashes here.

Provenance

The following attestation bundles were made for bayesprop-0.1.0.5.tar.gz:

Publisher: publish_pypi.yml on AVoss84/bayesProp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bayesprop-0.1.0.5-py3-none-any.whl.

File metadata

  • Download URL: bayesprop-0.1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 54.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bayesprop-0.1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 8b772ee02a2f31e030af7f3de0e4a08a30bcd7c31eabada1e545c8101bb4021e
MD5 bd1e13a1ba0fd7f058bd50188a657e45
BLAKE2b-256 38b4552b88004c9c93017ed3b4623c6941974fc742e2c112923917167d19fc09

See more details on using hashes here.

Provenance

The following attestation bundles were made for bayesprop-0.1.0.5-py3-none-any.whl:

Publisher: publish_pypi.yml on AVoss84/bayesProp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page