A/B Testing Toolkit - Statistical analysis library for A/B tests

These details have not been verified by PyPI

Project links

Project description

ABTK - A/B Testing Toolkit

ABTK is a comprehensive Python library for statistical analysis of A/B tests. It provides a unified interface for parametric and nonparametric hypothesis tests, variance reduction techniques, and multiple comparisons correction.

Key Features

8 Statistical Tests - Parametric (T-Test, Z-Test, CUPED, ANCOVA) and Nonparametric (Bootstrap)
Variance Reduction - CUPED, ANCOVA with multiple covariates
Multiple Comparisons - Bonferroni, Holm, Benjamini-Hochberg, and more
Quantile Analysis - Analyze treatment effects across the distribution
Unified Interface - All tests return standardized TestResult objects
Automatic Diagnostics - ANCOVA validates statistical assumptions
Flexible - Support for relative and absolute effects

Quick Start

Installation

pip install -r requirements.txt

Basic Example

from core.data_types import SampleData
from tests.parametric import TTest

# Prepare your data
control = SampleData(
    data=[100, 110, 95, 105, 98, 102],
    name="Control"
)

treatment = SampleData(
    data=[105, 115, 100, 110, 103, 108],
    name="Treatment"
)

# Run test
test = TTest(alpha=0.05, test_type="relative")
results = test.compare([control, treatment])

# Get results
result = results[0]
print(f"Effect: {result.effect:.2%}")           # e.g., "Effect: 5.5%"
print(f"P-value: {result.pvalue:.4f}")
print(f"Significant: {result.reject}")          # True/False
print(f"95% CI: [{result.left_bound:.2%}, {result.right_bound:.2%}]")

Using pandas DataFrames

For most analysts, data starts in a pandas DataFrame. Use the helper utilities for easy conversion:

import pandas as pd
from utils.dataframe_helpers import sample_data_from_dataframe
from tests.parametric import TTest

# Your experiment data as DataFrame
df = pd.DataFrame({
    'variant': ['control', 'control', 'treatment', 'treatment'],
    'revenue': [100, 110, 105, 115],
    'baseline_revenue': [90, 100, 92, 102]  # Optional: for CUPED
})

# Convert to SampleData objects automatically
samples = sample_data_from_dataframe(
    df,
    group_col='variant',
    metric_col='revenue',
    covariate_cols='baseline_revenue'  # Optional
)

# Run test
test = TTest(alpha=0.05, test_type="relative")
results = test.compare(samples)

See the DataFrame Usage Examples for more.

Available Tests

Parametric Tests

Test	Use Case	Special Features
TTest	Standard A/B test	Fast, well-understood
PairedTTest	Matched pairs A/B test	Removes between-subject variability
CupedTTest	A/B test with 1 covariate	Variance reduction
ZTest	Proportions (CTR, CVR)	Designed for binary outcomes
AncovaTest	A/B test with multiple covariates	Maximum variance reduction, diagnostics

Nonparametric Tests

Test	Use Case	Special Features
BootstrapTest	Non-normal data, outliers	No assumptions, robust
PairedBootstrapTest	Matched pairs, non-normal	Combines pairing + nonparametric
PostNormedBootstrapTest	Non-normal with covariate	Bootstrap + variance reduction

Documentation

Getting Started - Installation, first steps, basic concepts
Test Selection Guide - How to choose the right test
User Guides - Detailed guides for each test type
API Reference - Complete API documentation
Examples - Real-world use cases
FAQ - Frequently asked questions

Example: Variance Reduction with CUPED

from tests.parametric import CupedTTest

# Include historical data for variance reduction
control = SampleData(
    data=[100, 110, 95, 105],           # Current metric
    covariates=[90, 100, 85, 95],       # Historical baseline
    name="Control"
)

treatment = SampleData(
    data=[105, 115, 100, 110],
    covariates=[92, 102, 87, 97],
    name="Treatment"
)

test = CupedTTest(alpha=0.05)
results = test.compare([control, treatment])

# CUPED typically gives narrower CI and lower p-value!

Example: Multiple Comparisons Correction

from tests.parametric import TTest
from utils.corrections import adjust_pvalues

# Test multiple variants
test = TTest(alpha=0.05)
results = test.compare([control, treatment_a, treatment_b, treatment_c])

# Apply Bonferroni correction
adjusted = adjust_pvalues(results, method="bonferroni")

for r in adjusted:
    print(f"{r.name_1} vs {r.name_2}: p={r.pvalue:.4f}, significant={r.reject}")

Example: Quantile Analysis

from tests.nonparametric import BootstrapTest
from utils.quantile_analysis import QuantileAnalyzer

# Analyze treatment effects at different quantiles
bootstrap = BootstrapTest(alpha=0.05, n_samples=10000)
analyzer = QuantileAnalyzer(
    test=bootstrap,
    quantiles=[0.25, 0.5, 0.75, 0.9, 0.95]
)

results = analyzer.compare([control, treatment])
result = results[0]

# View results as table
print(result.to_dataframe())

# Find where effects are significant
sig_quantiles = result.significant_quantiles()
print(f"Effects significant at: {sig_quantiles}")

Test Selection Decision Tree

Do you have proportions (CTR, CVR)?
├─ Yes → ZTest
└─ No (continuous metric)
    └─ Do you have paired data?
        ├─ Yes
        │   ├─ Assume normality? → PairedTTest
        │   └─ No assumptions → PairedBootstrapTest
        └─ No (independent samples)
            └─ Do you have covariates?
                ├─ Yes
                │   ├─ Multiple covariates? → AncovaTest
                │   ├─ One covariate, normal → CupedTTest
                │   └─ One covariate, non-parametric → PostNormedBootstrapTest
                └─ No covariates
                    ├─ Assume normality? → TTest
                    └─ No assumptions → BootstrapTest

See Test Selection Guide for detailed recommendations.

Requirements

Python >= 3.8
numpy >= 1.20.0
scipy >= 1.7.0
pandas >= 1.3.0
statsmodels >= 0.13.0

Optional:

matplotlib >= 3.5.0 (for visualization)

Installation Options

# Basic installation
pip install -r requirements.txt

# Development installation (with pytest, black, etc.)
pip install -e ".[dev]"

# With visualization support
pip install -e ".[viz]"

Running Tests

# Run all tests
pytest unit_tests/

# Run with coverage
pytest --cov=. unit_tests/

# Run specific test file
pytest unit_tests/test_ancova.py

Project Structure

abtk/
├── core/                    # Core data structures
│   ├── data_types.py        # SampleData, ProportionData
│   ├── test_result.py       # TestResult
│   └── quantile_test_result.py
├── tests/
│   ├── parametric/          # Parametric tests
│   └── nonparametric/       # Nonparametric tests
├── utils/                   # Utilities
│   ├── corrections.py       # Multiple comparisons
│   ├── quantile_analysis.py # Quantile treatment effects
│   └── visualization.py     # Plotting (optional)
├── unit_tests/              # Unit tests
├── examples/                # Runnable examples
└── docs/                    # Documentation

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

Citation

If you use ABTK in your research, please cite:

@software{abtk2024,
  title={ABTK: A/B Testing Toolkit},
  author={Your Name},
  year={2024},
  url={https://github.com/yourusername/abtk}
}

Support

Documentation: docs/
Issues: GitHub Issues
FAQ: docs/faq.md

Acknowledgments

ABTK implements methods from:

Deng et al. (2013) - CUPED methodology
Benjamini & Hochberg (1995) - FDR control
And many other contributions to A/B testing literature

Ready to get started? → Check out the Getting Started Guide

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0

Oct 24, 2025

0.3.2

Oct 23, 2025

0.3.1

Oct 22, 2025

0.3.0

Oct 21, 2025

0.2.0

Oct 21, 2025

This version

0.1.0

Oct 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abtk-0.1.0.tar.gz (85.1 kB view details)

Uploaded Oct 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

abtk-0.1.0-py3-none-any.whl (33.5 kB view details)

Uploaded Oct 20, 2025 Python 3

File details

Details for the file abtk-0.1.0.tar.gz.

File metadata

Download URL: abtk-0.1.0.tar.gz
Upload date: Oct 20, 2025
Size: 85.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abtk-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`b4f6eb713b75628f46a2f0a295e48d627ec916c308b04597665e7c562d7fa0f2`
MD5	`6230c93fa556cd17cd284431eec04421`
BLAKE2b-256	`c5bc1481f0131ea09bce8038e50f2965b52853c90ce0de67875ce721b9778db4`

See more details on using hashes here.

File details

Details for the file abtk-0.1.0-py3-none-any.whl.

File metadata

Download URL: abtk-0.1.0-py3-none-any.whl
Upload date: Oct 20, 2025
Size: 33.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abtk-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fa96ca046e3eeb7c8572f37b05491e98a79af05335ccb96c51587d5f14a5672e`
MD5	`e2a7b4c197092f7f250e394502721615`
BLAKE2b-256	`567fd74c9a2f7842d656a546a6ad7db5b1087f1308b2be8b3371eccf42ddf559`

See more details on using hashes here.

abtk 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ABTK - A/B Testing Toolkit

Key Features

Quick Start

Installation

Basic Example

Using pandas DataFrames

Available Tests

Parametric Tests

Nonparametric Tests

Documentation

Example: Variance Reduction with CUPED

Example: Multiple Comparisons Correction

Example: Quantile Analysis

Test Selection Decision Tree

Requirements

Installation Options

Running Tests

Project Structure

Contributing

License

Citation

Support

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes