No project description provided

These details have not been verified by PyPI

Project description

cluster-experiments

License

cluster-experiments is a comprehensive Python library for end-to-end A/B testing workflows, from experiment design to statistical analysis.

📖 What is cluster-experiments?

cluster-experiments provides a complete toolkit for designing, running, and analyzing experiments, with particular strength in handling clustered randomization and complex experimental designs. Originally developed to address challenges in switchback experiments and scenarios with network effects where standard randomization isn't feasible, it has evolved into a general-purpose experimentation framework supporting both simple A/B tests and other randomization designs.

Why "cluster"?

The name reflects the library's origins in handling cluster-randomized experiments, where randomization happens at a group level (e.g., stores, cities, time periods) rather than at the individual level. This is critical when:

Spillover/Network Effects: Treatment of one unit affects others (e.g., testing driver incentives in ride-sharing)
Operational Constraints: You can't randomize individuals (e.g., testing restaurant menu changes)
Switchback Designs: Treatment alternates over time periods within the same unit

While the library is aimed at these scenarios, it's equally capable of handling standard A/B tests with individual-level randomization.

Key Features

Experiment Design

Power Analysis & Sample Size Calculation

Simulation-based (Monte Carlo) for any design complexity
Analytical (CLT-based) for standard designs
Minimum Detectable Effect (MDE) estimation

Multiple Experimental Designs

Standard A/B tests with individual randomization
Cluster-randomized experiments
Switchback/crossover experiments
Stratified randomization
Observational studies with Synthetic Control

Statistical Methods

Multiple Analysis Methods

OLS and Clustered OLS regression
GEE (Generalized Estimating Equations)
Mixed Linear Models (MLM)
Delta Method for ratio metrics
Synthetic Control for observational data

Variance Reduction Techniques

CUPED (Controlled-experiment Using Pre-Experiment Data)
CUPAC (Control Using Predictions As Covariates)
Covariate adjustment

Analysis Workflow

Scorecard & Multi-dimensional Analysis

Scorecard Generation: Analyze multiple metrics simultaneously
Multi-dimensional Slicing: Break down results by segments
Multiple Treatment Arms: Compare several treatments at once
Ratio Metrics: Built-in support for conversion rates, averages, etc.
Relative Lift: Analyze effects as percentage changes rather than absolute differences

📦 Installation

pip install cluster-experiments

⚡ Quick Example

Here's how to run an analysis in just a few lines:

import pandas as pd
import numpy as np
from cluster_experiments import AnalysisPlan, Variant

np.random.seed(42)

# 0. Create simple data
N = 1_000
df = pd.DataFrame({
    "variant": np.random.choice(["control", "treatment"], N),
    "orders": np.random.poisson(10, N),
    "visits": np.random.poisson(100, N),
})
df["converted"] = (df["orders"] > 0).astype(int)


# 1. Define your analysis plan
plan = AnalysisPlan.from_metrics_dict({
    "metrics": [
        {"name": "orders", "alias": "revenue", "metric_type": "simple"},
        {"name": "converted", "alias": "conversion", "metric_type": "ratio", "numerator": "converted", "denominator": "visits"}
    ],
    "variants": [
        {"name": "control", "is_control": True},
        {"name": "treatment", "is_control": False}
    ],
    "variant_col": "variant",
    "analysis_type": "ols"
})

# 2. Run analysis on your dataframe
results = plan.analyze(df)
print(results.to_dataframe().head())

Output Example:

  metric_alias control_variant_name treatment_variant_name  control_variant_mean  treatment_variant_mean analysis_type           ate  ate_ci_lower  ate_ci_upper   p_value     std_error     dimension_name dimension_value  alpha
0      revenue              control              treatment              10.08554                9.941061           ols -1.444788e-01 -5.446603e-01  2.557026e-01  0.479186  2.041780e-01  __total_dimension           total   0.05
1   conversion              control              treatment               1.00000                1.000000           ols  1.110223e-16 -1.096504e-16  3.316950e-16  0.324097  1.125902e-16  __total_dimension           total   0.05

Power Analysis

Design your experiment by estimating required sample size and detectable effects. Here's a complete example using analytical (CLT-based) power analysis:

import numpy as np
import pandas as pd
from cluster_experiments import NormalPowerAnalysis

# Create sample historical data
np.random.seed(42)
N = 500

historical_data = pd.DataFrame({
    'user_id': range(N),
    'metric': np.random.normal(100, 20, N),
    'date': pd.to_datetime('2025-10-01') + pd.to_timedelta(np.random.randint(0, 30, N), unit='d')
})

# Initialize analytical power analysis (fast, CLT-based)
power_analysis = NormalPowerAnalysis.from_dict({
    'analysis': 'ols',
    'splitter': 'non_clustered',
    'target_col': 'metric',
    'time_col': 'date'  # Required for mde_time_line
})

# 1. Calculate power for a given effect size
power = power_analysis.power_analysis(historical_data, average_effect=5.0)
print(f"Power for detecting +5 unit effect: {power:.1%}")

# 2. Calculate Minimum Detectable Effect (MDE) for desired power
mde = power_analysis.mde(historical_data, power=0.8)
print(f"Minimum detectable effect at 80% power: {mde:.2f}")

# 3. Power curve: How power changes with effect size
power_curve = power_analysis.power_line(
    historical_data,
    average_effects=[2.0, 4.0, 6.0, 8.0, 10.0]
)
print(power_curve)

# 4. MDE timeline: How MDE changes with experiment length
mde_timeline = power_analysis.mde_time_line(
    historical_data,
    powers=[0.8],
    experiment_length=[7, 14, 21, 30]
)

Output:

Power for detecting +5 unit effect: 72.7%
Minimum detectable effect at 80% power: 5.46
{2.0: 0.18, 4.0: 0.54, 6.0: 0.87, 8.0: 0.98, 10.0: 1.00}

Key methods:

power_analysis(): Calculate power for a given effect
mde(): Calculate minimum detectable effect
power_line(): Generate power curves across effect sizes
mde_time_line(): Calculate MDE for different experiment lengths

For simulation-based power analysis (for complex designs), see the Power Analysis Guide.

📚 Documentation

For detailed guides, API references, and advanced examples, visit our documentation.

Core Concepts

The library is built around three main components:

1. Splitter - Define how to randomize

Choose how to split your data into control and treatment groups:

NonClusteredSplitter: Standard individual-level randomization
ClusteredSplitter: Cluster-level randomization
SwitchbackSplitter: Time-based alternating treatments
StratifiedClusteredSplitter: Balance randomization across strata

2. Analysis - Measure the impact

Select the appropriate statistical method for your design:

OLSAnalysis: Standard regression for A/B tests
ClusteredOLSAnalysis: Clustered standard errors for cluster-randomized designs
TTestClusteredAnalysis: T-tests on cluster-aggregated data
GeeExperimentAnalysis: GEE for correlated observations
SyntheticControlAnalysis: Observational studies with synthetic controls

3. AnalysisPlan - Orchestrate your analysis

Define your complete analysis workflow:

Specify metrics (simple and ratio)
Define variants and dimensions
Configure hypothesis tests
Generate comprehensive scorecards

For power analysis, combine these with:

Perturbator: Simulate treatment effects for power calculations
PowerAnalysis: Estimate statistical power and sample sizes

🛠️ Advanced Features

Variance Reduction (CUPED/CUPAC)

Reduce variance and detect smaller effects by leveraging pre-experiment data. Use historical metrics as covariates to control for pre-existing differences between groups.

Use cases:

Have pre-experiment metrics for your users/clusters
Want to detect smaller treatment effects
Need more sensitive tests with same sample size

See the CUPAC Example for detailed implementation.

Cluster Randomization

Handle experiments where randomization occurs at group level (stores, cities, regions) rather than individual level. Essential for managing spillover effects and operational constraints.

See the Cluster Randomization Guide for details.

Switchback Experiments

Design and analyze time-based crossover experiments where the same units receive both control and treatment at different times.

See the Switchback Example for implementation.

🌟 Support

⭐ Star us on GitHub
📝 Read the documentation
🐛 Report issues on our issue tracker
💬 Join discussions in GitHub Discussions

📚 Citation

If you use cluster-experiments in your research, please cite:

@software{cluster_experiments,
  author = {David Masip and contributors},
  title = {cluster-experiments: A Python library for designing and analyzing experiments},
  url = {https://github.com/david26694/cluster-experiments},
  year = {2022}
}

Project details

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

This version

0.30.0

Mar 3, 2026

0.29.0

Dec 22, 2025

0.28.0

Oct 17, 2025

0.27.0

Aug 24, 2025

0.26.0

May 16, 2025

0.25.0

Feb 11, 2025

0.24.0

Jan 15, 2025

0.23.0

Dec 20, 2024

0.22.0

Dec 20, 2024

0.21.0

Dec 17, 2024

0.20.2

Dec 13, 2024

0.20.1

Dec 12, 2024

0.20.0

Nov 7, 2024

0.19.0

Jun 21, 2024

0.18.0

Jun 14, 2024

0.17.0

Jun 14, 2024

0.16.0

Jun 12, 2024

0.15.0

May 27, 2024

0.14.1

May 3, 2024

0.14.0

Mar 4, 2024

0.13.0

Feb 28, 2024

0.12.0

Feb 6, 2024

0.11.0

Jan 12, 2024

0.10.4

Jun 23, 2023

0.10.3

May 31, 2023

0.10.2

May 31, 2023

0.10.1

May 29, 2023

0.10.0

May 26, 2023

0.9.1

May 26, 2023

0.9.0

May 26, 2023

0.8.5

May 17, 2023

0.8.4

May 17, 2023

0.8.3

May 17, 2023

0.8.2

May 12, 2023

0.8.1

May 10, 2023

0.8.0

May 10, 2023

0.7.1

May 9, 2023

0.7.0

May 9, 2023

0.6.5

May 8, 2023

0.6.4

May 8, 2023

0.6.3

Apr 28, 2023

0.6.2

Apr 14, 2023

0.6.1

Apr 12, 2023

0.6.0

Mar 9, 2023

0.5.4

Mar 1, 2023

0.5.3

Feb 10, 2023

0.5.2

Feb 10, 2023

0.5.1

Feb 8, 2023

0.5.0

Dec 30, 2022

0.4.1

Dec 27, 2022

0.4.0

Dec 23, 2022

0.3.5

Dec 23, 2022

0.3.4

Dec 19, 2022

0.3.3

Dec 9, 2022

0.3.2

Nov 15, 2022

0.3.1

Nov 9, 2022

0.3.0

Nov 7, 2022

0.2.8

Nov 2, 2022

0.2.7

Oct 26, 2022

0.2.6

Oct 25, 2022

0.2.5

Oct 24, 2022

0.2.4

Oct 24, 2022

0.2.3

Oct 21, 2022

0.2.2

Oct 11, 2022

0.2.1

Oct 7, 2022

0.2.0

Oct 3, 2022

0.1.7

Oct 2, 2022

0.1.6

Sep 30, 2022

0.1.5

Sep 29, 2022

0.1.4

Sep 26, 2022

0.1.3

Sep 19, 2022

0.1.2

Sep 13, 2022

0.1.1

Sep 5, 2022

0.1.0

Sep 2, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cluster_experiments-0.30.0.tar.gz (93.4 kB view details)

Uploaded Mar 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cluster_experiments-0.30.0-py3-none-any.whl (116.1 kB view details)

Uploaded Mar 3, 2026 Python 3

File details

Details for the file cluster_experiments-0.30.0.tar.gz.

File metadata

Download URL: cluster_experiments-0.30.0.tar.gz
Upload date: Mar 3, 2026
Size: 93.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cluster_experiments-0.30.0.tar.gz
Algorithm	Hash digest
SHA256	`e05df44aa81ae635d71a7339bfaa7ca55906b20b19152b4bee194b71fea8ac76`
MD5	`0cb4912b817217e3b7a4775d2d1da93f`
BLAKE2b-256	`4a4f4c362739d6336f75aef9d6049b1b9a97937090dce63c28dd69ea73240e07`

See more details on using hashes here.

File details

Details for the file cluster_experiments-0.30.0-py3-none-any.whl.

File metadata

Download URL: cluster_experiments-0.30.0-py3-none-any.whl
Upload date: Mar 3, 2026
Size: 116.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for cluster_experiments-0.30.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8a69310741f536d8ba86bf9034d790e1c3c518a6415a1524b80950bba955c8e`
MD5	`64b6c4fcb9c2d21dd809583802c830f0`
BLAKE2b-256	`29dea53315e22ac64241997ebb59d037b08d6d58de24cb1df83fb9ea866840ab`

See more details on using hashes here.

cluster-experiments 0.30.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

cluster-experiments

📖 What is cluster-experiments?

Why "cluster"?

Key Features

Experiment Design

Statistical Methods

Analysis Workflow

📦 Installation

⚡ Quick Example

Power Analysis

📚 Documentation

Core Concepts

1. Splitter - Define how to randomize

2. Analysis - Measure the impact

3. AnalysisPlan - Orchestrate your analysis

🛠️ Advanced Features

Variance Reduction (CUPED/CUPAC)

Cluster Randomization

Switchback Experiments

🌟 Support

📚 Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes