Toolkit to simulate and analyze AB tests

These details have not been verified by PyPI

Project links

Homepage

Project description

ab-test-toolkit

Install

pip install ab_test_toolkit

imports

from ab_test_toolkit.generator import (
    generate_binary_data,
    generate_continuous_data,
    data_to_contingency,
)
from ab_test_toolkit.power import (
    simulate_power_binary,
    sample_size_binary,
    simulate_power_continuous,
    sample_size_continuous,
)
from ab_test_toolkit.plotting import (
    plot_power,
    plot_distribution,
    plot_betas,
    plot_binary_power,
)

Binary target (e.g. conversion rate experiments)

Sample size:

We can calculate the sample size required with the function “sample_size_binary”. Input needed is:

Conversion rate control: cr0
Conversion rate variant for minimal detectable effect: cr1 (for example, if we have a conversion rate of 1% and want to detect an effect of at least 20% relate, we would set cr0=0.010 and cr1=0.012)
Significance threshold: alpha. Usually set to 0.05, this defines our tolerance for falsely detecting an effect if in reality there is none (alpha=0.05 means that in 5% of the cases we will detect an effect even though the samples for control and variant are drawn from the exact same distribution).
Statistical power. Usually set to 0.8. This means that if the effect is the minimal effect specified above, we have an 80% probability of identifying it at statistically significant (and hence 20% of not idenfitying it).
one_sided: If the test is one-sided (one_sided=True) or if it is two-sided (one_sided=False). As a rule of thumb, if there are very strong reasons to believe that the variant cannot be inferior to the control, we can use a one sided test. In case of doubts, using a two sided test is better.

let us calculate the sample size for the following example:

n_sample = sample_size_binary(
    cr0=0.01,
    cr1=0.012,
    alpha=0.05,
    power=0.8,
    one_sided=True,
)
print(f"Required sample size per variant is {int(n_sample)}.")

Required sample size per variant is 33560.

n_sample_two_sided = sample_size_binary(
    cr0=0.01,
    cr1=0.012,
    alpha=0.05,
    power=0.8,
    one_sided=False,
)
print(
    f"For the two-sided experiment, required sample size per variant is {int(n_sample_two_sided)}."
)

For the two-sided experiment, required sample size per variant is 42606.

Power simulations

What happens if we use a smaller sample size? And how can we understand the sample size?

Let us analyze the statistical power with synthethic data. We can do this with the simulate_power_binary function. We are using some default argument here, see this page for more information.

# simulation = simulate_power_binary()

Note: The simulation object return the total sample size, so we need to split it per variant.

# simulation

Finally, we can plot the results (note: the plot function show the sample size per variant):

# plot_power(
#     simulation,
#     added_lines=[{"sample_size": sample_size_binary(), "label": "Chi2"}],
# )

The problem of peaking

wip

Contunious target (e.g. average)

Here we assume normally distributed data (which usually holds due to the central limit theorem).

Sample size

We can calculate the sample size required with the function “sample_size_continuous”. Input needed is:

mu1: Mean of the control group
mu2: Mean of the variant group assuming minimal detectable effect (e.g. if the mean it 5, and we want to detect an effect as small as 0.05, mu1=5.00 and mu2=5.05)
sigma: Standard deviation (we assume the same for variant and control, should be estimated from historical data)
alpha, power, one_sided: as in the binary case

Let us calculate an example:

n_sample = sample_size_continuous(
    mu1=5.0, mu2=5.05, sigma=1, alpha=0.05, power=0.8, one_sided=True
)
print(f"Required sample size per variant is {int(n_sample)}.")

Let us also do some simulations. These show results for the t-test as well as bayesian testing (only 1-sided).

# simulation = simulate_power_continuous()

# plot_power(
#     simulation,
#     added_lines=[
#         {"sample_size": continuous_sample_size(), "label": "Formula"}
#     ],
# )

Data Generators

We can also use the data generators for example data to analyze or visualuze as if they were experiments.

Distribution without effect:

df_continuous = generate_continuous_data(effect=0)
# plot_distribution(df_continuous)

Distribution with effect:

df_continuous = generate_continuous_data(effect=1)
# plot_distribution(df_continuous)

Visualizations

Plot beta distributions for a contingency table:

df = generate_binary_data()
df_contingency = data_to_contingency(df)
# fig = plot_betas(df_contingency, xmin=0, xmax=0.04)

False positives

# simulation = simulate_power_binary(cr0=0.01, cr1=0.01, one_sided=False)

# plot_power(simulation, is_effect=False)

# simulation = simulate_power_binary(cr0=0.01, cr1=0.01, one_sided=True)
# plot_power(simulation, is_effect=False)

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.15

Jun 20, 2023

0.0.14

Jun 19, 2023

0.0.13

Jun 16, 2023

0.0.12

Jun 15, 2023

0.0.11

Jun 14, 2023

0.0.10

Jun 14, 2023

This version

0.0.9

Jun 14, 2023

0.0.8

Jun 14, 2023

0.0.7

Jun 13, 2023

0.0.6

Jun 13, 2023

0.0.5

Jun 10, 2023

0.0.3

Jun 5, 2023

0.0.2

Jun 4, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ab-test-toolkit-0.0.9.tar.gz (15.7 kB view details)

Uploaded Jun 14, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ab_test_toolkit-0.0.9-py3-none-any.whl (15.5 kB view details)

Uploaded Jun 14, 2023 Python 3

File details

Details for the file ab-test-toolkit-0.0.9.tar.gz.

File metadata

Download URL: ab-test-toolkit-0.0.9.tar.gz
Upload date: Jun 14, 2023
Size: 15.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for ab-test-toolkit-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`ad434fef9a869b1b8a6c66e7718b531d61c5593021a9afc831225232a81fae71`
MD5	`77047f9eda11a296cdca221698a9f8a2`
BLAKE2b-256	`48e32e2065804d84d6c11fcfdfe48ce4015356cacde1d994e4ccafa74369f8e3`

See more details on using hashes here.

File details

Details for the file ab_test_toolkit-0.0.9-py3-none-any.whl.

File metadata

Download URL: ab_test_toolkit-0.0.9-py3-none-any.whl
Upload date: Jun 14, 2023
Size: 15.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for ab_test_toolkit-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6af39d71d5b0861fe7532b5f03f1728003c1a0a4291decfbce0680867acf11c8`
MD5	`a393d02eb5cf4526ae9d6dd0350e09f1`
BLAKE2b-256	`368669a6a55493054f5268568342d1497bcaa1e2a3df49faf1fde7c17c235dbc`

See more details on using hashes here.

ab-test-toolkit 0.0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ab-test-toolkit

Install

imports

Binary target (e.g. conversion rate experiments)

Sample size:

Power simulations

The problem of peaking

Contunious target (e.g. average)

Sample size

Data Generators

Visualizations

False positives

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes