Command line utilities for statistics, odds, and probabilities

These details have not been verified by PyPI

Project description

pythodds

A command-line utility and Python library for calculating statistics, odds, and probabilities.

Features

Binomial Distribution: Calculate PMF, CDF, and survival functions for binomial distributions
Birthday Problem: Compute collision probabilities for uniform and non-uniform pools, find minimum group sizes, and generate probability tables
Normal Distribution: Compute PDF, CDF, survival probabilities, interval probabilities, and the inverse CDF (percent-point function) for a Gaussian N(μ, σ²) distribution
Expected Value: Compute E[X], Var(X), SD(X), Shannon entropy, and the moment generating function for discrete probability distributions; supports inline input or CSV/JSON files
Poisson Distribution: Compute PMF, CDF, and survival probabilities, find minimum event counts for a target cumulative probability, and generate full probability tables
Streak Probability: Compute the probability of at least one consecutive run of successes and the expected length of the longest streak
Monte Carlo Simulator: Empirically estimate probabilities for binomial, birthday, streak, and Poisson experiments with confidence intervals and analytical comparison
Command-line Interface: Easy-to-use CLI tools (binom, birthday, normal, expected, poisson, streak, and simulate commands)
Pure Python: No external dependencies required for core calculations

Installation

Install from PyPI:

pip install pythodds

Or install from source:

git clone https://github.com/ncarsner/pythodds.git
cd pythodds
pip install -e .

Usage

Command Line

`binom` — Binomial Distribution

# Calculate binomial distribution probabilities
binom -n 10 -k 3 -p 0.4

# Specify a target and minimum probability threshold
binom -n 100 -k 30 -p 0.35 --target 40 --min-prob 0.05

`birthday` — Birthday Problem Collision Probability

Computes the probability that at least two items in a group share the same value when drawn from a pool of equally-likely possibilities. Defaults to a pool size of 365.25 (calendar days).

# P(duplicate birthday) in a group of 23 people
birthday -n 23

# Find the minimum group size to reach 50% collision probability
birthday --target-prob 0.50

# Print a probability table for group sizes 1–40
birthday --range 1 40

# Custom pool size (e.g. 7-digit phone numbers)
birthday -p 10_000_000 -n 1180

# Non-uniform pool via relative weights
birthday --group-size 30 --weights 0.10,0.15,0.20,0.30,0.25

# Output as JSON or CSV
birthday --range 1 60 --format <json|csv>

Options:

Flag	Long form	Description
`-p`	`--pool-size`	Pool size — number of equally-likely outcomes (default: `365.25`)
`-n`	`--group-size`	Compute collision probability for exactly this group size
`-t`	`--target-prob`	Find the minimum group size reaching this probability
`-r`	`--range MIN MAX`	Print a probability table for group sizes MIN through MAX
`-w`	`--weights`	Comma-separated relative frequencies for a non-uniform pool
`-f`	`--format`	Output format: `table` (default), `json`, or `csv`
`-P`	`--precision`	Decimal places for printed probabilities (default: `6`)

`normal` — Normal (Gaussian) Distribution

Computes PDF, CDF, survival probabilities, interval probabilities, and the inverse CDF (percent-point function) for a N(μ, σ²) distribution. Uses only the Python standard library.

# PDF, P(X ≤ 1.96), and P(X ≥ 1.96) for the standard normal
normal -x 1.96 -m 0 -s 1

# Same calculation for a custom distribution
normal -x 75 -m 70 -s 5

# P(−1.96 ≤ X ≤ 1.96)
normal --between -1.96 1.96 -m 0 -s 1

# Find the value x such that P(X ≤ x) = 0.975 (inverse CDF)
normal --quantile 0.975 -m 0 -s 1

Options:

Flag	Long form	Description
`-x`	`--value`	Compute PDF, P(X ≤ x), and P(X ≥ x) for this value
	`--between LOW HIGH`	Compute P(LOW ≤ X ≤ HIGH)
`-q`	`--quantile`	Find x such that P(X ≤ x) = P (inverse CDF)
`-m`	`--mean`	Distribution mean μ (default: `0`)
`-s`	`--std`	Distribution standard deviation σ (default: `1`)
`-P`	`--precision`	Decimal places for printed values (default: `6`)

-x/--value, --between, and -q/--quantile are mutually exclusive; one is required.

`expected` — Expected Value & Discrete Distribution Statistics

Computes E[X], Var(X), SD(X), Shannon entropy, and optionally the moment generating function (MGF) for a discrete probability distribution supplied inline or via a CSV/JSON file.

# E[X] and statistics for a simple discrete distribution
expected --outcomes 0,1,5,10 --probs 0.50,0.25,0.15,0.10

# Non-uniform six-sided die
expected --outcomes 1,2,3,4,5,6 --probs 0.1,0.2,0.3,0.2,0.1,0.1

# Load distribution from a CSV or JSON file
expected --file payouts.csv

# Also compute the MGF at t=0.5
expected --outcomes 0,1 --probs 0.3,0.7 --mgf 0.5

Options:

Flag	Long form	Description
`-o`	`--outcomes`	Comma-separated outcome values
`-f`	`--file`	CSV or JSON file with outcomes and probabilities
`-p`	`--probs`	Comma-separated probabilities (required with `--outcomes`)
	`--mgf T`	Also compute the moment generating function M_X(t) at t=T
`-P`	`--precision`	Decimal places for printed values (default: `6`)

--outcomes and --file are mutually exclusive; one is required. --probs is required when using --outcomes.

`poisson` — Poisson Distribution

Computes PMF, CDF, and survival probabilities for a Poisson(λ) distribution. Models rare, independent events occurring at a known average rate — server errors per hour, calls per minute, defects per batch, and so on.

# P(X=7), P(X≤7), and P(X≥7) for λ=3.0
poisson -l 3.0 -k 7

# Find the minimum k such that P(X ≤ k) >= 0.95
poisson -l 3.0 -t 0.95

# Print a probability table for k = 0 through 15
poisson -l 3.0 -r 0 15

# Also show P(X ≥ 5) and whether it meets a 1% threshold
poisson -l 0.5 -k 2 --target 5 --min-prob 0.01

# Output as JSON or CSV
poisson -l 3.0 -r 0 20 -f json
poisson -l 3.0 -r 0 20 -f csv

Options:

Flag	Long form	Description
`-l`	`--rate`	Average event rate λ (required, must be > 0)
`-k`	`--events`	Compute PMF and CDF for exactly this event count
`-t`	`--target-prob`	Find the minimum k such that P(X ≤ k) ≥ PROB
`-r`	`--range MIN MAX`	Print a probability table for event counts MIN through MAX
	`--target`	With `-k`: also print P(X ≥ T) for this target count
	`--min-prob`	With `--target`: report whether P(X ≥ T) meets this threshold
`-f`	`--format`	Output format: `table` (default), `json`, or `csv`
`-P`	`--precision`	Decimal places for printed probabilities (default: `6`)

`streak` — Streak / Consecutive Run Probability

Computes the exact probability of at least one run of k consecutive successes in n independent Bernoulli trials, and the expected length of the longest run. Uses dynamic programming for exact O(n·k) computation.

# P(at least one run of 5+ heads in 100 fair coin flips)
streak -n 100 -k 5 -p 0.5

# P(at least one hitting streak of 10+ games over a 162-game season at .320)
streak -n 162 -k 10 -p 0.32

# Expected length of the longest win streak in 50 trials at 40% success rate
streak -n 50 -p 0.40 --longest

Options:

Flag	Long form	Description
`-n`	`--trials`	Total number of independent trials (required)
`-p`	`--prob`	Success probability per trial, 0–1 (required)
`-k`	`--streak-length`	Compute P(at least one run of K consecutive successes)
	`--longest`	Compute E[length of longest run of consecutive successes]
`-P`	`--precision`	Decimal places for printed probabilities (default: `6`)

-k/--streak-length and --longest are mutually exclusive; one is required.

`simulate` — Monte Carlo Probability Simulator

Runs repeated random experiments to estimate probabilities empirically, with optional confidence intervals and analytical comparison against binom, birthday, poisson, and streak.

# Estimate P(X >= 5) for Binomial(n=10, p=0.4) over 100,000 trials
simulate --experiment binomial --params n=10 k=5 p=0.4 --trials 100000

# Birthday collision probability for a group of 23 with a 95% confidence interval
simulate --experiment birthday --params pool=365 group=23 --confidence

# Streak probability: P(run of 5+ successes in 100 trials, p=0.5)
simulate --experiment streak --params n=100 k=5 p=0.5 --trials 50000

# Poisson: P(X >= 7) for λ=3.0 with a fixed seed
simulate --experiment poisson --params lam=3.0 k=7 --seed 42

# Auto-size trial count to achieve a target standard error of 0.005
simulate --experiment binomial --params n=20 k=8 p=0.5 --scale 0.005

Options:

Flag	Long form	Description
`-e`	`--experiment`	Experiment type: `binomial`, `birthday`, `streak`, or `poisson` (required)
`-p`	`--params`	Space-separated `KEY=VALUE` experiment parameters (see below)
`-t`	`--trials`	Number of simulation trials (default: 10,000)
	`--scale`	Target standard error; auto-computes `--trials` (overrides `-t`)
`-s`	`--seed`	Random seed for reproducibility
`-c`	`--confidence`	Print 95% Wilson confidence interval
	`--dump`	Output per-trial results as CSV instead of summary
`-f`	`--format`	Summary output format: `table` (default) or `json`
`-P`	`--precision`	Decimal places for printed probabilities (default: `6`)

Required params by experiment:

Experiment	Required params
`binomial`	`n=INT k=INT p=FLOAT`
`birthday`	`pool=INT group=INT`
`streak`	`n=INT k=INT p=FLOAT`
`poisson`	`lam=FLOAT k=INT`

Python Library

Binomial Distribution

from src.utils.binomial_distribution import binomial_pmf, binomial_cdf_le, binomial_cdf_ge

# P(X = 3) for Binomial(n=10, p=0.4)
pmf = binomial_pmf(10, 3, 0.4)

# P(X <= 3) for Binomial(n=10, p=0.4)
cdf = binomial_cdf_le(10, 3, 0.4)

# P(X >= 3) for Binomial(n=10, p=0.4)
survival = binomial_cdf_ge(10, 3, 0.4)

Birthday Problem

from src.utils.birthday_problem import (
    collision_prob_uniform,
    collision_prob_nonuniform,
    min_group_for_prob,
    expected_duplicate_pairs,
)

# P(duplicate) for 23 people in a pool of 365.25
prob = collision_prob_uniform(23, 365.25)

# Minimum group size to reach 50% collision probability
n = min_group_for_prob(0.50, 365.25)

# P(duplicate) with a non-uniform pool
prob_nu = collision_prob_nonuniform(30, [0.10, 0.15, 0.20, 0.30, 0.25])

# Expected number of duplicate pairs
pairs = expected_duplicate_pairs(23, 365.25)

Poisson Distribution

from src.utils.poisson_distribution import (
    poisson_pmf,
    poisson_cdf_le,
    poisson_cdf_ge,
    min_k_for_prob,
)

# P(X = 7) for Poisson(λ=3.0)
pmf = poisson_pmf(7, 3.0)

# P(X ≤ 7) for Poisson(λ=3.0)
cdf = poisson_cdf_le(7, 3.0)

# P(X ≥ 7) for Poisson(λ=3.0)
survival = poisson_cdf_ge(7, 3.0)

# Minimum k such that P(X ≤ k) >= 0.95
k = min_k_for_prob(0.95, 3.0)

Streak Probability

from src.utils.streak_probability import (
    prob_at_least_one_streak,
    expected_longest_streak,
)

# P(at least one run of 5 consecutive heads in 100 fair coin flips)
p = prob_at_least_one_streak(100, 5, 0.5)

# Expected length of the longest run of successes in 162 trials at .300
e = expected_longest_streak(162, 0.300)

Normal Distribution

from src.utils.normal_gaussian import (
    normal_pdf,
    normal_cdf,
    normal_ppf,
    normal_prob_between,
)

# PDF value at x=1.96 for the standard normal
pdf = normal_pdf(1.96, mu=0.0, sigma=1.0)

# P(X ≤ 1.96)
cdf = normal_cdf(1.96, mu=0.0, sigma=1.0)

# P(X ≥ 1.96)
survival = 1.0 - normal_cdf(1.96, mu=0.0, sigma=1.0)

# P(−1.96 ≤ X ≤ 1.96)
prob = normal_prob_between(-1.96, 1.96, mu=0.0, sigma=1.0)

# Find x such that P(X ≤ x) = 0.975 (inverse CDF)
x = normal_ppf(0.975, mu=0.0, sigma=1.0)

Expected Value

from src.utils.expected_value import (
    expected_value,
    variance,
    std_dev,
    entropy,
    mgf,
    load_file,
)

outcomes = [0, 1, 5, 10]
probs    = [0.50, 0.25, 0.15, 0.10]

# E[X]
ev = expected_value(outcomes, probs)

# Var(X) and SD(X)
var = variance(outcomes, probs)
sd  = std_dev(outcomes, probs)

# Shannon entropy (bits)
H = entropy(probs)

# Moment generating function M_X(t) at t=0.5
M = mgf(outcomes, probs, t=0.5)

# Load a distribution from a CSV or JSON file
outcomes, probs = load_file("payouts.csv")

Monte Carlo Simulator

from src.utils.monte_carlo import (
    simulate_binomial,
    simulate_birthday,
    simulate_streak,
    simulate_poisson,
    wilson_ci,
    standard_error,
)

# Simulate P(X >= 5) for Binomial(10, 0.4) over 100,000 trials
results = simulate_binomial(n=10, k=5, p=0.4, trials=100_000, seed=42)
p_hat = sum(results) / len(results)
se = standard_error(p_hat, len(results))
ci = wilson_ci(p_hat, len(results))

Development

Clone the repository and install in editable mode:

git clone https://github.com/ncarsner/pythodds.git
cd pythodds
pip install -e .

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Nicholas Carsner

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.18.0

May 11, 2026

0.17.0

May 3, 2026

0.16.0

Apr 19, 2026

0.15.2

Apr 5, 2026

0.15.0

Apr 4, 2026

0.14.0

Mar 29, 2026

0.13.0

Mar 26, 2026

0.12.0

Mar 22, 2026

0.11.0

Mar 21, 2026

0.10.0

Mar 19, 2026

0.9.1

Mar 17, 2026

0.9.0

Mar 17, 2026

0.8.0

Mar 15, 2026

0.7.3

Mar 14, 2026

0.7.1

Mar 14, 2026

This version

0.7.0

Mar 14, 2026

0.6.0

Mar 12, 2026

0.5.2

Mar 11, 2026

0.5.1

Mar 11, 2026

0.5.0

Mar 11, 2026

0.4.1

Mar 9, 2026

0.4.0

Mar 9, 2026

0.3.1

Mar 6, 2026

0.3.0

Mar 6, 2026

0.2.1

Mar 5, 2026

0.2.0

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pythodds-0.7.0.tar.gz (71.4 kB view details)

Uploaded Mar 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pythodds-0.7.0-py3-none-any.whl (30.1 kB view details)

Uploaded Mar 14, 2026 Python 3

File details

Details for the file pythodds-0.7.0.tar.gz.

File metadata

Download URL: pythodds-0.7.0.tar.gz
Upload date: Mar 14, 2026
Size: 71.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.5

File hashes

Hashes for pythodds-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`e097a146f7e2998d9b2a77d0e3acb95e1ca87b52eef65256ae1912dda854a249`
MD5	`d223620f0399c58a95cee5686694503b`
BLAKE2b-256	`8131914bf92254dedce46aaf8ff642a5a6e835f81d593a596f2ddb4813d7ceee`

See more details on using hashes here.

File details

Details for the file pythodds-0.7.0-py3-none-any.whl.

File metadata

Download URL: pythodds-0.7.0-py3-none-any.whl
Upload date: Mar 14, 2026
Size: 30.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.5

File hashes

Hashes for pythodds-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6a5de03d39cbbb71e3aaad49e423fc1578ab3ce0b5ebde3566a6a66126f58a3a`
MD5	`494cd5581ed246d71f7a726f4b2d0dfe`
BLAKE2b-256	`15a3fcee0c7fad1b87669b6d2e6c3e7bede461d95d22766df948752cefed6fa3`

See more details on using hashes here.

pythodds 0.7.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

pythodds

Features

Installation

Usage

Command Line

binom — Binomial Distribution

birthday — Birthday Problem Collision Probability

normal — Normal (Gaussian) Distribution

expected — Expected Value & Discrete Distribution Statistics

poisson — Poisson Distribution

streak — Streak / Consecutive Run Probability

simulate — Monte Carlo Probability Simulator

Python Library

Binomial Distribution

Birthday Problem

Poisson Distribution

Streak Probability

Normal Distribution

Expected Value

Monte Carlo Simulator

Development

License

Author

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`binom` — Binomial Distribution

`birthday` — Birthday Problem Collision Probability

`normal` — Normal (Gaussian) Distribution

`expected` — Expected Value & Discrete Distribution Statistics

`poisson` — Poisson Distribution

`streak` — Streak / Consecutive Run Probability

`simulate` — Monte Carlo Probability Simulator