A library for estimates of causal effects.

These details have been verified by PyPI

Project links

Source

GitHub Statistics

Maintainers

ckirkle

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Information Analysis

Project description

CausalEstimate

CausalEstimate is a Python library designed for causal inference, providing a suite of methods to estimate treatment effects from observational data. It includes doubly robust techniques such as Targeted Maximum Likelihood Estimation (TMLE), alongside propensity score-based methods like inverse probability weighting (IPW) and matching. The library is built for flexibility and ease of use, integrating seamlessly with pandas and supporting bootstrap-based standard error estimation and multiple estimators in one pass.

Features

Causal inference methods: IPW, AIPW, TMLE, Matching, etc.
Supports multiple effect types: ATE, ATT, Risk Ratio, etc.
Bootstrap standard error estimation and confidence intervals
Common-support filtering and matching (greedy, optimal)
Plotting utilities for distribution checks (e.g., propensity score overlap)

Installation

pip install CausalEstimate

Or for local development:

git clone https://github.com/kirilklein/CausalEstimate.git
cd CausalEstimate
pip install -e .

Usage

1) Single Estimator Usage

You can import any estimator class (e.g., IPW, AIPW, TMLE) and call compute_effect(df) directly. Columns (treatment, outcome, propensity score) are passed to the estimator in its constructor.

import numpy as np
import pandas as pd
from CausalEstimate.estimators import IPW

# Simulate data
np.random.seed(42)
n = 1000
ps = np.random.uniform(0, 1, n)          # true propensity for treatment
treatment = np.random.binomial(1, ps)    # actual treatment assignment
outcome = 2 + 0.5 * treatment + np.random.normal(0, 1, n)

df = pd.DataFrame({
    "ps": ps,
    "treatment": treatment,
    "outcome": outcome
})

# Create an IPW Estimator for ATE
ipw_estimator = IPW(
    effect_type="ATE",
    treatment_col="treatment",
    outcome_col="outcome",
    ps_col="ps",
    # optionally stabilized=True if you want stabilized IP weights
)

results = ipw_estimator.compute_effect(df)
print("IPW estimated effect:", results)

In this case, results is simply a dictionary with the effect estimate computed from a single sample run (n_bootstraps=1). When no bootstrapping is applied, the output includes the key "n_bootstraps": 0.

2) Multi Estimator Usage

If you want to run multiple estimators (e.g., IPW, TMLE, AIPW) on the same dataset in one pass—optionally applying bootstrap or common-support filtering—you can use the MultiEstimator.

from CausalEstimate.estimators import IPW, AIPW, TMLE, MultiEstimator

ipw = IPW(
    effect_type="ATE",
    treatment_col="treatment",
    outcome_col="outcome",
    ps_col="ps"
)
aipw = AIPW(
    effect_type="ATE",
    treatment_col="treatment",
    outcome_col="outcome",
    ps_col="ps",
    probas_t1_col="predicted_outcome_treated",
    probas_t0_col="predicted_outcome_control"
)
tmle = TMLE(
    effect_type="ATE",
    treatment_col="treatment",
    outcome_col="outcome",
    ps_col="ps",
    probas_col="predicted_outcome",
    probas_t1_col="predicted_outcome_treated",
    probas_t0_col="predicted_outcome_control"
)

multi_estimator = MultiEstimator([ipw, aipw, tmle])

# Apply bootstrap (n_bootstraps > 1 triggers bootstrapping), common support, etc.
results = multi_estimator.compute_effects(
    df, 
    n_bootstraps=50,  # If n_bootstraps > 1, bootstrapping is applied.
    apply_common_support=True,
    common_support_threshold=0.05,
    return_bootstrap_samples=True  # Optionally return raw bootstrap estimates.
)
print(results)

Here, results is a dictionary with keys corresponding to each estimator's class name (e.g., "IPW", "AIPW", "TMLE"). For estimators that perform bootstrapping (i.e. when n_bootstraps > 1), the output dictionary includes:

"effect": The mean effect across bootstrap samples.
"std_err": The standard deviation of the bootstrap estimates.
"CI95_lower" and "CI95_upper": The 95% confidence interval computed using the percentile method.
"n_bootstraps": The number of bootstrap samples (e.g., 50).
Optionally, if return_bootstrap_samples=True, a "bootstrap_samples" key with the raw bootstrap estimates (e.g., for the overall effect, treated, and untreated effects).

When no bootstrapping is performed (i.e. n_bootstraps is set to 1), "n_bootstraps" is set to 0 and the bootstrap summary keys (like "std_err", "CI95_lower", "CI95_upper") may not be present.

3) Matching

The library supports both optimal and greedy (a.k.a. eager) matching. For example:

import pandas as pd
import numpy as np
from CausalEstimate.matching import match_optimal, match_eager

df = pd.DataFrame({
    "PID": [101, 102, 103, 202, 203, 204],
    "treatment": [1, 1, 1, 0, 0, 0],
    "ps": [0.30, 0.35, 0.90, 0.31, 0.34, 0.85],
})

# Optimal matching (with caliper=0.05, 1 control per treated)
matched_optimal = match_optimal(
    df, n_controls=1, caliper=0.05,
    treatment_col="treatment", ps_col="ps", pid_col="PID"
)
print("Optimal Matching Results:")
print(matched_optimal)

# Eager (greedy) matching
matched_eager = match_eager(
    df, caliper=0.05,
    treatment_col="treatment", ps_col="ps", pid_col="PID"
)
print("Eager Matching Results:")
print(matched_eager)

Both functions return a DataFrame of matched pairs (or sets), typically with columns like [treated_pid, control_pid, distance].

4) Plotting

CausalEstimate provides basic plotting utilities to visualize distributions of propensity scores or predicted outcome probabilities across treatment vs. control.

Example: Propensity Score Distribution

📌 Generated from this notebook

Propensity Score Distribution

import matplotlib.pyplot as plt
from CausalEstimate.vis.plotting import plot_propensity_score_dist, plot_outcome_proba_dist

# Suppose df has columns "ps", "treatment", and "predicted_outcome"
fig, ax = plot_propensity_score_dist(df, ps_col="ps", treatment_col="treatment")
plt.show()

fig, ax = plot_outcome_proba_dist(df, outcome_proba_col="predicted_outcome", treatment_col="treatment")
plt.show()

Development

See CONTRIBUTING.md for details on setting up a dev environment, running tests, and contributing to this project.

License

CausalEstimate is licensed under the MIT License. See LICENSE for more details.

Contact

GitHub: kirilklein
Email: kikl@di.ku.dk

Please open issues or pull requests if you find any bugs or want to propose enhancements.

Citation

If you use CausalEstimate in your research, please cite it using the following BibTeX entry:

@software{causalestimate,
  author = {Kiril Klein, ...},
  title = {CausalEstimate: A Python Library for Causal Inference},
  year = {2024},
  url = {https://github.com/kirilklein/CausalEstimate},
  version = {X.Y.Z},
  note = {GitHub repository}
}

Project details

These details have been verified by PyPI

Project links

Source

GitHub Statistics

Maintainers

ckirkle

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Information Analysis

Release history Release notifications | RSS feed

0.10.2

Nov 1, 2025

0.10.1

Oct 11, 2025

This version

0.10.0

Sep 13, 2025

0.9.1

Sep 4, 2025

0.9.0

Aug 28, 2025

0.8.5

Aug 28, 2025

0.8.4

Jul 23, 2025

0.8.3

Jul 16, 2025

0.8.2

Jul 16, 2025

0.8.1

May 1, 2025

0.8.0

Mar 13, 2025

0.7.1

Mar 12, 2025

0.7.0

Feb 20, 2025

0.6.1

Feb 20, 2025

0.6.0

Jan 13, 2025

0.5.1

Jan 8, 2025

0.5.0

Dec 4, 2024

0.4.2

Oct 20, 2024

0.4.1

Oct 17, 2024

0.4.0

Oct 13, 2024

0.3.1

Oct 11, 2024

0.3.0

Oct 11, 2024

0.2.2

Oct 4, 2024

0.2.1

Oct 4, 2024

0.2.0

Oct 4, 2024

0.1.5

Sep 30, 2024

0.1.4

Sep 30, 2024

0.1.2

Sep 29, 2024

0.1.0

Sep 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

causalestimate-0.10.0.tar.gz (112.5 kB view details)

Uploaded Sep 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

causalestimate-0.10.0-py3-none-any.whl (45.0 kB view details)

Uploaded Sep 13, 2025 Python 3

File details

Details for the file causalestimate-0.10.0.tar.gz.

File metadata

Download URL: causalestimate-0.10.0.tar.gz
Upload date: Sep 13, 2025
Size: 112.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for causalestimate-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`d08b4b27a3c42c5ca3227c45ed9d05a3bf6043052f0be0b8dede35b854db3d14`
MD5	`4b66f8fb4e9940dc041427e38624a562`
BLAKE2b-256	`f3cea4dbcc8a07108f30d576848172154e35c4a7bf307e673a0fd1d66968688f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for causalestimate-0.10.0.tar.gz:

Publisher: publish.yml on kirilklein/CausalEstimate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: causalestimate-0.10.0.tar.gz
- Subject digest: d08b4b27a3c42c5ca3227c45ed9d05a3bf6043052f0be0b8dede35b854db3d14
- Sigstore transparency entry: 511585998
- Sigstore integration time: Sep 13, 2025
Source repository:
- Permalink: kirilklein/CausalEstimate@198bae4a248e470eec522d5144e7048fad46eaeb
- Branch / Tag: refs/tags/v0.10.0
- Owner: https://github.com/kirilklein
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@198bae4a248e470eec522d5144e7048fad46eaeb
- Trigger Event: push

File details

Details for the file causalestimate-0.10.0-py3-none-any.whl.

File metadata

Download URL: causalestimate-0.10.0-py3-none-any.whl
Upload date: Sep 13, 2025
Size: 45.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for causalestimate-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`930ac5d2535a5c10c39f03a76a1592b8f8cc7217d0c0d4728477dd2752104568`
MD5	`8547d294fbc2c39df3b4cbbb60977806`
BLAKE2b-256	`e68a88952666a5d3024544b336ce61828698f3959499eea322b854994245346b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for causalestimate-0.10.0-py3-none-any.whl:

Publisher: publish.yml on kirilklein/CausalEstimate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: causalestimate-0.10.0-py3-none-any.whl
- Subject digest: 930ac5d2535a5c10c39f03a76a1592b8f8cc7217d0c0d4728477dd2752104568
- Sigstore transparency entry: 511586002
- Sigstore integration time: Sep 13, 2025
Source repository:
- Permalink: kirilklein/CausalEstimate@198bae4a248e470eec522d5144e7048fad46eaeb
- Branch / Tag: refs/tags/v0.10.0
- Owner: https://github.com/kirilklein
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@198bae4a248e470eec522d5144e7048fad46eaeb
- Trigger Event: push

CausalEstimate 0.10.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

CausalEstimate

Features

Installation

Usage

1) Single Estimator Usage

2) Multi Estimator Usage

3) Matching

4) Plotting

Example: Propensity Score Distribution

Development

License

Contact

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance