A library for Difference-in-Differences causal inference analysis

These details have not been verified by PyPI

Project links

Project description

diff-diff

A Python library for Difference-in-Differences (DiD) causal inference analysis with an sklearn-like API and statsmodels-style outputs.

Installation

pip install diff-diff

Or install from source:

git clone https://github.com/igerber/diff-diff.git
cd diff-diff
pip install -e .

Quick Start

import pandas as pd
from diff_diff import DifferenceInDifferences

# Create sample data
data = pd.DataFrame({
    'outcome': [10, 11, 15, 18, 9, 10, 12, 13],
    'treated': [1, 1, 1, 1, 0, 0, 0, 0],
    'post': [0, 0, 1, 1, 0, 0, 1, 1]
})

# Fit the model
did = DifferenceInDifferences()
results = did.fit(data, outcome='outcome', treatment='treated', time='post')

# View results
print(results)  # DiDResults(ATT=3.5000*, SE=1.2583, p=0.0367)
results.print_summary()

Output:

======================================================================
          Difference-in-Differences Estimation Results
======================================================================

Observations:                        8
Treated units:                       4
Control units:                       4
R-squared:                      0.9123

----------------------------------------------------------------------
Parameter         Estimate     Std. Err.     t-stat      P>|t|
----------------------------------------------------------------------
ATT                 3.5000       1.2583      2.782      0.0367
----------------------------------------------------------------------

95% Confidence Interval: [0.3912, 6.6088]

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
======================================================================

Features

sklearn-like API: Familiar fit() interface with get_params() and set_params()
Pythonic results: Easy access to coefficients, standard errors, and confidence intervals
Multiple interfaces: Column names or R-style formulas
Robust inference: Heteroskedasticity-robust (HC1) and cluster-robust standard errors
Panel data support: Two-way fixed effects estimator for panel designs
Multi-period analysis: Event-study style DiD with period-specific treatment effects

Usage

Basic DiD with Column Names

from diff_diff import DifferenceInDifferences

did = DifferenceInDifferences(robust=True, alpha=0.05)
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post_policy'
)

# Access results
print(f"ATT: {results.att:.4f}")
print(f"Standard Error: {results.se:.4f}")
print(f"P-value: {results.p_value:.4f}")
print(f"95% CI: {results.conf_int}")
print(f"Significant: {results.is_significant}")

Using Formula Interface

# R-style formula syntax
results = did.fit(data, formula='outcome ~ treated * post')

# Explicit interaction syntax
results = did.fit(data, formula='outcome ~ treated + post + treated:post')

# With covariates
results = did.fit(data, formula='outcome ~ treated * post + age + income')

Including Covariates

results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='post',
    covariates=['age', 'income', 'education']
)

Fixed Effects

Use fixed_effects for low-dimensional categorical controls (creates dummy variables):

# State and industry fixed effects
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    fixed_effects=['state', 'industry']
)

# Access fixed effect coefficients
state_coefs = {k: v for k, v in results.coefficients.items() if k.startswith('state_')}

Use absorb for high-dimensional fixed effects (more efficient, uses within-transformation):

# Absorb firm-level fixed effects (efficient for many firms)
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    absorb=['firm_id']
)

Combine covariates with fixed effects:

results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    covariates=['size', 'age'],           # Linear controls
    fixed_effects=['industry'],            # Low-dimensional FE (dummies)
    absorb=['firm_id']                     # High-dimensional FE (absorbed)
)

Cluster-Robust Standard Errors

did = DifferenceInDifferences(cluster='state')
results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='post'
)

Two-Way Fixed Effects (Panel Data)

from diff_diff.estimators import TwoWayFixedEffects

twfe = TwoWayFixedEffects()
results = twfe.fit(
    panel_data,
    outcome='outcome',
    treatment='treated',
    time='year',
    unit='firm_id'
)

Multi-Period DiD (Event Study)

For settings with multiple pre- and post-treatment periods:

from diff_diff import MultiPeriodDiD

# Fit with multiple time periods
did = MultiPeriodDiD()
results = did.fit(
    panel_data,
    outcome='sales',
    treatment='treated',
    time='period',
    post_periods=[3, 4, 5],      # Periods 3-5 are post-treatment
    reference_period=0           # Reference period for comparison
)

# View period-specific treatment effects
for period, effect in results.period_effects.items():
    print(f"Period {period}: {effect.effect:.3f} (SE: {effect.se:.3f})")

# View average treatment effect across post-periods
print(f"Average ATT: {results.avg_att:.3f}")
print(f"Average SE: {results.avg_se:.3f}")

# Full summary with all period effects
results.print_summary()

Output:

================================================================================
            Multi-Period Difference-in-Differences Estimation Results
================================================================================

Observations:                      600
Pre-treatment periods:             3
Post-treatment periods:            3

--------------------------------------------------------------------------------
Average Treatment Effect
--------------------------------------------------------------------------------
Average ATT       5.2000       0.8234      6.315      0.0000
--------------------------------------------------------------------------------
95% Confidence Interval: [3.5862, 6.8138]

Period-Specific Effects:
--------------------------------------------------------------------------------
Period            Effect     Std. Err.     t-stat      P>|t|
--------------------------------------------------------------------------------
3                 4.5000       0.9512      4.731      0.0000***
4                 5.2000       0.8876      5.858      0.0000***
5                 5.9000       0.9123      6.468      0.0000***
--------------------------------------------------------------------------------

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
================================================================================

Working with Results

Export Results

# As dictionary
results.to_dict()
# {'att': 3.5, 'se': 1.26, 'p_value': 0.037, ...}

# As DataFrame
df = results.to_dataframe()

Check Significance

if results.is_significant:
    print(f"Effect is significant at {did.alpha} level")

# Get significance stars
print(f"ATT: {results.att}{results.significance_stars}")
# ATT: 3.5000*

Access Full Regression Output

# All coefficients
results.coefficients
# {'const': 9.5, 'treated': 1.0, 'post': 2.5, 'treated:post': 3.5}

# Variance-covariance matrix
results.vcov

# Residuals and fitted values
results.residuals
results.fitted_values

# R-squared
results.r_squared

Checking Assumptions

Parallel Trends

Simple slope-based test:

from diff_diff.utils import check_parallel_trends

trends = check_parallel_trends(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated'
)

print(f"Treated trend: {trends['treated_trend']:.4f}")
print(f"Control trend: {trends['control_trend']:.4f}")
print(f"Difference p-value: {trends['p_value']:.4f}")

Robust distributional test (Wasserstein distance):

from diff_diff.utils import check_parallel_trends_robust

results = check_parallel_trends_robust(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated',
    unit='firm_id',              # Unit identifier for panel data
    pre_periods=[2018, 2019],    # Pre-treatment periods
    n_permutations=1000          # Permutations for p-value
)

print(f"Wasserstein distance: {results['wasserstein_distance']:.4f}")
print(f"Wasserstein p-value: {results['wasserstein_p_value']:.4f}")
print(f"KS test p-value: {results['ks_p_value']:.4f}")
print(f"Parallel trends plausible: {results['parallel_trends_plausible']}")

The Wasserstein (Earth Mover's) distance compares the full distribution of outcome changes, not just means. This is more robust to:

Non-normal distributions
Heterogeneous effects across units
Outliers

Equivalence testing (TOST):

from diff_diff.utils import equivalence_test_trends

results = equivalence_test_trends(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated',
    unit='firm_id',
    equivalence_margin=0.5       # Define "practically equivalent"
)

print(f"Mean difference: {results['mean_difference']:.4f}")
print(f"TOST p-value: {results['tost_p_value']:.4f}")
print(f"Trends equivalent: {results['equivalent']}")

API Reference

DifferenceInDifferences

DifferenceInDifferences(
    robust=True,      # Use HC1 robust standard errors
    cluster=None,     # Column for cluster-robust SEs
    alpha=0.05        # Significance level for CIs
)

Methods:

Method	Description
`fit(data, outcome, treatment, time, ...)`	Fit the DiD model
`summary()`	Get formatted summary string
`print_summary()`	Print summary to stdout
`get_params()`	Get estimator parameters (sklearn-compatible)
`set_params(**params)`	Set estimator parameters (sklearn-compatible)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Input data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1)
`time`	str	Post-treatment indicator column (0/1)
`formula`	str	R-style formula (alternative to column names)
`covariates`	list	Linear control variables
`fixed_effects`	list	Categorical FE columns (creates dummies)
`absorb`	list	High-dimensional FE (within-transformation)

DiDResults

Attributes:

Attribute	Description
`att`	Average Treatment effect on the Treated
`se`	Standard error of ATT
`t_stat`	T-statistic
`p_value`	P-value for H0: ATT = 0
`conf_int`	Tuple of (lower, upper) confidence bounds
`n_obs`	Number of observations
`n_treated`	Number of treated units
`n_control`	Number of control units
`r_squared`	R-squared of regression
`coefficients`	Dictionary of all coefficients
`is_significant`	Boolean for significance at alpha
`significance_stars`	String of significance stars

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

MultiPeriodDiD

MultiPeriodDiD(
    robust=True,      # Use HC1 robust standard errors
    cluster=None,     # Column for cluster-robust SEs
    alpha=0.05        # Significance level for CIs
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Input data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1)
`time`	str	Time period column (multiple values)
`post_periods`	list	List of post-treatment period values
`covariates`	list	Linear control variables
`fixed_effects`	list	Categorical FE columns (creates dummies)
`absorb`	list	High-dimensional FE (within-transformation)
`reference_period`	any	Omitted period for time dummies

MultiPeriodDiDResults

Attributes:

Attribute	Description
`period_effects`	Dict mapping periods to PeriodEffect objects
`avg_att`	Average ATT across post-treatment periods
`avg_se`	Standard error of average ATT
`avg_t_stat`	T-statistic for average ATT
`avg_p_value`	P-value for average ATT
`avg_conf_int`	Confidence interval for average ATT
`n_obs`	Number of observations
`pre_periods`	List of pre-treatment periods
`post_periods`	List of post-treatment periods

Methods:

Method	Description
`get_effect(period)`	Get PeriodEffect for specific period
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

PeriodEffect

Attributes:

Attribute	Description
`period`	Time period identifier
`effect`	Treatment effect estimate
`se`	Standard error
`t_stat`	T-statistic
`p_value`	P-value
`conf_int`	Confidence interval
`is_significant`	Boolean for significance at 0.05
`significance_stars`	String of significance stars

Requirements

Python >= 3.9
numpy >= 1.20
pandas >= 1.3
scipy >= 1.7

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Format code
black diff_diff tests
ruff check diff_diff tests

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.1.3

Apr 18, 2026

3.1.2

Apr 18, 2026

3.1.1

Apr 16, 2026

3.1.0

Apr 14, 2026

3.0.2

Apr 13, 2026

3.0.1

Apr 7, 2026

3.0.0

Apr 7, 2026

2.9.1

Apr 6, 2026

2.9.0

Apr 5, 2026

2.8.4

Apr 4, 2026

2.8.3

Apr 2, 2026

2.8.2

Apr 2, 2026

2.8.1

Apr 2, 2026

2.8.0

Mar 31, 2026

2.7.6

Mar 29, 2026

2.7.5

Mar 24, 2026

2.7.4

Mar 21, 2026

2.7.3

Mar 20, 2026

2.7.2

Mar 18, 2026

2.7.1

Mar 16, 2026

2.7.0

Mar 15, 2026

2.6.1

Mar 8, 2026

2.6.0

Feb 22, 2026

2.5.0

Feb 19, 2026

2.4.3

Feb 19, 2026

2.4.2

Feb 18, 2026

2.4.1

Feb 17, 2026

2.4.0

Feb 16, 2026

2.3.2

Feb 16, 2026

2.3.1

Feb 15, 2026

2.3.0

Feb 9, 2026

2.2.1

Feb 8, 2026

2.2.0

Jan 27, 2026

2.1.9

Jan 26, 2026

2.1.8

Jan 25, 2026

2.1.7

Jan 25, 2026

2.1.6

Jan 24, 2026

2.1.5

Jan 22, 2026

2.1.4

Jan 21, 2026

2.1.3

Jan 19, 2026

2.1.2

Jan 19, 2026

2.1.1

Jan 19, 2026

2.1.0

Jan 18, 2026

2.0.4

Jan 17, 2026

2.0.3

Jan 17, 2026

2.0.2

Jan 15, 2026

2.0.1

Jan 13, 2026

2.0.0

Jan 12, 2026

1.4.0

Jan 11, 2026

1.3.1

Jan 10, 2026

1.3.0

Jan 9, 2026

1.2.1

Jan 8, 2026

1.2.0

Jan 7, 2026

1.1.0

Jan 5, 2026

1.0.0

Jan 4, 2026

0.6.0

Jan 4, 2026

0.5.0

Jan 3, 2026

0.4.0

Jan 3, 2026

0.3.0

Jan 3, 2026

This version

0.2.0

Jan 2, 2026

0.1.0

Jan 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diff_diff-0.2.0.tar.gz (30.2 kB view details)

Uploaded Jan 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

diff_diff-0.2.0-py3-none-any.whl (21.3 kB view details)

Uploaded Jan 2, 2026 Python 3

File details

Details for the file diff_diff-0.2.0.tar.gz.

File metadata

Download URL: diff_diff-0.2.0.tar.gz
Upload date: Jan 2, 2026
Size: 30.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for diff_diff-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`17b5863b724e12e5b0d72856071eb55f29719a334a4314c2ff1307a4aaa67a10`
MD5	`79a5b5accb7dc0d462c7c6d516952735`
BLAKE2b-256	`879a7f4a93f1101724f79e45c1d5bd3ed14ac805fefb93d1ace784a902807ed1`

See more details on using hashes here.

File details

Details for the file diff_diff-0.2.0-py3-none-any.whl.

File metadata

Download URL: diff_diff-0.2.0-py3-none-any.whl
Upload date: Jan 2, 2026
Size: 21.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for diff_diff-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f4bc66a021af621ebba853973579e9b8b9c99ac41dffd9eaa0f9e3d16e21287b`
MD5	`669859824beccca63194fe538660c27b`
BLAKE2b-256	`7db0fb5d0d323fdf0f0ac67f62b20d702f32ecaefb0c0ae0f189e914c692b699`

See more details on using hashes here.

diff-diff 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

diff-diff

Installation

Quick Start

Features

Usage

Basic DiD with Column Names

Using Formula Interface

Including Covariates

Fixed Effects

Cluster-Robust Standard Errors

Two-Way Fixed Effects (Panel Data)

Multi-Period DiD (Event Study)

Working with Results

Export Results

Check Significance

Access Full Regression Output

Checking Assumptions

Parallel Trends

API Reference

DifferenceInDifferences

DiDResults

MultiPeriodDiD

MultiPeriodDiDResults

PeriodEffect

Requirements

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes