Difference-in-Differences causal inference with sklearn-like API. Callaway-Sant'Anna, Synthetic DiD, Honest DiD, event studies, parallel trends.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

igerber

These details have not been verified by PyPI

Project links

Project description

diff-diff

A Python library for Difference-in-Differences (DiD) causal inference analysis with an sklearn-like API and statsmodels-style outputs.

Installation

pip install diff-diff

Or install from source:

git clone https://github.com/igerber/diff-diff.git
cd diff-diff
pip install -e .

Quick Start

import pandas as pd
from diff_diff import DifferenceInDifferences  # or: DiD

# Create sample data
data = pd.DataFrame({
    'outcome': [10, 11, 15, 18, 9, 10, 12, 13],
    'treated': [1, 1, 1, 1, 0, 0, 0, 0],
    'post': [0, 0, 1, 1, 0, 0, 1, 1]
})

# Fit the model
did = DifferenceInDifferences()
results = did.fit(data, outcome='outcome', treatment='treated', time='post')

# View results
print(results)  # DiDResults(ATT=3.0000, SE=1.7321, p=0.1583)
results.print_summary()

Output:

======================================================================
             Difference-in-Differences Estimation Results
======================================================================

Observations:                      8
Treated units:                     4
Control units:                     4
R-squared:                    0.9055

----------------------------------------------------------------------
Parameter           Estimate    Std. Err.     t-stat      P>|t|
----------------------------------------------------------------------
ATT                   3.0000       1.7321      1.732     0.1583
----------------------------------------------------------------------

95% Confidence Interval: [-1.8089, 7.8089]

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
======================================================================

For AI Agents

If you are an AI agent or LLM using this library, call diff_diff.get_llm_guide() for a concise API reference with an 8-step practitioner workflow (based on Baker et al. 2025). The workflow ensures rigorous DiD analysis — not just calling fit(), but testing assumptions, running sensitivity analysis, and checking robustness.

from diff_diff import get_llm_guide

get_llm_guide()                 # concise API reference
get_llm_guide("practitioner")   # 8-step workflow (Baker et al. 2025)
get_llm_guide("full")           # comprehensive documentation

The guides are bundled in the wheel, so they are accessible from a pip install with no network access required.

After estimation, call practitioner_next_steps(results) for context-aware guidance on remaining diagnostic steps.

For Data Scientists

Measuring campaign lift? Evaluating a product launch? diff-diff handles the causal inference so you can focus on the business question.

Which method fits my problem? - Start from your business scenario (campaign in some markets, staggered rollout, survey data) and find the right estimator
Getting started for practitioners - End-to-end walkthrough: marketing campaign -> causal estimate -> stakeholder-ready result
Brand awareness survey tutorial - Full example with complex survey design, brand funnel analysis, and staggered rollouts
Have BRFSS/ACS/CPS individual records? Use aggregate_survey() to roll respondent-level microdata into a geographic-period panel with inverse-variance precision weights. The returned second-stage design uses analytic weights (aweight), so it works directly with DifferenceInDifferences, TwoWayFixedEffects, MultiPeriodDiD, SunAbraham, ContinuousDiD, and EfficientDiD (estimators marked Full in the survey support matrix)

Already know DiD? The academic quickstart and estimator guide cover the full technical details.

Features

sklearn-like API: Familiar fit() interface with get_params() and set_params()
Pythonic results: Easy access to coefficients, standard errors, and confidence intervals
Multiple interfaces: Column names or R-style formulas
Robust inference: Heteroskedasticity-robust (HC1) and cluster-robust standard errors
Wild cluster bootstrap: Valid inference with few clusters (<50) using Rademacher, Webb, or Mammen weights
Panel data support: Two-way fixed effects estimator for panel designs
Multi-period analysis: Event-study style DiD with period-specific treatment effects
Staggered adoption: Callaway-Sant'Anna (2021), Sun-Abraham (2021), Borusyak-Jaravel-Spiess (2024) imputation, Two-Stage DiD (Gardner 2022), Stacked DiD (Wing, Freedman & Hollingsworth 2024), Efficient DiD (Chen, Sant'Anna & Xie 2025), and Wooldridge ETWFE (2021/2023) estimators for heterogeneous treatment timing
Reversible (non-absorbing) treatments: de Chaisemartin-D'Haultfœuille DID_M estimator for treatments that switch on AND off over time (marketing campaigns, seasonal promotions, on/off policy cycles) — the only library option for non-absorbing treatments
Triple Difference (DDD): Ortiz-Villavicencio & Sant'Anna (2025) estimators with proper covariate handling
Synthetic DiD: Combined DiD with synthetic control for improved robustness
Triply Robust Panel (TROP): Factor-adjusted DiD with synthetic weights (Athey et al. 2025)
Event study plots: Publication-ready visualization of treatment effects
Parallel trends testing: Multiple methods including equivalence tests
Goodman-Bacon decomposition: Diagnose TWFE bias by decomposing into 2x2 comparisons
Placebo tests: Comprehensive diagnostics including fake timing, fake group, permutation, and leave-one-out tests
Honest DiD sensitivity analysis: Rambachan-Roth (2023) bounds and breakdown analysis for parallel trends violations
Pre-trends power analysis: Roth (2022) minimum detectable violation (MDV) and power curves for pre-trends tests
Power analysis: MDE, sample size, and power calculations for study design; simulation-based power for any estimator
Data prep utilities: Helper functions for common data preparation tasks
Survey microdata aggregation: aggregate_survey() rolls individual-level survey data (BRFSS, ACS, CPS, NHANES) into geographic-period panels with design-based precision weights for second-stage DiD
Validated against R: Benchmarked against did, synthdid, and fixest packages (see benchmarks)

Estimator Aliases

All estimators have short aliases for convenience:

Alias	Full Name	Method
`DiD`	`DifferenceInDifferences`	Basic 2x2 DiD
`TWFE`	`TwoWayFixedEffects`	Two-way fixed effects
`EventStudy`	`MultiPeriodDiD`	Event study / multi-period
`CS`	`CallawaySantAnna`	Callaway & Sant'Anna (2021)
`SA`	`SunAbraham`	Sun & Abraham (2021)
`BJS`	`ImputationDiD`	Borusyak, Jaravel & Spiess (2024)
`Gardner`	`TwoStageDiD`	Gardner (2022) two-stage
`SDiD`	`SyntheticDiD`	Synthetic DiD
`DDD`	`TripleDifference`	Triple difference
`CDiD`	`ContinuousDiD`	Continuous treatment DiD
`Stacked`	`StackedDiD`	Stacked DiD
`Bacon`	`BaconDecomposition`	Goodman-Bacon decomposition
`EDiD`	`EfficientDiD`	Efficient DiD
`ETWFE`	`WooldridgeDiD`	Wooldridge ETWFE (2021/2023)
`DCDH`	`ChaisemartinDHaultfoeuille`	de Chaisemartin & D'Haultfœuille (2020) — reversible treatments

TROP already uses its short canonical name and needs no alias.

Tutorials

We provide Jupyter notebook tutorials in docs/tutorials/:

Notebook	Description
`01_basic_did.ipynb`	Basic 2x2 DiD, formula interface, covariates, fixed effects, cluster-robust SE, wild bootstrap
`02_staggered_did.ipynb`	Staggered adoption with Callaway-Sant'Anna and Sun-Abraham, group-time effects, aggregation methods, Bacon decomposition
`03_synthetic_did.ipynb`	Synthetic DiD, unit/time weights, inference methods, regularization
`04_parallel_trends.ipynb`	Testing parallel trends, equivalence tests, placebo tests, diagnostics
`05_honest_did.ipynb`	Honest DiD sensitivity analysis, bounds, breakdown values, visualization
`06_power_analysis.ipynb`	Power analysis, MDE, sample size calculations, simulation-based power
`07_pretrends_power.ipynb`	Pre-trends power analysis (Roth 2022), MDV, power curves
`08_triple_diff.ipynb`	Triple Difference (DDD) estimation with proper covariate handling
`09_real_world_examples.ipynb`	Real-world data examples (Card-Krueger, Castle Doctrine, Divorce Laws)
`10_trop.ipynb`	Triply Robust Panel (TROP) estimation with factor model adjustment
`11_imputation_did.ipynb`	Imputation DiD (Borusyak et al. 2024), pre-trend test, efficiency comparison
`12_two_stage_did.ipynb`	Two-Stage DiD (Gardner 2022), GMM sandwich variance, per-observation effects
`13_stacked_did.ipynb`	Stacked DiD (Wing et al. 2024), Q-weights, sub-experiment inspection, trimming, clean control definitions
`15_efficient_did.ipynb`	Efficient DiD (Chen et al. 2025), optimal weighting, PT-All vs PT-Post, efficiency gains, bootstrap inference
`16_survey_did.ipynb`	Survey-aware DiD with complex sampling designs (strata, PSU, FPC, weights), replicate weights, subpopulation analysis, DEFF diagnostics
`17_brand_awareness_survey.ipynb`	Measuring campaign impact on brand awareness with survey data — naive vs. survey-corrected comparison, brand funnel analysis, staggered rollouts, stakeholder communication

Data Preparation

diff-diff provides utility functions to help prepare your data for DiD analysis. These functions handle common data transformation tasks like creating treatment indicators, reshaping panel data, and validating data formats.

Generate Sample Data

Create synthetic data with a known treatment effect for testing and learning:

from diff_diff import generate_did_data, DifferenceInDifferences

# Generate panel data with 100 units, 4 periods, and a treatment effect of 5
data = generate_did_data(
    n_units=100,
    n_periods=4,
    treatment_effect=5.0,
    treatment_fraction=0.5,  # 50% of units are treated
    treatment_period=2,       # Treatment starts at period 2
    seed=42
)

# Verify the estimator recovers the treatment effect
did = DifferenceInDifferences()
results = did.fit(data, outcome='outcome', treatment='treated', time='post')
print(f"Estimated ATT: {results.att:.2f} (true: 5.0)")

Create Treatment Indicators

Convert categorical variables or numeric thresholds to binary treatment indicators:

from diff_diff import make_treatment_indicator

# From categorical variable
df = make_treatment_indicator(
    data,
    column='state',
    treated_values=['CA', 'NY', 'TX']  # These states are treated
)

# From numeric threshold (e.g., firms above median size)
df = make_treatment_indicator(
    data,
    column='firm_size',
    threshold=data['firm_size'].median()
)

# Treat units below threshold
df = make_treatment_indicator(
    data,
    column='income',
    threshold=50000,
    above_threshold=False  # Units with income <= 50000 are treated
)

Create Post-Treatment Indicators

Convert time/date columns to binary post-treatment indicators:

from diff_diff import make_post_indicator

# From specific post-treatment periods
df = make_post_indicator(
    data,
    time_column='year',
    post_periods=[2020, 2021, 2022]
)

# From treatment start date
df = make_post_indicator(
    data,
    time_column='year',
    treatment_start=2020  # All years >= 2020 are post-treatment
)

# Works with datetime columns
df = make_post_indicator(
    data,
    time_column='date',
    treatment_start='2020-01-01'
)

Reshape Wide to Long Format

Convert wide-format data (one row per unit, multiple time columns) to long format:

from diff_diff import wide_to_long

# Wide format: columns like sales_2019, sales_2020, sales_2021
wide_df = pd.DataFrame({
    'firm_id': [1, 2, 3],
    'industry': ['tech', 'retail', 'tech'],
    'sales_2019': [100, 150, 200],
    'sales_2020': [110, 160, 210],
    'sales_2021': [120, 170, 220]
})

# Convert to long format for DiD
long_df = wide_to_long(
    wide_df,
    value_columns=['sales_2019', 'sales_2020', 'sales_2021'],
    id_column='firm_id',
    time_name='year',
    value_name='sales',
    time_values=[2019, 2020, 2021]
)
# Result: 9 rows (3 firms × 3 years), columns: firm_id, year, sales, industry

Balance Panel Data

Ensure all units have observations for all time periods:

from diff_diff import balance_panel

# Keep only units with complete data (drop incomplete units)
balanced = balance_panel(
    data,
    unit_column='firm_id',
    time_column='year',
    method='inner'
)

# Include all unit-period combinations (creates NaN for missing)
balanced = balance_panel(
    data,
    unit_column='firm_id',
    time_column='year',
    method='outer'
)

# Fill missing values
balanced = balance_panel(
    data,
    unit_column='firm_id',
    time_column='year',
    method='fill',
    fill_value=0  # Or None for forward/backward fill
)

Validate Data

Check that your data meets DiD requirements before fitting:

from diff_diff import validate_did_data

# Validate and get informative error messages
result = validate_did_data(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    unit='firm_id',      # Optional: for panel-specific validation
    raise_on_error=False  # Return dict instead of raising
)

if result['valid']:
    print("Data is ready for DiD analysis!")
    print(f"Summary: {result['summary']}")
else:
    print("Issues found:")
    for error in result['errors']:
        print(f"  - {error}")

for warning in result['warnings']:
    print(f"Warning: {warning}")

Summarize Data by Groups

Get summary statistics for each treatment-time cell:

from diff_diff import summarize_did_data

summary = summarize_did_data(
    data,
    outcome='sales',
    treatment='treated',
    time='post'
)
print(summary)

Output:

                        n      mean       std       min       max
Control - Pre        250  100.5000   15.2340   65.0000  145.0000
Control - Post       250  105.2000   16.1230   68.0000  152.0000
Treated - Pre        250  101.2000   14.8900   67.0000  143.0000
Treated - Post       250  115.8000   17.5600   72.0000  165.0000
DiD Estimate           -    9.9000         -         -         -

Create Event Time for Staggered Designs

For designs where treatment occurs at different times:

from diff_diff import create_event_time

# Add event-time column relative to treatment timing
df = create_event_time(
    data,
    time_column='year',
    treatment_time_column='treatment_year'
)
# Result: event_time = -2, -1, 0, 1, 2 relative to treatment

Aggregate to Cohort Means

Aggregate unit-level data for visualization:

from diff_diff import aggregate_to_cohorts

cohort_data = aggregate_to_cohorts(
    data,
    unit_column='firm_id',
    time_column='year',
    treatment_column='treated',
    outcome='sales'
)
# Result: mean outcome by treatment group and period

Rank Control Units

Select the best control units for DiD or Synthetic DiD analysis by ranking them based on pre-treatment outcome similarity:

from diff_diff import rank_control_units, generate_did_data

# Generate sample data
data = generate_did_data(n_units=50, n_periods=6, seed=42)

# Rank control units by their similarity to treated units
ranking = rank_control_units(
    data,
    unit_column='unit',
    time_column='period',
    outcome_column='outcome',
    treatment_column='treated',
    n_top=10  # Return top 10 controls
)

print(ranking[['unit', 'quality_score', 'pre_trend_rmse']])

Output:

   unit  quality_score  pre_trend_rmse
0    35         1.0000          0.4521
1    42         0.9234          0.5123
2    28         0.8876          0.5892
...

With covariates for matching:

# Add covariate-based matching
ranking = rank_control_units(
    data,
    unit_column='unit',
    time_column='period',
    outcome_column='outcome',
    treatment_column='treated',
    covariates=['size', 'age'],  # Match on these too
    outcome_weight=0.7,          # 70% weight on outcome trends
    covariate_weight=0.3         # 30% weight on covariate similarity
)

Filter data for SyntheticDiD using top controls:

from diff_diff import SyntheticDiD

# Get top control units
top_controls = ranking['unit'].tolist()

# Filter data to treated + top controls
filtered_data = data[
    (data['treated'] == 1) | (data['unit'].isin(top_controls))
]

# Fit SyntheticDiD with selected controls
sdid = SyntheticDiD()
results = sdid.fit(
    filtered_data,
    outcome='outcome',
    treatment='treated',
    unit='unit',
    time='period',
    post_periods=[3, 4, 5]
)

Usage

Basic DiD with Column Names

from diff_diff import DifferenceInDifferences

did = DifferenceInDifferences(robust=True, alpha=0.05)
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post_policy'
)

# Access results
print(f"ATT: {results.att:.4f}")
print(f"Standard Error: {results.se:.4f}")
print(f"P-value: {results.p_value:.4f}")
print(f"95% CI: {results.conf_int}")
print(f"Significant: {results.is_significant}")

Using Formula Interface

# R-style formula syntax
results = did.fit(data, formula='outcome ~ treated * post')

# Explicit interaction syntax
results = did.fit(data, formula='outcome ~ treated + post + treated:post')

# With covariates
results = did.fit(data, formula='outcome ~ treated * post + age + income')

Including Covariates

results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='post',
    covariates=['age', 'income', 'education']
)

Fixed Effects

Use fixed_effects for low-dimensional categorical controls (creates dummy variables):

# State and industry fixed effects
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    fixed_effects=['state', 'industry']
)

# Access fixed effect coefficients
state_coefs = {k: v for k, v in results.coefficients.items() if k.startswith('state_')}

Use absorb for high-dimensional fixed effects (more efficient, uses within-transformation):

# Absorb firm-level fixed effects (efficient for many firms)
results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    absorb=['firm_id']
)

Combine covariates with fixed effects:

results = did.fit(
    data,
    outcome='sales',
    treatment='treated',
    time='post',
    covariates=['size', 'age'],           # Linear controls
    fixed_effects=['industry'],            # Low-dimensional FE (dummies)
    absorb=['firm_id']                     # High-dimensional FE (absorbed)
)

Cluster-Robust Standard Errors

did = DifferenceInDifferences(cluster='state')
results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='post'
)

Wild Cluster Bootstrap

When you have few clusters (<50), standard cluster-robust SEs are biased. Wild cluster bootstrap provides valid inference even with 5-10 clusters.

# Use wild bootstrap for inference
did = DifferenceInDifferences(
    cluster='state',
    inference='wild_bootstrap',
    n_bootstrap=999,
    bootstrap_weights='rademacher',  # or 'webb' for <10 clusters, 'mammen'
    seed=42
)
results = did.fit(data, outcome='y', treatment='treated', time='post')

# Results include bootstrap-based SE and p-value
print(f"ATT: {results.att:.3f} (SE: {results.se:.3f})")
print(f"P-value: {results.p_value:.4f}")
print(f"95% CI: {results.conf_int}")
print(f"Inference method: {results.inference_method}")
print(f"Number of clusters: {results.n_clusters}")

Weight types:

'rademacher' - Default, ±1 with p=0.5, good for most cases
'webb' - 6-point distribution, recommended for <10 clusters
'mammen' - Two-point distribution, alternative to Rademacher

Works with DifferenceInDifferences and TwoWayFixedEffects estimators.

Two-Way Fixed Effects (Panel Data)

from diff_diff import TwoWayFixedEffects

twfe = TwoWayFixedEffects()
results = twfe.fit(
    panel_data,
    outcome='outcome',
    treatment='treated',
    time='year',
    unit='firm_id'
)

Multi-Period DiD (Event Study)

For settings with multiple pre- and post-treatment periods. Estimates treatment × period interactions for ALL periods (pre and post), enabling parallel trends assessment:

from diff_diff import MultiPeriodDiD

# Fit full event study with pre and post period effects
did = MultiPeriodDiD()
results = did.fit(
    panel_data,
    outcome='sales',
    treatment='treated',
    time='period',
    post_periods=[3, 4, 5],      # Periods 3-5 are post-treatment
    reference_period=2,          # Last pre-period (e=-1 convention)
    unit='unit_id',              # Optional: warns if staggered adoption detected
)

# Pre-period effects test parallel trends (should be ≈ 0)
for period, effect in results.pre_period_effects.items():
    print(f"Pre {period}: {effect.effect:.3f} (SE: {effect.se:.3f})")

# Post-period effects estimate dynamic treatment effects
for period, effect in results.post_period_effects.items():
    print(f"Post {period}: {effect.effect:.3f} (SE: {effect.se:.3f})")

# View average treatment effect across post-periods
print(f"Average ATT: {results.avg_att:.3f}")
print(f"Average SE: {results.avg_se:.3f}")

# Full summary with pre and post period effects
results.print_summary()

Output:

================================================================================
            Multi-Period Difference-in-Differences Estimation Results
================================================================================

Observations:                      600
Pre-treatment periods:             3
Post-treatment periods:            3

--------------------------------------------------------------------------------
Average Treatment Effect
--------------------------------------------------------------------------------
Average ATT       5.2000       0.8234      6.315      0.0000
--------------------------------------------------------------------------------
95% Confidence Interval: [3.5862, 6.8138]

Period-Specific Effects:
--------------------------------------------------------------------------------
Period            Effect     Std. Err.     t-stat      P>|t|
--------------------------------------------------------------------------------
3                 4.5000       0.9512      4.731      0.0000***
4                 5.2000       0.8876      5.858      0.0000***
5                 5.9000       0.9123      6.468      0.0000***
--------------------------------------------------------------------------------

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
================================================================================

Staggered Difference-in-Differences (Callaway-Sant'Anna)

When treatment is adopted at different times by different units, traditional TWFE estimators can be biased. The Callaway-Sant'Anna estimator provides unbiased estimates with staggered adoption.

from diff_diff import CallawaySantAnna

# Panel data with staggered treatment
# 'first_treat' = period when unit was first treated (0 if never treated)
cs = CallawaySantAnna()
results = cs.fit(
    panel_data,
    outcome='sales',
    unit='firm_id',
    time='year',
    first_treat='first_treat',  # 0 for never-treated, else first treatment year
    aggregate='event_study'      # Compute event study effects
)

# View results
results.print_summary()

# Access group-time effects ATT(g,t)
for (group, time), effect in results.group_time_effects.items():
    print(f"Cohort {group}, Period {time}: {effect['effect']:.3f}")

# Event study effects (averaged by relative time)
for rel_time, effect in results.event_study_effects.items():
    print(f"e={rel_time}: {effect['effect']:.3f} (SE: {effect['se']:.3f})")

# Convert to DataFrame
df = results.to_dataframe(level='event_study')

Output:

=====================================================================================
          Callaway-Sant'Anna Staggered Difference-in-Differences Results
=====================================================================================

Total observations:                     600
Treated units:                           35
Control units:                           15
Treatment cohorts:                        3
Time periods:                             8
Control group:                never_treated

-------------------------------------------------------------------------------------
                  Overall Average Treatment Effect on the Treated
-------------------------------------------------------------------------------------
Parameter         Estimate     Std. Err.     t-stat      P>|t|   Sig.
-------------------------------------------------------------------------------------
ATT                 2.5000       0.3521       7.101     0.0000   ***
-------------------------------------------------------------------------------------

95% Confidence Interval: [1.8099, 3.1901]

-------------------------------------------------------------------------------------
                          Event Study (Dynamic) Effects
-------------------------------------------------------------------------------------
Rel. Period       Estimate     Std. Err.     t-stat      P>|t|   Sig.
-------------------------------------------------------------------------------------
0                   2.1000       0.4521       4.645     0.0000   ***
1                   2.5000       0.4123       6.064     0.0000   ***
2                   2.8000       0.5234       5.349     0.0000   ***
-------------------------------------------------------------------------------------

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
=====================================================================================

When to use Callaway-Sant'Anna vs TWFE:

Scenario	Use TWFE	Use Callaway-Sant'Anna
All units treated at same time	✓	✓
Staggered adoption, homogeneous effects	✓	✓
Staggered adoption, heterogeneous effects	✗	✓
Need event study with staggered timing	✗	✓
Fewer than ~20 treated units	✓	Depends on design

Parameters:

CallawaySantAnna(
    control_group='never_treated',  # or 'not_yet_treated'
    anticipation=0,                  # Periods before treatment with effects
    estimation_method='dr',          # 'dr', 'ipw', or 'reg'
    alpha=0.05,                      # Significance level
    cluster=None,                    # Column for cluster SEs
    n_bootstrap=0,                   # Bootstrap iterations (0 = analytical SEs)
    bootstrap_weights='rademacher',  # 'rademacher', 'mammen', or 'webb'
    seed=None                        # Random seed
)

Multiplier bootstrap for inference:

With few clusters or when analytical standard errors may be unreliable, use the multiplier bootstrap for valid inference. This implements the approach from Callaway & Sant'Anna (2021).

# Bootstrap inference with 999 iterations
cs = CallawaySantAnna(
    n_bootstrap=999,
    bootstrap_weights='rademacher',  # or 'mammen', 'webb'
    seed=42
)
results = cs.fit(
    data,
    outcome='sales',
    unit='firm_id',
    time='year',
    first_treat='first_treat',
    aggregate='event_study'
)

# Access bootstrap results
print(f"Overall ATT: {results.overall_att:.3f}")
print(f"Bootstrap SE: {results.bootstrap_results.overall_att_se:.3f}")
print(f"Bootstrap 95% CI: {results.bootstrap_results.overall_att_ci}")
print(f"Bootstrap p-value: {results.bootstrap_results.overall_att_p_value:.4f}")

# Event study bootstrap inference
for rel_time, se in results.bootstrap_results.event_study_ses.items():
    ci = results.bootstrap_results.event_study_cis[rel_time]
    print(f"e={rel_time}: SE={se:.3f}, 95% CI=[{ci[0]:.3f}, {ci[1]:.3f}]")

Bootstrap weight types:

'rademacher' - Default, ±1 with p=0.5, good for most cases
'mammen' - Two-point distribution matching first 3 moments
'webb' - Six-point distribution, recommended for very few clusters (<10)

Covariate adjustment for conditional parallel trends:

When parallel trends only holds conditional on covariates, use the covariates parameter:

# Doubly robust estimation with covariates
cs = CallawaySantAnna(estimation_method='dr')  # 'dr', 'ipw', or 'reg'
results = cs.fit(
    data,
    outcome='sales',
    unit='firm_id',
    time='year',
    first_treat='first_treat',
    covariates=['size', 'age', 'industry'],  # Covariates for conditional PT
    aggregate='event_study'
)

Sun-Abraham Interaction-Weighted Estimator

The Sun-Abraham (2021) estimator provides an alternative to Callaway-Sant'Anna using an interaction-weighted (IW) regression approach. Running both estimators serves as a useful robustness check—when they agree, results are more credible.

from diff_diff import SunAbraham

# Basic usage
sa = SunAbraham()
results = sa.fit(
    panel_data,
    outcome='sales',
    unit='firm_id',
    time='year',
    first_treat='first_treat'  # 0 for never-treated, else first treatment year
)

# View results
results.print_summary()

# Event study effects (by relative time to treatment)
for rel_time, effect in results.event_study_effects.items():
    print(f"e={rel_time}: {effect['effect']:.3f} (SE: {effect['se']:.3f})")

# Overall ATT
print(f"Overall ATT: {results.overall_att:.3f} (SE: {results.overall_se:.3f})")

# Cohort weights (how each cohort contributes to each event-time estimate)
for rel_time, weights in results.cohort_weights.items():
    print(f"e={rel_time}: {weights}")

Parameters:

SunAbraham(
    control_group='never_treated',  # or 'not_yet_treated'
    anticipation=0,                  # Periods before treatment with effects
    alpha=0.05,                      # Significance level
    cluster=None,                    # Column for cluster SEs
    n_bootstrap=0,                   # Bootstrap iterations (0 = analytical SEs)
    bootstrap_weights='rademacher',  # 'rademacher', 'mammen', or 'webb'
    seed=None                        # Random seed
)

Bootstrap inference:

# Bootstrap inference with 999 iterations
sa = SunAbraham(
    n_bootstrap=999,
    bootstrap_weights='rademacher',
    seed=42
)
results = sa.fit(
    data,
    outcome='sales',
    unit='firm_id',
    time='year',
    first_treat='first_treat'
)

# Access bootstrap results
print(f"Overall ATT: {results.overall_att:.3f}")
print(f"Bootstrap SE: {results.bootstrap_results.overall_att_se:.3f}")
print(f"Bootstrap 95% CI: {results.bootstrap_results.overall_att_ci}")
print(f"Bootstrap p-value: {results.bootstrap_results.overall_att_p_value:.4f}")

When to use Sun-Abraham vs Callaway-Sant'Anna:

Aspect	Sun-Abraham	Callaway-Sant'Anna
Approach	Interaction-weighted regression	2x2 DiD aggregation
Efficiency	More efficient under homogeneous effects	More robust to heterogeneity
Weighting	Weights by cohort share at each relative time	Weights by sample size
Use case	Robustness check, regression-based inference	Primary staggered DiD estimator

Both estimators should give similar results when:

Treatment effects are relatively homogeneous across cohorts
Parallel trends holds

Running both as robustness check:

from diff_diff import CallawaySantAnna, SunAbraham

# Callaway-Sant'Anna
cs = CallawaySantAnna()
cs_results = cs.fit(data, outcome='y', unit='unit', time='time', first_treat='first_treat')

# Sun-Abraham
sa = SunAbraham()
sa_results = sa.fit(data, outcome='y', unit='unit', time='time', first_treat='first_treat')

# Compare
print(f"Callaway-Sant'Anna ATT: {cs_results.overall_att:.3f}")
print(f"Sun-Abraham ATT: {sa_results.overall_att:.3f}")

# If results differ substantially, investigate heterogeneity

Borusyak-Jaravel-Spiess Imputation Estimator

The Borusyak et al. (2024) imputation estimator is the efficient estimator for staggered DiD under parallel trends, producing ~50% shorter confidence intervals than Callaway-Sant'Anna and 2-3.5x shorter than Sun-Abraham under homogeneous treatment effects.

from diff_diff import ImputationDiD, imputation_did

# Basic usage
est = ImputationDiD()
results = est.fit(data, outcome='outcome', unit='unit',
                  time='period', first_treat='first_treat')
results.print_summary()

# Event study
results = est.fit(data, outcome='outcome', unit='unit',
                  time='period', first_treat='first_treat',
                  aggregate='event_study')

# Pre-trend test (Equation 9)
pt = results.pretrend_test(n_leads=3)
print(f"F-stat: {pt['f_stat']:.3f}, p-value: {pt['p_value']:.4f}")

# Convenience function
results = imputation_did(data, 'outcome', 'unit', 'period', 'first_treat',
                         aggregate='all')

ImputationDiD(
    anticipation=0,         # Number of anticipation periods
    alpha=0.05,             # Significance level
    cluster=None,           # Cluster variable (defaults to unit)
    n_bootstrap=0,          # Bootstrap iterations (0=analytical inference)
    seed=None,              # Random seed
    horizon_max=None,       # Max event-study horizon
    aux_partition="cohort_horizon",  # Variance partition: "cohort_horizon", "cohort", "horizon"
)

When to use Imputation DiD vs Callaway-Sant'Anna:

Aspect	Imputation DiD	Callaway-Sant'Anna
Efficiency	Most efficient under homogeneous effects	Less efficient but more robust to heterogeneity
Control group	Always uses all untreated obs	Choice of never-treated or not-yet-treated
Inference	Conservative variance (Theorem 3)	Multiplier bootstrap
Pre-trends	Built-in F-test (Equation 9)	Separate testing

Two-Stage DiD (Gardner 2022)

Two-Stage DiD addresses TWFE bias in staggered adoption designs by estimating unit and time fixed effects on untreated observations only, then regressing the residualized outcomes on treatment indicators. Point estimates match the Imputation DiD estimator (Borusyak et al. 2024); the key difference is that Two-Stage DiD uses a GMM sandwich variance estimator that accounts for first-stage estimation error, while Imputation DiD uses a conservative variance (Theorem 3).

from diff_diff import TwoStageDiD

# Basic usage
est = TwoStageDiD()
results = est.fit(data, outcome='outcome', unit='unit', time='period', first_treat='first_treat')
results.print_summary()

Event study:

# Event study aggregation with visualization
results = est.fit(data, outcome='outcome', unit='unit', time='period',
                  first_treat='first_treat', aggregate='event_study')
plot_event_study(results)

Parameters:

TwoStageDiD(
    anticipation=0,                   # Periods of anticipation effects
    alpha=0.05,                       # Significance level for CIs
    cluster=None,                     # Column for cluster-robust SEs (defaults to unit)
    n_bootstrap=0,                    # Bootstrap iterations (0 = analytical GMM SEs)
    seed=None,                        # Random seed
    rank_deficient_action='warn',     # 'warn', 'error', or 'silent'
    horizon_max=None,                 # Max event-study horizon
)

When to use Two-Stage DiD vs Imputation DiD:

Aspect	Two-Stage DiD	Imputation DiD
Point estimates	Identical	Identical
Variance	GMM sandwich (accounts for first-stage error)	Conservative (Theorem 3, may overcover)
Intuition	Residualize then regress	Impute counterfactuals then aggregate
Reference impl.	R `did2s` package	R `didimputation` package

Both estimators are the efficient estimator under homogeneous treatment effects, producing shorter confidence intervals than Callaway-Sant'Anna or Sun-Abraham.

Stacked DiD (Wing, Freedman & Hollingsworth 2024)

Stacked DiD addresses TWFE bias in staggered adoption settings by constructing a "clean" comparison dataset for each treatment cohort and stacking them together. Each cohort's sub-experiment compares units treated at that cohort's timing against units that are not yet treated (or never treated) within a symmetric event-study window. This avoids the "bad comparisons" problem in TWFE while retaining a regression-based framework that practitioners familiar with event studies will find intuitive.

from diff_diff import StackedDiD, generate_staggered_data

# Generate sample data
data = generate_staggered_data(n_units=200, n_periods=12,
                                cohort_periods=[4, 6, 8], seed=42)

# Fit stacked DiD with event study
est = StackedDiD(kappa_pre=2, kappa_post=2)
results = est.fit(data, outcome='outcome', unit='unit',
                  time='period', first_treat='first_treat',
                  aggregate='event_study')
results.print_summary()

# Access stacked data for custom analysis
stacked = results.stacked_data

# Convenience function
from diff_diff import stacked_did
results = stacked_did(data, 'outcome', 'unit', 'period', 'first_treat',
                      kappa_pre=2, kappa_post=2, aggregate='event_study')

Parameters:

StackedDiD(
    kappa_pre=1,                          # Pre-treatment event-study periods
    kappa_post=1,                         # Post-treatment event-study periods
    weighting='aggregate',                # 'aggregate', 'population', or 'sample_share'
    clean_control='not_yet_treated',      # 'not_yet_treated', 'strict', or 'never_treated'
    cluster='unit',                       # 'unit' or 'unit_subexp'
    alpha=0.05,                           # Significance level
    anticipation=0,                       # Anticipation periods
    rank_deficient_action='warn',         # 'warn', 'error', or 'silent'
)

Note: Group aggregation (aggregate='group') is not supported because the pooled stacked regression cannot produce cohort-specific effects. Use CallawaySantAnna or ImputationDiD for cohort-level estimates.

When to use Stacked DiD vs Callaway-Sant'Anna:

Aspect	Stacked DiD	Callaway-Sant'Anna
Approach	Stack cohort sub-experiments, run pooled TWFE	2x2 DiD aggregation
Symmetric windows	Enforced via kappa_pre / kappa_post	Not required
Control group	Not-yet-treated (default) or never-treated	Never-treated or not-yet-treated
Covariates	Passed to pooled regression	Doubly robust / IPW
Intuition	Familiar event-study regression	Nonparametric aggregation

Convenience function:

# One-liner estimation
results = stacked_did(
    data,
    outcome='outcome',
    unit='unit',
    time='period',
    first_treat='first_treat',
    kappa_pre=3,
    kappa_post=3,
    aggregate='event_study'
)

Efficient DiD (Chen, Sant'Anna & Xie 2025)

Efficient DiD achieves the semiparametric efficiency bound for ATT estimation in staggered adoption designs along the no-covariate path, producing tighter confidence intervals than standard estimators when the stronger PT-All assumption holds. It optimally weights across all valid comparison groups and baselines via the inverse covariance matrix Omega*. A doubly-robust covariate path is also available: it is consistent if either the outcome regression or the sieve propensity ratio is correctly specified, but the linear OLS outcome regression does not generically attain the efficiency bound unless the conditional mean is linear in the covariates.

from diff_diff import EfficientDiD, generate_staggered_data

# Generate sample data
data = generate_staggered_data(n_units=300, n_periods=10,
                                cohort_periods=[4, 6, 8], seed=42)

# Fit with PT-All (overidentified, tighter SEs)
edid = EfficientDiD(pt_assumption="all")
results = edid.fit(data, outcome='outcome', unit='unit',
                   time='period', first_treat='first_treat',
                   aggregate='all')
results.print_summary()

# PT-Post mode (matches CS for post-treatment effects)
edid_post = EfficientDiD(pt_assumption="post")
results_post = edid_post.fit(data, outcome='outcome', unit='unit',
                              time='period', first_treat='first_treat')

Parameters:

EfficientDiD(
    pt_assumption='all',            # 'all' (overidentified) or 'post' (matches CS post-treatment ATT)
    alpha=0.05,                     # Significance level
    n_bootstrap=0,                  # Bootstrap iterations (0 = analytical only)
    bootstrap_weights='rademacher', # 'rademacher', 'mammen', or 'webb'
    seed=None,                      # Random seed
    anticipation=0,                 # Anticipation periods
)

Note: EfficientDiD supports covariate adjustment via a doubly-robust path (sieve-based propensity score ratios and a linear OLS outcome regression). The DR property gives consistency if either the OR or the PS is correctly specified, but the OLS working model for the outcome regression does not generically attain the semiparametric efficiency bound. The unqualified efficiency-bound claim applies to the no-covariate path only. See the covariates parameter on fit() and docs/methodology/REGISTRY.md.

When to use Efficient DiD vs Callaway-Sant'Anna:

Aspect	Efficient DiD	Callaway-Sant'Anna
Approach	Optimal EIF-based weighting	Separate 2x2 DiD aggregation
PT assumption	PT-All (stronger) or PT-Post	Conditional PT
Efficiency	Achieves semiparametric bound on the no-covariate path; DR covariate path is consistent but does not generically attain the bound under a linear OLS outcome regression	Not efficient
Covariates	Supported (doubly robust, sieve-based PS + linear OLS OR)	Supported (OR, IPW, DR)
When to choose	Maximum efficiency, PT-All credible	Covariates needed, weaker PT

de Chaisemartin-D'Haultfœuille (dCDH) for Reversible Treatments

ChaisemartinDHaultfoeuille (alias DCDH) is the only library estimator that handles non-absorbing (reversible) treatments — treatment can switch on AND off over time. This is the natural fit for marketing campaigns, seasonal promotions, on/off policy cycles.

Ships DID_M (= DID_1 at horizon l = 1), the full multi-horizon event study DID_l for l = 1..L_max via the L_max parameter, residualization-style covariate adjustment (controls), group-specific linear trends (trends_linear), state-set-specific trends (trends_nonparam), heterogeneity testing, non-binary treatment, HonestDiD sensitivity integration on placebos, and survey support via Taylor-series linearization.

from diff_diff import ChaisemartinDHaultfoeuille
from diff_diff.prep import generate_reversible_did_data

# Generate a reversible-treatment panel
data = generate_reversible_did_data(
    n_groups=80, n_periods=6, pattern="single_switch", seed=42,
)

# Fit the estimator
est = ChaisemartinDHaultfoeuille()
results = est.fit(
    data,
    outcome="outcome",
    group="group",
    time="period",
    treatment="treatment",
)
results.print_summary()

# Decomposition
print(f"DID_M (overall):  {results.overall_att:.3f}")
print(f"DID_+ (joiners):  {results.joiners_att:.3f}")
print(f"DID_- (leavers):  {results.leavers_att:.3f}")
print(f"Placebo (DID^pl): {results.placebo_effect:.3f}")

Parameters:

ChaisemartinDHaultfoeuille(
    alpha=0.05,                   # Significance level
    n_bootstrap=0,                # 0 = analytical SE only; >0 = multiplier bootstrap
    bootstrap_weights="rademacher",  # 'rademacher', 'mammen', or 'webb'
    seed=None,                    # Random seed for bootstrap
    placebo=True,                 # Auto-compute single-lag placebo
    twfe_diagnostic=True,         # Auto-compute TWFE decomposition diagnostic
    drop_larger_lower=True,       # Drop multi-switch groups (matches R DIDmultiplegtDYN)
    rank_deficient_action="warn", # Used by TWFE diagnostic OLS
)

What you get back on the results object:

Field	Description
`overall_att`, `overall_se`, `overall_conf_int`	`DID_M` when `L_max=None`; cost-benefit `delta` when `L_max > 1` (delta-method SE from per-horizon SEs)
`joiners_att`, `leavers_att`	Decomposition into the joiners (`DID_+`) and leavers (`DID_-`) views
`placebo_effect`	Single-lag placebo (`DID_M^pl`) point estimate
`per_period_effects`	Per-period decomposition with explicit A11-violation flags
`twfe_weights`, `twfe_fraction_negative`, `twfe_sigma_fe`, `twfe_beta_fe`	Theorem 1 decomposition diagnostic
`n_groups_dropped_crossers`, `n_groups_dropped_singleton_baseline`	Filter counts (multi-switch groups dropped before estimation; singleton-baseline groups excluded from variance)
`n_groups_dropped_never_switching`	Backwards-compatibility metadata. Never-switching groups participate in the variance via stable-control roles; this field is no longer a filter count.

Multi-horizon event study (pass L_max to fit()):

results = est.fit(data, outcome="outcome", group="group",
                  time="period", treatment="treatment", L_max=5)

# Per-horizon effects with analytical SE
for horizon in sorted(results.event_study_effects):
    e = results.event_study_effects[horizon]
    print(f"  l={horizon}: DID_l={e['effect']:.3f} (SE={e['se']:.3f})")

# Cost-benefit delta (becomes overall_att when L_max > 1)
print(f"Cost-benefit delta: {results.cost_benefit_delta['delta']:.3f}")

# Normalized effects: DID^n_l = DID_l / l (for binary treatment)
for horizon in sorted(results.normalized_effects):
    print(f"  DID^n_{horizon} = {results.normalized_effects[horizon]['effect']:.3f}")

# Event study DataFrame (includes placebos as negative horizons)
df = results.to_dataframe("event_study")

# Plot (integrates with plot_event_study)
from diff_diff import plot_event_study
plot_event_study(results)

Standalone TWFE decomposition diagnostic (without fitting the full estimator):

from diff_diff import twowayfeweights

diagnostic = twowayfeweights(
    data, outcome="outcome", group="group", time="period", treatment="treatment",
)
print(f"Plain TWFE coefficient: {diagnostic.beta_fe:.3f}")
print(f"Fraction of negative weights: {diagnostic.fraction_negative:.3f}")
print(f"sigma_fe (sign-flipping threshold): {diagnostic.sigma_fe:.3f}")

Note: Placebo SE is NaN for the single-period DID_M^pl (L_max=None) because the per-period aggregation path has no influence-function derivation; the point estimate is meaningful for visual pre-trends inspection. Multi-horizon dynamic placebos DID^{pl}_l (L_max >= 1) have valid analytical SE via the same cohort-recentered plug-in variance as the positive horizons, with bootstrap SE available when n_bootstrap > 0. See docs/methodology/REGISTRY.md for the full contract.

Note: By default (drop_larger_lower=True), the estimator drops groups whose treatment switches more than once before estimation. This matches R DIDmultiplegtDYN's default and is required for the analytical variance formula to be consistent with the point estimate. Each drop emits an explicit warning.

Note: The estimator requires panels with a balanced baseline (every group observed at the first global period) and no interior period gaps. Late-entry groups (missing the baseline) raise ValueError; interior-gap groups are dropped with a warning; terminally-missing groups (early exit / right-censoring) are retained and contribute from their observed periods only. This is a documented deviation from R DIDmultiplegtDYN, which supports unbalanced panels - see docs/methodology/REGISTRY.md for the rationale, the defensive guards that make terminal missingness safe, and workarounds for unbalanced inputs.

Note: Survey design is supported via Taylor-series linearization on pweight with strata / PSU / FPC. Replicate-weight variance and PSU-level bootstrap for dCDH are a planned extension. The aggregate parameter still raises NotImplementedError.

Triple Difference (DDD)

Triple Difference (DDD) is used when treatment requires satisfying two criteria: belonging to a treated group AND being in an eligible partition. The TripleDifference class implements the methodology from Ortiz-Villavicencio & Sant'Anna (2025), which correctly handles covariate adjustment (unlike naive implementations).

from diff_diff import TripleDifference, triple_difference

# Basic usage
ddd = TripleDifference(estimation_method='dr')  # doubly robust (recommended)
results = ddd.fit(
    data,
    outcome='wages',
    group='policy_state',       # 1=state enacted policy, 0=control state
    partition='female',         # 1=women (affected by policy), 0=men
    time='post'                 # 1=post-policy, 0=pre-policy
)

# View results
results.print_summary()
print(f"ATT: {results.att:.3f} (SE: {results.se:.3f})")

# With covariates (properly incorporated, unlike naive DDD)
results = ddd.fit(
    data,
    outcome='wages',
    group='policy_state',
    partition='female',
    time='post',
    covariates=['age', 'education', 'experience']
)

Estimation methods:

Method	Description	When to use
`"dr"`	Doubly robust	Recommended. Consistent if either outcome or propensity model is correct
`"reg"`	Regression adjustment	Simple outcome regression with full interactions
`"ipw"`	Inverse probability weighting	When propensity score model is well-specified

# Compare estimation methods
for method in ['reg', 'ipw', 'dr']:
    est = TripleDifference(estimation_method=method)
    res = est.fit(data, outcome='y', group='g', partition='p', time='t')
    print(f"{method}: ATT={res.att:.3f} (SE={res.se:.3f})")

Convenience function:

# One-liner estimation
results = triple_difference(
    data,
    outcome='wages',
    group='policy_state',
    partition='female',
    time='post',
    covariates=['age', 'education'],
    estimation_method='dr'
)

Why use DDD instead of DiD?

DDD allows for violations of parallel trends that are:

Group-specific (e.g., economic shocks in treatment states)
Partition-specific (e.g., trends affecting women everywhere)

As long as these biases are additive, DDD differences them out. The key assumption is that the differential trend between eligible and ineligible units would be the same across groups.

Event Study Visualization

Create publication-ready event study plots:

from diff_diff import plot_event_study, MultiPeriodDiD, CallawaySantAnna, SunAbraham

# From MultiPeriodDiD (full event study with pre and post period effects)
did = MultiPeriodDiD()
results = did.fit(data, outcome='y', treatment='treated',
                  time='period', post_periods=[3, 4, 5], reference_period=2)
plot_event_study(results, title="Treatment Effects Over Time")

# From CallawaySantAnna (with event study aggregation)
cs = CallawaySantAnna()
results = cs.fit(data, outcome='y', unit='unit', time='period',
                 first_treat='first_treat', aggregate='event_study')
plot_event_study(results, title="Staggered DiD Event Study (CS)")

# From SunAbraham
sa = SunAbraham()
results = sa.fit(data, outcome='y', unit='unit', time='period',
                 first_treat='first_treat')
plot_event_study(results, title="Staggered DiD Event Study (SA)")

# From a DataFrame
df = pd.DataFrame({
    'period': [-2, -1, 0, 1, 2],
    'effect': [0.1, 0.05, 0.0, 2.5, 2.8],
    'se': [0.3, 0.25, 0.0, 0.4, 0.45]
})
plot_event_study(df, reference_period=0)

# With customization
ax = plot_event_study(
    results,
    title="Dynamic Treatment Effects",
    xlabel="Years Relative to Treatment",
    ylabel="Effect on Sales ($1000s)",
    color="#2563eb",
    marker="o",
    shade_pre=True,           # Shade pre-treatment region
    show_zero_line=True,      # Horizontal line at y=0
    show_reference_line=True, # Vertical line at reference period
    figsize=(10, 6),
    show=False                # Don't call plt.show(), return axes
)

Synthetic Difference-in-Differences

Synthetic DiD combines the strengths of Difference-in-Differences and Synthetic Control methods by re-weighting control units to better match treated units' pre-treatment outcomes.

from diff_diff import SyntheticDiD

# Fit Synthetic DiD model
sdid = SyntheticDiD()
results = sdid.fit(
    panel_data,
    outcome='gdp_growth',
    treatment='treated',
    unit='state',
    time='year',
    post_periods=[2015, 2016, 2017, 2018]
)

# View results
results.print_summary()
print(f"ATT: {results.att:.3f} (SE: {results.se:.3f})")

# Examine unit weights (which control units matter most)
weights_df = results.get_unit_weights_df()
print(weights_df.head(10))

# Examine time weights
time_weights_df = results.get_time_weights_df()
print(time_weights_df)

Output:

===========================================================================
         Synthetic Difference-in-Differences Estimation Results
===========================================================================

Observations:                      500
Treated units:                       1
Control units:                      49
Pre-treatment periods:               6
Post-treatment periods:              4
Regularization (lambda):        0.0000
Pre-treatment fit (RMSE):       0.1234

---------------------------------------------------------------------------
Parameter         Estimate     Std. Err.     t-stat      P>|t|
---------------------------------------------------------------------------
ATT                 2.5000       0.4521      5.530      0.0000
---------------------------------------------------------------------------

95% Confidence Interval: [1.6139, 3.3861]

---------------------------------------------------------------------------
                   Top Unit Weights (Synthetic Control)
---------------------------------------------------------------------------
  Unit state_12: 0.3521
  Unit state_5: 0.2156
  Unit state_23: 0.1834
  Unit state_8: 0.1245
  Unit state_31: 0.0892
  (8 units with weight > 0.001)

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
===========================================================================

When to Use Synthetic DiD Over Vanilla DiD

Use Synthetic DiD instead of standard DiD when:

Few treated units: When you have only one or a small number of treated units (e.g., a single state passed a policy), standard DiD averages across all controls equally. Synthetic DiD finds the optimal weighted combination of controls.
```
# Example: California passed a policy, want to estimate its effect
# Standard DiD would compare CA to the average of all other states
# Synthetic DiD finds states that together best match CA's pre-treatment trend
```
Parallel trends is questionable: When treated and control groups have different pre-treatment levels or trends, Synthetic DiD can construct a better counterfactual by matching the pre-treatment trajectory.
```
# Example: A tech hub city vs rural areas
# Rural areas may not be a good comparison on average
# Synthetic DiD can weight urban/suburban controls more heavily
```

Heterogeneous control units: When control units are very different from each other, equal weighting (as in standard DiD) is suboptimal.

# Example: Comparing a treated developing country to other countries
# Some control countries may be much more similar economically
# Synthetic DiD upweights the most comparable controls

You want transparency: Synthetic DiD provides explicit unit weights showing which controls contribute most to the comparison.
```
# See exactly which units are driving the counterfactual
print(results.get_unit_weights_df())
```

Key differences from standard DiD:

Aspect	Standard DiD	Synthetic DiD
Control weighting	Equal (1/N)	Optimized to match pre-treatment
Time weighting	Equal across periods	Can emphasize informative periods
N treated required	Can be many	Works with 1 treated unit
Parallel trends	Assumed	Partially relaxed via matching
Interpretability	Simple average	Explicit weights

Parameters:

SyntheticDiD(
    zeta_omega=None,        # Unit weight regularization (None = auto-computed from data)
    zeta_lambda=None,       # Time weight regularization (None = auto-computed from data)
    alpha=0.05,             # Significance level
    variance_method="placebo",  # "placebo" (default, matches R) or "bootstrap"
    n_bootstrap=200,        # Replications for SE estimation
    seed=None               # Random seed for reproducibility
)

Triply Robust Panel (TROP)

TROP (Athey, Imbens, Qu & Viviano 2025) extends Synthetic DiD by adding interactive fixed effects (factor model) adjustment. It's particularly useful when there are unobserved time-varying confounders with a factor structure that could bias standard DiD or SDID estimates.

TROP combines three robustness components:

Nuclear norm regularized factor model: Estimates interactive fixed effects L_it via soft-thresholding
Exponential distance-based unit weights: ω_j = exp(-λ_unit × distance(j,i))
Exponential time decay weights: θ_s = exp(-λ_time × |s-t|)

Tuning parameters are selected via leave-one-out cross-validation (LOOCV).

from diff_diff import TROP, trop

# Fit TROP model with automatic tuning via LOOCV
trop_est = TROP(
    lambda_time_grid=[0.0, 0.5, 1.0, 2.0],  # Time decay grid
    lambda_unit_grid=[0.0, 0.5, 1.0, 2.0],  # Unit distance grid
    lambda_nn_grid=[0.0, 0.1, 1.0],          # Nuclear norm grid
    n_bootstrap=200
)
# Note: TROP infers treatment periods from the treatment indicator column.
# The 'treated' column must be an absorbing state (D=1 for all periods
# during and after treatment starts for each unit).
results = trop_est.fit(
    panel_data,
    outcome='gdp_growth',
    treatment='treated',
    unit='state',
    time='year'
)

# View results
results.print_summary()
print(f"ATT: {results.att:.3f} (SE: {results.se:.3f})")
print(f"Effective rank: {results.effective_rank:.2f}")

# Selected tuning parameters
print(f"λ_time: {results.lambda_time:.2f}")
print(f"λ_unit: {results.lambda_unit:.2f}")
print(f"λ_nn: {results.lambda_nn:.2f}")

# Examine unit effects
unit_effects = results.get_unit_effects_df()
print(unit_effects.head(10))

Output:

===========================================================================
         Triply Robust Panel (TROP) Estimation Results
               Athey, Imbens, Qu & Viviano (2025)
===========================================================================

Observations:                      500
Treated units:                       1
Control units:                      49
Treated observations:                4
Pre-treatment periods:               6
Post-treatment periods:              4

---------------------------------------------------------------------------
             Tuning Parameters (selected via LOOCV)
---------------------------------------------------------------------------
Lambda (time decay):               1.0000
Lambda (unit distance):            0.5000
Lambda (nuclear norm):             0.1000
Effective rank:                      2.35
LOOCV score:                     0.012345
Variance method:                bootstrap
Bootstrap replications:              200

---------------------------------------------------------------------------
Parameter         Estimate     Std. Err.     t-stat      P>|t|
---------------------------------------------------------------------------
ATT                 2.5000       0.3892      6.424      0.0000   ***
---------------------------------------------------------------------------

95% Confidence Interval: [1.7372, 3.2628]

Signif. codes: '***' 0.001, '**' 0.01, '*' 0.05, '.' 0.1
===========================================================================

When to Use TROP Over Synthetic DiD

Use TROP when you suspect factor structure in the data—unobserved confounders that affect outcomes differently across units and time:

Scenario	Use SDID	Use TROP
Simple parallel trends	✓	✓
Unobserved factors (e.g., economic cycles)	May be biased	✓
Strong unit-time interactions	May be biased	✓
Low-dimensional confounding	✓	✓

Example scenarios where TROP excels:

Regional economic shocks that affect states differently based on industry composition
Global trends that impact countries differently based on their economic structure
Common factors in financial data (market risk, interest rates, etc.)

How TROP works:

Factor estimation: Estimates interactive fixed effects L_it using nuclear norm regularization (encourages low-rank structure)
Unit weights: Exponential distance-based weighting ω_j = exp(-λ_unit × d(j,i)) where d(j,i) is the RMSE of outcome differences
Time weights: Exponential decay weighting θ_s = exp(-λ_time × |s-t|) based on proximity to treatment
ATT computation: τ = Y_it - α_i - β_t - L_it for treated observations

# Compare TROP vs SDID under factor confounding
from diff_diff import SyntheticDiD

# Synthetic DiD (may be biased with factors)
sdid = SyntheticDiD()
sdid_results = sdid.fit(data, outcome='y', treatment='treated',
                        unit='unit', time='time', post_periods=[5,6,7])

# TROP (accounts for factors)
# Note: TROP infers treatment periods from the treatment indicator column
# (D=1 for treated observations, D=0 for control)
trop_est = TROP()  # Uses default grids with LOOCV selection
trop_results = trop_est.fit(data, outcome='y', treatment='treated',
                            unit='unit', time='time')

print(f"SDID estimate: {sdid_results.att:.3f}")
print(f"TROP estimate: {trop_results.att:.3f}")
print(f"Effective rank: {trop_results.effective_rank:.2f}")

Tuning parameter grids:

# Custom tuning grids (searched via LOOCV)
trop = TROP(
    lambda_time_grid=[0.0, 0.1, 0.5, 1.0, 2.0, 5.0],  # Time decay
    lambda_unit_grid=[0.0, 0.1, 0.5, 1.0, 2.0, 5.0],  # Unit distance
    lambda_nn_grid=[0.0, 0.01, 0.1, 1.0, 10.0]        # Nuclear norm
)

# Fixed tuning parameters (skip LOOCV search)
trop = TROP(
    lambda_time_grid=[1.0],   # Single value = fixed
    lambda_unit_grid=[1.0],   # Single value = fixed
    lambda_nn_grid=[0.1]      # Single value = fixed
)

Parameters:

TROP(
    method='local',             # Estimation method: 'local' (default) or 'global'
    lambda_time_grid=None,      # Time decay grid (default: [0, 0.1, 0.5, 1, 2, 5])
    lambda_unit_grid=None,      # Unit distance grid (default: [0, 0.1, 0.5, 1, 2, 5])
    lambda_nn_grid=None,        # Nuclear norm grid (default: [0, 0.01, 0.1, 1, 10])
    max_iter=100,               # Max iterations for factor estimation
    tol=1e-6,                   # Convergence tolerance
    alpha=0.05,                 # Significance level
    n_bootstrap=200,            # Bootstrap replications
    seed=None                   # Random seed
)

Estimation methods:

'local' (default): Per-observation model fitting following Algorithm 2 of the paper. Computes observation-specific weights and fits a model for each treated observation, then averages the individual treatment effects. More flexible but computationally intensive.
'global': Global weighted least squares optimization. Fits a single model on control observations with global weights, then computes per-observation treatment effects as residuals. Faster but uses global rather than observation-specific weights.

Convenience function:

# One-liner estimation with default tuning grids
# Note: TROP infers treatment periods from the treatment indicator
results = trop(
    data,
    outcome='y',
    treatment='treated',
    unit='unit',
    time='time',
    n_bootstrap=200
)

Working with Results

Export Results

# As dictionary
results.to_dict()
# {'att': 3.5, 'se': 1.26, 'p_value': 0.037, ...}

# As DataFrame
df = results.to_dataframe()

Check Significance

if results.is_significant:
    print(f"Effect is significant at {did.alpha} level")

# Get significance stars
print(f"ATT: {results.att}{results.significance_stars}")
# ATT: 3.5000*

Access Full Regression Output

# All coefficients
results.coefficients
# {'const': 9.5, 'treated': 1.0, 'post': 2.5, 'treated:post': 3.5}

# Variance-covariance matrix
results.vcov

# Residuals and fitted values
results.residuals
results.fitted_values

# R-squared
results.r_squared

Checking Assumptions

Parallel Trends

Simple slope-based test:

from diff_diff.utils import check_parallel_trends

trends = check_parallel_trends(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated'
)

print(f"Treated trend: {trends['treated_trend']:.4f}")
print(f"Control trend: {trends['control_trend']:.4f}")
print(f"Difference p-value: {trends['p_value']:.4f}")

Robust distributional test (Wasserstein distance):

from diff_diff.utils import check_parallel_trends_robust

results = check_parallel_trends_robust(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated',
    unit='firm_id',              # Unit identifier for panel data
    pre_periods=[2018, 2019],    # Pre-treatment periods
    n_permutations=1000          # Permutations for p-value
)

print(f"Wasserstein distance: {results['wasserstein_distance']:.4f}")
print(f"Wasserstein p-value: {results['wasserstein_p_value']:.4f}")
print(f"KS test p-value: {results['ks_p_value']:.4f}")
print(f"Parallel trends plausible: {results['parallel_trends_plausible']}")

The Wasserstein (Earth Mover's) distance compares the full distribution of outcome changes, not just means. This is more robust to:

Non-normal distributions
Heterogeneous effects across units
Outliers

Equivalence testing (TOST):

from diff_diff.utils import equivalence_test_trends

results = equivalence_test_trends(
    data,
    outcome='outcome',
    time='period',
    treatment_group='treated',
    unit='firm_id',
    equivalence_margin=0.5       # Define "practically equivalent"
)

print(f"Mean difference: {results['mean_difference']:.4f}")
print(f"TOST p-value: {results['tost_p_value']:.4f}")
print(f"Trends equivalent: {results['equivalent']}")

Honest DiD Sensitivity Analysis (Rambachan-Roth)

Pre-trends tests have low power and can exacerbate bias. Honest DiD (Rambachan & Roth 2023) provides sensitivity analysis showing how robust your results are to violations of parallel trends.

from diff_diff import HonestDiD, MultiPeriodDiD

# First, fit a full event study (pre + post period effects)
did = MultiPeriodDiD()
event_results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='period',
    post_periods=[5, 6, 7, 8, 9],
    reference_period=4,          # Last pre-period (e=-1 convention)
)

# Compute honest bounds with relative magnitudes restriction
# M=1 means post-treatment violations can be up to 1x the worst pre-treatment violation
honest = HonestDiD(method='relative_magnitude', M=1.0)
honest_results = honest.fit(event_results)

print(honest_results.summary())
print(f"Original estimate: {honest_results.original_estimate:.4f}")
print(f"Robust 95% CI: [{honest_results.ci_lb:.4f}, {honest_results.ci_ub:.4f}]")
print(f"Effect robust to violations: {honest_results.is_significant}")

Sensitivity analysis over M values:

# How do results change as we allow larger violations?
sensitivity = honest.sensitivity_analysis(
    event_results,
    M_grid=[0, 0.5, 1.0, 1.5, 2.0]
)

print(sensitivity.summary())
print(f"Breakdown value: M = {sensitivity.breakdown_M}")
# Breakdown = smallest M where the robust CI includes zero

Breakdown value:

The breakdown value tells you how robust your conclusion is:

breakdown = honest.breakdown_value(event_results)
if breakdown >= 1.0:
    print("Result holds even if post-treatment violations are as bad as pre-treatment")
else:
    print(f"Result requires violations smaller than {breakdown:.1f}x pre-treatment")

Smoothness restriction (alternative approach):

# Bounds second differences of trend violations
# M=0 means linear extrapolation of pre-trends
honest_smooth = HonestDiD(method='smoothness', M=0.5)
smooth_results = honest_smooth.fit(event_results)

Visualization:

from diff_diff import plot_sensitivity, plot_honest_event_study

# Plot sensitivity analysis
plot_sensitivity(sensitivity, title="Sensitivity to Parallel Trends Violations")

# Event study with honest confidence intervals
plot_honest_event_study(event_results, honest_results)

Pre-Trends Power Analysis (Roth 2022)

A passing pre-trends test doesn't mean parallel trends holds—it may just mean the test has low power. Pre-Trends Power Analysis (Roth 2022) answers: "What violations could my pre-trends test have detected?"

from diff_diff import PreTrendsPower, MultiPeriodDiD

# First, fit a full event study
did = MultiPeriodDiD()
event_results = did.fit(
    data,
    outcome='outcome',
    treatment='treated',
    time='period',
    post_periods=[5, 6, 7, 8, 9],
    reference_period=4,
)

# Analyze pre-trends test power
pt = PreTrendsPower(alpha=0.05, power=0.80)
power_results = pt.fit(event_results)

print(power_results.summary())
print(f"Minimum Detectable Violation (MDV): {power_results.mdv:.4f}")
print(f"Power to detect violations of size MDV: {power_results.power:.1%}")

Key concepts:

Minimum Detectable Violation (MDV): Smallest violation magnitude that would be detected with your target power (e.g., 80%). Passing the pre-trends test does NOT rule out violations up to this size.
Power: Probability of detecting a violation of given size if it exists.
Violation types: Linear trend, constant violation, last-period only, or custom patterns.

Power curve visualization:

from diff_diff import plot_pretrends_power

# Generate power curve across violation magnitudes
curve = pt.power_curve(event_results)

# Plot the power curve
plot_pretrends_power(curve, title="Pre-Trends Test Power Curve")

# Or from the curve object directly
curve.plot()

Different violation patterns:

# Linear trend violations (default) - most common assumption
pt_linear = PreTrendsPower(violation_type='linear')

# Constant violation in all pre-periods
pt_constant = PreTrendsPower(violation_type='constant')

# Violation only in the last pre-period (sharp break)
pt_last = PreTrendsPower(violation_type='last_period')

# Custom violation pattern
custom_weights = np.array([0.1, 0.3, 0.6])  # Increasing violations
pt_custom = PreTrendsPower(violation_type='custom', violation_weights=custom_weights)

Combining with HonestDiD:

Pre-trends power analysis and HonestDiD are complementary:

Pre-trends power tells you what the test could have detected
HonestDiD tells you how robust your results are to violations

from diff_diff import HonestDiD, PreTrendsPower

# If MDV is large relative to your estimated effect, be cautious
pt = PreTrendsPower()
power_results = pt.fit(event_results)
sensitivity = pt.sensitivity_to_honest_did(event_results)
print(sensitivity['interpretation'])

# Use HonestDiD for robust inference
honest = HonestDiD(method='relative_magnitude', M=1.0)
honest_results = honest.fit(event_results)

Placebo Tests

Placebo tests help validate the parallel trends assumption by checking whether effects appear where they shouldn't (before treatment or in untreated groups).

Fake timing test:

from diff_diff import run_placebo_test

# Test: Is there an effect before treatment actually occurred?
# Actual treatment is at period 3 (post_periods=[3, 4, 5])
# We test if a "fake" treatment at period 1 shows an effect
results = run_placebo_test(
    data,
    outcome='outcome',
    treatment='treated',
    time='period',
    test_type='fake_timing',
    fake_treatment_period=1,  # Pretend treatment was in period 1
    post_periods=[3, 4, 5]    # Actual post-treatment periods
)

print(results.summary())
# If parallel trends hold, placebo_effect should be ~0 and not significant
print(f"Placebo effect: {results.placebo_effect:.3f} (p={results.p_value:.3f})")
print(f"Is significant (bad): {results.is_significant}")

Fake group test:

# Test: Is there an effect among never-treated units?
# Get some control unit IDs to use as "fake treated"
control_units = data[data['treated'] == 0]['firm_id'].unique()[:5]

results = run_placebo_test(
    data,
    outcome='outcome',
    treatment='treated',
    time='period',
    unit='firm_id',
    test_type='fake_group',
    fake_treatment_group=list(control_units),  # List of control unit IDs
    post_periods=[3, 4, 5]
)

Permutation test:

# Randomly reassign treatment and compute distribution of effects
# Note: requires binary post indicator (use 'post' column, not 'period')
results = run_placebo_test(
    data,
    outcome='outcome',
    treatment='treated',
    time='post',           # Binary post-treatment indicator
    unit='firm_id',
    test_type='permutation',
    n_permutations=1000,
    seed=42
)

print(f"Original effect: {results.original_effect:.3f}")
print(f"Permutation p-value: {results.p_value:.4f}")
# Low p-value indicates the effect is unlikely to be due to chance

Leave-one-out sensitivity:

# Test sensitivity to individual treated units
# Note: requires binary post indicator (use 'post' column, not 'period')
results = run_placebo_test(
    data,
    outcome='outcome',
    treatment='treated',
    time='post',           # Binary post-treatment indicator
    unit='firm_id',
    test_type='leave_one_out'
)

# Check if any single unit drives the result
print(results.leave_one_out_effects)  # Effect when each unit is dropped

Run all placebo tests:

from diff_diff import run_all_placebo_tests

# Comprehensive diagnostic suite
# Note: This function runs fake_timing tests on pre-treatment periods.
# The permutation and leave_one_out tests require a binary post indicator,
# so they may return errors if the data uses multi-period time column.
all_results = run_all_placebo_tests(
    data,
    outcome='outcome',
    treatment='treated',
    time='period',
    unit='firm_id',
    pre_periods=[0, 1, 2],
    post_periods=[3, 4, 5],
    n_permutations=500,
    seed=42
)

for test_name, result in all_results.items():
    if hasattr(result, 'p_value'):
        print(f"{test_name}: p={result.p_value:.3f}, significant={result.is_significant}")
    elif isinstance(result, dict) and 'error' in result:
        print(f"{test_name}: Error - {result['error']}")

API Reference

DifferenceInDifferences

DifferenceInDifferences(
    robust=True,      # Use HC1 robust standard errors
    cluster=None,     # Column for cluster-robust SEs
    alpha=0.05        # Significance level for CIs
)

Methods:

Method	Description
`fit(data, outcome, treatment, time, ...)`	Fit the DiD model
`summary()`	Get formatted summary string
`print_summary()`	Print summary to stdout
`get_params()`	Get estimator parameters (sklearn-compatible)
`set_params(**params)`	Set estimator parameters (sklearn-compatible)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Input data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1)
`time`	str	Post-treatment indicator column (0/1)
`formula`	str	R-style formula (alternative to column names)
`covariates`	list	Linear control variables
`fixed_effects`	list	Categorical FE columns (creates dummies)
`absorb`	list	High-dimensional FE (within-transformation)

DiDResults

Attributes:

Attribute	Description
`att`	Average Treatment effect on the Treated
`se`	Standard error of ATT
`t_stat`	T-statistic
`p_value`	P-value for H0: ATT = 0
`conf_int`	Tuple of (lower, upper) confidence bounds
`n_obs`	Number of observations
`n_treated`	Number of treated units
`n_control`	Number of control units
`r_squared`	R-squared of regression
`coefficients`	Dictionary of all coefficients
`is_significant`	Boolean for significance at alpha
`significance_stars`	String of significance stars

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

MultiPeriodDiD

MultiPeriodDiD(
    robust=True,      # Use HC1 robust standard errors
    cluster=None,     # Column for cluster-robust SEs
    alpha=0.05        # Significance level for CIs
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Input data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1)
`time`	str	Time period column (multiple values)
`post_periods`	list	List of post-treatment period values
`covariates`	list	Linear control variables
`fixed_effects`	list	Categorical FE columns (creates dummies)
`absorb`	list	High-dimensional FE (within-transformation)
`reference_period`	any	Omitted period (default: last pre-period, e=-1 convention)
`unit`	str	Unit identifier column (for staggered adoption warning)

MultiPeriodDiDResults

Attributes:

Attribute	Description
`period_effects`	Dict mapping periods to PeriodEffect objects (pre and post, excluding reference)
`avg_att`	Average ATT across post-treatment periods only
`avg_se`	Standard error of average ATT
`avg_t_stat`	T-statistic for average ATT
`avg_p_value`	P-value for average ATT
`avg_conf_int`	Confidence interval for average ATT
`n_obs`	Number of observations
`pre_periods`	List of pre-treatment periods
`post_periods`	List of post-treatment periods
`reference_period`	The omitted reference period (coefficient = 0 by construction)
`interaction_indices`	Dict mapping period → column index in VCV (for sub-VCV extraction)
`pre_period_effects`	Property: pre-period effects only (for parallel trends assessment)
`post_period_effects`	Property: post-period effects only

Methods:

Method	Description
`get_effect(period)`	Get PeriodEffect for specific period
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

PeriodEffect

Attributes:

Attribute	Description
`period`	Time period identifier
`effect`	Treatment effect estimate
`se`	Standard error
`t_stat`	T-statistic
`p_value`	P-value
`conf_int`	Confidence interval
`is_significant`	Boolean for significance at 0.05
`significance_stars`	String of significance stars

SyntheticDiD

SyntheticDiD(
    zeta_omega=None,        # Unit weight regularization (None = auto from data)
    zeta_lambda=None,       # Time weight regularization (None = auto from data)
    alpha=0.05,             # Significance level for CIs
    variance_method="placebo",  # "placebo" (R default) or "bootstrap"
    n_bootstrap=200,        # Replications for SE estimation
    seed=None               # Random seed for reproducibility
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1)
`unit`	str	Unit identifier column
`time`	str	Time period column
`post_periods`	list	List of post-treatment period values
`covariates`	list	Covariates to residualize out

SyntheticDiDResults

Attributes:

Attribute	Description
`att`	Average Treatment effect on the Treated
`se`	Standard error (bootstrap or placebo-based)
`t_stat`	T-statistic
`p_value`	P-value
`conf_int`	Confidence interval
`n_obs`	Number of observations
`n_treated`	Number of treated units
`n_control`	Number of control units
`unit_weights`	Dict mapping control unit IDs to weights
`time_weights`	Dict mapping pre-treatment periods to weights
`pre_periods`	List of pre-treatment periods
`post_periods`	List of post-treatment periods
`pre_treatment_fit`	RMSE of synthetic vs treated in pre-period
`placebo_effects`	Array of placebo effect estimates

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame
`get_unit_weights_df()`	Get unit weights as DataFrame
`get_time_weights_df()`	Get time weights as DataFrame

TROP

TROP(
    lambda_time_grid=None,     # Time decay grid (default: [0, 0.1, 0.5, 1, 2, 5])
    lambda_unit_grid=None,     # Unit distance grid (default: [0, 0.1, 0.5, 1, 2, 5])
    lambda_nn_grid=None,       # Nuclear norm grid (default: [0, 0.01, 0.1, 1, 10])
    max_iter=100,              # Max iterations for factor estimation
    tol=1e-6,                  # Convergence tolerance
    alpha=0.05,                # Significance level for CIs
    n_bootstrap=200,           # Bootstrap replications (minimum 2; TROP requires bootstrap for SEs)
    seed=None                  # Random seed
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`treatment`	str	Treatment indicator column (0/1 absorbing state)
`unit`	str	Unit identifier column
`time`	str	Time period column

Note: TROP infers treatment periods from the treatment indicator column. The treatment column should be an absorbing state indicator where D=1 for all periods during and after treatment starts.

TROPResults

Attributes:

Attribute	Description
`att`	Average Treatment effect on the Treated
`se`	Standard error (bootstrap)
`t_stat`	T-statistic
`p_value`	P-value
`conf_int`	Confidence interval
`n_obs`	Number of observations
`n_treated`	Number of treated units
`n_control`	Number of control units
`n_treated_obs`	Number of treated unit-time observations
`unit_effects`	Dict mapping unit IDs to fixed effects
`time_effects`	Dict mapping periods to fixed effects
`treatment_effects`	Dict mapping (unit, time) to individual effects
`lambda_time`	Selected time decay parameter
`lambda_unit`	Selected unit distance parameter
`lambda_nn`	Selected nuclear norm parameter
`factor_matrix`	Low-rank factor matrix L (n_periods x n_units)
`effective_rank`	Effective rank of factor matrix
`loocv_score`	LOOCV score for selected parameters
`n_pre_periods`	Number of pre-treatment periods
`n_post_periods`	Number of post-treatment periods
`bootstrap_distribution`	Bootstrap distribution (if bootstrap)

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame
`get_unit_effects_df()`	Get unit fixed effects as DataFrame
`get_time_effects_df()`	Get time fixed effects as DataFrame
`get_treatment_effects_df()`	Get individual treatment effects as DataFrame

SunAbraham

SunAbraham(
    control_group='never_treated',  # or 'not_yet_treated'
    anticipation=0,           # Periods of anticipation effects
    alpha=0.05,               # Significance level for CIs
    cluster=None,             # Column for cluster-robust SEs
    n_bootstrap=0,            # Bootstrap iterations (0 = analytical SEs)
    bootstrap_weights='rademacher',  # 'rademacher', 'mammen', or 'webb'
    seed=None                 # Random seed
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`unit`	str	Unit identifier column
`time`	str	Time period column
`first_treat`	str	Column with first treatment period (0 for never-treated)
`covariates`	list	Covariate column names

SunAbrahamResults

Attributes:

Attribute	Description
`event_study_effects`	Dict mapping relative time to effect info
`overall_att`	Overall average treatment effect
`overall_se`	Standard error of overall ATT
`overall_t_stat`	T-statistic for overall ATT
`overall_p_value`	P-value for overall ATT
`overall_conf_int`	Confidence interval for overall ATT
`cohort_weights`	Dict mapping relative time to cohort weights
`groups`	List of treatment cohorts
`time_periods`	List of all time periods
`n_obs`	Total number of observations
`n_treated_units`	Number of ever-treated units
`n_control_units`	Number of never-treated units
`is_significant`	Boolean for significance at alpha
`significance_stars`	String of significance stars
`bootstrap_results`	SABootstrapResults (if bootstrap enabled)

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dataframe(level)`	Convert to DataFrame ('event_study' or 'cohort')

ImputationDiD

ImputationDiD(
    anticipation=0,                   # Periods of anticipation effects
    alpha=0.05,                       # Significance level for CIs
    cluster=None,                     # Column for cluster-robust SEs
    n_bootstrap=0,                    # Bootstrap iterations (0 = analytical)
    bootstrap_weights='rademacher',   # 'rademacher', 'mammen', or 'webb'
    seed=None,                        # Random seed
    rank_deficient_action='warn',     # 'warn', 'error', or 'silent'
    horizon_max=None,                 # Max event-study horizon
    aux_partition='cohort_horizon',   # Variance partition
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`unit`	str	Unit identifier column
`time`	str	Time period column
`first_treat`	str	First treatment period column (0 for never-treated)
`covariates`	list	Covariate column names
`aggregate`	str	Aggregation: None, "event_study", "group", "all"
`balance_e`	int	Balance event study to this many pre-treatment periods

ImputationDiDResults

Attributes:

Attribute	Description
`overall_att`	Overall average treatment effect on the treated
`overall_se`	Standard error (conservative, Theorem 3)
`overall_t_stat`	T-statistic
`overall_p_value`	P-value for H0: ATT = 0
`overall_conf_int`	Confidence interval
`event_study_effects`	Dict of relative time -> effect dict (if `aggregate='event_study'` or `'all'`)
`group_effects`	Dict of cohort -> effect dict (if `aggregate='group'` or `'all'`)
`treatment_effects`	DataFrame of unit-level imputed treatment effects
`n_treated_obs`	Number of treated observations
`n_untreated_obs`	Number of untreated observations

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dataframe(level)`	Convert to DataFrame ('observation', 'event_study', 'group')
`pretrend_test(n_leads)`	Run pre-trend F-test (Equation 9)

TwoStageDiD

TwoStageDiD(
    anticipation=0,                   # Periods of anticipation effects
    alpha=0.05,                       # Significance level for CIs
    cluster=None,                     # Column for cluster-robust SEs (defaults to unit)
    n_bootstrap=0,                    # Bootstrap iterations (0 = analytical GMM SEs)
    bootstrap_weights='rademacher',   # 'rademacher', 'mammen', or 'webb'
    seed=None,                        # Random seed
    rank_deficient_action='warn',     # 'warn', 'error', or 'silent'
    horizon_max=None,                 # Max event-study horizon
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`unit`	str	Unit identifier column
`time`	str	Time period column
`first_treat`	str	First treatment period column (0 for never-treated)
`covariates`	list	Covariate column names
`aggregate`	str	Aggregation: None, "event_study", "group", "all"
`balance_e`	int	Balance event study to this many pre-treatment periods

TwoStageDiDResults

Attributes:

Attribute	Description
`overall_att`	Overall average treatment effect on the treated
`overall_se`	Standard error (GMM sandwich variance)
`overall_t_stat`	T-statistic
`overall_p_value`	P-value for H0: ATT = 0
`overall_conf_int`	Confidence interval
`event_study_effects`	Dict of relative time -> effect dict (if `aggregate='event_study'` or `'all'`)
`group_effects`	Dict of cohort -> effect dict (if `aggregate='group'` or `'all'`)
`treatment_effects`	DataFrame of unit-level treatment effects
`n_treated_obs`	Number of treated observations
`n_untreated_obs`	Number of untreated observations

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dataframe(level)`	Convert to DataFrame ('observation', 'event_study', 'group')

StackedDiD

StackedDiD(
    kappa_pre=1,                          # Pre-treatment event-study periods
    kappa_post=1,                         # Post-treatment event-study periods
    weighting='aggregate',                # 'aggregate', 'population', or 'sample_share'
    clean_control='not_yet_treated',      # 'not_yet_treated', 'strict', or 'never_treated'
    cluster='unit',                       # 'unit' or 'unit_subexp'
    alpha=0.05,                           # Significance level
    anticipation=0,                       # Anticipation periods
    rank_deficient_action='warn',         # 'warn', 'error', or 'silent'
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Panel data
`outcome`	str	Outcome variable column name
`unit`	str	Unit identifier column
`time`	str	Time period column
`first_treat`	str	First treatment period column (0 for never-treated)
`population`	str, optional	Population column (required if weighting='population')
`aggregate`	str	Aggregation: None, `"simple"`, or `"event_study"`

StackedDiDResults

Attributes:

Attribute	Description
`overall_att`	Overall average treatment effect on the treated
`overall_se`	Standard error
`overall_t_stat`	T-statistic
`overall_p_value`	P-value for H0: ATT = 0
`overall_conf_int`	Confidence interval
`event_study_effects`	Dict of relative time -> effect dict (if `aggregate='event_study'`)
`stacked_data`	The stacked dataset used for estimation
`n_treated_obs`	Number of treated observations
`n_untreated_obs`	Number of untreated (clean control) observations
`n_cohorts`	Number of treatment cohorts
`kappa_pre`	Pre-treatment window used
`kappa_post`	Post-treatment window used

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dataframe(level)`	Convert to DataFrame ('event_study')

TripleDifference

TripleDifference(
    estimation_method='dr',   # 'dr' (doubly robust), 'reg', or 'ipw'
    robust=True,              # Use HC1 robust standard errors
    cluster=None,             # Column for cluster-robust SEs
    alpha=0.05,               # Significance level for CIs
    pscore_trim=0.01          # Propensity score trimming threshold
)

fit() Parameters:

Parameter	Type	Description
`data`	DataFrame	Input data
`outcome`	str	Outcome variable column name
`group`	str	Group indicator column (0/1): 1=treated group
`partition`	str	Partition/eligibility indicator column (0/1): 1=eligible
`time`	str	Time indicator column (0/1): 1=post-treatment
`covariates`	list	Covariate column names for adjustment

TripleDifferenceResults

Attributes:

Attribute	Description
`att`	Average Treatment effect on the Treated
`se`	Standard error of ATT
`t_stat`	T-statistic
`p_value`	P-value for H0: ATT = 0
`conf_int`	Tuple of (lower, upper) confidence bounds
`n_obs`	Total number of observations
`n_treated_eligible`	Obs in treated group & eligible partition
`n_treated_ineligible`	Obs in treated group & ineligible partition
`n_control_eligible`	Obs in control group & eligible partition
`n_control_ineligible`	Obs in control group & ineligible partition
`estimation_method`	Method used ('dr', 'reg', or 'ipw')
`group_means`	Dict of cell means for diagnostics
`pscore_stats`	Propensity score statistics (IPW/DR only)
`is_significant`	Boolean for significance at alpha
`significance_stars`	String of significance stars

Methods:

Method	Description
`summary(alpha)`	Get formatted summary string
`print_summary(alpha)`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

HonestDiD

HonestDiD(
    method='relative_magnitude',  # 'relative_magnitude' or 'smoothness'
    M=None,               # Restriction parameter (default: 1.0 for RM, 0.0 for SD)
    alpha=0.05,           # Significance level for CIs
    l_vec=None            # Linear combination vector for target parameter
)

fit() Parameters:

Parameter	Type	Description
`results`	MultiPeriodDiDResults	Results from MultiPeriodDiD.fit()
`M`	float	Restriction parameter (overrides constructor value)

Methods:

Method	Description
`fit(results, M)`	Compute bounds for given event study results
`sensitivity_analysis(results, M_grid)`	Compute bounds over grid of M values
`breakdown_value(results, tol)`	Find smallest M where CI includes zero

HonestDiDResults

Attributes:

Attribute	Description
`original_estimate`	Point estimate under parallel trends
`lb`	Lower bound of identified set
`ub`	Upper bound of identified set
`ci_lb`	Lower bound of robust confidence interval
`ci_ub`	Upper bound of robust confidence interval
`ci_width`	Width of robust CI
`M`	Restriction parameter used
`method`	Restriction method ('relative_magnitude' or 'smoothness')
`alpha`	Significance level
`is_significant`	True if robust CI excludes zero

Methods:

Method	Description
`summary()`	Get formatted summary string
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

SensitivityResults

Attributes:

Attribute	Description
`M_grid`	Array of M values analyzed
`results`	List of HonestDiDResults for each M
`breakdown_M`	Smallest M where CI includes zero (None if always significant)

Methods:

Method	Description
`summary()`	Get formatted summary string
`plot(ax)`	Plot sensitivity analysis
`to_dataframe()`	Convert to pandas DataFrame

PreTrendsPower

PreTrendsPower(
    alpha=0.05,           # Significance level for pre-trends test
    power=0.80,           # Target power for MDV calculation
    violation_type='linear',  # 'linear', 'constant', 'last_period', 'custom'
    violation_weights=None    # Custom weights (required if violation_type='custom')
)

fit() Parameters:

Parameter	Type	Description
`results`	MultiPeriodDiDResults	Results from event study
`M`	float	Specific violation magnitude to evaluate

Methods:

Method	Description
`fit(results, M)`	Compute power analysis for given event study
`power_at(results, M)`	Compute power for specific violation magnitude
`power_curve(results, M_grid, n_points)`	Compute power across range of M values
`sensitivity_to_honest_did(results)`	Compare with HonestDiD analysis

PreTrendsPowerResults

Attributes:

Attribute	Description
`power`	Power to detect the specified violation
`mdv`	Minimum detectable violation at target power
`violation_magnitude`	Violation magnitude (M) tested
`violation_type`	Type of violation pattern
`alpha`	Significance level
`target_power`	Target power level
`n_pre_periods`	Number of pre-treatment periods
`test_statistic`	Expected test statistic under violation
`critical_value`	Critical value for pre-trends test
`noncentrality`	Non-centrality parameter
`is_informative`	Heuristic check if test is informative
`power_adequate`	Whether power meets target

Methods:

Method	Description
`summary()`	Get formatted summary string
`print_summary()`	Print summary to stdout
`to_dict()`	Convert to dictionary
`to_dataframe()`	Convert to pandas DataFrame

PreTrendsPowerCurve

Attributes:

Attribute	Description
`M_values`	Array of violation magnitudes
`powers`	Array of power values
`mdv`	Minimum detectable violation
`alpha`	Significance level
`target_power`	Target power level
`violation_type`	Type of violation pattern

Methods:

Method	Description
`plot(ax, show_mdv, show_target)`	Plot power curve
`to_dataframe()`	Convert to DataFrame with M and power columns

Data Preparation Functions

generate_did_data

generate_did_data(
    n_units=100,          # Number of units
    n_periods=4,          # Number of time periods
    treatment_effect=5.0, # True ATT
    treatment_fraction=0.5,  # Fraction treated
    treatment_period=2,   # First post-treatment period
    unit_fe_sd=2.0,       # Unit fixed effect std dev
    time_trend=0.5,       # Linear time trend
    noise_sd=1.0,         # Idiosyncratic noise std dev
    seed=None             # Random seed
)

Returns DataFrame with columns: unit, period, treated, post, outcome, true_effect.

make_treatment_indicator

make_treatment_indicator(
    data,                 # Input DataFrame
    column,               # Column to create treatment from
    treated_values=None,  # Value(s) indicating treatment
    threshold=None,       # Numeric threshold for treatment
    above_threshold=True, # If True, >= threshold is treated
    new_column='treated'  # Output column name
)

make_post_indicator

make_post_indicator(
    data,                  # Input DataFrame
    time_column,           # Time/period column
    post_periods=None,     # Specific post-treatment period(s)
    treatment_start=None,  # First post-treatment period
    new_column='post'      # Output column name
)

wide_to_long

wide_to_long(
    data,                  # Wide-format DataFrame
    value_columns,         # List of time-varying columns
    id_column,             # Unit identifier column
    time_name='period',    # Name for time column
    value_name='value',    # Name for value column
    time_values=None       # Values for time periods
)

balance_panel

balance_panel(
    data,                  # Panel DataFrame
    unit_column,           # Unit identifier column
    time_column,           # Time period column
    method='inner',        # 'inner', 'outer', or 'fill'
    fill_value=None        # Value for filling (if method='fill')
)

validate_did_data

validate_did_data(
    data,                  # DataFrame to validate
    outcome,               # Outcome column name
    treatment,             # Treatment column name
    time,                  # Time/post column name
    unit=None,             # Unit column (for panel validation)
    raise_on_error=True    # Raise ValueError or return dict
)

Returns dict with valid, errors, warnings, and summary keys.

summarize_did_data

summarize_did_data(
    data,                  # Input DataFrame
    outcome,               # Outcome column name
    treatment,             # Treatment column name
    time,                  # Time/post column name
    unit=None              # Unit column (optional)
)

Returns DataFrame with summary statistics by treatment-time cell.

create_event_time

create_event_time(
    data,                  # Panel DataFrame
    time_column,           # Calendar time column
    treatment_time_column, # Column with treatment timing
    new_column='event_time' # Output column name
)

aggregate_to_cohorts

aggregate_to_cohorts(
    data,                  # Unit-level panel data
    unit_column,           # Unit identifier column
    time_column,           # Time period column
    treatment_column,      # Treatment indicator column
    outcome,               # Outcome variable column
    covariates=None        # Additional columns to aggregate
)

rank_control_units

rank_control_units(
    data,                          # Panel data in long format
    unit_column,                   # Unit identifier column
    time_column,                   # Time period column
    outcome_column,                # Outcome variable column
    treatment_column=None,         # Treatment indicator column (0/1)
    treated_units=None,            # Explicit list of treated unit IDs
    pre_periods=None,              # Pre-treatment periods (default: first half)
    covariates=None,               # Covariate columns for matching
    outcome_weight=0.7,            # Weight for outcome trend similarity (0-1)
    covariate_weight=0.3,          # Weight for covariate distance (0-1)
    exclude_units=None,            # Units to exclude from control pool
    require_units=None,            # Units that must appear in output
    n_top=None,                    # Return only top N controls
    suggest_treatment_candidates=False,  # Identify treatment candidates
    n_treatment_candidates=5,      # Number of treatment candidates
    lambda_reg=0.0                 # Regularization for synthetic weights
)

Returns DataFrame with columns: unit, quality_score, outcome_trend_score, covariate_score, synthetic_weight, pre_trend_rmse, is_required.

Requirements

Python 3.9 - 3.14
numpy >= 1.20
pandas >= 1.3
scipy >= 1.7

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Format code
black diff_diff tests
ruff check diff_diff tests

References

This library implements methods from the following scholarly works:

Difference-in-Differences

Ashenfelter, O., & Card, D. (1985). "Using the Longitudinal Structure of Earnings to Estimate the Effect of Training Programs." The Review of Economics and Statistics, 67(4), 648-660. https://doi.org/10.2307/1924810
Card, D., & Krueger, A. B. (1994). "Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania." The American Economic Review, 84(4), 772-793. https://www.jstor.org/stable/2118030
Angrist, J. D., & Pischke, J.-S. (2009). Mostly Harmless Econometrics: An Empiricist's Companion. Princeton University Press. Chapter 5: Differences-in-Differences.

Two-Way Fixed Effects

Wooldridge, J. M. (2010). Econometric Analysis of Cross Section and Panel Data (2nd ed.). MIT Press.
Imai, K., & Kim, I. S. (2021). "On the Use of Two-Way Fixed Effects Regression Models for Causal Inference with Panel Data." Political Analysis, 29(3), 405-415. https://doi.org/10.1017/pan.2020.33

Robust Standard Errors

White, H. (1980). "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity." Econometrica, 48(4), 817-838. https://doi.org/10.2307/1912934
MacKinnon, J. G., & White, H. (1985). "Some Heteroskedasticity-Consistent Covariance Matrix Estimators with Improved Finite Sample Properties." Journal of Econometrics, 29(3), 305-325. https://doi.org/10.1016/0304-4076(85)90158-7
Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2011). "Robust Inference With Multiway Clustering." Journal of Business & Economic Statistics, 29(2), 238-249. https://doi.org/10.1198/jbes.2010.07136

Wild Cluster Bootstrap

Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2008). "Bootstrap-Based Improvements for Inference with Clustered Errors." The Review of Economics and Statistics, 90(3), 414-427. https://doi.org/10.1162/rest.90.3.414
Webb, M. D. (2014). "Reworking Wild Bootstrap Based Inference for Clustered Errors." Queen's Economics Department Working Paper No. 1315. https://www.econ.queensu.ca/sites/econ.queensu.ca/files/qed_wp_1315.pdf
MacKinnon, J. G., & Webb, M. D. (2018). "The Wild Bootstrap for Few (Treated) Clusters." The Econometrics Journal, 21(2), 114-135. https://doi.org/10.1111/ectj.12107

Placebo Tests and DiD Diagnostics

Bertrand, M., Duflo, E., & Mullainathan, S. (2004). "How Much Should We Trust Differences-in-Differences Estimates?" The Quarterly Journal of Economics, 119(1), 249-275. https://doi.org/10.1162/003355304772839588

Synthetic Control Method

Abadie, A., & Gardeazabal, J. (2003). "The Economic Costs of Conflict: A Case Study of the Basque Country." The American Economic Review, 93(1), 113-132. https://doi.org/10.1257/000282803321455188
Abadie, A., Diamond, A., & Hainmueller, J. (2010). "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California's Tobacco Control Program." Journal of the American Statistical Association, 105(490), 493-505. https://doi.org/10.1198/jasa.2009.ap08746
Abadie, A., Diamond, A., & Hainmueller, J. (2015). "Comparative Politics and the Synthetic Control Method." American Journal of Political Science, 59(2), 495-510. https://doi.org/10.1111/ajps.12116

Synthetic Difference-in-Differences

Arkhangelsky, D., Athey, S., Hirshberg, D. A., Imbens, G. W., & Wager, S. (2021). "Synthetic Difference-in-Differences." American Economic Review, 111(12), 4088-4118. https://doi.org/10.1257/aer.20190159

Triply Robust Panel (TROP)

Athey, S., Imbens, G. W., Qu, Z., & Viviano, D. (2025). "Triply Robust Panel Estimators." Working Paper. https://arxiv.org/abs/2508.21536

This paper introduces the TROP estimator which combines three robustness components:
- Factor model adjustment: Low-rank factor structure via SVD removes unobserved confounders
- Unit weights: Synthetic control style weighting for optimal comparison
- Time weights: SDID style time weighting for informative pre-periods
TROP is particularly useful when there are unobserved time-varying confounders with a factor structure that affect different units differently over time.

Triple Difference (DDD)

Ortiz-Villavicencio, M., & Sant'Anna, P. H. C. (2025). "Better Understanding Triple Differences Estimators." Working Paper. https://arxiv.org/abs/2505.09942

This paper shows that common DDD implementations (taking the difference between two DiDs, or applying three-way fixed effects regressions) are generally invalid when identification requires conditioning on covariates. The TripleDifference class implements their regression adjustment, inverse probability weighting, and doubly robust estimators.
Gruber, J. (1994). "The Incidence of Mandated Maternity Benefits." American Economic Review, 84(3), 622-641. https://www.jstor.org/stable/2118071

Classic paper introducing the Triple Difference design for policy evaluation.
Olden, A., & Møen, J. (2022). "The Triple Difference Estimator." The Econometrics Journal, 25(3), 531-553. https://doi.org/10.1093/ectj/utac010

Parallel Trends and Pre-Trend Testing

Roth, J. (2022). "Pretest with Caution: Event-Study Estimates after Testing for Parallel Trends." American Economic Review: Insights, 4(3), 305-322. https://doi.org/10.1257/aeri.20210236
Lakens, D. (2017). "Equivalence Tests: A Practical Primer for t Tests, Correlations, and Meta-Analyses." Social Psychological and Personality Science, 8(4), 355-362. https://doi.org/10.1177/1948550617697177

Honest DiD / Sensitivity Analysis

The HonestDiD module implements sensitivity analysis methods for relaxing the parallel trends assumption:

Rambachan, A., & Roth, J. (2023). "A More Credible Approach to Parallel Trends." The Review of Economic Studies, 90(5), 2555-2591. https://doi.org/10.1093/restud/rdad018

This paper introduces the "Honest DiD" framework implemented in our HonestDiD class:
- Relative Magnitudes (ΔRM): Bounds post-treatment violations by a multiple of observed pre-treatment violations
- Smoothness (ΔSD): Bounds on second differences of trend violations, allowing for linear extrapolation of pre-trends
- Breakdown Analysis: Finding the smallest violation magnitude that would overturn conclusions
- Robust Confidence Intervals: Valid inference under partial identification
Roth, J., & Sant'Anna, P. H. C. (2023). "When Is Parallel Trends Sensitive to Functional Form?" Econometrica, 91(2), 737-747. https://doi.org/10.3982/ECTA19402

Discusses functional form sensitivity in parallel trends assumptions, relevant to understanding when smoothness restrictions are appropriate.

Multi-Period and Staggered Adoption

Borusyak, K., Jaravel, X., & Spiess, J. (2024). "Revisiting Event-Study Designs: Robust and Efficient Estimation." Review of Economic Studies, 91(6), 3253-3285. https://doi.org/10.1093/restud/rdae007

This paper introduces the imputation estimator implemented in our ImputationDiD class:
- Efficient imputation: OLS on untreated observations → impute counterfactuals → aggregate
- Conservative variance: Theorem 3 clustered variance estimator with auxiliary model
- Pre-trend test: Independent of treatment effect estimation (Proposition 9)
- Efficiency gains: ~50% shorter CIs than Callaway-Sant'Anna under homogeneous effects
Callaway, B., & Sant'Anna, P. H. C. (2021). "Difference-in-Differences with Multiple Time Periods." Journal of Econometrics, 225(2), 200-230. https://doi.org/10.1016/j.jeconom.2020.12.001
Sant'Anna, P. H. C., & Zhao, J. (2020). "Doubly Robust Difference-in-Differences Estimators." Journal of Econometrics, 219(1), 101-122. https://doi.org/10.1016/j.jeconom.2020.06.003
Sun, L., & Abraham, S. (2021). "Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects." Journal of Econometrics, 225(2), 175-199. https://doi.org/10.1016/j.jeconom.2020.09.006
Gardner, J. (2022). "Two-stage differences in differences." arXiv preprint arXiv:2207.05943. https://arxiv.org/abs/2207.05943
Butts, K., & Gardner, J. (2022). "did2s: Two-Stage Difference-in-Differences." The R Journal, 14(1), 162-173. https://doi.org/10.32614/RJ-2022-048
de Chaisemartin, C., & D'Haultfœuille, X. (2020). "Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects." American Economic Review, 110(9), 2964-2996. https://doi.org/10.1257/aer.20181169
Goodman-Bacon, A. (2021). "Difference-in-Differences with Variation in Treatment Timing." Journal of Econometrics, 225(2), 254-277. https://doi.org/10.1016/j.jeconom.2021.03.014
Wing, C., Freedman, S. M., & Hollingsworth, A. (2024). "Stacked Difference-in-Differences." NBER Working Paper 32054. https://www.nber.org/papers/w32054

Power Analysis

Bloom, H. S. (1995). "Minimum Detectable Effects: A Simple Way to Report the Statistical Power of Experimental Designs." Evaluation Review, 19(5), 547-556. https://doi.org/10.1177/0193841X9501900504
Burlig, F., Preonas, L., & Woerman, M. (2020). "Panel Data and Experimental Design." Journal of Development Economics, 144, 102458. https://doi.org/10.1016/j.jdeveco.2020.102458

Essential reference for power analysis in panel DiD designs. Discusses how serial correlation (ICC) affects power and provides formulas for panel data settings.
Djimeu, E. W., & Houndolo, D.-G. (2016). "Power Calculation for Causal Inference in Social Science: Sample Size and Minimum Detectable Effect Determination." Journal of Development Effectiveness, 8(4), 508-527. https://doi.org/10.1080/19439342.2016.1244555

General Causal Inference

Imbens, G. W., & Rubin, D. B. (2015). Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press.
Cunningham, S. (2021). Causal Inference: The Mixtape. Yale University Press. https://mixtape.scunning.com/

Citing diff-diff

If you use diff-diff in your research, please cite it:

@software{diff_diff,
  title = {diff-diff: Difference-in-Differences Causal Inference for Python},
  author = {{diff-diff contributors}},
  url = {https://github.com/igerber/diff-diff},
  license = {MIT},
}

See CITATION.cff for the full citation metadata.

License

MIT License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

igerber

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.1.3

Apr 18, 2026

This version

3.1.2

Apr 18, 2026

3.1.1

Apr 16, 2026

3.1.0

Apr 14, 2026

3.0.2

Apr 13, 2026

3.0.1

Apr 7, 2026

3.0.0

Apr 7, 2026

2.9.1

Apr 6, 2026

2.9.0

Apr 5, 2026

2.8.4

Apr 4, 2026

2.8.3

Apr 2, 2026

2.8.2

Apr 2, 2026

2.8.1

Apr 2, 2026

2.8.0

Mar 31, 2026

2.7.6

Mar 29, 2026

2.7.5

Mar 24, 2026

2.7.4

Mar 21, 2026

2.7.3

Mar 20, 2026

2.7.2

Mar 18, 2026

2.7.1

Mar 16, 2026

2.7.0

Mar 15, 2026

2.6.1

Mar 8, 2026

2.6.0

Feb 22, 2026

2.5.0

Feb 19, 2026

2.4.3

Feb 19, 2026

2.4.2

Feb 18, 2026

2.4.1

Feb 17, 2026

2.4.0

Feb 16, 2026

2.3.2

Feb 16, 2026

2.3.1

Feb 15, 2026

2.3.0

Feb 9, 2026

2.2.1

Feb 8, 2026

2.2.0

Jan 27, 2026

2.1.9

Jan 26, 2026

2.1.8

Jan 25, 2026

2.1.7

Jan 25, 2026

2.1.6

Jan 24, 2026

2.1.5

Jan 22, 2026

2.1.4

Jan 21, 2026

2.1.3

Jan 19, 2026

2.1.2

Jan 19, 2026

2.1.1

Jan 19, 2026

2.1.0

Jan 18, 2026

2.0.4

Jan 17, 2026

2.0.3

Jan 17, 2026

2.0.2

Jan 15, 2026

2.0.1

Jan 13, 2026

2.0.0

Jan 12, 2026

1.4.0

Jan 11, 2026

1.3.1

Jan 10, 2026

1.3.0

Jan 9, 2026

1.2.1

Jan 8, 2026

1.2.0

Jan 7, 2026

1.1.0

Jan 5, 2026

1.0.0

Jan 4, 2026

0.6.0

Jan 4, 2026

0.5.0

Jan 3, 2026

0.4.0

Jan 3, 2026

0.3.0

Jan 3, 2026

0.2.0

Jan 2, 2026

0.1.0

Jan 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diff_diff-3.1.2.tar.gz (695.2 kB view details)

Uploaded Apr 18, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

diff_diff-3.1.2-cp314-cp314-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.14Windows x86-64

diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.14manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.14manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.14macOS 11.0+ ARM64

diff_diff-3.1.2-cp313-cp313-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.13Windows x86-64

diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.13manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.13manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.13macOS 11.0+ ARM64

diff_diff-3.1.2-cp312-cp312-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.12Windows x86-64

diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.12manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.12manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.12macOS 11.0+ ARM64

diff_diff-3.1.2-cp311-cp311-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.11Windows x86-64

diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.11manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.11manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.11macOS 11.0+ ARM64

diff_diff-3.1.2-cp310-cp310-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.10Windows x86-64

diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.10manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.10manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.10macOS 11.0+ ARM64

diff_diff-3.1.2-cp39-cp39-win_amd64.whl (1.6 MB view details)

Uploaded Apr 18, 2026 CPython 3.9Windows x86-64

diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl (14.1 MB view details)

Uploaded Apr 18, 2026 CPython 3.9manylinux: glibc 2.28+ x86-64

diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl (6.5 MB view details)

Uploaded Apr 18, 2026 CPython 3.9manylinux: glibc 2.28+ ARM64

diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl (1.3 MB view details)

Uploaded Apr 18, 2026 CPython 3.9macOS 11.0+ ARM64

File details

Details for the file diff_diff-3.1.2.tar.gz.

File metadata

Download URL: diff_diff-3.1.2.tar.gz
Upload date: Apr 18, 2026
Size: 695.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2.tar.gz
Algorithm	Hash digest
SHA256	`c2c29823b0b62bdd04fca9b144b0bdf714f2e6f59115f711a8412b54ba5c71cd`
MD5	`fd72e01c14f4d20c6b6ed0dd3a65fcc5`
BLAKE2b-256	`0b2e705f8c6e73ffd6a737bb2d2b94358f54f48bb9ffb5a63124f3274c1939bb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2.tar.gz:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2.tar.gz
- Subject digest: c2c29823b0b62bdd04fca9b144b0bdf714f2e6f59115f711a8412b54ba5c71cd
- Sigstore transparency entry: 1339522769
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp314-cp314-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp314-cp314-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.14, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp314-cp314-win_amd64.whl
Algorithm	Hash digest
SHA256	`bddff4ebcfd6b5c38513ecd4aa5a888cb89800c1db4d15f3c75e792eebca0e5a`
MD5	`89d7f39a7758de3dea3e069f33d8f3ca`
BLAKE2b-256	`961bcef9a15bf2a49dcaa5e1535edcd122d5f460ab89ae04740905eda50a4077`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp314-cp314-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp314-cp314-win_amd64.whl
- Subject digest: bddff4ebcfd6b5c38513ecd4aa5a888cb89800c1db4d15f3c75e792eebca0e5a
- Sigstore transparency entry: 1339523753
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.14, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`09b7a11b86e6bc32b4de23d129aca56c5d8ae33929241ef5bb15fb2faa18bf69`
MD5	`5fd78af9e063f3a8d0f539a8de734cf2`
BLAKE2b-256	`5c9a29e98faca4e5c42dce94ccdd01e1ad813eb9b1122229e386eebeac33afc2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp314-cp314-manylinux_2_28_x86_64.whl
- Subject digest: 09b7a11b86e6bc32b4de23d129aca56c5d8ae33929241ef5bb15fb2faa18bf69
- Sigstore transparency entry: 1339523954
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.14, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`755d137fc84c0576ad82155ff24e1d7e326e06ebb7d4a3e664b6e1c5403416c0`
MD5	`c5f5a10655832f85691bce6f9d230435`
BLAKE2b-256	`519d4e4f86aed6196c6dd19560c0da99e10a4c6eb2a9b7c0b989dcc27d7ac56b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp314-cp314-manylinux_2_28_aarch64.whl
- Subject digest: 755d137fc84c0576ad82155ff24e1d7e326e06ebb7d4a3e664b6e1c5403416c0
- Sigstore transparency entry: 1339523688
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.14, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`6d922a1d08c84cc1a6c8f5d70cd5c18d817e5a263c39a94bfb6628661109adbe`
MD5	`04d91f0b409c30438d1473eee524a0c1`
BLAKE2b-256	`e82f6ffb9d9a05d42e3771075ff3848707ff9147fba1b48b0463a449dbbc3d36`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp314-cp314-macosx_11_0_arm64.whl
- Subject digest: 6d922a1d08c84cc1a6c8f5d70cd5c18d817e5a263c39a94bfb6628661109adbe
- Sigstore transparency entry: 1339523041
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp313-cp313-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp313-cp313-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.13, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp313-cp313-win_amd64.whl
Algorithm	Hash digest
SHA256	`f9cf1fba40c34f15364ceb90a081b1adc71b6a55df1799f6f9afd0f9debe7379`
MD5	`c1cfebb5b22a4ec6ed844ce3f7de3fd3`
BLAKE2b-256	`237f40897aeddfcbfffa6bc174f416c740e17acfab2e3665a9ba74c0b659373c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp313-cp313-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp313-cp313-win_amd64.whl
- Subject digest: f9cf1fba40c34f15364ceb90a081b1adc71b6a55df1799f6f9afd0f9debe7379
- Sigstore transparency entry: 1339523564
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.13, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`cb39aec1ddd218c365fd35d44a455104e8162e9f397e2c9a5827ff24f0a065ae`
MD5	`64c42db582d877b7c15f3caf204df501`
BLAKE2b-256	`5ef376d12744f6f1668c365987cf3ef5c8fe0adf3e6019a18a35397001029819`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp313-cp313-manylinux_2_28_x86_64.whl
- Subject digest: cb39aec1ddd218c365fd35d44a455104e8162e9f397e2c9a5827ff24f0a065ae
- Sigstore transparency entry: 1339522855
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.13, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`8ed1074cbc4f554c8f578657c9fff3767d4b6fe39cdfa490540e0d870e346826`
MD5	`01ace74783a3bc33844e0be31c0b6560`
BLAKE2b-256	`2ef3bd824da507c35605a5d0380f672679dcfc9798698011edb66ffac436afcb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp313-cp313-manylinux_2_28_aarch64.whl
- Subject digest: 8ed1074cbc4f554c8f578657c9fff3767d4b6fe39cdfa490540e0d870e346826
- Sigstore transparency entry: 1339523995
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.13, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`d44e6451f95cefb347255a6821b18db1ec3ec740db9d1f6838a84b0e579c5434`
MD5	`01d7bcd82daae71ac71159f77010fa93`
BLAKE2b-256	`3bd38762ef5222952647f6a04fc7b3cff8927cf318e6303f9e5541a3d4415404`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp313-cp313-macosx_11_0_arm64.whl
- Subject digest: d44e6451f95cefb347255a6821b18db1ec3ec740db9d1f6838a84b0e579c5434
- Sigstore transparency entry: 1339523339
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp312-cp312-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp312-cp312-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.12, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp312-cp312-win_amd64.whl
Algorithm	Hash digest
SHA256	`e68b846809c303e8efd3652a9ab2ca0410f4de91657e3d849145804eedc400ef`
MD5	`155a0389e2d04e57655b3e0bd1b9a170`
BLAKE2b-256	`56dd871da31fd40155b1e6a04da215001460744b7455eb442ce37b67a7b609a2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp312-cp312-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp312-cp312-win_amd64.whl
- Subject digest: e68b846809c303e8efd3652a9ab2ca0410f4de91657e3d849145804eedc400ef
- Sigstore transparency entry: 1339524245
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.12, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`d6a3be00d457962c53181fec20c472749a95c6448ea0af1ba8bac6615c32665c`
MD5	`9f8a9ddbab0f21c4337d2ef16fcfab16`
BLAKE2b-256	`0cf3399b086449648a9226f3d2fcb45fd484b867e63ed53b4dbf71a541538eb2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp312-cp312-manylinux_2_28_x86_64.whl
- Subject digest: d6a3be00d457962c53181fec20c472749a95c6448ea0af1ba8bac6615c32665c
- Sigstore transparency entry: 1339524379
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.12, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`90717349996079378a5a87ecac42ac23e3164065ab0b5a3a937266bce9dabeca`
MD5	`5404d3f28d21715acd32b68672f988e3`
BLAKE2b-256	`947f5eef77498f1c83a0a6955015fd6df628dc52e44fbe2c7ba7d475e8d7b5e3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp312-cp312-manylinux_2_28_aarch64.whl
- Subject digest: 90717349996079378a5a87ecac42ac23e3164065ab0b5a3a937266bce9dabeca
- Sigstore transparency entry: 1339524437
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.12, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`57c54d9fc6f80133c0419149d6745635cb61f5cba4c7f1eab94add81dc88bfd5`
MD5	`9fe3b109a3031800ff4b390b660c7811`
BLAKE2b-256	`4bd36cbd3bfcad6f5a31148c2efefe857beccb3f628d5e677c26658902ce18fa`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp312-cp312-macosx_11_0_arm64.whl
- Subject digest: 57c54d9fc6f80133c0419149d6745635cb61f5cba4c7f1eab94add81dc88bfd5
- Sigstore transparency entry: 1339523123
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp311-cp311-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp311-cp311-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.11, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp311-cp311-win_amd64.whl
Algorithm	Hash digest
SHA256	`33dada793fb2025a58360a7f7fb0b2bb4ac5a40ffb755da447b858fdbfa6bde8`
MD5	`3dc5b894a4ae9abd665be4256bb7b6db`
BLAKE2b-256	`1e59d462cc539f5920ed4385dc34e0974e1a4762392724573d9d84a95cf5cb98`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp311-cp311-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp311-cp311-win_amd64.whl
- Subject digest: 33dada793fb2025a58360a7f7fb0b2bb4ac5a40ffb755da447b858fdbfa6bde8
- Sigstore transparency entry: 1339525143
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.11, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`e54569c9c1da41d5795ccde6d747912075176a76dd5c41da4bd8fea2910a3115`
MD5	`4da3b4e3b06f114f1bfca869b8144cb3`
BLAKE2b-256	`11001c1a209efbd3b04ea276686e90298ab474e4ec1d03dc8efda973af1dea2d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp311-cp311-manylinux_2_28_x86_64.whl
- Subject digest: e54569c9c1da41d5795ccde6d747912075176a76dd5c41da4bd8fea2910a3115
- Sigstore transparency entry: 1339524289
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.11, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`df82da05fce8ed3e89a84bf0c3b82a1ae8572be9ad582512dbb80b33db0399f4`
MD5	`b58b1a8dca51406b3c78fcffa45edc35`
BLAKE2b-256	`434347a6217f9eb2d246561bda60ade71e9556802aeacb72811aced21f5907d0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp311-cp311-manylinux_2_28_aarch64.whl
- Subject digest: df82da05fce8ed3e89a84bf0c3b82a1ae8572be9ad582512dbb80b33db0399f4
- Sigstore transparency entry: 1339524497
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.11, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`8f70610dcbfa3f1d654d18eb9143d7bd98d191037eed7c00ef1e96a052c8cacf`
MD5	`0dcfd3c1fc4f2b67b7e127926f0be7dc`
BLAKE2b-256	`0875a917769e5cccf0000ab1c7ef99aec84ed68a76aa79b7985e01965c75720d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp311-cp311-macosx_11_0_arm64.whl
- Subject digest: 8f70610dcbfa3f1d654d18eb9143d7bd98d191037eed7c00ef1e96a052c8cacf
- Sigstore transparency entry: 1339522946
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp310-cp310-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp310-cp310-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.10, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp310-cp310-win_amd64.whl
Algorithm	Hash digest
SHA256	`3f8451d49f1013b6c0077b99164dec4c23321c9dec6b07aa46c46f2855149380`
MD5	`5d1d7e1095676b1f4f22f93abcf047d5`
BLAKE2b-256	`1955e0da6d78d9905318ab9f4756dd1f34aed8e43f12c6e4878f1c13e8490f3c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp310-cp310-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp310-cp310-win_amd64.whl
- Subject digest: 3f8451d49f1013b6c0077b99164dec4c23321c9dec6b07aa46c46f2855149380
- Sigstore transparency entry: 1339524827
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.10, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`40cf96f49ac096f20e4142bcec5475129827aff75c749ddb8a158e154a918178`
MD5	`25c8d944b8836ddeaea8c611eff8b777`
BLAKE2b-256	`4f9639c857e3b45ae7f8f45b5d1833d208dc1d8f3a4baf54e9c0a7fbc9f974b1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp310-cp310-manylinux_2_28_x86_64.whl
- Subject digest: 40cf96f49ac096f20e4142bcec5475129827aff75c749ddb8a158e154a918178
- Sigstore transparency entry: 1339524332
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.10, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`322bcba941914ee4b691eb764f8f77f48c9a28f585422d401c47fe7774e4519f`
MD5	`540d9e565427b705aa4564fdd1ab5bcd`
BLAKE2b-256	`66daa4ef75214766da578889d5ef90fe9b2e1c78737a09d79ed4dde82044c82f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp310-cp310-manylinux_2_28_aarch64.whl
- Subject digest: 322bcba941914ee4b691eb764f8f77f48c9a28f585422d401c47fe7774e4519f
- Sigstore transparency entry: 1339524657
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.10, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`f61b91de69d0d623b78bcb7213bb35acaaacba59e9b747fce670d2b0c0e6f144`
MD5	`0e3c869113e7ec23841eeac038ca6fa6`
BLAKE2b-256	`66ea1c2a42e607b95c732110463085a4261336e54ac56a6ac17951619fb6ee5b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp310-cp310-macosx_11_0_arm64.whl
- Subject digest: f61b91de69d0d623b78bcb7213bb35acaaacba59e9b747fce670d2b0c0e6f144
- Sigstore transparency entry: 1339524060
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp39-cp39-win_amd64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp39-cp39-win_amd64.whl
Upload date: Apr 18, 2026
Size: 1.6 MB
Tags: CPython 3.9, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp39-cp39-win_amd64.whl
Algorithm	Hash digest
SHA256	`5cc88cf54c1894fbab6f0e5b6fe3812fb6bbe610ea35fbf09428e670aec5f56a`
MD5	`a255328ac84e81b27fc06132abeb44e4`
BLAKE2b-256	`cf4cff5c83d484c236f148a52dad9f64c54aa546fde6e5d20493f52a53730a4e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp39-cp39-win_amd64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp39-cp39-win_amd64.whl
- Subject digest: 5cc88cf54c1894fbab6f0e5b6fe3812fb6bbe610ea35fbf09428e670aec5f56a
- Sigstore transparency entry: 1339523639
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl
Upload date: Apr 18, 2026
Size: 14.1 MB
Tags: CPython 3.9, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`182765b7747b482465817e5fe9144ab4acc2b83813ccb06fadeb04b5d94a9a6c`
MD5	`2edcbf803d8c62b88539fdd45a04e14c`
BLAKE2b-256	`33bf97b1519ac970cf5832573977e8914c746dac35ec8419264a35a145afc70f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp39-cp39-manylinux_2_28_x86_64.whl
- Subject digest: 182765b7747b482465817e5fe9144ab4acc2b83813ccb06fadeb04b5d94a9a6c
- Sigstore transparency entry: 1339525056
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl
Upload date: Apr 18, 2026
Size: 6.5 MB
Tags: CPython 3.9, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`6f96ad7ba219abfc77bd4a114678a9babd71fe96ef88128a47a5fa9dec39429c`
MD5	`72221d9fbe0383bff75e7032bfdc1c6b`
BLAKE2b-256	`2eb6d4e0975b0c14fd0e2d954cfb20595767b1ac95e33d1898c593ca3ea245af`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp39-cp39-manylinux_2_28_aarch64.whl
- Subject digest: 6f96ad7ba219abfc77bd4a114678a9babd71fe96ef88128a47a5fa9dec39429c
- Sigstore transparency entry: 1339524719
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

File details

Details for the file diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl.

File metadata

Download URL: diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl
Upload date: Apr 18, 2026
Size: 1.3 MB
Tags: CPython 3.9, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`b8155772d6e3b5a9f5d4dba26233d847a79d4586c47e17c76a52509d791befc8`
MD5	`e3c0833e830078aeca16140602a9ed08`
BLAKE2b-256	`0dff0dae9490873fc26f2b8351a55c34ec45a6030b5cc28a7f5cce9706d4ac18`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl:

Publisher: publish.yml on igerber/diff-diff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diff_diff-3.1.2-cp39-cp39-macosx_11_0_arm64.whl
- Subject digest: b8155772d6e3b5a9f5d4dba26233d847a79d4586c47e17c76a52509d791befc8
- Sigstore transparency entry: 1339524945
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: igerber/diff-diff@ba790b0a05db83f6789ef63d839792a79d328102
- Branch / Tag: refs/tags/v3.1.2
- Owner: https://github.com/igerber
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ba790b0a05db83f6789ef63d839792a79d328102
- Trigger Event: release

diff-diff 3.1.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

diff-diff

Installation

Quick Start

For AI Agents

For Data Scientists

Features

Estimator Aliases

Tutorials

Data Preparation

Generate Sample Data

Create Treatment Indicators

Create Post-Treatment Indicators

Reshape Wide to Long Format

Balance Panel Data

Validate Data

Summarize Data by Groups

Create Event Time for Staggered Designs

Aggregate to Cohort Means

Rank Control Units

Usage

Basic DiD with Column Names

Using Formula Interface

Including Covariates

Fixed Effects

Cluster-Robust Standard Errors

Wild Cluster Bootstrap

Two-Way Fixed Effects (Panel Data)

Multi-Period DiD (Event Study)

Staggered Difference-in-Differences (Callaway-Sant'Anna)

Sun-Abraham Interaction-Weighted Estimator

Borusyak-Jaravel-Spiess Imputation Estimator

Two-Stage DiD (Gardner 2022)

Stacked DiD (Wing, Freedman & Hollingsworth 2024)

Efficient DiD (Chen, Sant'Anna & Xie 2025)

de Chaisemartin-D'Haultfœuille (dCDH) for Reversible Treatments

Triple Difference (DDD)

Event Study Visualization

Synthetic Difference-in-Differences

When to Use Synthetic DiD Over Vanilla DiD

Triply Robust Panel (TROP)

When to Use TROP Over Synthetic DiD

Working with Results

Export Results

Check Significance

Access Full Regression Output

Checking Assumptions

Parallel Trends

Honest DiD Sensitivity Analysis (Rambachan-Roth)

Pre-Trends Power Analysis (Roth 2022)

Placebo Tests

API Reference

DifferenceInDifferences

DiDResults

MultiPeriodDiD

MultiPeriodDiDResults

PeriodEffect

SyntheticDiD

SyntheticDiDResults

TROP

TROPResults

SunAbraham

SunAbrahamResults

ImputationDiD

ImputationDiDResults

TwoStageDiD

TwoStageDiDResults

StackedDiD

StackedDiDResults

TripleDifference