Fast counterfactual estimators for panel data — Python reimplementation of R fect

These details have not been verified by PyPI

Project description

pyfector

Alpha (v0.1.x) -- This package is in early development. Results have been validated against R fect on synthetic data, but edge cases may remain. APIs are not stable and may change without notice. Please verify critical results independently. Bug reports, feature requests, and contributions are welcome via GitHub Issues.

Fast counterfactual estimators for panel data in Python.

A high-performance reimplementation of the R fect package (Liu, Wang & Xu, 2024, AJPS), featuring GPU acceleration, parallel computing, and Polars data handling.

Installation

pip install pyfector

For GPU support:

pip install pyfector[gpu]

Quick Start

import pyfector

result = pyfector.fect(
    data=df,                    # Polars or Pandas DataFrame
    Y="outcome",                # outcome column
    D="treatment",              # binary treatment indicator
    index=("unit_id", "year"),  # (unit, time) columns
    X=["gdp", "population"],   # covariates (optional)
    method="ife",               # "fe", "ife", "mc", "cfe"
    r=(0, 5),                   # cross-validate over 0..5 factors
    se=True,                    # bootstrap standard errors
    nboots=500,                 # bootstrap replications
    device="cpu",               # "cpu" or "gpu"
    n_jobs=4,                   # parallel workers
    seed=42,                    # full reproducibility
)

# Results
print(result.summary())

# Plotting (gap plot, status heatmap, etc.)
result.plot(kind="gap")
result.plot(kind="status")

# Diagnostic tests
diag = result.diagnose(f_threshold=0.5, tost_threshold=0.36, loo=True)
print(diag.summary())

Methods

Method	Description	Key Parameter	When to Use
`"fe"`	Fixed effects counterfactual	`force`	Parallel trends holds
`"ife"`	Interactive fixed effects	`r` (number of factors)	Unobserved time-varying confounders
`"mc"`	Matrix completion	`lam` (nuclear norm penalty)	Alternative to IFE
`"cfe"`	Complex fixed effects	`Z`, `Q`	Unit/time-varying interactions

API Reference

`pyfector.fect()`

Main estimation function. Returns a FectResult object.

pyfector.fect(
    data,                           # DataFrame (Polars or Pandas)
    Y: str,                         # outcome column name
    D: str,                         # treatment column name (0/1)
    index: tuple[str, str],         # (unit_id, time) column names
    X: list[str] = None,            # covariate column names
    W: str = None,                  # weight column name
    method: str = "ife",            # "fe", "ife", "mc", "cfe", "both"
    force: str = "two-way",         # "none", "unit", "time", "two-way"
    r: int | tuple = 0,             # factors; tuple (min, max) for CV
    lam: float = None,              # lambda for MC; None = auto CV
    nlambda: int = 10,              # lambda grid size for MC CV
    CV: bool = True,                # cross-validate r or lambda
    k: int = 10,                    # CV folds
    cv_prop: float = 0.1,           # fraction of control obs to mask
    criterion: str = "mspe",        # CV criterion: "mspe", "gmspe", "mad"
    se: bool = False,               # compute standard errors
    vartype: str = "bootstrap",     # "bootstrap" or "jackknife"
    nboots: int = 200,              # bootstrap replications
    alpha: float = 0.05,            # significance level
    tol: float = 1e-7,              # convergence tolerance
    max_iter: int = 5000,           # max EM iterations
    min_T0: int = 1,               # min pre-treatment periods per unit
    normalize: bool = False,        # normalize outcome by SD
    device: str = "cpu",            # "cpu" or "gpu"
    n_jobs: int = 1,                # parallel workers
    seed: int = None,               # random seed
)

`FectResult` object

Attribute	Type	Description
`att_avg`	`float`	Overall average treatment effect on the treated
`att_avg_unit`	`float`	Unit-averaged ATT
`att_on`	`ndarray`	Dynamic ATT by relative time to treatment
`time_on`	`ndarray`	Relative time indices
`count_on`	`ndarray`	Observation counts per relative time
`beta`	`ndarray`	Covariate coefficients
`Y_ct`	`ndarray`	T x N counterfactual outcome matrix
`eff`	`ndarray`	T x N treatment effect matrix
`factors`	`ndarray`	T x r estimated factors (IFE only)
`loadings`	`ndarray`	N x r estimated loadings (IFE only)
`sigma2`	`float`	Error variance estimate
`r_cv`	`int`	CV-selected number of factors
`lambda_cv`	`float`	CV-selected lambda
`inference`	`InferenceResult`	Bootstrap/jackknife results (if `se=True`)

Methods:

result.summary() — formatted summary table
result.plot(kind="gap") — dynamic effects plot
result.plot(kind="status") — treatment status heatmap
result.plot(kind="factors") — latent factors
result.plot(kind="counterfactual", units=[1, 5]) — actual vs counterfactual
result.diagnose(...) — run diagnostic tests

`InferenceResult` (when `se=True`)

Attribute	Description
`att_avg_se`	Standard error of overall ATT
`att_avg_ci`	95% confidence interval (tuple)
`att_avg_pval`	p-value for H0: ATT=0
`att_on_se`	Per-period standard errors
`att_on_ci_lower`	Per-period CI lower bounds
`att_on_ci_upper`	Per-period CI upper bounds
`att_on_pval`	Per-period p-values
`att_avg_boot`	Bootstrap distribution of overall ATT
`att_on_boot`	Bootstrap distribution of dynamic ATTs

Diagnostic Tests

diag = result.diagnose(
    f_threshold=0.5,         # equivalence F-test threshold
    tost_threshold=0.36,     # TOST equivalence bound
    placebo_period=(-5, -1), # placebo test window
    loo=True,                # leave-one-period-out
)
print(diag.summary())

Test	What it Tests	Key Output
Pre-trend F-test	Joint significance of pre-treatment ATTs	`f_stat`, `f_pval`
Equivalence F-test	Pre-trends within equivalence bounds	`equiv_f_pval`
TOST	Per-period equivalence	`tost_pvals`
Placebo test	No effect in specified pre-period window	`placebo_att`, `placebo_pval`
Carryover test	No lingering effect after treatment ends	`carryover_att`, `carryover_pval`
Leave-one-out	Sensitivity of ATT to dropping periods	`loo_max_change`

Validation Against R fect

pyfector is validated against the original R fect package on identical simulated data. The table below shows exact comparison results (N=200, T=50):

Point Estimates

Scenario	pyfector ATT	R fect ATT	Difference	True ATT
FE (no factors)	4.995583	4.995640	-0.000057	5.0
FE + covariates	4.975683	4.975809	-0.000126	5.0
IFE r=2	3.010223	3.013046	-0.002822	3.0
IFE r=2 + covariates	2.993155	2.996099	-0.002944	3.0
MC lambda=0.01	3.176671	3.176721	-0.000050	3.0

Point estimates agree to 4-6 decimal places for FE and MC. IFE differences (~0.003) are due to different SVD implementations converging to slightly different factor rotations — both are equally valid and equally close to the true ATT.

Standard Errors (500 bootstrap replications)

Scenario	pyfector SE	R fect SE	Ratio
FE (500 boots)	0.020011	0.021291	0.94
IFE r=2 (500 boots)	0.017128	0.018382	0.93

SEs agree within 7% despite using different random draws and different SVD implementations. Per-period SE ratios range from 0.7 to 1.25.

Covariate Coefficients

Covariate	pyfector	R fect	Difference
X1	0.817041	0.817042	-0.000001
X2	1.173748	1.173747	+0.000000
X3	1.857722	1.857723	-0.000001

Coefficients agree to 6 decimal places.

Performance

The key bottleneck in R fect is full SVD inside the EM loop: O(NT min(N,T)) per iteration. pyfector uses randomized truncated SVD for large matrices: O(NTr) where r << min(N,T).

Scenario	pyfector	R fect	Speedup
FE, N=200 T=50	0.01s	0.05s	5x
IFE, N=200 T=50	0.08s	0.06s	0.7x
FE + 500 boots, N=200 T=50	0.54s	5.52s	10x
IFE + 500 boots, N=200 T=50	3.61s	22.83s	6x
IFE, N=1000 T=50	0.17s	0.17s	1x

The main speedup comes from bootstrap/CV parallelization (n_jobs) and will increase further with GPU acceleration on large panels.

Design Principles

Reproducible: every random operation is seeded; same seed = identical results
Idempotent: calling fect() twice with the same inputs produces the same output
GPU-optional: toggle device="gpu" for CuPy acceleration (no code changes needed)
Parallel: CV folds and bootstrap replications run in parallel via n_jobs
Polars-first: native Polars support; Pandas also accepted

References

Liu, L., Wang, Y., & Xu, Y. (2024). A Practical Guide to Counterfactual Estimators for Causal Inference with Time-Series Cross-Sectional Data. American Journal of Political Science, 68(1), 160-176.
Xu, Y. (2017). Generalized Synthetic Control Method. Political Analysis, 25(1), 57-76.
Athey, S., et al. (2021). Matrix Completion Methods for Causal Panel Data Models. JASA, 116(536), 1716-1730.
Bai, J. (2009). Panel Data Models with Interactive Fixed Effects. Econometrica, 77(4), 1229-1279.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.0

May 7, 2026

0.1.6

May 4, 2026

0.1.5

May 1, 2026

0.1.4

May 1, 2026

0.1.3

Apr 9, 2026

0.1.2

Apr 9, 2026

0.1.1

Apr 9, 2026

This version

0.1.0

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyfector-0.1.0.tar.gz (1.1 MB view details)

Uploaded Mar 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyfector-0.1.0-py3-none-any.whl (36.8 kB view details)

Uploaded Mar 29, 2026 Python 3

File details

Details for the file pyfector-0.1.0.tar.gz.

File metadata

Download URL: pyfector-0.1.0.tar.gz
Upload date: Mar 29, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyfector-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5ee74362aec12f8ee1c7ce744f30818226fdaa04ddfa6f4abca02a7c17b967b8`
MD5	`4eeeece0414a904dd9de879f28b4fc34`
BLAKE2b-256	`cc54399cd89da42eaf6ad74a1df518cef3f521a43c7c39b3a46ace6749e8d80a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyfector-0.1.0.tar.gz:

Publisher: publish.yml on AlanHuang99/pyfector

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyfector-0.1.0.tar.gz
- Subject digest: 5ee74362aec12f8ee1c7ce744f30818226fdaa04ddfa6f4abca02a7c17b967b8
- Sigstore transparency entry: 1195582680
- Sigstore integration time: Mar 29, 2026
Source repository:
- Permalink: AlanHuang99/pyfector@1e5ee35532620fab4a4f03963a136b11901624f1
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/AlanHuang99
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@1e5ee35532620fab4a4f03963a136b11901624f1
- Trigger Event: release

File details

Details for the file pyfector-0.1.0-py3-none-any.whl.

File metadata

Download URL: pyfector-0.1.0-py3-none-any.whl
Upload date: Mar 29, 2026
Size: 36.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyfector-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0d7915d241ba78742940da35c091615551ef0aceeb91d88f0e2c86382aa72ae5`
MD5	`3c5b2058515f476ed4149f5f83e397b3`
BLAKE2b-256	`5a199123e1235e69d1760645aa4be3c56ffc3c824a1e8e05654a8e0d32ab986b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyfector-0.1.0-py3-none-any.whl:

Publisher: publish.yml on AlanHuang99/pyfector

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyfector-0.1.0-py3-none-any.whl
- Subject digest: 0d7915d241ba78742940da35c091615551ef0aceeb91d88f0e2c86382aa72ae5
- Sigstore transparency entry: 1195582754
- Sigstore integration time: Mar 29, 2026
Source repository:
- Permalink: AlanHuang99/pyfector@1e5ee35532620fab4a4f03963a136b11901624f1
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/AlanHuang99
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@1e5ee35532620fab4a4f03963a136b11901624f1
- Trigger Event: release

pyfector 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

pyfector

Installation

Quick Start

Methods

API Reference

pyfector.fect()

FectResult object

InferenceResult (when se=True)

Diagnostic Tests

Validation Against R fect

Point Estimates

Standard Errors (500 bootstrap replications)

Covariate Coefficients

Performance

Design Principles

References

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`pyfector.fect()`

`FectResult` object

`InferenceResult` (when `se=True`)