Relationship-aware counterfactual fairness testing

These details have not been verified by PyPI

Project description

relfair

Relationship-aware counterfactual fairness testing.

Naive fairness tests flip one protected attribute (e.g. sex: Male → Female) and ask if the prediction changes. They miss the proxies: a model that learns relationship = Husband ⇒ Male will read the flipped row as off-distribution and silently absorb the bias. relfair propagates the intervention through the causal graph — Husband → Wife, occupation, household role — so counterfactuals stay on the data manifold. Result: 3–4× more discrimination detected on Adult, ACS, and German Credit.

The library also ships an NYC Local Law 144 audit engine: selection rates, impact ratios, the four-fifths rule, intersectional sex × race cross-tabs with bootstrap CIs, and DCWP-compliant PDF/JSON reports.

Status: alpha (0.1.0). The counterfactual engine and LL 144 metrics are stable and covered by 62 tests. The public API may still change before 1.0.

Why this exists

Dataset	Naive flip rate	Rel-aware flip rate	Detection lift
Adult — Husband rows (n=3,682)	7.0%	24.3%	+17.2 pp
ACS Income CA (n=5,241)	7.5%	34.5%	+27.0 pp (4.6×)
German Credit (n=144)	4.9%	13.2%	+8.3 pp

Reproduce: cd experiments/<dataset> && python run.py.

A separate finding: detect_constraint_violations() has 100% recall for hard-rule incoherences (a row with sex=Female, relationship=Husband never occurs in training data and must be off-manifold). IsolationForest flags ~7.5% of rows uniformly regardless of whether the violation is present — it cannot localise attribute-pair contradictions. Use the constraint check for hard rules; reserve the IsolationForest filter for soft distributional drift.

Install

pip install relfair                       # core
pip install "relfair[cli,report]"         # + CLI + PDF reports
pip install "relfair[experiments]"        # + benchmark deps
pip install "relfair[all]"                # everything

PDF generation uses WeasyPrint, which needs the GTK3 runtime on Windows. The CLI's --html flag produces a report without native deps. See WeasyPrint installation.

Quick start — counterfactual engine

from relfair.graph import DependencyGraph
from relfair.mechanisms import LearnedMechanisms
from relfair.counterfactual import batch_counterfactuals, detect_constraint_violations

G = DependencyGraph.from_edges([
    ("sex", "relationship"),
    ("sex", "occupation"),
    ("age", "occupation"),
])

# Conditional transition: flip Husband → Wife only when current value is Husband.
G.add_hard_rule("relationship", when={"sex": "Female"}, value="Wife",      from_val="Husband")
G.add_hard_rule("relationship", when={"sex": "Female"}, value="Own-child", from_val="Own-child")

mechs = LearnedMechanisms(G)
mechs.fit(training_df, cat_cols=["sex", "relationship", "occupation"])

naive    = batch_counterfactuals(test_df, "sex", "Female", G, mechs, naive=True,  seed=42)
relaware = batch_counterfactuals(test_df, "sex", "Female", G, mechs, naive=False, seed=42)

violated = detect_constraint_violations(naive, G)   # definitionally off-manifold

from_val is load-bearing. Without it, a rule "when sex=Female → relationship=Wife" overwrites every relationship to Wife. Always add passthrough rules (e.g. Own-child → Own-child) for values that should be preserved.

Quick start — LL 144 audit CLI

relfair audit predictions.csv \
  --outcome hired \
  --sex   sex_column \
  --race  race_column \
  --pdf   report.pdf \
  --json  report.json \
  --employer    "Acme Corp" \
  --aedt        "Resume Ranker v4.2" \
  --auditor     "Archit Rathod" \
  --cover-period "Jan 1 2025 – Dec 31 2025"

Produces: a console table (rates, ratios, four-fifths flags, bootstrap CIs), a DCWP-style PDF, and machine-readable JSON with all metrics, CIs, p-values, and flags.

Quick start — metrics API

from relfair.metrics import compute_ll144_metrics
from relfair.report   import write_json, write_pdf

result = compute_ll144_metrics(
    df,
    outcome_col="hired",
    outcome_type="binary",            # or "score"
    sex_col="sex",
    race_col="race",
    n_boot=2000,
    bootstrap_seed=42,
    meta={"employer": "Acme Corp", "auditor": "Archit Rathod"},
)

for group in result.by_race:
    print(f"{group.group}: ratio={group.ratio:.3f}  flag={group.four_fifths_flag}")

write_json(result, "report.json")
write_pdf(result,  "report.pdf")      # WeasyPrint + GTK3

Module map

relfair/
  graph.py           DependencyGraph — DAG, topological ops, hard rules (incl. from_val)
  mechanisms.py      LearnedMechanisms — ColumnTransformer + HistGBM per non-root node
  counterfactual.py  batch_counterfactuals, detect_constraint_violations, flip_rate
  manifold.py        ManifoldFilter — IsolationForest for soft distributional violations
  discovery.py       propose_graph — PC/GES/NOTEARS structure discovery (draft only)
  metrics/
    ll144.py         compute_ll144_metrics → LL144Result (GroupStat, IntersectionalStat)
    stats.py         bootstrap_impact_ratio_cis, fisher_exact_pvalue, two_proportion_ztest
  report/
    pdf.py           render_html, render_pdf, write_pdf
    json_export.py   to_dict, to_json, write_json
    templates/
      ll144.html.j2  DCWP-style Jinja2 template
  cli.py             Click CLI — `relfair audit ...`

experiments/
  adult/             Adult/Census — Husband/Wife constraint, +17.2 pp
  folktables/        ACS Income CA — occupation proxy, +27.0 pp (4.6×)
  german_credit/     personal_status hard rule, +8.3 pp

tests/               62 tests (pytest, ~32 s)

Limitations and assumptions

Group-level, not individual-level. relfair measures aggregate disparities under intervention. For per-row counterfactual claims you want dowhy.gcm or similar — a future experiment, not core.
Causal graph is an input, not a discovery. discovery.propose_graph() exists, but its output is a draft for human review. Hard rules and edges should always be validated by a domain expert before audit results are trusted.
Hard rules require care. Conditional transitions (from_val) only fire when the current value matches. Forgetting to add passthrough rules for values that should be preserved silently rewrites them. See graph.py docstrings.
Race categories are categorical labels you pass in. No BISG / probabilistic imputation; that's a different threat model.
Two protected attributes in the LL 144 path. Sex and race only, per the regulation. Generalising to additional axes is straightforward but not yet implemented.

Running tests

pip install -e ".[dev]"

python -m pytest tests/                          # full suite, ~32 s
python -m pytest tests/test_metrics.py -v        # single file
python -m pytest tests/ -k "test_from_val_rule"  # single test by name

CI runs the same on Python 3.10, 3.11, and 3.12.

Design constraints

No framework dependencies in the core. No FastAPI, no SQLAlchemy, no Neo4j driver — relfair is importable as a standalone research artifact.
Reproducibility. Every generation and sampling function accepts an explicit seed. No global RNG mutation.
Both code paths always present. batch_counterfactuals(..., naive=True/False) — the headline exhibit is naive vs. relationship-aware side-by-side.
cat_cols for float-coded categoricals. ACS encodes categories as floats (OCCP, MAR, SEX). Pass cat_cols=[...] to LearnedMechanisms.fit() so they're treated correctly.

Citation

Paper in preparation. Target venue: FAccT 2027 / AIES 2027.

@misc{rathod2026relfair,
  title  = {Relationship-Aware Counterfactual Fairness Testing},
  author = {Archit Rathod},
  year   = {2026},
  note   = {Preprint in preparation.}
}

Contributing

See CONTRIBUTING.md. Bug reports with minimal repros, new benchmark datasets, and documentation fixes are especially welcome.

License

Apache 2.0 — see LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

May 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

relfair-0.1.0.tar.gz (39.1 kB view details)

Uploaded May 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

relfair-0.1.0-py3-none-any.whl (33.1 kB view details)

Uploaded May 30, 2026 Python 3

File details

Details for the file relfair-0.1.0.tar.gz.

File metadata

Download URL: relfair-0.1.0.tar.gz
Upload date: May 30, 2026
Size: 39.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for relfair-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`d8e0ddef46e7f0fe227b016dff9d876b6f7c8ef60c57059e9fcda935876267ee`
MD5	`c2140f823cdea43a9b302aeb550faead`
BLAKE2b-256	`20e57fce864739c570b5dbc3e95ea7331f2237c570eef8591870baa862d438a3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for relfair-0.1.0.tar.gz:

Publisher: release.yml on Archit1706/relfair

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: relfair-0.1.0.tar.gz
- Subject digest: d8e0ddef46e7f0fe227b016dff9d876b6f7c8ef60c57059e9fcda935876267ee
- Sigstore transparency entry: 1673460646
- Sigstore integration time: May 30, 2026
Source repository:
- Permalink: Archit1706/relfair@334f6af55e41b279a626d54e6cb83ddc145f68c6
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Archit1706
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@334f6af55e41b279a626d54e6cb83ddc145f68c6
- Trigger Event: push

File details

Details for the file relfair-0.1.0-py3-none-any.whl.

File metadata

Download URL: relfair-0.1.0-py3-none-any.whl
Upload date: May 30, 2026
Size: 33.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for relfair-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5d1212f3d6ee3f51e64a490993d0e9c91160d624c2f515ecdd6c72ed7cd4d7f4`
MD5	`f1179342416a06d0f441cf66e9125429`
BLAKE2b-256	`c7d6fbefb6d122f38d7ac2ac261c3fd812c6a7576da34e809286793e1fcad93c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for relfair-0.1.0-py3-none-any.whl:

Publisher: release.yml on Archit1706/relfair

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: relfair-0.1.0-py3-none-any.whl
- Subject digest: 5d1212f3d6ee3f51e64a490993d0e9c91160d624c2f515ecdd6c72ed7cd4d7f4
- Sigstore transparency entry: 1673460670
- Sigstore integration time: May 30, 2026
Source repository:
- Permalink: Archit1706/relfair@334f6af55e41b279a626d54e6cb83ddc145f68c6
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Archit1706
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@334f6af55e41b279a626d54e6cb83ddc145f68c6
- Trigger Event: push

relfair 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

relfair

Why this exists

Install

Quick start — counterfactual engine

Quick start — LL 144 audit CLI

Quick start — metrics API

Module map

Limitations and assumptions

Running tests

Design constraints

Citation

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance