Conformal Anomaly Detection
Project description
Conformal Anomaly Detection
Thresholds for anomaly detection are often arbitrary and lack theoretical guarantees. nonconform wraps anomaly detectors (from PyOD, scikit-learn, or custom implementations) and transforms their raw anomaly scores into conformal p-values. Under the assumptions of the selected method, these p-values support controlled false discovery rate (FDR) workflows with explicit, assumption-dependent guarantees.
Note: The methods in nonconform assume that training and test data are exchangeable. The package is therefore not suited for spatial or temporal autocorrelation unless such dependencies are explicitly handled in preprocessing or model design.
Guarantee scope: nonconform calibrates detector scores; it does not make an unsuitable detector or mismatched calibration set valid. Standard conformal claims require exchangeability. Weighted workflows require plausible covariate shift, support overlap, and reliable weights. FDR claims require valid p-values and the relevant multiple-testing assumptions.
Feature Overview
| Need | nonconform Functionality | Start Here |
|---|---|---|
| Principled anomaly decisions | ConformalDetector.select(...) combines conformal p-values with FDR-controlled selection |
FDR Control |
| Flexible calibration strategies | Split, CrossValidation, and JackknifeBootstrap for different data/compute tradeoffs |
Conformalization Strategies |
| Covariate-shift aware workflows | Weighted conformal prediction with density-ratio estimators and weighted FDR control (requires sufficient calibration/test support overlap) | Weighted Conformal |
| Rich p-value estimation | Empirical, probabilistic KDE, and conditional calibration estimators | Common Workflows |
| Sequential monitoring | Exchangeability martingales (PowerMartingale, SimpleMixtureMartingale, SimpleJumperMartingale) |
Exchangeability Martingales |
| Custom detector integration | Support for protocol-compliant detectors (with strict-inductive caveats for blocked PyOD models) | Detector Compatibility |
Citation
If you use nonconform in academic work, reports, or other published material, please cite the accompanying paper:
@misc{hennhoefer2026,
title={Conformal Anomaly Detection in Python: Moving Beyond Heuristic Thresholds with 'nonconform'},
author={Oliver Hennhöfer and Maximilian Kirsch and Christine Preisach},
year={2026},
eprint={2605.13642},
archivePrefix={arXiv},
primaryClass={stat.ML},
url={https://arxiv.org/abs/2605.13642},
}
Getting Started
Installation via PyPI:
pip install nonconform
Note: The example below uses an external dataset API. Install with
pip install oddballorpip install "nonconform[data]".
Classical Conformal Workflow
Example: Isolation Forest on the Shuttle benchmark. This trains a base detector, calibrates conformal scores, then applies FDR-controlled selection through select(...). Raw p-values remain available via detector.last_result.p_values.
from pyod.models.iforest import IForest
from nonconform import ConformalDetector, Split
from nonconform.metrics import false_discovery_rate, statistical_power
from oddball import Dataset, load
x_train, x_test, y_test = load(Dataset.SHUTTLE, setup=True, seed=42)
detector = ConformalDetector(
detector=IForest(),
strategy=Split(n_calib=1_000),
seed=42,
)
detector.fit(x_train)
decisions = detector.select(x_test, alpha=0.2)
print(f"Empirical FDR: {false_discovery_rate(y_test, decisions)}")
print(f"Statistical Power: {statistical_power(y_test, decisions)}")
Output:
Empirical FDR: 0.18
Statistical Power: 0.99
Advanced Methods
nonconform includes advanced workflows:
- Weighted Conformal Prediction (
weight_estimator=...): reweights calibration evidence for covariate shift settings where test and calibration distributions differ, assuming enough support overlap between calibration and test features. - Exchangeability Martingales (
nonconform.martingales): sequential evidence monitoring over conformal p-value streams.
Weighted Conformal Setup:
from pyod.models.iforest import IForest
from nonconform import ConformalDetector, Split, logistic_weight_estimator
detector = ConformalDetector(
detector=IForest(),
strategy=Split(n_calib=1_000),
weight_estimator=logistic_weight_estimator(),
seed=42,
)
Note: In weighted mode,
ConformalDetector.select(...)dispatches weighted FDR control automatically.
Martingale Setup for Sequential Monitoring:
from nonconform.martingales import AlarmConfig, PowerMartingale
alpha = 0.01
martingale = PowerMartingale(
epsilon=0.5,
alarm_config=AlarmConfig(
ville_threshold=1 / alpha,
restarted_ville_threshold=1 / alpha,
),
)
state = martingale.update(p_t)
states = martingale.update_many(p_values_chunk)
Note:
update(...)already validates and normalizes numeric scalar p-values, so an explicitfloat(...)cast is optional. Useville_thresholdorrestarted_ville_thresholdwhen you need an anytime false-alarm bound for a monitored stream. CUSUM and Shiryaev-Roberts thresholds are change-evidence triggers for diagnosing possible stream changes; they need separate calibration and do not replace cross-hypothesis FDR control. See Exchangeability Martingales for threshold interpretation details.
Beyond Static Data
While primarily designed for static (single-batch) workflows, optional onlinefdr integration supports streaming FDR procedures.
Custom Detectors
Any detector implementing the AnomalyDetector protocol can be integrated with nonconform:
from typing import Self
import numpy as np
class MyDetector:
def fit(self, X, y=None) -> Self: ...
def decision_function(self, X) -> np.ndarray: ... # higher = more anomalous
def get_params(self, deep=True) -> dict: ...
def set_params(self, **params) -> Self: ...
For custom detectors, either set score_polarity explicitly ("higher_is_anomalous" in most cases), or omit it to use the default score-polarity policy. Use score_polarity="auto" only when you want strict detector-family validation.
For strict inductive conformal/FDR pipelines, avoid batch-adaptive PyOD
detectors with non-frozen score maps (for example ECOD and COPOD, which are
blocked at runtime).
See Detector Compatibility for details and examples.
Optional Dependencies
For additional features, you might need optional dependencies:
pip install nonconform[pyod]- Includes PyOD anomaly detection librarypip install nonconform[data]- Includes oddball for loading benchmark datasetspip install nonconform[fdr]- Includes advanced FDR control methods (online-fdr)pip install nonconform[probabilistic]- Includes KDEpy and Optuna for probabilistic approximationpip install nonconform[all]- Includes all optional dependencies
Please refer to the pyproject.toml for details.
Contact
Bug reporting: https://github.com/OliverHennhoefer/nonconform/issues
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nonconform-1.0.1.tar.gz.
File metadata
- Download URL: nonconform-1.0.1.tar.gz
- Upload date:
- Size: 55.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7df54d0f6dc897f92ff576bd197893122193b55a2654634362129909876d10d1
|
|
| MD5 |
55ed5298f4ee273a29d60be0b3d6feda
|
|
| BLAKE2b-256 |
e0e3c2527998ad3526739a6cddbf7abeb05b5ef81abace4ee9988eacab871059
|
File details
Details for the file nonconform-1.0.1-py3-none-any.whl.
File metadata
- Download URL: nonconform-1.0.1-py3-none-any.whl
- Upload date:
- Size: 61.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.9 {"installer":{"name":"uv","version":"0.9.9"},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
691f5e315bf149e007636dba8421ced44978fe0b6735e54418c8a8437d9a58c1
|
|
| MD5 |
08dfb32cf0e32e0948ba9d87a713c082
|
|
| BLAKE2b-256 |
94f4b4cf9edb1f2b1f3970b9b11943dcf5933c28487c0e08544b88a436b76d77
|