Skip to main content

Lightweight drift and anomaly monitoring for production ML models.

Project description

canary-ml

Drop-in drift and anomaly monitoring for production ML models.

PyPI Python License: MIT Tests

One line wraps your model. Every .predict() call logs drift metrics, detects anomalies, and fires an alert when something shifts. Monitoring runs in a background thread — your inference latency is unaffected. No infrastructure required.

Project page · Guide & manual · Live demo


Install

pip install "canary-ml[keras]"

Keras/TensorFlow support included. For a minimal install without Keras:

pip install canary-ml

Requires Python 3.9+. Core dependencies: numpy, scipy, scikit-learn, rich.


Quickstart

from canary_ml import ModelMonitor

monitor = ModelMonitor(
    model=your_model,           # any sklearn-compatible model
    reference_data=X_train,     # baseline distribution
    alert_threshold=0.2,        # PSI threshold for alerts
    log_path="./canary_logs",
    verbose=True,
)

# Drop-in replacement — monitoring runs in the background
predictions = monitor.predict(X_new)

# Inspect the latest report
report = monitor.get_report()
print(report.summary())
# DriftReport | psi=0.41 | features_drifted=3/8 | anomaly_rate=3.2% | ALERT

# Launch the live dashboard
monitor.serve_dashboard(port=8501)
# → http://localhost:8501

What it monitors

  • PSI — global distribution shift. < 0.1 stable · 0.1–0.2 moderate · > 0.2 alert. Requires ≥ 200 samples per batch; use drift_detected (KS-based) for smaller batches.
  • KS test — per-feature Kolmogorov-Smirnov (continuous features, p < 0.05 = drift). Sample-size–corrected.
  • Chi² test — per-feature chi-squared (categorical features, ≤ 20 unique values).
  • Anomaly detection — ensemble of Isolation Forest + z-score (|z| > 3).
  • Confidence estimate — label-free accuracy proxy from predicted probabilities. Accurate when probabilities are well-calibrated; overestimates if the model is overconfident.

Alert callback

def my_alert(report):
    send_slack(f"Drift alert: PSI={report.psi_score:.2f}")

monitor = ModelMonitor(..., on_alert=my_alert)

Dashboard

monitor.serve_dashboard(port=8501)

Stdlib HTTP server, no extra dependencies. Auto-refreshes every 5 seconds. Can also run standalone:

python -m canary_ml.server ./canary_logs 8501

API reference

ModelMonitor

ModelMonitor(
    model,                      # sklearn-compatible model with .predict()
    reference_data,             # np.ndarray or pd.DataFrame, shape (n, features)
    alert_threshold=0.2,        # PSI threshold for drift alert
    performance_threshold=0.05, # accuracy drop (pp) below reference that fires a perf alert
    anomaly_contamination=0.05, # expected fraction of anomalies; alert fires at 3×
    categorical_threshold=20,   # max unique values for a feature to be treated as categorical
    store_samples=True,         # set False to skip storing raw feature rows (PII-sensitive envs)
    log_path="./canary_logs",
    verbose=False,
    on_alert=None,              # callable(DriftReport) fired on alert
)
Method Returns Description
.predict(X) same as model Runs model; monitoring queued in background thread
.get_report() DriftReport | None Latest monitoring report
.serve_dashboard(port=8501) Starts dashboard server in background thread

DriftReport

Attribute Type Description
psi_score float Global PSI vs reference
drift_detected bool True if any feature's KS/chi² p < 0.05 (soft warning)
ks_results dict Per-feature {statistic, p_value, drifted}
features_drifted int Count of features with p < 0.05 (computed property)
anomaly_rate float Fraction of samples flagged as anomalies
alert_triggered bool True if PSI > threshold, anomaly rate is high, or performance drops
alert_reasons list[str] Which conditions fired: "drift", "anomaly", "performance"
estimated_accuracy float | None Confidence estimate; None if no predict_proba
reference_accuracy float | None Confidence estimate on reference data
performance_delta float | None estimated_accuracy − reference_accuracy
performance_alert bool True if delta < −performance_threshold
timestamp str ISO 8601

DriftReport is not directly JSON-serialisable. Use report.to_dict() for logging or json.dumps(report.to_dict()). Dict-style access (report["psi_score"]) is also supported.


Testing

pip install -e ".[dev]"
pytest                        # 44 tests
pytest --cov=canary_ml

License

MIT © Aitor Bazo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

canary_ml-1.2.1.tar.gz (31.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

canary_ml-1.2.1-py3-none-any.whl (26.9 kB view details)

Uploaded Python 3

File details

Details for the file canary_ml-1.2.1.tar.gz.

File metadata

  • Download URL: canary_ml-1.2.1.tar.gz
  • Upload date:
  • Size: 31.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for canary_ml-1.2.1.tar.gz
Algorithm Hash digest
SHA256 72b1897986c8bd8735b8440ea44f5abd58227de5aca2c1015e134f682edc44a4
MD5 ebd11e189fddaf890c74887d63a64547
BLAKE2b-256 ada0c3e02048bac45a98cf1fae6874f858fbdb51e14e530a0adea2d812b3f3a9

See more details on using hashes here.

File details

Details for the file canary_ml-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: canary_ml-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 26.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for canary_ml-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7f22e064f7441dfc06a5da380ee6a27831487ba8a97f7e36038d07874b262820
MD5 7cb917e4e884ddbc36b90c4249b78dd6
BLAKE2b-256 2130626fb053366bf12e7c6361bb14f6afca7fe9a948af7ee169a32778ccdc6f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page