Real-time behavioural drift detection for agentic AI systems

These details have not been verified by PyPI

Project links

Project description

DriftShield

Your LangChain agent just called the same API 47 times. Your CrewAI crew burned £200 in tokens overnight. Your research agent started writing marketing copy instead of financial summaries.

You didn't find out until morning.

DriftShield catches this stuff in real-time. It wraps your existing agent, watches what it does, and pings you on Slack or Discord the moment something goes sideways. No dashboard. No cloud. No account to create. Just a Python library that runs alongside your agent.

What it actually does

DriftShield monitors three things:

Loop detection: Is your agent calling the same tool over and over? Or stuck in a cycle like search → format → search → format? DriftShield spots the pattern and alerts you before it eats your budget.

Goal drift: Is your agent still doing what you asked it to? DriftShield uses local embeddings (runs on your CPU, no API calls) to measure how far the agent's output has drifted from its original objective.

Resource spikes: Is this run burning way more tokens or taking way longer than usual? DriftShield learns what "normal" looks like for your agent, then flags when things go abnormal.

Everything stays on your machine. Traces go to a local SQLite file. Embeddings run on your CPU. The only thing that leaves your machine is the alert you choose to send to Slack/Discord.

Get started

pip install driftshield

LangChain

from driftshield import DriftMonitor

monitor = DriftMonitor(
    agent_id="logistics-v2",
    alert_webhook="https://hooks.slack.com/...",
)

agent = monitor.wrap(existing_agent)
result = agent.invoke({"input": "optimise route for order #4821"})
# DriftShield is now watching. That's it.

CrewAI

from driftshield.crewai import DriftCrew

crew = DriftCrew(
    crew=existing_crew,
    agent_id="research-team-v1",
    alert_webhook="https://discord.com/api/webhooks/...",
)

result = crew.kickoff()

Works with any LLM

OpenAI, Anthropic, Groq, Ollama, local models doesn't matter. DriftShield only sees the traces (tool calls, token counts, outputs), not the model internals. Swap providers whenever you want.

How calibration works

For the first 30 runs (configurable), DriftShield quietly observes your agent and builds a baseline average tokens per run, typical tool sequences, normal execution time. No alerts during this phase.

After that, it knows what "normal" looks like and starts flagging deviations. You can inspect the baseline anytime:

driftshield baseline my-agent

Tip: If 30 runs feels like a lot, you can lower calibration_runs or use a preset template. DriftShield still catches obvious problems (like 50 identical tool calls) even without a baseline, using absolute safety limits.

What an alert looks like

When drift hits your Slack/Discord, you get:

{
  "agent_id": "logistics-v2",
  "detector": "action_loop",
  "severity": "HIGH",
  "message": "Action loop: search_inventory called 6x in 45s",
  "suggested_action": "Check search_inventory input/output for stale data or error loops",
  "context": {
    "tool_name": "search_inventory",
    "repeat_count": 6,
    "recent_actions": ["search_inventory", "search_inventory", "search_inventory", "..."]
  }
}

Not just "something's wrong" — it tells you what happened, which detector caught it, and what to check first.

CLI

# What went wrong in the last 24 hours?
driftshield alerts --last 24h

# Show me exactly what my agent did on its last run
driftshield traces logistics-v2 --run latest

# What does "normal" look like for this agent?
driftshield baseline logistics-v2

# List recent runs
driftshield runs logistics-v2

Configuration

Everything's tuneable. Defaults are sensible, but you can adjust:

monitor = DriftMonitor(
    agent_id="my-agent",
    alert_webhook="https://hooks.slack.com/...",
    goal_description="Summarise financial reports",
    calibration_runs=30,         # runs before baseline kicks in
    loop_window=20,              # how many recent actions to check
    loop_max_repeats=4,          # repeated calls before flagging
    similarity_threshold=0.5,    # goal drift sensitivity (lower = stricter)
    spike_multiplier=2.5,        # how many std devs = a spike
    min_alert_severity="MED",    # ignore LOW severity events
    alert_cooldown=60.0,         # don't spam the same alert
)

Custom reactions

DriftShield alerts you by default, but you can also react programmatically:

def handle_drift(event):
    if event.severity.value == "CRITICAL":
        agent.stop()  # kill the run
        page_oncall()  # wake someone up

monitor.on_drift(handle_drift)

What this isn't

I want to be upfront about scope. DriftShield is v0.1, built by one person.

Not a full observability platform. No web dashboard, no hosted backend, no team features. If you need that, look at LangSmith, Langfuse, or Arize.
Not a guardrail system. It detects drift after the fact and alerts you. It doesn't block actions before they happen (that's on the roadmap).
Not production-hardened yet. It works, it's tested, but it hasn't been battle-tested by thousands of users. Expect rough edges.

What it IS: the smallest, simplest tool that does one thing well — tells you when your agent is going off the rails, fast, with zero setup overhead.

Roadmap

v0.2 — Auto-correction hooks (retry, context trim, kill run). Preset baseline templates so you get value from run 1.
v0.3 — Better multi-agent support. Predictive drift (catch it before it happens).
v1.0 — Dashboard, team features, historical analytics. But only if people actually want it.

Built with

Python 3.10+
SQLite (zero config)
sentence-transformers (local CPU embeddings)
scikit-learn (basic stats)
httpx (webhooks)
click + rich (CLI)

Contributing

This is early. If you're running agents in production and hit a case DriftShield missed (or flagged incorrectly), please open an issue. Your real-world edge cases are the most valuable thing you can give this project right now.

git clone https://github.com/YOUR_USERNAME/driftshield.git
cd driftshield
python -m venv .venv
source .venv/bin/activate  # or .venv\Scripts\activate on Windows
pip install -e ".[dev]"
python -m pytest tests/ -v

Why I built this

I kept reading the same story: dev builds agent, agent works great in testing, agent goes haywire in production at 2am, dev wakes up to a hefty API bill and a Slack full of confused users. The big observability platforms exist but they're heavy on dashboards, accounts, pricing tiers, cloud dependencies. Most solo devs and small teams just want to know when their agent is broken. That's it.

So I built the smallest thing that solves that problem.

If you try it and it helps (or doesn't), I genuinely want to hear about it.

License

MIT - do whatever you want with it.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.1

Feb 27, 2026

This version

0.1.0

Feb 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

driftshield_mini-0.1.0.tar.gz (26.3 kB view details)

Uploaded Feb 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

driftshield_mini-0.1.0-py3-none-any.whl (26.5 kB view details)

Uploaded Feb 24, 2026 Python 3

File details

Details for the file driftshield_mini-0.1.0.tar.gz.

File metadata

Download URL: driftshield_mini-0.1.0.tar.gz
Upload date: Feb 24, 2026
Size: 26.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for driftshield_mini-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`1ef74441dcdff73ae4885579e754ad7f3b9bd1622184fe9fcdeb702ba041ded2`
MD5	`840b27644e79da84ebccd174dbbd2b25`
BLAKE2b-256	`00ac8cbf31e5d3ebbaa90987e2ec9d60ebae4cfcbeaa3f04fa25b1e215553f3a`

See more details on using hashes here.

File details

Details for the file driftshield_mini-0.1.0-py3-none-any.whl.

File metadata

Download URL: driftshield_mini-0.1.0-py3-none-any.whl
Upload date: Feb 24, 2026
Size: 26.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for driftshield_mini-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`151c0b33df7f86e2b1264b4b37f3412fe99d8fe1efdb07e8379bbbd513dbab2f`
MD5	`5cfed4c74730bf1ceadc8081c27db49c`
BLAKE2b-256	`b5fdf72ff1b5dba6dc00d378d371584e10c4345049c21653a70036e583934a19`

See more details on using hashes here.

driftshield-mini 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DriftShield

What it actually does

Get started

LangChain

CrewAI

Works with any LLM

How calibration works

What an alert looks like

CLI

Configuration

Custom reactions

What this isn't

Roadmap

Built with

Contributing

Why I built this

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes