Scan your agentic codebase for unguarded tool calls with real-world side effects

These details have not been verified by PyPI

Project links

Project description

Python 3.10+ License: Apache-2.0 Dependencies: 0

agent-canary

Your agent can send emails, delete database rows, and charge credit cards — do you know which of those calls have no validation around them?

agent-canary scans your Python codebase and maps every function that can change the real world. For each one, it tells you what safety checks exist — and what's missing.

$ agent-canary ./skyvern/
382 tool calls · 307 with no checks · 66 partial · 9 confirmed

⚠ terminate                 .../script_skyvern_page.py:868
  actions:
    shutil.rmtree(temp_dir)
    os.kill(pid, signal.SIGTERM)
  checks:  none

⚠ _analyze_gmail_messages   .../composio_gmail_connector.py:228
  actions:
    session.execute(insert(messages))
  checks:  none
    → no auth check · no rate limit · no idempotency key

~ session_create             .../browser.py:126
  actions:
    db.add(browser_session)
    db.commit()
  checks:
    if not current_user: raise HTTPException(403)
    → no rate limit

✓ _get_or_create_browser_state  .../script_skyvern_page.py:81  [idempotency: full]

The problem

Your agent calls stripe.Refund.create, session.commit, requests.post — and nothing stops it from doing so twice, without auth, or with unbounded parameters. 1,075 GitHub issues across LangGraph, CrewAI, AutoGen, and OpenAI Agents SDK document tool calls executing multiple times without idempotency. 307 of 382 tool calls in Skyvern have no checks at all.

What it does

Scans your Python source with AST analysis. No network calls. No config. No dependencies. Finds every function that triggers a real-world action (DB write, payment, email, API call, LLM invocation, file delete) and checks whether protections exist (auth, rate limit, validation, idempotency, retry bounds).

Quickstart

pip install agent-canary
agent-canary ./my_agent/

Output in < 2 seconds. Zero dependencies. Try it on the included demo:

agent-canary examples/demo_agent/

What gets flagged

Tool call	Why it's flagged
`stripe.Refund.create(amount=amount)`	No bounds on `amount`, no rate limit, no idempotency key
`session.commit()` in an agent tool	No auth check before the write
`openai.chat.completions.create()` in a retry loop	No `max_retries` or `stop_after_attempt` — unbounded LLM spend

CI integration

- run: pip install agent-canary
- run: agent-canary . --fail-on-unchecked --output-registry toolcalls.yaml

Exit code 1 if any new tool call has no checks. Existing unguarded calls are visible but don't block CI until you address them.

The registry

agent-canary . --format registry > toolcalls.yaml

Generates a YAML inventory of every tool call, its checks, and what's missing. Commit it to your repo. Diff it on PRs. Each entry can be signed off with # checked:ok — creating an auditable record of who reviewed what.

This is what no other tool produces: a versionable, diffable artifact that tracks your agent's entire impact surface over time. See examples/toolcalls.skyvern.yaml for a real excerpt.

Benchmarked on real projects

Project	Stars	Tool calls	Unguarded	Time
Skyvern	20.9k	382	307 (80%)	~2s
SurfSense	13.3k	319	169 (53%)	1.4s
FinRobot	6.5k	27	18 (67%)	<1s

How to resolve findings

Action	How
Fix	Add validation in code. The next scan picks it up.
Acknowledge	Add `# checked:ok` as a comment on the function.
Protected elsewhere	Add `# checked:ok — protected by [middleware/gateway/etc]`

What it detects

Tool calls: session.commit, db.add, stripe.Refund.create, requests.post, send_mail, openai.chat.completions.create, exec(), s3.put_object, os.remove, shutil.rmtree

Checks: Depends() / Security() (FastAPI), @login_required, @rate_limit, get_or_create, Field(le=, ge=), max_retries=, confirm / approve in function body

Limitations

Import aliases — import requests as req then req.post() is not detected
Cross-function analysis — if the check is in the caller and the effect is in the callee, not detected
Python only — TypeScript planned

Why this exists

We analyzed 3,047 GitHub issues across LangGraph, CrewAI, AutoGen, OpenAI Agents SDK, Claude Code, and Vercel AI SDK. 737 directly document tool calls executing without checks — duplicate executions, missing rate limits, loops without bounds, payments without validation.

The most common pattern (1,075 issues): a tool call that executes multiple times when it should execute once. The cause: no idempotency, no rate limit, no circuit breaker in the code around the tool.

agent-canary doesn't fix these problems. It makes them visible.

See methodology for data sources and classification criteria.

Configuration

For dynamic tools (MCP servers, plugins), create an agent-canary.yml:

tools:
  - name: search_web
    effects: [http_write]
  - name: send_slack
    effects: [messaging]

License

Apache-2.0

Built by the team behind Diplomat — runtime governance for AI agents.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Mar 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_canary-0.1.0.tar.gz (62.3 kB view details)

Uploaded Mar 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_canary-0.1.0-py3-none-any.whl (38.8 kB view details)

Uploaded Mar 23, 2026 Python 3

File details

Details for the file agent_canary-0.1.0.tar.gz.

File metadata

Download URL: agent_canary-0.1.0.tar.gz
Upload date: Mar 23, 2026
Size: 62.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for agent_canary-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`a3fc253c20dce95da8265493a8f648836e5d4068b77d42a0659ec6c1a2542257`
MD5	`7ea125713f3d2517e59341aedf140d5a`
BLAKE2b-256	`1e06c7bcd365f6b4a4c311afef2445e8785b2094bee40762ce5f455b73654907`

See more details on using hashes here.

File details

Details for the file agent_canary-0.1.0-py3-none-any.whl.

File metadata

Download URL: agent_canary-0.1.0-py3-none-any.whl
Upload date: Mar 23, 2026
Size: 38.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for agent_canary-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5744c41a8ac2d21114e3e0e1dbac8b475057141c1ebb498051707f338ba63ffb`
MD5	`573da3a4ce98d8e1c7c364a79b433d3e`
BLAKE2b-256	`c13fb87871e670d93e50e0b660d04b895a2893c3c69df21e3b3d6e8071d9e208`

See more details on using hashes here.

agent-canary 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

agent-canary

The problem

What it does

Quickstart

What gets flagged

CI integration

The registry

Benchmarked on real projects

How to resolve findings

What it detects

Limitations

Why this exists

Configuration

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes