Skip to main content

Crash reporting for AI agents. Catch failures before your users do.

Project description

agent-sentry banner

PyPI Downloads Python License Tests

Crash reporting for AI agents. Catch failures before your users do.


Your AI agent works great in demos. In production, it fails silently. Tool calls timeout. Context windows overflow. Hallucinations slip through. And you have no idea until a user complains.

agent-sentry catches every failure, classifies why it happened, and alerts you in real time.

Quick Start

pip install ai-agent-sentry

Note: The PyPI package is ai-agent-sentry, but you import it as agent_sentry.

from agent_sentry import watch

@watch
def my_agent(query):
    result = call_llm(query)
    return result

# Every call is now tracked. Every failure is caught.
my_agent("summarize this document")

Open the dashboard:

agent-sentry dashboard

Three lines of code. Full visibility into every failure.

Why?

You deployed an agent. It runs 10,000 tool calls a day. Some fail. You don't know which ones, you don't know why, and you don't know how often.

Traditional monitoring sees HTTP 200s and thinks everything is fine. But your agent just hallucinated a fake API endpoint, retried it 5 times, burned $2 in tokens, and returned garbage to the user. Nobody noticed.

agent-sentry is Sentry for AI agents.

Features

@watch decorator One line. Wraps any agent function.
Root cause classification Timeout, hallucination, context overflow, rate limit, auth error, silent failure, wrong tool. Automatic.
Live dashboard Streamlit UI with failure rates, root causes, reliability scores, and cost tracking.
Cost tracking Token usage and estimated spend per call.
Reliability score Track agent health over time, like a credit score.
Alerts Slack, webhooks, email, or custom callbacks.
CLI Terminal reports without opening a browser.
Zero config One decorator. Done.

Integrations

# OpenAI
from agent_sentry.integrations.openai import SentryOpenAIWrapper
client = SentryOpenAIWrapper(openai_client)
response = client.chat_completions_create(model="gpt-4", messages=[...])

# Anthropic
from agent_sentry.integrations.anthropic import SentryAnthropicWrapper
client = SentryAnthropicWrapper(anthropic_client)
response = client.messages_create(model="claude-sonnet-4-20250514", messages=[...])

# LangChain
from agent_sentry.integrations.langchain import AgentSentryCallbackHandler
llm = ChatOpenAI(callbacks=[AgentSentryCallbackHandler()])

# Any function
@watch(event_type="tool_call", tags=["search"])
def search_web(query):
    return requests.get(f"https://api.search.com?q={query}").json()

Root Cause Categories

Category What It Catches
timeout Deadline exceeded, slow requests
hallucination References to non-existent tools, APIs, or data
context_overflow Token limit exceeded
malformed_args Type errors, validation failures
rate_limit 429s, quota exceeded
auth_error Invalid keys, 401/403
silent_failure No error, but empty/null results
network_error Connection refused, DNS, SSL
wrong_tool Agent picked the wrong tool

Performance

Metric Without With agent-sentry
Time to detect failure ~4.2 hours 0.3 seconds
Failures identified ~15% 100%
Root cause classification Manual Automatic
Overhead per call -- <2ms
Storage per 10K events -- ~5MB (SQLite)

Alerts

from agent_sentry import configure

# Slack
configure(slack_webhook="https://hooks.slack.com/services/YOUR/WEBHOOK/URL")

# Webhooks
configure(webhook_url="https://your-server.com/agent-alerts")

# Custom
from agent_sentry import CallbackAlert
configure(alert_channels=[CallbackAlert(lambda e: print(f"ALERT: {e['error_message']}"))])

CLI

agent-sentry dashboard          # Launch Streamlit dashboard
agent-sentry report             # Terminal report (last 24h)
agent-sentry report --hours 168 # Last 7 days
agent-sentry clear              # Clear all events

Architecture

@watch decorator
    |
    v
EventCapture (intercepts calls, measures latency, catches errors)
    |
    +--> RootCauseClassifier (timeout? hallucination? rate limit?)
    +--> EventStore (SQLite, WAL mode, thread-safe)
    +--> AlertManager (Slack, webhooks, email, callbacks)
    +--> Dashboard (Streamlit, reads from EventStore)

All data stays local. SQLite with WAL mode. Thread-safe. No external services required.

Installation

pip install ai-agent-sentry                                # Core
pip install ai-agent-sentry[dashboard]                     # + Streamlit dashboard
pip install ai-agent-sentry[openai,anthropic,langchain]    # + Framework integrations
pip install ai-agent-sentry[all]                           # Everything

Contributing

See CONTRIBUTING.md. Issues and PRs welcome.

Roadmap

See ROADMAP.md.

License

MIT. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_agent_sentry-0.1.1.tar.gz (36.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_agent_sentry-0.1.1-py3-none-any.whl (24.2 kB view details)

Uploaded Python 3

File details

Details for the file ai_agent_sentry-0.1.1.tar.gz.

File metadata

  • Download URL: ai_agent_sentry-0.1.1.tar.gz
  • Upload date:
  • Size: 36.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for ai_agent_sentry-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8b5c827d8bfb0ad2324ac266c17c495a0cf2baee83537258e0db22f4b6356011
MD5 e6129e77ebeee7fc0fa2c817ca680c48
BLAKE2b-256 3adffff8b568217924abe207bf70a86c62cf664ec42a572012bc541b71af7bf0

See more details on using hashes here.

File details

Details for the file ai_agent_sentry-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for ai_agent_sentry-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b3030cf93984041868dc7d288be1da621c0029a2b4b19618ff97b11126732716
MD5 78575b4e07ed5bbe85bc3b7f5a062217
BLAKE2b-256 8e03e2fd4fb0d380d8307669a1149aa517a6fdaabd4402a686dae0ffc0df709d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page