Skip to main content

Core detection, scoring, and healing engine for Pisama agent forensics

Project description

pisama-core

Detection, scoring, and healing engine for AI agent systems. Detect failure modes like infinite loops, hallucinations, cost overruns, and coordination breakdowns in your LLM agents -- entirely offline, no API keys required.

Part of the Pisama platform for multi-agent failure detection.

Install

pip install pisama-core

Quick Start

import asyncio
from pisama_core import Trace, SpanKind, DetectionOrchestrator

# Build a trace from your agent's execution
trace = Trace()
for i in range(8):
    trace.create_span(name="Read", kind=SpanKind.TOOL)

# Run all built-in detectors
orchestrator = DetectionOrchestrator()
result = asyncio.run(orchestrator.analyze(trace))

for detection in result.detections:
    print(f"[{detection.detector_name}] {detection.summary}")
    print(f"  Severity: {detection.severity}/100")
    print(f"  Fix: {detection.fix_recommendation.instruction}")

Output:

[loop] Tool 'Read' repeated 8x consecutively
  Severity: 45/100
  Fix: Stop the current loop. Try a different approach or ask the user for guidance.

No API key. No network calls. Runs completely locally.

Built-in Detectors

Detector What it catches
Loop Consecutive repetitions, cyclic patterns (A->B->A->B), low tool diversity
Repetition Similar actions with slight variations, tool dominance
Cost Token budget overruns, excessive LLM/tool calls
Hallucination Failed file operations, error rate spikes
Coordination Message storms, agent imbalance, handoff loops

All detectors support both batch analysis (full trace) and real-time hooks (per-span).

Use Individual Detectors

import asyncio
from pisama_core import Trace, SpanKind
from pisama_core.detection.detectors.loop import LoopDetector
from pisama_core.detection.detectors.cost import CostDetector

trace = Trace()
# ... add spans representing your agent's execution

loop = LoopDetector()
cost = CostDetector()

loop_result = asyncio.run(loop.detect(trace))
cost_result = asyncio.run(cost.detect(trace))

if loop_result.detected:
    print(f"Loop detected: {loop_result.summary}")

Write Your Own Detector

from pisama_core import BaseDetector, DetectionResult, Trace
from pisama_core.detection.result import FixType

class MyDetector(BaseDetector):
    name = "my_detector"
    description = "Detects my custom failure pattern"
    version = "0.1.0"

    async def detect(self, trace: Trace) -> DetectionResult:
        # Your detection logic here
        tool_names = trace.get_tool_sequence()
        if len(set(tool_names)) == 1 and len(tool_names) > 5:
            return DetectionResult.issue_found(
                detector_name=self.name,
                severity=50,
                summary="Agent is stuck using a single tool",
                fix_type=FixType.SWITCH_STRATEGY,
                fix_instruction="Try a different approach",
            )
        return DetectionResult.no_issue(self.name)

Register it so the orchestrator picks it up:

from pisama_core import registry
registry.register(MyDetector())

Core Concepts

  • Trace -- A complete agent execution session containing multiple spans
  • Span -- A single unit of work (tool call, LLM inference, agent turn) with kind, timing, and optional I/O data
  • DetectionResult -- Detector output: issue found (yes/no), severity (0-100), evidence, fix recommendation
  • DetectorRegistry -- Plugin system for registering detectors (built-ins auto-register on import)
  • DetectionOrchestrator -- Runs all registered detectors and aggregates results

Platform Support

Traces are framework-agnostic. Set platform for platform-aware threshold tuning:

from pisama_core import Trace, TraceMetadata, Platform

trace = Trace(metadata=TraceMetadata(platform=Platform.LANGGRAPH))

Works with Claude Agent SDK, LangGraph, AutoGen, CrewAI, n8n, Dify, and custom agents.

Pisama Platform

For production monitoring with 25+ calibrated detectors, ML-based detection, LLM-as-judge verification, and a dashboard, see pisama.ai.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pisama_core-1.4.0.tar.gz (196.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pisama_core-1.4.0-py3-none-any.whl (221.7 kB view details)

Uploaded Python 3

File details

Details for the file pisama_core-1.4.0.tar.gz.

File metadata

  • Download URL: pisama_core-1.4.0.tar.gz
  • Upload date:
  • Size: 196.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pisama_core-1.4.0.tar.gz
Algorithm Hash digest
SHA256 2d78dbb787a2e4a00c5f8d0730c06975e49e7b21220705313b6eedb39375c389
MD5 1124400bbbc58c0169f3abd5e8cfe70a
BLAKE2b-256 1676a56598125c5ce3052b0072dde7244abc28e56dbfc9ec4759202b567409a1

See more details on using hashes here.

Provenance

The following attestation bundles were made for pisama_core-1.4.0.tar.gz:

Publisher: publish.yml on tn-pisama/pisama

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pisama_core-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: pisama_core-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 221.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pisama_core-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 86afb1257b53a392baed9ca4a19f3dcea945714720d1d71f4a3a8703fcabfe63
MD5 9d17588249ea4e26ec39c5273a612da9
BLAKE2b-256 e76b462e4ad51ffbcd3b7a973d24b06ed01d6fa5f95312a973d57ec4d792870b

See more details on using hashes here.

Provenance

The following attestation bundles were made for pisama_core-1.4.0-py3-none-any.whl:

Publisher: publish.yml on tn-pisama/pisama

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page