Behavioral monitoring and directive control for AI agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tr00x

These details have not been verified by PyPI

Project description

SOMA

SOMA -- The Nervous System for AI Agents

Proprioceptive behavioral monitoring -- agents that feel themselves.

SOMA is a real-time behavioral monitoring system that gives AI agents awareness of their own state. It intercepts every tool call, computes behavioral pressure from 11 vital signals, detects failure patterns, enforces safety reflexes, and injects self-awareness directly into the agent's environment. Think of it as a nervous system: sensing, reacting, learning, remembering -- so agents can self-correct before problems escalate.

pip install soma-ai

Dashboard

SOMA includes a real-time web dashboard (FastAPI + SSE, port 7777) for monitoring agent behavior, analyzing sessions, and configuring settings.

Overview -- live pressure gauges, behavioral insights, and findings

More screenshots

Deep Dive -- per-agent pressure timeline, vitals breakdown, and baseline report

Analytics -- cross-session trends, tool usage distribution, mirror effectiveness

Logs -- filterable action log with tool names, pressure, errors, and timing Logs

Sessions -- session history with cross-session trend comparison

Settings -- mode selection, thresholds, weights, budget, and policy configuration

Who is this for

AI engineers building agent systems who want observability beyond logs
Teams deploying agents in production who need safety rails without babysitting
Researchers studying agent reliability, failure patterns, and self-correction
Anyone using Claude Code who wants their agent to stop retrying the same failing command

SOMA works today as a Claude Code hook system. The core is platform-agnostic -- SDK adapters exist for LangChain, CrewAI, AutoGen, and any Anthropic API client. Hook adapters also exist for Cursor and Windsurf.

The problem

AI agents are blind to their own behavior. They retry the same failing command five times. They edit files they never read. Their error rate climbs for ten actions straight and they don't notice.

This isn't anecdotal:

41-86% failure rate across agent benchmarks (MAST, Berkeley NeurIPS 2025)
Reliability lags capability by 2-3x (Kapoor et al., Princeton 2026)
Agents degrade predictably -- error cascades, retry loops, scope drift -- but have no signal to self-correct

Every existing tool monitors agents externally for humans. Dashboards and alerts for the operator. The agent itself never sees the data.

SOMA's key insight: LLMs ignore instructions but cannot ignore environmental data. Embed behavioral telemetry into tool responses and the agent processes it like any other fact about the world.

What SOMA does

Behavioral Sensing

11 real-time vital signals computed per action:

Signal	What it measures
Uncertainty	Epistemic vs aleatoric -- does the agent know what it doesn't know?
Drift	Behavioral deviation from established baseline (phase-aware)
Error rate	Rolling error frequency with pattern weighting
Entropy	Action distribution disorder -- are tool choices becoming random?
Goal coherence	Is the agent still working toward its objective?
Token usage	Consumption rate and acceleration
Cost	Spend rate per action and cumulative
Context burn rate	How fast is the context window being consumed?
Half-life	Estimated actions until the agent becomes ineffective
Calibration	How well do confidence signals predict actual outcomes?
Verbal-behavioral divergence	Does the agent say one thing and do another?

All signals feed into Pressure -- a unified 0-to-1 metric aggregated via sigmoid z-scores with configurable blending (mean + max). EMA baselines with cold-start blending prevent false positives in early sessions.

Pattern Detection

Real-time detection of known failure modes:

Blind edits -- writing to files never read (the #1 agent anti-pattern)
Retry storms -- same failing command repeated 3+ times
Bash failure cascades -- error rate > 40% with dedup detection
Thrashing -- editing the same file 3+ times without progress
Research stall -- reading without acting, context burning without output
Quality grading -- A/B/C/D/F based on syntax errors, lint issues, bash failures

Reflex System

Hard safety blocks that fire before the tool executes:

Destructive operation blocks -- rm -rf, git push --force, DROP TABLE
Commit gate -- blocks git commit when quality grade is D or F
Circuit breaker -- halts cascading failures after threshold
Retry dedup -- prevents identical failing commands from re-executing
Blind write prevention -- warns on file edits without prior read

Reflexes operate independently from pressure. They fire at any pressure level when the specific pattern is detected.

Mirror

Proprioceptive feedback via environment augmentation -- the agent sees its own state as facts in tool responses.

Mode	Cost	When	What the agent sees
PATTERN	$0	Known behavioral pattern	`pattern: same bash cmd repeated 5x`
STATS	$0	Elevated pressure, no pattern	`errors: 3/8 \| error_rate: 0.41`
SEMANTIC	~$0.001	High pressure + drift	LLM-generated behavioral observation

Mirror learns from outcomes. After each injection, it watches the next 3 actions. If pressure drops >=10%, the pattern helped and gets cached. Ineffective patterns are pruned after 5 failures.

Multi-Agent Intelligence

PressureGraph -- directed graph modeling inter-agent dependencies with trust-weighted edges and damping-based pressure propagation
Subagent monitoring -- tracks spawned child agents, their tool usage, error rates, and token consumption
Cascade risk detection -- when a subagent's error rate crosses threshold, risk propagates to parent pressure
Coordination SNR -- signal-to-noise ratio per agent in multi-agent workflows

Cross-Session Memory

SOMA remembers across sessions:

Behavioral fingerprinting -- tool distribution, error baselines, read/write ratios tracked via Jensen-Shannon divergence. Detects when an agent's behavior shifts from its historical norm
Session history -- append-only JSONL log of every session (pressure trajectories, mode transitions, tool distributions, phase sequences)
Cross-session predictor -- matches current trajectory against historical patterns (cosine similarity) to predict escalations before they happen
Learned pattern database -- effective Mirror patterns cached and reused; ineffective ones pruned

Budget and Resource Tracking

Token and cost tracking with configurable limits per dimension
Burn rate estimation -- tokens/action and cost/action with trend detection
Half-life estimation -- predicts when the agent will become ineffective based on context consumption
Budget exhaustion blocking -- stops API calls when budget is spent
Handoff suggestions -- recommends human takeover when half-life is critical

Predictions and Forecasting

Linear trend extrapolation from recent pressure window
Pattern-based boosts -- error streaks (+0.15), blind writes (+0.10), thrashing (+0.08), retry storms (+0.12) added to predicted pressure
Cross-session trajectory matching -- blends 60% current trend + 40% historical pattern match
Confidence scoring via R-squared on trend fit
Escalation warnings -- predicts threshold crossings N actions ahead

Quick start

pip install soma-ai

Configure Claude Code hooks automatically:

soma setup-claude

Or add hooks manually to ~/.claude/settings.json:

{
  "hooks": {
    "PreToolUse": [{ "type": "command", "command": "soma-hook" }],
    "PostToolUse": [{ "type": "command", "command": "soma-hook" }],
    "Stop": [{ "type": "command", "command": "soma-hook" }]
  }
}

SOMA is silent when the agent is healthy. When behavioral pressure rises above 15%, session context appears in tool responses.

For semantic mode (optional): export GEMINI_API_KEY=... (free tier).

See docs/QUICKSTART.md for the full setup guide.

Programmatic API

import soma

# Quick start -- creates engine with defaults
engine = soma.quickstart()

# Wrap any Anthropic client -- all calls monitored transparently
client = soma.wrap(anthropic.Anthropic())

# Universal proxy for any framework (LangChain, CrewAI, custom)
proxy = soma.SOMAProxy(engine, "my-agent")
safe_tool = proxy.wrap_tool(my_function)
child = proxy.spawn_subagent("child-agent")

# Replay and analyze past sessions
soma.replay_session("~/.soma/sessions/recording.json")

Architecture

Tool Call ─────────────────────────────────────────────> Tool Execution
     │                                                         │
     v                                                         v
+- SOMA Engine ────────────────────────────────────────────────────+
│                                                                   │
│  PRE-TOOL (before execution)          POST-TOOL (after execution) │
│  +──────────────+                     +──────────────────+        │
│  │   Skeleton   │ hard blocks,        │  Sensor Layer    │        │
│  │   (Reflexes) │ retry dedup,        │  11 vitals ->    │        │
│  │              │ blind write warn    │  EMA baselines ->│        │
│  +──────────────+                     │  pressure (0->1) │        │
│                                       +────────┬─────────+        │
│                                                │                  │
│                                       +────────v─────────+        │
│                                       │ Pattern Detection │       │
│                                       │ retry, thrash,    │       │
│                                       │ blind edit, stall │       │
│                                       +────────┬─────────+        │
│                                                │                  │
│                                       +────────v─────────+        │
│                                       │     Mirror        │       │
│                                       │ PATTERN -> STATS  │       │
│                                       │ -> SEMANTIC       │       │
│                                       │ (-> tool response)│       │
│                                       +────────┬─────────+        │
│                                                │                  │
│  +──────────────+                     +────────v─────────+        │
│  │ PressureGraph│<────────────────────│   Multi-Agent    │        │
│  │ (propagation)│ trust-weighted      │   Coordination   │        │
│  +──────────────+ edges               +──────────────────+        │
│                                                                   │
│  +────────────────────────────────────────────────────────+       │
│  │ Memory: fingerprints | sessions | patterns | predictor │       │
│  +────────────────────────────────────────────────────────+       │
+───────────────────────────────────────────────────────────────────+

Escalation: OBSERVE (silent) -> GUIDE (suggestions) -> WARN (insistent) -> BLOCK (destructive ops only)
Delivery:   stdout -> tool response (agent sees)  |  stderr -> system diagnostics (operator sees)

Integrations

Platform	Method	Status
Claude Code	Hook system (pre/post tool use, stop, notification)	Production
Anthropic API	`soma.wrap(client)` -- transparent proxy	Production
Any framework	`soma.SOMAProxy` -- universal tool wrapper	Production
LangChain	SDK adapter with callback handler	Adapter ready
CrewAI	SDK adapter with tool decorator	Adapter ready
AutoGen	SDK adapter with agent wrapper	Adapter ready
Cursor	Hook adapter (`CursorAdapter`)	Adapter ready
Windsurf	Hook adapter (`WindsurfAdapter`)	Adapter ready
OpenTelemetry	Metrics export (gauges, counters)	Optional extra
Webhooks	Fire-and-forget HTTP on WARN/BLOCK events	Built-in

Research foundation

SOMA addresses gaps identified in recent agent reliability research:

Paper	Finding	SOMA response
Kapoor et al. (Princeton 2026)	Reliability lags capability 2-3x	Real-time behavioral feedback
MAST (Berkeley NeurIPS 2025)	41-86% failure, error cascades	Pattern detection + pressure signal
METR (2025)	Silent failures, no self-correction	Proprioceptive session context
Anthropic (2025)	Tool errors propagate unchecked	Pre/post tool use interception

All prior work measures behavior post-hoc for human review. SOMA provides real-time proprioceptive feedback to the agent itself.

See docs/RESEARCH.md for the full research mapping.

v0.6.0 highlights

Mirror -- proprioceptive feedback via environment augmentation. Three escalation modes (PATTERN -> STATS -> SEMANTIC), self-learning from outcomes, zero-cost for pattern/stats modes. Web dashboard -- real-time FastAPI + SSE dashboard on port 7777 with 6 tabs (Overview, Deep Dive, Analytics, Logs, Sessions, Settings). See CHANGELOG.md for full history.

Stats

90 modules | 74 test files | 19k lines | Python 3.11+ | MIT license

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tr00x

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.0

Apr 15, 2026

This version

0.6.3

Apr 14, 2026

0.6.2

Apr 12, 2026

0.6.1

Apr 12, 2026

0.6.0

Apr 2, 2026

0.5.0

Mar 31, 2026

0.4.12

Mar 30, 2026

0.3.5

Mar 29, 2026

0.3.4

Mar 29, 2026

0.3.3

Mar 29, 2026

0.3.2

Mar 29, 2026

0.3.1

Mar 29, 2026

0.3.0

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soma_ai-0.6.3.tar.gz (1.2 MB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

soma_ai-0.6.3-py3-none-any.whl (260.2 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file soma_ai-0.6.3.tar.gz.

File metadata

Download URL: soma_ai-0.6.3.tar.gz
Upload date: Apr 14, 2026
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for soma_ai-0.6.3.tar.gz
Algorithm	Hash digest
SHA256	`904ab1af7c787d420b16c8a2c33d8e9dd83c37620d619ecd0d2d65f4d20a1686`
MD5	`55970e155e5156174d15ea5120fbedd8`
BLAKE2b-256	`705cf127ac6c3bb590f4dc4f9ac08f1cf5d8a5aba310ba1ca1d37593de99c6f4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for soma_ai-0.6.3.tar.gz:

Publisher: publish.yml on tr00x/SOMA-Core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: soma_ai-0.6.3.tar.gz
- Subject digest: 904ab1af7c787d420b16c8a2c33d8e9dd83c37620d619ecd0d2d65f4d20a1686
- Sigstore transparency entry: 1297022497
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: tr00x/SOMA-Core@f0cb5e12af67b270f7283c07a0fb241cf7e11d89
- Branch / Tag: refs/tags/v0.6.3
- Owner: https://github.com/tr00x
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f0cb5e12af67b270f7283c07a0fb241cf7e11d89
- Trigger Event: release

File details

Details for the file soma_ai-0.6.3-py3-none-any.whl.

File metadata

Download URL: soma_ai-0.6.3-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 260.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for soma_ai-0.6.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`61a7880c0c15558666cd00fa760e43d42ebcd26b663e5ad3ab65d938758451a4`
MD5	`43baf92b5c9379a4e5a586bcbc351ccc`
BLAKE2b-256	`3934744e3f150456c60a13e3c7a82c8d76854b9060c23bbd51277ac8211fe475`

See more details on using hashes here.

Provenance

The following attestation bundles were made for soma_ai-0.6.3-py3-none-any.whl:

Publisher: publish.yml on tr00x/SOMA-Core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: soma_ai-0.6.3-py3-none-any.whl
- Subject digest: 61a7880c0c15558666cd00fa760e43d42ebcd26b663e5ad3ab65d938758451a4
- Sigstore transparency entry: 1297022581
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: tr00x/SOMA-Core@f0cb5e12af67b270f7283c07a0fb241cf7e11d89
- Branch / Tag: refs/tags/v0.6.3
- Owner: https://github.com/tr00x
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f0cb5e12af67b270f7283c07a0fb241cf7e11d89
- Trigger Event: release

soma-ai 0.6.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

SOMA -- The Nervous System for AI Agents

Dashboard

Who is this for

The problem

What SOMA does

Behavioral Sensing

Pattern Detection

Reflex System

Mirror

Multi-Agent Intelligence

Cross-Session Memory

Budget and Resource Tracking

Predictions and Forecasting

Quick start

Programmatic API

Architecture

Integrations

Research foundation

v0.6.0 highlights

Stats

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance