Skip to main content

The Active Reliability Layer for AI Agents

Project description

Steer Logo

Steer SDK

Active Reliability Layer for AI Agents.

Steer is an open-source Python library that intercepts agent failures (hallucinations, bad JSON, PII leaks) and allows you to inject fixes via a local dashboard without changing your code.

PyPI version

The Problem

When an agent fails in production (e.g., outputs bad JSON), logging the error isn't enough. You usually have to:

  1. Dig through logs to find the prompt.
  2. Edit your prompt template manually.
  3. Redeploy the application.

The Solution

Steer wraps your agent function. When it detects a failure, it blocks the output and logs it to a local dashboard. You click "Teach" to provide a correction (e.g., "Use Strict JSON"), and Steer injects that rule into the agent's context for future runs.

Visual Workflow:

Steer Dashboard

Installation

pip install steer-sdk

Quickstart

Generate the example scripts to see the workflow in action:

steer init
# Generates 01_structure_guard.py, 02_safety_guard.py, etc.

steer ui
# Starts the local dashboard at http://localhost:8000

Run a demo (Split-screen recommended):

  1. Run python 01_structure_guard.py. It will fail (Blocked).
  2. Go to http://localhost:8000. Click Teach. Select "Strict JSON".
  3. Run python 01_structure_guard.py again. It will succeed.

Usage

Steer uses a decorator pattern to wrap your existing functions.

from steer import capture
from steer.verifiers import JsonVerifier

# 1. Define Verifiers
json_check = JsonVerifier(name="Strict JSON")

# 2. Decorate your Agent Function
@capture(verifiers=[json_check])
def my_agent(user_input, steer_rules=""):
    
    # 3. Pass 'steer_rules' to your system prompt.
    # Steer populates this argument automatically based on your teaching.
    system_prompt = f"You are a helpful assistant.\n{steer_rules}"
    
    # ... Your LLM call ...
    return llm.call(system_prompt, user_input)

🧬 Data Engine: From Guardrails to Fine-Tuning

Steer does not just catch errors; it creates the dataset needed to fix them permanently.

Every time a rule is applied or an agent succeeds, Steer logs the interaction. You can export these logs into a standard fine-tuning format (JSONL) compatible with OpenAI and other providers.

Export Training Data

Run this command to convert your local logs into a dataset:

steer export

Output: steer_fine_tune.jsonl

Format:

{"messages": [{"role": "system", "content": "..."}, {"role": "user", "content": "..."}, {"role": "assistant", "content": "{\"valid\": \"json\"}"}]}

The Fine-Tuning Workflow

  1. Capture: Run your agent with Steer. Fix issues in the Dashboard.
  2. Export: Run steer export to generate the dataset.
  3. Train: Upload steer_fine_tune.jsonl to OpenAI/Anthropic to fine-tune a model.
  4. Remove: Once the model is trained, you can often remove the strict guardrails, reducing latency.

Configuration

The Quickstart demos use a Mock LLM and require no API keys.

To use advanced LLM-based verifiers in production, set your environment variables:

export GEMINI_API_KEY=...
# OR
export OPENAI_API_KEY=...

What is the "Confident Idiot" Problem?

The Confident Idiot is a failure mode where an LLM generates a factually incorrect or structurally broken response with high probability (confidence).

Unlike traditional software that crashes when it fails, LLMs fail silently and plausibly.

  • Example: User asks "Weather in Springfield". The agent confidently guesses "Springfield, IL" (ignoring the fact that there are 33 other Springfields in the USA).
  • The Fix: Steer prevents this by enforcing Reality Locks (deterministic checks) that run after generation but before the user sees the response.

Read the viral discussion on Hacker News.

Star History

Star History Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

steer_sdk-0.2.6.tar.gz (456.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

steer_sdk-0.2.6-py3-none-any.whl (474.2 kB view details)

Uploaded Python 3

File details

Details for the file steer_sdk-0.2.6.tar.gz.

File metadata

  • Download URL: steer_sdk-0.2.6.tar.gz
  • Upload date:
  • Size: 456.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.9 Darwin/24.6.0

File hashes

Hashes for steer_sdk-0.2.6.tar.gz
Algorithm Hash digest
SHA256 968ed2d0b978be331ddd0f8bb0364e65222e2c8d165f3524f6ddd7db0ccd0881
MD5 c7f61230d1a5ee3ca1bce7fa06f1d388
BLAKE2b-256 254e027ac5a7b0933599daec461da45b7dda7059a5dbf3b859bffabfe33ff133

See more details on using hashes here.

File details

Details for the file steer_sdk-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: steer_sdk-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 474.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.9 Darwin/24.6.0

File hashes

Hashes for steer_sdk-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 179a3195451110ddfe25a34466597219f6db791c23bbd8f2dc14f72bc858aa5c
MD5 46cb9b19440beaa80af225a2957c3a0d
BLAKE2b-256 079eab9dcaab264f13d945795fd341e3fa00f0dac1a3be1cf5f78c33efa5d107

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page