Lightweight Python interceptor that prevents AI agent failures in real-time

These details have not been verified by PyPI

Project links

Project description

Arc Runtime

Arc Runtime is a lightweight Python interceptor that prevents AI agent failures in real-time by applying learned fixes before requests reach the LLM. It's the client-side component of the Arc AI reliability system, designed to work with Arc Core.

Key Features

Zero-config interception - Just import and it works
Ultra-low latency - 0.011ms P99 overhead (99.78% better than 5ms requirement)
Thread-safe - Works seamlessly with async and multi-threaded applications
Pattern matching - Real-time detection and fixing of known failure patterns
Multi-agent support - Track complex agent pipelines with context handoffs
MCP interception - Monitor Model Context Protocol communications
LangGraph integration - Automatic tracking for LangGraph workflows
OpenTelemetry support - Full agent telemetry capture (reasoning traces, tool calls, tokens)
Graceful degradation - Never breaks your application if Arc Core is unreachable
Local metrics - Prometheus endpoint at http://localhost:9090/metrics

How It Works

Arc Runtime intercepts outgoing LLM API calls and:

Matches requests against known failure patterns (<1ms)
Applies fixes before the request reaches the LLM
Streams telemetry to Arc Core for continuous learning
Exposes metrics for monitoring

System Architecture

Arc Runtime is the client-side component that sits in your application environment:

graph TB
    subgraph "Your Application Environment"
        App[Your AI Application]
        Arc[Arc Runtime]
        SDK[OpenAI/Anthropic SDK]
        Cache[(Local Cache)]
        
        App --> Arc
        Arc --> Cache
        Cache --> Arc
        Arc --> SDK
    end
    
    SDK --> API[LLM API]
    API --> SDK
    SDK --> Arc
    Arc --> App
    
    subgraph "Arc Core Service"
        Collector[gRPC Collector]
        Detector[Failure Detector]
        Registry[Pattern Registry]
        
        Collector --> Detector
        Detector --> Registry
    end
    
    Arc -.-> Collector
    Registry -.-> Cache
    
    style Arc fill:#4CAF50,stroke:#2E7D32,stroke-width:2px
    style App fill:#2196F3,stroke:#1565C0,stroke-width:2px
    style Cache fill:#FFB74D,stroke:#F57C00,stroke-width:2px
    style Collector fill:#E1BEE7,stroke:#9C27B0,stroke-width:2px
    style Registry fill:#FFCDD2,stroke:#D32F2F,stroke-width:2px

Request Flow:

Your AI Application makes an API call
Arc Runtime intercepts the request
Checks local cache for matching failure patterns
Applies fixes if patterns match
Forwards the (potentially modified) request to the LLM SDK
SDK sends request to LLM API
Response flows back through Arc Runtime to your application
Arc Runtime asynchronously streams telemetry to Arc Core

Key Integration Points:

Telemetry Streaming: Arc Runtime streams all request/response data to Arc Core via gRPC
Pattern Updates: Arc Core pushes new failure patterns and fixes to Runtime instances
Metrics Export: Local Prometheus endpoint for monitoring Arc Runtime performance

Installation

pip install arc-runtime

For development:

git clone https://github.com/arc-computer/runtime.git
cd runtime
pip install -e .

Quick Start

Zero Configuration

import openai
from runtime import Arc

# Initialize Arc - this automatically patches OpenAI
Arc()

# Use OpenAI as normal - Arc protects your calls
client = openai.OpenAI()  # Uses API key from environment
response = client.chat.completions.create(
    model="gpt-4.1",
    messages=[{"role": "user", "content": "Write a poem about Python"}],
    temperature=0.95  # Arc automatically fixes this to 0.7
)

With Telemetry Endpoint

from runtime import Arc

# Connect to your Arc Core instance
arc = Arc(endpoint="grpc://arc.computer:50051")

# All subsequent OpenAI calls are protected and telemetry is streamed

Configuration

Arc Runtime can be configured via environment variables or constructor args:

from runtime import Arc

# Explicit configuration
arc = Arc(
    endpoint="grpc://arc.computer:50051",
    api_key="arc_key_xxx",
    log_level="DEBUG"
)

Environment variables:

ARC_DISABLE=1 - Disable Arc Runtime completely
ARC_ENDPOINT - gRPC endpoint for telemetry streaming to Arc Core (default: grpc://localhost:50051)
ARC_API_KEY - API key for Arc Core
ARC_LOG_LEVEL - Logging level (default: INFO)

Metrics

Arc Runtime exposes Prometheus metrics at http://localhost:9090/metrics:

arc_requests_intercepted_total - Total requests intercepted
arc_fixes_applied_total - Total fixes applied
arc_pattern_matches_total - Total pattern matches
arc_interception_latency_ms - Interception overhead histogram

Custom Patterns

arc = Arc()

# Register a pattern
arc.register_pattern(
    pattern={"model": "gpt-4", "temperature": {">": 0.9}},
    fix={"temperature": 0.7}
)

Multi-Agent Pipelines

Track complex multi-agent workflows with automatic context propagation:

from runtime import Arc
import openai

arc = Arc()
client = openai.OpenAI()

# Track a loan underwriting pipeline
with arc.create_multiagent_context(application_id="LOAN-2024-001") as ctx:
    # Loan officer agent
    response1 = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": "Analyze loan application"}],
        extra_headers={"X-Agent-Name": "loan_officer"}
    )
    
    # Credit analyst agent
    response2 = client.chat.completions.create(
        model="gpt-4",
        messages=[{"role": "user", "content": "Review credit history"}],
        extra_headers={"X-Agent-Name": "credit_analyst"}
    )
    
    # Track context handoffs between agents
    ctx.add_context_handoff(
        from_agent="loan_officer",
        to_agent="credit_analyst",
        context={"loan_amount": 250000, "initial_assessment": "positive"}
    )
    
    # Get pipeline summary
    summary = ctx.get_pipeline_summary()
    print(f"Agents executed: {summary['agents_executed']}")
    print(f"Total latency: {summary['total_latency_ms']}ms")

LangGraph Integration

Automatically track LangGraph workflows:

from runtime import ArcStateGraph

# Use ArcStateGraph instead of StateGraph
workflow = ArcStateGraph()

# Nodes are automatically tracked
workflow.add_node("process_application", process_application_fn)
workflow.add_node("verify_documents", verify_documents_fn)

# Compile and run - Arc tracks everything
app = workflow.compile()
result = app.invoke({"application_id": "APP-123"})

Manual Wrapping

If auto-patching fails, you can explicitly wrap clients:

import openai
from runtime import Arc

arc = Arc()
client = openai.OpenAI()
protected_client = arc.wrap(client)

Default Pattern Fixes

Arc Runtime ships with a built-in pattern for preventing high-temperature hallucinations:

Pattern	Fix	Rationale
GPT-4.1 with temperature > 0.9	Set temperature to 0.7	Reduces hallucination risk while maintaining creativity

Testing

# Set your OpenAI API key
export OPENAI_API_KEY="sk-..."

# Run real API tests
python tests/test_real_api.py

Components

Interceptors: Provider-specific hooks (OpenAI, MCP, Anthropic planned)
Pattern Registry: Thread-safe pattern storage and matching
Multi-Agent Context: Pipeline execution tracking with context handoffs
MCP Interceptor: Model Context Protocol monitoring
LangGraph Integration: Automatic workflow tracking
Telemetry Client: OpenTelemetry-compatible async streaming with agent tracing
Metrics Server: Prometheus-compatible metrics endpoint

Performance

Verified performance characteristics:

P99 Interception Overhead: 0.011ms (requirement: <5ms)
Pattern Matching: <1ms for dictionary lookup
Memory Footprint: <50MB base
Thread Safety: Full concurrent request support

Troubleshooting

Arc Runtime is not intercepting calls

Ensure Arc is imported before the LLM library:

from runtime import Arc  # Import Arc first
Arc()
import openai  # Then import OpenAI

Check if Arc is disabled:

echo $ARC_DISABLE  # Should be empty or "0"

Enable debug logging:
```
export ARC_LOG_LEVEL=DEBUG
```

Telemetry not streaming

Check endpoint connectivity:
```
telnet your-arc-endpoint 50051
```
Verify gRPC is installed:
```
pip install grpcio
```

License

MIT License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.5

Jul 9, 2025

This version

0.1.4

Jul 8, 2025

0.1.3

Jul 8, 2025

0.1.2

Jul 8, 2025

0.1.1

Jun 27, 2025

0.1.0

Jun 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arc_runtime-0.1.4.tar.gz (52.9 kB view details)

Uploaded Jul 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arc_runtime-0.1.4-py3-none-any.whl (37.0 kB view details)

Uploaded Jul 8, 2025 Python 3

File details

Details for the file arc_runtime-0.1.4.tar.gz.

File metadata

Download URL: arc_runtime-0.1.4.tar.gz
Upload date: Jul 8, 2025
Size: 52.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for arc_runtime-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`4d7f4cda08a43fb6eae1f88bfc71d0e4b2e19a0aedce096f816daf01d0b9003e`
MD5	`437438438ddca1bc799a087db9b4d7ac`
BLAKE2b-256	`2e044e4de1797da6885b10a32d28fc7679e3fe71dd90b0aadfe882da6355d053`

See more details on using hashes here.

File details

Details for the file arc_runtime-0.1.4-py3-none-any.whl.

File metadata

Download URL: arc_runtime-0.1.4-py3-none-any.whl
Upload date: Jul 8, 2025
Size: 37.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for arc_runtime-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`de1195f1dc6aeb38b64f638d17aef5958ed05a4873b3ead385a52a362f6a7261`
MD5	`8f236928235a0fffc90ddd63d9bd1d06`
BLAKE2b-256	`58437726e921d32fa19ad90064e41d656baac5145e137b7271b52557e0d43773`

See more details on using hashes here.

arc-runtime 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Arc Runtime

Key Features

How It Works

System Architecture

Installation

Quick Start

Zero Configuration

With Telemetry Endpoint

Configuration

Metrics

Custom Patterns

Multi-Agent Pipelines

LangGraph Integration

Manual Wrapping

Default Pattern Fixes

Testing

Components

Performance

Troubleshooting

Arc Runtime is not intercepting calls

Telemetry not streaming

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes