Production observability and FinOps for Forge — OpenTelemetry tracing, cost tracking, and real-time metrics.

These details have not been verified by PyPI

Project links

Project description

forge-observe

Observability and FinOps for Forge.

OpenTelemetry tracing, event-driven cost tracking, real-time metrics, console output, and a FastAPI backend for monitoring multi-agent systems in production.

Why `forge-observe`

Agent systems are hard to operate when you cannot answer simple questions fast:

Which agent is spending the money?
Which model is driving cost?
Where are the failures happening?
Are we seeing real telemetry or just framework-level guesses?
Can we expose live run data to dashboards and internal tooling?

forge-observe is the package in Forge that turns runtime signals into something you can inspect, aggregate, export, and act on. It sits on top of forge-core and treats events as the source of truth for costs, spans, and operational visibility.

What ships in this package

ForgeLLMInterceptor for canonical LLM call instrumentation
ForgeTracer for OpenTelemetry wiring and event-to-span mirroring
MetricsCollector for aggregated session, model, and agent metrics
DefaultCostModel for pricing-aware FinOps calculations
Rich console output helpers for CLI-facing run presentation
FastAPI REST + SSE backend for recent runs, metrics, evolution entries, and memory endpoints
SDK instrumentation helpers for direct Anthropic, OpenAI, and Google GenAI usage
Label sanitization to control metric-cardinality explosions

Installation

Inside the Forge workspace:

uv sync

Standalone:

pip install forge-observe

Requirements:

Python 3.11+
forge-core==0.1.0
opentelemetry-api
opentelemetry-sdk
opentelemetry-exporter-otlp
fastapi
uvicorn
rich

Quickstart

The most important runtime object here is the interceptor. It converts raw model-call telemetry into populated RunEvent objects with costs and tracing metadata attached.

from forge_core.events import EventBus
from forge_observe.interceptor import ForgeLLMInterceptor

bus = EventBus()
interceptor = ForgeLLMInterceptor(bus=bus)

In a full Forge stack, MetaOrchestrator owns one interceptor and hands it to adapters so all frameworks feed the same event stream.

The core model

forge-observe is built around one idea:

raw LLM/tool activity -> RunEvent -> metrics / traces / API / console

Instead of each adapter inventing its own accounting path, this package centralizes observability around Forge's shared event model.

That gives you:

one place to compute cost
one place to publish telemetry
one place to wire tracing
one place to expose aggregated state

`ForgeLLMInterceptor`

ForgeLLMInterceptor is the canonical implementation of the LLMCallInterceptor protocol from forge-core.

Responsibilities:

open and close OpenTelemetry spans around model calls
compute cost using DefaultCostModel
publish LLM_CALL and ERROR events to the EventBus
preserve per-call metadata such as model, agent, tool, latency, and token counts
protect degraded providers through a circuit breaker

This is the linchpin that keeps FinOps honest. If instrumentation bypasses the interceptor, costs stop being trustworthy.

Breaker behavior

The interceptor can wire an LLM circuit breaker automatically. When repeated failures accumulate, new starts are short-circuited and an ERROR event is emitted instead of letting dead dependencies flood the runtime with timeouts.

Environment flag:

FORGE_LLM_CIRCUIT_BREAKER=0
- disables the default breaker wiring

`ForgeTracer`

ForgeTracer configures OpenTelemetry export and can subscribe directly to the Forge event bus.

It serves two roles:

ensure a tracer provider exists and exporters are configured
mirror bus events into spans so traces are visible even when the interceptor is not the only event source

Convenience helper:

from forge_observe.tracer import attach_tracer_to_orchestrator

tracer, handle = attach_tracer_to_orchestrator(orchestrator)

After attachment, runs executed by the orchestrator automatically produce OTel spans.

`MetricsCollector`

MetricsCollector aggregates run outcomes into session-level operational metrics.

Tracked dimensions include:

total runs
successful vs failed runs
total cost
total input and output tokens
total duration
per-model cost
per-agent LLM calls, cost, latency, tool calls, and errors

It also includes a cost-savings estimator that compares current spending against cheaper model substitutions.

Example:

from forge_observe.metrics import MetricsCollector

collector = MetricsCollector()
collector.record_run(result)

print(collector.session.total_cost)
print(collector.summary_dict())

Pricing and FinOps

DefaultCostModel provides pricing-aware cost calculation for a registry of known models.

Notable behavior:

exact and prefix model matching
custom pricing overrides
warning-once behavior for unknown model names
deterministic zero-cost fallback for unknown models instead of silent guessing

This matters because downstream evaluation and optimization logic depends on cost accuracy.

Rich console output

The console exporter is designed for high-signal CLI output and demos.

It includes helpers such as:

print_forge_banner()
print_run_result()
print_evolution_proposal()
live_progress()

These render:

run status
cost breakdown
topology tree
mutations and proposals
progress feedback for long operations

API and live backend

forge-observe ships a FastAPI application that exposes recent runtime state over REST and SSE.

Key endpoints:

GET /api/v1/health
GET /api/v1/runs
GET /api/v1/runs/{run_id}
GET /api/v1/runs/{run_id}/events
POST /api/v1/runs
GET /api/v1/evolution
POST /api/v1/evolution
GET /api/v1/evolution/stats
GET /api/v1/metrics
GET /api/v1/stream
GET /api/v1/memory/query
GET /api/v1/memory/graph

The API keeps an in-memory store for development and lightweight deployments, and it can be fronted by a standalone dashboard or internal integration.

SSE support

The /api/v1/stream endpoint exposes a live event stream over Server-Sent Events with:

bounded per-client queues
subscriber-cap enforcement
heartbeat pings
cleanup on disconnect

This makes it suitable for lightweight real-time dashboards without needing a separate event broker.

CORS

Browser clients on different origins can be enabled through:

FORGE_API_ALLOWED_ORIGINS

Example:

FORGE_API_ALLOWED_ORIGINS="http://localhost:5173,https://example.com"

Dashboard backend

DashboardBackend is a convenience wrapper that runs the API server in-process and can attach itself to a MetaOrchestrator.

What it does:

starts a background uvicorn server
exposes the server URL and docs URL
patches orchestrator run() calls so completed runs are posted into the API backend

This is useful for local demos and simple operator setups.

Direct SDK instrumentation

Not every user runs through a framework adapter. Some flows call SDKs directly.

forge_observe.llm_clients.instrument_llm_clients(...) patches imported SDK entrypoints for the duration of a context manager so direct model calls still emit Forge telemetry.

Supported SDK families in the current implementation:

Anthropic
OpenAI
Google Generative AI

Example:

from forge_observe.llm_clients import instrument_llm_clients

with instrument_llm_clients(interceptor, agent_id="writer"):
    ...

Design intent:

only patch SDKs already imported by the user
revert patches cleanly on exit
preserve telemetry for sync and async call paths
avoid breaking user execution if instrumentation fails

Label sanitization

Metrics backends are vulnerable to high-cardinality labels. LabelSanitizer exists to keep observability from turning into a cardinality bomb.

It enforces:

label allowlists
per-key unique-value caps
deterministic hashing for over-cap values
normalization of empty and missing values

This is especially important for fields like agent_id, where unbounded variation can destroy Prometheus-style backends.

Package boundaries

forge-observe focuses on observability and runtime reporting.

Related packages:

forge-core
- run model, event bus, protocols, orchestrator, guards
forge-memory
- memory storage and retrieval
forge-adapters
- framework integrations
forge-cli
- CLI entrypoints and user-facing commands

That separation keeps forge-observe reusable in monitoring-oriented deployments.

Public API at a glance

Top-level exports from forge_observe:

DefaultCostModel
ForgeTracer
MetricsCollector

Primary modules:

forge_observe.interceptor
forge_observe.tracer
forge_observe.metrics
forge_observe.cost_model
forge_observe.llm_clients
forge_observe.labels
forge_observe.dashboard_backend
forge_observe.exporters.api
forge_observe.exporters.console
forge_observe.exporters.otlp

Recommended usage patterns

Attach tracing to an orchestrator

from forge_observe.tracer import attach_tracer_to_orchestrator

tracer, subscription = attach_tracer_to_orchestrator(
    orchestrator,
    enable_console=True,
    otlp_endpoint=None,
)

Aggregate metrics from completed runs

from forge_observe.metrics import MetricsCollector

collector = MetricsCollector()
collector.record_run(result)
summary = collector.summary_dict()

Start the dashboard backend locally

from forge_observe.dashboard_backend import DashboardBackend

backend = DashboardBackend(host="127.0.0.1", port=8787)
backend.start_background()
backend.attach_orchestrator(orchestrator)

Testing

From the repository root:

pytest packages/forge-observe/tests -q

The test suite covers:

REST and SSE API behavior
memory API integration points
event fan-out for live subscribers
metrics aggregation from real Forge test harness runs
interceptor circuit-breaker behavior
label-sanitization guarantees
SDK monkey-patch instrumentation and restoration

License

Apache-2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

forge_os_observe-0.2.0.tar.gz (45.8 kB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

forge_os_observe-0.2.0-py3-none-any.whl (39.7 kB view details)

Uploaded May 15, 2026 Python 3

File details

Details for the file forge_os_observe-0.2.0.tar.gz.

File metadata

Download URL: forge_os_observe-0.2.0.tar.gz
Upload date: May 19, 2026
Size: 45.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for forge_os_observe-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`87ad5b0a1b6abe56013b6a467f194bb095fd680af2da9383179879863cb6f75d`
MD5	`2f0f2d2f7c73f3d7855579e5e77dc925`
BLAKE2b-256	`9c48019d9b2c6520a3fba27c2f2e5c71d8c320e10c714c11a71675cb76675192`

See more details on using hashes here.

File details

Details for the file forge_os_observe-0.2.0-py3-none-any.whl.

File metadata

Download URL: forge_os_observe-0.2.0-py3-none-any.whl
Upload date: May 15, 2026
Size: 39.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for forge_os_observe-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`26367ded16c7078d709e1f197dd1511654ae2b5fdb21bc44934c217cd5f2bd85`
MD5	`267e70059c0fe9b9152ded37227f5ddb`
BLAKE2b-256	`e6da3f1beee4a55aa5e960f302cc075dc1dd3756d6f40600a9cda93cedd6fa34`

See more details on using hashes here.

forge-os-observe 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

forge-observe

Why forge-observe

What ships in this package

Installation

Quickstart

The core model

ForgeLLMInterceptor

Breaker behavior

ForgeTracer

MetricsCollector

Pricing and FinOps

Rich console output

API and live backend

SSE support

CORS

Dashboard backend

Direct SDK instrumentation

Label sanitization

Package boundaries

Public API at a glance

Recommended usage patterns

Attach tracing to an orchestrator

Aggregate metrics from completed runs

Start the dashboard backend locally

Testing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Why `forge-observe`

`ForgeLLMInterceptor`

`ForgeTracer`

`MetricsCollector`