LLM-oriented observability SDK built on OpenTelemetry with cost/usage tracking

Project description

yuutrace

LLM-oriented observability SDK built on OpenTelemetry. Provides structured tracing for LLM agent workloads with first-class cost and token usage tracking.

What's in the box

Deliverable	Registry	Description
`yuutrace`	PyPI	Python SDK for instrumentation + CLI (`ytrace server` / `ytrace ui`)
`@yuutrace/ui`	npm	React component library for trace visualization

your-agent (Python)
  │  import yuutrace
  │
  ▼
ytrace server ──OTLP/HTTP JSON──▶ SQLite
  │
  ▼
ytrace ui ──REST API──▶ Browser (@yuutrace/ui)

Installation

# Python SDK (includes CLI tools)
pip install yuutrace

# React components (for embedding in your own dashboard)
npm install @yuutrace/ui

Quick Start

1. Start the Trace Collector

ytrace server --db ./traces.db --port 4318

2. Initialize Tracing

import yuutrace as ytrace

ytrace.init(service_name="my-agent")

If you already configure OpenTelemetry elsewhere, yuutrace reuses the existing TracerProvider and init() becomes a no-op.

3. Instrument Your Agent

Below is a minimal but complete example covering the core workflow: conversation → LLM generation → tool execution.

import yuutrace as ytrace
from uuid import uuid4

ytrace.init(service_name="my-agent")

async def agent_turn(user_msg: str):
    with ytrace.conversation(
        id=uuid4(),            # UUID – unique conversation identifier
        agent="my-agent",      # str  – agent name
        model="gpt-4o",        # str  – primary model
        tags={"env": "prod"},  # dict[str, str] | None – filtering tags
    ) as chat:

        # Record context
        chat.system(persona="You are helpful.", tools=tool_specs)
        chat.user(user_msg)

        # ── LLM generation ──────────────────────────────────────
        with chat.llm_gen() as gen:
            response = await llm.call(messages)

            # Log response items for UI inspection
            gen.log(response.choices[0].message.content)

            # Record token usage (keyword args)
            ytrace.record_llm_usage(
                provider="openai",
                model="gpt-4o",
                input_tokens=150,
                output_tokens=42,
                cache_read_tokens=80,
            )

            # Record cost
            ytrace.record_cost(
                category="llm",        # "llm" | "tool"
                currency="USD",        # "USD"
                amount=0.0023,
                llm_provider="openai",
                llm_model="gpt-4o",
            )

        # ── Tool execution ──────────────────────────────────────
        with chat.tools() as t:
            results = await t.gather([
                {
                    "tool_call_id": "call_1",   # str  – unique call ID
                    "tool": search_fn,           # Callable – sync or async
                    "params": {"q": "BTC"},      # dict – keyword args
                },
            ])
            # results: list[ToolResult]
            # ToolResult.tool_call_id: str
            # ToolResult.output: Any
            # ToolResult.error: str | None

4. View Traces

ytrace ui --db ./traces.db --port 8080
# Open http://localhost:8080

`gen.log()` — Logging LLM Response Items

gen.log(items) attaches the LLM response to the current llm_gen span so you can inspect it in the web UI.

Signature

LlmGenContext.log(items: list[Any]) -> None

UI-recognised item types

The web UI renders two item shapes. Items with other type values are stored but not rendered.

`type`	Required fields	UI rendering
`"text"`	`text: str`	Text block with pre-wrap whitespace
`"tool_calls"`	`tool_calls: [{"function": str, "arguments": Any}, ...]`	Tool call list with function name and arguments

Serialization

Each element in the list is auto-serialized to JSON before storage:

Input type	Serialization method
`dict`, `list`, `str`, `int`, `float`, `bool`, `None`	Pass-through
`msgspec.Struct`	`msgspec.to_builtins()`
Pydantic `BaseModel`	`.model_dump()`
`dataclass`	`vars()` (private attrs stripped)
Other objects	`str()` fallback

Best Practice

# Pattern 1: Log text response (most common)
with chat.llm_gen() as gen:
    response = await client.chat.completions.create(...)
    message = response.choices[0].message
    gen.log([
        {"type": "text", "text": message.content},
    ])

# Pattern 2: Log text + tool-call decisions
with chat.llm_gen() as gen:
    response = await client.chat.completions.create(...)
    message = response.choices[0].message
    gen.log([
        {"type": "text", "text": message.content or ""},
        {"type": "tool_calls", "tool_calls": [
            {"function": tc.function.name,
             "arguments": tc.function.arguments}
            for tc in (message.tool_calls or [])
        ]},
    ])

# Pattern 3: Log msgspec / Pydantic models directly
# (stored as JSON, but only rendered if the serialized dict
#  matches one of the two shapes above)
with chat.llm_gen() as gen:
    gen.log([my_msgspec_struct, my_pydantic_model])

When to call

Call gen.log() once per llm_gen() block, after you have the LLM response. Calling it multiple times overwrites the previous value (it sets a span attribute, not an event).

Key Concepts

Span Hierarchy

Every instrumented conversation produces a tree of OpenTelemetry spans:

conversation (root)
  ├── llm_gen          # one LLM request
  ├── tools            # a batch of tool calls
  │     ├── tool:search
  │     └── tool:calc
  ├── llm_gen
  └── ...

The root conversation span carries metadata (conversation.id, agent, model, tags). Child spans are created automatically by the context managers.

Delta Semantics

All cost and usage data is recorded as increments (deltas). A single span can emit multiple cost/usage events. Aggregation happens at query time, not write time. This keeps the write path simple and concurrent-safe.

Event Types

Event Name	Purpose	Key Attributes
`yuu.cost`	Cost increment	`category`, `currency`, `amount`, `llm.model`, `tool.name`
`yuu.llm.usage`	Token usage	`provider`, `model`, `input_tokens`, `output_tokens`, `cache_read_tokens`
`yuu.tool.usage`	Tool usage (optional)	`name`, `unit`, `quantity`

Business code never writes these event names or attribute keys directly — the SDK wraps them in type-safe functions.

Fast Fail

current_span() raises NoActiveSpanError if called outside a span context. No implicit span creation, no silent data loss.

Python SDK API Reference

Initialization

ytrace.init(
    *,
    endpoint: str = "http://localhost:4318/v1/traces",
    service_name: str = "yuutrace",
    service_version: str | None = None,
    timeout_seconds: float = 10.0,
) -> None

No-op if OpenTelemetry is already configured. Registers atexit shutdown hook.

Context Managers

`conversation()`

ytrace.conversation(
    *,
    id: UUID,                            # unique conversation ID
    agent: str,                          # agent name
    model: str,                          # primary LLM model
    tags: dict[str, str] | None = None,  # filtering/grouping tags
) -> Iterator[ConversationContext]

Root span. All recording functions must be called inside this (or a child) context.

`ConversationContext`

Method	Signature	Description
`system`	`(persona: str, tools: list[Any] \| None = None) -> None`	Record system prompt and tool specs
`user`	`(content: str) -> None`	Record user message
`llm_gen`	`() -> Iterator[LlmGenContext]`	Open child span for an LLM call
`tools`	`() -> Iterator[ToolsContext]`	Open child span for a tool batch

`LlmGenContext`

Method	Signature	Description
`log`	`(items: list[Any]) -> None`	Attach LLM response items (auto-serialized to JSON)

`ToolsContext`

Method	Signature	Description
`gather`	`(calls: list[dict[str, Any]]) -> list[ToolResult]`	Execute tools concurrently

Each call dict: {"tool_call_id": str, "tool": Callable, "params": dict, "name": str (optional)}.

`ToolResult`

class ToolResult(msgspec.Struct, frozen=True):
    tool_call_id: str
    output: Any
    error: str | None = None

Recording Functions

`record_llm_usage()`

Accepts either a pre-built struct or keyword arguments:

# Keyword args (most common)
ytrace.record_llm_usage(
    provider: str,                       # e.g. "openai", "anthropic"
    model: str,                          # e.g. "gpt-4o", "claude-sonnet-4-20250514"
    request_id: str | None = None,
    input_tokens: int = 0,
    output_tokens: int = 0,
    cache_read_tokens: int = 0,
    cache_write_tokens: int = 0,
    total_tokens: int | None = None,     # auto-computed if None
)

# Or pass a struct
ytrace.record_llm_usage(LlmUsageDelta(...))

`record_cost()` / `record_cost_delta()`

ytrace.record_cost(
    category: str,       # "llm" | "tool"
    currency: str,       # "USD"
    amount: float,       # incremental cost
    # LLM-specific (when category="llm")
    llm_provider: str | None = None,
    llm_model: str | None = None,
    llm_request_id: str | None = None,
    # Tool-specific (when category="tool")
    tool_name: str | None = None,
    tool_call_id: str | None = None,
    # General
    source: str | None = None,
    pricing_id: str | None = None,
)

# Or pass a struct
ytrace.record_cost_delta(CostDelta(...))

`record_tool_usage()`

ytrace.record_tool_usage(
    ToolUsageDelta(
        name="get_weather",     # tool name
        unit="api_calls",       # unit of measurement
        quantity=1.0,           # amount
        call_id="call_1",       # optional correlation ID
    )
)

Types

All types are frozen msgspec.Struct instances (immutable, fast serialization).

Type	Required Fields	Optional Fields
`CostDelta`	`category`, `currency`, `amount`	`source`, `pricing_id`, `llm_provider`, `llm_model`, `llm_request_id`, `tool_name`, `tool_call_id`
`LlmUsageDelta`	`provider`, `model`	`request_id`, `input_tokens`, `output_tokens`, `cache_read_tokens`, `cache_write_tokens`, `total_tokens`
`ToolUsageDelta`	`name`, `unit`, `quantity`	`call_id`

Enums:

CostCategory — "llm" | "tool"
Currency — "USD"

Low-level

Function	Signature	Description
`current_span()`	`-> Span`	Return active OTEL span; raises `NoActiveSpanError` if none
`add_event()`	`(name: str, attributes: dict) -> None`	Add event to current span (prefer typed wrappers above)

Errors

Error	When
`TracingNotInitializedError`	`conversation()` called before `init()` or external OTEL setup
`NoActiveSpanError`	Recording function called outside any span context

CLI Reference

`ytrace server`

Receives OTLP/HTTP traces (JSON or Protobuf) and stores them to SQLite.

ytrace server --db ./traces.db --port 4318 --host 127.0.0.1

Option	Default	Description
`--db`	`./traces.db`	SQLite database file path
`--port`	`4318`	HTTP server port
`--host`	`127.0.0.1`	Bind address

`ytrace ui`

Serves the trace visualization web UI with REST API.

ytrace ui --db ./traces.db --port 8080 --host 127.0.0.1

Option	Default	Description
`--db`	`./traces.db`	SQLite database file path
`--port`	`8080`	HTTP server port
`--host`	`127.0.0.1`	Bind address

REST API endpoints:

Method	Path	Description
GET	`/api/health`	Health check
GET	`/api/conversations`	List conversations (`?limit=50&offset=0&agent=...`)
GET	`/api/conversations/{id}`	Single conversation with all spans and events
GET	`/api/spans/{id}`	Single span detail

React Component Library

@yuutrace/ui exports pure presentation components. Data is injected via props — no built-in data fetching, no framework lock-in.

import {
  ConversationList,
  ConversationFlow,
  CostSummary,
  UsageSummary,
  SpanTimeline,
  parseConversation,
} from "@yuutrace/ui";

function MyDashboard({ conversation }) {
  const { costs, usages } = parseConversation(conversation.spans);

  return (
    <>
      <SpanTimeline spans={conversation.spans} />
      <ConversationFlow spans={conversation.spans} />
      <CostSummary costs={costs} />
      <UsageSummary usages={usages} />
    </>
  );
}

Components

Component	Props	Description
`ConversationList`	`conversations`, `selectedId?`, `onSelect?`	Searchable conversation list
`ConversationFlow`	`spans`	Waterfall of LLM/tool cards
`LlmCard`	`span`, `usage?`, `cost?`	LLM call detail card
`ToolCard`	`span`, `usage?`, `cost?`	Tool call detail card
`CostSummary`	`costs`	Cost breakdown by category/model
`UsageSummary`	`usages`	Token usage by model
`SpanTimeline`	`spans`	Horizontal Gantt chart

Utilities

parseConversation(spans) — extract typed cost/usage events from raw spans
extractCostEvents(span) — cost events from a single span
extractLlmUsageEvents(span) — LLM usage from a single span
extractToolUsageEvents(span) — tool usage from a single span

Examples

See examples/ for complete working examples:

weather_agent.py — Multi-turn agent with LLM calls, tool execution, cost tracking, and error handling

# Terminal 1: Start collector
ytrace server --db ./traces.db --port 4318

# Terminal 2: Run example
python examples/weather_agent.py

# Terminal 3: Start UI
ytrace ui --db ./traces.db --port 8080
# Open http://localhost:8080

Development

Prerequisites

Python >= 3.12
Node.js >= 20
uv (Python package manager)

Setup

# Python
uv sync

# React UI
cd ui && npm install

Build the UI

# Build standalone app + copy to _static/ for ytrace ui
bash scripts/build_ui.sh

# Or build separately:
cd ui
npm run build:app    # standalone page → dist/app/
npm run build:lib    # npm library → dist/lib/

Project Structure

yuutrace/
├── src/yuutrace/
│   ├── __init__.py          # public API
│   ├── types.py             # CostDelta, LlmUsageDelta, ToolUsageDelta
│   ├── context.py           # conversation(), llm_gen(), tools()
│   ├── cost.py              # record_cost(), record_cost_delta()
│   ├── usage.py             # record_llm_usage(), record_tool_usage()
│   ├── span.py              # current_span(), add_event()
│   ├── otel.py              # OTEL attribute keys + serialization
│   └── cli/
│       ├── main.py          # ytrace CLI entry point
│       ├── server.py        # OTLP collector (Starlette)
│       ├── ui.py            # REST API + static serving (Starlette)
│       ├── db.py            # SQLite persistence
│       └── _static/         # pre-built UI assets
├── ui/                      # @yuutrace/ui React package
│   ├── src/
│   │   ├── components/      # ConversationList, LlmCard, etc.
│   │   ├── hooks/           # useTraceData (standalone only)
│   │   ├── pages/           # TracePage
│   │   ├── utils/           # parse.ts
│   │   ├── types.ts
│   │   └── index.ts         # library exports
│   ├── vite.config.ts       # app build
│   └── vite.config.lib.ts   # library build
├── examples/                # Example applications
│   ├── weather_agent.py     # Multi-turn agent example
│   └── README.md            # Example documentation
├── scripts/
│   └── build_ui.sh
└── pyproject.toml

License

MIT

Project details

Release history Release notifications | RSS feed

0.3.0

Apr 24, 2026

0.2.0

Mar 31, 2026

0.1.11

Feb 17, 2026

0.1.10

Feb 17, 2026

This version

0.1.9

Feb 15, 2026

0.1.8

Feb 12, 2026

0.1.6

Feb 11, 2026

0.1.5

Feb 10, 2026

0.1.4

Feb 9, 2026

0.1.3

Feb 9, 2026

0.1.0

Feb 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yuutrace-0.1.9.tar.gz (88.3 kB view details)

Uploaded Feb 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

yuutrace-0.1.9-py3-none-any.whl (94.5 kB view details)

Uploaded Feb 15, 2026 Python 3

File details

Details for the file yuutrace-0.1.9.tar.gz.

File metadata

Download URL: yuutrace-0.1.9.tar.gz
Upload date: Feb 15, 2026
Size: 88.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for yuutrace-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`7d83f812235e51df75e97b7b562a671ba944a1c93c437106d610ff5b92d9b4e2`
MD5	`2912f6a2fbd7fcd646c97f2f4697ef1b`
BLAKE2b-256	`640d5abdb8f250271d8ca6ac4e23e9c86849de2145dbe9f8071f115abb5c48df`

See more details on using hashes here.

Provenance

The following attestation bundles were made for yuutrace-0.1.9.tar.gz:

Publisher: publish-pypi.yml on yuulabs/yuutrace

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: yuutrace-0.1.9.tar.gz
- Subject digest: 7d83f812235e51df75e97b7b562a671ba944a1c93c437106d610ff5b92d9b4e2
- Sigstore transparency entry: 953538456
- Sigstore integration time: Feb 15, 2026
Source repository:
- Permalink: yuulabs/yuutrace@c1a3b7a83300f34c30e11684d1ed231db05235a8
- Branch / Tag: refs/tags/v0.1.9
- Owner: https://github.com/yuulabs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@c1a3b7a83300f34c30e11684d1ed231db05235a8
- Trigger Event: push

File details

Details for the file yuutrace-0.1.9-py3-none-any.whl.

File metadata

Download URL: yuutrace-0.1.9-py3-none-any.whl
Upload date: Feb 15, 2026
Size: 94.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for yuutrace-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e8b8428b143516816285cdd2acab40d0fbe3709c94aed2895b4bbff7f7700639`
MD5	`b5b90718f50375183fc626a190f3ae01`
BLAKE2b-256	`117e10330ae9c384fd0ba5d5f38789a284921bdc5655843412ddf41131753a03`

See more details on using hashes here.

Provenance

The following attestation bundles were made for yuutrace-0.1.9-py3-none-any.whl:

Publisher: publish-pypi.yml on yuulabs/yuutrace

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: yuutrace-0.1.9-py3-none-any.whl
- Subject digest: e8b8428b143516816285cdd2acab40d0fbe3709c94aed2895b4bbff7f7700639
- Sigstore transparency entry: 953538457
- Sigstore integration time: Feb 15, 2026
Source repository:
- Permalink: yuulabs/yuutrace@c1a3b7a83300f34c30e11684d1ed231db05235a8
- Branch / Tag: refs/tags/v0.1.9
- Owner: https://github.com/yuulabs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@c1a3b7a83300f34c30e11684d1ed231db05235a8
- Trigger Event: push

yuutrace 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

yuutrace

What's in the box

Installation

Quick Start

1. Start the Trace Collector

2. Initialize Tracing

3. Instrument Your Agent

4. View Traces

gen.log() — Logging LLM Response Items

Signature

UI-recognised item types

Serialization

Best Practice

When to call

Key Concepts

Span Hierarchy

Delta Semantics

Event Types

Fast Fail

Python SDK API Reference

Initialization

Context Managers

conversation()

ConversationContext

LlmGenContext

ToolsContext

ToolResult

Recording Functions

record_llm_usage()

record_cost() / record_cost_delta()

record_tool_usage()

Types

Low-level

Errors

CLI Reference

ytrace server

ytrace ui

React Component Library

Components

Utilities

Examples

Development

Prerequisites

Setup

Build the UI

Project Structure

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`gen.log()` — Logging LLM Response Items

`conversation()`

`ConversationContext`

`LlmGenContext`

`ToolsContext`

`ToolResult`

`record_llm_usage()`

`record_cost()` / `record_cost_delta()`

`record_tool_usage()`

`ytrace server`

`ytrace ui`