o11y-reflect

See why your AI coding agents fail, stall, or burn budget — local-first telemetry for Claude, Copilot, Gemini, and Cursor

These details have not been verified by PyPI

Project links

Project description

░▒▓███████▓▒░░▒▓████████▓▒░▒▓████████▓▒░▒▓█▓▒░      ░▒▓████████▓▒░▒▓██████▓▒░▒▓████████▓▒░
░▒▓█▓▒░░▒▓█▓▒░▒▓█▓▒░      ░▒▓█▓▒░      ░▒▓█▓▒░      ░▒▓█▓▒░     ░▒▓█▓▒░░▒▓█▓▒░ ░▒▓█▓▒░
░▒▓███████▓▒░░▒▓██████▓▒░ ░▒▓██████▓▒░ ░▒▓█▓▒░      ░▒▓██████▓▒░░▒▓█▓▒░        ░▒▓█▓▒░
░▒▓█▓▒░░▒▓█▓▒░▒▓█▓▒░      ░▒▓█▓▒░      ░▒▓█▓▒░      ░▒▓█▓▒░     ░▒▓█▓▒░░▒▓█▓▒░ ░▒▓█▓▒░
░▒▓█▓▒░░▒▓█▓▒░▒▓████████▓▒░▒▓█▓▒░      ░▒▓████████▓▒░▒▓████████▓▒░▒▓██████▓▒░  ░▒▓█▓▒░

Your AI agents are doing things you can't see. reflect shows you.

Local-first telemetry for Claude Code, GitHub Copilot, Gemini CLI, and Cursor — token spend, tool failure rates, latency, and what's actually burning your budget. No cloud. No account. Runs on your machine.

$ reflect --demo

─────────── AI Usage Dashboard  All time  (2026-03-16 → 2026-03-23) ────────────

╭────────────────────────────────── Insights ──────────────────────────────────╮
│ ✓ Good prompt-to-action ratio — 4.2 tool calls per prompt, showing           │
│   effective task delegation.                                                 │
│ ✓ Effective subagent delegation — 1 Task subagent, keeping main context      │
│   focused.                                                                   │
│ ⚠ 7 tool failures (20.6% of tool calls). Path and schema validation up       │
│   front can reduce iteration cost.                                           │
│ ⚠ Top session consumed 42% of all tokens — context blowout pattern.          │
│ → Use a fixed prompt contract: Goal, Context, Constraints, Output, Done-when │
│ → Pin relevant files in the first prompt to reduce exploratory tool churn.   │
╰──────────────────────────────────────────────────────────────────────────────╯

╭── Quality Score ──╮ ╭─── Sessions ────╮ ╭── Active Days ──╮
│       75.0%       │ │        8        │ │        8        │
╰───────────────────╯ ╰─────────────────╯ ╰─────────────────╯
╭───── Prompts ─────╮ ╭── Tool/Prompt ──╮ ╭─── Failure % ───╮
│         8         │ │      4.2:1      │ │      20.6%      │
╰───────────────────╯ ╰─────────────────╯ ╰─────────────────╯

╭────────────────────────────── Agent Comparison ──────────────────────────────╮
│                                                        Top    In    Out  Fail │
│   Agent     Sess  Events  Quality      Top Model       Tool  Tok    Tok     % │
│  ──────────────────────────────────────────────────────────────────────────  │
│   claude       4      46  ████░ High   sonnet-4-5      Read  275K  44.5K  16% │
│   copilot      2      20  ████░ High   gpt-4o          Read   33K   6.3K  12% │
│   cursor       1      11  █░░░░ Low    —               Write  95K   8.0K  60% │
│   gemini       1       8  ████░ High   gemini-2.0-fla… Read   12K   2.5K   0% │
╰──────────────────────────────────────────────────────────────────────────────╯

╭───────────────────────────── Sessions (8 total) ─────────────────────────────╮
│   Session                    Agent     Started (UTC)      Score   In Tok      │
│  ──────────────────────────────────────────────────────────────────────────  │
│   implement the entire da…   claude    2026-03-16 20:10      60   180.0K      │
│   migrate the users table…   cursor    2026-03-20 17:25      20    95.0K      │
│   investigate the memory …   claude    2026-03-22 14:55      80    45.0K      │
│   refactor the auth modul…   claude    2026-03-23 10:10      90    28.0K      │
│   add cursor-based pagina…   copilot   2026-03-21 10:40      80    18.0K      │
│   fix the token expiry bu…   copilot   2026-03-17 09:40      90    15.0K      │
│   review PR #142 for secu…   gemini    2026-03-18 16:03      90    12.0K      │
╰──────────────────────────────────────────────────────────────────────────────╯

─────────────────────────────── reflect.o11y.dev ───────────────────────────────

Run this yourself: pipx install o11y-reflect && reflect --demo

Requirements

Python 3.11+
pipx (recommended) or pip

Quickstart

pipx install o11y-reflect
reflect setup
# use your AI tool normally for a bit, then:
reflect

reflect setup modifies config files for the agents with implemented telemetry wiring today (Claude Code, GitHub Copilot, Gemini CLI, and Codex native OTel) and starts writing spans to ~/.reflect/state/. reflect then reads those spans and renders an interactive terminal dashboard.

No telemetry yet? Try the demo:

reflect --demo

What people actually find

Running reflect for the first time is usually surprising:

One session consumed 30–40% of your total tokens (almost always a context blowout, not useful work)
Your tool failure rate is higher than you thought — Bash failures often go unnoticed because the agent silently retries
Cache hit rate varies dramatically by agent; switching prompt style can cut costs 30–50%
If you use multiple agents, one is almost always measurably more efficient than the others for the same class of task

How it works

reflect takes care of instrumentation and session data collection for the integrations that are implemented today. AI coding agents expose telemetry in two ways, and reflect setup uses whichever the verified integration supports:

Hooks (Claude Code today) — scripts that fire at key lifecycle moments (session start, tool call, prompt, stop). reflect setup installs a small opentelemetry-hooks instrumentation layer into the agent's config file where that path is verified.
Native OpenTelemetry (Claude Code, GitHub Copilot, Gemini CLI, OpenAI Codex CLI) — the agent has built-in OTLP export that just needs to be pointed at the local collector. reflect setup writes the relevant settings for each:
- Claude Code: env block in ~/.claude/settings.json (metrics + logs only, not traces)
- GitHub Copilot VS Code: github.copilot.chat.otel.* keys in VS Code settings.json
- GitHub Copilot CLI: COPILOT_OTEL_ENABLED / COPILOT_OTEL_OTLP_ENDPOINT env vars
- Gemini CLI: telemetry.* keys in ~/.gemini/settings.json (e.g. telemetry.enabled, telemetry.otlpEndpoint)
- OpenAI Codex CLI: [otel] section in ~/.codex/config.toml (interactive mode only)

Either way, every tool call, token usage event, and session boundary is recorded as an OTLP span and written locally to ~/.reflect/state/.

When you run reflect, it:

Reads spans from ~/.reflect/state/ (or falls back to each agent's native session logs if hooks aren't available)
Normalizes them into a single cross-agent data model — so a Claude tool call and a Copilot tool call look the same
Aggregates per-session and cross-session metrics: token totals, tool failure rates, latency percentiles, subagent delegation patterns
Renders the results as a terminal dashboard, markdown report, or JSON artifact for a hosted web view

Nothing leaves your machine. There's no cloud backend, no account, no API key.

What you get

Token economy — input, output, cache hits, largest-session concentration
Tool efficiency — failure rates, latency percentiles (p50/p90/p95/p99), tool-to-prompt ratio
Agent comparison — side-by-side across Claude, Copilot, Gemini, Cursor
Model breakdown — which models you're actually using and how much
MCP server tracking — observed usage counts and completion gaps from recorded MCP events
Subagent patterns — delegation frequency and types
Activity heatmaps — by hour and day of week
Actionable recommendations — based on your actual usage patterns

Output modes

reflect                        # interactive terminal dashboard (default)
reflect --no-terminal          # markdown report
reflect --dashboard-artifact out.json  # JSON artifact for dashboards
reflect --publish              # open local dashboard in browser
reflect --demo                 # instant demo with sample data

Health check

reflect doctor
reflect update

reflect doctor checks that your installation is healthy, shows which integrations are implemented vs still planned, and reports whether hooks are wired correctly, the installed package matches the latest release, and skill files are up to date. reflect update --apply upgrades the pipx package when a newer release is available.

Agent instrumentation landscape

reflect's mission is to make every AI coding agent observable with zero manual instrumentation. Today, though, only a subset of integrations have verified telemetry collection. reflect setup detects agent homes for guidance, but it only starts collection where wiring and parsing are implemented.

Agent	Instrumentation	What you get	Confidence
Claude Code	Native OTel + hooks	Metrics, logs, tool calls, sessions	High
GitHub Copilot VS Code	Native OTel	Traces, metrics, logs	High
GitHub Copilot CLI	Native OTel + hooks	Traces, metrics, logs	High
Gemini CLI	Native OTel + hooks	Traces, metrics, logs	High
OpenAI Codex CLI	Native OTel (interactive)	Traces (interactive mode only)	Medium
Cursor	Session/log adapters	Tool calls, sessions, rough token estimates when exact usage is missing (`len(text) / 4`)	Medium
Windsurf, Trae, Cline, Roo Code, Goose, OpenHands, Amp, Continue, iFlow, Pi, OpenClaw	Not implemented yet	Detection, config snapshots, and skill distribution only	Planned

Why Cursor is only medium confidence: local Cursor transcripts do not contain exact per-session usage, so reflect falls back to a rough len(text) / 4 estimate when provider-side token usage is unavailable.

Instrumentation paths:

Native OTel — agent has built-in OTLP export; reflect configures it to point at the local collector
Hooks — opentelemetry-hooks intercepts agent lifecycle events (session start, tool calls, stop)
Session/log adapters — reflect reads the agent's local session files directly when spans aren't available

When hook spans and OTLP traces are absent, reflect falls back to rich local session stores:

Cursor: ~/.cursor/projects/**/agent-transcripts/**/*.jsonl
Copilot: ~/.copilot/session-state/*/events.jsonl
Claude Code: ~/.claude/projects/**/*.jsonl
Gemini: ~/.gemini/tmp/**/chats/session-*.json

Advanced usage

Direct OTLP traces

If you already have OTLP JSON traces from a collector, skip setup:

reflect --otlp-traces path/to/otel-traces.json

A sibling otel-logs.json file is used automatically for enrichment when present.

Hosted dashboard

Write a JSON artifact for GitHub Pages or a local server:

reflect --dashboard-artifact docs/reports/latest.json --publish

For a safe public example, this repo also ships a curated GitHub Pages demo:

https://reflect.o11y.dev/showcase.html

All options

reflect [OPTIONS] [COMMAND]

Options:
  --sessions-dir PATH          Session metadata JSON directory
  --spans-dir PATH             Local span JSONL directory
  --otlp-traces PATH           OTLP JSON traces file
  --output PATH                Markdown report output path
  --terminal / --no-terminal   Terminal dashboard (default) or markdown report
  --dashboard-artifact PATH    Write dashboard JSON artifact
  --publish                    Open dashboard in browser
  --demo                       Run with bundled sample data
  --help                       Show help

Commands:
  setup    Install hooks, wire agents, configure telemetry
  doctor   Check installation health and agent status
  update   Check release drift and optional package upgrade

Data flow

reflect setup
    ├── installs opentelemetry-hooks
    ├── edits each agent's settings file to enable telemetry
    │       via hooks        Claude Code  → ~/.claude/settings.json
    │                        Codex CLI    → ~/.codex/config.toml
    │       via native otel  Claude Code  → ~/.claude/settings.json  (env block, metrics+logs)
    │                        Copilot VS Code → VS Code settings.json (otel.* keys)
    │                        Copilot CLI  → VS Code settings.json  (env block)
    │                        Gemini CLI   → ~/.gemini/settings.json  (telemetry.* keys)
    │                        Codex CLI    → ~/.codex/config.toml    ([otel] section)
    ├── distributes skill packages
    └── enables local span export to ~/.reflect/state/

Your AI tool → hooks -or- native OTLP → ~/.reflect/state/

reflect → reads traces → terminal dashboard / report / hosted view

Skill package

reflect ships with a portable skill for Claude Code. After reflect setup, the /reflect skill is available in your Claude Code session for in-session telemetry analysis.

Analysis schema

See docs/ai-observability-schema.md for the canonical cross-tool analysis schema.

reflect setup automatically installs opentelemetry-hooks, the instrumentation layer that captures spans from your AI agents.

Two optional extras if you need them:

opentelemetry-skill — observability knowledge for AI assistants
gateway — OTLP gateway for team/shared telemetry

License

Apache-2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.3

May 26, 2026

0.8.2

May 26, 2026

0.8.1

May 26, 2026

0.8.0

May 26, 2026

0.7.2

May 4, 2026

0.7.1

Apr 28, 2026

0.7.0

Apr 26, 2026

0.6.1

Apr 20, 2026

0.6.0

Apr 20, 2026

0.5.0

Apr 18, 2026

0.4.2

Apr 16, 2026

0.4.1

Apr 15, 2026

0.4.0

Apr 15, 2026

0.3.0

Apr 14, 2026

0.2.0

Apr 14, 2026

This version

0.1.1

Apr 13, 2026

0.1.0

Apr 11, 2026

0.0.1

Apr 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

o11y_reflect-0.1.1.tar.gz (147.6 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

o11y_reflect-0.1.1-py3-none-any.whl (132.5 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file o11y_reflect-0.1.1.tar.gz.

File metadata

Download URL: o11y_reflect-0.1.1.tar.gz
Upload date: Apr 13, 2026
Size: 147.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for o11y_reflect-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`6ff6d55c97e851f6fdc62aa5b574cc8d7756a71820f2e1c76410409a2e159b25`
MD5	`bc4ae9fed39ddc333004304bb246079d`
BLAKE2b-256	`2864427fd4df87d5bb8ecdeb06367fd5b44393e97d0e8e3ce87d727e35e75060`

See more details on using hashes here.

File details

Details for the file o11y_reflect-0.1.1-py3-none-any.whl.

File metadata

Download URL: o11y_reflect-0.1.1-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 132.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for o11y_reflect-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ad1e31134e8b34140820b18af47c80e2fbce15468613161636d31418f1dc20d1`
MD5	`8bdc877a0b440956265100b15ae6de95`
BLAKE2b-256	`2990c9b3c2f0a6956a9e1c14f4db01af293b0e344bc275153d608fc3e406df9e`

See more details on using hashes here.

o11y-reflect 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Requirements

Quickstart

What people actually find

How it works

What you get

Output modes

Health check

Agent instrumentation landscape

Advanced usage

Direct OTLP traces

Hosted dashboard

All options

Data flow

Skill package

Analysis schema

Related

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes