Auto-instrumentation and visibility for AI agents — OpenAI Agents SDK, Claude SDK, and more.

These details have not been verified by PyPI

Project links

Project description

mimir-observe

Auto-instrumentation and visibility for AI agents. Two lines of code, zero config.

pip install mimir-observe

Quick start

1. Add instrumentation (2 lines)

Pick the one that matches your stack:

import mimir

# Raw OpenAI client (chat.completions.create)
mimir.instrument_openai()

# Raw Anthropic client (messages.create)
mimir.instrument_anthropic()

# OpenAI Agents SDK (Runner.run / Runner.run_streamed)
mimir.instrument_openai_agents()

# Claude Agent SDK (query)
mimir.instrument_claude()

Add these lines at the top of your entry point, before any API calls. That's it. Your existing code stays exactly the same.

2. Start the dashboard

In a separate terminal:

python -m mimir.cli dashboard

Open http://localhost:9847 to see your runs.

3. There is no step 3

Every API call and agent run is now captured automatically. The dashboard shows:

Agent list with run counts, models, and tools
Run timeline with every tool call (args + results), reasoning block, and token usage
Run diffing -- side-by-side comparison of any two runs
Divergence detection -- flags agents whose reruns follow different tool patterns

What gets captured

Data	How
Tool calls	Name, arguments, result, duration
Reasoning	Model output text between tool calls
Token usage	Input/output tokens per call
Cost	If set via `run.set_cost()`
Run duration	Wall clock time
Run status	Success or error
Input/output	Prompt and final result

Which instrument function do I use?

Your code uses	Function
`from openai import OpenAI`	`mimir.instrument_openai()`
`from anthropic import Anthropic`	`mimir.instrument_anthropic()`
`from agents import Runner`	`mimir.instrument_openai_agents()`
`from claude_code_sdk import query`	`mimir.instrument_claude()`

You can call multiple if your project uses more than one SDK.

Multi-turn agentic loops

If your agent calls the API multiple times in a loop, wrap the loop in a mimir.task().run() so all calls are grouped as one run:

import mimir
mimir.instrument_openai()  # or instrument_anthropic()

from openai import OpenAI
client = OpenAI()

t = mimir.task(name="My Agent", config="research assistant", tools=["search"], model="gpt-4o")
with t.run(input={"prompt": user_input}) as run:
    # Every API call inside this block becomes a step in the same run
    response = client.chat.completions.create(model="gpt-4o", messages=[...])
    response = client.chat.completions.create(model="gpt-4o", messages=[...])
    response = client.chat.completions.create(model="gpt-4o", messages=[...])
    run.set_output("final result")

Without the wrapper, each API call creates its own run — fine for single calls, wrong for loops.

How it works

Mimir monkey-patches the SDK at the class level when you call instrument_*(). Every subsequent API call is intercepted, telemetry is extracted from the request/response, and it's sent to the local dashboard via fire-and-forget HTTP. Your agent code is never blocked or slowed down.

Zero external dependencies (stdlib only)
All data stays local (~/.mimir/)
Dashboard down? Agent runs normally, no errors
Uninstrument anytime: mimir.uninstrument_openai(), etc.

Manual instrumentation

For custom setups where auto-instrumentation doesn't fit:

import mimir

t = mimir.task(
    name="My Agent",
    config="what it does",
    tools=["search", "write"],
    model="gpt-4o",
)

with t.run(input={"prompt": "user input"}) as run:
    run.tool("search", {"q": "test"}, "3 results", duration_ms=150)
    run.reasoning("Found relevant results, writing report...")
    run.tool("write", {"file": "report.md"}, "ok", duration_ms=50)
    run.set_usage(1500, 800)
    run.set_output("Report written")

Onboarding with Claude Code

If you use Claude Code, paste this prompt to have it instrument your project automatically:

Install and set up Mimir agent observability in this project.

Step 1: pip install mimir-observe (if not already installed). Import as `import mimir`.

Step 2: Find the entry point(s) and determine which SDK is used:
  - `from openai import OpenAI` → add `mimir.instrument_openai()`
  - `from anthropic import Anthropic` → add `mimir.instrument_anthropic()`
  - `from agents import Runner` → add `mimir.instrument_openai_agents()`
  - `from claude_code_sdk import query` → add `mimir.instrument_claude()`

Add the 2 lines (import + instrument call) at the top of each entry point,
BEFORE any API calls. No other code changes needed.

Step 3: If the code has multi-turn agentic loops (calling the API multiple times
in a while/for loop), wrap the loop in a mimir task so all turns become steps
in one run instead of separate runs:

    t = mimir.task(name="Agent Name", config="what it does", model="model-name")
    with t.run(input={"prompt": user_input}) as run:
        # ... the existing loop goes here, unchanged ...
        run.set_output(final_result)

Single API calls outside a loop do NOT need this wrapper — they auto-create runs.

Step 4: Start the dashboard: python -m mimir.cli dashboard

CLI

python -m mimir.cli dashboard              # start on :9847
python -m mimir.cli dashboard --port 8080  # custom port
python -m mimir.cli version                # print version

Requirements

Python 3.10+
No external dependencies

The SDKs you want to instrument (openai, anthropic, openai-agents, etc.) must be installed separately.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.3

Apr 29, 2026

1.0.2

Apr 29, 2026

1.0.1

Apr 29, 2026

1.0.0

Apr 28, 2026

0.9.6

Apr 17, 2026

0.9.5

Apr 17, 2026

0.9.4

Apr 17, 2026

0.9.3

Apr 16, 2026

0.9.2

Apr 16, 2026

0.9.1

Apr 16, 2026

0.9.0

Apr 16, 2026

0.8.0

Apr 16, 2026

0.7.2

Apr 16, 2026

0.7.1

Apr 16, 2026

0.7.0

Apr 16, 2026

0.6.1

Apr 15, 2026

0.6.0

Apr 15, 2026

0.5.1

Apr 15, 2026

0.5.0

Apr 14, 2026

0.4.2

Apr 14, 2026

0.4.1

Apr 14, 2026

0.4.0

Apr 14, 2026

0.3.2

Apr 14, 2026

0.3.1

Apr 13, 2026

0.3.0

Apr 13, 2026

This version

0.2.2

Apr 13, 2026

0.2.1

Apr 13, 2026

0.2.0

Apr 13, 2026

0.1.0

Apr 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mimir_observe-0.2.2.tar.gz (30.7 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mimir_observe-0.2.2-py3-none-any.whl (26.8 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file mimir_observe-0.2.2.tar.gz.

File metadata

Download URL: mimir_observe-0.2.2.tar.gz
Upload date: Apr 13, 2026
Size: 30.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mimir_observe-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`5468bb3cdc08abb6cc83958bea68b6c0bc073e5eae40c055dcb549812b2332f0`
MD5	`55d8e9690a2341aeb3090305b7bef8fe`
BLAKE2b-256	`51beb495c19e953020a4071ded80180ac7ddfdcc970b1eb50d778c7dd8a00ba0`

See more details on using hashes here.

File details

Details for the file mimir_observe-0.2.2-py3-none-any.whl.

File metadata

Download URL: mimir_observe-0.2.2-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 26.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for mimir_observe-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4ee935115485c325cdbd373716f18b97a26d9370e2df9bde6a8a4383e37f5e28`
MD5	`96cdbf1282e0807a4ae397a631e17c51`
BLAKE2b-256	`3b0ff3b8c9352b8a0dd261ca604ca83c8d8efd5868c8c1774048ddc37987ac6c`

See more details on using hashes here.

mimir-observe 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mimir-observe

Quick start

1. Add instrumentation (2 lines)

2. Start the dashboard

3. There is no step 3

What gets captured

Which instrument function do I use?

Multi-turn agentic loops

How it works

Manual instrumentation

Onboarding with Claude Code

CLI

Requirements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes