shadowdance

Zero-code-change observability for Python

These details have not been verified by PyPI

Project links

Project description

The first observability tool for LLM-powered robots.

The Problem

When your robot does something unexpected, you have no idea why.

Was it the vision model? The planner? A bad command? The network? There's no way to know. Robot SDKs have no logging, no observability, no debugging. You're flying blind.

The Solution

One line of code. Wrap your robot client and see everything:

from unitree_sdk2py.go2.sport.sport_client import SportClient
from shadowdance import ShadowDance

# Your existing robot code
client = SportClient()
client.Init()

# ONE LINE - wrap with ShadowDance
client = ShadowDance(client)  # <- THAT'S IT. Everything below is traced.

# All robot commands now traced with inputs, outputs, timing
client.StandUp()
client.Move(0.3, 0, 0)
client.Damp()

No refactoring. No code changes. Just wrap and go.

End-to-End ML Pipeline Observability

Modern LLM-powered robots span multiple systems:

┌─────────────────────────────────────────┐
│  Cloud LLM (OpenAI, Anthropic)          │
│  "pick up the white box" → commands     │
├─────────────────────────────────────────┤
│  Your Agent Code                        │
│  Vision → Planning → Execution          │
├─────────────────────────────────────────┤
│  Robot (Unitree Go2, H1, etc.)          │
│  Move, StandUp, Damp, gripper control   │
└─────────────────────────────────────────┘

ShadowDance traces the entire pipeline:

from shadowdance import ShadowDance
from openai import OpenAI

# Wrap your LLM (ONE LINE)
llm = OpenAI()
llm = ShadowDance(llm, run_type="llm")

# Wrap your robot (ONE LINE)
robot = SportClient()
robot = ShadowDance(robot, run_type="tool")

# Now you see the FULL chain in your dashboard:
# LLM prompt → generated commands → robot execution → timing → errors

Choose Your Observability Platform

LangSmith (default):

export PLATFORM=langsmith
export LANGCHAIN_API_KEY=...

Langfuse:

export PLATFORM=langfuse
export LANGFUSE_PUBLIC_KEY=...
export LANGFUSE_SECRET_KEY=...

Weave (Weights & Biases):

export PLATFORM=weave
export WANDB_API_KEY=...

What You Get

Full Trace Visibility

Run: robot_session
  ├── StandUp()                          8ms  ✓
  ├── Move(vx=0.3, vy=0, vyaw=0)        12ms  ✓
  ├── Move(vx=0, vy=0.3, vyaw=0)        11ms  ✓
  └── Damp()                             9ms  ✓

Nested Task Organization

from shadowdance import task

@task("pick_up_box")
def pick_up_box():
    robot = ShadowDance(SportClient())
    robot.StandUp()
    robot.Move(0.3, 0, 0)
    robot.Damp()

In your dashboard:

pick_up_box (chain)
├── StandUp (tool)
├── Move (tool)
└── Damp (tool)

LLM + Robot Correlation

See how LLM decisions affect robot behavior:

code_as_policies_task (chain)
├── vision_analysis (llm)
│   └── "white_box at [0.0, 0.1, 0.72]"
├── code_generation (llm)
│   └── "robot.move_to(0.0, 0.1, 0.72)"
└── code_execution (tool)
    ├── move_to (tool)  ✓
    └── close_gripper (tool)  ✓

Datasets & Regression Testing

# Log all robot commands to a dataset
robot = ShadowDance(
    SportClient(),
    run_type="tool",
    log_to_dataset="robot-tasks"
)

# Every command logged for evaluation
robot.StandUp()      # ✓ Logged
robot.Move(0.3, 0, 0)  # ✓ Logged

In your dashboard:

Go to Datasets & Experiments
Find robot-tasks with all executions
Compare robot versions
Run regression tests

Installation

pip install shadowdance

Then install your chosen platform:

# For LangSmith (default)
pip install langsmith

# For Langfuse
pip install langfuse

# For Weave
pip install wandb

Quick Start

# Set your platform
export PLATFORM=langsmith
export LANGCHAIN_API_KEY=your-key

# Run your robot code
python your_robot_script.py

View traces at:

LangSmith: smith.langchain.com
Langfuse: Your Langfuse dashboard
Weave: Your Weave project in W&B

API

`ShadowDance(client, run_type="tool", log_to_dataset=None)`

Wraps any client object with observability tracing.

Args:

client: The client object to wrap (Unitree SDK, OpenAI, etc.)
run_type: Type for filtering ("tool", "llm", "chain", etc.)
log_to_dataset: Optional dataset name for evaluation

Example:

# Robot
robot = ShadowDance(SportClient(), run_type="tool")

# LLM
llm = ShadowDance(OpenAI(), run_type="llm")

# Agent
agent = ShadowDance(MyAgent(), run_type="chain")

`@task(name, run_type="chain")`

Decorator to create parent runs for nested tracing.

Example:

@task("pick_up_box")
def pick_up_box():
    robot = ShadowDance(SportClient())
    robot.StandUp()  # Nested under "pick_up_box"

`task_context(name, run_type="chain")`

Context manager for creating parent runs.

Example:

with task_context("move_to_kitchen"):
    robot = ShadowDance(SportClient())
    robot.Move(0.5, 0, 0)

Run Types

Run Type	Use Case	Example
`"llm"`	LLM/VLM API calls	OpenAI, Anthropic, vision models
`"tool"`	Robot commands, API calls	Move, StandUp, gripper control
`"chain"`	Orchestration logic	Agents, multi-step workflows
`"retriever"`	Document retrieval	RAG systems, vector stores
`"embedding"`	Embedding generation	Text embeddings

Why ShadowDance?

Before ShadowDance:

Robot SDKs have zero observability
No way to debug why robot did X instead of Y
Can't correlate LLM decisions with robot actions
No regression testing for robot behavior
Flying blind in production

After ShadowDance:

Every robot command traced with timing and results
Full LLM → robot pipeline visibility
Organized traces by task
Datasets for evaluation and regression
Debug production issues from your dashboard

One line of code. That's all it takes to go from blind to full visibility.

File Structure

./shadowdance/                # Main package
├── __init__.py               # ShadowDance wrapper + factory
└── adapters/
    ├── __init__.py           # Base interface + TraceEvent
    ├── langsmith.py          # LangSmith adapter
    ├── langfuse.py           # Langfuse adapter
    ├── weave.py              # Weave adapter (W&B)
    ├── example.py            # Template for custom adapters
    └── README.md             # Adapter documentation
./test_shadowdance.py         # Unit tests
./examples/                   # Example code
./pyproject.toml              # Package configuration
./requirements.txt            # Dependencies

Testing

python test_shadowdance.py

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.1

Mar 16, 2026

0.5.0

Mar 16, 2026

0.4.0

Mar 16, 2026

0.3.0

Mar 15, 2026

0.2.0

Mar 15, 2026

0.1.0

Mar 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shadowdance-0.5.1.tar.gz (1.5 MB view details)

Uploaded Mar 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shadowdance-0.5.1-py3-none-any.whl (7.9 kB view details)

Uploaded Mar 16, 2026 Python 3

File details

Details for the file shadowdance-0.5.1.tar.gz.

File metadata

Download URL: shadowdance-0.5.1.tar.gz
Upload date: Mar 16, 2026
Size: 1.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for shadowdance-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`2d624da74dd0b80283599e57f3866cb30b58552aa6966f3cf0104cd4636cc114`
MD5	`2c79cb623c20349930246526da431820`
BLAKE2b-256	`fb5ee5d3ee2b2e664755cd67b73177047a81f6de07675224040b246bb3ae58ff`

See more details on using hashes here.

File details

Details for the file shadowdance-0.5.1-py3-none-any.whl.

File metadata

Download URL: shadowdance-0.5.1-py3-none-any.whl
Upload date: Mar 16, 2026
Size: 7.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for shadowdance-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5a6bbc39e1633ee587a1138496ddb40bb5d5f067ab113f233338bee5f84826f8`
MD5	`e3ca713e031842860013a4a508dda687`
BLAKE2b-256	`c6b4e398cb98d778c4970aede401581b7b6a1b24c268a12fbc2d8c5d2cd53af8`

See more details on using hashes here.

shadowdance 0.5.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

The Problem

The Solution

End-to-End ML Pipeline Observability

Choose Your Observability Platform

What You Get

Full Trace Visibility

Nested Task Organization

LLM + Robot Correlation

Datasets & Regression Testing

Installation

Quick Start

API

ShadowDance(client, run_type="tool", log_to_dataset=None)

@task(name, run_type="chain")

task_context(name, run_type="chain")

Run Types

Why ShadowDance?

File Structure

Testing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`ShadowDance(client, run_type="tool", log_to_dataset=None)`

`@task(name, run_type="chain")`

`task_context(name, run_type="chain")`