fastharness

Expose AI agents as A2A-compliant FastAPI services with pluggable runtime backends

Project description

FastHarness

Turn AI agents into production-ready A2A services — with pluggable runtime backends.

FastHarness exposes agents through Google's A2A (Agent-to-Agent) protocol. Define agents with decorators, pick a runtime backend (Claude, OpenHands, or Pydantic DeepAgents), and FastHarness handles protocol compliance, message conversion, task lifecycle, and multi-turn conversations.

from fastharness import FastHarness, Skill

harness = FastHarness(name="my-agent")

harness.agent(
    name="assistant",
    description="A helpful assistant",
    skills=[Skill(id="help", name="Help", description="Answer questions")],
    system_prompt="You are helpful.",
    tools=["Read", "Grep"],
)

app = harness.app  # Ready to deploy

Why FastHarness?

Building AI agents is easy. Making them interoperable is hard:

Without FastHarness	With FastHarness
Implement A2A protocol manually	Automatic A2A compliance
Handle message format conversion	Built-in message conversion
Manage task lifecycle and state	Managed task execution
Build conversation history tracking	Multi-turn conversations out of the box
Create JSON-RPC endpoints	FastAPI endpoints ready
Write agent card generation	Auto-generated agent cards
Lock into one agent framework	Pluggable runtime backends

What FastHarness Adds

On top of agent SDKs + A2A SDK:

Multi-turn conversations — Runtime sessions maintain conversation history across A2A requests
Pluggable runtimes — Swap between Claude, OpenHands, and Pydantic DeepAgents backends
Cost tracking — Built-in telemetry callbacks for monitoring API usage
Step logging — Debug middleware for tool calls and intermediate steps
Zero-config protocol bridge — Decorator API handles all A2A protocol machinery

Installation

# Core (Claude Agent SDK backend)
uv add fastharness

# With OpenHands backend (requires Python 3.12+)
uv add fastharness[openhands]

# With Pydantic DeepAgents backend
uv add fastharness[deepagents]

# Both optional backends
uv add fastharness[openhands,deepagents]

Environment Setup

FastHarness automatically loads environment variables from a .env file in your project root at import time. Create one for API keys:

# .env
ANTHROPIC_API_KEY=sk-ant-...

The Claude runtime uses the Claude Agent SDK subprocess (which handles auth via the claude CLI). The DeepAgents and OpenHands runtimes use API keys directly — set ANTHROPIC_API_KEY in .env or your shell environment.

Quick Start

1. Define your agent:

from fastharness import FastHarness, Skill

harness = FastHarness(name="my-agent")

harness.agent(
    name="assistant",
    description="A helpful assistant",
    skills=[Skill(id="help", name="Help", description="Answer questions")],
    system_prompt="You are helpful.",
    tools=["Read", "Grep"],
)

app = harness.app

2. Run it:

uvicorn mymodule:app --port 8000

3. Talk to it (Python):

import asyncio
from fastharness import FastHarnessClient

async def main():
    async with FastHarnessClient("http://localhost:8000") as client:
        reply = await client.send("Hello!")
        print(reply)

        # Multi-turn — same context_id maintains conversation
        reply = await client.send("My name is Alice", context_id="conv-1")
        reply = await client.send("What's my name?", context_id="conv-1")
        print(reply)  # "Alice"

        # Stream tokens as they arrive
        async for chunk in client.stream("Write a haiku"):
            print(chunk, end="", flush=True)

asyncio.run(main())

Or with curl

# Agent card
curl http://localhost:8000/.well-known/agent-card.json

# Send a message
curl -X POST http://localhost:8000/ \
  -H "Content-Type: application/json" \
  -d '{
    "jsonrpc": "2.0",
    "method": "message/send",
    "params": {
      "message": {
        "role": "user",
        "parts": [{"kind": "text", "text": "Hello!"}],
        "messageId": "msg-1"
      }
    },
    "id": 1
  }'

# Stream (SSE)
curl -X POST http://localhost:8000/ \
  -H "Content-Type: application/json" \
  -H "Accept: text/event-stream" \
  -d '{
    "jsonrpc": "2.0",
    "method": "message/sendStream",
    "params": {
      "message": {
        "role": "user",
        "parts": [{"kind": "text", "text": "Hello!"}],
        "messageId": "msg-1"
      }
    },
    "id": 1
  }'

Runtime Backends

FastHarness uses an AgentRuntime / AgentRuntimeFactory protocol system that decouples agent execution from any specific SDK. You can swap backends without changing your agent definitions.

Claude (default)

Uses the Claude Agent SDK via subprocess. No API key needed if the claude CLI is configured.

from fastharness import FastHarness

harness = FastHarness(name="my-agent")  # ClaudeRuntimeFactory is the default

OpenHands

Uses the OpenHands Software Agent SDK for agents with terminal, file editing, and workspace capabilities.

uv add fastharness[openhands]

from fastharness import FastHarness
from fastharness.runtime.openhands import OpenHandsRuntimeFactory

harness = FastHarness(
    name="dev-agent",
    runtime_factory=OpenHandsRuntimeFactory(workspace="/path/to/project"),
)

Pydantic DeepAgents

Uses Pydantic DeepAgents for agents built on pydantic-ai with planning, subagents, and structured output.

uv add fastharness[deepagents]

from fastharness import FastHarness
from fastharness.runtime.deepagents import DeepAgentsRuntimeFactory

harness = FastHarness(
    name="research-agent",
    runtime_factory=DeepAgentsRuntimeFactory(),
)

Note: DeepAgents requires ANTHROPIC_API_KEY (or the appropriate provider key) in your environment or .env file.

Custom Runtimes

Implement the AgentRuntime and AgentRuntimeFactory protocols to add your own backend:

from fastharness.runtime.base import AgentRuntime, AgentRuntimeFactory

class MyRuntime:
    async def run(self, prompt: str) -> Any:
        """Execute prompt, return result."""
        ...

    async def stream(self, prompt: str) -> AsyncIterator[Event]:
        """Execute prompt, yield events."""
        ...

    async def aclose(self) -> None:
        """Cleanup resources."""
        ...

class MyRuntimeFactory:
    async def get_or_create(self, session_key: str, config: AgentConfig) -> MyRuntime:
        ...
    async def remove(self, session_key: str) -> None:
        ...
    async def start_cleanup_task(self) -> None:
        ...
    async def shutdown(self) -> None:
        ...

harness = FastHarness(name="my-agent", runtime_factory=MyRuntimeFactory())

Multi-Turn Conversations

FastHarness maintains conversation history automatically. Just use the same contextId on the message — this is the standard A2A conversation identifier:

# Message 1: "My name is Alice"
# → Response: "Nice to meet you, Alice!"

# Message 2: "What's my name?" (same contextId)
# → Response: "Your name is Alice!"

How it works:

All messages with the same contextId share history via the runtime session pool
Sessions are reused for 15 minutes by default (configurable via ttl_minutes)
No manual history management needed

Full example with curl

# First message
curl -X POST http://localhost:8000/ -H "Content-Type: application/json" -d '{
  "jsonrpc": "2.0",
  "method": "message/send",
  "params": {
    "message": {
      "role": "user",
      "contextId": "conv-123",
      "parts": [{"kind": "text", "text": "My name is Alice"}],
      "messageId": "msg-1"
    }
  },
  "id": 1
}'

# Follow-up (agent remembers "Alice")
curl -X POST http://localhost:8000/ -H "Content-Type: application/json" -d '{
  "jsonrpc": "2.0",
  "method": "message/send",
  "params": {
    "message": {
      "role": "user",
      "contextId": "conv-123",
      "parts": [{"kind": "text", "text": "What is my name?"}],
      "messageId": "msg-2"
    }
  },
  "id": 2
}'

Usage Patterns

Custom Agent Loop

Take full control over execution while FastHarness handles protocol machinery:

@harness.agentloop(
    name="researcher",
    description="Deep research assistant",
    skills=[Skill(id="research", name="Research", description="Conduct research")],
)
async def researcher(prompt, ctx, client):
    """Custom loop with iterative refinement."""
    result = await client.run(prompt)

    # Keep researching until confident
    while "need more information" in result.lower():
        result = await client.run("Continue researching, go deeper")

    return result

Mount on Existing FastAPI App

from fastapi import FastAPI
from contextlib import asynccontextmanager

@asynccontextmanager
async def lifespan(app):
    async with harness.lifespan_context():
        yield

app = FastAPI(lifespan=lifespan)
app.mount("/agents", harness.app)

Advanced Features

Cost Tracking

Monitor API costs with configurable thresholds:

from fastharness import CostTracker

tracker = CostTracker(warn_threshold_usd=1.0)

@harness.agentloop(...)
async def agent(prompt, ctx, client):
    client.telemetry_callbacks.append(tracker)
    result = await client.run(prompt)
    print(f"Cost: ${tracker.total_cost_usd:.4f}")
    return result

Step Logging

Debug with detailed step-by-step logging:

from fastharness import ConsoleStepLogger

client = HarnessClient(step_logger=ConsoleStepLogger())
result = await client.run(prompt)

Output:

[step_logger] Tool call
    turn: 0
    tool_name: Read
    tool_id: call_123
[step_logger] Assistant message
    turn: 0
    text_preview: Found the bug...
[step_logger] Turn complete
    turn: 0
    cost_usd: 0.01

MCP Server Integration

Connect external services via Model Context Protocol:

harness.agent(
    name="assistant",
    description="Multi-tool assistant",
    skills=[...],
    mcp_servers={
        "filesystem": {
            "command": "node",
            "args": ["mcp-server-stdio-filesystem"],
        },
    },
    tools=["mcp__filesystem__read", "mcp__filesystem__write"],
)

CLAUDE.md Support

Agents automatically load project context from CLAUDE.md:

harness.agent(
    name="reviewer",
    description="Code reviewer",
    skills=[...],
    # setting_sources=["project"] is default - loads CLAUDE.md
)

# To disable:
harness.agent(
    name="readonly",
    description="Read-only assistant",
    skills=[...],
    setting_sources=[],  # Don't load settings
)

Configuration

HarnessClient Options

Option	Default	Description
`system_prompt`	`None`	System prompt for the agent
`tools`	`[]`	Allowed tools (`["Read", "Grep", "Glob"]`)
`model`	`claude-sonnet-4-20250514`	Model identifier
`max_turns`	`None`	Max conversation turns
`mcp_servers`	`{}`	MCP server configs
`setting_sources`	`["project"]`	Load CLAUDE.md automatically
`output_format`	`None`	JSON schema for structured output
`runtime`	`None`	Injected `AgentRuntime` (set by factory)

Override per-call:

result = await client.run(prompt, model="claude-opus-4-20250514", max_turns=5)

A2A Endpoints

Endpoint	Description
`/.well-known/agent-card.json`	Agent metadata and capabilities
`/`	JSON-RPC endpoint (`message/send`, `message/sendStream`, `tasks/get`, `tasks/cancel`)
`/docs`	Interactive API documentation

Architecture

FastHarness (app.py)
    │
    ├── Decorators: .agent() and .agentloop()
    │   └── Register agents with skills, tools, system prompts
    │
    ├── A2AFastAPIApplication (native A2A SDK)
    │   └── Exposes A2A endpoints: /.well-known/agent-card.json, JSON-RPC /
    │
    ├── ClaudeAgentExecutor (worker/claude_executor.py)
    │   └── Executes tasks using HarnessClient → AgentRuntime
    │
    └── AgentRuntimeFactory (runtime/base.py)  ← Protocol
        ├── ClaudeRuntimeFactory    → Claude Agent SDK subprocess
        ├── OpenHandsRuntimeFactory → OpenHands SDK Conversation
        └── DeepAgentsRuntimeFactory → Pydantic DeepAgents

Flow: A2A request → DefaultRequestHandler → ClaudeAgentExecutor → HarnessClient → AgentRuntime → SDK → A2A response

Examples

Complete working examples in examples/:

simple_agent.py - Basic agent with multi-turn support
fastapi_integration.py - Mounting on existing FastAPI app
advanced_features.py - Cost tracking, logging, MCP servers

Run examples:

uv run uvicorn examples.simple_agent:app --port 8000

LiteLLM Integration

Route through LiteLLM for alternative providers:

ANTHROPIC_BASE_URL=http://localhost:4000
ANTHROPIC_API_KEY=your-litellm-key
ANTHROPIC_MODEL=sonnet-4

License

MIT © Prassanna Ravishankar

Built for the AI agent community

Project details

Release history Release notifications | RSS feed

This version

1.1.0

Mar 16, 2026

0.3.1

Feb 13, 2026

0.3.0

Feb 12, 2026

0.2.1

Jan 29, 2026

0.2.0

Jan 29, 2026

0.1.1

Jan 24, 2026

0.1.0

Jan 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastharness-1.1.0.tar.gz (27.4 kB view details)

Uploaded Mar 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fastharness-1.1.0-py3-none-any.whl (38.0 kB view details)

Uploaded Mar 16, 2026 Python 3

File details

Details for the file fastharness-1.1.0.tar.gz.

File metadata

Download URL: fastharness-1.1.0.tar.gz
Upload date: Mar 16, 2026
Size: 27.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for fastharness-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8fc84d23fd53131ccd49c6ef2e979c78691cfa46dafc5a07819feb36be427f26`
MD5	`d6e3aeb2d067a102989660c90f351f1a`
BLAKE2b-256	`82771dfbdfffb739edb37728fc0904e76f13a93f7cd1482c45678ec317cb7be4`

See more details on using hashes here.

File details

Details for the file fastharness-1.1.0-py3-none-any.whl.

File metadata

Download URL: fastharness-1.1.0-py3-none-any.whl
Upload date: Mar 16, 2026
Size: 38.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for fastharness-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8a3ab698e3a46160d4ba005075ee24d0b3f14c837147780d87816bd3ac4351e0`
MD5	`4acbd5483689f1d4fc9b6108eb345b0f`
BLAKE2b-256	`f8bd1e210135c04046ad89aadb4830b6cc4c1ea74c4c5910f361f27e65f20137`

See more details on using hashes here.

fastharness 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

FastHarness

Why FastHarness?

What FastHarness Adds

Installation

Environment Setup

Quick Start

Runtime Backends

Claude (default)

OpenHands

Pydantic DeepAgents

Custom Runtimes

Multi-Turn Conversations

Usage Patterns

Custom Agent Loop

Mount on Existing FastAPI App

Advanced Features

Configuration

HarnessClient Options

A2A Endpoints

Architecture

Examples

LiteLLM Integration

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes