Skip to main content

fast_a2a_app — Drop-in A2A server and chat UI for any AI agent

Project description

fast_a2a_app

Drop-in A2A server and chat UI for any FastAPI application running AI agents — installable from PyPI.

pip install fast_a2a_app

Why fast_a2a_app

The Agent2Agent (A2A) protocol is HTTP for AI agents — a shared contract that lets any agent talk to any client (chat UI, orchestrator, another agent) across companies and frameworks. Turning a Python coroutine into a spec-compliant A2A server is a lot of plumbing: JSON-RPC routes, SSE streaming, task lifecycle, cross-instance cancel, agent-card discovery, multi-turn history. fast_a2a_app does it for you, mounted cleanly into the FastAPI app you already run.

  • 🔌 Mount, don't replace. Starlette app you mount at any path prefix. Auth, middleware, CORS, observability — all yours, unchanged.
  • 🧱 Framework-agnostic. No dependency on Pydantic AI, LangChain, or any agent runtime. Wrap any async (str) -> str (or async generator) and you're done.
  • 💬 Batteries-included chat UI. Self-contained browser interface — no build step, no npm. Markdown, tables, maps, clickable suggestions, file uploads, image previews, fullscreen viewer.
  • 🧩 Typed-artifact widgets you can extend. Drop a <TAG>.py + <TAG>.js pair to add a new chat widget; built-ins ship TABLE, PROMPT_SUGGESTIONS, and MAP (Leaflet).
  • 📡 Real protocol, not a mock. Streaming SSE, multi-turn history, cross-instance cancel, reload recovery, agent-card discovery — built on a2a-sdk 1.0.x.

60-second quickstart

One file, three lines of glue — and you get a fully spec-compliant streaming A2A server with a built-in chat UI on top of an Azure OpenAI chat-completions call:

# main.py
import os
from collections.abc import AsyncIterable

from fastapi import FastAPI
from a2a.types import AgentCapabilities, AgentCard, AgentInterface
from fast_a2a_app import a2a_ui, build_a2a_app, build_stream_invoke

from azure.identity.aio import AzureCliCredential, get_bearer_token_provider
from openai import AsyncOpenAI


# Azure OpenAI client — bearer token from `az login` (no API key needed).
client = AsyncOpenAI(
    base_url=f"{os.environ['AZURE_AI_BASE_URL'].rstrip('/')}/openai/v1",
    api_key=get_bearer_token_provider(AzureCliCredential(), "https://ai.azure.com/.default"),
)

# Your agent: any async generator yielding text chunks.
async def stream_chat(prompt: str) -> AsyncIterable[str]:
    stream = await client.chat.completions.create(
        model=os.environ.get("AZURE_AI_DEPLOYMENT_NAME", "gpt-4o"),
        messages=[{"role": "user", "content": prompt}],
        stream=True,
    )
    async for chunk in stream:
        if chunk.choices and (text := chunk.choices[0].delta.content):
            yield text

# A2A agent card — public metadata served at /a2a/.well-known/agent-card.json
agent_card = AgentCard(
    name="Chat",
    description="Streaming chat agent",
    version="1.0.0",
    supported_interfaces=[
        AgentInterface(url="http://localhost:8000/a2a/", protocol_binding="JSONRPC")
    ],
    capabilities=AgentCapabilities(streaming=True),
    default_input_modes=["text"],
    default_output_modes=["text"],
)

# Mount the A2A protocol server and the chat UI into your FastAPI app.
app = FastAPI()
app.mount(
    "/a2a",
    build_a2a_app(agent_card=agent_card, stream_invoke=build_stream_invoke(stream_chat)),
)
app.mount("/", a2a_ui)
pip install fast_a2a_app openai azure-identity
az login                                         # AzureCliCredential
export AZURE_AI_BASE_URL=https://<your-resource>.openai.azure.com
export AZURE_AI_DEPLOYMENT_NAME=gpt-4o
uvicorn main:app --reload

No Docker needed for local development — the default in-process MemoryTaskStore keeps task state in RAM. For multi-process / cross-instance deployments, pass task_store=RedisTaskStore.from_url(REDIS_URL) (or a MongoTaskStore / PostgresTaskStore) to build_a2a_app.

Open http://localhost:8000/ — you're chatting.


Built-in A2A-UI

A self-contained, zero-build browser chat — drop it in via app.mount("/", a2a_ui) and you have a working interface for trying, demoing, and sharing your agent. Streams tokens as they arrive, preserves multi-turn history across page reloads, and renders text, data, file, image, table, prompt-suggestion, and map artifacts inline. The renderer for each typed artifact lives in its own <TAG>.js file under fast_a2a_app/ui/renderers/, so adding a new chat widget is a one-file change — see How-to → UI rendering conventions.

A2A chat UI

Built-in debug view

A Debug tab in the chat UI surfaces full task state, JSON-RPC request/response payloads, and the streaming wire log — useful while iterating on tools, prompts, or multi-part artifacts.

Debug view


Examples

Example What it shows API key
Echo Agent Minimal integration — pure Python, no LLM No
Echo Multipart Streaming multi-part responses (text + JSON data + file download) No
Joke Agent Raw chat completions, no agent framework Azure OpenAI
Image Creator Multi-tool agent: image generation, web search, fullscreen viewer, prompt suggestions, in-agent slash commands Azure OpenAI
Holiday Planner Pydantic-ai agent with tools, live progress, interactive map of suggested destinations, artifact-aware quick-reply pills (click a destination to advance) Azure OpenAI

Every example follows the same two-file layout: agent.py owns the agent_card, invoke / stream_invoke, and any tools or prompts; main.py is a thin FastAPI composition root that imports them and calls build_a2a_app. Keeping the AgentCard next to the agent functions means the agent's metadata, skills, and behaviour live together.

The first three examples (Echo Agent, Echo Multipart, Joke Agent) run on the in-process MemoryTaskStore and need no external service. The remaining examples opt into Redis when REDIS_URL is set in the environment (and fall back to memory otherwise).

Running an example with Poetry

The examples don't ship their own requirements.txt — they share the parent project's dependency set, which already includes everything they need (openai, azure-identity, pydantic-ai, folium, …). One install at the repo root is enough for all examples.

# From the repo root — one-time
poetry install                                  # installs fast_a2a_app + all example deps

# Optional shared config (Azure-backed examples only)
cp examples/.env.example examples/.env          # then edit AZURE_AI_BASE_URL / AZURE_AI_DEPLOYMENT_NAME

# Run any example from its own directory
cd examples/joke-agent
poetry run uvicorn main:app --reload

The Azure-backed examples (joke-agent, image-creator, holiday-planner) additionally need az login for the AzureCliCredential bearer token. See each example's README for its specific extras (Redis, IMAGE_SIZE, etc.).


Scaffold a new agent with SKILL.md

The repo ships a SKILL.md at the project root, loosely following the Agent Skills format — drop it into any skills-aware coding agent (Claude Code, Cursor, etc.) and ask it to scaffold a fresh fast_a2a_app project for you. The skill encodes the agent.py contract (AgentCard, invoke / stream_invoke), the recommended pydantic-ai shape (Agent[AgentDeps] + @agent.tool + report_progress), the main.py boilerplate, and the hard rules / pitfalls / verification checklist.

Typical workflow for a brand-new repo:

mkdir my-agent && cd my-agent
git init
mkdir -p .claude/skills/FAST_A2A_APP
curl -L https://raw.githubusercontent.com/rembli/fast_a2a_app/main/SKILL.md \
  -o .claude/skills/FAST_A2A_APP/SKILL.md

Then prompt your agent: "Use the fast-a2a-app skill to scaffold a pydantic-ai-based holiday-planning agent with a recommend_destinations tool." The skill takes care of pyproject.toml deps, the two-file layout, the AgentCard fields, history wiring, and the boot test.


Documentation

  • Design choices — what the library is, what it isn't, and the trade-offs behind those decisions
  • Architecture — module layout, storage, conversation history injection, the streaming pipeline
  • API reference — every public symbol with parameters and examples
  • How-to guides — prompt management, multi-part artifacts, image uploads, progress reporting, custom storage backends

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast_a2a_app-0.6.9.tar.gz (250.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fast_a2a_app-0.6.9-py3-none-any.whl (219.7 kB view details)

Uploaded Python 3

File details

Details for the file fast_a2a_app-0.6.9.tar.gz.

File metadata

  • Download URL: fast_a2a_app-0.6.9.tar.gz
  • Upload date:
  • Size: 250.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.12.4 HTTPX/0.28.1

File hashes

Hashes for fast_a2a_app-0.6.9.tar.gz
Algorithm Hash digest
SHA256 5cdf2a2020dc60e304e52f5eccf67ddc2ac8ef13225131e50a15e28805621a4c
MD5 48ac4a84c453dcf6202276f3a6a307e7
BLAKE2b-256 d830117067143358089e530c56d71ce1d0145f712f1095dadc5d7b55490fc1fb

See more details on using hashes here.

File details

Details for the file fast_a2a_app-0.6.9-py3-none-any.whl.

File metadata

  • Download URL: fast_a2a_app-0.6.9-py3-none-any.whl
  • Upload date:
  • Size: 219.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Hatch/1.16.5 cpython/3.12.4 HTTPX/0.28.1

File hashes

Hashes for fast_a2a_app-0.6.9-py3-none-any.whl
Algorithm Hash digest
SHA256 0b9e943a1d317c5a6d804edd13170008d49e66cd0351c9a9a649842748de9c45
MD5 38124003a8ce7a75a57ea66581182205
BLAKE2b-256 2e666e587db0180bbd634f8817066d6e354754bb8baab8c9200cda4d571b9cf6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page