A hierarchical multi-agent Deep Agent harness built on Pydantic AI

These details have not been verified by PyPI

Project description

PydanTask: Deep Agentic Harness for Pydantic AI

Pydantask Logo

PydanTask is a Harness for building deep, autonomous agents that don't just respond—they reason, decompose, and execute.

It builds on top of Pydantic AI and adds:

Long‑horizon planning and hierarchical task management
A reusable orchestration loop (DeepAgent) with supervisor (dynamic planner) → worker/researcher/producer → critic
Shared runtime state across agents (RuntimeState)
Extensible capabilities via CapabilityDescription and custom tools/agents

The goal is to give you a solid “agentic backbone” you can adapt, without having to reinvent multi‑step planning and control logic yourself.

For the most up‑to‑date implementation details and API docs, see the hosted documentation: PydanTask Documentation.

High-Level Architecture

The core orchestrator is DeepAgent:

from pydantask.agents.agent import DeepAgent

DeepAgent coordinates several built‑in agents:

Supervisor – plans and chooses which tasks to run next, based on statuses and dependencies.
Researcher – performs web/external research for tasks that need new information.
Producer – synthesizes intermediate results into a final answer or artifact.
Critic – evaluates task outputs and drives deterministic retry/fail transitions.

They all operate over a shared RuntimeState:

from pydantask.models import RuntimeState, TaskItem, TaskResult, Plan

Key concepts:

Plan (Plan):
- reasoning_steps: planner’s internal notes
- tasks: list of TaskItem instances
TaskItem: one sub‑task in the plan, with:
- task_id, overall_objective, sub_task_objective
- capability (which sub‑agent to use, e.g. "research_agent")
- sub_task_dependencies (other task IDs that must complete first)
- status (TaskStatus: PENDING, READY, RUNNING, NEEDS_REVIEW, COMPLETED, FAILED, ERRORED, RERUN)
- result (TaskResult) and task_feedback (TaskQAResult)
RuntimeState:
- plan: Dict[int, TaskItem]
- objective: str
- agent_registry: Dict[str, CapabilityDescription]
- document_store, knowledge_store, runtime_steps, etc.

The control loop in DeepAgent.run():

Supervisor incrementally builds a task DAG for the objective (via tools like add_task).
RuntimeState is initialized with the capability registry.
In each cycle:
- Supervisor decides which tasks to execute next based on plan progress, task dependencies, and self reflection.
- Ready tasks, so long as dependencies are satisfied, are executed by the appropriate capability (sub‑agent).
- Critic reviews each result and produces a "QA" report for the supervisor to review if the task failed.
Loop stops when the Supervisor sets all_tasks_completed = True or max_steps is reached.

For more detail, see docs/agents.md.

Installation & Setup

PydanTask assumes you already have Pydantic AI and an OpenAI‑compatible model configured. You’ll also need a Tavily API key for the built‑in research agent.

1. Install dependencies

From your project root:

pip install pydantask

(or however you manage your environment; if you use Poetry, adjust accordingly.)

2. Environment variables

Set the following environment variables (e.g. in your shell or a .env file):

OPENAI_API_KEY – for the underlying OpenAIChatModel (or whatever your Pydantic AI provider expects).
TAVILY_API_KEY – used by the research_agent (via tavily_search_tool). If this key is not set, defaults to DuckDuckGo search.

Quickstart: Running a DeepAgent

Minimal example that creates a DeepAgent and runs it on a single objective:

import asyncio

from pydantask.agents.agent import DeepAgent

async def main() -> None:
    agent = DeepAgent(
        prompt="Write an overview of ghost lights folklore and summarize scientific explanations.",
        model="gpt-4.1-mini",  # or any compatible OpenAIChatModel name
        max_steps=10,
    )

    run_result = await agent.run()
    runtime_state = run_result.runtime_state

    # Inspect the final plan and results
    for task_id, task in sorted(runtime_state.plan.items()):
        print(f"Task {task_id} [{task.status}]: {task.sub_task_objective}")
        if task.result is not None:
            print("  Summary:", task.result.summary)
            # The main long-form output for the task is stored in-memory:
            print("  Detailed output:", (task.result.detailed_output or "<empty>"))
            print()

if __name__ == "__main__":
    asyncio.run(main())

What this does:

Constructs a DeepAgent with default Supervisor, Researcher, Producer, and Critic.
Supervisor (dynamic DAG architect) breaks down the objective into TaskItems using built-in capabilities.
Supervisor picks tasks to run in each loop iteration.
Researcher and Producer execute those tasks and return structured TaskResults.
Critic evaluates each task result and marks tasks as COMPLETED, retryable (READY/RERUN), or FAILED based on configured retry limits.
When done, you get a RuntimeState with the full plan and results.

Note: this harness currently treats task artifacts as in-memory outputs (e.g. TaskResult.detailed_output). File persistence is intentionally out-of-scope for now.

Customizing Capabilities

You can add custom sub‑agents or tools via CapabilityDescription and the sub_agents argument.

Example: custom agent capability

from pydantic_ai import Agent
from pydantask.agents.agent import DeepAgent
from pydantask.models import CapabilityDescription, RuntimeState, TaskResult

my_special_agent = Agent(
    model=...,  # e.g. the same OpenAIChatModel
    name="_my_special_agent",
    system_prompt="You are a specialized agent for security analysis.",
    deps_type=RuntimeState,
    output_type=TaskResult,
    tools=[...],  # any tools it needs
)

custom_capability = CapabilityDescription(
    name="security_agent",  # used in TaskItem.capability
    description="Performs security-focused analysis and risk assessment.",
    tool_func=my_special_agent,
)

agent = DeepAgent(
    prompt="Assess the security posture of this web application.",
    sub_agents=[custom_capability],
)

# Now the Planner can choose `security_agent` as a capability in the plan.

Example: simple function capability

from pydantic_ai import RunContext
from pydantask.agents.agent import DeepAgent
from pydantask.models import CapabilityDescription, RuntimeState

async def my_utility_tool(ctx: RunContext[RuntimeState], payload: str) -> str:
    # do something simple with ctx.deps and payload
    return f"processed: {payload}"

utility_capability = CapabilityDescription(
    name="my_utility_tool",
    description="Utility capability that performs a simple transformation.",
    tool_func=my_utility_tool,
)

agent = DeepAgent(prompt="Some goal...", sub_agents=[utility_capability])

For more customization details, see:

docs/customization.md
docs/tools.md
docs/agents.md

Running Unit Tests

Tests live under the test/ directory and are written to be compatible with both pytest and the standard library unittest.

Recommended: pytest

From the repository root:

pip install pytest
pytest

Using unittest directly

If you prefer unittest, you can still run the suite with:

python -m unittest discover -s test -p "test_*.py"

Make sure required environment variables (e.g. TAVILY_API_KEY, OPENAI_API_KEY) are set, or that tests patch them appropriately (as in test/test_agent.py).

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0a2 pre-release

Apr 29, 2026

0.1.0a1 pre-release

Apr 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydantask-0.1.0a2.tar.gz (55.3 kB view details)

Uploaded Apr 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pydantask-0.1.0a2-py3-none-any.whl (54.3 kB view details)

Uploaded Apr 29, 2026 Python 3

File details

Details for the file pydantask-0.1.0a2.tar.gz.

File metadata

Download URL: pydantask-0.1.0a2.tar.gz
Upload date: Apr 29, 2026
Size: 55.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pydantask-0.1.0a2.tar.gz
Algorithm	Hash digest
SHA256	`46676be3b01159b7fb77f1325fff4ce9286cfa2446c296240b22c9f296a14013`
MD5	`e47f2ac8a749e794fcd0ea12a565fb82`
BLAKE2b-256	`76971565fb0a3f7d3b1b0e6fae7bdfed4119fdc4763251d7ee9ca122be6ecd91`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydantask-0.1.0a2.tar.gz:

Publisher: pydantask.yml on GeorgeDittmar/pydantask

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pydantask-0.1.0a2.tar.gz
- Subject digest: 46676be3b01159b7fb77f1325fff4ce9286cfa2446c296240b22c9f296a14013
- Sigstore transparency entry: 1399112564
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: GeorgeDittmar/pydantask@a6738c2858eac987711508602450b0bdb91c2a30
- Branch / Tag: refs/tags/v0.1.0a3
- Owner: https://github.com/GeorgeDittmar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pydantask.yml@a6738c2858eac987711508602450b0bdb91c2a30
- Trigger Event: release

File details

Details for the file pydantask-0.1.0a2-py3-none-any.whl.

File metadata

Download URL: pydantask-0.1.0a2-py3-none-any.whl
Upload date: Apr 29, 2026
Size: 54.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pydantask-0.1.0a2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6c1e289245f53890ee2b8307ccfd7cff78c417ffd413974e2c7aea5332984155`
MD5	`65f1105ba518d798a95b5b20ac7512ed`
BLAKE2b-256	`417a40334cf5f55ea4b7c52f82bbeae593535e22f075ef1969414f7070b7ffff`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydantask-0.1.0a2-py3-none-any.whl:

Publisher: pydantask.yml on GeorgeDittmar/pydantask

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pydantask-0.1.0a2-py3-none-any.whl
- Subject digest: 6c1e289245f53890ee2b8307ccfd7cff78c417ffd413974e2c7aea5332984155
- Sigstore transparency entry: 1399112576
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: GeorgeDittmar/pydantask@a6738c2858eac987711508602450b0bdb91c2a30
- Branch / Tag: refs/tags/v0.1.0a3
- Owner: https://github.com/GeorgeDittmar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pydantask.yml@a6738c2858eac987711508602450b0bdb91c2a30
- Trigger Event: release

pydantask 0.1.0a2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

PydanTask: Deep Agentic Harness for Pydantic AI

High-Level Architecture

Installation & Setup

1. Install dependencies

2. Environment variables

Quickstart: Running a DeepAgent

Customizing Capabilities

Example: custom agent capability

Example: simple function capability

Running Unit Tests

Recommended: pytest

Using unittest directly

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance