Deep Agent framework built on Pydantic-ai with planning, filesystem, and subagent capabilities

These details have not been verified by PyPI

Project description

pydantic-deep

Pydantic AI Deep Agents Framework

Build Claude Code-Style AI Agents — In 10 Lines of Python

🔄 Unlimited Context via summarization • 🤖 Subagent Delegation sync & async • 🧩 Modular use only what you need • 🎯 Fully Type-Safe

See It In Action

Get Started in 60 Seconds

pip install pydantic-deep

from pydantic_ai_backends import StateBackend
from pydantic_deep import create_deep_agent, create_default_deps

agent = create_deep_agent()
deps = create_default_deps(StateBackend())

result = await agent.run("Create a todo list for building a REST API", deps=deps)

That's it. Or use the CLI directly from your terminal:

pip install pydantic-deep[cli]

# Run a task
pydantic-deep run "Create a REST API with FastAPI"

# Interactive chat
pydantic-deep chat

# Run in Docker sandbox
pydantic-deep run "Build a web scraper" --sandbox --runtime python-web

Your agent can now:

✅ Plan tasks — break down complex work into steps
✅ Read & write files — navigate and modify codebases
✅ Delegate to subagents — spawn specialists for specific tasks
✅ Load skills — use domain-specific instructions
✅ Manage context — handle unlimited conversation length
✅ Checkpoint & rewind — save conversation state, rewind, or fork sessions
✅ Agent teams — shared TODO lists and peer-to-peer messaging
✅ Persistent memory — remember facts across sessions via MEMORY.md
✅ Lifecycle hooks — Claude Code-style hooks for audit, safety gates, and custom logic
✅ Output styles — built-in or custom response formatting
✅ Cost tracking — token/USD budgets with automatic enforcement
✅ Middleware — composable before/after hooks with permission handling
✅ Context files — auto-inject DEEP.md, AGENTS.md, CLAUDE.md into system prompt
✅ Context manager — hybrid summarization + sliding window middleware
✅ Custom tool descriptions — override any tool's built-in description via descriptions parameter
✅ Custom commands — user-defined slash commands from .md files (built-in, user, and project scopes)

Same Architecture as the Best

pydantic-deep implements the deep agent architecture — the same patterns powering:

	Product	What They Built
🤖	Claude Code	Anthropic's AI coding assistant
🦾	Manus AI	Autonomous task execution
👨‍💻	Devin	AI software engineer

Now you can build the same thing.

Inspired by: This framework is also inspired by LangChain's Deep Agents research on autonomous agent architectures.

DeepResearch — Full-Featured Reference App

Planner subagent asks clarifying questions

Plan Mode — planner asks clarifying questions before research

Parallel Subagents — 5 agents researching simultaneously

Features

Core Toolsets

🧠 Planning — pydantic-ai-todo

Task tracking with read_todos / write_todos. Subtasks & dependencies with cycle detection. PostgreSQL storage. Event system for webhooks.

📁 Filesystem — pydantic-ai-backend

Full access: ls, read_file, write_file, edit_file, glob, grep, execute. Docker sandbox for isolation. Permission system (allow/deny/ask). Session manager for multi-user apps.

🤖 Subagents — subagents-pydantic-ai

Delegate with task in sync or async mode. Background task management. Dynamic agent creation at runtime. Soft/hard cancellation.

💬 Summarization — summarization-pydantic-ai

Two modes: LLM-based intelligent summaries or zero-cost sliding window. Trigger on tokens, messages, or context fraction. Custom prompts.

🛡️ Middleware — pydantic-ai-middleware

7 lifecycle hooks: before_run, after_run, before_model_request, before_tool_call, after_tool_call, on_tool_error, on_error. Composable chains. Permission handling.

Advanced Features

💾 Checkpointing — Save conversation state at intervals. Rewind to any checkpoint, or fork into a new session from a past state. In-memory and file-based stores.

👥 Agent Teams — Shared TODO lists with claiming and dependency tracking. Peer-to-peer message bus between team members. Spawn, assign, and dissolve teams via tools.

🪝 Hooks — Claude Code-style lifecycle hooks. Run shell commands or scripts on events like before_tool_call or after_run. Use for audit logging, safety gates, or custom side effects.

🧠 Persistent Memory — Agents read/write a MEMORY.md file that persists across sessions. Automatic injection into system prompt. Tools: read_memory, write_memory.

📄 Context Files — Auto-discover and inject DEEP.md, AGENTS.md, CLAUDE.md, and SOUL.md from the working directory into the system prompt.

🎨 Output Styles — Built-in styles (concise, detailed, markdown, etc.) or load custom styles from files. Control response formatting via output_style.

📋 Plan Mode — Dedicated plan mode subagent for structured planning before execution. Separate toolset with plan-specific instructions.

💰 Cost Tracking — Track token usage and USD costs per run. Set budgets with automatic enforcement. Callbacks for real-time cost updates.

📦 Eviction Processor — Automatically evict large tool outputs to files when they exceed token limits. Keeps conversation lean while preserving data access.

🔧 Patch Tool Calls — On session resume, patch stale tool call results so the model sees clean history without re-executing tools.

🏷️ Custom Tool Descriptions — All toolset factories accept a descriptions parameter to override any tool's built-in description. Useful for tuning tool usage behavior without forking toolsets.

⚡ Custom Commands — User-triggered slash commands from .md files. Built-in commands include /commit, /pr, /review, /test, /fix, /explain. Three-scope discovery: built-in, user (~/.pydantic-deep/commands/), and project (.pydantic-deep/commands/).

Built-in Capabilities

🎯 Skills — Load domain instructions from markdown files with YAML frontmatter.

📊 Structured Output — Type-safe responses with Pydantic models via output_type.

👤 Human-in-the-Loop — Built-in confirmation workflows for sensitive operations.

⚡ Streaming — Full streaming support for real-time responses.

🖼️ Image Support — Pass images to the agent for multi-modal analysis.

Use Cases

What You Want to Build	Key Components
AI Coding Assistant	Planning + Filesystem + Skills + Memory + Hooks
Data Analysis Agent	File Uploads + Structured Output + Teams
Document Processor	Filesystem + Summarization + Eviction
Research Agent	Subagents + Planning + Checkpointing — see DeepResearch
Project Scaffolder	Planning + Filesystem + Output Styles
Test Generator	Filesystem + Docker Sandbox + Cost Tracking
Multi-Agent Workflow	Teams + Subagents + Middleware
Audited Enterprise Agent	Hooks + Middleware + Cost Tracking

Reference app: DeepResearch is a full-featured research agent built with pydantic-deep. It includes a web UI, MCP-powered web search, Excalidraw diagrams, code execution in Docker, and more. See apps/deepresearch/README.md for setup instructions.

CLI — Your Terminal AI Assistant

The pydantic-deep CLI gives you a Claude Code-style experience powered by the full framework.

pip install pydantic-deep[cli]

Commands

# Non-interactive (benchmark mode) — stdout is clean, diagnostics go to stderr
pydantic-deep run "Fix the failing tests in src/" --model openai:gpt-4.1

# Interactive chat with streaming and tool visibility
pydantic-deep chat

# Docker sandbox for isolated execution
pydantic-deep run "Set up a Django project" --sandbox --runtime python-web

# Manage skills
pydantic-deep skills list                     # List built-in + user skills
pydantic-deep skills info code-review         # Show skill details
pydantic-deep skills create my-skill          # Scaffold a new skill

# Manage conversation threads
pydantic-deep threads list                    # List saved sessions
pydantic-deep threads delete abc12345         # Delete a session

# Configuration
pydantic-deep config show                     # Show current config
pydantic-deep config set model anthropic:claude-sonnet-4-20250514

Configuration

Config file: ~/.pydantic-deep/config.toml

model = "openai:gpt-4.1"
include_skills = true
include_plan = true
include_memory = true
shell_allow_list = ["python", "pip", "npm", "make"]

CLI arguments override config file values.

Built-in Skills

Skill	Description
`skill-creator`	Create new reusable skills from conversation context
`code-review`	Systematic code review for bugs, security, style, and performance
`test-writer`	Generate comprehensive test suites for existing code
`refactor`	Refactor code to improve structure and maintainability
`git-workflow`	Git operations: commits, branches, PRs, and conflict resolution

Skills are SKILL.md files — create your own with pydantic-deep skills create my-skill.

Sandbox Runtimes

Runtime	Description
`python-minimal`	Python 3.12 (default)
`python-datascience`	Python + numpy, pandas, matplotlib
`python-web`	Python + FastAPI, Django, Flask
`node-minimal`	Node.js 20
`node-react`	Node.js + React, Next.js

pydantic-deep run "Analyze this CSV" --sandbox --runtime python-datascience

What's Included

The CLI wraps the full pydantic-deep framework with all features enabled by default:

Planning (TodoToolset) + delegation (SubAgentToolset)
Filesystem access (ConsoleToolset) + shell execution
Skills, memory, checkpoints, context files
Custom commands — /commit, /pr, /review, /test, /fix, /explain + user/project commands
Loop detection middleware + context management
Git/directory context injection into system prompt
Cost tracking with real-time display
Rich terminal UI — colored unified diffs for file approvals, context progress bar with threshold colors, tool call timing with success/error states, seamless streaming from spinner to Markdown

Modular — Use What You Need

Every component works standalone:

Component	Package	Use It For
Backends	pydantic-ai-backend	File storage, Docker sandbox
Planning	pydantic-ai-todo	Task tracking
Subagents	subagents-pydantic-ai	Task delegation
Summarization	summarization-pydantic-ai	Context management
Middleware	pydantic-ai-middleware	Lifecycle hooks, permissions

Full-stack template? fastapi-fullstack — Production-ready with FastAPI + Next.js

Go Deeper

Structured Output

from pydantic import BaseModel

class CodeReview(BaseModel):
    summary: str
    issues: list[str]
    score: int

agent = create_deep_agent(output_type=CodeReview)
result = await agent.run("Review the auth module", deps=deps)
print(result.output.score)  # Type-safe!

File Uploads

from pydantic_deep import run_with_files

with open("data.csv", "rb") as f:
    result = await run_with_files(
        agent,
        "Analyze this data and find trends",
        deps,
        files=[("data.csv", f.read())],
    )

Context Management

from pydantic_deep import create_summarization_processor

processor = create_summarization_processor(
    trigger=("tokens", 100000),
    keep=("messages", 20),
)
agent = create_deep_agent(history_processors=[processor])

Custom Subagents

agent = create_deep_agent(
    subagents=[
        {
            "name": "code-reviewer",
            "description": "Reviews code for quality issues",
            "instructions": "You are a senior code reviewer...",
            "preferred_mode": "sync",
        },
    ],
)

Checkpointing & Rewind

from pydantic_deep import create_deep_agent

agent = create_deep_agent(
    include_checkpoints=True,
    checkpoint_frequency="every_tool",  # "every_tool", "every_turn", or "manual_only"
    max_checkpoints=20,
)
# Agent gets tools: save_checkpoint, list_checkpoints, rewind_to

Agent Teams

agent = create_deep_agent(include_teams=True)
# Agent gets tools: spawn_team, assign_task, check_teammates,
#                   message_teammate, dissolve_team

Hooks (Claude Code-Style)

from pydantic_deep import Hook, HookEvent

agent = create_deep_agent(
    hooks=[
        Hook(
            event=HookEvent.PRE_TOOL_USE,
            command="echo 'Tool called: $TOOL_NAME' >> /tmp/audit.log",
        ),
    ],
)

Persistent Memory

agent = create_deep_agent(include_memory=True, memory_dir="./agent-data")
# Agent reads MEMORY.md on start, can write_memory to persist facts

Cost Tracking

agent = create_deep_agent(
    cost_tracking=True,
    cost_budget_usd=5.0,  # Stop if costs exceed $5
    on_cost_update=lambda info: print(f"Cost: ${info.total_usd:.4f}"),
)

Middleware

from pydantic_ai_middleware import before_tool_call, ToolDecision

@before_tool_call
async def audit(ctx, tool_name, tool_args):
    print(f"Calling {tool_name}")
    return ToolDecision.ALLOW

agent = create_deep_agent(middleware=[audit])

Output Styles

agent = create_deep_agent(output_style="concise")  # Built-in: concise, explanatory, formal, conversational

# Or load custom styles from a directory
agent = create_deep_agent(output_style="my-style", styles_dir="./styles")

Context Files

agent = create_deep_agent(
    context_files=["DEEP.md", "AGENTS.md"],  # Explicit files
    context_discovery=True,  # Auto-discover DEEP.md, CLAUDE.md, SOUL.md in working dir
)

Plan Mode

agent = create_deep_agent(include_plan=True, plans_dir="./plans")
# Agent gets a dedicated plan mode subagent for structured planning

Context Manager (Hybrid)

## Combines token tracking + auto-compression in a single middleware
agent = create_deep_agent(
    context_manager=True,
    context_manager_max_tokens=100000,
)

Skills

Create ~/.pydantic-deep/skills/review/SKILL.md:

---
name: code-review
description: Review Python code for quality
---

# Code Review Skill

Check for:
- [ ] Security issues
- [ ] Type hints
- [ ] Error handling

agent = create_deep_agent(
    skill_directories=[{"path": "~/.pydantic-deep/skills", "recursive": True}],
)

Custom Tool Descriptions

from pydantic_deep import create_deep_agent

agent = create_deep_agent(
    descriptions={
        "write_file": "Create or overwrite a file. Always confirm with the user first.",
        "execute": "Run a shell command in the working directory.",
    },
)

All toolset factories (CheckpointToolset, AgentMemoryToolset, create_team_toolset(), create_plan_toolset(), SkillsToolset, create_web_toolset()) also accept a descriptions parameter individually.

Project Structure

pydantic_deep/          # Core library — agent factory, toolsets, processors, types
cli/                    # CLI frontend — terminal UI, commands, interactive chat
apps/                   # Applications built on pydantic-deep
  swebench_agent/       #   SWE-bench evaluation runner
  harbor_agent/         #   Harbor benchmark adapter
  deepresearch/         #   Full-featured research agent with web UI
tests/                  # Unit tests (100% coverage required)
docs/                   # MkDocs documentation source

Architecture

                              pydantic-deep
┌─────────────────────────────────────────────────────────────────────┐
│                                                                     │
│   ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌─────────┐  │
│   │ Planning │ │Filesystem│ │ Subagents│ │  Skills  │ │  Teams  │  │
│   └────┬─────┘ └────┬─────┘ └────┬─────┘ └────┬─────┘ └────┬────┘  │
│        │            │            │            │            │        │
│        └────────────┴─────┬──────┴────────────┴────────────┘        │
│                           │                                         │
│                           ▼                                         │
│  Summarization ──► ┌──────────────────┐ ◄── Middleware              │
│  Checkpointing ──► │    Deep Agent    │ ◄── Hooks                   │
│  Cost Tracking ──► │   (pydantic-ai)  │ ◄── Memory                  │
│                    └────────┬─────────┘                              │
│                             │                                       │
│           ┌─────────────────┼─────────────────┐                     │
│           ▼                 ▼                 ▼                     │
│    ┌────────────┐    ┌────────────┐    ┌────────────┐               │
│    │   State    │    │   Local    │    │   Docker   │               │
│    │  Backend   │    │  Backend   │    │  Sandbox   │               │
│    └────────────┘    └────────────┘    └────────────┘               │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Related Projects

pydantic-ai - The foundation: Agent framework by Pydantic
pydantic-ai-backend - File storage and sandbox backends
pydantic-ai-todo - Task planning toolset
subagents-pydantic-ai - Multi-agent orchestration
summarization-pydantic-ai - Context management
pydantic-ai-middleware - Middleware system with lifecycle hooks
DeepResearch - Full-featured research agent built with pydantic-deep (included in this repo)
SWE-bench Agent - SWE-bench evaluation runner (included in this repo)
Harbor Agent - Harbor benchmark adapter (included in this repo)
fastapi-fullstack - Full-stack AI app template
deepagents - Deep Agent implementation by LangChain (inspiration)

Contributing

git clone https://github.com/vstorm-co/pydantic-deepagents.git
cd pydantic-deepagents
make install
make test  # 100% coverage required
make all   # lint + typecheck + test

See CONTRIBUTING.md for full guidelines.

Star History

License

MIT — see LICENSE

Need help implementing this in your company?

We're Vstorm — an Applied Agentic AI Engineering Consultancy
with 30+ production AI agent implementations.

Made with ❤️ by Vstorm

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.18

May 5, 2026

0.3.17

Apr 22, 2026

0.3.16

Apr 22, 2026

0.3.15

Apr 17, 2026

0.3.14

Apr 16, 2026

0.3.13

Apr 13, 2026

0.3.12

Apr 13, 2026

0.3.11

Apr 13, 2026

0.3.10

Apr 13, 2026

0.3.9

Apr 12, 2026

0.3.8

Apr 12, 2026

0.3.7

Apr 11, 2026

0.3.6

Apr 11, 2026

0.3.5

Apr 11, 2026

0.3.4

Apr 9, 2026

0.3.3

Apr 2, 2026

0.3.2

Mar 31, 2026

0.3.1

Mar 31, 2026

0.3.0

Mar 30, 2026

0.2.21

Mar 19, 2026

0.2.20

Mar 12, 2026

0.2.19

Mar 6, 2026

This version

0.2.18

Feb 27, 2026

0.2.17

Feb 17, 2026

0.2.16

Feb 12, 2026

0.2.15

Feb 7, 2026

0.2.14

Jan 23, 2026

0.2.13

Jan 17, 2026

0.2.12

Jan 16, 2026

0.2.11

Jan 15, 2026

0.2.9

Dec 28, 2025

0.2.7

Dec 23, 2025

0.2.6

Dec 10, 2025

0.2.2

Dec 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydantic_deep-0.2.18.tar.gz (10.2 MB view details)

Uploaded Feb 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pydantic_deep-0.2.18-py3-none-any.whl (2.1 MB view details)

Uploaded Feb 27, 2026 Python 3

File details

Details for the file pydantic_deep-0.2.18.tar.gz.

File metadata

Download URL: pydantic_deep-0.2.18.tar.gz
Upload date: Feb 27, 2026
Size: 10.2 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydantic_deep-0.2.18.tar.gz
Algorithm	Hash digest
SHA256	`1bb7d97d7dc5d5bcb7009bcde79bc41f1e9a3aba3bf4179fe0d2b0a2050b438b`
MD5	`035c805bed663824d0a78a4696619f13`
BLAKE2b-256	`7ab6d0169a6070edc680a6312c4d009ceb84accc5451c1a436565ca58c714a9d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydantic_deep-0.2.18.tar.gz:

Publisher: publish.yml on vstorm-co/pydantic-deepagents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pydantic_deep-0.2.18.tar.gz
- Subject digest: 1bb7d97d7dc5d5bcb7009bcde79bc41f1e9a3aba3bf4179fe0d2b0a2050b438b
- Sigstore transparency entry: 1004358483
- Sigstore integration time: Feb 27, 2026
Source repository:
- Permalink: vstorm-co/pydantic-deepagents@850fb95b45ff20cf7b7daf85679bd4efe5a25dc0
- Branch / Tag: refs/tags/0.2.18
- Owner: https://github.com/vstorm-co
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@850fb95b45ff20cf7b7daf85679bd4efe5a25dc0
- Trigger Event: release

File details

Details for the file pydantic_deep-0.2.18-py3-none-any.whl.

File metadata

Download URL: pydantic_deep-0.2.18-py3-none-any.whl
Upload date: Feb 27, 2026
Size: 2.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydantic_deep-0.2.18-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e97218d19e423cf2de5614926ca8d90bd86fa361373ef8cd534c16afc7a1ea45`
MD5	`a64c38996081f721bb0d73e26952d0d6`
BLAKE2b-256	`57fcd48bc16f6cbac69d7f2f32126080f2d9e3ba908646cea718a2c85dd50397`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydantic_deep-0.2.18-py3-none-any.whl:

Publisher: publish.yml on vstorm-co/pydantic-deepagents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pydantic_deep-0.2.18-py3-none-any.whl
- Subject digest: e97218d19e423cf2de5614926ca8d90bd86fa361373ef8cd534c16afc7a1ea45
- Sigstore transparency entry: 1004358484
- Sigstore integration time: Feb 27, 2026
Source repository:
- Permalink: vstorm-co/pydantic-deepagents@850fb95b45ff20cf7b7daf85679bd4efe5a25dc0
- Branch / Tag: refs/tags/0.2.18
- Owner: https://github.com/vstorm-co
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@850fb95b45ff20cf7b7daf85679bd4efe5a25dc0
- Trigger Event: release

pydantic-deep 0.2.18

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Pydantic AI Deep Agents Framework

See It In Action

Get Started in 60 Seconds

Same Architecture as the Best

DeepResearch — Full-Featured Reference App

Features

Core Toolsets

Advanced Features

Built-in Capabilities

Use Cases

CLI — Your Terminal AI Assistant

Commands

Configuration

Built-in Skills

Sandbox Runtimes

What's Included

Modular — Use What You Need

Go Deeper

Structured Output

File Uploads

Context Management

Custom Subagents

Checkpointing & Rewind

Agent Teams

Hooks (Claude Code-Style)

Persistent Memory

Cost Tracking

Middleware

Output Styles

Context Files

Plan Mode

Context Manager (Hybrid)

Skills

Custom Tool Descriptions

Project Structure

Architecture

Related Projects

Contributing

Star History

License

Need help implementing this in your company?

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance