Code execution mode for UTCP - enables executing Python code chains with tool access.

These details have not been verified by PyPI

Project links

Project description

🐍 Python Code-Mode Library: Tool calling via code execution

Transform your AI agents from clunky tool callers into efficient code executors — in Python.

Why This Changes Everything

LLMs excel at writing code but struggle with tool calls. Instead of exposing hundreds of tools directly, give them ONE tool that executes Python code with access to your entire toolkit.

Apple, Cloudflare, and Anthropic say that Code-Mode is a more efficient way to approach tool calling compared to the traditional dump function information and then extract a JSON for function calling.

Benchmarks

Independent Python benchmark study validates the performance claims with $9,536/year cost savings at 1,000 scenarios/day:

Scenario Complexity	Traditional	Code Mode	Improvement
Simple (2-3 tools)	3 iterations	1 execution	67% faster
Medium (4-7 tools)	8 iterations	1 execution	75% faster
Complex (8+ tools)	16 iterations	1 execution	88% faster

Why Code Mode Dominates:

Batching Advantage - Single code block replaces multiple API calls
Cognitive Efficiency - LLMs excel at code generation vs. tool orchestration
Computational Efficiency - No context re-processing between operations

Getting Started

Get Started in 3 Lines

from utcp_code_mode import CodeModeUtcpClient

client = await CodeModeUtcpClient.create()                    # 1. Initialize
await client.register_manual({'name': 'github', ...})         # 2. Add tools  
result = await client.call_tool_chain("# Python code here")   # 3. Execute code

That's it. Your AI agent can now execute complex workflows in a single request instead of dozens.

What You Get

Progressive Tool Discovery

# Agent discovers tools dynamically, loads only what it needs
tools = await client.search_tools('github pull request')
# Instead of 500 tool definitions → 3 relevant tools

Natural Code Execution

result = await client.call_tool_chain('''
# Chain multiple operations in one request
pr = await github.get_pull_request(owner='microsoft', repo='vscode', pull_number=1234)
comments = await github.get_pull_request_comments(owner='microsoft', repo='vscode', pull_number=1234)
reviews = await github.get_pull_request_reviews(owner='microsoft', repo='vscode', pull_number=1234)

# Process data efficiently in-sandbox
return {
    "title": pr["title"],
    "commentCount": len(comments),
    "approvals": len([r for r in reviews if r["state"] == "APPROVED"])
}
''')
# Single API call replaces 15+ traditional tool calls

Auto-Generated Python TypedDict Interfaces

class GithubGetPullRequestInput(TypedDict):
    """Repository owner"""
    owner: str
    """Repository name"""
    repo: str
    """Pull request number"""
    pull_number: int

Enterprise-Ready

Secure Process Sandboxing – Subprocess isolation prevents unauthorized access
Timeout Protection – Configurable execution limits prevent runaway code
Complete Observability – Full console output capture and error handling
Zero External Dependencies – Tools only accessible through registered UTCP/MCP servers
Runtime Introspection – Dynamic interface discovery for adaptive workflows

Universal Protocol Support

Works with any tool ecosystem:

Protocol	Description	Usage
MCP	Model Context Protocol servers	`call_template_type: 'mcp'`
HTTP	REST APIs with auto-discovery	`call_template_type: 'http'`
File	Local JSON/YAML configurations	`call_template_type: 'file'`
CLI	Command-line tool execution	`call_template_type: 'cli'`

Installation

pip install code-mode

Direct Python Usage

1. MCP Server Integration

Connect to any Model Context Protocol server:

from utcp_code_mode import CodeModeUtcpClient

client = await CodeModeUtcpClient.create()

# Connect to GitHub MCP server
await client.register_manual({
    'name': 'github',
    'call_template_type': 'mcp',
    'config': {
        'mcpServers': {
            'github': {
                'command': 'docker',
                'args': ['run', '-i', '--rm', '-e', 'GITHUB_PERSONAL_ACCESS_TOKEN', 'mcp/github'],
                'env': {'GITHUB_PERSONAL_ACCESS_TOKEN': os.environ.get('GITHUB_TOKEN')}
            }
        }
    }
})

2. Execute Multi-Step Workflows

Replace 15+ tool calls with a single code execution:

result = await client.call_tool_chain('''
# Traditional: 4 separate API round trips → Code Mode: 1 execution
pr = await github.get_pull_request(owner='microsoft', repo='vscode', pull_number=1234)
comments = await github.get_pull_request_comments(owner='microsoft', repo='vscode', pull_number=1234)
reviews = await github.get_pull_request_reviews(owner='microsoft', repo='vscode', pull_number=1234)
files = await github.get_pull_request_files(owner='microsoft', repo='vscode', pull_number=1234)

# Process data in-sandbox (no token overhead)
summary = {
    "title": pr["title"],
    "state": pr["state"],
    "author": pr["user"]["login"],
    "stats": {
        "comments": len(comments),
        "reviews": len(reviews),
        "filesChanged": len(files),
        "approvals": len([r for r in reviews if r["state"] == "APPROVED"])
    },
    "topDiscussion": [
        {
            "author": c["user"]["login"],
            "preview": c["body"][:100] + "..."
        } for c in comments[:3]
    ]
}

print(f'PR "{pr["title"]}" analysis complete')
return summary
''')

print('Analysis Result:', result['result'])
# console output: 'PR "Fix memory leak in hooks" analysis complete'

Advanced Features

Multi-Protocol Tool Chains

Mix and match different tool ecosystems in a single execution:

# Register multiple tool sources
await client.register_manual({'name': 'github', 'call_template_type': 'mcp', ...})
await client.register_manual({'name': 'slack', 'call_template_type': 'http', ...})
await client.register_manual({'name': 'db', 'call_template_type': 'file', 'file_path': './db-tools.json'})

result = await client.call_tool_chain('''
# Fetch PR data from GitHub (MCP)
pr = await github.get_pull_request(owner='company', repo='api', pull_number=42)

# Query deployment status from database (File)
deployment = await db.get_deployment_status(pr_id=pr["id"])

# Send notification to Slack (HTTP)
await slack.post_message(
    channel='#releases',
    text=f'PR #42 "{pr["title"]}" deployed to {deployment["environment"]}'
)

return {"pr": pr["title"], "environment": deployment["environment"]}
''')

Runtime Interface Introspection

Tools can dynamically discover and adapt to available interfaces:

result = await client.call_tool_chain('''
# Discover available tools at runtime
print('Available interfaces:', __interfaces)

# Get specific tool interface for validation
pr_interface = __get_tool_interface('github.get_pull_request')
print('PR tool expects:', pr_interface)

# Use interface info for dynamic workflows
has_slack_tools = 'namespace slack' in __interfaces
if has_slack_tools:
    await slack.post_message(channel='#dev', text='Analysis complete')

return {"toolsAvailable": has_slack_tools}
''')

Context-Efficient Data Processing

Process large datasets without bloating the model's context:

result = await client.call_tool_chain('''
# Fetch large dataset
all_issues = await github.list_repository_issues(owner='facebook', repo='react')
print(f'Fetched {len(all_issues)} total issues')

# Process efficiently in-sandbox
critical_bugs = [
    {
        "number": issue["number"],
        "title": issue["title"],
        "author": issue["user"]["login"],
        "daysOld": (datetime.now() - datetime.fromisoformat(issue["created_at"].replace('Z', '+00:00'))).days
    }
    for issue in all_issues
    if any(l["name"] == "bug" for l in issue["labels"])
    and any(l["name"] == "high priority" for l in issue["labels"])
]

critical_bugs.sort(key=lambda x: x["daysOld"], reverse=True)

# Only return processed summary (not 10,000 raw issues)
return {
    "totalIssues": len(all_issues),
    "criticalBugs": critical_bugs[:10],  # Top 10 oldest critical bugs
    "summary": f'Found {len(critical_bugs)} critical bugs, oldest is {critical_bugs[0]["daysOld"]} days old'
}
''')

Error Handling & Observability

Built-in error handling with complete execution transparency:

result = await client.call_tool_chain('''
try:
    print('Starting multi-step workflow...')
    
    data = await external_api.fetch_data(id='user-123')
    print('Data fetched successfully')
    
    processed = await data_processor.transform(data)
    print(f'Processing completed with {len(processed.get("warnings", []))} warnings')
    
    return processed
except Exception as error:
    print(f'Workflow failed: {str(error)}')
    raise error  # Propagates to outer error handling
''', timeout=30)  # 30-second timeout

# Complete observability
print('Result:', result['result'])
print('Execution logs:', result['logs'])
# ['Starting multi-step workflow...', 'Data fetched successfully', 'Processing completed with 2 warnings']

Custom Timeouts

Configure execution limits for different workload types:

# Quick operations (5 seconds)
quick_result = await client.call_tool_chain('return await ping.check()', timeout=5)

# Heavy data processing (2 minutes)
heavy_result = await client.call_tool_chain('''
big_data = await database.export_full_dataset()
return await analytics.process_dataset(big_data)
''', timeout=120)

AI Agent Integration

Plug-and-play with any AI framework. The built-in prompt template handles all the complexity:

from utcp_code_mode import CodeModeUtcpClient
from openai import OpenAI

system_prompt = f"""
You are an AI assistant with access to tools via UTCP CodeMode.
{CodeModeUtcpClient.AGENT_PROMPT_TEMPLATE}
Additional instructions...
"""

# Works with any AI library
client = OpenAI()
response = client.chat.completions.create(
    model='gpt-4',
    messages=[
        {'role': 'system', 'content': system_prompt},
        {'role': 'user', 'content': 'Analyze the latest PR in microsoft/vscode'}
    ]
)

The template provides comprehensive guidance on:

Tool discovery workflow (search_tools → __interfaces → call_tool_chain)
Hierarchical access patterns (manual.tool() syntax)
Interface introspection (__get_tool_interface())
Error handling and best practices

API Reference

Core Methods

`call_tool_chain(code: str, timeout: int = 30) -> Dict[str, Any]`

Execute Python code with full tool access and observability.

Returns: {"result": any, "logs": List[str]} with execution result and captured console output
Default timeout: 30 seconds

`get_all_tools_python_interfaces() -> str`

Generate complete Python TypedDict interfaces for IDE integration.

Returns: String containing all interface definitions with proper typing

`search_tools(query: str, limit: int = 10)` (from UtcpClient)

Discover tools using natural language queries.

Returns: List of relevant tools with descriptions and interfaces

Static Methods

`CodeModeUtcpClient.create(root_dir=None, config=None) -> CodeModeUtcpClient`

Create a new client instance with optional configuration.

`CodeModeUtcpClient.AGENT_PROMPT_TEMPLATE`

Production-ready prompt template for AI agents.

Security & Performance

Secure by Design

Process sandboxing – Isolated execution in separate processes with real termination
No filesystem access – Tools only through registered servers
Timeout protection – Configurable execution limits with forcible termination
Zero network access – No external dependencies or API keys exposed
Restricted imports – Only safe modules allowed (json, math, asyncio, datetime, time, re, typing, collections, itertools, functools, operator, uuid)
Safe builtins – Dangerous functions like exec, eval, open are blocked
No system access – Modules like os, sys, subprocess not available

Performance Optimized

Minimal memory footprint – Process isolation is efficient with copy-on-write
Efficient tool caching – TypedDict interfaces cached automatically
Streaming console output – Real-time log capture without buffering
Identifier sanitization – Handles invalid Python identifiers gracefully

Cooperative Sandbox Model

This security model is designed for cooperative LLM-generated code (not adversarial scenarios). It's perfect for:

AI agents with tool-based workflows
Development environments with controlled tool access
Educational settings for safe code experimentation
Internal automation with defined interfaces

Not suitable for: Production multi-tenant environments or untrusted user code.

Development Experience

IDE Integration

Generate Python definitions for full IntelliSense support:

# Generate tool interfaces  
interfaces = await client.get_all_tools_python_interfaces()
with open('generated_tools.py', 'w') as f:
    f.write(interfaces)

# Import in your code for type hints
from generated_tools import *

Debug & Monitor

Built-in observability for production deployments:

result = await client.call_tool_chain(user_code)

# Ship logs to your monitoring system
for log in result['logs']:
    if '[ERROR]' in log:
        monitoring.error(log)
    if '[WARN]' in log:
        monitoring.warn(log)

Benchmark Methodology

The comprehensive Python study tested 16 realistic scenarios across:

Financial workflows (invoicing, expense tracking)
DevOps operations (deployments, monitoring)
Data processing (analysis, reporting)
Business automation (CRM, notifications)

Models tested: Claude Haiku, Gemini Flash
Pricing basis: $0.25/1M input, $1.25/1M output tokens
Scale: 1,000 scenarios/day = $9,536/year savings with Code Mode

Learn More

Cloudflare Research – Original code mode whitepaper
Anthropic Study – MCP code execution benefits
Python Benchmark Study – Comprehensive performance analysis
UTCP Specification – Universal Tool Calling Protocol
Report Issues – Bug reports and feature requests

License

MPL-2.0 – Open source with commercial-friendly terms.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.1.0

May 3, 2026

This version

0.0.3

Nov 23, 2025

0.0.2

Nov 23, 2025

0.0.1

Nov 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_mode-0.0.3.tar.gz (19.6 kB view details)

Uploaded Nov 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

code_mode-0.0.3-py3-none-any.whl (16.2 kB view details)

Uploaded Nov 23, 2025 Python 3

File details

Details for the file code_mode-0.0.3.tar.gz.

File metadata

Download URL: code_mode-0.0.3.tar.gz
Upload date: Nov 23, 2025
Size: 19.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for code_mode-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`72a6b1b4224e903265349aaab0a8d1ec91ac3aa554bfa1f0720e64dfb08141a8`
MD5	`6390b79fec4e4a2ae3d4f4f987960905`
BLAKE2b-256	`5348a0bbcd65ed6d3600e994cdb8a14ffeb4e283da59fc6ede90914bc627ef6c`

See more details on using hashes here.

File details

Details for the file code_mode-0.0.3-py3-none-any.whl.

File metadata

Download URL: code_mode-0.0.3-py3-none-any.whl
Upload date: Nov 23, 2025
Size: 16.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for code_mode-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`92f97106112cf8c34be8469ebbf62478b3107e13a0288b664e222270d7892a3b`
MD5	`8c41e73c66a667e57724dfaf95bebd3c`
BLAKE2b-256	`5d500e7ea2c0d6b7aa5286368a849ba35efa5ccc497b6419e4f0f9a2f30eb8b5`

See more details on using hashes here.

code-mode 0.0.3

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

🐍 Python Code-Mode Library: Tool calling via code execution

Why This Changes Everything

Benchmarks

Why Code Mode Dominates:

Getting Started

Get Started in 3 Lines

What You Get

Progressive Tool Discovery

Natural Code Execution

Auto-Generated Python TypedDict Interfaces

Enterprise-Ready

Universal Protocol Support

Installation

Direct Python Usage

1. MCP Server Integration

2. Execute Multi-Step Workflows

Advanced Features

Multi-Protocol Tool Chains

Runtime Interface Introspection

Context-Efficient Data Processing

Error Handling & Observability

Custom Timeouts

AI Agent Integration

API Reference

Core Methods

call_tool_chain(code: str, timeout: int = 30) -> Dict[str, Any]

get_all_tools_python_interfaces() -> str

search_tools(query: str, limit: int = 10) (from UtcpClient)

Static Methods

CodeModeUtcpClient.create(root_dir=None, config=None) -> CodeModeUtcpClient

CodeModeUtcpClient.AGENT_PROMPT_TEMPLATE

Security & Performance

Secure by Design

Performance Optimized

Cooperative Sandbox Model

Development Experience

IDE Integration

Debug & Monitor

Benchmark Methodology

Learn More

License

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`call_tool_chain(code: str, timeout: int = 30) -> Dict[str, Any]`

`get_all_tools_python_interfaces() -> str`

`search_tools(query: str, limit: int = 10)` (from UtcpClient)

`CodeModeUtcpClient.create(root_dir=None, config=None) -> CodeModeUtcpClient`

`CodeModeUtcpClient.AGENT_PROMPT_TEMPLATE`