Lightweight Python AI agent with OpenAI tool calling, MCP support, and parallel subagents

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

chldu2000

Project description

air-agent

中文文档

A lightweight Python AI Agent library with OpenAI as the default provider, plus custom LLM provider support. It includes tool-calling loops, built-in file/shell tools, MCP server connections, skills, parallel subagents, tracing, and streaming output. Designed to be imported directly by other Python projects.

Installation

uv add air-agent

Or in development mode:

git clone https://github.com/chldu2000/air-agent.git
cd air-agent
uv sync --group dev

Quick Start

1. Set an API Key

For the default OpenAI provider, either set OPENAI_API_KEY or pass api_key in AgentConfig.

export OPENAI_API_KEY=sk-...

2. Run a Basic Conversation

import asyncio
from air_agent import Agent, AgentConfig


async def main():
    agent = Agent(AgentConfig(model="gpt-4o"))
    response = await agent.run("Explain quantum computing in one sentence")
    print(response.content)


asyncio.run(main())

3. Register a Local Tool

Built-in tools are registered automatically. You can also add local Python functions as tools.

import asyncio
from air_agent import Agent, AgentConfig


async def main():
    agent = Agent(AgentConfig(model="gpt-4o"))

    @agent.tool(name="add", description="Calculate the sum of two numbers")
    async def add(a: int, b: int) -> int:
        return a + b

    response = await agent.run("What is 3 plus 5?")
    print(response.content)


asyncio.run(main())

Parameter types are inferred from the function signature and converted to the JSON Schema required by OpenAI tool calling.

4. Stream Output

import asyncio
from air_agent import Agent, AgentConfig


async def main():
    agent = Agent(AgentConfig(model="gpt-4o"))

    async for event in await agent.run("Write a short poem about programming", stream=True):
        if event.type == "text":
            print(event.content, end="", flush=True)
        elif event.type == "tool_call":
            print(f"\n[Calling tool: {event.name}]")
        elif event.type == "tool_result":
            print(f"\n[Tool result: {event.content}]")
        elif event.type == "done":
            print(f"\nDone, token usage: {event.usage}")


asyncio.run(main())

5. Keep Conversation Context

Pass the same conversation_id across turns. air-agent keeps the recent conversation history for that id.

import asyncio
from air_agent import Agent, AgentConfig


async def main():
    agent = Agent(AgentConfig(model="gpt-4o"))

    first = await agent.run("My project is named air-agent.", conversation_id="session-1")
    second = await agent.run("What is my project named?", conversation_id="session-1")

    print(first.content)
    print(second.content)


asyncio.run(main())

6. Observe Runs with Tracing

Tracing is opt-in. When enabled, the agent emits structured RunEvent records for LLM calls, tool calls, retries, skill routing, errors, and completion.

from air_agent import Agent, AgentConfig

events = []

agent = Agent(AgentConfig(
    model="gpt-4o",
    enable_tracing=True,
    log_events=True,
    event_handlers=[events.append],
))

response = await agent.run("What files are in this project?")

for event in events:
    print(event.to_dict())

tool_duration_ms = sum(
    event.duration_ms or 0
    for event in events
    if event.type == "tool_end"
)
failed_tools = [
    event
    for event in events
    if event.type == "tool_error"
]

print(f"Tool time: {tool_duration_ms:.1f}ms")
for event in failed_tools:
    print(f"Failed tool: {event.name} ({event.error_kind})")

Useful event types include llm_start, llm_end, tool_start, tool_end, tool_error, retry, and done. Tool errors include an error_kind such as invalid_arguments, tool_not_found, timeout, permission_denied, or tool_error.

Skills tracing adds:

skill_route_start with metadata.candidate_names, metadata.candidate_count, and metadata.router
skill_route_end with the router raw output in content, metadata.matched_names, metadata.unrecognized_names, and duration_ms
skill_route_error with the failure message in content, metadata.error_type, metadata.fallback="no_skills", and duration_ms
skill_injected with the injected skill name, metadata.path, and metadata.content_length

skill_route_end.content contains the complete model-generated router output. Tracing logs may therefore include sensitive prompt or routing data; enable logging, storage, access, and retention controls accordingly.

Load Configuration from JSON

{
  "model": "gpt-4o",
  "system_prompt": "You are a coding assistant",
  "mcp_servers": [
    {"command": "npx", "args": ["-y", "@anthropic/mcp-server-filesystem", "/tmp"]},
    {"url": "http://localhost:8080/sse"}
  ]
}

config = AgentConfig.from_json("agent-config.json")
agent = Agent(config)

The mcp_servers field auto-detects the transport type based on command (stdio) or url (StreamableHTTP).

Load Configuration from Environment Variables

export AIR_MODEL=gpt-4o
export AIR_SYSTEM_PROMPT="You are an assistant"
export AIR_MAX_ITERATIONS=30
export AIR_MCP_SERVERS='[{"command":"npx","args":["server"]}]'

config = AgentConfig.from_env()          # default AIR_ prefix
config = AgentConfig.from_env(prefix="MYAPP_")  # custom prefix
agent = Agent(config)

Supported environment variables:

Variable	Type	Description
`AIR_MODEL`	str	Model name
`AIR_API_KEY`	str	API key (takes precedence over `OPENAI_API_KEY`)
`AIR_BASE_URL`	str	Custom API endpoint
`AIR_PROVIDER`	str	Provider name (`openai`; unset also uses OpenAI)
`AIR_SYSTEM_PROMPT`	str	System prompt
`AIR_MAX_ITERATIONS`	int	Max tool-calling rounds
`AIR_TOOL_TIMEOUT`	float	Tool call timeout in seconds
`AIR_MCP_SERVERS`	JSON	MCP server list
`AIR_DEFAULT_HEADERS`	JSON	Custom request headers
`AIR_SKILLS_DIR`	str	Skills directory path
`AIR_BUILTIN_TOOLS`	JSON	Built-in tools config
`AIR_ENABLE_TRACING`	bool	Enable structured event dispatch
`AIR_LOG_EVENTS`	bool	Log structured events as JSON
`AIR_MAX_TOOL_RETRIES`	int	Retries for retryable tool errors

Custom LLM Providers

OpenAI remains the default provider. For OpenAI-compatible APIs, keep using model, api_key, base_url, and default_headers:

from air_agent import Agent, AgentConfig

agent = Agent(AgentConfig(
    model="gpt-4o",
    api_key="sk-xxx",
    base_url="https://api.example.com/v1",
    default_headers={"X-API-Key": "custom-header"},
))

For other backends, pass an object that implements LLMProvider. Provider methods return the neutral LLMResponse and LLMStreamChunk types, so you can adapt any backend without OpenAI-specific payloads.

from typing import Any, AsyncIterator

from air_agent import Agent, AgentConfig, BuiltinToolsConfig, LLMResponse, LLMStreamChunk


class EchoProvider:
    supports_tools = False
    supports_streaming = True

    async def complete(
        self,
        *,
        model: str,
        messages: list[dict[str, Any]],
        tools: list[dict[str, Any]] | None = None,
        **options: Any,
    ) -> LLMResponse:
        last_message = messages[-1]["content"]
        return LLMResponse(content=f"echo: {last_message}")

    async def stream(
        self,
        *,
        model: str,
        messages: list[dict[str, Any]],
        tools: list[dict[str, Any]] | None = None,
        **options: Any,
    ) -> AsyncIterator[LLMStreamChunk]:
        yield LLMStreamChunk(content_delta="echo: ")
        yield LLMStreamChunk(content_delta=str(messages[-1]["content"]))


agent = Agent(AgentConfig(
    model="echo",
    provider=EchoProvider(),
    builtin_tools=BuiltinToolsConfig(enabled=False),
))

If supports_tools = False, runs with registered or enabled tools fail clearly instead of silently ignoring them. Built-in tools are enabled by default, so disable them as shown above or implement tool support in your provider.

Skills

Load skill instructions from a directory of skill folders. Each skill is a directory (kebab-case named) containing a SKILL.md file with YAML frontmatter for metadata and Markdown body for instructions.

Directory structure:

skills/
├── brainstorming/
│   └── SKILL.md              # Required: metadata + instructions
├── data-analysis/
│   ├── SKILL.md
│   ├── scripts/              # Optional: executable scripts
│   │   └── process_data.py
│   └── references/           # Optional: templates, schemas
│       └── data_schema.json

SKILL.md format (skills/brainstorming/SKILL.md):

---
name: brainstorming
description: Use when starting creative work or exploring ideas
---

# Brainstorming

Ask questions one at a time to refine the idea.

Usage:

from air_agent import Agent, AgentConfig

config = AgentConfig(
    model="gpt-4o",
    skills_dir="./skills",  # directory containing skill subdirectories
)
agent = Agent(config)
response = await agent.run("I want to brainstorm a new feature")

Skills work via progressive prompt injection:

All skill metadata (name + description) is always included in the system prompt
When a user query matches relevant skills, the full skill content is injected into the conversation context
Skill matching uses an LLM-based router by default; you can provide a custom SkillRouter implementation

Custom router:

from air_agent import SkillRouter

class KeywordRouter(SkillRouter):
    async def match(self, user_input: str, skills: list) -> list:
        return [s for s in skills if s.name in user_input.lower()]

Built-in Tools

Agent comes with a minimal built-in toolset for file system operations and shell commands. These are enabled by default and registered automatically.

Tool	Description
`read_file`	Read file contents with offset/limit support
`write_file`	Write content to a file, auto-create directories
`list_directory`	List directory entries with type and size info
`find_files`	Find files matching a glob pattern
`grep`	Search file contents with regex
`run_shell`	Execute shell commands

Default usage (no configuration needed):

from air_agent import Agent, AgentConfig

agent = Agent(AgentConfig(model="gpt-4o", api_key="sk-xxx"))
# read_file, write_file, list_directory, find_files, grep, run_shell are all available

Configuration:

from air_agent import BuiltinToolsConfig

# Disable built-in tools entirely
config = AgentConfig(model="gpt-4o", builtin_tools=BuiltinToolsConfig(enabled=False))

# Select specific tools only
config = AgentConfig(model="gpt-4o",
    builtin_tools=BuiltinToolsConfig(tools=["read_file", "grep"]))

# Custom sandbox and limits
config = AgentConfig(model="gpt-4o",
    builtin_tools=BuiltinToolsConfig(
        allowed_directories=["/project"],
        max_read_size=500_000,
        max_grep_results=50,
        default_timeout=60.0,
    ))

Security features:

Path sandbox — file tools only access paths within allowed_directories (defaults to cwd)
Command blocklist — dangerous commands (rm -rf /, sudo, mkfs, etc.) are blocked
Result limits — configurable caps on find/grep/list results and shell output
Truncation notices — when results are truncated, the agent is informed so it can refine queries

BuiltinToolsConfig fields:

Field	Type	Default	Description
`enabled`	bool	`True`	Master switch
`tools`	list	`None`	Tool selection (`None` = all)
`allowed_directories`	list	`[]`	Sandbox dirs (empty = cwd)
`max_read_size`	int	`1000000`	Max file read size in bytes
`default_timeout`	float	`30.0`	Shell command timeout
`blocked_commands`	list	[...]	Blocked command patterns
`max_find_results`	int	`200`	Find results cap
`max_grep_results`	int	`100`	Grep matches cap
`max_list_entries`	int	`500`	Directory listing cap
`max_output_bytes`	int	`50000`	Shell output truncation

Connect to MCP Servers

from air_agent import MCPServerStdio, MCPServerSSE

agent = Agent(AgentConfig(
    model="gpt-4o",
    mcp_servers=[
        MCPServerStdio(command="npx", args=["-y", "@anthropic/mcp-server-filesystem", "/tmp"]),
        MCPServerSSE(url="http://localhost:8080/mcp"),
    ],
))

async with agent:  # auto connect/disconnect MCP servers
    response = await agent.run("List files under /tmp")

Supports both stdio and StreamableHTTP MCP transports. MCPServerSSE is the compatibility name for URL-based MCP servers. Once connected, tools exposed by the server are automatically registered in the agent's tool list.

Parallel Subagents

from air_agent import SubagentConfig

results = await agent.delegate(
    tasks=[
        "Analyze the code structure in src/",
        "Check test coverage in tests/",
        "Generate a CHANGELOG",
    ],
    config=SubagentConfig(max_parallel=3, timeout=60),
)

for r in results:
    print(f"[{r.status}] {r.content[:100]}")

Each task runs as an isolated prompt through the same agent, with concurrency limited by SubagentConfig.max_parallel.

Configuration

AgentConfig(
    model="gpt-4o",              # Model name
    api_key="sk-xxx",            # Or set OPENAI_API_KEY env variable
    base_url=None,               # Custom API endpoint
    provider=None,                # None/"openai" or an LLMProvider object
    default_headers=None,         # Custom provider request headers
    system_prompt="You are an assistant",  # System prompt
    max_iterations=20,           # Max tool-calling rounds
    tool_timeout=30.0,           # Single tool call timeout (seconds)
    mcp_servers=[],              # MCP server list
    skills_dir=None,             # Skills directory path
    builtin_tools=None,          # BuiltinToolsConfig or None for defaults
    enable_tracing=False,         # Emit structured RunEvent records
    log_events=False,             # Log RunEvent records as JSON
    max_tool_retries=0,           # Retries for retryable tool errors
)

Project Structure

src/air_agent/
├── __init__.py          # Public API exports
├── agent.py             # Core Agent (ReAct loop + streaming)
├── config.py            # Configuration dataclass
├── providers/
│   ├── types.py         # LLMProvider protocol + neutral response types
│   └── openai.py        # Default OpenAI provider adapter
├── tracing.py           # RunEvent dispatcher and structured event logging
├── types.py             # Response, StreamEvent, SubagentResult
├── tools/
│   ├── base.py          # Tool dataclass
│   ├── registry.py      # Tool registry
│   └── builtin/
│       ├── config.py    # BuiltinToolsConfig
│       ├── _permissions.py  # Path sandbox + command blocklist
│       ├── file_tools.py    # read, write, list, find, grep
│       └── shell_tools.py   # run_shell
├── mcp/
│   ├── client.py        # MCP client (stdio + streamable_http)
│   └── tool_adapter.py  # MCP tool → OpenAI format adapter
├── skills/
│   ├── skill.py         # Skill dataclass + SKILL.md parser
│   ├── manager.py       # SkillManager (directory scanning)
│   └── router.py        # SkillRouter ABC + LLMSkillRouter
└── subagent.py          # Parallel subagent manager

Dependencies

openai — LLM calls and tool calling
mcp — MCP protocol client
pydantic — Data validation

Development

uv sync --group dev
uv run pytest tests/ -v

License

MIT

Project details

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

chldu2000

Release history Release notifications | RSS feed

0.5.0

Jun 11, 2026

This version

0.4.0

Jun 10, 2026

0.3.1

Jun 6, 2026

0.3.0

Jun 3, 2026

0.2.0

May 27, 2026

0.1.3

May 18, 2026

0.1.2

May 6, 2026

0.1.1

Apr 30, 2026

0.1.0

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

air_agent-0.4.0.tar.gz (104.9 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

air_agent-0.4.0-py3-none-any.whl (31.6 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file air_agent-0.4.0.tar.gz.

File metadata

Download URL: air_agent-0.4.0.tar.gz
Upload date: Jun 10, 2026
Size: 104.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for air_agent-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`4f85e8bf9449d62c06b029869bb057284ed104c8bf4e90fb56637215aec247e4`
MD5	`0f48e8dd705e2d1aa545646cf4eee549`
BLAKE2b-256	`2692a8d9b61d0b7ecb5e4a5ca4c4ec756fec4ca60ebf9a241373910c2e10283e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for air_agent-0.4.0.tar.gz:

Publisher: python-publish.yml on chldu2000/air-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: air_agent-0.4.0.tar.gz
- Subject digest: 4f85e8bf9449d62c06b029869bb057284ed104c8bf4e90fb56637215aec247e4
- Sigstore transparency entry: 1779176660
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: chldu2000/air-agent@046a77886ba5db62627847a71443eeb4c09271c9
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/chldu2000
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@046a77886ba5db62627847a71443eeb4c09271c9
- Trigger Event: release

File details

Details for the file air_agent-0.4.0-py3-none-any.whl.

File metadata

Download URL: air_agent-0.4.0-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 31.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for air_agent-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1fd2b9471805ad556ae3fbbe6042f18752bff1f62c7d9a94874ff6ffd40420d9`
MD5	`667a9906966e1eda990cb02b55525364`
BLAKE2b-256	`697bd41cffbd027f787337c473d96deef28bbe589ca193b41f75574c5332d603`

See more details on using hashes here.

Provenance

The following attestation bundles were made for air_agent-0.4.0-py3-none-any.whl:

Publisher: python-publish.yml on chldu2000/air-agent

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: air_agent-0.4.0-py3-none-any.whl
- Subject digest: 1fd2b9471805ad556ae3fbbe6042f18752bff1f62c7d9a94874ff6ffd40420d9
- Sigstore transparency entry: 1779176835
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: chldu2000/air-agent@046a77886ba5db62627847a71443eeb4c09271c9
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/chldu2000
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@046a77886ba5db62627847a71443eeb4c09271c9
- Trigger Event: release

air-agent 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

air-agent

Installation

Quick Start

1. Set an API Key

2. Run a Basic Conversation

3. Register a Local Tool

4. Stream Output

5. Keep Conversation Context

6. Observe Runs with Tracing

Load Configuration from JSON

Load Configuration from Environment Variables

Custom LLM Providers

Skills

Built-in Tools

Connect to MCP Servers

Parallel Subagents

Configuration

Project Structure

Dependencies

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance