Skip to main content

A comprehensive Python framework for building and serving conversational AI agents with FastAPI

Project description

Agent Framework Library

A comprehensive Python framework for building and serving conversational AI agents with FastAPI. Create production-ready AI agents in minutes with automatic session management, streaming responses, file storage, and easy MCP integration.

Key Features:

  • 🚀 Quick Setup - Create agents in 10-15 minutes
  • 🔌 Easy MCP Integration - Connect to external tools effortlessly
  • 🛠️ Off-the-Shelf Tools - Pre-built tools for files, PDFs, charts, and more
  • 🎯 Skills System - On-demand capability loading for token optimization
  • 🔄 Multi-Provider Support - OpenAI, Anthropic, Gemini
  • 🎯 Smart Model Routing - Auto mode selects the best model per query complexity
  • 💾 Session Management - Automatic conversation persistence
  • 📁 File Storage - Local, S3, MinIO support

Installation

# Install with LlamaIndex support (recommended)
uv add agent-framework-lib[llamaindex]

# Install with MCP support
uv add agent-framework-lib[llamaindex,mcp]

# Install with all features
uv add agent-framework-lib[all]

# Or with pip
pip install agent-framework-lib[llamaindex]

Available extras: llamaindex, mcp, mongodb, s3, minio, multimod

Optional: System Dependencies

The framework automatically detects and configures system libraries. Manual installation is only needed if you encounter issues:

For PDF Generation (WeasyPrint):

# macOS
brew install pango gdk-pixbuf libffi cairo

# Ubuntu/Debian
sudo apt-get install libpango-1.0-0 libpangoft2-1.0-0 libgdk-pixbuf2.0-0 libffi-dev libcairo2

# Fedora/RHEL
sudo dnf install pango gdk-pixbuf2 libffi-devel cairo

For Chart/Mermaid Image Generation (Playwright):

# Install Playwright and browser
uv add playwright
playwright install chromium

# Or with pip
pip install playwright
playwright install chromium

For MCP Python Server (Deno):

# macOS/Linux
curl -fsSL https://deno.land/install.sh | sh

# Windows (PowerShell)
irm https://deno.land/install.ps1 | iex

Post-Installation Script (Recommended)

The framework includes a CLI script that automatically installs all optional dependencies (Playwright browsers and Deno runtime):

# Run after installing the package
agent-framework-post-install

This script:

  • ✅ Installs Playwright Chromium browser (for charts, mermaid diagrams, tables)
  • ✅ Installs Deno runtime (for MCP servers like mcp-run-python)
  • ✅ Works on Windows, macOS, and Linux
  • ✅ Detects if dependencies are already installed (fast path)

Note: The framework also attempts lazy auto-installation when tools are first used, but running the post-install script ensures everything is ready upfront.

The framework handles library path configuration automatically on startup.

🤖 Framework Helper Agent

The framework includes a built-in AI assistant that helps you create agents! Access it at /helper when running any agent server.

Features:

  • 🧠 Deep knowledge of framework documentation, examples, and source code
  • 🔍 Search tools for docs and examples
  • 💡 Code generation assistance
  • 📚 Indexed knowledge base (30+ files)
  • 🗄️ Persistent knowledge graph (FalkorDB) - survives server restarts

Access: http://localhost:8000/helper

The helper agent indexes:

  • All documentation (docs/*.md)
  • All examples (examples/*.py)
  • Core framework source (tools, storage, memory, session management)

Re-indexing: If you update documentation or examples, trigger a re-index:

curl -X POST http://localhost:8000/helper/reindex

Model Configuration:

By default, the helper agent uses Claude (if ANTHROPIC_API_KEY is set) or GPT-5 (if OPENAI_API_KEY is set). You can override this with:

# Force a specific model (useful if your Anthropic key has reached its limit)
HELPER_AGENT_MODEL=gpt-5

Example questions:

  • "How do I create an agent with memory?"
  • "Show me how to use PDF tools"
  • "What's the difference between Memori and Graphiti?"
  • "How do I configure S3 storage?"
  • "Search the web for LlamaIndex best practices"

🐳 Docker Development Environment

For local development, use Docker Compose to run all external services (Elasticsearch, MongoDB, PostgreSQL, FalkorDB, MinIO):

# Start all services
docker-compose --profile all up -d

# Copy environment template
cp .env.docker .env
# Edit .env to add your LLM API keys

# Stop services
docker-compose down

Use profiles to start only what you need:

docker-compose --profile storage up -d  # Elasticsearch, MongoDB, MinIO
docker-compose --profile memory up -d   # PostgreSQL, FalkorDB

Full documentation: See Docker Setup Guide for service details, ports, credentials, and troubleshooting.

🚀 Getting Started

Create Your First Agent

Here's a complete, working agent with LlamaIndex:

from typing import List
from agent_framework import LlamaIndexAgent, create_basic_agent_server

class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="my_calculator_agent",
            name="Calculator Agent",
            description="A helpful calculator assistant that can perform basic math operations."
        )
    
    def get_agent_prompt(self) -> str:
        """Define your agent's behavior and personality."""
        return "You are a helpful calculator assistant."
  
    def get_agent_tools(self) -> List[callable]:
        """Define the tools your agent can use.
        
        Tools are automatically converted to LlamaIndex FunctionTool instances.
        The function name becomes the tool name, and the docstring becomes the description.
        """
        def add(a: float, b: float) -> float:
            """Add two numbers together."""
            return a + b
        
        def multiply(a: float, b: float) -> float:
            """Multiply two numbers together."""
            return a * b
        
        # Just return the functions - automatic conversion to FunctionTool
        return [add, multiply]

# Start server - includes streaming, session management, web UI
create_basic_agent_server(MyAgent, port=8000)

Required Methods:

  • __init__() - Call super().__init__(agent_id, name, description) with required identity info
  • get_agent_prompt() - Return system prompt string
  • get_agent_tools() - Return list of tools (can be empty)

Optional Methods (have default implementations):

  • create_fresh_context() - Create new LlamaIndex Context (default provided)
  • serialize_context(ctx) - Serialize context for persistence (default provided)
  • deserialize_context(state) - Deserialize context from state (default provided)
  • initialize_agent() - Customize agent creation (default: FunctionAgent)
  • configure_session() - Add session setup logic

That's it! The framework provides default implementations for context management (state persistence), so you only need to implement the three core methods above.

Run it:

# Set your API key
export OPENAI_API_KEY=sk-your-key-here

# Run the agent
python my_agent.py

# Open http://localhost:8000/ui

⚙️ Configure Your Agent

Environment Setup

Create a .env file:

# Required: At least one API key
OPENAI_API_KEY=sk-your-openai-key
ANTHROPIC_API_KEY=sk-ant-your-anthropic-key
GEMINI_API_KEY=your-gemini-key

# Model Configuration
DEFAULT_MODEL=gpt-5-mini

# Multi-Model Routing (Auto Mode)
DEFAULT_MODEL_MODE=auto                    # "auto" or specific model name
AUTO_CLASSIFIER_MODEL=gpt-4o-mini          # Model for complexity classification
PREFERRED_LIGHT_MODELS=gpt-4o-mini,claude-haiku-4-5-20251001
PREFERRED_STANDARD_MODELS=gpt-5-mini,claude-sonnet-4-5-20250929
PREFERRED_ADVANCED_MODELS=gpt-5,claude-opus-4-1-20250805

# Session Storage (optional)
SESSION_STORAGE_TYPE=memory  # or "mongodb" or "elasticsearch"
MONGODB_CONNECTION_STRING=mongodb://localhost:27017
MONGODB_DATABASE_NAME=agent_sessions

# File Storage (optional)
LOCAL_STORAGE_PATH=./file_storage
AWS_S3_BUCKET=my-bucket
S3_AS_DEFAULT=false

Remote Configuration (Elasticsearch-Managed Agents)

For production deployments, you can configure agents to be managed entirely via Elasticsearch, allowing ops teams to modify prompts and models at runtime without code deployments.

Enable remote configuration:

from agent_framework import LlamaIndexAgent

class OpsMangedAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="ops_managed_agent",
            name="Ops Managed Agent",
            description="An agent configured via Elasticsearch."
        )
    
    @classmethod
    def get_use_remote_config(cls) -> bool:
        """Enable Elasticsearch-only configuration."""
        return True
    
    def get_agent_prompt(self) -> str:
        # Fallback prompt if ES config not available
        return "You are a helpful assistant."
    
    def get_agent_tools(self) -> list:
        return []

Behavior:

use_remote_config Server Startup Session Init
False (default) Pushes hardcoded config to ES if different Merges ES config with hardcoded
True Skips pushing to ES Reads ES config only (no merge)

When to use:

  • use_remote_config=False (default): Code-managed agents where developers control the config
  • use_remote_config=True: Ops-managed agents where configuration is modified via ES/Kibana

Fallback: If use_remote_config=True but no ES config exists, the system falls back to hardcoded config and pushes it to ES with a warning.

🎯 Multi-Model Selection

The framework includes intelligent model routing that automatically selects the best model based on query complexity.

Auto Mode (Default)

When DEFAULT_MODEL_MODE=auto, the system analyzes each query and routes it to the appropriate tier:

Tier Icon Use Case Example Models
Light 💨 Simple queries, greetings, basic info gpt-4o-mini, claude-haiku
Standard ⚖️ Typical questions, explanations gpt-5-mini, claude-sonnet
Advanced 🧠 Complex analysis, creative tasks gpt-5, claude-opus

Benefits:

  • 💰 Cost optimization - Use cheaper models for simple queries
  • Speed - Faster responses for trivial messages
  • 🎯 Quality - Powerful models for complex tasks

Manual Model Selection

Users can also select a specific model from the UI dropdown:

  • Models grouped by tier with availability indicators (✓/✗)
  • Preference persisted in localStorage
  • Real-time routing indicator shows selected model

Configuration

# Default mode when no user preference
DEFAULT_MODEL_MODE=auto

# Model used for complexity classification (should be fast and cheap)
AUTO_CLASSIFIER_MODEL=gpt-4o-mini

# Preferred models per tier (comma-separated, in order of preference)
PREFERRED_LIGHT_MODELS=gpt-4o-mini,claude-haiku-4-5-20251001,gemini-2.5-flash-lite
PREFERRED_STANDARD_MODELS=gpt-5-mini,claude-sonnet-4-5-20250929,gemini-2.5-flash
PREFERRED_ADVANCED_MODELS=gpt-5,claude-opus-4-1-20250805,gemini-2.5-pro

API Endpoint

# Get available models
curl http://localhost:8000/api/models

# Response
{
  "models_by_tier": {
    "light": [{"id": "gpt-4o-mini", "provider": "openai", "available": true}, ...],
    "standard": [...],
    "advanced": [...]
  },
  "default_mode": "auto",
  "classifier_model": "gpt-4o-mini"
}

Backward Compatibility

Agents with hardcoded models continue to work without changes:

class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(...)
        self._default_model = "gpt-5"  # This model will always be used

LlamaIndex Agent Configuration

Control model behavior in your agent:

class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="my_agent",
            name="My Agent",
            description="A helpful assistant."
        )
        # Default model config (can be overridden per session)
        self.default_temperature = 0.7
        self.default_model = "gpt-5-mini"

Runtime Configuration:

Users can override settings per session via the API or web UI:

  • Model selection (gpt-5, claude-4.5-sonnet, gemini-pro)
  • Temperature (0.0 - 1.0)
  • Max tokens
  • System prompt override

🛠️ Off-the-Shelf Tools

The framework provides ready-to-use tools for common tasks. Import from agent_framework.tools:

File Management Tools

from agent_framework.tools import (
    CreateFileTool,      # Create text files
    ListFilesTool,       # List stored files
    ReadFileTool,        # Read file contents
    GetFilePathTool      # Get file system path
)

PDF Generation Tools

from agent_framework.tools import (
    CreatePDFFromMarkdownTool,  # Generate PDF from markdown
    CreatePDFFromHTMLTool,      # Generate PDF from HTML
    CreatePDFWithImagesTool     # Generate PDF with embedded images
)

Chart & Visualization Tools

from agent_framework.tools import (
    ChartToImageTool,    # Convert Chart.js config to PNG
    MermaidToImageTool,  # Convert Mermaid diagram to PNG
    TableToImageTool     # Convert table data to PNG
)

Using Off-the-Shelf Tools

from agent_framework import LlamaIndexAgent
from agent_framework.storage.file_system_management import FileStorageFactory
from agent_framework.tools import CreateFileTool, ListFilesTool, CreatePDFFromMarkdownTool

class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="my_agent",
            name="File Agent",
            description="An assistant with file storage and PDF generation capabilities."
        )
        self.file_storage = None
        
        # Initialize tools
        self.tools = [
            CreateFileTool(),
            ListFilesTool(),
            CreatePDFFromMarkdownTool()
        ]
    
    async def _ensure_file_storage(self):
        if self.file_storage is None:
            self.file_storage = await FileStorageFactory.create_storage_manager()
    
    async def configure_session(self, session_configuration):
        user_id = session_configuration.get('user_id', 'default_user')
        session_id = session_configuration.get('session_id')
        
        await self._ensure_file_storage()
        
        # Inject dependencies into tools
        for tool in self.tools:
            tool.set_context(
                file_storage=self.file_storage,
                user_id=user_id,
                session_id=session_id
            )
        
        await super().configure_session(session_configuration)
    
    def get_agent_tools(self):
        return [tool.get_tool_function() for tool in self.tools]

Key Pattern:

  1. Instantiate tools in __init__()
  2. Initialize file storage in configure_session()
  3. Inject context with tool.set_context()
  4. Return tool functions in get_agent_tools()

🔧 Create Custom Tools

Custom tools extend your agent's capabilities. The tool name and docstring are crucial - they tell the agent when and how to use the tool.

Basic Custom Tool

def get_weather(city: str) -> str:
    """Get the current weather for a specific city.
    
    Args:
        city: The name of the city to get weather for
        
    Returns:
        A description of the current weather
    """
    # Your implementation here
    return f"The weather in {city} is sunny, 22°C"

# Add to your agent
class MyAgent(LlamaIndexAgent):
    def get_agent_tools(self):
        # Just return the function - automatic conversion to FunctionTool
        # Function name = tool name, docstring = tool description
        return [get_weather]

Important:

  • Function name should be explicit and descriptive (e.g., get_weather, not weather)
  • Docstring is added as the tool description - the agent uses this to understand when to call the tool
  • Type hints help the agent understand parameters
  • Args/Returns documentation provides additional context

Custom Tool with Dependencies

For tools that need file storage or other dependencies:

from agent_framework.tools.base_tool import AgentTool

class MyCustomTool(AgentTool):
    """Base class handles dependency injection."""
    
    def execute(self, param1: str, param2: int) -> str:
        """Process data and store results.
        
        Args:
            param1: Description of first parameter
            param2: Description of second parameter
            
        Returns:
            Result description
        """
        # Access injected dependencies
        user_id = self.user_id
        session_id = self.session_id
        file_storage = self.file_storage
        
        # Your logic here
        result = f"Processed {param1} with {param2}"
        
        # Store file if needed
        file_id = await file_storage.store_file(
            user_id=user_id,
            session_id=session_id,
            filename="result.txt",
            content=result.encode()
        )
        
        return f"Result stored with ID: {file_id}"

# Use in your agent
class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="my_agent",
            name="My Agent",
            description="A helpful assistant with custom tools."
        )
        self.custom_tool = MyCustomTool()
    
    async def configure_session(self, session_configuration):
        # Inject dependencies
        self.custom_tool.set_context(
            file_storage=self.file_storage,
            user_id=session_configuration.get('user_id'),
            session_id=session_configuration.get('session_id')
        )
        await super().configure_session(session_configuration)
    
    def get_agent_tools(self):
        return [self.custom_tool.get_tool_function()]

Tool Naming Best Practices

# ✅ GOOD - Explicit and clear
def calculate_mortgage_payment(principal: float, rate: float, years: int) -> float:
    """Calculate monthly mortgage payment."""
    pass

def send_email_notification(recipient: str, subject: str, body: str) -> bool:
    """Send an email notification to a recipient."""
    pass

# ❌ BAD - Too vague
def calculate(x: float, y: float) -> float:
    """Do calculation."""
    pass

def send(data: str) -> bool:
    """Send something."""
    pass

🔌 Adding MCP Servers

MCP (Model Context Protocol) allows your agent to connect to external tools and services.

Basic MCP Setup

from llama_index.tools.mcp import BasicMCPClient, McpToolSpec

class MyAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="my_agent",
            name="MCP Agent",
            description="An assistant with access to external tools via MCP servers."
        )
        self.mcp_tools = []
        self._mcp_initialized = False
    
    async def _initialize_mcp_tools(self):
        """Load tools from MCP servers."""
        if self._mcp_initialized:
            return
        
        # Configure your MCP server
        mcp_configs = [
            {
                "command": "uvx",
                "args": ["mcp-server-filesystem"],
                "env": {"FILESYSTEM_ROOT": "/path/to/workspace"}
            }
        ]
        
        for config in mcp_configs:
            client = BasicMCPClient(
                config["command"],
                args=config["args"],
                env=config.get("env", {})
            )
            
            # Load tools from the MCP server
            mcp_tool_spec = McpToolSpec(client=client)
            tools = await mcp_tool_spec.to_tool_list_async()
            self.mcp_tools.extend(tools)
        
        self._mcp_initialized = True
    
    async def initialize_agent(self, model_name, system_prompt, tools, **kwargs):
        # Load MCP tools before initializing agent
        await self._initialize_mcp_tools()
        
        # Combine with other tools
        all_tools = self.get_agent_tools()
        await super().initialize_agent(model_name, system_prompt, all_tools, **kwargs)
    
    def get_agent_tools(self):
        # Return built-in tools + MCP tools
        return self.mcp_tools

Multiple MCP Servers

def _get_mcp_configs(self):
    """Configure multiple MCP servers."""
    return [
        {
            "name": "filesystem",
            "command": "uvx",
            "args": ["mcp-server-filesystem"],
            "env": {"FILESYSTEM_ROOT": "/workspace"}
        },
        {
            "name": "github",
            "command": "uvx",
            "args": ["mcp-server-github"],
            "env": {
                "GITHUB_TOKEN": os.getenv("GITHUB_TOKEN")
            }
        },
        {
            "name": "python",
            "command": "uvx",
            "args": ["mcp-run-python", "stdio"]
        }
    ]

Popular MCP Servers

# Filesystem operations
uvx mcp-server-filesystem

# GitHub integration
uvx mcp-server-github

# Python code execution
uvx mcp-run-python

# Database access
uvx mcp-neo4j-cypher
uvx mcp-server-postgres

Installation:

# Install with MCP support
uv add agent-framework-lib[llamaindex,mcp]

# Or add MCP to existing installation
uv add agent-framework-lib[mcp]

# MCP servers are run via uvx (no separate install needed)

Using Deno-based MCP servers:

If you need to use Deno-based MCP servers (like TypeScript MCP servers), the framework provides a helper function to ensure Deno works correctly even if it's not in your PATH:

from agent_framework import get_deno_command

# Configure a Deno-based MCP server
mcp_config = {
    "command": get_deno_command(),  # Automatically uses correct Deno path
    "args": ["run", "-N", "jsr:@pydantic/mcp-run-python", "stdio"]
}

This helper function:

  • ✅ Automatically finds Deno even if not in system PATH
  • ✅ Works seamlessly after agent-framework-post-install
  • ✅ Returns absolute path to Deno binary when needed

🧠 Memory Module

Add long-term semantic memory to your agents, enabling them to remember information across conversations and provide personalized responses.

Quick Start

from agent_framework import LlamaIndexAgent
from agent_framework.memory import MemoryConfig

class MyMemoryAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="memory_agent",
            name="Memory Agent",
            description="An agent with long-term memory."
        )
    
    def get_agent_prompt(self) -> str:
        return "You are a helpful assistant that remembers user preferences."
    
    def get_agent_tools(self) -> list:
        return []
    
    def get_memory_config(self):
        """Enable memory - just override this method!"""
        return MemoryConfig.memori_simple(
            database_url="sqlite:///memory.db"
        )

Memory Providers

Provider Backend Best For
Memori SQLite, PostgreSQL, MySQL Fast queries, simple setup
Graphiti FalkorDB, Neo4j Complex relationships, temporal queries
Hybrid Both Best of both worlds

Configuration Options

# Memori with SQLite (simplest)
MemoryConfig.memori_simple(database_url="sqlite:///memory.db")

# Graphiti with FalkorDB
MemoryConfig.graphiti_simple(use_falkordb=True)

# Hybrid mode (both providers)
MemoryConfig.hybrid(
    memori_database_url="sqlite:///memory.db",
    graphiti_use_falkordb=True
)

Memory Modes

  • Passive Injection: Relevant memories automatically injected into prompts
  • Active Tools: Agent can explicitly recall_memory(), store_memory(), forget_memory()

Installation

# All memory support
uv add agent-framework-lib[memory]

# Or individual providers
uv add agent-framework-lib[memori]
uv add agent-framework-lib[graphiti]

More info: See Memory Installation Guide and Creating Agents Guide

🎯 Skills System

The Skills System provides modular, on-demand capability loading that reduces token consumption by ~80%. Instead of loading all instructions into every system prompt, skills deliver detailed instructions only when needed.

How It Works

BEFORE: System Prompt = Base (~500) + Rich Content (~3000) = ~3500 tokens/message
AFTER:  System Prompt = Base (~500) + Skills Discovery (~200) = ~700 tokens/message
        + On-demand skill loading (~500 tokens, one-time per skill)

Quick Start

Skills are automatically available in all agents via BaseAgent. No need to explicitly inherit from SkillsMixin:

from agent_framework import LlamaIndexAgent
from agent_framework import LlamaIndexAgent

class MySkillsAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="skills_agent",
            name="Skills Agent",
            description="An agent with on-demand capabilities."
        )
        # Built-in skills are automatically registered by BaseAgent.__init__
    
    def get_agent_prompt(self) -> str:
        # Skills discovery prompt is automatically appended by BaseAgent
        return "You are a helpful assistant."
    
    def get_agent_tools(self) -> list:
        # Skill tools are auto-loaded - no need to add them manually!
        return []  # Only return custom tools specific to your agent

Built-in Skills

Category Skills
Visualization chart, mermaid, table
Document file, pdf, pdf_with_images, file_access
Web web_search
Multimodal multimodal
UI form, optionsblock, image_display

Agent Workflow

  1. Agent receives user request: "Create a bar chart"
  2. Agent calls list_skills() → sees available skills
  3. Agent calls load_skill("chart") → gets Chart.js instructions
  4. Agent uses save_chart_as_image() tool with loaded knowledge
  5. Optionally calls unload_skill("chart") when done

More info: See Creating Agents Guide and skills_demo_agent.py

📝 Rich Content Capabilities (Automatic)

All agents automatically support rich content generation including:

  • 📊 Mermaid diagrams (version 10.x syntax)
  • 📈 Chart.js charts (bar, line, pie, doughnut, polarArea, radar, scatter, bubble)
  • 📋 Interactive forms (formDefinition JSON)
  • 🔘 Clickable option buttons (optionsblock)
  • 📑 Formatted tables (tabledata)

This is automatic! The framework injects rich content instructions into all agent system prompts by default. You don't need to add anything to your get_agent_prompt().

Disabling Rich Content

If you need to disable automatic rich content injection for a specific agent or session:

Via Session Configuration (UI or API):

# When initializing a session
session_config = {
    "user_id": "user123",
    "session_id": "session456",
    "enable_rich_content": False  # Disable rich content
}

Via Web UI: Uncheck the "Enable rich content capabilities" checkbox when creating a session.

Format Examples

Chart:

```chart
{
  "type": "chartjs",
  "chartConfig": {
    "type": "bar",
    "data": {
      "labels": ["Mon", "Tue", "Wed"],
      "datasets": [{
        "label": "Sales",
        "data": [120, 150, 100]
      }]
    }
  }
}
```

Options Block:

```optionsblock
{
  "question": "What would you like to do?",
  "options": [
    {"text": "Continue", "value": "continue"},
    {"text": "Cancel", "value": "cancel"}
  ]
}
```

Table:

```tabledata
{
  "caption": "Sales Data",
  "headers": ["Month", "Revenue"],
  "rows": [["Jan", "$1000"], ["Feb", "$1200"]]
}
```

🎯 All Together: Complete Multi-Skills Agent

Here's a complete example combining all features - MCP, off-the-shelf tools, custom tools, and format support:

import os
from typing import List, Any, Dict
from agent_framework import LlamaIndexAgent, create_basic_agent_server
from agent_framework.storage.file_system_management import FileStorageFactory
from agent_framework.tools import (
    CreateFileTool, ListFilesTool, ReadFileTool,
    CreatePDFFromMarkdownTool, CreatePDFFromHTMLTool,
    ChartToImageTool, MermaidToImageTool, CreatePDFWithImagesTool, TableToImageTool
)
from llama_index.tools.mcp import BasicMCPClient, McpToolSpec

class MultiSkillsAgent(LlamaIndexAgent):
    def __init__(self):
        super().__init__(
            agent_id="multi_skills_agent_v1",
            name="Multi-Skills Agent",
            description="A versatile assistant with file storage, PDF generation, charts, and MCP capabilities."
        )
        self.file_storage = None
        self.mcp_tools = []
        self._mcp_initialized = False
        
        # Off-the-shelf tools
        self.file_tools = [
            CreateFileTool(),
            ListFilesTool(),
            ReadFileTool(),
            CreatePDFFromMarkdownTool(),
            CreatePDFFromHTMLTool(),
            ChartToImageTool(),
            MermaidToImageTool(),
            TableToImageTool(),
            CreatePDFWithImagesTool()
        ]
    
    async def _ensure_file_storage(self):
        if self.file_storage is None:
            self.file_storage = await FileStorageFactory.create_storage_manager()
    
    async def configure_session(self, session_configuration: Dict[str, Any]):
        user_id = session_configuration.get('user_id', 'default_user')
        session_id = session_configuration.get('session_id')
        
        await self._ensure_file_storage()
        
        # Inject context into file tools
        for tool in self.file_tools:
            tool.set_context(
                file_storage=self.file_storage,
                user_id=user_id,
                session_id=session_id
            )
        
        await super().configure_session(session_configuration)
    
    async def _initialize_mcp_tools(self):
        if self._mcp_initialized:
            return
        
        try:
            from llama_index.tools.mcp import BasicMCPClient, McpToolSpec
        except ImportError:
            return
        
        # Configure MCP servers
        mcp_configs = [
            {
                "command": "uvx",
                "args": ["mcp-run-python", "stdio"]
            }
        ]
        
        for config in mcp_configs:
            try:
                client = BasicMCPClient(config["command"], args=config["args"])
                mcp_tool_spec = McpToolSpec(client=client)
                tools = await mcp_tool_spec.to_tool_list_async()
                self.mcp_tools.extend(tools)
            except Exception as e:
                print(f"MCP initialization failed: {e}")
        
        self._mcp_initialized = True
    
    def get_agent_prompt(self) -> str:
        return """You are a helpful assistant with multiple capabilities:
        
        - Execute Python code via MCP
        - Create, read, and list files
        - Generate PDF documents from markdown or HTML
        - Create charts, mermaid diagrams, and tables
        - Present forms and option blocks to users
        
        You can generate markdown, mermaid diagrams, charts, code blocks, forms and optionsblocks.
        ALWAYS include option blocks when asking the user to select an option!
        
        ... See the format section above
        """
    
    def get_agent_tools(self) -> List[callable]:
        # Combine all tools
        all_tools = []
        all_tools.extend([tool.get_tool_function() for tool in self.file_tools])
        all_tools.extend(self.mcp_tools)
        return all_tools
    
    async def initialize_agent(self, model_name, system_prompt, tools, **kwargs):
        await self._initialize_mcp_tools()
        all_tools = self.get_agent_tools()
        await super().initialize_agent(model_name, system_prompt, all_tools, **kwargs)

# Start the server
if __name__ == "__main__":
    create_basic_agent_server(MultiSkillsAgent, port=8000)

Run it:

export OPENAI_API_KEY=sk-your-key
python multi_skills_agent.py
# Open http://localhost:8000/ui

Full example: See examples/agent_example_multi_skills.py for the complete implementation with full format support prompt.

🌐 Web Interface

The framework includes a built-in web UI for testing and interacting with your agent.

Access: http://localhost:8000/ui

Features:

  • 💬 Real-time message streaming
  • 🎨 Rich format rendering (charts, tables, mermaid diagrams)
  • 📁 File upload and management
  • ⚙️ Model and parameter configuration
  • 💾 Session management
  • 📊 Conversation history
  • 🎯 Interactive option blocks and forms

Quick Test:

# Start your agent
python my_agent.py

# Open in browser
open http://localhost:8000/ui

The UI automatically detects and renders:

  • Chart.js visualizations from chart blocks
  • Mermaid diagrams from mermaid blocks
  • Tables from tabledata blocks
  • Interactive forms from formDefinition JSON
  • Clickable options from optionsblock

API Documentation: http://localhost:8000/docs (Swagger UI)

📚 Additional Resources

Documentation

Examples

API Endpoints

Core:

  • POST /message - Send message to agent
  • POST /init - Initialize session
  • POST /end - End session
  • GET /sessions - List sessions

Files:

  • POST /files/upload - Upload file
  • GET /files/{file_id}/download - Download file
  • GET /files - List files

Full API docs: http://localhost:8000/docs

Authentication

# API Key Authentication
REQUIRE_AUTH=true
API_KEYS=sk-key-1,sk-key-2
curl -H "Authorization: Bearer sk-key-1" \
  http://localhost:8000/message \
  -H "Content-Type: application/json" \
  -d '{"query": "Hello!"}'

Quick Links:

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_framework_lib-0.5.8.post17.tar.gz (904.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agent_framework_lib-0.5.8.post17-py3-none-any.whl (815.0 kB view details)

Uploaded Python 3

File details

Details for the file agent_framework_lib-0.5.8.post17.tar.gz.

File metadata

File hashes

Hashes for agent_framework_lib-0.5.8.post17.tar.gz
Algorithm Hash digest
SHA256 312bdbef6dece152c0cc613b140d9df830e5894240ff7de243959a5caf28026d
MD5 002d929ecd4cdbcfad3b7a830e71cd07
BLAKE2b-256 6c8ad1723a93828892d0ee8510b3f2a0731a7aa45c009181e867b1f70f780230

See more details on using hashes here.

File details

Details for the file agent_framework_lib-0.5.8.post17-py3-none-any.whl.

File metadata

File hashes

Hashes for agent_framework_lib-0.5.8.post17-py3-none-any.whl
Algorithm Hash digest
SHA256 737c2af7e4e2475912902c7e7de2fa57da8abce5e4681178c3b0a61a898b2bc4
MD5 b707ea4a6a0489a8f686a24d23c99ffc
BLAKE2b-256 a0b793491a744605a8f2d0e77a11167d6fd556f72b13ab317444638224f7e60d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page