Web Agents that Learn Tools - Automatic tool discovery from websites

These details have not been verified by PyPI

Project links

Project description

WALT: Web Agents that Learn Tools

Web Agents that Learn Tools - Automatic tool discovery from websites for LLM agents

WALT enables LLM agents to automatically discover and learn reusable tools from any website. Point WALT at a website, and it will explore, understand, and generate ready-to-use tool definitions.

WALT Overview

🚀 Quick Start

Installation

# Install uv (faster than pip)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install WALT (ideally inside a venv)
uv venv && source .venv/bin/activate
uv pip install walt
playwright install chromium

# Set up configuration
walt init  # Creates .env file for API keys

Basic Usage

# Run agent with tools
walt agent "find and return the URL of the cheapest blue kayak" \
  --tools walt-tools/classifieds/ \
  --start-url http://localhost:9980

# Discover new tools from any website
walt discover --url https://example.com

# Or generate a specific tool (faster!)
walt generate --url https://zillow.com --goal "Search for homes with filters"

# List available tools
walt list walt-tools/shopping/

# Start an MCP server
walt serve walt-tools/classifieds/ --port 8000

# Record a demonstration
walt record https://example.com --name my_tool

🐍 Python SDK

WALT can be used programmatically for tool discovery and agent execution:

# Tool discovery
from walt.tools.discovery import propose, generate
import asyncio

async def discover_tools():
    class Args:
        base_url = "https://example.com"
        output_dir = "my-tools"
        llm = "gpt-5-mini"
        planner_llm = "gpt-5-mini"
        auth_file = None  # Optional: path to Playwright storage_state.json
        max_processes = 16
        
    args = Args()
    
    # Phase 1: Discover candidates
    tools = await propose.discover_candidates(args)
    
    # Phase 2: Generate tools
    await generate.generate_tools(args, tools)

asyncio.run(discover_tools())

# Agent with tools
from walt.browser_use.custom.agent_zoo import VWA_Agent
from walt.browser_use.custom.browser import VWABrowser, BrowserConfig
from walt.browser_use import Controller
from walt.tools.discovery.register import register_tools_from_directory
from langchain_openai import ChatOpenAI

async def run_agent():
    # Setup browser and controller
    browser = VWABrowser(BrowserConfig(headless=False))
    controller = Controller()
    
    # Load tools
    register_tools_from_directory(
        controller=controller,
        tool_dir="walt-tools/classifieds/",
        llm=ChatOpenAI(model="gpt-5-mini")
    )
    
    # Create and run agent
    agent = VWA_Agent(
        task="Find the cheapest blue kayak",
        llm=ChatOpenAI(model="gpt-5-mini"),
        browser=browser,
        controller=controller,
        max_actions_per_step=30
    )
    
    await agent.run()
    await browser.close()

asyncio.run(run_agent())

📖 CLI Commands

`walt agent <task>`

Run an agent to complete a task, optionally using tools.

walt agent "find cheap apartments" --tools walt-tools/classifieds/ --start-url https://www.zillow.com
walt agent "book a flight to NYC" --llm gemini-2.5-flash --max-steps 100 --start-url https://www.google.com/flights
walt agent "search for blue kayaks" --save-gif kayak_search.gif  # Record as GIF

Key options: --tools, --llm, --headless, --max-steps, --start-url, --save-gif

Recording: Use --save-gif <path> to save the agent's browser interactions as an animated GIF with step-by-step actions overlaid.

`walt discover --url <url>`

Discover and generate tools by exploring a website.

walt discover --url https://example.com
walt discover --url http://localhost:9980 --output walt-tools/mysite
walt discover --url https://example.com --auth-file .auth/state.json
walt discover --url https://example.com --llm gpt-4o --max-processes 8

Key options: --url, --output, --llm, --auth-file, --max-processes, --force-regenerate

Note: To reproduce results on research benchmarks, see BENCHMARKS.md.

`walt generate --url <url> --goal <goal>`

Generate a specific tool without exploration (when you know what you want).

walt generate --url https://airbnb.com --goal "Search for homes available in a location for provided dates and guest details"
walt generate --url https://zillow.com --goal "View property details" -o walt-tools/zillow/
walt generate --url https://example.com --goal "Book appointment" --auth-file .auth/state.json

Key options: --url, --goal, --output, --llm, --auth-file

Use case: When you already know what tool you need and don't want to wait for exploratory discovery.

`walt record <url>`

Record a human demonstration and convert it to a tool.

walt record https://example.com --name search_products

`walt serve <tool_dir>`

Start an MCP server with your tools.

walt serve walt-tools/shopping/ --port 8000

`walt list [tool_dir]`

List discovered tools.

walt list                           # All tools
walt list walt-tools/classifieds/   # Specific directory
walt list --detailed                # Detailed table view

The examples/ directory contains detailed examples of how to use WALT, including:

01_simple_discovery.py - Simple tool discovery
02_agent_with_tools.py - Using an agent with discovered tools
03_advanced_tool_use.py - Advanced tool usage patterns

📦 Tool Format

WALT tools are JSON files with a simple structure:

{
  "name": "search_products",
  "description": "Search for products on the site",
  "inputs": {
    "query": {
      "type": "string",
      "description": "Search query",
      "required": true
    }
  },
  "steps": [
    {
      "type": "navigation",
      "url": "https://example.com"
    },
    {
      "type": "input",
      "cssSelector": "#search-box",
      "text": "{query}"
    },
    {
      "type": "click",
      "cssSelector": "#search-button"
    },
    {
      "type": "extract_page_content",
      "goal": "Extract search results"
    }
  ]
}

Step types:

Deterministic: navigation, click, input, select_change, key_press, scroll
Agentic: extract_page_content, wait_for_page_load

See walt-tools/ for 50 pre-discovered examples.

🛠️ Development

Install from Source

git clone https://github.com/salesforceairesearch/walt.git
cd walt
uv venv && source .venv/bin/activate
uv pip install -e ".[dev]"
playwright install chromium

Project Structure

walt/
├── src/walt/
│   ├── browser_use/         # Browser automation
│   ├── tools/               # Tool system (discovery, execution, demonstration)
│   ├── benchmarks/          # WebArena/VisualWebArena evaluation
│   ├── cli.py               # CLI entry point
│   └── config.py            # Configuration system
├── experiment_configs/
│   └── ...                  # Experiment & benchmark configs
├── walt-tools/              # Pre-discovered tools
└── examples/                # Example scripts

Configuration

Use experiment configs to define reproducible evaluation runs:

# experiment_configs/my_experiment.yaml
name: "My Experiment"
llm:
  agent_model: gpt-5
agent:
  max_steps: 100
output:
  dir: outputs/my-experiment

Run it: python src/walt/benchmarks/vwa/aeval.py --config experiment_configs/my_experiment.yaml

Reproducing Paper Results

Interested in reproducing results from our paper? See BENCHMARKS.md for:

WebArena and VisualWebArena setup
Running evaluations with experiment configs
Tool discovery for benchmarks
Detailed configuration options

🤝 Citation

If you use WALT in your research, please cite:

@article{walt2025,
  title={WALT: Web Agents that Learn Tools},
  author={Viraj Prabhu, Yutong Dai, Matthew Fernandez, Jing Gu, Krithika Ramakrishnan, Yanqi Luo, Silvio Savarese, Caiming Xiong, Junnan Li, Zeyuan Chen, Ran Xu},
  journal={arXiv preprint arXiv:2510.01524},
  year={2025}
}

📄 License

MIT - See LICENSE

🙏 Acknowledgments

We are grateful to the browser-use team for the following projects upon which WALT is built:

browser-use
workflow-use

We are also grateful to the WebArena and VisualWebArena teams for the benchmark datasets.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Oct 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sfr_walt-0.1.0.tar.gz (294.4 kB view details)

Uploaded Oct 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sfr_walt-0.1.0-py3-none-any.whl (356.1 kB view details)

Uploaded Oct 22, 2025 Python 3

File details

Details for the file sfr_walt-0.1.0.tar.gz.

File metadata

Download URL: sfr_walt-0.1.0.tar.gz
Upload date: Oct 22, 2025
Size: 294.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for sfr_walt-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9941617064ac73d229a11ee51eb7660c95b72a86302667e3b36d7221a159dacb`
MD5	`806d9c416de4916ed1eab424eea94375`
BLAKE2b-256	`fe7e5d1c8eed1b5f380e8cee8004cbfc7b7067dd16bd31a6846b71403c2e03fa`

See more details on using hashes here.

File details

Details for the file sfr_walt-0.1.0-py3-none-any.whl.

File metadata

Download URL: sfr_walt-0.1.0-py3-none-any.whl
Upload date: Oct 22, 2025
Size: 356.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for sfr_walt-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a29a76ac6935195fb0eab76f2bbb68eea1bec636f75394dca357d2ad642b807e`
MD5	`996e69caaaeab301f29a5560d3469270`
BLAKE2b-256	`51a57b396c28be0ca3513f58cc6190d3f991c3fd5842a2bb09367801c8006314`

See more details on using hashes here.

sfr-walt 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

WALT: Web Agents that Learn Tools

🚀 Quick Start

Installation

Basic Usage

🐍 Python SDK

📖 CLI Commands

walt agent <task>

walt discover --url <url>

walt generate --url <url> --goal <goal>

walt record <url>

walt serve <tool_dir>

walt list [tool_dir]

📦 Tool Format

🛠️ Development

Install from Source

Project Structure

Configuration

Reproducing Paper Results

🤝 Citation

📄 License

🙏 Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`walt agent <task>`

`walt discover --url <url>`

`walt generate --url <url> --goal <goal>`

`walt record <url>`

`walt serve <tool_dir>`

`walt list [tool_dir]`