Official Python SDK for the Yutori API
Project description
Yutori Python SDK & CLI
The official Python library and CLI for the Yutori API.
Yutori provides APIs for building web agents that autonomously execute tasks on the web. The SDK offers both synchronous and asynchronous clients with full type annotations, plus a CLI for authentication and managing resources from the terminal.
Documentation
Installation
pip install yutori
Getting Started
Authentication
The easiest way to authenticate is to run this in your terminal:
Terminal:
yutori auth login
This opens your browser to log in with your Yutori account and saves an API key to ~/.yutori/config.json. The SDK and CLI automatically use this saved key.
Alternatively, you can set the YUTORI_API_KEY environment variable, or pass the key directly:
from yutori import YutoriClient
# Uses saved credentials from `yutori auth login`, or YUTORI_API_KEY env var
client = YutoriClient()
# Or pass explicitly
client = YutoriClient(api_key="yt-...")
API key resolution order: explicit parameter > YUTORI_API_KEY env var > ~/.yutori/config.json.
API Overview
The Yutori API provides four main capabilities:
| API | Description | SDK Namespace |
|---|---|---|
| n1 | Pixels-to-actions LLM for browser control | client.chat |
| Browsing | One-time browser automation tasks | client.browsing |
| Research | Deep web research using 100+ tools | client.research |
| Scouting | Continuous web monitoring on a schedule | client.scouts |
n1 API
The n1 API is a pixels-to-actions LLM that processes screenshots and predicts browser actions (click, type, scroll, etc.). It follows the OpenAI Chat Completions interface:
response = client.chat.completions.create(
model="n1-latest",
messages=[
{
"role": "user",
"content": [
{
"type": "text",
"text": "Describe the screenshot and search for Yutori."
},
{
"type": "image_url",
"image_url": {
"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/5/53/Google_homepage_%28as_of_January_2024%29.jpg/1280px-Google_homepage_%28as_of_January_2024%29.jpg"
}
}
]
}
]
)
# Get the thoughts
message = response.choices[0].message
print(message.content)
# Get the tool calls, such as browser interaction actions
if message.tool_calls:
for tool_call in message.tool_calls:
print(f"Action: {tool_call.function.name}")
print(f"Arguments: {tool_call.function.arguments}")
Live n1 model IDs and parameter docs are maintained at https://docs.yutori.com/reference/n1. The SDK forwards standard OpenAI chat-completions parameters through **kwargs, including tools, tool_choice, and response_format.
For Playwright users, the SDK provides a screenshot helper that captures and encodes images optimized for n1:
from yutori.n1 import aplaywright_screenshot_to_data_url
image_url = await aplaywright_screenshot_to_data_url(page)
messages = [
{
"role": "user",
"content": [
{"type": "text", "text": "Describe the screenshot and search for Yutori."},
{"type": "image_url", "image_url": {"url": image_url}},
],
}
]
Install the optional image dependency with pip install "yutori[n1]" if you want to use these screenshot helpers.
n1 tool calls use a normalized 1000x1000 coordinate space. The SDK provides public helpers so agent loops do not need to
re-implement that math:
from yutori.n1 import denormalize_coordinates
coords = [500, 250]
x, y = denormalize_coordinates(coords, width=1280, height=800)
For screenshot-heavy agent loops, the SDK also provides opt-in trimming helpers under yutori.n1:
from yutori.n1 import estimate_messages_size_bytes, trimmed_messages_to_fit
if estimate_messages_size_bytes(messages) > 9_500_000:
messages, size_bytes, removed = trimmed_messages_to_fit(
messages,
max_bytes=9_500_000,
keep_recent=6,
)
response = await client.chat.completions.create(
model="n1-latest",
messages=messages,
)
This keeps the raw OpenAI-compatible client.chat.completions.create(...) call unchanged, while giving Yutori users a safer
message-preparation helper for large screenshot histories. In long-lived loops, assign the trimmed copy back to your owned
history before the next step so old screenshots do not keep accumulating in memory. The size pre-check is there to avoid
deep-copying the full history on every step when trimming is not needed.
If you don't want to manage your own browser infrastructure, use the Browsing API which calls n1 on a cloud browser.
Browsing API
Run one-time browser automation tasks. An AI agent can operate either Yutori's cloud browser or Yutori Local on the desktop to complete your task.
# Create a browsing task
task = client.browsing.create(
task="Give me a list of all employees (names and titles) of Yutori.",
start_url="https://yutori.com",
)
# Poll for completion
import time
while True:
result = client.browsing.get(task["task_id"])
if result["status"] in ("succeeded", "failed"):
break
time.sleep(5)
print(result)
For tasks that involve logging in on a cloud browser, use require_auth to pick an auth-optimized provider:
task = client.browsing.create(
task="Log in and export the latest invoice.",
start_url="https://example.com/login",
require_auth=True,
)
To use Yutori Local with the user's existing logged-in desktop sessions instead of the cloud:
task = client.browsing.create(
task="Export the latest invoice from my dashboard.",
start_url="https://example.com/dashboard",
browser="local",
)
Failed browsing tasks may include a rejection_reason field to explain why the task was rejected.
Structured Output with Webhooks
You can define the output structure using a JSON schema dict or a Pydantic BaseModel class (Pydantic is optional):
from pydantic import BaseModel # optional dependency
class Employee(BaseModel):
name: str
title: str
task = client.browsing.create(
task="Give me a list of all employees (names and titles) of Yutori.",
start_url="https://yutori.com",
max_steps=75,
webhook_url="https://example.com/webhook",
output_schema=Employee, # auto-converted to JSON schema
)
Using a JSON schema dict instead
task = client.browsing.create(
task="Give me a list of all employees (names and titles) of Yutori.",
start_url="https://yutori.com",
max_steps=75,
webhook_url="https://example.com/webhook",
output_schema={
"type": "array",
"items": {
"type": "object",
"properties": {
"name": {"type": "string"},
"title": {"type": "string"}
}
}
}
)
Research API
Perform deep web research using 100+ MCP tools including search engines, APIs, and data sources.
task = client.research.create(
query="What are the latest developments in quantum computing from the past week?",
user_timezone="America/Los_Angeles",
)
# Poll for results
import time
while True:
result = client.research.get(task["task_id"])
if result["status"] in ("succeeded", "failed"):
break
time.sleep(5)
print(result)
If the research task needs access to a logged-in browser session, use Yutori Local:
task = client.research.create(
query="Review the latest updates in our vendor dashboard and summarize them.",
browser="local",
)
Failed research tasks may include a rejection_reason field to explain why the task was rejected.
Structured Output
from pydantic import BaseModel # optional dependency
class Finding(BaseModel):
title: str
summary: str
source_url: str
task = client.research.create(
query="What are the latest developments in quantum computing?",
user_timezone="America/Los_Angeles",
webhook_url="https://example.com/webhook",
output_schema=Finding, # auto-converted to JSON schema
)
Using a JSON schema dict instead
task = client.research.create(
query="What are the latest developments in quantum computing?",
user_timezone="America/Los_Angeles",
webhook_url="https://example.com/webhook",
output_schema={
"type": "array",
"items": {
"type": "object",
"properties": {
"title": {"type": "string"},
"summary": {"type": "string"},
"source_url": {"type": "string"}
}
}
}
)
Scouting API
Scouts run on a configurable schedule to monitor the web and send notifications when relevant updates occur.
from yutori import YutoriClient
client = YutoriClient(api_key="yt-...")
# Create a scout that monitors for updates
scout = client.scouts.create(
query="Tell me about the latest news, product updates, and announcements about Yutori AI",
)
print(f"Created scout: {scout['id']}")
# List all active scouts
scouts = client.scouts.list(status="active")
# Get a specific scout
scout = client.scouts.get("scout_abc123")
# Pause a scout
client.scouts.update("scout_abc123", status="paused")
# Resume a scout
client.scouts.update("scout_abc123", status="active")
# Archive a scout
client.scouts.update("scout_abc123", status="done")
# Get scout updates
updates = client.scouts.get_updates("scout_abc123", limit=20)
# Delete a scout
client.scouts.delete("scout_abc123")
Structured Output with Webhooks
from pydantic import BaseModel # optional dependency
class NewsItem(BaseModel):
headline: str
summary: str
source_url: str
scout = client.scouts.create(
query="Tell me about the latest news, product updates, and announcements about Yutori AI",
output_interval=86400, # Daily
user_timezone="America/Los_Angeles",
skip_email=True,
webhook_url="https://example.com/webhook",
output_schema=NewsItem, # auto-converted to JSON schema
)
Scout responses may also include rejection_reason when a run or configuration is rejected.
Using a JSON schema dict instead
scout = client.scouts.create(
query="Tell me about the latest news, product updates, and announcements about Yutori AI",
output_interval=86400, # Daily
user_timezone="America/Los_Angeles",
skip_email=True,
webhook_url="https://example.com/webhook",
output_schema={
"type": "array",
"items": {
"type": "object",
"properties": {
"headline": {"type": "string"},
"summary": {"type": "string"},
"source_url": {"type": "string"}
}
}
}
)
Async Usage
The SDK provides an async client with the same interface:
import asyncio
from yutori import AsyncYutoriClient
async def main():
async with AsyncYutoriClient(api_key="yt-...") as client:
# All methods are async
usage = await client.get_usage()
print(usage)
scouts = await client.scouts.list()
print(scouts)
scout = await client.scouts.create(
query="Monitor https://example.com for updates",
output_interval=3600,
)
print(scout)
asyncio.run(main())
Error Handling
The SDK raises typed exceptions for API errors:
from yutori import YutoriClient, APIError, AuthenticationError
try:
client = YutoriClient(api_key="invalid-key")
client.get_usage()
except AuthenticationError as e:
print(f"Invalid API key: {e}")
except APIError as e:
print(f"API error (status {e.status_code}): {e.message}")
Exception Types
| Exception | Status Code | Description |
|---|---|---|
AuthenticationError |
401, 403 | Invalid or missing API key |
APIError |
4xx, 5xx | General API error with status code |
Configuration
from yutori import YutoriClient
client = YutoriClient(
api_key="yt-...", # Or: yutori auth login / YUTORI_API_KEY
base_url="https://api.yutori.com/v1", # Default
timeout=30.0, # Request timeout in seconds
)
CLI
The CLI provides commands for authentication and managing Yutori resources from the terminal.
# Version
yutori --version
# Authentication
yutori auth login # Log in via browser
yutori auth status # Show current auth status
yutori auth logout # Remove saved credentials
# Scouts
yutori scouts list # List your scouts
yutori scouts get SCOUT_ID # Get scout details
yutori scouts create -q "monitor for news" # Create a scout
yutori scouts create -q "monitor for news" -i daily -tz America/New_York
yutori scouts delete SCOUT_ID # Delete a scout
# Browsing
yutori browse run "extract all prices" https://example.com/products
yutori browse run "log in and continue" https://example.com/login --require-auth
yutori browse run "export dashboard data" https://example.com/dashboard --browser local
yutori browse run "fill out the form" https://example.com --agent n1 --max-steps 50
yutori browse get TASK_ID
# Research
yutori research run "latest developments in quantum computing" --browser local
yutori research run "local events this weekend" -tz America/Los_Angeles --location "San Francisco, CA, US"
yutori research get TASK_ID
# Usage
yutori usage # Show API usage statistics
Run yutori --help or yutori <command> --help for full option details.
Requirements
Examples
See examples/ for complete working examples, including a browser automation agent using the n1 API.
Contributing
See CONTRIBUTING.md for development setup and guidelines.
License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file yutori-0.5.0.tar.gz.
File metadata
- Download URL: yutori-0.5.0.tar.gz
- Upload date:
- Size: 77.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9ab39145922ed9f4c3268b379b809d928c22f39fbe3fca187e651265e0e11ace
|
|
| MD5 |
32e9e204eadc4dc8bb156744398ed4fd
|
|
| BLAKE2b-256 |
5ba6f625e6540ba6e12cb467e953d2591031f3d54651cae9b88f5fc01d5b16da
|
File details
Details for the file yutori-0.5.0-py3-none-any.whl.
File metadata
- Download URL: yutori-0.5.0-py3-none-any.whl
- Upload date:
- Size: 67.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c8ad6c8f466540d5092295832966f350bcad63783f0546c308f3958f378204d7
|
|
| MD5 |
893dc0264e104ef56d58d5733748fc7f
|
|
| BLAKE2b-256 |
606aed6fc6b9d2622b1610a38027babbfe0a44b956aa10e508492d3d6faa2f59
|