Superfast logprob-native agent runtime

These details have not been verified by PyPI

Project description

swiftagents

Superfast, logprob-native, async-first agent runtime.

Why swiftagents

Logprob-native routing and uncertainty
Tool-agnostic, model-agnostic (strict about logprobs)
Async-first with bounded speculation (max 2 tools)
Cost-aware, cacheable, and observable
Optional judge pipeline

Install

pip install swiftagents

Local development:

pip install -e .[dev]

Quickstart

import asyncio

from swiftagents.core import AgentRuntime, AgentConfig, MockModelClient, ToolRegistry, ToolSpec


def web_search(query: str) -> dict:
    # Put any code here: Pinecone, DB, APIs, etc.
    return {"snippet": "Example result"}


async def main():
    client = MockModelClient()
    client.queue_text("TOOL=WEB")
    client.queue_text("Answer using web evidence")

    spec = ToolSpec(
        name="WEB",
        description="Search the web",
        input_schema={"type": "object", "properties": {"query": {"type": "string"}}, "required": ["query"]},
        example_calls=[],
        cost_hint="medium",
        latency_hint_ms=200,
        side_effects=False,
        cacheable=True,
        cancellable=True,
    )

    tools = ToolRegistry()
    tools.register_function(web_search, spec)

    runtime = AgentRuntime(client=client, tools=tools, config=AgentConfig())
    result = await runtime.run("Find the latest overview")
    print(result.answer)


asyncio.run(main())

Core concepts

Model clients (logprobs required)

swiftagents requires token-level logprobs for routing. If a backend cannot provide them, it hard-errors.

Supported clients:

OpenAIChatCompletionsClient
VLLMOpenAICompatibleClient
MockModelClient (tests and examples)

Tools

Tools are async callables with a ToolSpec and return ToolResult.

from swiftagents.core import ToolSpec, ToolResult

class MyTool:
    def __init__(self):
        self.spec = ToolSpec(
            name="RAG",
            description="Retrieve from docs",
            input_schema={"type": "object", "properties": {"query": {"type": "string"}}, "required": ["query"]},
            example_calls=[],
            cost_hint="medium",
            latency_hint_ms=200,
            side_effects=False,
            cacheable=True,
            cancellable=True,
        )

    async def __call__(self, **kwargs):
        return ToolResult(ok=True, data={"docs": []}, error=None, metadata={})

Register functions directly (no tool classes)

Use any code inside a function (sync or async) and register it with a ToolSpec.

from swiftagents.core import ToolRegistry, ToolSpec, ToolResult

def pinecone_search(query: str) -> dict:
    # Put any code here (Pinecone, DB, API, etc.)
    # return raw data or ToolResult
    return {"matches": [{"id": "doc1", "score": 0.92}]}

spec = ToolSpec(
    name="PINECONE",
    description="Vector search over Pinecone",
    input_schema={"type": "object", "properties": {"query": {"type": "string"}}, "required": ["query"]},
    example_calls=[],
    cost_hint="medium",
    latency_hint_ms=200,
    side_effects=False,
    cacheable=True,
    cancellable=True,
)

registry = ToolRegistry()
registry.register_function(pinecone_search, spec)

If you prefer decorators:

from swiftagents.core import ToolRegistry, ToolSpec, tool

spec = ToolSpec(
    name="PINECONE",
    description="Vector search over Pinecone",
    input_schema={"type": "object", "properties": {"query": {"type": "string"}}, "required": ["query"]},
    example_calls=[],
    cost_hint="medium",
    latency_hint_ms=200,
    side_effects=False,
    cacheable=True,
    cancellable=True,
)

@tool(spec)
async def pinecone_tool(query: str):
    return {"matches": [{"id": "doc1", "score": 0.92}]}

registry = ToolRegistry()
registry.register(pinecone_tool)

Routing (logprob-gated)

Routing prompts the same LLM to output TOOL=<LABEL> where LABEL is NONE or a shortlist tool name. Confidence is computed using logprobs, entropy, and margin. Low confidence triggers bounded speculation.

Multi-tool routing modes

AgentConfig.multi_tool_mode controls how the runtime selects multiple tools:

single: default single-label routing (bounded speculation when uncertain).
multi_label: pick multiple tools from one router call using logprob thresholds.
multi_intent: lightweight heuristic splitting, then route each segment.
decompose: logprob-gated split decision + LLM decomposition into sub-questions, then route each.

All multi-tool modes merge tool evidence and produce one final answer.

Judge

Judge behaves like a tool. It can be disabled, run a cheap LLM, and optionally escalate to a stronger LLM. It can also run deterministic stage0 checks.

Caching and observability

Tool and model decision caches with TTL
Structured trace events
Token usage metrics and wasted work ratio

Examples

python -m swiftagents.examples.tool_selection
python -m swiftagents.examples.speculative_execution_demo
python -m swiftagents.examples.function_tool_demo

Benchmarks

python -m swiftagents.benchmarks.run_benchmark

Tests

pytest

Design notes

The router is logprob-native; labels should be compact and stable (prefer short uppercase names).
Speculation is bounded to two tools and never speculative for side-effecting tools unless explicitly allowed.
All runtime stages are async-first.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.5

Mar 27, 2026

0.1.4

Mar 27, 2026

0.1.3

Feb 18, 2026

0.1.2

Feb 18, 2026

0.1.1

Feb 18, 2026

0.1.0

Jan 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swiftagents-0.1.5.tar.gz (28.1 kB view details)

Uploaded Mar 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

swiftagents-0.1.5-py3-none-any.whl (33.5 kB view details)

Uploaded Mar 27, 2026 Python 3

File details

Details for the file swiftagents-0.1.5.tar.gz.

File metadata

Download URL: swiftagents-0.1.5.tar.gz
Upload date: Mar 27, 2026
Size: 28.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for swiftagents-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`797b2f4038dede0df2f4b8c388b1262234d461e7596ead75d793af7ef88c0718`
MD5	`bdc5bf5d1e9c7580a6719be0319eaac5`
BLAKE2b-256	`79dfd90f0d27d50c3696a2e2df3593a54b8477ceadba8ad0aba69855f414b2fc`

See more details on using hashes here.

File details

Details for the file swiftagents-0.1.5-py3-none-any.whl.

File metadata

Download URL: swiftagents-0.1.5-py3-none-any.whl
Upload date: Mar 27, 2026
Size: 33.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for swiftagents-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a5b5b9e6bc9770ced491d0b4627797ee9b8f909212c12e9c64d3c73ae3955d7c`
MD5	`da5f4caaa1591e048ca47cccf58fd3b1`
BLAKE2b-256	`ba058aa472b562f55e4cdbbef49f55091728b4b386bc9a74e04230136069a22d`

See more details on using hashes here.

swiftagents 0.1.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

swiftagents

Why swiftagents

Install

Quickstart

Core concepts

Model clients (logprobs required)

Tools

Register functions directly (no tool classes)

Routing (logprob-gated)

Multi-tool routing modes

Judge

Caching and observability

Examples

Benchmarks

Tests

Design notes

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes