superserve

Run agentic workloads on Ray

These details have not been verified by PyPI

Project links

Project description

Scalable runtime for Agents, MCP Servers, and coding sandboxes, orchestrated with Ray.

Features

Distributed Runtime - Resource-aware tool execution on Ray clusters with automatic scaling
Framework Agnostic - Works with LangChain, Pydantic AI, Agno, or pure Python
Secure Sandboxing - gVisor-sandboxed environments for AI-generated code execution
Simple CLI - Initialize projects, create agents, and serve with single commands
Production Ready - Built on Ray Serve for reliable, scalable deployments

Quick Start

# Install
pip install superserve

# Create a new project
superserve init my_project
cd my_project

# Create your first agent
superserve create-agent my_agent --framework pydantic

# Run locally
superserve up

Your agent is now available at http://localhost:8000/agents/my_agent/

Installation

pip install superserve

With optional sandbox support (for code execution):

pip install superserve[sandbox]

Requirements: Python 3.12+

Usage

CLI Commands

Initialize a Project

superserve init my_project

Creates a project structure with configuration files and an agents/ directory.

Create an Agent

superserve create-agent my-agent
superserve create-agent my-agent --framework langchain
superserve create-agent my-agent --framework pydantic

Supported frameworks: python (default), langchain, pydantic

Run Agents

superserve up                             # Run all agents on port 8000
superserve up --port 9000                 # Custom port
superserve up --agents agent1,agent2      # Run specific agents

Creating Your First Agent

After running superserve create-agent my_agent --framework pydantic, edit agents/my_agent/agent.py:

import superserve
from pydantic_ai import Agent

@superserve.tool(num_cpus=1)
def search(query: str) -> str:
    """Search for information."""
    return f"Results for: {query}"

# Create Pydantic AI agent with Ray-distributed tools
def make_agent():
    return Agent(
        "openai:gpt-4o-mini",
        system_prompt="You are a helpful assistant.",
        tools=[search],
    )

# Serve the agent
superserve.serve(make_agent, name="my_agent", num_cpus=1, memory="2GB")

Run with:

superserve up

Test your agent:

curl -X POST http://localhost:8000/agents/my_agent/ \
  -H "Content-Type: application/json" \
  -d '{"query": "Hello!"}'

API Reference

`@superserve.tool` Decorator

Creates a Ray-distributed async tool from a function. Works as both a decorator and wrapper for framework tools.

import superserve

# As decorator (uses defaults: num_cpus=1, num_gpus=0)
@superserve.tool
def search(query: str) -> str:
    """Search for information."""
    return f"Results for: {query}"

# As decorator with explicit resources
@superserve.tool(num_cpus=2, memory="4GB")
def expensive_task(data: str) -> str:
    return process(data)

# As wrapper for framework tools
from langchain_community.tools import DuckDuckGoSearchRun
lc_search = superserve.tool(DuckDuckGoSearchRun())

Parameter	Type	Default	Description
`num_cpus`	int/float	1	CPU cores per invocation
`num_gpus`	int/float	0	GPUs per invocation
`memory`	str	None	Memory requirement (e.g., "1GB")

`superserve.serve()`

Serve an agent via HTTP with Ray Serve. Auto-detects framework (Pydantic AI, LangChain, custom).

import superserve
from pydantic_ai import Agent

def make_agent():
    return Agent("openai:gpt-4", tools=[search])

superserve.serve(make_agent, name="myagent", num_cpus=1, memory="2GB")

Parameter	Type	Default	Description
`agent`	Any	required	Agent class, function, or instance
`name`	str	None	Agent name (inferred from directory if not set)
`port`	int	8000	HTTP port
`num_cpus`	int	1	CPU cores per replica
`num_gpus`	int	0	GPUs per replica
`memory`	str	"2GB"	Memory allocation
`replicas`	int	1	Number of replicas

`superserve.Agent` Base Class

For custom agents without a framework:

from superserve import Agent

class MyAgent(Agent):
    tools = [search, analyze]

    async def run(self, query: str) -> str:
        result = await self.call_tool("search", query=query)
        return f"Found: {result}"

superserve.serve(MyAgent, name="myagent")

`execute_tools`

Execute multiple tools in parallel on Ray:

import superserve
from superserve import execute_tools

@superserve.tool(num_cpus=1)
def tool_1(x: str) -> str:
    """Process input with tool 1."""
    return process_1(x)

@superserve.tool(num_cpus=1)
def tool_2(x: str) -> dict:
    """Process input with tool 2."""
    return process_2(x)

# Execute both tools in parallel on Ray
results = execute_tools([
    (tool_1, {"x": "input_1"}),
    (tool_2, {"x": "input_2"})
], parallel=True)

Examples

See the examples/ directory for complete implementations:

Token-Efficient Agent - Autonomous code execution in sandboxed environments
Finance Agent - Multi-step financial analysis with external APIs

Telemetry

Superserve collects anonymous usage data to help improve the CLI:

Commands run (init, create-agent, up)
Framework choices (python, langchain, pydantic)

No PII data, project information, or code is collected by telemetry without your explicit approval.

To opt out:

superserve analytics off

Contributing

Contributions are welcome! See CONTRIBUTING.md for guidelines.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

If you find this project helpful, please consider giving it a star!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.7.3

May 28, 2026

0.7.2

May 28, 2026

0.7.1

May 5, 2026

0.7.0

Apr 29, 2026

0.6.0

Apr 23, 2026

0.5.0

Apr 16, 2026

0.4.2

Apr 16, 2026

0.4.1

Apr 16, 2026

0.1.5

Feb 24, 2026

0.1.4

Feb 24, 2026

0.1.3

Feb 17, 2026

This version

0.1.2

Jan 31, 2026

0.1.1

Jan 31, 2026

0.1.0

Jan 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superserve-0.1.2.tar.gz (83.7 kB view details)

Uploaded Jan 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

superserve-0.1.2-py3-none-any.whl (88.0 kB view details)

Uploaded Jan 31, 2026 Python 3

File details

Details for the file superserve-0.1.2.tar.gz.

File metadata

Download URL: superserve-0.1.2.tar.gz
Upload date: Jan 31, 2026
Size: 83.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.9

File hashes

Hashes for superserve-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`a1a60a9f8f759103920b894c5c7c77f1206a0579a5a91b21da917a83522b1cd6`
MD5	`fb65cfd6b95ab26c8993703a77954bfd`
BLAKE2b-256	`d8fcaceb15a6d59929e20250b1bf158bf52935e37fa31b8c4fded6388d3b9348`

See more details on using hashes here.

File details

Details for the file superserve-0.1.2-py3-none-any.whl.

File metadata

Download URL: superserve-0.1.2-py3-none-any.whl
Upload date: Jan 31, 2026
Size: 88.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.9

File hashes

Hashes for superserve-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`21c1ae170fc2ce7bd5a0b91807226812b34f122ba04a98f825b9c07dec0ab8c6`
MD5	`ec385e72a563db5aae74ff7dc1be4e5d`
BLAKE2b-256	`05fe49609a364b80ea3358e9c398bf464b327acbc7a78cf02d0187e1f2644cdd`

See more details on using hashes here.

superserve 0.1.2

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Project description

Features

Quick Start

Installation

Usage

CLI Commands

Initialize a Project

Create an Agent

Run Agents

Creating Your First Agent

API Reference

@superserve.tool Decorator

superserve.serve()

superserve.Agent Base Class

execute_tools

Examples

Telemetry

Contributing

License

Project details

Verified details

Owner

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`@superserve.tool` Decorator

`superserve.serve()`

`superserve.Agent` Base Class

`execute_tools`