Skip to main content

Turn AI agents into microservices.

Project description

ServeAI

Turn AI agents into microservices. Deploy, compose, and scale.

One decorator to make any AI agent a production-ready microservice with REST API, health checks, and auto-generated docs.

PyPI version License: MIT Python 3.10+


Why ServeAI?

Building AI agents is easy. Deploying them as reliable, scalable services is hard. ServeAI bridges that gap:

  • One decorator@agent_service turns any function into a FastAPI microservice
  • Auto-generated API — REST endpoints, OpenAPI docs, health checks out of the box
  • Pipeline composition — Chain agents with >> operator: researcher >> writer >> reviewer
  • Service discovery — Agents register themselves and find each other
  • Message bus — Async communication between agents (in-memory or Redis)
  • Typed contracts — Pydantic-based input/output validation between services
  • Built-in dashboard — Web UI to monitor your agent mesh
  • Scale independently — Each agent runs as its own process, scale what you need

Installation

pip install serveai

# With Redis support for production message bus
pip install serveai[redis]

Quick Start

1. Create an Agent Service

from serveai import agent_service

@agent_service(name="researcher", port=8001)
async def research(topic: str) -> dict:
    """Research agent — runs as its own microservice."""
    # Your LLM logic here
    return {"findings": f"Key insights about {topic}...", "sources": 3}

# Start the service
research.serve()
# → API running at http://localhost:8001
# → Docs at http://localhost:8001/docs

2. Call It from Anywhere

from serveai import AgentClient

client = AgentClient("http://localhost:8001")
result = await client.run(topic="quantum computing")
print(result.output)
# → {"findings": "Key insights about quantum computing...", "sources": 3}

3. Chain Agents into Pipelines

from serveai import agent_service

@agent_service(name="researcher", port=8001)
async def researcher(topic: str) -> dict:
    return {"findings": f"Research on {topic}..."}

@agent_service(name="writer", port=8002)
async def writer(findings: str) -> dict:
    return {"article": f"Blog post based on: {findings}"}

@agent_service(name="reviewer", port=8003)
async def reviewer(article: str) -> str:
    return f"Reviewed and approved: {article[:100]}..."

# Compose with >> operator
pipeline = researcher >> writer >> reviewer
result = await pipeline.run(topic="AI agents")
print(result.output)

4. Service Discovery

from serveai import ServiceRegistry

registry = ServiceRegistry()

# Services register themselves
registry.register("researcher", "http://localhost:8001", port=8001, tags=["research"])
registry.register("writer", "http://localhost:8002", port=8002, tags=["content"])

# Discover by name
service = registry.get("researcher")
print(service.url)  # → http://localhost:8001

# Discover by tag
content_services = registry.find_by_tag("content")

5. Message Bus

from serveai.bus import InMemoryBus, Message

bus = InMemoryBus()

# Subscribe to events
async def on_research_done(msg: Message):
    print(f"Research complete: {msg.payload}")

await bus.subscribe("research-done", on_research_done)

# Publish events
await bus.publish("research-done", Message.create(
    source="researcher",
    target="writer",
    payload={"findings": "..."},
))

6. Dashboard

from serveai.dashboard import build_dashboard_app
from serveai import ServiceRegistry
import uvicorn

registry = ServiceRegistry()
registry.register("researcher", "http://localhost:8001", port=8001)
registry.register("writer", "http://localhost:8002", port=8002)

app = build_dashboard_app(registry, port=9000)
uvicorn.run(app, port=9000)
# → Dashboard at http://localhost:9000

7. Typed Contracts

from serveai import agent_service, InputSchema, OutputSchema, Contract

class ResearchInput(InputSchema):
    topic: str
    max_sources: int = 5

class ResearchOutput(OutputSchema):
    findings: str
    sources: list[str]

contract = Contract.from_types(
    name="research",
    input_type=ResearchInput,
    output_type=ResearchOutput,
)

# Contract validates data at runtime
contract.validate_input({"topic": "AI"})  # OK
contract.validate_input({})  # Raises ValueError: Missing required field 'topic'

Architecture

serveai/
├── service.py          # @agent_service decorator & FastAPI service
├── client.py           # HTTP client for calling services
├── contracts.py        # Typed input/output contracts
├── dashboard.py        # Web UI for monitoring
├── router/
│   └── registry.py     # Service discovery & registration
├── bus/
│   ├── base.py         # Abstract message bus
│   ├── memory.py       # In-memory bus (dev/testing)
│   └── redis_bus.py    # Redis bus (production)
└── pipeline/
    └── builder.py      # Pipeline composition with >> operator

Roadmap

  • gRPC support alongside REST
  • Docker/Kubernetes deployment helpers
  • Circuit breaker pattern for resilience
  • Agent authentication and API keys
  • Webhook triggers and cron scheduling
  • Prometheus metrics export
  • CLI tool for managing agent services

Contributing

git clone https://github.com/abhishekjain/serveai.git
cd serveai
pip install -e ".[dev]"
pytest

License

MIT License — see LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentbharat-0.1.0-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file agentbharat-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: agentbharat-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.6

File hashes

Hashes for agentbharat-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3a4f93ec39614e9195fec4e69c6a2f1c36a6b543299f6c1664e4e5ccfa4be4b6
MD5 380eb652bc4b4fabd1bacd5a79a43ba3
BLAKE2b-256 275afb4ff0c853bd35c55770718494e4622cac9f664a2af232e38298293dce56

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page