Python SDK for building AI agents with multi-LLM support, streaming, and production-ready infrastructure
Project description
Spaik SDK
Python SDK for building AI agents with multi-LLM support, streaming, and production infrastructure.
Spaik SDK is an open-source project developed by engineers at Siili Solutions Oyj. This is not an official Siili product.
Installation
pip install spaik-sdk
Quick Start
from spaik_sdk.agent.base_agent import BaseAgent
class MyAgent(BaseAgent):
pass
agent = MyAgent(system_prompt="You are a helpful assistant.")
print(agent.get_response_text("Hello!"))
Features
- Multi-LLM Support: OpenAI, Anthropic, Google, Azure AI Foundry, DeepSeek, Mistral, Meta Llama, Cohere, xAI (Grok), Moonshot (Kimi), Ollama
- Unified API: Same interface across all providers
- Streaming: Real-time response streaming via SSE
- Tools: Function calling with LangChain integration
- Subagents: Isolated nested agent execution with
spawn() - Tracing: Configurable trace persistence with pluggable sinks
- Structured Output: Pydantic model responses
- Server: FastAPI with thread persistence, auth, file uploads
- Audio: Text-to-speech and speech-to-text
- Cost Tracking: Token usage and cost estimation
Agent API
Basic Response Methods
from spaik_sdk.agent.base_agent import BaseAgent
from spaik_sdk.models.model_registry import ModelRegistry
agent = MyAgent(
system_prompt="You are helpful.",
llm_model=ModelRegistry.CLAUDE_4_SONNET
)
# Sync - text only
text = agent.get_response_text("Hello")
# Sync - full message with blocks
message = agent.get_response("Hello")
print(message.get_text_content())
# Async
message = await agent.get_response_async("Hello")
Streaming
# Token stream
async for chunk in agent.get_response_stream("Write a story"):
print(chunk, end="", flush=True)
# Event stream (for SSE)
async for event in agent.get_event_stream("Write a story"):
if event.get_event_type() == "StreamingUpdated":
print(event.content, end="")
Structured Output
from pydantic import BaseModel
class Recipe(BaseModel):
name: str
ingredients: list[str]
steps: list[str]
recipe = agent.get_structured_response("Give me a pasta recipe", Recipe)
print(recipe.name)
Interactive CLI
agent.run_cli() # Starts interactive chat in terminal
LangGraph Interop
Expose the configured LangChain model and a ready-to-use LangGraph ReAct agent when you need to compose Spaik agents into custom LangChain or LangGraph workflows.
agent = MyAgent(system_prompt="You are helpful.")
llm = agent.get_langchain_model()
react_agent = agent.get_react_agent()
Tools
from spaik_sdk.tools.tool_provider import ToolProvider, BaseTool, tool
class WeatherTools(ToolProvider):
def get_tools(self) -> list[BaseTool]:
@tool
def get_weather(city: str) -> str:
"""Get current weather for a city."""
return f"Sunny, 22°C in {city}"
@tool
def get_forecast(city: str, days: int = 3) -> str:
"""Get weather forecast."""
return f"{days}-day forecast for {city}: Sunny"
return [get_weather, get_forecast]
class WeatherAgent(BaseAgent):
def get_tool_providers(self) -> list[ToolProvider]:
return [WeatherTools()]
agent = WeatherAgent(system_prompt="You provide weather info.")
print(agent.get_response_text("What's the weather in Tokyo?"))
For unusually long tool loops, set LLMConfig.max_agent_steps above the default 100.
Built-in Tool Providers
from spaik_sdk.tools.impl.search_tool_provider import SearchToolProvider
from spaik_sdk.tools.impl.mcp_tool_provider import MCPToolProvider
class MyAgent(BaseAgent):
def get_tool_providers(self):
return [
SearchToolProvider(), # Web search (Tavily)
MCPToolProvider(server), # MCP server tools
]
Controlling Tool-Call Replay in History
Each ToolProvider controls how its tool-use blocks are rendered back into model history when a thread is replayed. By default, the replay contains the tool name, call id, arguments, response, and error state.
Use persist_tool_block_history=False when a provider should replay only a name marker instead of full call details. This is useful for tools whose output is large, sensitive, or not useful to show the model again.
class SearchTools(ToolProvider):
def __init__(self) -> None:
super().__init__(persist_tool_block_history=False)
def get_tools(self) -> list[BaseTool]:
...
Override render_tool_block_for_history() when you need provider-specific history text.
from spaik_sdk.thread.models import MessageBlock
class SearchTools(ToolProvider):
def render_tool_block_for_history(self, block: MessageBlock) -> str:
return f'<search tool="{block.tool_name}" />'
Tool-use blocks persist the owning provider id on the thread, so after a thread reload the SDK rebinds the correct ToolProvider and applies the same history-rendering policy. Override get_provider_id() if a provider needs a stable id across package or class renames.
Subagents
To call one agent from inside another agent's tool, use spawn() instead of get_response(). This prevents LangChain's callback context from leaking into the subagent, which would otherwise cause the subagent's internal tool calls to appear in the parent thread.
class ResearchTools(ToolProvider):
def get_tools(self) -> list[BaseTool]:
@tool
def research(topic: str) -> str:
"""Delegate a research task to a specialist subagent."""
sub = ResearchAgent(system_prompt="You are a research specialist.")
return sub.spawn(topic).get_text_content()
return [research]
For cases where you need to isolate an arbitrary coroutine rather than a full agent call, use the static BaseAgent.run_isolated(coro) helper directly.
Tracing
Every agent run produces an AgentTrace. Configure trace persistence with the trace_sink constructor argument or the TRACE_SINK_MODE environment variable.
from spaik_sdk.tracing.local_trace_sink import LocalTraceSink
from spaik_sdk.tracing.noop_trace_sink import NoOpTraceSink
# Disable trace persistence
agent = MyAgent(system_prompt="...", trace_sink=NoOpTraceSink())
# Write traces to a custom directory
agent = MyAgent(system_prompt="...", trace_sink=LocalTraceSink(traces_dir="./custom-traces"))
TRACE_SINK_MODE=local forces local file traces and TRACE_SINK_MODE=noop disables trace persistence. Implement TraceSink for remote or database-backed trace storage.
Models
from spaik_sdk.models.model_registry import ModelRegistry
# Anthropic
ModelRegistry.CLAUDE_4_SONNET
ModelRegistry.CLAUDE_4_OPUS
ModelRegistry.CLAUDE_4_5_SONNET
ModelRegistry.CLAUDE_4_5_OPUS
ModelRegistry.CLAUDE_4_6_SONNET
ModelRegistry.CLAUDE_4_6_OPUS
ModelRegistry.CLAUDE_4_7_OPUS
# OpenAI
ModelRegistry.GPT_4_1
ModelRegistry.GPT_4O
ModelRegistry.O4_MINI
ModelRegistry.GPT_5_4
ModelRegistry.GPT_5_4_NANO
ModelRegistry.GPT_5_5
# Google
ModelRegistry.GEMINI_2_5_FLASH
ModelRegistry.GEMINI_2_5_PRO
ModelRegistry.GEMINI_3_1_PRO
# Azure AI Foundry and other provider families
ModelRegistry.DEEPSEEK_V3_2
ModelRegistry.MISTRAL_LARGE_3
ModelRegistry.LLAMA_4_MAVERICK
ModelRegistry.COHERE_COMMAND_A
ModelRegistry.GROK_4
ModelRegistry.KIMI_K2_THINKING
# Aliases
ModelRegistry.from_name("sonnet") # CLAUDE_4_6_SONNET
ModelRegistry.from_name("gpt 4.1") # GPT_4_1
ModelRegistry.from_name("opus") # CLAUDE_4_7_OPUS
# Custom model
from spaik_sdk.models.llm_model import LLMModel
from spaik_sdk.models.llm_families import LLMFamilies
custom = LLMModel(
family=LLMFamilies.OPENAI,
name="gpt-4-custom",
reasoning=False
)
ModelRegistry.register_custom(custom)
FastAPI Server
from contextlib import asynccontextmanager
from fastapi import FastAPI
from fastapi.middleware.cors import CORSMiddleware
from spaik_sdk.agent.base_agent import BaseAgent
from spaik_sdk.server.api.routers.api_builder import ApiBuilder
class MyAgent(BaseAgent):
pass
@asynccontextmanager
async def lifespan(app: FastAPI):
agent = MyAgent(system_prompt="You are helpful.")
api_builder = ApiBuilder.local(agent=agent)
app.include_router(api_builder.build_thread_router())
app.include_router(api_builder.build_file_router())
app.include_router(api_builder.build_audio_router())
yield
app = FastAPI(lifespan=lifespan)
app.add_middleware(
CORSMiddleware,
allow_origins=["*"],
allow_methods=["*"],
allow_headers=["*"],
)
API Endpoints
Thread management:
POST /threads- Create threadGET /threads- List threadsGET /threads/{id}- Get thread with messagesPOST /threads/{id}/messages/stream- Send message (SSE)DELETE /threads/{id}- Delete threadPOST /threads/{id}/cancel- Cancel generation
Files:
POST /files- Upload fileGET /files/{id}- Download file
Audio:
POST /audio/speech- Text to speechPOST /audio/transcribe- Speech to text
Production Setup
from spaik_sdk.server.storage.impl.local_file_thread_repository import LocalFileThreadRepository
from spaik_sdk.server.authorization.base_authorizer import BaseAuthorizer
# Custom repository and auth
api_builder = ApiBuilder.stateful(
repository=LocalFileThreadRepository(base_path="./data"),
authorizer=MyAuthorizer(),
agent=agent,
)
Long-running streams are checkpointed incrementally. The built-in response generator calls update_thread after each ToolResponseReceivedEvent and MessageFullyAddedEvent, so completed tool results and messages survive crashes or restarts during a run.
Orchestration
Code-first workflow orchestration without graph DSLs:
from spaik_sdk.orchestration import BaseOrchestrator, OrchestratorEvent
from dataclasses import dataclass
from typing import AsyncIterator
@dataclass
class State:
items: list[str]
@dataclass
class Result:
count: int
class MyOrchestrator(BaseOrchestrator[State, Result]):
async def run(self) -> AsyncIterator[OrchestratorEvent[Result]]:
state = State(items=[])
# Run step with automatic status events
async for event in self.step("fetch", "Fetching data", self.fetch, state):
yield event
if event.result:
state = event.result
# Progress updates
for i, item in enumerate(state.items):
yield self.progress("process", i + 1, len(state.items))
await self.process(item)
yield self.ok(Result(count=len(state.items)))
async def fetch(self, state: State) -> State:
return State(items=["a", "b", "c"])
async def process(self, item: str):
pass
# Run
orchestrator = MyOrchestrator()
result = orchestrator.run_sync()
Configuration
Environment variables:
# Direct mode: at least one LLM provider API key required
ANTHROPIC_API_KEY=sk-ant-...
OPENAI_API_KEY=sk-...
GOOGLE_API_KEY=...
# Optional
AZURE_API_KEY=...
AZURE_ENDPOINT=https://your-resource.openai.azure.com/
DEFAULT_MODEL=claude-sonnet-4-20250514
Azure Authentication
Azure uses API key authentication from env vars by default:
MODEL_PROVIDER=azure
AZURE_API_KEY=...
AZURE_API_VERSION=2025-04-01-preview
AZURE_ENDPOINT=https://your-resource.openai.azure.com/
AZURE_GPT_4O_DEPLOYMENT=your-deployment-name
For Microsoft Entra ID or custom auth, pass an AzureProvider to LLMConfig:
pip install azure-identity
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
from spaik_sdk.models.llm_config import LLMConfig
from spaik_sdk.models.model_registry import ModelRegistry
from spaik_sdk.models.providers.azure_provider import AzureProvider
token_provider = get_bearer_token_provider(
DefaultAzureCredential(),
"https://cognitiveservices.azure.com/.default",
)
llm_config = LLMConfig(
model=ModelRegistry.GPT_4O,
provider=AzureProvider(azure_ad_token_provider=token_provider),
)
Proxy Mode
Set LLM_AUTH_MODE=proxy to route every provider through a single proxy endpoint such as LiteLLM or an internal gateway. Provider API keys are not required in this mode.
| Env Variable | Description |
|---|---|
LLM_AUTH_MODE |
direct (default) or proxy |
LLM_PROXY_BASE_URL |
Proxy endpoint URL |
LLM_PROXY_API_KEY |
Auth key sent to the proxy |
LLM_PROXY_HEADERS |
Extra headers, comma-separated Key:Value pairs |
Development
# Setup
uv sync
# Tests
make test # All
make test-unit # Unit only
make test-integration # Integration only
make test-unit-single PATTERN=name # Single test
# Quality
make lint # Check linting
make lint-fix # Fix linting
make typecheck # Type check
Message Structure
Messages contain blocks of different types:
from spaik_sdk.thread.models import MessageBlockType
# Block types
MessageBlockType.PLAIN # Regular text
MessageBlockType.REASONING # Chain of thought
MessageBlockType.TOOL_USE # Tool call
MessageBlockType.ERROR # Error message
License
MIT - Copyright (c) 2026 Siili Solutions Oyj
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file spaik_sdk-0.10.0.tar.gz.
File metadata
- Download URL: spaik_sdk-0.10.0.tar.gz
- Upload date:
- Size: 465.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.9 {"installer":{"name":"uv","version":"0.11.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3e641eaebaaf26620ef356567d2315f025d1c94b9ba412c3f816faa9f8e53238
|
|
| MD5 |
a2c43e71e26182158ab70b2bc0f49007
|
|
| BLAKE2b-256 |
eef0ec08fb94346495b294d45095b8543cdd3b07a5bc47dc935c195270e561cf
|
File details
Details for the file spaik_sdk-0.10.0-py3-none-any.whl.
File metadata
- Download URL: spaik_sdk-0.10.0-py3-none-any.whl
- Upload date:
- Size: 145.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.9 {"installer":{"name":"uv","version":"0.11.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8a753ecb2de9b4189ae1f9a499da123540bb45fd52335feb864ff6c07273fd37
|
|
| MD5 |
ac7005475b9e43493c07efb9c3ec185c
|
|
| BLAKE2b-256 |
346327951c6a09c566ef132efc36012ec9937aa3e5be1defd431a9c505fe9ae9
|