Agentic Framework for Enterprise-Wide Execution with multi-LLM provider support, observability, and error tracking
Project description
by ShyftLabs
The agent runtime for builders who ship.
Build, run, and deploy reliable AI agents at enterprise scale — multi-LLM routing, persistent memory, MCP-native tools, durable workflows, and full observability, out of the box.
📖 Documentation · ⚡ Quick start · ⚙️ Configuration · 🧩 Components · 🧪 Examples · 🤝 Contributing
Continuum is a production-grade Python framework for building, orchestrating, and shipping autonomous AI agents at enterprise scale. It unifies a clean, typed agent core with cost-aware multi-model inference, stateful long- and short-term memory, open standards-based tool calling, durable execution, and end-to-end observability — all behind one small, composable, type-safe API.
✨ Features
- 🤖 Agentic core & orchestration — a strongly-typed agent primitive with full lifecycle hooks, schema-validated structured outputs, and nine composable multi-agent patterns (sequential, parallel, loop, routing, planning, reflection, debate, scatter, supervised).
- 🔀 Smart Inference — cost-aware inference routing that classifies every request by complexity and dispatches it to the cheapest capable model, with seamless cross-provider failover and zero lock-in.
- 🧠 Stateful memory — persistent semantic long-term recall plus low-latency working memory, with multi-tenant isolation scopes and built-in PII redaction for privacy-by-default agents.
- 🔌 Open tool calling — plug into any standards-based tool ecosystem (Model Context Protocol) across multiple transports, with fine-grained capability scoping, context capture/injection, and rich generative-UI artifacts.
- 🔁 Durable execution — long-running, crash- and restart-safe agent workflows with human-in-the-loop approval gates and exactly-once guarantees.
- 🔭 Full observability — first-class distributed tracing, token/latency/error telemetry, and one-line function instrumentation for complete run transparency.
- 🌐 Model-agnostic — target frontier or open-weight models through a single model string; swap providers without touching agent code.
- 🤝 Multi-agent handoffs — context-preserving agent-to-agent delegation with history summarization, cycle detection, and depth control.
- 📡 Real-time streaming — token-, tool-, handoff-, and memory-level events streamed the moment they happen.
- ✅ Built-in evaluation — turn live production traces into golden datasets and regression-test agent quality with standard LLM-evaluation metrics.
🚀 Quick start
Requirements: Python 3.13+ and Docker (for Redis · Milvus/Qdrant · Langfuse).
git clone https://github.com/shyftlabs/continuum.git
cd continuum
python3.13 -m venv .venv && source .venv/bin/activate
pip install -e .
cp .env.template .env # add your provider key(s) — see Configuration below
docker compose up -d # Redis · Milvus/Qdrant · Langfuse
Your first agent:
import asyncio
from continuum.agent import BaseAgent, AgentRunner
async def main():
agent = BaseAgent(
name="hello-agent",
instructions="You are a friendly assistant.",
model="gpt-4o-mini",
)
runner = AgentRunner()
response = await runner.run(agent, "Hi!")
print(response.content)
asyncio.run(main())
AgentRunner.run() returns an AgentResponse with content, structured_output, usage, tool_calls, run_artifacts, latency_ms, and the full handoff chain. See the docs for streaming, tools/MCP, memory, handoffs, and workflows.
⚙️ Configuring Continuum
Continuum is configured through environment variables (copy .env.template → .env). Set keys only for the providers and components you use — everything else has sensible defaults. The most common settings:
LLM providers & routing
| Variable | Description | Example |
|---|---|---|
OPENAI_API_KEY / ANTHROPIC_API_KEY / GEMINI_API_KEY |
Provider API keys — set the one(s) you use | sk-… |
DEFAULT_LLM_MODEL |
Default model (provider/model, or bare name for OpenAI) |
gemini/gemini-2.5-flash |
FALLBACK_LLM_MODEL |
Model used if the default fails | gpt-4o-mini |
LLM_ENABLE_FALLBACK |
Automatically fall back on provider errors | true |
SMART_LAYER_ENABLED |
Enable cost-aware tier routing (Smart Inference) | true |
Memory (long-term) & embeddings
| Variable | Description | Example |
|---|---|---|
MEMORY_ENABLED |
Enable mem0-backed long-term memory | true |
VECTOR_STORE_PROVIDER |
Vector store backend | qdrant / milvus |
EMBEDDER_PROVIDER / EMBEDDER_MODEL |
Embedding provider & model | openai / text-embedding-3-small |
MEMORY_ISOLATION |
Scope of memory isolation | user / agent / run / shared |
Sessions (short-term)
| Variable | Description | Example |
|---|---|---|
SESSION_ENABLED |
Enable Redis-backed conversation sessions | true |
SESSION_REDIS_HOST / SESSION_REDIS_PORT |
Redis connection | localhost / 6380 |
SESSION_TTL_SECONDS |
Session lifetime | 172800 |
Observability (Langfuse)
| Variable | Description | Example |
|---|---|---|
LANGFUSE_ENABLED |
Enable tracing | true |
LANGFUSE_PUBLIC_KEY / LANGFUSE_SECRET_KEY |
Langfuse credentials | pk-… / sk-… |
LANGFUSE_HOST |
Langfuse endpoint | http://localhost:3000 |
Temporal (optional, durable workflows)
| Variable | Description | Example |
|---|---|---|
TEMPORAL_ENABLED |
Enable durable workflow orchestration | false |
TEMPORAL_HOST |
Temporal frontend | localhost:7233 |
Optional extras:
pip install -e ".[temporal]"for Temporal,".[eval]"for evaluation,".[embeddings]"for local embeddings. See.env.templatefor the complete, annotated reference.
🧩 Components
| Component | What it does |
|---|---|
| Agents | BaseAgent + AgentRunner — config, hooks, structured outputs, ReAct |
| Workflows | Nine multi-agent patterns for chaining, branching, looping, and self-improvement |
| Smart Inference | Request classifier + cost-aware model routing with fallback |
| Memory | mem0 + Qdrant/Milvus (long-term) · Redis (sessions) · multi-tenant scopes |
| Tools / MCP | MCP servers over Stdio/SSE/StreamableHTTP, tool filtering, widget artifacts |
| Temporal | Durable, restart-safe workflows with human-in-the-loop gates |
| Observability | Langfuse traces, metrics, @observe decorators |
| Evaluation | Golden datasets + DeepEval / RAGAS metrics |
📚 Documentation
Full documentation lives at docs.continuum.shyftlabs.io — guides for building & running agents, Smart Inference, memory, tools/MCP, workflows, handoffs, streaming, evaluation, and the research behind it.
Markdown sources are also in docs/ if you prefer reading on GitHub — e.g. agent.md, memory.md, tools.md, and the integration GUIDE.md.
🧪 Examples
Runnable demos live under playground/:
gateway-local-shop— an MCP server + agent + chat UI for a pet-shop assistant (end-to-end: server → agent → UI).gateway-multi-agent-shop— a multi-agent workflow variant with routing and handoffs.frontend/— the demo web UIs (assortment,commerce-chat).
🤝 Contributing
Contributions are welcome! Please read CONTRIBUTING.md for the branch model, Conventional Commits, DCO sign-off, and local setup. By participating you agree to our Code of Conduct.
- 🐛 Bugs & features — use the issue templates
- 💬 Questions & ideas — GitHub Discussions
- 🔒 Security — report privately via
SECURITY.md, never a public issue
📄 License
Licensed under the Apache License, Version 2.0. Copyright © 2025–2026 ShyftLabs Inc.
For commercial / enterprise inquiries — SLAs, indemnification, hosted offerings, custom features — contact continuum@shyftlabs.io.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file shyftlabs_continuum-0.2.2.tar.gz.
File metadata
- Download URL: shyftlabs_continuum-0.2.2.tar.gz
- Upload date:
- Size: 341.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3f30414274f060bf44995bd4a156f088a3d53197160cf43a7be71411aeb8d46d
|
|
| MD5 |
23f1c37a8b2ef11dfce25e6fad6436e3
|
|
| BLAKE2b-256 |
731a2433da56e8918bfaaaf886e9c0a07d3db54fa41573107d73a2ed1a2db642
|
File details
Details for the file shyftlabs_continuum-0.2.2-py3-none-any.whl.
File metadata
- Download URL: shyftlabs_continuum-0.2.2-py3-none-any.whl
- Upload date:
- Size: 420.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e2cc85ed574df849f68cefc0073fdc87570d0fd0d0fd13b82cb3bf63a6a4c620
|
|
| MD5 |
b59ef823bf1863c19616f681843d2731
|
|
| BLAKE2b-256 |
46dc26ec7a46de0af402a7cd8f523bfe2ecf1e4205c4bceb83c2a033b07bac04
|