Fan-Out/Fan-In agentic orchestration framework with multi-provider LLM support and MCP tool integration.
Project description
📖 Looking for the full docs, feature showcase, and use cases? Visit the Cortex landing page →
Cortex Agent FrameworkOne YAML file. One method call. A production AI agent.Stop rebuilding the same agent plumbing. Define it once, deploy anywhere, let it learn. |
Overview • Architecture • Features • Getting Started • Use Cases • Config • CLI • Deployment • FAQ
The problem
Every AI team builds the same stack: task decomposition, parallel tool execution, streaming, retries, session management, quality scoring, multi-provider routing, deployment. Most teams rebuild it two or three times before shipping.
Cortex is that stack. Pre-built. Battle-tested. Driven by config, not code.
3 commands. A running agent.
pip install cortex-agent-framework
cortex setup # visual wizard at localhost:7799
cortex publish ui # chat UI at localhost:8090
Or invoke the agent directly from the terminal:
cortex run "Research the latest vector DB benchmarks"
cortex chat # interactive REPL
You now have a working agent with a professional web interface, file upload support, streaming responses, and persistent chat history. No frontend to build. No backend to wire. No infrastructure to manage.
Define your agent in YAML. Run it in Python.
agent:
name: ResearchAgent
description: Searches the web and writes reports
llm_access:
default:
provider: anthropic
model: claude-sonnet-4-5
api_key_env_var: ANTHROPIC_API_KEY
task_types:
- name: web_research
capability_hint: web_search
output_format: md
- name: write_report
capability_hint: document_generation
depends_on: [web_research]
from cortex.framework import CortexFramework
framework = CortexFramework("cortex.yaml")
await framework.initialize()
result = await framework.run_session(
user_id="user_1",
request="Research the latest vector DB benchmarks and write a report",
)
print(result.response)
Fan-out, tool calls, dependency resolution, synthesis, validation — all handled.
Or define it in Python — code-first agents
Prefer code to config? Build the entire agent with CortexBuilder and wire in
code nodes — plain Python functions as graph nodes, LangGraph-style. No
YAML file, no decomposition LLM call: the graph runs exactly as you wrote it.
from cortex import CortexBuilder, CortexFramework
agent = CortexBuilder("ResearchAgent", "Searches the web and writes reports")
agent.llm("anthropic", model="claude-sonnet-4-5", api_key_env="ANTHROPIC_API_KEY")
agent.tool_server("brave", url="http://localhost:9000/sse")
@agent.node()
async def web_research(ctx):
return await ctx.call_tool("brave", "search", query=ctx.request)
@agent.node(depends_on=["web_research"])
async def write_report(ctx):
return await ctx.llm(f"Write a report from:\n{ctx.deps['web_research']}")
framework = CortexFramework(config=agent.build())
await framework.initialize()
result = await framework.run_session("user_1", "Research vector DB benchmarks")
print(result.response) # synthesised answer
print(result.node_outputs["write_report"]) # raw output of one node
Registering a code node flips the agent to static execution — the DAG you
declared is the plan. Each node receives a TaskContext with ctx.request,
ctx.deps (upstream node outputs), ctx.llm(), and ctx.call_tool(). You
still get the wave engine, validation gate, retries, streaming, and session
persistence — Cortex just skips the planner. Mix .task() (LLM-routed) and
.node() (Python) freely. See Getting Started.
Why teams choose Cortex
Skip months of framework engineering
The orchestration, parallelism, tool integration, streaming, retries, validation, session persistence, and deployment pipeline — it's all in the box. Your Python code stays a thin wrapper. The agent's behavior lives in cortex.yaml, versioned, diffable, reviewable.
Multi-agent composition for free
Any Cortex agent becomes an MCP server in one command. Other agents consume it as a tool. That's the entire inter-agent protocol — standard MCP, nothing custom.
Orchestrator → Research Agent (MCP :8081) → brave-search, wikipedia
→ Code Review Agent (MCP :8082) → github, filesystem
→ Writing Agent (MCP :8083) → document-gen
Each agent scales, deploys, and configures independently. Compose them by adding YAML lines.
Built-in quality gates
Every response passes through a Validation Agent that scores intent match, completeness, and coherence. You set a floor (default: 0.75); responses below it are flagged and remediated. Bad intermediate outputs are retried with feedback before the pipeline moves on.
Catch regressions before users do, not after.
Your agent gets smarter over time
The Learning Engine observes task patterns across sessions. When patterns recur, it stages a delta proposal — a concrete config change you review with cortex delta review and apply in one command. The agent doesn't silently drift; it surfaces what it learned and asks for approval.
What's in the box
| 8 cloud LLM providers + local | Anthropic, OpenAI, Gemini, Grok, Mistral, DeepSeek, AWS Bedrock, Azure AI — plus local Ollama / LM Studio / vLLM with a Gemma 4 quickstart |
| Code-first agents | Build the whole agent in Python with CortexBuilder; @node decorator turns plain functions into graph nodes — static DAG, no decomposition LLM call |
| Fan-out / fan-in | LLM-generated DAG with parallel execution; independent tasks run simultaneously |
| Intent Gate | Cheap heuristic + LLM cascade routes chat-shaped turns directly to a streaming reply; only task-shaped turns decompose |
interaction_mode |
One agent, two contracts: interactive for chat / CLI, rpc for MCP / automation — never blocks on clarifications |
| MCP-native tools | First-class SSE, stdio, and streamable-HTTP MCP tool servers |
| Multi-agent mesh | Publish any agent as an MCP server — compose specialist agents into an orchestrator |
| Ant Colony | Orchestrator self-spawns specialist Cortex agents as MCP servers at runtime; supervised, health-checked, auto-restarted |
| Identity & delegation | First-class Principal model — human, system, or agent-to-agent calls with full delegation chains in audit logs |
| Quality validation | Every response scored and gated; per-task validation inside the execution loop |
| Autonomic learning | Signal-gated end-of-session evolution — complexity + validation thresholds trigger staging; distinct-principal confirmation gates promotion |
| Blueprints | Reusable workflow knowledge loaded into context; refined automatically when a session clears the validation threshold, seeded as drafts the moment a new task pattern is staged |
| Streaming | Typed event classes (StatusEvent, ResultEvent, ClarificationEvent) for any UI |
| Smart synthesis | Keyword-grep excerpts + concurrent per-file LLM summaries; large results written to disk as a file ResultEvent |
| Native app control | Launch and drive desktop applications via AppleScript (macOS), PowerShell + UIA (Windows), or a screenshot vision loop — every mutating action gated by HITL |
| Built-in browser automation | Playwright MCP starts internally — chromium / firefox / webkit with persistent session state so logins survive runs. No tool-server wiring required |
| Polyglot code sandbox | # LANGUAGE: header dispatches to Python, Node, TypeScript, Deno, shell, Ruby, Go, Rust, C, Java, or Kotlin — with per-ecosystem package install (npm, gem, go get) |
| Per-task LLM routing | Route decomposition to a fast model, synthesis to flagship |
| Auto-tuned LLM concurrency | max_parallel_llm_calls derives from your provider+model (1 for local Ollama, 8 for Haiku, 4 for Opus) and self-tunes at runtime via AdaptiveLLMGate — no wizard knob to guess |
| Session persistence | Memory / SQLite / Redis with WAL replay and resumable sessions |
| Built-in chat UI | Web frontend with file uploads, streaming, and conversation history |
| 4 deploy targets | publish docker, publish package, publish mcp, publish ui |
| Visual setup wizard | Configure everything from a browser — cortex setup |
| Full CLI | cortex run, cortex chat, cortex sessions, cortex stats, cortex providers, cortex storage, cortex blueprints, cortex mcps — manage every aspect of the framework without writing Python |
| Security | Input sanitisation, credential scrubbing, sandboxed code execution, MCP output guard |
| Observability | OpenTelemetry, audit logs, anomaly detection, token budgets |
How Cortex compares
| Capability | Cortex | Typical agent frameworks |
|---|---|---|
| Configuration | Single cortex.yaml or a Python CortexBuilder |
Scattered code, env vars, multiple config files |
| Task orchestration | LLM-generated DAG or a hand-authored static DAG of code nodes | Sequential chain or hand-coded state machine |
| Tool protocol | Native MCP (SSE, stdio, streamable-HTTP) | Custom tool wrappers per integration |
| Multi-agent | Any agent becomes an MCP tool in one command | Bespoke inter-agent protocols |
| Quality gates | Built-in validation with scoring + remediation | Manual testing or nothing |
| Learning | Delta proposals + blueprints with human review | Prompt tweaking by hand |
| LLM providers | 8 built-in, swap via config | Usually 1-2, hard-coded |
| Deployment | 4 targets, one command each | Write your own Dockerfile |
Who is Cortex for?
| You are... | Cortex gives you... |
|---|---|
| Startup founder shipping an AI product | A production agent runtime in an afternoon — skip 3-6 months of plumbing |
| Platform team at a larger company | A governed agent runtime with audit trails, quality gates, and per-user isolation |
| Enterprise architect | Multi-agent meshes with independent scaling and compliance-friendly history encryption |
| Solo developer | Prototype to production with one YAML file — or skip YAML entirely and build the agent in Python |
| LangGraph / framework refugee | Code-first agents with @node functions and a static DAG — keep Cortex's wave engine, validation, and streaming |
| Researcher | Swap providers, models, and tools from config — run experiments without touching code |
| MLOps engineer | Validation scores, session replay, token accounting, and OpenTelemetry out of the box |
What Cortex is not
- Not a low-code builder. It's a Python library — drive it with
cortex.yamlor theCortexBuilderAPI. Either way, config replaces boilerplate, not your code. - Not an LLM gateway. Bring your own API key.
- Not a vector database. It calls MCP tools that do RAG — it doesn't implement retrieval itself.
- Not a web framework. Cortex runs inside FastAPI/Django/Flask/Click.
Documentation
| Document | Read this if you want to... |
|---|---|
| Overview | ...understand what Cortex is, who it's for, and why it exists |
| Architecture | ...see the internals: primary agent, task graph, MCP agents, validation |
| Features | ...scan the full feature matrix |
| Getting Started | ...build your first agent with working code |
| Use Cases | ...see real-world scenarios and reference architectures |
| Configuration | ...look up every cortex.yaml field |
| CLI Reference | ...look up every cortex subcommand |
| Deployment | ...ship to production |
| FAQ | ...find answers to common gotchas |
| Contributing | ...report bugs or submit PRs |
Community & support
- Issues: file bugs and feature requests on GitHub Issues
- Discussions: ask questions on GitHub Discussions
- Security: report vulnerabilities privately, not in public issues
License
MIT — see LICENSE. Use it commercially, fork it, ship it.
Define once. Deploy anywhere. Let it learn.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cortex_agent_framework-1.5.0.tar.gz.
File metadata
- Download URL: cortex_agent_framework-1.5.0.tar.gz
- Upload date:
- Size: 712.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e798e01855ea2037d3f209c3b8ebd3d02666c0784707864b51335550e8aeae18
|
|
| MD5 |
755f6daa89fc1c7c13b6baafb4257f70
|
|
| BLAKE2b-256 |
7e5f1a5b45f80c4d3961e92d199fbab11e3357620dac194ba4a9badcd94744ea
|
Provenance
The following attestation bundles were made for cortex_agent_framework-1.5.0.tar.gz:
Publisher:
publish.yml on kritird/Cortex-Agent-Framework
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cortex_agent_framework-1.5.0.tar.gz -
Subject digest:
e798e01855ea2037d3f209c3b8ebd3d02666c0784707864b51335550e8aeae18 - Sigstore transparency entry: 1588423197
- Sigstore integration time:
-
Permalink:
kritird/Cortex-Agent-Framework@7673f340b552448d21ad52b15386baa077a2fd3d -
Branch / Tag:
refs/tags/v1.5.0 - Owner: https://github.com/kritird
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7673f340b552448d21ad52b15386baa077a2fd3d -
Trigger Event:
release
-
Statement type:
File details
Details for the file cortex_agent_framework-1.5.0-py3-none-any.whl.
File metadata
- Download URL: cortex_agent_framework-1.5.0-py3-none-any.whl
- Upload date:
- Size: 748.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34a20f4ddec1e380e0f697038582603078599abca33ce0c9a4c902d5d32dcbc3
|
|
| MD5 |
d8365f414bb384b971329bafafe951d9
|
|
| BLAKE2b-256 |
32412e63ae1fdab5856b2a71f12c85e46ca4516cdfa59a2207a8536c651e3eb2
|
Provenance
The following attestation bundles were made for cortex_agent_framework-1.5.0-py3-none-any.whl:
Publisher:
publish.yml on kritird/Cortex-Agent-Framework
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cortex_agent_framework-1.5.0-py3-none-any.whl -
Subject digest:
34a20f4ddec1e380e0f697038582603078599abca33ce0c9a4c902d5d32dcbc3 - Sigstore transparency entry: 1588423327
- Sigstore integration time:
-
Permalink:
kritird/Cortex-Agent-Framework@7673f340b552448d21ad52b15386baa077a2fd3d -
Branch / Tag:
refs/tags/v1.5.0 - Owner: https://github.com/kritird
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@7673f340b552448d21ad52b15386baa077a2fd3d -
Trigger Event:
release
-
Statement type: