Skip to main content

Fan-Out/Fan-In agentic orchestration framework with multi-provider LLM support and MCP tool integration.

Project description

Full docs & landing page

📖 Looking for the full docs, feature showcase, and use cases? Visit the Cortex landing page →


Cortex Agent Framework

Cortex Agent Framework

One YAML file. One method call. A production AI agent.
Stop rebuilding the same agent plumbing. Define it once, deploy anywhere, let it learn.

PyPI version License: MIT Python 3.11+ 8 LLM providers + local MCP native

OverviewArchitectureFeaturesGetting StartedUse CasesConfigCLIDeploymentFAQ


The problem

Every AI team builds the same stack: task decomposition, parallel tool execution, streaming, retries, session management, quality scoring, multi-provider routing, deployment. Most teams rebuild it two or three times before shipping.

Cortex is that stack. Pre-built. Battle-tested. Driven by config, not code.


3 commands. A running agent.

pip install cortex-agent-framework
cortex setup            # visual wizard at localhost:7799
cortex publish ui       # chat UI at localhost:8090

Or invoke the agent directly from the terminal:

cortex run "Research the latest vector DB benchmarks"
cortex chat                                            # interactive REPL

You now have a working agent with a professional web interface, file upload support, streaming responses, and persistent chat history. No frontend to build. No backend to wire. No infrastructure to manage.


Define your agent in YAML. Run it in Python.

agent:
  name: ResearchAgent
  description: Searches the web and writes reports

llm_access:
  default:
    provider: anthropic
    model: claude-sonnet-4-5
    api_key_env_var: ANTHROPIC_API_KEY

task_types:
  - name: web_research
    capability_hint: web_search
    output_format: md
  - name: write_report
    capability_hint: document_generation
    depends_on: [web_research]
from cortex.framework import CortexFramework

framework = CortexFramework("cortex.yaml")
await framework.initialize()

result = await framework.run_session(
    user_id="user_1",
    request="Research the latest vector DB benchmarks and write a report",
)
print(result.response)

Fan-out, tool calls, dependency resolution, synthesis, validation — all handled.


Or define it in Python — code-first agents

Prefer code to config? Build the entire agent with CortexBuilder and wire in code nodes — plain Python functions as graph nodes, LangGraph-style. No YAML file, no decomposition LLM call: the graph runs exactly as you wrote it.

from cortex import CortexBuilder, CortexFramework

agent = CortexBuilder("ResearchAgent", "Searches the web and writes reports")
agent.llm("anthropic", model="claude-sonnet-4-5", api_key_env="ANTHROPIC_API_KEY")
agent.tool_server("brave", url="http://localhost:9000/sse")

@agent.node()
async def web_research(ctx):
    return await ctx.call_tool("brave", "search", query=ctx.request)

@agent.node(depends_on=["web_research"])
async def write_report(ctx):
    return await ctx.llm(f"Write a report from:\n{ctx.deps['web_research']}")

framework = CortexFramework(config=agent.build())
await framework.initialize()

result = await framework.run_session("user_1", "Research vector DB benchmarks")
print(result.response)                       # synthesised answer
print(result.node_outputs["write_report"])   # raw output of one node

Registering a code node flips the agent to static execution — the DAG you declared is the plan. Each node receives a TaskContext with ctx.request, ctx.deps (upstream node outputs), ctx.llm(), and ctx.call_tool(). You still get the wave engine, validation gate, retries, streaming, and session persistence — Cortex just skips the planner. Mix .task() (LLM-routed) and .node() (Python) freely. See Getting Started.


Why teams choose Cortex

Skip months of framework engineering

The orchestration, parallelism, tool integration, streaming, retries, validation, session persistence, and deployment pipeline — it's all in the box. Your Python code stays a thin wrapper. The agent's behavior lives in cortex.yaml, versioned, diffable, reviewable.

Multi-agent composition for free

Any Cortex agent becomes an MCP server in one command. Other agents consume it as a tool. That's the entire inter-agent protocol — standard MCP, nothing custom.

Orchestrator → Research Agent (MCP :8081) → brave-search, wikipedia
             → Code Review Agent (MCP :8082) → github, filesystem
             → Writing Agent (MCP :8083) → document-gen

Each agent scales, deploys, and configures independently. Compose them by adding YAML lines.

Built-in quality gates

Every response passes through a Validation Agent that scores intent match, completeness, and coherence. You set a floor (default: 0.75); responses below it are flagged and remediated. Bad intermediate outputs are retried with feedback before the pipeline moves on.

Catch regressions before users do, not after.

Your agent gets smarter over time

The Learning Engine observes task patterns across sessions. When patterns recur, it stages a delta proposal — a concrete config change you review with cortex delta review and apply in one command. The agent doesn't silently drift; it surfaces what it learned and asks for approval.


What's in the box

8 cloud LLM providers + local Anthropic, OpenAI, Gemini, Grok, Mistral, DeepSeek, AWS Bedrock, Azure AI — plus local Ollama / LM Studio / vLLM with a Gemma 4 quickstart
Code-first agents Build the whole agent in Python with CortexBuilder; @node decorator turns plain functions into graph nodes — static DAG, no decomposition LLM call
Fan-out / fan-in LLM-generated DAG with parallel execution; independent tasks run simultaneously
Intent Gate Cheap heuristic + LLM cascade routes chat-shaped turns directly to a streaming reply; only task-shaped turns decompose
interaction_mode One agent, two contracts: interactive for chat / CLI, rpc for MCP / automation — never blocks on clarifications
MCP-native tools First-class SSE, stdio, and streamable-HTTP MCP tool servers
Multi-agent mesh Publish any agent as an MCP server — compose specialist agents into an orchestrator
Ant Colony Orchestrator self-spawns specialist Cortex agents as MCP servers at runtime; supervised, health-checked, auto-restarted
Identity & delegation First-class Principal model — human, system, or agent-to-agent calls with full delegation chains in audit logs
Quality validation Every response scored and gated; per-task validation inside the execution loop
Autonomic learning Signal-gated end-of-session evolution — complexity + validation thresholds trigger staging; distinct-principal confirmation gates promotion
Blueprints Reusable workflow knowledge loaded into context; refined automatically when a session clears the validation threshold, seeded as drafts the moment a new task pattern is staged
Streaming Typed event classes (StatusEvent, ResultEvent, ClarificationEvent) for any UI
Smart synthesis Keyword-grep excerpts + concurrent per-file LLM summaries; large results written to disk as a file ResultEvent
Native app control Launch and drive desktop applications via AppleScript (macOS), PowerShell + UIA (Windows), or a screenshot vision loop — every mutating action gated by HITL
Built-in browser automation Playwright MCP starts internally — chromium / firefox / webkit with persistent session state so logins survive runs. No tool-server wiring required
Polyglot code sandbox # LANGUAGE: header dispatches to Python, Node, TypeScript, Deno, shell, Ruby, Go, Rust, C, Java, or Kotlin — with per-ecosystem package install (npm, gem, go get)
Per-task LLM routing Route decomposition to a fast model, synthesis to flagship
Auto-tuned LLM concurrency max_parallel_llm_calls derives from your provider+model (1 for local Ollama, 8 for Haiku, 4 for Opus) and self-tunes at runtime via AdaptiveLLMGate — no wizard knob to guess
Session persistence Memory / SQLite / Redis with WAL replay and resumable sessions
Built-in chat UI Web frontend with file uploads, streaming, and conversation history
4 deploy targets publish docker, publish package, publish mcp, publish ui
Visual setup wizard Configure everything from a browser — cortex setup
Full CLI cortex run, cortex chat, cortex sessions, cortex stats, cortex providers, cortex storage, cortex blueprints, cortex mcps — manage every aspect of the framework without writing Python
Security Input sanitisation, credential scrubbing, sandboxed code execution, MCP output guard
Observability OpenTelemetry, audit logs, anomaly detection, token budgets

How Cortex compares

Capability Cortex Typical agent frameworks
Configuration Single cortex.yaml or a Python CortexBuilder Scattered code, env vars, multiple config files
Task orchestration LLM-generated DAG or a hand-authored static DAG of code nodes Sequential chain or hand-coded state machine
Tool protocol Native MCP (SSE, stdio, streamable-HTTP) Custom tool wrappers per integration
Multi-agent Any agent becomes an MCP tool in one command Bespoke inter-agent protocols
Quality gates Built-in validation with scoring + remediation Manual testing or nothing
Learning Delta proposals + blueprints with human review Prompt tweaking by hand
LLM providers 8 built-in, swap via config Usually 1-2, hard-coded
Deployment 4 targets, one command each Write your own Dockerfile

Who is Cortex for?

You are... Cortex gives you...
Startup founder shipping an AI product A production agent runtime in an afternoon — skip 3-6 months of plumbing
Platform team at a larger company A governed agent runtime with audit trails, quality gates, and per-user isolation
Enterprise architect Multi-agent meshes with independent scaling and compliance-friendly history encryption
Solo developer Prototype to production with one YAML file — or skip YAML entirely and build the agent in Python
LangGraph / framework refugee Code-first agents with @node functions and a static DAG — keep Cortex's wave engine, validation, and streaming
Researcher Swap providers, models, and tools from config — run experiments without touching code
MLOps engineer Validation scores, session replay, token accounting, and OpenTelemetry out of the box

What Cortex is not

  • Not a low-code builder. It's a Python library — drive it with cortex.yaml or the CortexBuilder API. Either way, config replaces boilerplate, not your code.
  • Not an LLM gateway. Bring your own API key.
  • Not a vector database. It calls MCP tools that do RAG — it doesn't implement retrieval itself.
  • Not a web framework. Cortex runs inside FastAPI/Django/Flask/Click.

Documentation

Document Read this if you want to...
Overview ...understand what Cortex is, who it's for, and why it exists
Architecture ...see the internals: primary agent, task graph, MCP agents, validation
Features ...scan the full feature matrix
Getting Started ...build your first agent with working code
Use Cases ...see real-world scenarios and reference architectures
Configuration ...look up every cortex.yaml field
CLI Reference ...look up every cortex subcommand
Deployment ...ship to production
FAQ ...find answers to common gotchas
Contributing ...report bugs or submit PRs

Community & support

  • Issues: file bugs and feature requests on GitHub Issues
  • Discussions: ask questions on GitHub Discussions
  • Security: report vulnerabilities privately, not in public issues

License

MIT — see LICENSE. Use it commercially, fork it, ship it.

Define once. Deploy anywhere. Let it learn.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cortex_agent_framework-1.5.0.tar.gz (712.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cortex_agent_framework-1.5.0-py3-none-any.whl (748.3 kB view details)

Uploaded Python 3

File details

Details for the file cortex_agent_framework-1.5.0.tar.gz.

File metadata

  • Download URL: cortex_agent_framework-1.5.0.tar.gz
  • Upload date:
  • Size: 712.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cortex_agent_framework-1.5.0.tar.gz
Algorithm Hash digest
SHA256 e798e01855ea2037d3f209c3b8ebd3d02666c0784707864b51335550e8aeae18
MD5 755f6daa89fc1c7c13b6baafb4257f70
BLAKE2b-256 7e5f1a5b45f80c4d3961e92d199fbab11e3357620dac194ba4a9badcd94744ea

See more details on using hashes here.

Provenance

The following attestation bundles were made for cortex_agent_framework-1.5.0.tar.gz:

Publisher: publish.yml on kritird/Cortex-Agent-Framework

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cortex_agent_framework-1.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for cortex_agent_framework-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 34a20f4ddec1e380e0f697038582603078599abca33ce0c9a4c902d5d32dcbc3
MD5 d8365f414bb384b971329bafafe951d9
BLAKE2b-256 32412e63ae1fdab5856b2a71f12c85e46ca4516cdfa59a2207a8536c651e3eb2

See more details on using hashes here.

Provenance

The following attestation bundles were made for cortex_agent_framework-1.5.0-py3-none-any.whl:

Publisher: publish.yml on kritird/Cortex-Agent-Framework

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page