Harness-engineered agent pipeline library with 21-stage dual-abstraction architecture, built on the Anthropic API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

CocoRoF

These details have not been verified by PyPI

Project description

geny-executor

A harness-engineered agent pipeline library — 21 stages, 5 LLM providers, MCP-native, fully introspectable.

geny-executor implements a 21-stage pipeline with dual-abstraction architecture (stage slots × strategy slots). Inspired by Claude Code's agent loop and Anthropic's harness design principles. No LangChain. No LangGraph. Just an explicit, modular pipeline where every step is observable, mutatable, and swappable.

한국어 README · Architecture · Providers · Error codes · Claude Code CLI host

Why geny-executor?

Problem	geny-executor's answer
Frameworks hide too much behind abstractions	Every one of the 21 stages is explicit, inspectable, and individually swappable.
Hard to customize one part without rewriting everything	Dual abstraction: swap a whole stage or swap a strategy inside a stage. Manifest-driven so config = artifact.
Vendor lock-in across LLM providers	One contract, five providers wired in (`anthropic` / `openai` / `google` / `vllm` / `claude_code_cli`). Switch by editing one config field.
Agent loops are opaque black boxes	Event-bus + stable structured error codes (`exec.cli.auth_failed`, …) — every failure groups cleanly in your logs / Sentry / i18n layer.
MCP integration is a side concern	First-class. Host-attached MCP servers + per-session MCP wraps for CLI backends (e.g. Claude Code CLI) ship out of the box.
Cost tracking is an afterthought	Built into Stage 7 (Token). Per-call cost, per-session ledger, budget guards.

Architecture at a glance

The 21-stage pipeline

Phase A — Setup (once per turn)
  1: Input  →  2: Context  →  3: System  →  4: Guard  →  5: Cache

Phase B — Generate + Dispatch (loop)
  6: API  →  7: Token  →  8: Think  →  9: Parse
  → 10: Tool  →  11: ToolReview  →  12: Agent  →  13: TaskRegistry
  → 14: Evaluate  →  15: HITL  →  16: Loop

Phase C — Surface (once)
  17: Emit  →  18: Memory  →  19: Summarize  →  20: Persist  →  21: Yield

The full stage list with strategy options lives in docs/architecture.md.

Dual abstraction — two levels of swap

┌─ Level 1: Stage Abstraction ─────────────────────────┐
│   Swap an entire stage module in/out of the pipeline. │
│                                                       │
│  ┌─ Level 2: Strategy Abstraction ─────────────────┐  │
│  │   Swap internal logic within a stage.            │  │
│  │                                                  │  │
│  │   ContextStage can use:                          │  │
│  │     → SimpleLoad     (default)                   │  │
│  │     → ProgressiveDisclosure                      │  │
│  │     → VectorSearch                               │  │
│  │     → YourCustomStrategy                         │  │
│  └──────────────────────────────────────────────────┘  │
└────────────────────────────────────────────────────────┘

Stage Abstraction — replace a whole stage (e.g. drop a custom APIStage for a private provider).
Strategy Abstraction — change behaviour inside a stage (e.g. switch context loading from SimpleLoad to VectorSearch) without touching the surrounding pipeline.

Installation

pip install geny-executor

Optional extras:

pip install geny-executor[memory]   # numpy for vector retrieval
pip install geny-executor[all]      # everything
pip install geny-executor[dev]      # dev/test tooling

Requirements: Python 3.11+. At least one provider's credentials (Anthropic API key, OpenAI API key, …) or a local CLI binary (claude for claude_code_cli).

Quick start

Minimal pipeline

import asyncio
from geny_executor import PipelinePresets

async def main():
    pipeline = PipelinePresets.minimal(api_key="sk-ant-...")
    result = await pipeline.run("What is the capital of France?")
    print(result.text)

asyncio.run(main())

Chat pipeline (history + system prompt + optional tools)

from geny_executor import PipelinePresets

pipeline = PipelinePresets.chat(
    api_key="sk-ant-...",
    system_prompt="You are a helpful coding assistant.",
)

result = await pipeline.run("Explain Python decorators")
print(result.text)
print(f"Cost: ${result.total_cost_usd:.4f}")

Full agent (all 21 stages — tools, evaluation, memory, loop control)

from geny_executor import PipelinePresets
from geny_executor.tools import ToolRegistry, Tool, ToolResult, ToolContext

class SearchTool(Tool):
    @property
    def name(self) -> str: return "search"
    @property
    def description(self) -> str: return "Search the web for information"
    @property
    def input_schema(self) -> dict:
        return {
            "type": "object",
            "properties": {"query": {"type": "string"}},
            "required": ["query"],
        }
    async def execute(self, input, context):
        return ToolResult(content=f"Results for: {input['query']}")

registry = ToolRegistry()
registry.register(SearchTool())

pipeline = PipelinePresets.agent(
    api_key="sk-ant-...",
    system_prompt="You are a research assistant. Use tools to find answers.",
    tools=registry,
    max_turns=20,
)

result = await pipeline.run("Find the latest Python release version")

Custom pipeline with builder

from geny_executor import PipelineBuilder

pipeline = (
    PipelineBuilder("my-agent", api_key="sk-ant-...")
    .with_model(model="claude-sonnet-4-6", max_tokens=4096)
    .with_system(prompt="You are a concise assistant.")
    .with_context()
    .with_guard(cost_budget_usd=1.0, max_iterations=30)
    .with_cache(strategy="aggressive")
    .with_tools(registry=my_registry)
    .with_think(enabled=True, budget_tokens=10000)
    .with_evaluate()
    .with_loop(max_turns=30)
    .with_memory()
    .build()
)

result = await pipeline.run("Complex multi-step task here")

Manifest-driven pipeline (recommended for hosts)

from geny_executor import Pipeline, CredentialBundle, ProviderCredentials, EnvironmentManifest

manifest = EnvironmentManifest.load("./envs/my_env.json")
credentials = CredentialBundle(by_provider={
    "anthropic": ProviderCredentials(api_key="sk-ant-..."),
})
pipeline = await Pipeline.from_manifest_async(manifest, credentials=credentials)
result = await pipeline.run("Hello!")

See docs/manifest.md for the full schema.

Five LLM providers, one contract

Provider	Notes
`anthropic`	Claude family. Full streaming, native `tool_use`, thinking blocks.
`openai`	GPT-4.1 / o-series. Streaming, tools, JSON-schema structured output.
`google`	Gemini 3.x / 2.5. Streaming, tools, thinking blocks.
`vllm`	Any model on a local vLLM endpoint. OpenAI-compatible. Tools opt-in via `configure_capabilities()`.
`claude_code_cli`	Subprocess-driven Claude Code CLI. Hosts attach a per-session MCP bridge to surface their own tool registry to the spawned CLI's LLM. See `docs/claude_code_cli.md`.

A session picks its provider via stages[6].config["provider"] in the manifest. Credentials flow through a single CredentialBundle channel — see docs/providers.md.

Error codes (2.1.0+)

Every executor exception carries a stable exec.<component>.<reason> code:

from geny_executor import APIError, ExecutorErrorCode, ErrorCategory

try:
    result = await pipeline.run("...")
except APIError as e:
    if e.code is ExecutorErrorCode.EXEC_CLI_AUTH_FAILED:
        print("Please re-login to Claude Code CLI.")
    elif e.category.is_recoverable:
        print(f"Recoverable failure ({e.code.value}); retrying.")

Structured event payloads also carry the code:

{
  "type": "pipeline.error",
  "data": {
    "error": "Claude Code CLI is not authenticated …",
    "code": "exec.cli.auth_failed",
    "exception_type": "geny_executor.core.errors.APIError"
  }
}

Codes are stable across releases — see docs/error_codes.md for the full table, recoverability, and how to add a new code.

Sessions

Persistent state across multiple interactions:

from geny_executor import PipelinePresets
from geny_executor.session import SessionManager

manager = SessionManager()
pipeline = PipelinePresets.chat(api_key="sk-ant-...")
session = manager.create(pipeline)

await session.run("My name is Alice")
result = await session.run("What's my name?")

for info in manager.list_sessions():
    print(f"{info.session_id}: {info.message_count} msgs, ${info.total_cost_usd:.4f}")

Event system + observability

@pipeline.on("stage.enter")
async def _(event):
    print(f"→ {event.stage}")

@pipeline.on("pipeline.error")
async def _(event):
    print(f"❌ {event.data['code']}: {event.data['error']}")

@pipeline.on("*")
async def _(event):
    pass   # firehose

Streaming:

async for event in pipeline.run_stream("Solve step by step"):
    if event.type == "stage.enter":
        print(f"Stage: {event.stage}")
    elif event.type == "pipeline.complete":
        print(f"Final: {event.data['result'].text}")

Tools + MCP

from geny_executor.tools import Tool, ToolResult, ToolContext, ToolRegistry

class Calculator(Tool):
    @property
    def name(self): return "calculator"
    @property
    def description(self): return "Perform arithmetic."
    @property
    def input_schema(self):
        return {"type": "object", "properties": {"expression": {"type": "string"}}, "required": ["expression"]}
    async def execute(self, input, context):
        return ToolResult(content=str(eval(input["expression"])))   # use a safe evaluator!

registry = ToolRegistry()
registry.register(Calculator())

Connect a host-attached MCP server:

from geny_executor.tools.mcp import MCPManager

mcp = MCPManager()
await mcp.connect("filesystem", command="npx", args=["-y", "@anthropic/mcp-filesystem"])
for tool in mcp.list_tools():
    registry.register(tool)

For the CLI-side MCP wrap (your tool registry exposed into a spawned Claude Code CLI's LLM), see docs/claude_code_cli.md.

Pipeline presets

Preset	Active stages	Use case
`PipelinePresets.minimal()`	Input → API → Parse → Yield	Quick Q&A, smoke tests
`PipelinePresets.chat()`	+ Context, System, Guard, Cache, Token, Tool, Loop, Memory	Conversational chatbot
`PipelinePresets.agent()`	All 21 stages active	Autonomous agent with tools, eval, memory, summarisation, persistence
`PipelinePresets.evaluator()`	Input → System → API → Parse → Evaluate → Yield	Generator/Evaluator quality pass
`PipelinePresets.geny_vtuber()`	All 21 stages + VTuber/TTS emitters	Reference reproduction of the Geny VTuber harness

Custom stages + strategies

from geny_executor.core.stage import Strategy

class MyContextStrategy(Strategy):
    name = "my_context"
    description = "Custom context loading with RAG"

    def configure(self, config: dict) -> None:
        self.top_k = config.get("top_k", 5)

    async def load(self, state):
        ...   # your RAG retrieval

from geny_executor.core.stage import Stage
from geny_executor.core.state import PipelineState

class LoggingStage(Stage[dict, dict]):
    name = "logging"
    order = 7      # after API, before Think
    category = "execution"

    async def execute(self, input, state: PipelineState):
        print(f"[{state.iteration}] API response received")
        return input

pipeline.register_stage(LoggingStage())

Project structure

geny-executor/
├── src/geny_executor/
│   ├── __init__.py          # Public API surface
│   ├── py.typed             # PEP 561 type marker
│   ├── core/                # Pipeline engine, errors, manifest, mutation, snapshot
│   ├── stages/              # 21 pipeline stages (s01–s21)
│   ├── llm_client/          # 5 providers + ClientRegistry + CredentialBundle + CLI runtime
│   ├── tools/               # Tool ABC, registry, router, MCP integration
│   ├── hooks/               # PRE/POST tool-use lifecycle hooks
│   ├── memory/              # Memory v2 retrieval, vault map, vector store
│   ├── skills/              # SkillProvider + skill loading
│   ├── subagents/           # Stage 12 sub-agent orchestration
│   ├── permission/          # Per-tool ACL evaluated by RegistryRouter
│   ├── channels/            # Output channel adapters (text, callback, TTS, …)
│   ├── cron/                # Scheduled trigger support
│   ├── events/              # EventBus pub/sub
│   ├── history/             # Conversation history primitives
│   ├── telemetry/           # Event / metric exporters
│   └── session/             # Session manager + freshness checks
├── docs/                    # Architecture, providers, manifest, error codes, MCP, hooks
├── tests/                   # 3100+ unit, conformance, contract, integration tests
├── pyproject.toml           # Package configuration (Hatch)
└── LICENSE                  # MIT

Development

git clone https://github.com/CocoRoF/geny-executor.git
cd geny-executor

pip install -e ".[dev]"

pytest                                                       # full suite (~30s, 3100+ tests)
pytest tests/contract/test_error_codes_stability.py          # error code stability check
pytest --cov=geny_executor --cov-report=term-missing         # coverage

ruff check src/ tests/
ruff format src/ tests/

Versioning

Version	Highlights
2.1.0	`ExecutorErrorCode` taxonomy + structured `pipeline.error` / `stage.error` / `api.retry` payloads. `docs/error_codes.md`.
2.0.6	Removed `copilot_cli` provider (text-only, can't host tool round-trip). Upstreamed Geny's claude_code_cli compat patches (`--verbose` injection, `--bare` strip, drop auto-`--tools ""`, `tool_use` strip from finalize).
2.0.5	`APIRequest.mcp_config` per-request override + auto-emit `--strict-mcp-config`. Foundational support for the host MCP wrap.
2.0.0	Provider abstraction (`ClientRegistry`, `CredentialBundle`). Manifest single source of truth for Stage 6 provider.
1.x	Original 16-stage pipeline; Anthropic-only.

See CHANGELOG for the full history.

License

MIT — see LICENSE.

Related projects

Anthropic SDK
OpenAI SDK
Google GenAI SDK
vLLM
Claude Code CLI — geny-executor hosts it via claude_code_cli provider
MCP — Model Context Protocol; both host-attached servers and per-session CLI wraps are first-class
Geny — Multi-agent platform built on geny-executor

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

CocoRoF

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.3.0

Jun 10, 2026

2.2.1

Jun 10, 2026

2.2.0

Jun 10, 2026

2.1.4

Jun 9, 2026

This version

2.1.3

Jun 4, 2026

2.1.2

Jun 4, 2026

2.1.1

Jun 4, 2026

2.1.0

May 21, 2026

2.0.6

May 20, 2026

2.0.5

May 19, 2026

2.0.4

May 19, 2026

2.0.3

May 19, 2026

2.0.2

May 19, 2026

2.0.1

May 18, 2026

2.0.0

May 17, 2026

1.21.0

May 5, 2026

1.20.0

May 5, 2026

1.19.0

May 5, 2026

1.18.0

May 5, 2026

1.17.2

May 5, 2026

1.17.1

May 4, 2026

1.17.0

May 4, 2026

1.16.0

May 4, 2026

1.15.0

May 3, 2026

1.13.0

May 3, 2026

1.12.0

May 1, 2026

1.11.0

May 1, 2026

1.10.0

May 1, 2026

1.9.0

May 1, 2026

1.8.0

Apr 30, 2026

1.3.3

Apr 29, 2026

1.3.1

Apr 28, 2026

1.3.0

Apr 26, 2026

1.0.0

Apr 25, 2026

0.46.0

Apr 25, 2026

0.45.0

Apr 25, 2026

0.44.0

Apr 25, 2026

0.43.0

Apr 25, 2026

0.42.0

Apr 25, 2026

0.41.0

Apr 24, 2026

0.40.0

Apr 24, 2026

0.39.0

Apr 24, 2026

0.38.0

Apr 24, 2026

0.37.0

Apr 24, 2026

0.36.1

Apr 24, 2026

0.36.0

Apr 24, 2026

0.35.0

Apr 24, 2026

0.34.0

Apr 24, 2026

0.33.0

Apr 24, 2026

0.32.3

Apr 24, 2026

0.31.2

Apr 24, 2026

0.31.1

Apr 24, 2026

0.31.0

Apr 24, 2026

0.30.0

Apr 22, 2026

0.29.0

Apr 21, 2026

0.28.0

Apr 20, 2026

0.27.0

Apr 20, 2026

0.26.2

Apr 20, 2026

0.26.1

Apr 20, 2026

0.26.0

Apr 20, 2026

0.25.0

Apr 20, 2026

0.24.0

Apr 20, 2026

0.23.0

Apr 20, 2026

0.22.1

Apr 20, 2026

0.22.0

Apr 20, 2026

0.20.1

Apr 20, 2026

0.20.0

Apr 19, 2026

0.13.5

Apr 18, 2026

0.13.4

Apr 18, 2026

0.13.3

Apr 17, 2026

0.13.2

Apr 17, 2026

0.13.1

Apr 17, 2026

0.13.0

Apr 17, 2026

0.12.0

Apr 17, 2026

0.11.0

Apr 17, 2026

0.9.0

Apr 16, 2026

0.8.3

Apr 14, 2026

0.8.1

Apr 14, 2026

0.8.0

Apr 14, 2026

0.7.3

Apr 14, 2026

0.7.1

Apr 14, 2026

0.5.0

Apr 13, 2026

0.4.0

Apr 10, 2026

0.3.0

Apr 10, 2026

0.2.3

Apr 10, 2026

0.2.2

Apr 9, 2026

0.2.1

Apr 9, 2026

0.2.0

Apr 9, 2026

0.1.0

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geny_executor-2.1.3.tar.gz (827.9 kB view details)

Uploaded Jun 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

geny_executor-2.1.3-py3-none-any.whl (746.3 kB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file geny_executor-2.1.3.tar.gz.

File metadata

Download URL: geny_executor-2.1.3.tar.gz
Upload date: Jun 4, 2026
Size: 827.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for geny_executor-2.1.3.tar.gz
Algorithm	Hash digest
SHA256	`0777e9856a7283d225931673f8ae08dea83a02321c92452987ff323c5be8231f`
MD5	`8d2290364324bae67c7561e01595aa6f`
BLAKE2b-256	`89f5cf9450d89ecb268b928de7a34418778219a0467a6eb768b83cefedc1994f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for geny_executor-2.1.3.tar.gz:

Publisher: publish.yml on CocoRoF/geny-executor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: geny_executor-2.1.3.tar.gz
- Subject digest: 0777e9856a7283d225931673f8ae08dea83a02321c92452987ff323c5be8231f
- Sigstore transparency entry: 1715407606
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: CocoRoF/geny-executor@963b2600b30f95469cd502a664918904d05aca59
- Branch / Tag: refs/heads/main
- Owner: https://github.com/CocoRoF
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@963b2600b30f95469cd502a664918904d05aca59
- Trigger Event: workflow_dispatch

File details

Details for the file geny_executor-2.1.3-py3-none-any.whl.

File metadata

Download URL: geny_executor-2.1.3-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 746.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for geny_executor-2.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8a6c7584cf184a2fffc2afa05c2fcadbaa05b9485f180d8a07676a84646681ad`
MD5	`68353ec2550a8bab3b46428c17d43338`
BLAKE2b-256	`cbcd9fc80dd56a3028e91342d7b9f94c29de1e61916fd5679403f4f83af6e7a2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for geny_executor-2.1.3-py3-none-any.whl:

Publisher: publish.yml on CocoRoF/geny-executor

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: geny_executor-2.1.3-py3-none-any.whl
- Subject digest: 8a6c7584cf184a2fffc2afa05c2fcadbaa05b9485f180d8a07676a84646681ad
- Sigstore transparency entry: 1715407654
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: CocoRoF/geny-executor@963b2600b30f95469cd502a664918904d05aca59
- Branch / Tag: refs/heads/main
- Owner: https://github.com/CocoRoF
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@963b2600b30f95469cd502a664918904d05aca59
- Trigger Event: workflow_dispatch

geny-executor 2.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

geny-executor

Why geny-executor?

Architecture at a glance

The 21-stage pipeline

Dual abstraction — two levels of swap

Installation

Quick start

Minimal pipeline

Chat pipeline (history + system prompt + optional tools)

Full agent (all 21 stages — tools, evaluation, memory, loop control)

Custom pipeline with builder

Manifest-driven pipeline (recommended for hosts)

Five LLM providers, one contract

Error codes (2.1.0+)

Sessions

Event system + observability

Tools + MCP

Pipeline presets

Custom stages + strategies

Project structure

Development

Versioning

License

Related projects

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance