Declarative YAML-based framework for defining, managing, and orchestrating AI coding agent instances

These details have not been verified by PyPI

Project links

Project description

scitex-agent-container

Declarative YAML-based AI agent lifecycle management with tmux/screen, SSH remote deploy, health checks, and auto-accept.

Full Documentation · pip install scitex-agent-container

Problem and Solution

#	Problem	Solution
1	Fragile per-agent scripts — launching Claude Code means hand-rolling shell scripts for tmux, env vars, MCP configs, and auto-accept prompts, with no restart policy or health monitoring	Declarative YAML manifest — one file fully specifies runtime, model, MCP servers, env, health checks, and remote host; `sac agent start` brings the agent up in tmux/screen with auto-accept and a watchdog
2	No fleet story — scaling from one agent to many across machines duplicates the same fragile scripts, with no SSH deploy, no presence, and no inter-agent comms	Remote deploy + state inspection — `sac` copies src files, installs the venv over SSH, and keeps a live view of every pane's state so the fleet behaves as one unit

Problem

Managing AI coding agents (Claude Code) in production requires manual script-writing, environment setup, and process monitoring for each agent instance. Scaling from one agent to a fleet across multiple machines means duplicating fragile shell scripts with no health checks, restart policies, remote deployment, or inter-agent communication.

Solution

scitex-agent-container provides declarative YAML definitions that fully specify an agent -- runtime, model, MCP servers, environment, health checks, remote host -- started with a single command:

YAML manifest + src_CLAUDE.md + src_mcp.json
          |
          v
scitex-agent-container start
          |
          v
tmux/screen session + auto-accept TUI prompts
                     + remote SSH deploy
                     + health monitor
                     + restart policy

Installation

Requires Python >= 3.10.

pip install scitex-agent-container

Architecture

scitex_agent_container/
├── _api/                ← Python API: spawn / inspect / health-check agents
├── _cli/                ← `scitex-agent-container ...` Click commands
│   └── cli_pkg/         ← grouped subcommand modules (account / build / install / lifecycle)
├── _config/             ← layered config (priority: --flag → yaml → env → default)
├── _docker/             ← Dockerfile + container build helpers
├── _slurm/              ← single-agent SLURM dispatch
├── _ssh/                ← remote-deploy entry-points
└── _mcp/                ← MCP server bridge for agent introspection

The CLI is the canonical entry point; the Python API is what the MCP server exposes. The Docker + SLURM + SSH backends share the same agents.yaml schema so a workflow that runs locally also runs unchanged on a SLURM cluster.

Demo

flowchart LR
    A["scitex-agent-container<br/>start --agent foo"] --> B{backend?}
    B -- "local" --> C[docker run]
    B -- "slurm" --> D[sbatch]
    B -- "ssh" --> E[ssh remote &amp;&amp; nohup]
    C & D & E --> F[(agent process)]
    F --> G["MCP server<br/>scitex-agent-container mcp start"]
    G --> H["agent.list / agent.health<br/>(agent introspection tools)"]

End-to-end: a single start command spawns an agent on whichever backend the config selects, the MCP server exposes its lifecycle tools, and downstream health / list queries surface the live state.

Part of SciTeX

scitex-agent-container is part of SciTeX. Install via the umbrella with pip install scitex[agent-container] to use as scitex.agent_container (Python) or scitex agent-container ... (CLI).

Templates

config/templates/ ships six minimal pattern templates — copy and adapt:

Template	Pattern	When to use
`local.yaml`	claude-code on local host	Default; shares operator's env (skills, MCP, venv)
`docker.yaml`	claude-code in Docker	Local isolation; `mount_host_claude` opt-in
`apptainer.yaml`	claude-code in Apptainer/Singularity	HPC compute nodes / locked-down hosts
`ssh.yaml`	claude-code via SSH on remote host	Cross-machine fleet member
`ssh-slurm.yaml`	SLURM-submitted job (with auto-resubmit)	Long-running compute on shared cluster
`mcp.yaml`	claude-code with MCP server wiring	Agent that needs MCP tool access

Concrete real-world configs live in config/examples/ (e.g. newbie-docker.yaml, researcher-opus.yaml). Both directories are validated by tests/test_templates_v3_valid.py — every shipped YAML must round-trip through load_config, and the SLURM template must additionally render a valid sbatch script.

To instantiate (dir-as-SSoT — agent name is derived from the parent directory):

mkdir -p ~/.scitex/orochi/agents/my-agent
cp config/templates/local.yaml ~/.scitex/orochi/agents/my-agent/my-agent.yaml
scitex-agent-container start my-agent

1 Interfaces

CLI

sac agent start <agent-yaml>      # launch declared agent in tmux/screen with auto-accept + watchdog
sac agent stop <agent>            # graceful stop
sac agent status                  # live state of every pane
sac deploy <host>           # SSH-deploy fleet to remote host
sac --help-recursive        # full subcommand tree

Quickstart (v2 config)

Create agent definition directory:

my-agent/
  my-agent.yaml     # Agent config
  src_CLAUDE.md      # -> deployed to {workdir}/CLAUDE.md
  src_mcp.json       # -> deployed to {workdir}/.mcp.json
  src_env            # -> deployed to {workdir}/.env  (mode 0600)

The src_* family is a generic file-deploy pipeline: a sibling file named src_X next to the YAML is materialized into the workspace at agent start, with ${VAR} and ${metadata.name} interpolation. src_env is the dotenv variant — sourceable by anything the agent spawns (cron jobs, ssh-launched commands, fresh shells), not just the multiplexer session. See _skills/scitex-agent-container/06_env-injection-ports.md for the four distinct env-injection ports and when to use each.

Write a YAML manifest:

apiVersion: scitex-agent-container/v2
kind: Agent
metadata:
  name: my-agent
  labels:
    role: worker
    machine: local
spec:
  runtime: claude-code
  model: sonnet
  multiplexer: tmux       # tmux (default) or screen

  claude:
    flags:
      - --dangerously-skip-permissions
    # session: continue-or-new (default) | continue | new
    # continue-or-new: pass --continue iff a prior session exists for the
    #   workdir, else launch fresh. Preserves /compact history across
    #   rolling restarts without risking a hard failure.
    # continue: always pass --continue (fails if no prior session)
    # new:      never pass --continue
    session: continue-or-new

  skills:
    required:
      - scitex

  health:
    enabled: true
    interval: 60
    method: multiplexer-alive

  restart:
    policy: on-failure
    max_retries: 3

v2 auto-derives from metadata.name: workdir, session name, env vars (CLAUDE_AGENT_ID, CLAUDE_AGENT_ROLE, etc.), and pre-start hooks. Sibling src_CLAUDE.md and src_mcp.json files are deployed to the workspace with ${metadata.name} and ${ENV_VAR} interpolation.

Start and monitor:

scitex-agent-container start my-agent.yaml
scitex-agent-container inspect my-agent         # Live state detection
scitex-agent-container show-status my-agent
scitex-agent-container show-logs my-agent -n 100
scitex-agent-container attach my-agent          # Ctrl-B D to detach (tmux)

Remote SSH Deployment

Deploy agents to remote machines:

spec:
  remote:
    host: mba              # SSH hostname
    user: ywatanabe
    timeout: 180

scitex-agent-container start remote-agent.yaml   # SSHs to remote, launches there
scitex-agent-container stop remote-agent.yaml     # Accepts name or YAML path
scitex-agent-container inspect my-remote-agent    # Live state from remote

SLURM (single-agent)

Submit an agent as an sbatch job that holds the allocation, runs claude in tmux on the compute node, and auto-resubmits before walltime via a SIGUSR1 trap:

spec:
  runtime: slurm
  slurm:
    partition: cascade
    cpus_per_task: 4
    mem: "16G"
    time_limit: "7-00:00:00"
    auto_resubmit: true
    hooks:
      pre_agent: ~/path/to/module-load.sh    # `module load Python/3.11.3` etc.

sac agent start head-spartan/head-spartan.yaml   # submits sbatch on the local SLURM submission host
sac agent attach head-spartan                    # srun --pty + tmux attach on the compute node
sac agent stop head-spartan                      # scancel + clear state

SLURM (multi-tenant — many agents on one allocation)

Requires pip install scitex-agent-container[slurm] (pulls scitex-hpc>=0.6.1).

Book a reservation once, then launch many agents into the same allocation. Cuts queue wait from minutes per launch to one ssh round-trip per launch:

# Once: book a node for the day
scitex-hpc reservations book dev-pool \
    --host spartan --partition cascade \
    --cpus 8 --mem 32G --time 7-0 \
    --tmux-server sac --persistent

# All day: launch agents into it
sac agent start dev-helper.yaml         # tmux session in dev-pool's allocation
sac agent start doc-builder.yaml        # second tmux session, same allocation
sac agent start test-runner.yaml        # third, same allocation

sac agent attach dev-helper             # interactive on compute node

# When done with the day's pool:
scitex-hpc reservations release dev-pool

Tenant agent YAML — note the new runtime kind and the slurm.reservation field:

spec:
  runtime: slurm-tenant
  slurm:
    reservation: dev-pool         # name of the existing scitex-hpc lease
  claude:
    flags: [--dangerously-skip-permissions]

The reservation's hold body bootstraps a long-lived tmux server as PID 1 of the sbatch script (via --tmux-server sac), so tenant tmux sessions survive past their setup commands. Without it, srun --overlap step cgroups would terminate them within seconds.

Compatible with HPC policies banning persistent daemons — every operation is bastion-initiated SSH, no crontab @reboot, no autossh, no tunnel. SLURM's documented SIGUSR1 signal handles walltime auto-resubmit.

MCP Servers (src_mcp.json)

MCP config lives alongside the YAML as src_mcp.json -- visible, editable, version-controlled:

{
  "mcpServers": {
    "scitex-orochi": {
      "type": "stdio",
      "command": "bun",
      "args": ["run", "~/proj/scitex-orochi/ts/mcp_channel.ts"],
      "env": {
        "SCITEX_OROCHI_URL": "wss://scitex-orochi.com",
        "SCITEX_OROCHI_AGENT": "${metadata.name}",
        "SCITEX_OROCHI_TOKEN": "${SCITEX_OROCHI_TOKEN}"
      }
    }
  }
}

~ in args is expanded at deploy time. ${metadata.name} interpolates from YAML. ${ENV_VAR} resolves from the environment.

Auto-Accept TUI Prompts

Claude Code shows confirmation prompts for dangerous flags. The auto-accept system handles them automatically using modular prompt handlers (runtimes/prompts.py):

# Each handler: detect prompt text -> send number key + Enter
PromptHandler(name="bypass-permissions",
              detect=lambda c: "2. Yes, I accept" in c,
              keys=["2", "Enter"])

Handlers are order-agnostic, use numbered option text for reliability, and work with both tmux and screen. New prompts are added by appending to PROMPT_HANDLERS.

Diagnostics logged to ~/.scitex/agent-container/logs/{name}/auto-accept.log.

CLI Commands

# Lifecycle (accepts name or YAML path)
scitex-agent-container start <config.yaml>
scitex-agent-container stop <name|yaml>
scitex-agent-container restart <name|yaml>

# Inspection
scitex-agent-container inspect <name> [--json]   # Live pane state detection
scitex-agent-container show-status [name] [--json]   # Rich status dict (see below)
scitex-agent-container list [--json] [--capability X] [--machine Y]
scitex-agent-container show-logs <name> [-n LINES]
scitex-agent-container check-health <name> [--json]
scitex-agent-container attach <name>

# Hook event ingestor (wired from Claude Code hooks, see below)
scitex-agent-container ingest-hook-event <pretool|posttool|prompt|stop|other>

# Pane actions (see "Pane Actions" below)
scitex-agent-container actions run <nonce-probe|compact> <agent> [--json]
scitex-agent-container actions query [--agent X] [--action Y] [--since 2h]
scitex-agent-container actions stats [--agent X] [--since 7d]
scitex-agent-container actions purge [--days N]

# A2A protocol — standalone agent endpoint, no fleet deps
# (echo handler by default; --handler claude_cli runs `claude --print`)
scitex-agent-container a2a serve <agent.yaml>... [--port 8888] [--handler echo|claude_cli|exec]

# Configuration
scitex-agent-container validate <config.yaml>
scitex-agent-container check <config.yaml>

# Maintenance
scitex-agent-container clean-registry

Rich Status (`status <name> --json`)

status <name> --json returns a non-agentic snapshot of the agent suitable for dashboards or fleet monitors. The payload merges the base registry entry with fields from agent_meta.collect_rich() and event_log.summarize():

Field	Description
`pane_text`	Recent tmux `capture-pane` output, secrets redacted
`pane_state`	Classified: `running` / `idle_prompt` / `y_n_prompt` / `auth_error` / `compose_pending_unsent` / `limit_reached` / `unknown`
`stuck_prompt_text`	Last line when `pane_state` indicates a blocking prompt
`claude_md`	Workspace `CLAUDE.md` contents (truncated)
`mcp_json`	Workspace `.mcp.json` with token-like values redacted
`recent_tools`, `recent_prompts`	Last N tool uses / user prompts from the hook ring-buffer
`agent_calls`, `background_tasks`	Subagent launches and `Bash run_in_background=true` starts
`tool_counts`	`{tool_name: count}` over the window
`last_tool_at`, `last_tool_name`	ISO timestamp and name of the newest `pretool` event (any tool) -- functional heartbeat, distinguishes "process alive" from "LLM actually producing tool calls"
`last_mcp_tool_at`, `last_mcp_tool_name`	Same, restricted to tools whose name starts with `mcp__` -- MCP sidecar health probe
`last_action_at`, `last_action_name`	ISO timestamp and name of the most recent `PaneAction` attempt. `last_action_name` (renamed from `last_action`) avoids a column collision with orochi's hub schema.
`last_action_outcome`, `last_action_elapsed_s`	Outcome (`success`, `precondition_fail`, `send_error`, `completion_timeout`, `skipped_by_policy`) and wall-clock duration of that attempt
`action_counts`	`{action_name: count}` rollup from `action_store.summarize()`
`p95_elapsed_s_by_action`	`{action_name: p95_seconds}` per-action latency headline
`context_pct`, `current_tool`, `current_task`, `last_user_msg`, `model_transcript`	Derived from the active Claude Code transcript JSONL
`quota_5h_used_pct`, `quota_7d_used_pct`, `quota_*_reset_at`	Claude usage (best-effort, cached)
`metrics`	Host-level CPU / memory / load / disk (psutil)

Every field is best-effort: failures leave the default value ("", 0, []) rather than raising.

scitex-agent-container show-status my-agent --json | jq '.pane_state, .recent_tools[-3:]'

Claude Code Hook Integration

hook-event is the non-agentic counterpart to the status command: Claude Code invokes it on every tool call / prompt / stop, and the handler appends a compact JSON record to a per-agent ring-buffer at $XDG_DATA_HOME/.scitex/agent-container/events/<agent>.jsonl (capped at 500 lines). status --json reads that buffer to populate recent_tools, recent_prompts, agent_calls, background_tasks, and tool_counts.

Wire it in the agent workspace's .claude/settings.local.json:

{
  "hooks": {
    "PreToolUse":       [{"matcher": "", "hooks": [
      {"type": "command", "command": "scitex-agent-container ingest-hook-event pretool"}
    ]}],
    "PostToolUse":      [{"matcher": "", "hooks": [
      {"type": "command", "command": "scitex-agent-container ingest-hook-event posttool"}
    ]}],
    "UserPromptSubmit": [{"matcher": "", "hooks": [
      {"type": "command", "command": "scitex-agent-container ingest-hook-event prompt"}
    ]}],
    "Stop":             [{"matcher": "", "hooks": [
      {"type": "command", "command": "scitex-agent-container ingest-hook-event stop"}
    ]}]
  }
}

Agent name resolution order: --agent <name> flag > SCITEX_OROCHI_AGENT env var > CLAUDE_AGENT_ID env var > basename of the current working directory. The handler swallows all errors so a broken log can never block a tool call.

Pane Actions

A typed, logged vocabulary for pane-mediated agent actions. Each action is a PaneAction subclass implementing four methods (snapshot / precheck / send / is_complete); the run_action engine classifies every attempt as success, precondition_fail, send_error, completion_timeout, or skipped_by_policy, and writes it to a host-wide SQLite log at ~/.scitex/agent-container/actions.db (agent is a column, not a path). Two concrete actions ship today:

NonceProbeAction -- sends Repeat <nonce> and confirms the model echoes it back (true functional liveness, not just "process alive").
CompactAction -- sends /compact and confirms by watching context_pct drop by at least --min-drop-pct (default 20).

# Run an attempt (non-zero exit on any non-SUCCESS / non-SKIPPED).
scitex-agent-container actions run nonce-probe <agent>
scitex-agent-container actions run compact <agent> \
    --min-drop-pct 30 --timeout 60 --json

# Query / aggregate / purge the attempt log.
scitex-agent-container actions query \
    --agent <agent> --action compact --since 2h --limit 20
scitex-agent-container actions stats --agent <agent> --since 7d
scitex-agent-container actions purge --days 14

The latest attempt is folded into status --json via agent_meta.collect_rich() as last_action_at / last_action_name / last_action_outcome / last_action_elapsed_s, with rollups action_counts and p95_elapsed_s_by_action.

Reliable send_keys into a running pane needs an inter-key delay and a settle window before Enter. Both are configurable via env vars (read once at import time by runtimes/tmux.py and runtimes/screen.py):

Env var	Default	Meaning
`SAC_KEY_DELAY_S`	`0.1`	Delay between individual keys
`SAC_SUBMIT_SETTLE_S`	`0.3`	Settle after text, before `Enter`
`SAC_ACTION_RETENTION_DAYS`	`30`	Default `actions purge --days` horizon

A send_text_and_submit(session, text) helper wraps the "type then submit" pattern used by every action's send.

Zero Coupling to Downstream Orchestrators

scitex-agent-container is a generic library. It knows nothing about scitex-orochi, the hub, or any particular dashboard. status --json emits a self-describing dict; downstream consumers (e.g. orochi's heartbeat-push command) wrap it -- calling status --json, reshaping the payload, and POSTing to whatever endpoint they own. Keeping the two sides decoupled lets you swap the orchestrator, the transport, or the schema without touching this package.

YAML Spec Reference

Section	Key Fields	Description
`apiVersion`	`scitex-agent-container/v2`, `cld-agent/v1`	Config format version
`metadata`	`name`, `labels`	Agent identity and labels
`spec.runtime`	`claude-code`, `slurm`, `slurm-tenant`	Agent runtime selector
`spec.model`	`sonnet`, `opus[1m]`	Model selection
`spec.multiplexer`	`tmux` (default), `screen`	Terminal multiplexer
`spec.remote`	`host`, `user`, `timeout`	SSH remote deployment
`spec.claude`	`flags[]`, `session`, `auto_accept`	Claude Code options. `session` values: `continue-or-new` (default, try `--continue` with graceful fallback), `continue` (strict resume), `new` (always fresh). Top-level `spec.session:` also accepted and takes precedence.
`spec.health`	`enabled`, `interval`, `method`	Health monitoring
`spec.restart`	`policy`, `max_retries`, `backoff`	Auto-restart
`spec.skills`	`required[]`, `available[]`	Skill injection
`spec.env`	key-value pairs	Environment variables
`spec.venv`	path	Python virtualenv to activate
`spec.hooks`	`pre_start`, `post_start`, `pre_stop`, `post_stop`	Lifecycle hooks
`spec.container`	`runtime`, `image`, `volumes`	Docker/Apptainer

Four Freedoms for Research

The freedom to run your research anywhere -- your machine, your terms.

The freedom to study how every step works -- from raw data to final manuscript.

The freedom to redistribute your workflows, not just your papers.

The freedom to modify any module and share improvements with the community.

AGPL-3.0 — because we believe research infrastructure deserves the same freedoms as the software it runs on.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.21.9

Jun 1, 2026

0.21.7

Jun 1, 2026

0.21.5

May 31, 2026

0.21.4

May 31, 2026

0.21.3

May 28, 2026

0.21.2

May 28, 2026

0.21.1

May 26, 2026

0.21.0

May 25, 2026

0.20.0

May 24, 2026

0.19.0

May 24, 2026

0.18.0

May 21, 2026

0.17.3

May 20, 2026

0.17.2

May 20, 2026

0.17.0

May 17, 2026

0.16.0

May 15, 2026

0.15.0

May 13, 2026

This version

0.14.0

May 9, 2026

0.13.0

May 3, 2026

0.12.0

May 2, 2026

0.10.7

Apr 28, 2026

0.10.6

Apr 28, 2026

0.10.5

Apr 28, 2026

0.10.4

Apr 28, 2026

0.10.3

Apr 28, 2026

0.10.2

Apr 28, 2026

0.10.1

Apr 28, 2026

0.10.0

Apr 28, 2026

0.9.1

Apr 27, 2026

0.9.0

Apr 27, 2026

0.7.1

Apr 11, 2026

0.7.0

Apr 11, 2026

0.5.0

Apr 8, 2026

0.4.2

Apr 8, 2026

0.4.1

Apr 7, 2026

0.4.0

Apr 7, 2026

0.3.3

Apr 7, 2026

0.3.2

Apr 7, 2026

0.3.1

Apr 7, 2026

0.3.0

Apr 7, 2026

0.2.0

Apr 7, 2026

0.1.0

Apr 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scitex_agent_container-0.14.0.tar.gz (8.7 MB view details)

Uploaded May 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scitex_agent_container-0.14.0-py3-none-any.whl (8.2 MB view details)

Uploaded May 9, 2026 Python 3

File details

Details for the file scitex_agent_container-0.14.0.tar.gz.

File metadata

Download URL: scitex_agent_container-0.14.0.tar.gz
Upload date: May 9, 2026
Size: 8.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0rc1

File hashes

Hashes for scitex_agent_container-0.14.0.tar.gz
Algorithm	Hash digest
SHA256	`617a680331359a726d3bfe3f257c77947edc231c780d5714bcecbc2cac722b38`
MD5	`e4117cddb1dd457c27e6de7dc7f6c651`
BLAKE2b-256	`36e3492e96130bbccd028c2abfc9b08cb2d2163f77f942fc7b6ad3247351d666`

See more details on using hashes here.

File details

Details for the file scitex_agent_container-0.14.0-py3-none-any.whl.

File metadata

Download URL: scitex_agent_container-0.14.0-py3-none-any.whl
Upload date: May 9, 2026
Size: 8.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0rc1

File hashes

Hashes for scitex_agent_container-0.14.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c237b501c90a215903d8dc986bdd38d9bf83577d2384f5dc464c4abbe4fe16ef`
MD5	`5b4d3a1be546ece3284c1af1e6fcd73c`
BLAKE2b-256	`832936eb2534078122e335dacbdf25a2987ab611fb33360eb045fbd64d265c43`

See more details on using hashes here.

scitex-agent-container 0.14.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

scitex-agent-container

Problem and Solution

Problem

Solution

Installation

Architecture

Demo

Part of SciTeX

Templates

1 Interfaces

Quickstart (v2 config)

Remote SSH Deployment

SLURM (single-agent)

SLURM (multi-tenant — many agents on one allocation)

MCP Servers (src_mcp.json)

Auto-Accept TUI Prompts

CLI Commands

Rich Status (status <name> --json)

Claude Code Hook Integration

Pane Actions

Zero Coupling to Downstream Orchestrators

YAML Spec Reference

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Rich Status (`status <name> --json`)