Self-hosted SRE investigation copilot with YAML tools, SSH execution, SSE streaming, and secret redaction.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

benjamin-j

These details have not been verified by PyPI

Project description

ops-copilot

Self-hosted SRE investigation copilot for production systems.

ops-copilot lets an LLM call tools defined in YAML, execute safe remote commands over SSH, redact secrets from outputs, and stream investigation events through LangGraph or an optional FastAPI SSE server.

It is built for maintainers who want reviewed operational tools, not arbitrary shell access.

30-second demo

From a checkout, run a local incident replay, review a toolpack, and exercise the three-package demo without SSH credentials or API keys:

uv sync --dev
uv run ops-copilot replay examples/incidents/disk-full.yaml
uv run ops-copilot review examples/toolpacks/systemd.yaml
uv run python examples/orchestrated_demo.py

Expected signal:

incident=disk-full
toolpack_review status=ok
Starting Coordinated Local Investigation

The orchestrated demo uses ops-copilot, ollama-orchestra, and langchain-content-normalizer together when the three repos are checked out side-by-side.

Who this is for

SREs and platform engineers running self-hosted infrastructure.
Open source maintainers operating docs, bots, CI runners, demos, or package services.
Teams that want reviewed operational tools instead of free-form shell access.
Developers building incident-investigation UIs around LangGraph or LangChain.

Maintenance workflows

This repository is maintained with CI, build checks, smoke tests, release workflows, Dependabot, issue templates, PR checklists, a security model, and PyPI releases.

Typical maintainer tasks include reviewing YAML tools, triaging operational edge cases, adding tests for sanitizer and command-rendering behavior, and preparing safe releases.

What makes it safe

Tools are reviewed YAML or Python objects, not free-form shell generated by a model.
String parameters are constrained by allowed_values, pattern, or a conservative default validator.
Built-in command policy blocks obvious destructive commands unless the tool explicitly opts in.
Tool outputs are redacted before they are returned to the model, audit log, or UI.
dry_run: true lets maintainers review rendered commands without executing them.
JSONL audit logs and incident reports preserve redacted evidence for post-incident review.

Architecture

User question -> InvestigationGraph -> LLM -> YAML tools -> SSH host
                                      <- redacted tool output <- command result

The package is intentionally generic. You can start with shell tools from YAML, then inject custom Python RemoteTool classes for richer workflows.

Install

uv add ops-copilot

Optional extras:

uv add 'ops-copilot[server]'
uv add 'ops-copilot[openai]'
uv add 'ops-copilot[ollama]'

YAML tools

tools:
  - name: disk_usage
    type: shell
    description: Show filesystem usage.
    command: df -h

  - name: journalctl_service
    type: shell
    description: Show recent logs for a systemd service.
    command: journalctl -u {service} --since '{since}' --no-pager
    parameters:
      service:
        type: string
      since:
        type: string
        required: false
        default: "30 minutes ago"

Minimal usage

from ops_copilot import InvestigationGraph, SSHClient, ToolRegistry

ssh = SSHClient(host="server.example.com", user="deploy", key_path="~/.ssh/id_ed25519")
tools = ToolRegistry(ssh, config_path="tools.yaml").load()

graph = InvestigationGraph(
    llm=your_langchain_chat_model,
    tools=tools,
    system_prompt="You are an SRE copilot. Investigate safely and report evidence.",
)

async for event in graph.stream("The API is slow. What should I check?"):
    print(event)

Streaming events

InvestigationGraph.stream() yields dictionaries with these event names:

Event	Meaning
`token`	streamed model text
`tool_start`	tool call started with input and step id
`tool_end`	tool call finished with redacted output
`error`	graph or stream error
`done`	investigation complete

Optional FastAPI server

The ops_copilot.server.create_app() helper exposes:

POST /investigate
POST /investigate/stream

If OPS_COPILOT_API_KEY is set, clients must send X-API-Key.

CLI

ops-copilot review examples/toolpacks/systemd.yaml
ops-copilot replay examples/incidents/disk-full.yaml

The CLI is intentionally small: it exposes the maintainer workflows that should run before a toolpack or incident fixture is trusted.

Security notes

This project executes commands on servers you control. Treat tools.yaml as privileged code.

Recommendations:

Use SSH key auth with least-privilege users.
Review every command template before exposing it to an LLM.
Avoid destructive commands in YAML.
Keep parameterized commands narrow.
Store no secrets in YAML or prompts.
Rely on built-in redaction as a safety net, not as your only control.

Built-in redaction covers env-style secret lines, Bearer tokens, OpenAI-style keys, JWTs, long hex runs, and inline image data URLs.

Shell tools also apply a conservative command policy. Obvious destructive commands such as rm, dd, mkfs, shutdown, docker rm, docker prune, and systemctl restart are blocked unless the YAML tool explicitly opts in with policy.allow_destructive: true. Use dry_run: true to review rendered commands without executing them.

Audit logs

Use JsonlAuditLog to append redacted investigation events for incident review:

from ops_copilot import InvestigationGraph, JsonlAuditLog

graph = InvestigationGraph(
    llm=your_langchain_chat_model,
    tools=tools,
    system_prompt="Investigate safely and cite evidence.",
    audit_log=JsonlAuditLog("audit/investigation.jsonl"),
)

Documentation and examples

docs/security-model.md documents threat boundaries and deployment controls.
docs/threat-model.md documents STRIDE-style risks and mitigations.
docs/why-ops-copilot.md explains the project scope and ecosystem need.
docs/demo.md shows a local demo that runs without real SSH credentials.
docs/codex-maintenance.md documents safe Codex-style maintenance workflows.
docs/writing-tools.md explains YAML and custom Python tools.
docs/server.md covers the optional FastAPI/SSE integration.
docs/maintenance-workflows.md describes maintainer workflows and review checklists.
docs/toolpacks.md documents reviewed example toolpacks.
docs/incident-fixtures.md documents fake incidents for demos and regression tests.
.github/workflows/ecosystem-smoke.yml verifies the three-package demo across sibling repos.
examples/local_demo.py runs without a real SSH host using fake outputs.
examples/replay_incident.py replays fake incident fixtures for demos.
examples/custom_tool.py shows how to inject a custom RemoteTool class.

Roadmap

SQLite-backed investigation sessions.
More reviewed toolpacks for common self-hosted services.
More incident fixture coverage for regression tests.
Coverage and type-checking gates in CI.

Development

uv sync --dev
uv run ruff check .
uv run pytest
uv run python scripts/smoke.py
uv build

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

benjamin-j

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.10

Jun 3, 2026

0.1.9

Jun 2, 2026

0.1.8

Jun 2, 2026

This version

0.1.7

Jun 2, 2026

0.1.6

Jun 1, 2026

0.1.5

Jun 1, 2026

0.1.4

Jun 1, 2026

0.1.3

Jun 1, 2026

0.1.2

Jun 1, 2026

0.1.0

Jun 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ops_copilot-0.1.7.tar.gz (218.9 kB view details)

Uploaded Jun 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ops_copilot-0.1.7-py3-none-any.whl (22.8 kB view details)

Uploaded Jun 2, 2026 Python 3

File details

Details for the file ops_copilot-0.1.7.tar.gz.

File metadata

Download URL: ops_copilot-0.1.7.tar.gz
Upload date: Jun 2, 2026
Size: 218.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ops_copilot-0.1.7.tar.gz
Algorithm	Hash digest
SHA256	`30e36b2298e680a55bf797475ffbe2d4a3812c0571b50a1b0d44c9a89a0618f4`
MD5	`4c95f35e7f37c0553a7bc5461a210ae3`
BLAKE2b-256	`e20f2166f9c9ed3f3b4f3d209e760dc17c1c478062d8a0558c7f208ffd2155a1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ops_copilot-0.1.7.tar.gz:

Publisher: publish.yml on BenjaminJornet/ops-copilot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ops_copilot-0.1.7.tar.gz
- Subject digest: 30e36b2298e680a55bf797475ffbe2d4a3812c0571b50a1b0d44c9a89a0618f4
- Sigstore transparency entry: 1698878544
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: BenjaminJornet/ops-copilot@01cc6f8a135380aed461c8e26245069ba7821abd
- Branch / Tag: refs/tags/v0.1.7
- Owner: https://github.com/BenjaminJornet
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@01cc6f8a135380aed461c8e26245069ba7821abd
- Trigger Event: release

File details

Details for the file ops_copilot-0.1.7-py3-none-any.whl.

File metadata

Download URL: ops_copilot-0.1.7-py3-none-any.whl
Upload date: Jun 2, 2026
Size: 22.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ops_copilot-0.1.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c7d3369aec57426ea70265910f2b0ee852f03ea9768bb4ad1efdca61b649554`
MD5	`fe6c3be6a620c9669551eabf11cccc80`
BLAKE2b-256	`876262f8f3bc8348c59e894c8c216dbe5b348d686aee8c3ec5c9edcb3edb7e65`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ops_copilot-0.1.7-py3-none-any.whl:

Publisher: publish.yml on BenjaminJornet/ops-copilot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ops_copilot-0.1.7-py3-none-any.whl
- Subject digest: 3c7d3369aec57426ea70265910f2b0ee852f03ea9768bb4ad1efdca61b649554
- Sigstore transparency entry: 1698878863
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: BenjaminJornet/ops-copilot@01cc6f8a135380aed461c8e26245069ba7821abd
- Branch / Tag: refs/tags/v0.1.7
- Owner: https://github.com/BenjaminJornet
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@01cc6f8a135380aed461c8e26245069ba7821abd
- Trigger Event: release

ops-copilot 0.1.7

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ops-copilot

30-second demo

Who this is for

Maintenance workflows

What makes it safe

Architecture

Install

YAML tools

Minimal usage

Streaming events

Optional FastAPI server

CLI

Security notes

Audit logs

Documentation and examples

Roadmap

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance