Production-grade Agent Operations (AgentOps) Platform
Project description
🕹️ AgentOps Cockpit
"Infrastructure gives you the pipes. We give you the Intelligence."
The developer distribution for building, optimizing, and securing AI agents on Google Cloud.
📽️ The Mission
Most AI agent templates stop at a single Python file and an API key. The AgentOps Cockpit is for developers moving into production. It provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem.
- Governance-as-Code: Audit your agent against Google Well-Architected best practices with the Evidence Bridge—real-time citations for architectural integrity.
- SME Persona Audits: Parallelized review of your codebase by automated Principal SMEs across FinOps, SecOps, Architecture, and Quality.
- Agentic Trinity: Dedicated layers for the Engine (Logic), Face (UX), and Cockpit (Ops).
- A2A Connectivity: Implements the Agent-to-Agent Transmission Standard for secure swarm orchestration.
- MCP Native: Registration as a Model Context Protocol server for 1P/2P/3P tool consumption.
🏗️ The Agentic Trinity
We divide the complexity of production agents into three focused pillars:
graph TD
subgraph Trinity [The Agentic Trinity 2.0]
E(The Engine: Reasoning)
F(The Face: Interface)
C(The Cockpit: Operations)
S{Sovereignty & Compliance}
end
E <--> C
F <--> C
E <--> F
E -.-> S
F -.-> S
C -.-> S
style Trinity fill:#f8fafc,stroke:#334155,stroke-width:2px
style S fill:#0ea5e9,color:#fff,stroke:#0284c7
- ⚙️ The Engine: The reasoning core. Built with ADK, FastAPI, and Vertex AI.
- 🎭 The Face: The user experience. Adaptive UI surfaces and GenUI standards via the A2UI spec.
- 🕹️ The Cockpit: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.
🕹️ v1.6.6: The "Watchtower Standard" Release (LATEST)
Evolving into a full Lifecycle Management Platform for AI Agents. See the v1.6.6 Release Notes. The ultimate end-to-end management platform for professional AI agents. The Cockpit has been refactored into a Sovereign Hub Hierarchy for simplified operations.
- 🛰️ Fleet Hub (
fleet): Stateful registry and runtime oversight. Monitor health withfleet status, scale withmothball/resume, and iterative withtunnel. - 🛡️ Audit Hub (
audit): Principal SME Principal board. Run master reviews withaudit report, security scans withaudit security, and context/token visualization withaudit context. - 🧪 Reliability Hub (
test): Regression and smoke validation. Run unit tests withtest unit, persona journeys withtest smoke, and adversarial user stress-testing withtest simulate. - 🚀 Deployment Hub (
deploy): The multi-cloud factory. End-to-end pipelines viadeploy sovereignand GCP/AWS/Azure migration viadeploy migrate. - 🔧 Evolution Hub (
fix): Autonomous code synthesis. Apply targeted audit fixes withfix issueor trigger thefix evolve"PR Closer". - 🕵️ Sentinel Journey: Reasoning-based runtime oversight. Identifies behavioral anomalies, suspicious intent, and tool misuse via
fleet anomaly. - 🚨 Proactive Enforcement: High-fidelity "Kill Switch". The Cockpit automatically mothballs agents that exhibit critical risks during runtime audits.
- 🏗️ Scaffolding Hub (
create): Trinity Project initialization. Bootstrap unified Engine/Face/Cockpit projects viacreate trinity.
🚀 Key Innovation: The "Intelligence" Layer
🛡️ Red Team Auditor (Adversarial SRE)
Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and multilingual jailbreaks.
🧠 Hive Mind (Semantic Caching)
Reduce LLM costs by up to 40%. The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.
🏛️ Arch Review & Autonomous Evolution
Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack and runs a tailored Architecture Review. v1.3 introduces Autonomous Evolution—the ability to synthesize code fixes directly from audit findings.
🕹️ MCP Connectivity Hub (Model Context Protocol)
Stop building one-off tool integrations. The Cockpit provides a unified hub for MCP Servers. Connect to 1P/2P/3P tools via the standardized Model Context Protocol for secure, audited tool execution. Start the server with make mcp-serve.
🗄️ Situational Database Audits
The Cockpit now performs platform-specific performance and security audits for AlloyDB, Pinecone, BigQuery, and Cloud SQL.
🛡️ Advanced Governance & Discovery (v1.3.5)
Modern agents don't just live in agent.py. The Cockpit uses a centralized Discovery Engine to intelligently map your project:
.gitignoreCompliance: Zero-noise scanning that respects your project's ignore rules.- Multi-Target Logic: Define
targets: []incockpit.yamlto audit distributed agents in a single pass. - Template Isolation: Automatically ignores raw template placeholders (e.g., Jinja/Cookiecutter) to focus on the active implementation.
- Artifact Store: All data (SARIF, Evidence, HTML) is now sovereignly stored in the
.cockpit/directory.
⌨️ Master Command Registry
The Cockpit is available as a first-class CLI and a comprehensive Makefile-based operational toolkit.
| Registry | Description |
|---|---|
| 🕹️ Makefile Commands | Standard local development and orchestration shortcuts. |
| 🚀 UVX Master Guide | Portable, zero-install commands for CI/CD and automation. |
🧑💼 Principal SME Persona Approvals
The Cockpit now features a Multi-Persona Governance Board. Every audit result is framed through the lens of a Principal Engineer in that domain:
🚀 Production Readiness Auditor
The Cockpit serves as the final gate before production deployment. make deploy-prod triggers a deep benchmark of the entire ecosystem:
- v1.6.6 Deep System Audit: Benchmarks models (Gemini 2.0 Pro/Flash) and logic.
- Stress Testing: Load testing endpoints to ensure concurrency safety.
- Red Team Verification: Adversarial security scans for prompt injection and PII.
- Resiliency Check: Verifies
@retrylogic andtimeoutguards are active.
🛡️ Privacy & Telemetry
The AgentOps Cockpit follows a Privacy-First, Sovereign Standard.
By default, the CLI sends anonymous operational metrics (e.g., event names, OS type, success rates) to the Global Pulse hub to help us understand fleet health and prioritize improvements. We do not collect names, emails, code snippets, secrets, or folder paths.
🌑 How to Opt-Out
If you prefer 100% isolation, you can disable telemetry by setting the following environment variable in your shell:
export AGENTOPS_TELEMETRY_ENABLED=false
Alternatively, you can set it in your local cockpit.yaml:
telemetry:
enabled: false
🤝 Ecosystem & Attribution
The AgentOps Cockpit is designed to leverage and secure the best-of-breed tools in the Google Cloud ecosystem. We explicitly acknowledge and leverage the excellent work from:
- GoogleCloudPlatform/agent-starter-pack: We leverage this as a core reference for the Agent Development Kit (ADK) patterns and Vertex AI Agent Engine integration.
- A2UI Protocol: Standardized Generative UI handshake for building adaptive, agentic user interfaces.
- A2A Standard: Agent-to-Agent Transmission Protocol for secure swarm intelligence and inter-agent communication.
- Model Context Protocol (MCP): Our unified tool execution standard, enabling portable and secure 1P/2P/3P integrations.
- LangChain & LangGraph: Foundational libraries for stateful, multi-agent reasoning loops and graph-based orchestration.
- CrewAI: Multi-agent framework used as a reference for collaborative task execution and role-playing agents.
- Firebase: Provider for enterprise-grade hosting and global distribution of the Face layer.
- Google Cloud Run & GKE: High-scale orchestration platforms for the Engine and cluster-wide agent fleets.
- Vertex AI SDK: The backbone for frontier reasoning (Gemini 3) and enterprise-grade model governance.
- Tenacity: The gold-standard library for the exponential backoff and resiliency patterns we enforce.
- Rich: Modern visualization engine that powers the high-fidelity Cockpit CLI experience.
Reference: Google Cloud Architecture Center - Agentic AI Overview
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentops_cockpit-1.6.6.tar.gz.
File metadata
- Download URL: agentops_cockpit-1.6.6.tar.gz
- Upload date:
- Size: 18.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.16 {"installer":{"name":"uv","version":"0.9.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
866f8d25c64f14231194670c463c85ffe9924d167cf5491b2d689468f550121c
|
|
| MD5 |
1636e0ef123f6046d66196cb8dffbd2d
|
|
| BLAKE2b-256 |
99230e245af28242a9a58253cc4ac4f5499111b4e2fb7b3f4cc65fc4d6d2ab46
|
File details
Details for the file agentops_cockpit-1.6.6-py3-none-any.whl.
File metadata
- Download URL: agentops_cockpit-1.6.6-py3-none-any.whl
- Upload date:
- Size: 232.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.16 {"installer":{"name":"uv","version":"0.9.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5376a39e99c7d6edf5de264771db130856199dadaee69ad4f750902511e565ee
|
|
| MD5 |
34b2e518bb5d63d9b9438beca8e84d7d
|
|
| BLAKE2b-256 |
72078f5217a51c5ab48001ae03115178e5cb0b1233eedf1e0de3f5c26b83d1b7
|