Skip to main content

Production-grade Agent Operations (AgentOps) Platform

Project description

🕹️ AgentOps Cockpit

AgentOps Cockpit Trinity

"Infrastructure gives you the pipes. We give you the Intelligence."

The developer distribution for building, optimizing, and securing AI agents on Google Cloud.


📽️ The Mission

Most AI agent templates stop at a single Python file and an API key. The AgentOps Cockpit is for developers moving into production. It provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem.

  • Governance-as-Code: Audit your agent against Google Well-Architected best practices with the Evidence Bridge—real-time citations for architectural integrity.
  • SME Persona Audits: Parallelized review of your codebase by automated Principal SMEs across FinOps, SecOps, Architecture, and Quality.
  • Agentic Trinity: Dedicated layers for the Engine (Logic), Face (UX), and Cockpit (Ops).
  • A2A Connectivity: Implements the Agent-to-Agent Transmission Standard for secure swarm orchestration.
  • MCP Native: Registration as a Model Context Protocol server for 1P/2P/3P tool consumption.

🏗️ The Agentic Trinity

We divide the complexity of production agents into three focused pillars:

graph TD
   subgraph Trinity [The Agentic Trinity 2.0]
       E(The Engine: Reasoning)
       F(The Face: Interface)
       C(The Cockpit: Operations)
       S{Sovereignty & Compliance}
   end
   E <--> C
   F <--> C
   E <--> F
   E -.-> S
   F -.-> S
   C -.-> S
   style Trinity fill:#f8fafc,stroke:#334155,stroke-width:2px
   style S fill:#0ea5e9,color:#fff,stroke:#0284c7
  • ⚙️ The Engine: The reasoning core. Built with ADK, FastAPI, and Vertex AI.
  • 🎭 The Face: The user experience. Adaptive UI surfaces and GenUI standards via the A2UI spec.
  • 🕹️ The Cockpit: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.
Ecosystem Integrations

🕹️ v1.4.4: The "Sovereign Evolution" Release (NEW)

Evolving into a full Lifecycle Management Platform for AI Agents. See the v1.4.4 Release Notes.

  • 🛡️ Validation Automation: Integrated /validate workflow to enforce 100% SME approval and regression safety.

🚢 v1.4.7: The "Fleet Sovereign" Release (LATEST)

The ultimate end-to-end factory for professional AI agents.

  • 🚢 Sovereign Fleet Pipeline (10X): Unified orchestrator (uv run agentops-cockpit sovereign) that Audits, Hardens, Hydrates, Deploys, and Registers fleets of 1 to 50+ agents in a single command.
  • 🌊 Multi-Cloud Sovereign Factory: Full end-to-end support for AWS App Runner and Azure Container Apps, including cloud-specific hydration assets (Dockerfile.aws, aws-sam.json, azure-deploy.json).
  • 🛫 Phase 0: Pre-flight Handshake: Identity and toolchain verification gate that ensures IAM principals and CLIs are active before expensive fleet operations launch.
  • 🌉 Cross-Cloud A2A Bridge: Seamlessly register your AWS/Azure agents as native Vertex AI tools via the A2A Proxy Registration logic.
  • ☸️ Industrial GKE Autopilot: Native Kubernetes support for high-scale agent fleets, including LoadBalancer exposure and resource-aware scaling.
  • 📡 Gemini Enterprise Tool-use: Seamlessly register your cross-cloud agents as native Vertex AI tools via the Agent Engine and A2A Bridge.
  • 💧 ADK-Native Transition: Automatically upgrades generic agents to the high-fidelity Agent Development Kit (ADK) standard.
  • 🧗 Autonomous Evolution (10X): The "PR Closer" mode. Surgically fixes detected gaps and creates a hardened deployment branch automatically.
  • 🕵️ Shadow Mode (10X): Differential reasoning analysis to detect drift, latency, and cost delta between agent versions.
  • 💰 SME Consensus 2.0: Unified approval engine requiring a unanimous "Sovereign Standard" from all 11 Principal SMEs.

🚀 Key Innovation: The "Intelligence" Layer

🛡️ Red Team Auditor (Adversarial SRE)

Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and multilingual jailbreaks.

🧠 Hive Mind (Semantic Caching)

Reduce LLM costs by up to 40%. The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.

🏛️ Arch Review & Autonomous Evolution

Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack and runs a tailored Architecture Review. v1.3 introduces Autonomous Evolution—the ability to synthesize code fixes directly from audit findings.

🕹️ MCP Connectivity Hub (Model Context Protocol)

Stop building one-off tool integrations. The Cockpit provides a unified hub for MCP Servers. Connect to 1P/2P/3P tools via the standardized Model Context Protocol for secure, audited tool execution. Start the server with make mcp-serve.

🗄️ Situational Database Audits

The Cockpit now performs platform-specific performance and security audits for AlloyDB, Pinecone, BigQuery, and Cloud SQL.


🛡️ Advanced Governance & Discovery (v1.3.5)

Modern agents don't just live in agent.py. The Cockpit uses a centralized Discovery Engine to intelligently map your project:

  • .gitignore Compliance: Zero-noise scanning that respects your project's ignore rules.
  • Multi-Target Logic: Define targets: [] in cockpit.yaml to audit distributed agents in a single pass.
  • Template Isolation: Automatically ignores raw template placeholders (e.g., Jinja/Cookiecutter) to focus on the active implementation.
  • Artifact Store: All data (SARIF, Evidence, HTML) is now sovereignly stored in the .cockpit/ directory.

⌨️ Master Command Registry

The Cockpit is available as a first-class CLI and a comprehensive Makefile-based operational toolkit.

Registry Description
🕹️ Makefile Commands Standard local development and orchestration shortcuts.
🚀 UVX Master Guide Portable, zero-install commands for CI/CD and automation.

🧑‍💼 Principal SME Persona Approvals

The Cockpit now features a Multi-Persona Governance Board. Every audit result is framed through the lens of a Principal Engineer in that domain:


🚀 Production Readiness Auditor

The Cockpit serves as the final gate before production deployment. make deploy-prod triggers a deep benchmark of the entire ecosystem:

  1. v1.4.1 Deep System Audit: Benchmarks models (Gemini 3 Pro/Flash) and logic.
  2. Stress Testing: Load testing endpoints to ensure concurrency safety.
  3. Red Team Verification: Adversarial security scans for prompt injection and PII.
  4. Resiliency Check: Verifies @retry logic and timeout guards are active.


🛡️ Privacy & Telemetry

The AgentOps Cockpit follows a Privacy-First, Sovereign Standard.

By default, the CLI sends anonymous operational metrics (e.g., event names, OS type, success rates) to the Global Pulse hub to help us understand fleet health and prioritize improvements. We do not collect names, emails, code snippets, secrets, or folder paths.

🌑 How to Opt-Out

If you prefer 100% isolation, you can disable telemetry by setting the following environment variable in your shell:

export AGENTOPS_TELEMETRY_ENABLED=false

Alternatively, you can set it in your local cockpit.yaml:

telemetry:
  enabled: false

🤝 Ecosystem & Attribution

The AgentOps Cockpit is designed to leverage and secure the best-of-breed tools in the Google Cloud ecosystem. We explicitly acknowledge and leverage the excellent work from:

  • GoogleCloudPlatform/agent-starter-pack: We leverage this as a core reference for the Agent Development Kit (ADK) patterns and Vertex AI Agent Engine integration.
  • A2UI Protocol: Standardized Generative UI handshake for building adaptive, agentic user interfaces.
  • A2A Standard: Agent-to-Agent Transmission Protocol for secure swarm intelligence and inter-agent communication.
  • Model Context Protocol (MCP): Our unified tool execution standard, enabling portable and secure 1P/2P/3P integrations.
  • LangChain & LangGraph: Foundational libraries for stateful, multi-agent reasoning loops and graph-based orchestration.
  • CrewAI: Multi-agent framework used as a reference for collaborative task execution and role-playing agents.
  • Firebase: Provider for enterprise-grade hosting and global distribution of the Face layer.
  • Google Cloud Run & GKE: High-scale orchestration platforms for the Engine and cluster-wide agent fleets.
  • Vertex AI SDK: The backbone for frontier reasoning (Gemini 3) and enterprise-grade model governance.
  • Tenacity: The gold-standard library for the exponential backoff and resiliency patterns we enforce.
  • Rich: Modern visualization engine that powers the high-fidelity Cockpit CLI experience.

Reference: Google Cloud Architecture Center - Agentic AI Overview

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentops_cockpit-1.4.7.tar.gz (18.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentops_cockpit-1.4.7-py3-none-any.whl (221.5 kB view details)

Uploaded Python 3

File details

Details for the file agentops_cockpit-1.4.7.tar.gz.

File metadata

  • Download URL: agentops_cockpit-1.4.7.tar.gz
  • Upload date:
  • Size: 18.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for agentops_cockpit-1.4.7.tar.gz
Algorithm Hash digest
SHA256 b45a8e7f70c86bd1603a5658667a46d6049d993c5fd057ae6a8a5bd6a005a773
MD5 97f4bfb9713733779494800ca4937953
BLAKE2b-256 32e8c3de8ae477f1d764f1247f2388bf14f6a86ef743c9002d064184b8c681b7

See more details on using hashes here.

File details

Details for the file agentops_cockpit-1.4.7-py3-none-any.whl.

File metadata

File hashes

Hashes for agentops_cockpit-1.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 fa4a998c973323c6adfc35261462f68898a3dd815c7e362e451721e194fa9b17
MD5 269282e67794c076d4815dceca58ec60
BLAKE2b-256 d2104c793449698629e110fe1cdcf0564e4011611be044ec0d73a7173106d77e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page