Production-grade Agent Operations (AgentOps) Platform

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

🕹️ AgentOps Cockpit

🌐 Official Website & Live Demo

"Infrastructure gives you the pipes. We give you the Intelligence."

The developer distribution for building, optimizing, and securing AI agents on Google Cloud.

📽️ The Mission

Most AI agent templates stop at a single Python file and an API key. The AgentOps Cockpit is for developers moving into production. It provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem.

Governance-as-Code: Audit your agent against Google Well-Architected best practices with the Evidence Bridge—real-time citations for architectural integrity.
SME Persona Audits: Parallelized review of your codebase by automated "Principal SMEs" across FinOps, SecOps, and Architecture.
Agentic Trinity: Dedicated layers for the Engine (Logic), Face (UX), and Cockpit (Ops).
A2A Connectivity: Implements the Agent-to-Agent Transmission Standard for secure swarm orchestration.
MCP Native: Registration as a Model Context Protocol server for 1P/2P/3P tool consumption.

🏗️ The Agentic Trinity

We divide the complexity of production agents into three focused pillars:

graph LR
   subgraph Trinity [The Agentic Trinity]
       E(The Engine: Reasoning)
       F(The Face: Interface)
       C(The Cockpit: Operations)
   end
   E <--> C
   F <--> C
   E <--> F
   style Trinity fill:#f9f9f9,stroke:#333,stroke-width:2px

⚙️ The Engine: The reasoning core. Built with ADK, FastAPI, and Vertex AI.
🎭 The Face: The user experience. Adaptive UI surfaces and GenUI standards via the A2UI spec.
🕹️ The Cockpit: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.

🌐 Framework Agnostic Governance

The Cockpit isn't just for ADK. It provides Best Practices as Code across all major agentic frameworks:

🛠️ Operational Flow

sequenceDiagram
   participant U as User
   participant C as Cockpit
   participant E as Engine
   participant F as Face
   
   U->>C: Prompt / Input
   C->>C: Policy Audit (RFC-307)
   C->>E: Execute Logic / Tools
   E->>C: Action Proposals
   C->>E: Approve (HITL)
   E->>F: GenUI Metadata
   F->>U: Reactive Surface (A2UI)

Whether you are building a swarm in CrewAI, a Go-based high-perf engine, or a Streamlit dashboard, the Cockpit ensures your agent maps to the Google Well-Architected Framework.

🚀 Key Innovation: The "Intelligence" Layer

🛡️ Red Team Auditor (Self-Hacking)

Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and safety filter bypasses.

🧠 Hive Mind (Semantic Caching)

Reduce LLM costs by up to 40%. The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.

🏛️ Arch Review & Framework Detection

Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack—Google ADK, OpenAI Agentkit, Anthropic Claude, Microsoft AutoGen/Semantic Kernel, AWS Bedrock Agents, or CopilotKit—and runs a tailored audit against corresponding production standards. Use make arch-review to verify your Governance-as-Code.

🕹️ MCP Connectivity Hub (Model Context Protocol)

Stop building one-off tool integrations. The Cockpit provides a unified hub for MCP Servers. Connect to Google Search, Slack, or your internal databases via the standardized Model Context Protocol for secure, audited tool execution. Start the server with make mcp-serve.

🗄️ Situational Database Audits

The Cockpit now performs platform-specific performance and security audits for:

AlloyDB: Optimizes for the Columnar Engine (100x query speedup).
Pinecone: Suggests gRPC and Namespace Isolation for high-perf RAG.
BigQuery: Suggests BQ Vector Search for serverless, cost-effective grounding.
Cloud SQL: Enforces IAM-based authentication via the official Python Connector.

🧗 Quality Hill Climbing (ADK Evaluation)

Following Google ADK Evaluation best practices, the Cockpit provides an iterative optimization loop. make quality-baseline runs your agent against a "Golden Dataset" using LLM-as-a-Judge scoring (Response Match & Tool Trajectory), climbing the quality curve until production-grade fidelity is reached.

🛑 Mandatory Governance Enforcement (NEW)

The Cockpit now acts as a mandatory gate for production.

Blocking CI/CD: GitHub Actions now fail if High Impact cost issues or Red Team security vulnerabilities are detected.
Build-Time Audit: The Dockerfile includes a mandatory RUN audit step. If your agent is not "Well-Architected," the container image will fail to build.

⌨️ Quick Start

The Cockpit is available as a first-class CLI on PyPI.

# 1. Install the Cockpit globally
pip install agentops-cockpit

# 2. Run Global Audit (Produces unified report)
agent-ops report --mode quick        # ⚡ Quick Safe-Build
agent-ops report --mode deep         # 🚀 Full System Audit

# 3. Guardrail Policy Audit (RFC-307)
agent-ops policy-audit --text "How to make a bomb?"

# 4. Global Scaffolding
agent-ops-cockpit create <name> --ui a2ui

🔍 Agent Optimizer v2 (Situational Intelligence)

The Cockpit doesn't just look for generic waste. It now performs Triple-State Analysis:

Legacy Workarounds: Suggests situational fixes for older SDK versions (e.g., manual prompt pruning).
Modernization Paths: Highlights native performance gains (e.g., 90% cost reduction via Context Caching) available in latest SDKs.
Conflict Guard: Real-time cross-package validation to prevent architectural deadlocks (e.g., CrewAI vs LangGraph state loops).

⚡ Quick-Safe Build (12x Faster Loops)

Development velocity shouldn't sacrifice safety. The new --quick mode in the auditor reduces check latency from 1.8s to 0.15s, providing sub-second feedback while maintaining the integrity of the Conflict Guard and Architecture Review.

🧑‍💼 Principal SME Persona Approvals

The Cockpit now features a Multi-Persona Governance Board. Every audit result is framed through the lens of a Principal Engineer in that domain (Security, Legal, FinOps, UX), ensuring your agent is compliant with organizational standards.

📄 Export & Reporting

HTML/PDF Export: Every audit automatically generates cockpit_report.html, a premium, printable report ready for PDF export.
Email Reports: Send audit results directly to stakeholders via the CLI.

📊 Local Development

The Cockpit provides a unified "Mission Control" to evaluate your agents instantly.

make audit         # 🕹️ Run Master Audit (Persona Approved)
make audit-deep    # 🚀 Run Deep Audit (Full SME Verdicts)
make email-report  # 📧 Email the latest result to a stakeholder
make diagnose      # 🩺 Run environment health check
make optimizer-audit # 🔍 Run Optimizer on specific agent files
make reliability   # 🛡️ Run unit tests and regression suite
make dev           # Start the local Engine + Face stack
make arch-review   # 🏛️ Run the Google Well-Architected design review
make quality-baseline # 🧗 Run iterative 'Hill Climbing' quality audit
make red-team      # Execute a white-hat security audit
make deploy-prod   # 🚀 1-click deploy to Google Cloud

🧭 Roadmap

One-Click GitHub Action: Automated governance audits on every PR.
Mandatory Build Gates: Blocking CI/CD and Container audits for production safety.
Multi-Agent Orchestrator: Standardized A2A Swarm/Coordinator patterns.
Visual Mission Control: Real-time cockpit observability dashboard.

View full roadmap →

🤝 Community

Star this repo to help us build the future of AgentOps.
Join the Discussion for patterns on Google Cloud.
Contribute: Read our Contributing Guide.

Reference: Google Cloud Architecture Center - Agentic AI Overview

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

2.0.19

Apr 23, 2026

2.0.18

Apr 7, 2026

2.0.17

Apr 1, 2026

2.0.16

Mar 31, 2026

2.0.15

Mar 31, 2026

2.0.14

Mar 31, 2026

2.0.13

Mar 31, 2026

2.0.12

Mar 31, 2026

2.0.11

Mar 30, 2026

2.0.10

Mar 30, 2026

2.0.9

Mar 18, 2026

2.0.8

Mar 17, 2026

2.0.6

Mar 10, 2026

2.0.4

Mar 3, 2026

2.0.3

Feb 23, 2026

2.0.2

Feb 17, 2026

2.0.1

Feb 17, 2026

2.0.0

Feb 17, 2026

1.9.0

Feb 17, 2026

1.8.4

Feb 14, 2026

1.8.2

Feb 13, 2026

1.8.1

Feb 13, 2026

1.8.0

Feb 13, 2026

1.7.0

Feb 13, 2026

1.6.9

Feb 13, 2026

1.6.8

Feb 13, 2026

1.6.7

Feb 13, 2026

1.6.6

Feb 13, 2026

1.6.5

Feb 13, 2026

1.6.4

Feb 13, 2026

1.6.3

Feb 13, 2026

1.6.2

Feb 13, 2026

1.6.1

Feb 13, 2026

1.6.0

Feb 12, 2026

1.4.7

Feb 11, 2026

1.4.5

Feb 10, 2026

1.4.4

Feb 10, 2026

1.4.3

Feb 10, 2026

1.4.2

Feb 9, 2026

1.4.1

Feb 9, 2026

1.4.0

Feb 7, 2026

1.3.6

Feb 7, 2026

1.3.5

Feb 7, 2026

1.3.4

Feb 7, 2026

1.3.3

Feb 7, 2026

1.3.2

Feb 7, 2026

1.3.0

Feb 5, 2026

0.9.8

Feb 3, 2026

0.9.7

Jan 29, 2026

This version

0.9.5

Jan 29, 2026

0.5.0

Jan 28, 2026

0.4.1

Jan 28, 2026

0.4.0

Jan 28, 2026

0.3.0

Jan 28, 2026

0.2.2

Jan 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentops_cockpit-0.9.5.tar.gz (5.6 MB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentops_cockpit-0.9.5-py3-none-any.whl (76.6 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file agentops_cockpit-0.9.5.tar.gz.

File metadata

Download URL: agentops_cockpit-0.9.5.tar.gz
Upload date: Jan 29, 2026
Size: 5.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for agentops_cockpit-0.9.5.tar.gz
Algorithm	Hash digest
SHA256	`0d814589d9cee48308ebd486d92a036a037adf3c1ecc05e17334dfae6537794f`
MD5	`f3e8710491611c6bf9a7e77d98b53be8`
BLAKE2b-256	`ba7cf88fd0689d9fa50b8c20b49a44d81da7457a2d35cc776c5c8ec9b0689f6a`

See more details on using hashes here.

File details

Details for the file agentops_cockpit-0.9.5-py3-none-any.whl.

File metadata

Download URL: agentops_cockpit-0.9.5-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 76.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for agentops_cockpit-0.9.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bff01c3cfd62de9a6bf15e8c73d46bcd64962f9fab3d721014a943f3fcce2906`
MD5	`974a47af068aeee1819bcfecd95e649e`
BLAKE2b-256	`3ad7e41875e817d56860c5d6bced33aa77021f389c8919a9faebe7c6401acd55`

See more details on using hashes here.

agentops-cockpit 0.9.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🕹️ AgentOps Cockpit

"Infrastructure gives you the pipes. We give you the Intelligence."

📽️ The Mission

🏗️ The Agentic Trinity

🌐 Framework Agnostic Governance

🛠️ Operational Flow

🚀 Key Innovation: The "Intelligence" Layer

🛡️ Red Team Auditor (Self-Hacking)

🧠 Hive Mind (Semantic Caching)

🏛️ Arch Review & Framework Detection

🕹️ MCP Connectivity Hub (Model Context Protocol)

🗄️ Situational Database Audits

🧗 Quality Hill Climbing (ADK Evaluation)

🛑 Mandatory Governance Enforcement (NEW)

⌨️ Quick Start

🔍 Agent Optimizer v2 (Situational Intelligence)

⚡ Quick-Safe Build (12x Faster Loops)

🧑‍💼 Principal SME Persona Approvals

📄 Export & Reporting

📊 Local Development

🧭 Roadmap

🤝 Community

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes