Production-grade Agent Operations (AgentOps) Platform
Project description
🕹️ AgentOps Cockpit
"Infrastructure gives you the pipes. We give you the Intelligence."
The developer distribution for building, optimizing, and securing AI agents on Google Cloud.
📽️ The Mission
Most AI agent templates stop at a single Python file and an API key. The AgentOps Cockpit is for developers moving into production. While optimized for ADK, it provides framework-agnostic governance, safety, and cost guardrails for the entire agentic ecosystem—from CrewAI to LangGraph. Based on the Google Well-Architected Framework for Agents.
🏗️ The Agentic Trinity
We divide the complexity of production agents into three focused pillars:
- ⚙️ The Engine: The reasoning core. Built with ADK, FastAPI, and Vertex AI.
- 🎭 The Face: The user experience. Adaptive UI surfaces and GenUI standards via the A2UI spec.
- 🕹️ The Cockpit: The operational brain. Cost control, semantic caching, shadow routing, and adversarial audits.
🌐 Framework Agnostic Governance
The Cockpit isn't just for ADK. It provides Best Practices as Code across all major agentic frameworks:
Whether you are building a swarm in CrewAI, a Go-based high-perf engine, or a Streamlit dashboard, the Cockpit ensures your agent maps to the Google Well-Architected Framework.
🚀 Key Innovation: The "Intelligence" Layer
🛡️ Red Team Auditor (Self-Hacking)
Don't wait for your users to find prompt injections. Use the built-in Adversarial Evaluator to launch self-attacks against your agent, testing for PII leaks, instruction overrides, and safety filter bypasses.
🧠 Hive Mind (Semantic Caching)
Reduce LLM costs by up to 40%. The Hive Mind checks for semantically similar queries in 10ms, serving cached answers for common questions without calling the LLM.
🏛️ Arch Review & Framework Detection
Every agent in the cockpit is graded against a framework-aware checklist. The Cockpit intelligently detects your stack—Google ADK, OpenAI Agentkit, Anthropic Claude, Microsoft AutoGen/Semantic Kernel, AWS Bedrock Agents, or CopilotKit—and runs a tailored audit against corresponding production standards. Use make arch-review to verify your Governance-as-Code.
🧗 Quality Hill Climbing (ADK Evaluation)
Following Google ADK Evaluation best practices, the Cockpit provides an iterative optimization loop. make quality-baseline runs your agent against a "Golden Dataset" using LLM-as-a-Judge scoring (Response Match & Tool Trajectory), climbing the quality curve until production-grade fidelity is reached.
⌨️ Quick Start
The Cockpit is available as a first-class CLI on PyPI.
# 1. Install the Cockpit globally
pip install agentops-cockpit
# 2. Audit your existing agent design
agent-ops arch-review
# 3. Stress test your endpoint
agent-ops load-test --requests 100 --concurrency 10
# 4. Scaffold a new Well-Architected app
agent-ops create my-agent --ui a2ui
You can also use uvx for one-off commands without installation:
uvx agentops-cockpit arch-review
📊 Local Development
The Cockpit provides a unified "Mission Control" to evaluate your agents instantly.
make audit-all # 🕹️ Run ALL audits and generate a Final Report
make reliability # 🛡️ Run unit tests and regression suite
make dev # Start the local Engine + Face stack
make arch-review # 🏛️ Run the Google Well-Architected design review
make quality-baseline # 🧗 Run iterative 'Hill Climbing' quality audit
make audit # 🔍 Run the Interactive Agent Optimizer
make red-team # Execute a white-hat security audit
make deploy-prod # 🚀 1-click deploy to Google Cloud
🧭 Roadmap
- One-Click GitHub Action: Automated audits on every PR.
- Multi-Agent Orchestrator: Support for Swarm/Coordinator patterns.
- Visual Mission Control: Real-time observability dashboard.
🤝 Community
- Star this repo to help us build the future of AgentOps.
- Join the Discussion for patterns on Google Cloud.
- Contribute: Read our Contributing Guide.
Reference: Google Cloud Architecture Center - Agentic AI Overview
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentops_cockpit-0.3.0.tar.gz.
File metadata
- Download URL: agentops_cockpit-0.3.0.tar.gz
- Upload date:
- Size: 2.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.16 {"installer":{"name":"uv","version":"0.9.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f5917915b6c5056547905e97f023d62a10e491ccc6c6ca53baa3feb81d052a6b
|
|
| MD5 |
5361e69d48e0d41e9cb87a96f7c8c87c
|
|
| BLAKE2b-256 |
a473471ae696a5bca4f2d019131c7326e9ca58ccb9c59b62f6a8ac25012d32d0
|
File details
Details for the file agentops_cockpit-0.3.0-py3-none-any.whl.
File metadata
- Download URL: agentops_cockpit-0.3.0-py3-none-any.whl
- Upload date:
- Size: 40.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.16 {"installer":{"name":"uv","version":"0.9.16","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f2afdaf564bbf3dd78d0b3507e9bc596718e65c19457a6526a94562d1002e9d6
|
|
| MD5 |
080017158b0cf767965c724d1b1a42e6
|
|
| BLAKE2b-256 |
1e351e9def1ab0d17f2ba14ebe7eda170bb0e947702c1d293d548b0eca874977
|