Skip to main content

SDK for analyzing and evaluating agent traces stored in BigQuery.

Project description

BigQuery Agent Analytics SDK

License Python CI

An open-source Python SDK for analyzing, evaluating, and curating agent traces stored in BigQuery. Built on top of the BigQuery Agent Analytics, it provides a consumption-layer toolkit for agent observability, analysis, evaluation, and advanced capabilities like context graph at scale.

Overview

The BigQuery Agent Analytics SDK connects your AI agent telemetry in BigQuery to a rich set of evaluation, observability, and analytics capabilities. It is designed for ML engineers, data scientists, and platform teams who run agents in production and need to understand agent behavior, measure quality, and detect regressions — all through BigQuery SQL or Python.

Key Features

Observability

  • Trace reconstruction and DAG visualization
  • Per-event-type BigQuery views
  • Observability dashboards (SQL and BigFrames)

Evaluation

  • Code-based metrics (latency, turn count, error rate, token efficiency, cost)
  • LLM-as-Judge scoring (correctness, hallucination, sentiment)
  • Trajectory matching (exact, in-order, any-order)
  • Multi-trial evaluation with pass@k / pass^k
  • Grader composition (weighted, binary, majority strategies)
  • Eval suite lifecycle management with graduation and saturation detection
  • Static quality validation (ambiguous tasks, class imbalance, suspicious thresholds)

AI/ML Integration

  • BigQuery AI.GENERATE, AI.EMBED, AI.CLASSIFY
  • Anomaly detection and latency forecasting
  • Categorical (Hatteras-style) evaluation via BigFrames

Advanced Analytics

  • Context Graph — property graph linking traces to business entities with GQL traversal
  • YAML-driven ontology extraction and materialization
  • Long-horizon cross-session memory
  • Multi-stage agent insights pipeline
  • Drift detection for golden vs production question distributions

CLI (bq-agent-sdk)

  • 12+ commands for diagnostics, evaluation, and CI/CD integration

Deployment Surfaces

  • Remote Function (BigQuery SQL via Cloud Run)
  • Python UDF scoring kernels
  • Streaming evaluation (Cloud Scheduler + Cloud Run)
  • Continuous query templates

Usage Telemetry

  • Every job the SDK submits is labeled (sdk, sdk_version, sdk_surface, sdk_feature, and sdk_ai_function where relevant) so operators can attribute spend, latency, and adoption directly from INFORMATION_SCHEMA.JOBS_BY_PROJECT. No extra telemetry pipeline is required. See docs/sdk_usage_tracking.md for the label schema and ready-to-run tracking queries.

Prerequisites

Installation

pip install bigquery-agent-analytics

With optional LLM judge support:

pip install bigquery-agent-analytics[llm]

With BigFrames support:

pip install bigquery-agent-analytics[bigframes]

Quick Start

from bigquery_agent_analytics import Client

client = Client(project_id="my-project", dataset_id="analytics")
trace = client.get_trace("trace-abc-123")
trace.render()

See SDK.md for the full API walkthrough with code examples for every feature.

Documentation

Resource Description
SDK Feature Reference Complete API walkthrough with working code examples
Design Documents Architecture decisions and design rationale
Examples Notebooks, SQL scripts, and demos
Deployment Guides Four deployment surfaces for Google Cloud

Architecture

src/bigquery_agent_analytics/
│
├── Core
│   ├── client.py                  # High-level SDK client
│   ├── trace.py                   # Trace reconstruction & visualization
│   ├── views.py                   # Per-event-type BigQuery view management
│   ├── event_semantics.py         # Canonical event type helpers & predicates
│   ├── serialization.py           # Uniform serialization layer
│   └── formatter.py               # Output formatting (json/text/table)
│
├── Evaluation
│   ├── evaluators.py              # CodeEvaluator + LLMAsJudge
│   ├── trace_evaluator.py         # Trajectory matching & replay
│   ├── multi_trial.py             # Multi-trial runner + pass@k
│   ├── grader_pipeline.py         # Grader composition pipeline
│   ├── eval_suite.py              # Eval suite lifecycle management
│   └── eval_validator.py          # Static validation checks
│
├── AI/ML
│   ├── ai_ml_integration.py       # BigQuery AI/ML capabilities
│   ├── bigframes_evaluator.py     # BigFrames DataFrame evaluator
│   ├── categorical_evaluator.py   # Hatteras categorical evaluation
│   └── categorical_views.py       # Categorical metric views
│
├── Analytics
│   ├── insights.py                # Multi-stage insights pipeline
│   ├── feedback.py                # Drift detection & question distribution
│   ├── context_graph.py           # Property Graph: BizNode extraction, GQL
│   └── memory_service.py          # Long-horizon agent memory
│
├── Ontology
│   ├── ontology_models.py         # Pydantic models for ontology schema
│   ├── ontology_schema_compiler.py# YAML → compiled schema
│   ├── ontology_graph.py          # Ontology graph construction
│   ├── ontology_materializer.py   # Graph materialization to BigQuery
│   ├── ontology_property_graph.py # Property graph operations
│   └── ontology_orchestrator.py   # End-to-end ontology pipeline
│
└── CLI & Deploy
    ├── cli.py                     # CLI entry point (bq-agent-sdk)
    ├── udf_kernels.py             # Python UDF scoring kernels
    └── udf_sql_templates.py       # UDF SQL generation

Related Projects

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest tests/ -v

# Format code
pyink --config pyproject.toml src/ tests/
isort src/ tests/

Contributing

See CONTRIBUTING.md for guidelines.

License

Apache License 2.0 — see LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bigquery_agent_analytics-0.2.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bigquery_agent_analytics-0.2.0-py3-none-any.whl (255.2 kB view details)

Uploaded Python 3

File details

Details for the file bigquery_agent_analytics-0.2.0.tar.gz.

File metadata

  • Download URL: bigquery_agent_analytics-0.2.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bigquery_agent_analytics-0.2.0.tar.gz
Algorithm Hash digest
SHA256 c11e7d92fd87467512d80a05ecfc2cf2cd4542203d3a7fa17a9a39202ccab51b
MD5 bd4a2d1d2e464c33d470bb5e10be0bbd
BLAKE2b-256 5a41d5b938560f6f4dce644f1b430f24666e7757227b6d4e54a3920114db5dee

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigquery_agent_analytics-0.2.0.tar.gz:

Publisher: release.yml on GoogleCloudPlatform/BigQuery-Agent-Analytics-SDK

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bigquery_agent_analytics-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for bigquery_agent_analytics-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c0b2491d8047410980405e5c9a1f4ad821012b9266b9696c1a07015d3ca2f9be
MD5 87c08f77606a68e06c351f17743bd5bb
BLAKE2b-256 ff18e73fb89d9dafd9f595f1e6ac2ce89844892eb6dc4ea6732118683ad71b13

See more details on using hashes here.

Provenance

The following attestation bundles were made for bigquery_agent_analytics-0.2.0-py3-none-any.whl:

Publisher: release.yml on GoogleCloudPlatform/BigQuery-Agent-Analytics-SDK

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page