Drop-in observability for Gemini agents: traces, cost, drift, egress allowlist. Ships exporters for Arize Phoenix, Splunk HEC, Elastic, GitLab Observability, MongoDB Atlas, Dynatrace, TrueFoundry.
Project description
GeminiLens
Drop-in observability for Gemini agents. Wrap any Vertex AI Gemini call and get traces, cost, latency, drift, and tool-call audit out of the box. Ships with a Streamlit dashboard.
Live demo: https://geminilens-1029931682737.us-central1.run.app Demo video: https://storage.googleapis.com/geminilens-demo-mukunda/geminilens-demo.mp4 (2:13, 3 MB) License: Apache 2.0
What it does
- Traces every Gemini call: prompt, response, token usage, latency.
- Per-call cost in USD for Gemini 2.5 Pro/Flash/Flash-Lite and 2.0 Flash, with cached-input pricing.
- Drift signals: rolling p95 latency, mean cost, and output-length vs a baseline window, so you notice when the agent gets slower, pricier, or more verbose without you changing anything.
- Egress allowlist: tool calls go through an
httpxclient that throws if the agent tries to reach a host outside your allowlist. Useful for research and data-extraction agents. - JSONL trace store plus an in-memory ring buffer, so traces are durable and the dashboard is fast.
Install
git clone https://github.com/MukundaKatta/geminilens
cd geminilens
python3 -m venv .venv && source .venv/bin/activate
pip install -e .
30-second demo (no GCP needed)
PYTHONPATH=src python examples/quickstart.py
This writes 120 synthetic traces to ~/.geminilens/traces.jsonl and prints
a drift report. Then start the dashboard:
PYTHONPATH=src streamlit run app/dashboard.py
Real Gemini call (Vertex AI)
export GOOGLE_CLOUD_PROJECT=your-project-id
gcloud auth application-default login
PYTHONPATH=src streamlit run app/dashboard.py
The dashboard's sidebar will call gemini-2.5-flash via Vertex AI and the
trace, cost, and latency will land in the table.
Use as a library
from geminilens import GeminiObserver, TraceStore, EgressGuard
from google import genai
observer = GeminiObserver(store=TraceStore("traces.jsonl"))
guard = EgressGuard(allow=["en.wikipedia.org", "arxiv.org"])
client = genai.Client(vertexai=True, project="my-proj", location="us-central1")
with observer.trace(model="gemini-2.5-pro", prompt="...", agent="researcher") as tr:
# Tool calls go through the guard; egress to other hosts will raise.
summary = observer.run_tool(
tr, "wiki", lambda t: guard.client().get(f"https://en.wikipedia.org/wiki/{t}").text,
"Vertex_AI",
)
response = client.models.generate_content(model="gemini-2.5-pro", contents="...")
observer.record_response(tr, response)
Tests
PYTHONPATH=src pytest -q
Azure OpenAI
For Azure agents, swap GeminiObserver for AzureObserver:
from geminilens.azure import AzureObserver, make_azure_client
observer = AzureObserver()
client = make_azure_client()
with observer.trace("gpt-4o-mini", prompt="...") as tr:
resp = client.chat.completions.create(
model="gpt-4o-mini",
messages=[{"role": "user", "content": "..."}],
)
observer.record_response(tr, resp)
Cost is computed from the published Azure OpenAI pricing for gpt-4.1 / gpt-4o / o3 / o4 families.
Export to Dynatrace
from geminilens.exporters import DynatraceExporter
exporter = DynatraceExporter() # reads DT_ENV_URL + DT_API_TOKEN
observer = GeminiObserver(on_trace=exporter.export_one)
Set the env vars, give the API token logs.ingest scope, and every trace
lands as a structured log event with gen_ai.usage.* attributes.
Deploy
See docs/deploy.md for a one-shot gcloud run deploy.
The included Dockerfile is Cloud Run compatible.
Repo layout
src/geminilens/ core library
observer.py trace context manager + tool-call recorder
cost.py Gemini USD cost calculator
drift.py rolling-window drift report
store.py JSONL + in-memory trace store
guard.py egress allowlist (httpx transport)
agent.py reference ResearchAgent using all of the above
azure.py Azure OpenAI adapter (same Trace shape, MS pricing)
exporters/
dynatrace.py ship traces to Dynatrace Log Ingestion API
jsonl_file.py append traces to a JSONL file
app/dashboard.py Streamlit dashboard
examples/ quickstart + Dynatrace export demo
docs/ deploy guide, demo script, per-hackathon submission copy
Dockerfile Cloud Run / Azure Container Apps image
tests/ pytest suite (19 tests)
Why this exists
Most LLM observability tools are either Python SDKs that lock you into one vendor's hosted backend or full APM platforms that ask for an agent. GeminiLens is the smallest piece in between: a single file you can drop into a Gemini project and get the four numbers that matter (cost, latency, drift, tool audit) without leaving your machine. The dashboard is local Streamlit, the data is JSONL, and the cost table lives in the repo so you can audit it.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file geminilens-0.2.0.tar.gz.
File metadata
- Download URL: geminilens-0.2.0.tar.gz
- Upload date:
- Size: 30.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e095be9df8f90fedb4324feabeef28e980cf18ea409dbadf0362a92ab1ec0e23
|
|
| MD5 |
24b52e937f2e3bb29343d45236442f93
|
|
| BLAKE2b-256 |
1fcc700f8fd935e348342780746f8d5a83b924ec1eda8b6fe275b22a0096689a
|
File details
Details for the file geminilens-0.2.0-py3-none-any.whl.
File metadata
- Download URL: geminilens-0.2.0-py3-none-any.whl
- Upload date:
- Size: 32.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9795dc35a064bac9b0522aa21bffa23a12902b2e08a23c46cd6e9c1a2b7fbc7a
|
|
| MD5 |
48732da32b2104810adeb787f068a07a
|
|
| BLAKE2b-256 |
f3f6a5df62a8b66b214946cfcdb049f0a49b70bbe43d359ce563f34c38c78705
|