Skip to main content

LLM usage telemetry SDK for Cloptima reporting, attribution, and analytics

Project description

Cloptima LLM Observability Python SDK

Capture LLM usage telemetry from your application and send it to Cloptima for cost reporting, attribution, and usage analytics.

This SDK is designed for teams that want observability without replacing their existing provider clients, wrappers, retries, auth, or application security controls.

Install

pip install cloptima-llm-observability

If you want the httpx transport helpers:

pip install "cloptima-llm-observability[httpx]"

Quick start

Required configuration:

  • CLOPTIMA_LLM_OBSERVABILITY_API_KEY
  • CLOPTIMA_LLM_OBSERVABILITY_APP_ID

Recommended while testing:

  • CLOPTIMA_LLM_OBSERVABILITY_ENVIRONMENT=dev
from cloptima_llm_observability import extract_openai_usage, init_from_env

cloptima = init_from_env()

result = cloptima.observe_call(
    provider="openai",
    model="gpt-4.1-mini",
    call=lambda: summary_service.generate(prompt),
    extract_usage=extract_openai_usage,
    feature_id="summary_generation",
    workflow_id="support_agent",
    fire_and_forget=False,
)

By default, the SDK sends bearer-authenticated HTTPS requests to Cloptima at https://api.cloptima.ai/v1/ai/integrations/sdk/events.

If the required configuration is missing, init_from_env() returns a disabled pass-through client so local development and tests do not break.

Choose your integration path

Call-site or wrapper boundary

This is the default path for most teams.

Use it when you already know the provider, model, and business context at the point where your code calls an LLM or an existing AI wrapper.

  • observe_call(...) for direct integration
  • create_observed_call(...) for reusable wrappers
  • wrap_observed_service(...) to instrument customer-owned service classes
from cloptima_llm_observability import (
    extract_openai_usage,
    init_from_env,
    wrap_observed_service,
)


class SummaryService:
    def generate_summary(self, prompt: str):
        return openai.responses.create(model="gpt-4.1-mini", input=prompt)


cloptima = init_from_env()
summary_service = wrap_observed_service(
    cloptima,
    SummaryService(),
    {
        "generate_summary": {
            "kind": "call",
            "options": {
                "provider": "openai",
                "model": "gpt-4.1-mini",
                "extract_usage": extract_openai_usage,
                "fire_and_forget": False,
            },
            "resolve_overrides": lambda prompt: {
                "attribution": {
                    "feature_id": "summary_generation",
                },
            },
        }
    },
)

Context-first attribution

Use context helpers when you want workflow or feature attribution to apply across nested calls without threading more parameters through your own service signatures.

  • with_attribution(...)
  • run_with_attribution(...)
  • with_workflow(...)
  • with_task(...)
  • @workflow(...)
  • @task(...)
from cloptima_llm_observability import with_task, with_workflow

with with_workflow("support_agent", tenant_id="acme-prod"):
    with with_task("draft_reply", team_id="customer-support"):
        summary_service.generate_summary(prompt)

Per-call attribution still works and overrides context when needed.

Shared transport integration

If your application centralizes outbound LLM calls behind httpx, instrument that shared boundary:

import httpx

from cloptima_llm_observability import init_from_env, instrument_httpx_transport

cloptima = init_from_env()
transport = instrument_httpx_transport(
    httpx.HTTPTransport(),
    cloptima=cloptima,
    provider="openai",
    model="gpt-4o-mini",
    fire_and_forget=False,
)
client = httpx.Client(transport=transport)

This gives broad coverage, but it has less business context than call-site or wrapper-boundary integration.

OTLP delivery to Cloptima

Use otlp_http when your enterprise prefers OpenTelemetry-compatible payloads but still wants to send that telemetry to Cloptima.

  • cloptima_http is the default delivery mode
  • otlp_http sends OpenTelemetry-compatible payloads to Cloptima's OTLP receiver
CLOPTIMA_LLM_OBSERVABILITY_DELIVERY_MODE=otlp_http
CLOPTIMA_LLM_OBSERVABILITY_OTLP_SERVICE_NAME=agent-api
CLOPTIMA_LLM_OBSERVABILITY_OTLP_SERVICE_VERSION=2026.06.14

If you already operate an OTEL collector and emit GenAI spans, you can also send OTLP data to Cloptima without using this SDK. Use the SDK OTLP mode when you want application-managed instrumentation that still fits an OTLP-shaped delivery contract.

Built-in extractors and compatibility

Built-in usage extractors cover:

  • OpenAI
  • Azure OpenAI
  • Anthropic
  • Gemini
  • Vertex AI
  • Bedrock

If a provider reports image, audio, or video token usage, the built-in extractors capture those units in fields such as input_image, output_image, input_audio, and output_video. When Cloptima has pricing for that model, those units can be included in cost reporting.

If a provider returns a direct charge, pass or preserve it as vendor_reported_cost_usd.

The SDK does not invent media charges for providers that bill by image count, video duration, resolution, or other non-token measures when the provider response does not expose enough pricing data. In those cases, either:

  • preserve the provider-reported cost when available
  • map the provider's usage fields into extra_usage_units
  • or add your own custom extractor until the provider exposes a stable shape

If a provider response shape drifts, you do not need to replace the whole extractor path. Compose or patch it instead:

  • try_extract_usage(...)
  • compose_usage_extractors(...)
  • with_usage_overrides(...)
  • create_mapped_usage_extractor(...)
  • list_supported_providers()

Example:

from cloptima_llm_observability import create_mapped_usage_extractor, init_from_env

cloptima = init_from_env()

extract_usage = create_mapped_usage_extractor(
    defaults={
        "provider": "gemini",
    },
    fields={
        "model": "modelVersion",
        "provider_request_id": "responseId",
        "vendor_reported_cost_usd": "billing.costUsd",
    },
    number_fields={
        "input_tokens": "usage.promptTokenCount",
        "output_tokens": "usage.responseTokenCount",
        "total_tokens": "usage.totalTokenCount",
    },
    extra_usage_units={
        "output_image": "usage.outputImageTokenCount",
    },
)

Attribution fields

Common ownership and reporting fields:

  • app_id
  • environment
  • team_id
  • feature_id
  • workflow_id
  • cost_center
  • business_unit
  • product
  • tenant_id
  • end_customer_id
  • customer_segment
  • release

Set defaults once in default_attribution, set them in context, or override them per call.

Metadata and privacy

Use metadata_policy to control how custom metadata is retained:

  • metadata_only
  • allowlisted_metadata
  • strict_finops
  • debug_observability

Sensitive-looking keys such as prompts, messages, credentials, and secrets are treated conservatively by default.

Validation and local previews

Use these helpers in local tests, CI, or rollout checks:

  • preview_event_payload(...)
  • preview_batch_payload(...)
  • preview_otlp_request(...)
  • validate_payload(...)

They build or validate payloads in memory and do not send network traffic.

Examples

Public examples live in examples/:

  • basic.py: direct call-site integration
  • custom_wrapper.py: existing service wrapper integration
  • workflow_context.py: context-first attribution without signature bloat
  • httpx_transport.py: shared httpx integration
  • multimodal_tokens.py: token-based multimodal usage extraction for image, audio, and video inputs and outputs
  • mapped_extractor.py: adapt a provider or internal wrapper response without rewriting your integration
  • otlp_basic.py: OTLP-compatible delivery to Cloptima
  • openai_basic.py, anthropic_basic.py, gemini_basic.py: provider-specific extractor examples

Troubleshooting

No telemetry arrives:

  • verify the API key is valid for Cloptima telemetry ingestion
  • check client.is_enabled()
  • inspect a sample event with validate_payload(preview_event_payload(...))

Unexpected provider response shape:

  • start with the closest built-in extractor
  • patch field differences with with_usage_overrides(...) or create_mapped_usage_extractor(...)
  • compare against list_supported_providers() if you need a supported-provider snapshot

Support

  • Issues: https://github.com/cloptima/cloptima-llm-observability-python/issues
  • Security: see SECURITY.md
  • Product support: hello@cloptima.ai

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cloptima_llm_observability-0.2.0.tar.gz (51.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cloptima_llm_observability-0.2.0-py3-none-any.whl (31.0 kB view details)

Uploaded Python 3

File details

Details for the file cloptima_llm_observability-0.2.0.tar.gz.

File metadata

File hashes

Hashes for cloptima_llm_observability-0.2.0.tar.gz
Algorithm Hash digest
SHA256 1c669ae4aea517d8f065c1741484a20c167e1689fb849e7e323fc80d376d3b56
MD5 17778c00ead443f304ae3e9c4844aaf4
BLAKE2b-256 28a80f5dda79492469a7ef24a2099a5633ab6740e607addfee913eaba92b0096

See more details on using hashes here.

Provenance

The following attestation bundles were made for cloptima_llm_observability-0.2.0.tar.gz:

Publisher: release.yml on cloptima/cloptima-llm-observability-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cloptima_llm_observability-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for cloptima_llm_observability-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a401573618f705c4ab3b6a017a249895bcbadec4809bba31d47f219ec744a92c
MD5 c96f693158b0b07466a4e6ddd5014296
BLAKE2b-256 8987dd897b4b45c41ee4a93e052b1f47982c195e9ff63d582a5a05db22690ae9

See more details on using hashes here.

Provenance

The following attestation bundles were made for cloptima_llm_observability-0.2.0-py3-none-any.whl:

Publisher: release.yml on cloptima/cloptima-llm-observability-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page