Grafana Sigil Python SDK

Project description

Grafana Sigil Python SDK

sigil-sdk records normalized LLM generation and tool-execution telemetry. It exports normalized generations to Sigil ingest and uses your OpenTelemetry tracer/meter setup for traces and metrics.

Use this package when you want:

A provider-agnostic generation record (same schema for OpenAI, Anthropic, Gemini, or custom adapters).
OTel-aligned tracing attributes for generation and tool spans.
Async export with retry/backoff, queueing, batching, and explicit shutdown semantics.

Installation

pip install sigil-sdk

Validation

Run the shared core conformance suite for the Python SDK from the repo root:

mise run test:py:sdk-conformance

Run the cross-language aggregate core conformance suite from the repo root:

mise run sdk:conformance

Optional provider helper packages:

pip install sigil-sdk-openai
pip install sigil-sdk-anthropic
pip install sigil-sdk-gemini

Optional framework modules:

pip install sigil-sdk-langchain
pip install sigil-sdk-langgraph
pip install sigil-sdk-openai-agents
pip install sigil-sdk-llamaindex
pip install sigil-sdk-google-adk

Framework handler usage:

from sigil_sdk import Client
from sigil_sdk_langchain import with_sigil_langchain_callbacks
from sigil_sdk_langgraph import with_sigil_langgraph_callbacks
from sigil_sdk_openai_agents import with_sigil_openai_agents_hooks
from sigil_sdk_llamaindex import with_sigil_llamaindex_callbacks
from sigil_sdk_google_adk import with_sigil_google_adk_callbacks

client = Client()
chain_config = with_sigil_langchain_callbacks(None, client=client, provider_resolver="auto")
graph_config = with_sigil_langgraph_callbacks(None, client=client, provider_resolver="auto")
openai_agents_run_options = with_sigil_openai_agents_hooks(None, client=client, provider_resolver="auto")
llamaindex_config = with_sigil_llamaindex_callbacks(None, client=client, provider_resolver="auto")
google_adk_agent_config = with_sigil_google_adk_callbacks(None, client=client, provider_resolver="auto")

Framework handlers inject framework tags/metadata on recorded generations:

sigil.framework.name (langchain, langgraph, openai-agents, llamaindex, or google-adk)
sigil.framework.source=handler
sigil.framework.language=python
metadata["sigil.framework.run_id"]
metadata["sigil.framework.thread_id"] (when present)
metadata["sigil.framework.parent_run_id"] (when available)
metadata["sigil.framework.component_name"]
metadata["sigil.framework.run_type"]
metadata["sigil.framework.tags"]
metadata["sigil.framework.retry_attempt"] (when available)
metadata["sigil.framework.event_id"] (when available)
metadata["sigil.framework.langgraph.node"] (LangGraph when available)

Conversation mapping is conversation-first:

conversation_id / session_id / group_id from framework context first
then thread_id
deterministic fallback sigil:framework:<framework_name>:<run_id>

When present in generation metadata, low-cardinality framework keys are copied onto generation span attributes.

For LangGraph persistence, pass configurable.thread_id and reuse it across invocations:

thread_config = {
    **with_sigil_langgraph_callbacks(None, client=client, provider_resolver="auto"),
    "configurable": {"thread_id": "customer-42"},
}
graph.invoke({"prompt": "Remember my timezone is UTC+1.", "answer": ""}, config=thread_config)
graph.invoke({"prompt": "What timezone did I give you?", "answer": ""}, config=thread_config)

Full framework examples:

LangChain: ../python-frameworks/langchain/README.md
LangGraph: ../python-frameworks/langgraph/README.md
OpenAI Agents: ../python-frameworks/openai-agents/README.md
LlamaIndex: ../python-frameworks/llamaindex/README.md
Google ADK: ../python-frameworks/google-adk/README.md

Quick Start (Sync Generation)

from sigil_sdk import (
    Client,
    ClientConfig,
    GenerationStart,
    ModelRef,
    assistant_text_message,
    user_text_message,
)

client = Client(
    ClientConfig(
        generation_export_endpoint="http://localhost:8080/api/v1/generations:export",
    )
)

with client.start_generation(
    GenerationStart(
        conversation_id="conv-1",
        agent_name="my-service",
        agent_version="1.0.0",
        model=ModelRef(provider="openai", name="gpt-5"),
    )
) as rec:
    rec.set_result(
        input=[user_text_message("What is the weather in Paris?")],
        output=[assistant_text_message("It is 18C and sunny.")],
    )

    # Recorder errors are local SDK errors (validation/enqueue/shutdown),
    # not provider call failures.
    if rec.err() is not None:
        raise rec.err()

client.shutdown()

Configure OTEL exporters (traces/metrics) in your application OTEL SDK setup. You can optionally pass tracer and meter via ClientConfig.

Quick OTEL setup pattern before creating the Sigil client:

from opentelemetry import metrics, trace
from opentelemetry.sdk.metrics import MeterProvider
from opentelemetry.sdk.trace import TracerProvider

trace.set_tracer_provider(TracerProvider())
metrics.set_meter_provider(MeterProvider())

Streaming Generation

Use start_streaming_generation(...) when the upstream provider call is streaming.

from sigil_sdk import GenerationStart, ModelRef

with client.start_streaming_generation(
    GenerationStart(
        conversation_id="conv-stream",
        model=ModelRef(provider="anthropic", name="claude-sonnet-4-5"),
    )
) as rec:
    rec.set_result(output=[assistant_text_message("partial stream summary")])

Embedding Observability

Use start_embedding(...) for embedding API calls. Embedding recording emits OTel spans and SDK metrics only, and does not enqueue generation exports.

from sigil_sdk import EmbeddingResult, EmbeddingStart, ModelRef

with client.start_embedding(
    EmbeddingStart(
        agent_name="retrieval-worker",
        agent_version="1.0.0",
        model=ModelRef(provider="openai", name="text-embedding-3-small"),
    )
) as rec:
    response = openai.embeddings.create(model="text-embedding-3-small", input=["hello", "world"])
    rec.set_result(
        EmbeddingResult(
            input_count=2,
            input_tokens=response.usage.prompt_tokens,
            input_texts=["hello", "world"],  # captured only when embedding_capture.capture_input=True
            response_model=response.model,
        )
    )

Input text capture is opt-in:

from sigil_sdk import ClientConfig, EmbeddingCaptureConfig

cfg = ClientConfig(
    embedding_capture=EmbeddingCaptureConfig(
        capture_input=True,
        max_input_items=20,
        max_text_length=1024,
    )
)

capture_input may expose PII/document content in spans. Keep it disabled by default and enable only for scoped debugging.

TraceQL examples:

traces{gen_ai.operation.name="embeddings"}
traces{gen_ai.operation.name="embeddings" && gen_ai.request.model="text-embedding-3-small"}
traces{gen_ai.operation.name="embeddings" && error.type!=""}

Tool Execution Span Recording

Tool spans are recorded independently of generation export.

from sigil_sdk import ToolExecutionStart

with client.start_tool_execution(
    ToolExecutionStart(
        tool_name="weather",
        tool_call_id="call_weather_1",
        tool_type="function",
        include_content=True,
    )
) as rec:
    rec.set_result(arguments={"city": "Paris"}, result={"temp_c": 18})

SDK identity attributes

Generation and tool spans always include:
- sigil.sdk.name=sdk-python
Normalized generation metadata always includes the same key.
If caller metadata provides a conflicting value for this key, the SDK overwrites it.

Context Defaults

Use context helpers to set defaults once per request/task boundary.

from sigil_sdk import with_agent_name, with_agent_version, with_conversation_id

with with_conversation_id("conv-ctx"), with_agent_name("planner"), with_agent_version("2026.02"):
    with client.start_generation(
        GenerationStart(model=ModelRef(provider="gemini", name="gemini-2.5-pro"))
    ) as rec:
        rec.set_result(output=[assistant_text_message("ok")])

Content Capture Mode

ContentCaptureMode controls what content is included in exported generation payloads and OTel span attributes. Use it to prevent sensitive text (prompts, tool I/O, model responses) from leaving the process.

Mode	Generations	Tool spans
`FULL`	All content exported	Arguments and results in span attributes
`NO_TOOL_CONTENT` (SDK default)	All content exported	Arguments and results excluded
`METADATA_ONLY`	Structure preserved, all text stripped	Arguments and results excluded

The default is NO_TOOL_CONTENT, which matches the SDK's behavior before this feature was added.

Client-level default

from sigil_sdk import Client, ClientConfig, ContentCaptureMode

client = Client(ClientConfig(
    content_capture=ContentCaptureMode.METADATA_ONLY,
))

Per-generation override

from sigil_sdk import ContentCaptureMode, GenerationStart, ModelRef

with client.start_generation(
    GenerationStart(
        model=ModelRef(provider="openai", name="gpt-5"),
        content_capture=ContentCaptureMode.FULL,
    )
) as rec:
    rec.set_result(
        input=[user_text_message("What is the weather?")],
        output=[assistant_text_message("18C and sunny.")],
    )

Context propagation

Child tool executions inherit the active capture mode from the parent generation via ContextVar. You can also set it explicitly for a block:

from sigil_sdk import ContentCaptureMode, with_content_capture_mode

with with_content_capture_mode(ContentCaptureMode.METADATA_ONLY):
    with client.start_tool_execution(
        ToolExecutionStart(tool_name="search")
    ) as rec:
        rec.set_result(arguments={"q": "weather"}, result={"temp_c": 18})

Dynamic resolution via resolver

A callback on ClientConfig that resolves the capture mode per-recording at runtime. Useful for feature flags, per-tenant policies, or context-dependent decisions:

from sigil_sdk import Client, ClientConfig, ContentCaptureMode

def resolve_capture(metadata: dict) -> ContentCaptureMode:
    if metadata.get("sigil.tenant") == "healthcare":
        return ContentCaptureMode.METADATA_ONLY
    return ContentCaptureMode.DEFAULT  # fall through to client default

client = Client(ClientConfig(
    content_capture_resolver=resolve_capture,
))

Resolution precedence (highest to lowest)

Per-recording content_capture field (GenerationStart / ToolExecutionStart)
content_capture_resolver return value
ContextVar from with_content_capture_mode()
ClientConfig.content_capture (defaults to NO_TOOL_CONTENT)

Exceptions in the resolver are caught and treated as METADATA_ONLY (fail-closed).

Export Configuration

HTTP generation export

from sigil_sdk import ApiConfig, AuthConfig, ClientConfig, GenerationExportConfig

cfg = ClientConfig(
    generation_export=GenerationExportConfig(
        protocol="http",
        endpoint="http://localhost:8080/api/v1/generations:export",
        auth=AuthConfig(mode="tenant", tenant_id="dev-tenant"),
    ),
    api=ApiConfig(endpoint="http://localhost:8080"),
)

gRPC generation export

cfg = ClientConfig(
    generation_export=GenerationExportConfig(
        protocol="grpc",
        endpoint="localhost:50051",
        insecure=True,
        auth=AuthConfig(mode="tenant", tenant_id="dev-tenant"),
    ),
    api=ApiConfig(endpoint="http://localhost:8080"),
)

Generation export auth modes

Auth is resolved for generation_export.

mode="none"
mode="tenant" (requires tenant_id, injects X-Scope-OrgID)
mode="bearer" (requires bearer_token, injects Authorization: Bearer <token>)
mode="basic" (requires basic_password + basic_user or tenant_id, injects Authorization: Basic <base64(user:password)>; also injects X-Scope-OrgID when tenant_id is set — for self-hosted multi-tenancy only, not needed for Grafana Cloud)

Invalid mode/field combinations fail fast in resolve_config(...).

If explicit headers already include Authorization or X-Scope-OrgID, explicit headers win.

from sigil_sdk import ApiConfig, AuthConfig, ClientConfig, GenerationExportConfig

cfg = ClientConfig(
    generation_export=GenerationExportConfig(
        protocol="http",
        endpoint="http://localhost:8080/api/v1/generations:export",
        auth=AuthConfig(mode="tenant", tenant_id="prod-tenant"),
    ),
    api=ApiConfig(endpoint="http://localhost:8080"),
)

Grafana Cloud auth (basic)

For Grafana Cloud, use basic auth mode. The username is your Grafana Cloud instance/tenant ID and the password is your Grafana Cloud API key:

import os
from sigil_sdk import AuthConfig, ClientConfig, GenerationExportConfig

cfg = ClientConfig(
    generation_export=GenerationExportConfig(
        protocol="http",
        endpoint="https://<your-stack>.grafana.net/api/v1/generations:export",
        auth=AuthConfig(
            mode="basic",
            tenant_id=os.environ["GRAFANA_CLOUD_INSTANCE_ID"],
            basic_password=os.environ["GRAFANA_CLOUD_API_KEY"],
        ),
    ),
)

If your deployment requires a distinct username, set basic_user explicitly:

auth=AuthConfig(
    mode="basic",
    tenant_id=os.environ["GRAFANA_CLOUD_INSTANCE_ID"],
    basic_user=os.environ["GRAFANA_CLOUD_INSTANCE_ID"],
    basic_password=os.environ["GRAFANA_CLOUD_API_KEY"],
)

Env-secret wiring example

The SDK does not auto-load env vars. Resolve env values in your application and pass them into config explicitly.

import os
from sigil_sdk import AuthConfig, ClientConfig

cfg = ClientConfig()

gen_token = (os.getenv("SIGIL_GEN_BEARER_TOKEN") or "").strip()
if gen_token:
    cfg.generation_export.auth = AuthConfig(mode="bearer", bearer_token=gen_token)

Common topology:

Grafana Cloud: generation basic mode with instance ID and API key.
Self-hosted direct to Sigil: generation tenant mode.
Traces/metrics via OTEL Collector/Alloy: configure exporters in your app OTEL SDK setup.
Enterprise proxy: generation bearer mode to proxy; proxy authenticates and forwards tenant header upstream.

Conversation Ratings

Use the SDK helper to submit user-facing ratings:

from sigil_sdk import ConversationRatingInput, ConversationRatingValue

result = client.submit_conversation_rating(
    "conv-123",
    ConversationRatingInput(
        rating_id="rat-123",
        rating=ConversationRatingValue.BAD,
        comment="Answer ignored user context",
        metadata={"channel": "assistant-ui"},
        source="sdk-python",
    ),
)

print(result.rating.rating, result.summary.has_bad_rating)

submit_conversation_rating(...) sends requests to ClientConfig.api.endpoint (default http://localhost:8080) and uses the same generation-export auth headers (tenant or bearer) already configured on the SDK client.

Instrumentation-only mode (no generation send)

Set generation_export.protocol="none" to keep generation/tool instrumentation and spans while disabling generation transport.

from sigil_sdk import Client, ClientConfig, GenerationExportConfig

cfg = ClientConfig(
    generation_export=GenerationExportConfig(
        protocol="none",
    ),
)

client = Client(cfg)

Lifecycle and Error Semantics

flush() forces immediate export of queued generations.
shutdown() flushes pending generations, then closes generation exporters.
Always call shutdown() during process teardown to avoid dropped telemetry.
recorder.set_call_error(exc) marks provider-call failures on the generation payload and span status.
recorder.err() is for local SDK runtime errors only (validation, queue full, payload too large, shutdown).

SDK metrics

The SDK emits these OTel histograms through your configured OTEL meter provider:

gen_ai.client.operation.duration
gen_ai.client.token.usage
gen_ai.client.time_to_first_token
gen_ai.client.tool_calls_per_operation

Public API Overview

Core client and lifecycle:

Client
Client.start_generation(...)
Client.start_streaming_generation(...)
Client.start_tool_execution(...)
Client.flush()
Client.shutdown()

Typed payloads:

GenerationStart, Generation, ModelRef
Message, Part, ToolDefinition, TokenUsage
ToolExecutionStart, ToolExecutionEnd
ContentCaptureMode

Helpers:

user_text_message(...), assistant_text_message(...)
with_conversation_id(...), with_agent_name(...), with_agent_version(...)
with_content_capture_mode(...)

Validation:

validate_generation(...)

Provider Helper Packages

Provider wrappers are wrapper-first and mapper-explicit:

sigil-sdk-openai
sigil-sdk-anthropic
sigil-sdk-gemini

Each package exposes sync + async wrappers and explicit mapper functions for custom integration points.

Regenerating gRPC Stubs

Install dev dependencies once:

python3 -m pip install -e 'sdks/python[dev]'

Then regenerate:

./sdks/python/scripts/generate_proto.sh

This regenerates sigil_sdk/internal/gen/sigil/v1/*_pb2*.py from sigil/proto/sigil/v1/generation_ingest.proto.

Project details

Release history Release notifications | RSS feed

0.2.0

May 1, 2026

0.1.5

Apr 29, 2026

This version

0.1.4

Apr 29, 2026

0.1.3

Apr 17, 2026

0.1.2

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sigil_sdk-0.1.4.tar.gz (69.9 kB view details)

Uploaded Apr 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sigil_sdk-0.1.4-py3-none-any.whl (47.8 kB view details)

Uploaded Apr 29, 2026 Python 3

File details

Details for the file sigil_sdk-0.1.4.tar.gz.

File metadata

Download URL: sigil_sdk-0.1.4.tar.gz
Upload date: Apr 29, 2026
Size: 69.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for sigil_sdk-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`7e2fcb6c0922569714e4b2ec56c757555985dc15f795cd91e59ec4d14e188a7f`
MD5	`86e5a5d791597e2ac9afc558e1672f2a`
BLAKE2b-256	`cbdaa9356b9f89fbad7645368ee130ce1c6db1252df2660e02dc7d1503330393`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sigil_sdk-0.1.4.tar.gz:

Publisher: python-sdks-publish.yml on grafana/sigil-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sigil_sdk-0.1.4.tar.gz
- Subject digest: 7e2fcb6c0922569714e4b2ec56c757555985dc15f795cd91e59ec4d14e188a7f
- Sigstore transparency entry: 1400792490
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: grafana/sigil-sdk@e70beecacc215c12b48fbc2a107b065ab48a5be7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/grafana
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-sdks-publish.yml@e70beecacc215c12b48fbc2a107b065ab48a5be7
- Trigger Event: workflow_dispatch

File details

Details for the file sigil_sdk-0.1.4-py3-none-any.whl.

File metadata

Download URL: sigil_sdk-0.1.4-py3-none-any.whl
Upload date: Apr 29, 2026
Size: 47.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for sigil_sdk-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d99ddd3f9b92f45e81589c7195a15c58d0bb32ffcdb46f9557e00f0197ba46bc`
MD5	`35f3626ccd77c628bfd640421caf1a5d`
BLAKE2b-256	`455c7727bb6b43566d76ff31b36aa42cc45b7a9503dff5bf7fa9ab8a54864f7d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sigil_sdk-0.1.4-py3-none-any.whl:

Publisher: python-sdks-publish.yml on grafana/sigil-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sigil_sdk-0.1.4-py3-none-any.whl
- Subject digest: d99ddd3f9b92f45e81589c7195a15c58d0bb32ffcdb46f9557e00f0197ba46bc
- Sigstore transparency entry: 1400792559
- Sigstore integration time: Apr 29, 2026
Source repository:
- Permalink: grafana/sigil-sdk@e70beecacc215c12b48fbc2a107b065ab48a5be7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/grafana
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-sdks-publish.yml@e70beecacc215c12b48fbc2a107b065ab48a5be7
- Trigger Event: workflow_dispatch

sigil-sdk 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Grafana Sigil Python SDK

Installation

Validation

Quick Start (Sync Generation)

Streaming Generation

Embedding Observability

Tool Execution Span Recording

SDK identity attributes

Context Defaults

Content Capture Mode

Client-level default

Per-generation override

Context propagation

Dynamic resolution via resolver

Resolution precedence (highest to lowest)

Export Configuration

HTTP generation export

gRPC generation export

Generation export auth modes

Grafana Cloud auth (basic)

Env-secret wiring example

Conversation Ratings

Instrumentation-only mode (no generation send)

Lifecycle and Error Semantics

SDK metrics

Public API Overview

Provider Helper Packages

Regenerating gRPC Stubs

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance