Skip to main content

Core SDK for genblaze media generation orchestration

Project description

genblaze-core

Python SDK for building generative AI pipelines across video, image, and audio — with built-in SHA-256 provenance.

genblaze-core is the core of genblaze, an open-source orchestration framework by Backblaze for composing multi-step AI media generation workflows. It gives you a single, provider-agnostic Pipeline API for text-to-video, text-to-image, text-to-speech, image-to-video, and audio generation — so you can swap models (Sora, Veo, Runway, Luma, Flux, DALL·E, ElevenLabs, Stable Audio, LMNT, GMICloud) without rewriting pipeline logic.

Every pipeline run emits a canonical, hash-verified provenance manifest — a tamper-evident JSON document capturing the provider, model, prompt, parameters, timestamps, and SHA-256 hash of every generated asset. Manifests can be embedded directly into PNG, JPEG, WebP, MP4, MP3, and WAV files, uploaded alongside assets to S3-compatible storage, or exported to Parquet for analytics.

Why genblaze-core

  • One API for every generative AI provider — Pipelines, not per-vendor SDK glue. Fluent, composable, chainable.
  • Built-in provenance — Every asset gets a SHA-256–verified manifest. Prove how media was made; detect tampering.
  • Production-ready — Retries, timeouts, progress streaming, moderation hooks, OpenTelemetry tracing, step caching.
  • Storage-agnostic sinks — Drop into Backblaze B2, AWS S3, Cloudflare R2, MinIO, Parquet, or local disk.
  • Policy + privacy controls — Redact prompts, strip params, pointer-mode for sensitive content.
  • Agent loops + templates — Evaluator-driven iteration, reusable pipeline and step templates.
  • Zero lock-in — MIT licensed, typed, lazy imports, provider adapters are separate packages.

Features

Capability What you get
Pipeline API Fluent multi-step generation, fan-in (input_from), AV compositing via FFmpeg
Provider discovery Entry-point–based registry — pip install genblaze-<provider> and it's available
Manifest (Pydantic) Run, Step, Asset models with canonical JSON hashing and .verify()
Media embedding PngHandler, Mp4Handler, Mp3Handler, etc. — embed + extract manifests in-file
Storage sink ObjectStorageSink with hierarchical or content-addressable key layout
Parquet sink Partitioned run/step/asset tables for downstream analytics
Observability OTelTracer, LoggingTracer, CompositeTracer, structured events
Agents AgentLoop with pluggable Evaluator for iterative refinement
Moderation Pre/post moderation hooks, configurable embed policies
Testing MockProvider, MockVideoProvider, MockAudioProvider for offline tests

Install

pip install genblaze-core

Optional extras:

pip install "genblaze-core[parquet]"   # ParquetSink for analytics
pip install "genblaze-core[audio]"     # Audio metadata embedding (mutagen)

Add provider adapters separately:

pip install genblaze-openai genblaze-google genblaze-runway genblaze-luma \
            genblaze-decart genblaze-replicate genblaze-elevenlabs \
            genblaze-stability-audio genblaze-lmnt genblaze-gmicloud

pip install genblaze-s3    # Storage backend for Backblaze B2 / AWS S3 / R2 / MinIO
pip install genblaze-cli   # Extract / verify / replay / index manifests

Quickstart — local, zero API keys

from genblaze_core import Modality, Pipeline
from genblaze_core.testing import MockVideoProvider

run, manifest = (
    Pipeline("hello-genblaze")
    .step(MockVideoProvider(), model="mock-v1",
          prompt="A drone shot over a coastal city at golden hour",
          modality=Modality.VIDEO)
    .run()
)

print(manifest.canonical_hash)   # deterministic SHA-256 of the run
print(manifest.verify())         # True

Quickstart — Sora + Backblaze B2 storage

Generate a video, upload it + its manifest to Backblaze B2, verify the hash:

pip install genblaze-core genblaze-openai genblaze-s3
export OPENAI_API_KEY="sk-..."
export B2_KEY_ID="..."
export B2_APP_KEY="..."
from genblaze_core import KeyStrategy, Modality, ObjectStorageSink, Pipeline
from genblaze_openai import SoraProvider
from genblaze_s3 import S3StorageBackend

storage = ObjectStorageSink(
    S3StorageBackend.for_backblaze("my-bucket"),
    key_strategy=KeyStrategy.HIERARCHICAL,
)

result = (
    Pipeline("hero-reel")
    .step(SoraProvider(), model="sora-2",
          prompt="Aerial flyover of a mountain lake at sunrise",
          modality=Modality.VIDEO, seconds=4, size="1280x720")
    .run(sink=storage, timeout=300)
)

print(result.run.steps[0].assets[0].url)   # durable B2 URL
print(result.manifest.canonical_hash)      # SHA-256 of the full run
assert result.manifest.verify()

Storage — Backblaze B2 recommended

Backblaze B2 is the recommended default sink for genblaze — purpose-built for large AI-generated media with S3-compatible APIs, resilient multipart uploads, Object Lock for immutable manifests, and strong cost economics at scale. One-liner credentials from B2_KEY_ID / B2_APP_KEY. See the genblaze-s3 backend for the full recipe plus AWS S3, Cloudflare R2, and MinIO variants.

Documentation

Related packages

Provider adapters: genblaze-openai · genblaze-google · genblaze-runway · genblaze-luma · genblaze-decart · genblaze-replicate · genblaze-elevenlabs · genblaze-stability-audio · genblaze-lmnt · genblaze-gmicloud

Storage + tooling: genblaze-s3 · genblaze-cli · genblaze-langsmith

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genblaze_core-0.3.0.tar.gz (325.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

genblaze_core-0.3.0-py3-none-any.whl (232.8 kB view details)

Uploaded Python 3

File details

Details for the file genblaze_core-0.3.0.tar.gz.

File metadata

  • Download URL: genblaze_core-0.3.0.tar.gz
  • Upload date:
  • Size: 325.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for genblaze_core-0.3.0.tar.gz
Algorithm Hash digest
SHA256 920400f77b3cc15e4d1f5839ba35b0d56223f2c56bb3d92fbc6182a89a96d462
MD5 8c275619ca654ae70f167ee21b122f4d
BLAKE2b-256 6526bf01cd6b11d9bef3b5ebf59b1b374d598bdfc60b738cd363ab9fadfcd1db

See more details on using hashes here.

Provenance

The following attestation bundles were made for genblaze_core-0.3.0.tar.gz:

Publisher: release.yml on backblaze-labs/genblaze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file genblaze_core-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: genblaze_core-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 232.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for genblaze_core-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5f8d00c2f5c3561dfbc11e12539001a9d886313febbd080f70eb057d59d44f62
MD5 0509f653bd1c93101921030e8a6c572b
BLAKE2b-256 687fe87d0d787866adf0013572f3f618a181a2a18cf16773e8568e308c59afb4

See more details on using hashes here.

Provenance

The following attestation bundles were made for genblaze_core-0.3.0-py3-none-any.whl:

Publisher: release.yml on backblaze-labs/genblaze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page