NVIDIA NIM / build.nvidia.com media provider adapters for genblaze (video, image, audio, chat)

These details have not been verified by PyPI

Project links

Project description

genblaze-nvidia

NVIDIA NIM / build.nvidia.com provider adapters for genblaze. Covers four modalities on one nvapi- key: video (Cosmos, Edify), image (SDXL, SD 3.5, FLUX), audio (Fugatto, Riva TTS), and chat (Nemotron, Llama, Mistral, Qwen, …).

Install

pip install genblaze-nvidia            # video/image/audio providers
pip install "genblaze-nvidia[chat]"    # + the OpenAI SDK for LLM calls

Auth

export NVIDIA_API_KEY=nvapi-...

The free tier is rate-limited (~40 requests/minute per model) with no per-token billing. Some models (Cosmos video) are still enterprise-gated as of 2026-04 and will return AUTH_FAILURE for free-tier keys until you have access.

Two base URLs

NVIDIA's API spans two public hosts on the same key:

Surface	Base URL	Used by
OpenAI-compatible chat / embeddings	`https://integrate.api.nvidia.com/v1`	`chat`, `achat`
Model-specific generation	`https://ai.api.nvidia.com/v1/genai/{vendor}/{slug}`	`NvidiaVideoProvider`, `NvidiaImageProvider`, `NvidiaAudioProvider`
NVCF async status	`https://api.nvcf.nvidia.com/v2/nvcf/pexec/status`	Async video polling

All three are overridable per-constructor for self-hosted NIM deployments:

NvidiaImageProvider(
    api_key="...",
    gen_base_url="https://self-hosted.internal/v1",
    nvcf_status_url="https://self-hosted.internal/v2/nvcf/pexec/status",
)

Video — `NvidiaVideoProvider`

Cosmos and Edify Video return async (202 Accepted + NVCF-REQID header) and the provider polls NVCF for completion. Some fast models return inline synchronous responses — both paths converge on the same lifecycle.

from genblaze_core.models.step import Step
from genblaze_nvidia import NvidiaVideoProvider

provider = NvidiaVideoProvider()  # reads NVIDIA_API_KEY
step = Step(
    provider="nvidia-video",
    model="nvidia/cosmos-1.0-7b-diffusion-text2world",
    prompt="a drone flight over a coastal cliff at sunset",
)
result = provider.invoke(step)
print(result.assets[0].url)  # file:// or https:// depending on response shape

Image — `NvidiaImageProvider`

Synchronous inline base64 response. If an endpoint occasionally returns 202, the provider short-polls NVCF inside generate() so the caller still sees one blocking call.

from genblaze_nvidia import NvidiaImageProvider

provider = NvidiaImageProvider()
step = Step(
    provider="nvidia-image",
    model="stabilityai/stable-diffusion-3-5-large",
    prompt="a studio photo of a brass teapot",
    params={"cfg_scale": 4.5, "aspect_ratio": "1:1"},
)
result = provider.invoke(step)

SDXL's schema differs from SD 3.5 / FLUX — the registry handles that transparently, rewriting prompt + negative_prompt into the text_prompts array SDXL expects.

Audio — `NvidiaAudioProvider`

from genblaze_nvidia import NvidiaAudioProvider

provider = NvidiaAudioProvider()

# TTS (mono)
step = Step(provider="nvidia-audio", model="nvidia/riva-tts", prompt="Hello, world.")

# Music / SFX (stereo)
step = Step(provider="nvidia-audio", model="nvidia/fugatto", prompt="upbeat synthwave intro")

result = provider.invoke(step)

Chat — `chat` / `achat`

OpenAI-wire-compatible. Any model NIM currently serves works as a plain string — no enumeration.

from genblaze_nvidia import chat

resp = chat(
    "nvidia/nemotron-4-340b-instruct",
    prompt="Summarize the Cosmos world foundation model in one sentence.",
)
print(resp.text)

import asyncio
from genblaze_nvidia import achat

async def main():
    r = await achat("meta/llama-3.3-70b-instruct", prompt="hi")
    print(r.text)

asyncio.run(main())

Models

Curated entries encode per-model behavior (SDXL's text_prompts shape, canonical→native param aliases). Any model NVIDIA ships that isn't listed still works via the permissive fallback spec — no release needed.

Modality	Curated defaults
Video	`nvidia/cosmos-1.0-7b-diffusion-text2world`, `.../video2world`, `nvidia/cosmos-2.0-diffusion-*`
Image	`stabilityai/stable-diffusion-xl`, `.../stable-diffusion-3-5-{large,large-turbo,medium}`, `black-forest-labs/flux.1-{schnell,dev}`
Audio	`nvidia/fugatto`, `nvidia/riva-tts`, `nvidia/maxine-voice-font`
Chat	None — pure pass-through. Use any NIM model id.

Discover live models at runtime (if you want the fresh catalog) via the OpenAI-compatible /v1/models endpoint:

import httpx, os
r = httpx.get(
    "https://integrate.api.nvidia.com/v1/models",
    headers={"Authorization": f"Bearer {os.environ['NVIDIA_API_KEY']}"},
)
for m in r.json()["data"]:
    print(m["id"])

Error handling

NIM returns safety refusals as HTTP 400 with Nemoguard / safety markers in the body. map_nvidia_error classifies these as CONTENT_POLICY (non-retryable) instead of INVALID_INPUT — pipelines don't burn retries on a deterministic refusal.

HTTP / message	`ProviderErrorCode`
401, 403	`AUTH_FAILURE`
404	`MODEL_ERROR`
429	`RATE_LIMIT`
400 with safety marker	`CONTENT_POLICY`
400 plain	`INVALID_INPUT`
5xx	`SERVER_ERROR`
transport timeout	`TIMEOUT`

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.1

Apr 28, 2026

This version

0.2.0

Apr 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

genblaze_nvidia-0.2.0.tar.gz (25.1 kB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

genblaze_nvidia-0.2.0-py3-none-any.whl (26.4 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file genblaze_nvidia-0.2.0.tar.gz.

File metadata

Download URL: genblaze_nvidia-0.2.0.tar.gz
Upload date: Apr 24, 2026
Size: 25.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for genblaze_nvidia-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`ba278dfe99d99d65cbe8bc4020710aa6be807e9d180b9c658f96127a16e88b43`
MD5	`69d7e56cbb244e9e4d70cc04702b8dfa`
BLAKE2b-256	`10b520c1b024921a2da4947e0f49f03ad4d5862a4c8a719c27382dc6eb235041`

See more details on using hashes here.

File details

Details for the file genblaze_nvidia-0.2.0-py3-none-any.whl.

File metadata

Download URL: genblaze_nvidia-0.2.0-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 26.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for genblaze_nvidia-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`145c0436178f2e49a9d58377784013bee9dcc9c69703028d867ca65a20b77230`
MD5	`a2aad4c02f413d25065e09009e9f45e5`
BLAKE2b-256	`fd5baae9f4706e459646ebd8dba1907e31a96ea8d4171efe0f83fc3f28889690`

See more details on using hashes here.

genblaze-nvidia 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

genblaze-nvidia

Install

Auth

Two base URLs

Video — `NvidiaVideoProvider`

Image — `NvidiaImageProvider`

Audio — `NvidiaAudioProvider`

Chat — `chat` / `achat`

Models

Error handling

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

genblaze-nvidia 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

genblaze-nvidia

Install

Auth

Two base URLs

Video — NvidiaVideoProvider

Image — NvidiaImageProvider

Audio — NvidiaAudioProvider

Chat — chat / achat

Models

Error handling

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Video — `NvidiaVideoProvider`

Image — `NvidiaImageProvider`

Audio — `NvidiaAudioProvider`

Chat — `chat` / `achat`