Python SDK for the 0G Compute Network - AI inference services on decentralized infrastructure

These details have not been verified by PyPI

Project links

Project description

0G Compute Network — Python SDK

Python SDK for the 0G Compute Network — a decentralized marketplace for AI inference and fine-tuning.

📚 Full developer documentation: og-py.vercel.app

🤖 Agent-readable docs: the wheel also ships local, version-locked markdown guides at zerog_py_sdk/llms.txt and zerog_py_sdk/docs/llms/, so coding assistants can inspect the installed package and use APIs that match your exact SDK version.

What you get

Decentralized AI inference — chatbot, text-to-image, image-editing, speech-to-text providers, all OpenAI-compatible
Fine-tuning — upload datasets to TEEs, train LoRA adapters, deploy to inference GPUs
Pay-per-request billing — on-chain ledger with provider sub-accounts; no signups, no credit cards
TEE attestation — every response is cryptographically signed; verify on-chain
API keys — persistent, individually revocable tokens for server applications
Auto-funding — background top-ups keep long-running services alive, and header generation performs an inline balance check when background funding is not running
Read-only mode — list providers without connecting a wallet
Native Python crypto — Baby JubJub, EdDSA, Pedersen — no Node.js dependency

Installation

pip install 0g-inference-sdk

Requires Python 3.8+. The SDK automatically installs web3, eth-account, requests.

Configure

# .env
PRIVATE_KEY=0xYOUR_PRIVATE_KEY
RPC_URL=https://evmrpc-testnet.0g.ai
NETWORK=testnet

⚠️ Never commit your .env file.

Quickstart

import os, requests
from zerog_py_sdk import create_broker
from zerog_py_sdk.utils import og_to_wei

# 1. Connect
broker = create_broker(
    private_key=os.environ["PRIVATE_KEY"],
    network="testnet",          # or "mainnet"
)

# 2. Discover providers
services = broker.inference.list_service()
provider = next(s.provider for s in services if s.service_type == "chatbot")

# 3. Fund (one-time setup)
try:
    broker.ledger.get_ledger()
except Exception:
    broker.ledger.add_ledger("3")               # contract minimum is 3 OG

broker.inference.acknowledge_provider_signer(provider)
broker.ledger.transfer_fund(provider, "inference", og_to_wei("1"))

# 4. Make an inference request
metadata = broker.inference.get_service_metadata(provider)
headers  = broker.inference.get_request_headers(provider)

response = requests.post(
    f"{metadata['endpoint']}/chat/completions",
    headers={"Content-Type": "application/json", **headers},
    json={
        "model":    metadata["model"],
        "messages": [{"role": "user", "content": "What is 2+2?"}],
    },
)
print(response.json()["choices"][0]["message"]["content"])

Async image jobs

The network's async image APIs live under /v1/async instead of /v1/proxy. Following the TypeScript SDK pattern, the broker only signs requests via get_request_headers; callers issue the HTTP requests themselves:

import json, time, requests

metadata = broker.inference.get_service_metadata(provider)
base_url = metadata["endpoint"].replace("/v1/proxy", "")

# Submit
body = {
    "model": metadata["model"],
    "prompt": "A cute baby sea otter",
    "n": 1,
    "size": "512x512",
    "response_format": "b64_json",
}
headers = broker.inference.get_request_headers(provider, json.dumps(body))
submit = requests.post(
    f"{base_url}/v1/async/images/generations",
    headers={"Content-Type": "application/json", **headers},
    json=body,
)
job_id = submit.json()["jobId"]

# Poll
while True:
    poll_headers = broker.inference.get_request_headers(provider)
    resp = requests.get(f"{base_url}/v1/async/jobs/{job_id}", headers=poll_headers)
    payload = resp.json()
    if payload["status"] in ("completed", "failed"):
        break
    time.sleep(float(resp.headers.get("Retry-After", 5)))

if payload["status"] == "failed":
    raise RuntimeError(payload.get("errorMessage", "Async job failed"))

images = payload["data"]["data"]
print(f"Generated {len(images)} image(s)")

Image edits use the same flow against /v1/async/images/edits with multipart/form-data.

Browse without a wallet

from zerog_py_sdk import create_read_only_broker

browser = create_read_only_broker(network="testnet")
for s in browser.list_service():
    print(f"{s.model} — {s.provider}")

list_service_with_detail() adds health metrics (uptime, response time) from the network's monitoring API. It also enriches services with status-API model metadata plus parsed tiered_pricing, cache_token_billing, multi-model metadata, and per-model health details when providers expose them.

Multi-model providers

Use get_provider_models() to fetch the provider's authoritative public catalog. For multi-model providers, pass one of the returned IDs to get_service_metadata():

catalog = broker.inference.get_provider_models(provider)

for model in catalog.models:
    print(model.id, model.canonical_id, model.pricing)

selected = catalog.models[0].id
metadata = broker.inference.get_service_metadata(provider, selected)
headers = broker.inference.get_request_headers(provider)

response = requests.post(
    f"{metadata['endpoint']}/chat/completions",
    headers={"Content-Type": "application/json", **headers},
    json={
        "model": metadata["model"],
        "messages": [{"role": "user", "content": "Hello!"}],
    },
)

The SDK forwards the selected model unchanged. The provider validates it and applies that model's price. Omitting the model preserves the on-chain default.

The wallet-free broker exposes the same catalog:

catalog = browser.get_provider_models(provider)

Use with the OpenAI Python SDK

Issue a persistent API key once, then use it like any OpenAI key:

from openai import OpenAI

secret   = broker.inference.get_secret(provider)
metadata = broker.inference.get_service_metadata(provider)

client = OpenAI(base_url=metadata["endpoint"], api_key=secret)

response = client.chat.completions.create(
    model    = metadata["model"],
    messages = [{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Persistent keys can be revoked with revoke_api_key(provider, token_id), revoke_tokens(provider, [token_id, ...]), or revoke_all_tokens(provider).

Fine-tuning

PROVIDER = "0xFineTuningProvider..."
MODEL    = "Qwen2.5-0.5B-Instruct"

broker.fine_tuning.acknowledge_provider_signer(PROVIDER)
broker.ledger.transfer_fund(PROVIDER, "fine-tuning", og_to_wei("1"))

# Upload dataset and create task
upload  = broker.fine_tuning.upload_dataset_to_tee(
    PROVIDER,
    "./train.jsonl",
    max_file_size_mb=100,
    timeout_ms=180_000,
)
task_id = broker.fine_tuning.create_task(
    provider_address       = PROVIDER,
    pre_trained_model_name = MODEL,
    dataset_hash           = upload["datasetHash"],
    training_path          = "./training_params.json",
)

# After training completes, acknowledge the model and deploy LoRA
broker.fine_tuning.acknowledge_model(PROVIDER, task_id, "./lora.bin")

deploy = broker.inference.deploy_adapter(
    provider_address = PROVIDER,
    base_model       = MODEL,
    task_id          = task_id,
    wait             = True,
)

# Chat with the fine-tuned adapter
response = broker.inference.chat_with_fine_tuned_model(
    PROVIDER,
    deploy.adapter_name,
    "Who are you?",
)

acknowledge_model() mirrors the TypeScript SDK retrieval flow: by default it tries 0G Storage first, falls back to TEE download, verifies the TEE-downloaded artifact hash when available, then acknowledges the deliverable on-chain. You can force a path with download_method="tee" or download_method="0g-storage". For recovery cases where you already retrieved the artifact elsewhere, acknowledge_deliverable(PROVIDER, task_id) releases the on-chain queue without downloading.

Fine-tuning provider helpers are also exposed on the broker: get_provider_url(), get_quote(), get_pending_task_counter(), get_customized_models(), get_customized_model(), and download_model_usage().

Fine-tuning helper binaries

0G Storage operations use the official 0g-storage-client. Configure an executable explicitly for production deployments:

from zerog_py_sdk import BinaryConfig, create_broker

broker = create_broker(
    private_key="0x...",
    binary_config=BinaryConfig(
        storage_client_path="/opt/0g/bin/0g-storage-client",
        token_counter_path="/opt/0g/bin/token_counter",  # optional
    ),
)

The equivalent environment variables are ZG_STORAGE_CLIENT_PATH, ZG_TOKEN_COUNTER_PATH, and ZG_BINARY_CACHE_DIR. Without an explicit path, the SDK checks its user cache, packaged assets, and PATH, in that order.

Official 0g-storage-client GitHub releases do not currently publish cross-platform executable assets, so the Python SDK does not download an unverified build. Build the client from the official repository for your platform. When executable token counting is requested and no override is configured, token_counter is fetched through 0G Storage using the upstream Merkle root, SHA-256 verified, and atomically installed in the user cache.

Model decryption requires the fine-tuning extra:

pip install "0g-inference-sdk[fine-tuning]"

Decryption is fail-closed. The destination is replaced only after AES-GCM authentication and the provider TEE signer's raw tag signature both verify. Failures raise ModelVerificationError and leave any existing destination unchanged.

See examples/fine_tuning/fine_tuning_example.py for a complete end-to-end script.

Verify TEE-signed responses

chat_id = response.json()["id"]
content = response.json()["choices"][0]["message"]["content"]

is_valid = broker.inference.process_response(
    provider_address = provider,
    content          = content,
    chat_id          = chat_id,
)

For a full attestation check (TEE signer extraction, optional Automata contract verification, signed report), verify_service is silent by default. Pass on_log to stream progress, or iterate result.steps after the call. An explicit Automata rejection sets attestation_verified=False and fails the verification; an RPC outage sets it to None and records attestation_error.

# Silent — returns a typed VerificationResult
result = broker.inference.verify_service(provider, output_dir="./reports")
if result.success:
    print(result.tee_signer)

# Stream progress while running
result = broker.inference.verify_service(
    provider,
    output_dir="./reports",
    on_log=lambda step: print(step.message),
)

# Or replay the steps after the fact
for step in result.steps:
    print(f"[{step.type}] {step.message}")

SDK reference

The SDK exposes everything from the top-level zerog_py_sdk package:

Symbol	Purpose
`create_broker(private_key, network=None, rpc_url=None)`	Main factory — returns `ZGServingBroker`
`create_ledger_broker(...)`	Standalone ledger broker factory
`create_inference_broker(...)`	Standalone inference broker factory
`create_fine_tuning_broker(...)`	Standalone fine-tuning broker factory
`create_broker_from_env(env_file=".env")`	Load credentials from a `.env` file
`create_read_only_broker(network=None)`	No wallet required — for browsing services
`broker.ledger`	`LedgerManager` — deposits, transfers, refunds
`broker.inference`	`InferenceManager` — service discovery, sync/async requests, API keys
`broker.inference.get_provider_models(provider)`	Live provider model catalog and per-model health
`broker.inference.lora`	`LoRAProcessor` — deploy LoRA adapters, chat
`broker.inference.deploy_adapter(...)`	Direct TS-parity LoRA deployment method
`broker.fine_tuning`	`FineTuningBroker` — datasets, tasks, model delivery
`get_network_type(chain_id)`	Canonical network name, including `unknown`
`og_to_wei`, `wei_to_og`	OG ↔ wei conversion helpers (`zerog_py_sdk.utils`)
`ServiceMetadata`, `LedgerAccount`, `Account`, `ApiKeyInfo`	Data classes
`ZGServingBrokerError`, `InsufficientBalanceError`, `ContractError`, ...	Exception hierarchy

Decoded contract failures expose ContractError.error_name, ContractError.error_args, and ContractError.revert_data, while preserving the original exception as __cause__.

For the full per-class API, browse the Compute reference docs or read the source — every public method has a docstring.

Network configuration

Network	Chain ID	RPC
Mainnet	`16661`	`https://evmrpc.0g.ai`
Testnet (Galileo)	`16602`	`https://evmrpc-testnet.0g.ai`

Contract addresses are bundled with the SDK and resolved automatically from the chain ID. Override with create_broker(..., ledger_address=..., inference_address=...) if needed.

Get testnet tokens at faucet.0g.ai.

Examples

Runnable scripts in examples/:

examples/inference/inference_example.py — end-to-end inference flow
examples/inference/setup_account.py — create and fund a ledger
examples/inference/diagnose_account.py — inspect your account state
examples/fine_tuning/fine_tuning_example.py — full fine-tuning + deploy + chat

Troubleshooting

403 Forbidden — endpoint URL must include /v1/proxy. Use broker.inference.get_service_metadata(provider)["endpoint"], which appends it automatically.
401 Unauthorized — session token expired (24h) or revoked. Headers refresh on the next get_request_headers() call.
Insufficient balance — top up with broker.ledger.deposit_fund("1") and transfer to the provider with broker.ledger.transfer_fund(...).
Provider not acknowledged — call broker.inference.acknowledge_provider_signer(provider) once per provider.
Empty service list — check chain ID matches the network you intended (broker.web3.eth.chain_id).

Contributing

PRs welcome. Please add tests and run them locally before opening a PR.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.9.1

Jun 16, 2026

0.9.0

Jun 15, 2026

0.7.0

May 18, 2026

0.6.0

Apr 5, 2026

0.5.0

Mar 23, 2026

0.4.0

Mar 18, 2026

0.2.0

Feb 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

0g_inference_sdk-0.9.1.tar.gz (146.2 kB view details)

Uploaded Jun 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

0g_inference_sdk-0.9.1-py3-none-any.whl (136.6 kB view details)

Uploaded Jun 16, 2026 Python 3

File details

Details for the file 0g_inference_sdk-0.9.1.tar.gz.

File metadata

Download URL: 0g_inference_sdk-0.9.1.tar.gz
Upload date: Jun 16, 2026
Size: 146.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for 0g_inference_sdk-0.9.1.tar.gz
Algorithm	Hash digest
SHA256	`b1dd963de3429ba3a87caaf5eb030d8f663432ae0ef5eb6bdd40cda3a54c0077`
MD5	`42ee3cc5e7380bea5e6c94fb230977c0`
BLAKE2b-256	`7bfb2cb6054b16b21d5e0c96a37f5a41d405d46fa8c5d17d7882693d8cf433f1`

See more details on using hashes here.

File details

Details for the file 0g_inference_sdk-0.9.1-py3-none-any.whl.

File metadata

Download URL: 0g_inference_sdk-0.9.1-py3-none-any.whl
Upload date: Jun 16, 2026
Size: 136.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for 0g_inference_sdk-0.9.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4381fc7dffa3237c3dd81497d1a08f3add2a352cec9533cfe4a54e7f725f1cd8`
MD5	`92d154d9782e6d8e4920f062679973b5`
BLAKE2b-256	`9c337a2077812c1270af9c83f6504e6559ce8555ead142bb298f6167ea86996e`

See more details on using hashes here.

0g-inference-sdk 0.9.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

0G Compute Network — Python SDK

What you get

Installation

Configure

Quickstart

Async image jobs

Browse without a wallet

Multi-model providers

Use with the OpenAI Python SDK

Fine-tuning

Fine-tuning helper binaries

Verify TEE-signed responses

SDK reference

Network configuration

Examples

Troubleshooting

Contributing

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes