Skip to main content

Python SDK for the 0G Compute Network - AI inference services on decentralized infrastructure

Project description

0G Compute Network — Python SDK

Python 3.8+ License: MIT PyPI

Python SDK for the 0G Compute Network — a decentralized marketplace for AI inference and fine-tuning.

📚 Full developer documentation: og-py.vercel.app

What you get

  • Decentralized AI inference — chatbot, text-to-image, image-editing, speech-to-text providers, all OpenAI-compatible
  • Fine-tuning — upload datasets to TEEs, train LoRA adapters, deploy to inference GPUs
  • Pay-per-request billing — on-chain ledger with provider sub-accounts; no signups, no credit cards
  • TEE attestation — every response is cryptographically signed; verify on-chain
  • API keys — persistent, individually revocable tokens for server applications
  • Auto-funding — background top-ups keep long-running services alive, and header generation performs an inline balance check when background funding is not running
  • Read-only mode — list providers without connecting a wallet
  • Native Python crypto — Baby JubJub, EdDSA, Pedersen — no Node.js dependency

Installation

pip install 0g-inference-sdk

Requires Python 3.8+. The SDK automatically installs web3, eth-account, requests.

Configure

# .env
PRIVATE_KEY=0xYOUR_PRIVATE_KEY
RPC_URL=https://evmrpc-testnet.0g.ai
NETWORK=testnet

⚠️ Never commit your .env file.

Quickstart

import os, requests
from zerog_py_sdk import create_broker
from zerog_py_sdk.utils import og_to_wei

# 1. Connect
broker = create_broker(
    private_key=os.environ["PRIVATE_KEY"],
    network="testnet",          # or "mainnet"
)

# 2. Discover providers
services = broker.inference.list_service()
provider = next(s.provider for s in services if s.service_type == "chatbot")

# 3. Fund (one-time setup)
try:
    broker.ledger.get_ledger()
except Exception:
    broker.ledger.add_ledger("3")               # contract minimum is 3 OG

broker.inference.acknowledge_provider_signer(provider)
broker.ledger.transfer_fund(provider, "inference", og_to_wei("1"))

# 4. Make an inference request
metadata = broker.inference.get_service_metadata(provider)
headers  = broker.inference.get_request_headers(provider)

response = requests.post(
    f"{metadata['endpoint']}/chat/completions",
    headers={"Content-Type": "application/json", **headers},
    json={
        "model":    metadata["model"],
        "messages": [{"role": "user", "content": "What is 2+2?"}],
    },
)
print(response.json()["choices"][0]["message"]["content"])

Async image jobs

The network's async image APIs live under /v1/async instead of /v1/proxy. Following the TypeScript SDK pattern, the broker only signs requests via get_request_headers; callers issue the HTTP requests themselves:

import json, time, requests

metadata = broker.inference.get_service_metadata(provider)
base_url = metadata["endpoint"].replace("/v1/proxy", "")

# Submit
body = {
    "model": metadata["model"],
    "prompt": "A cute baby sea otter",
    "n": 1,
    "size": "512x512",
    "response_format": "b64_json",
}
headers = broker.inference.get_request_headers(provider, json.dumps(body))
submit = requests.post(
    f"{base_url}/v1/async/images/generations",
    headers={"Content-Type": "application/json", **headers},
    json=body,
)
job_id = submit.json()["jobId"]

# Poll
while True:
    poll_headers = broker.inference.get_request_headers(provider)
    resp = requests.get(f"{base_url}/v1/async/jobs/{job_id}", headers=poll_headers)
    payload = resp.json()
    if payload["status"] in ("completed", "failed"):
        break
    time.sleep(float(resp.headers.get("Retry-After", 5)))

if payload["status"] == "failed":
    raise RuntimeError(payload.get("errorMessage", "Async job failed"))

images = payload["data"]["data"]
print(f"Generated {len(images)} image(s)")

Image edits use the same flow against /v1/async/images/edits with multipart/form-data.

Browse without a wallet

from zerog_py_sdk import create_read_only_broker

browser = create_read_only_broker(network="testnet")
for s in browser.list_service():
    print(f"{s.model}{s.provider}")

list_service_with_detail() adds health metrics (uptime, response time) from the network's monitoring API. It also enriches services with status-API model metadata plus parsed tiered_pricing and cache_token_billing details when providers expose them.

Use with the OpenAI Python SDK

Issue a persistent API key once, then use it like any OpenAI key:

from openai import OpenAI

secret   = broker.inference.get_secret(provider)
metadata = broker.inference.get_service_metadata(provider)

client = OpenAI(base_url=metadata["endpoint"], api_key=secret)

response = client.chat.completions.create(
    model    = metadata["model"],
    messages = [{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Persistent keys can be revoked with revoke_api_key(provider, token_id), revoke_tokens(provider, [token_id, ...]), or revoke_all_tokens(provider).

Fine-tuning

PROVIDER = "0xFineTuningProvider..."
MODEL    = "Qwen2.5-0.5B-Instruct"

broker.fine_tuning.acknowledge_provider_signer(PROVIDER)
broker.ledger.transfer_fund(PROVIDER, "fine-tuning", og_to_wei("1"))

# Upload dataset and create task
upload  = broker.fine_tuning.upload_dataset_to_tee(PROVIDER, "./train.jsonl")
task_id = broker.fine_tuning.create_task(
    provider_address       = PROVIDER,
    pre_trained_model_name = MODEL,
    dataset_hash           = upload["datasetHash"],
    training_path          = "./training_params.json",
)

# After training completes, acknowledge the model and deploy LoRA
broker.fine_tuning.acknowledge_model(PROVIDER, task_id, "./lora.bin")

deploy = broker.inference.lora.deploy_adapter(
    provider_address = PROVIDER,
    base_model       = MODEL,
    task_id          = task_id,
    wait             = True,
)

# Chat with the fine-tuned adapter
response = broker.inference.lora.chat(PROVIDER, deploy.adapter_name, "Who are you?")

acknowledge_model() mirrors the TypeScript SDK retrieval flow: by default it tries 0G Storage first, falls back to TEE download, verifies the TEE-downloaded artifact hash when available, then acknowledges the deliverable on-chain. You can force a path with download_method="tee" or download_method="0g-storage". For recovery cases where you already retrieved the artifact elsewhere, acknowledge_deliverable(PROVIDER, task_id) releases the on-chain queue without downloading.

Fine-tuning provider helpers are also exposed on the broker: get_provider_url(), get_quote(), get_pending_task_counter(), get_customized_models(), get_customized_model(), and download_model_usage().

See examples/fine_tuning/fine_tuning_example.py for a complete end-to-end script.

Verify TEE-signed responses

chat_id = response.json()["id"]
content = response.json()["choices"][0]["message"]["content"]

is_valid = broker.inference.process_response(
    provider_address = provider,
    content          = content,
    chat_id          = chat_id,
)

For a full attestation check (TEE signer extraction, optional Automata contract verification, signed report). verify_service is silent by default — pass on_log to stream progress, or iterate result.steps after the call:

# Silent — returns a typed VerificationResult
result = broker.inference.verify_service(provider, output_dir="./reports")
if result.success:
    print(result.tee_signer)

# Stream progress while running
result = broker.inference.verify_service(
    provider,
    output_dir="./reports",
    on_log=lambda step: print(step.message),
)

# Or replay the steps after the fact
for step in result.steps:
    print(f"[{step.type}] {step.message}")

SDK reference

The SDK exposes everything from the top-level zerog_py_sdk package:

Symbol Purpose
create_broker(private_key, network=None, rpc_url=None) Main factory — returns ZGServingBroker
create_broker_from_env(env_file=".env") Load credentials from a .env file
create_read_only_broker(network=None) No wallet required — for browsing services
broker.ledger LedgerManager — deposits, transfers, refunds
broker.inference InferenceManager — service discovery, sync/async requests, API keys
broker.inference.lora LoRAProcessor — deploy LoRA adapters, chat
broker.fine_tuning FineTuningBroker — datasets, tasks, model delivery
og_to_wei, wei_to_og OG ↔ wei conversion helpers (zerog_py_sdk.utils)
ServiceMetadata, LedgerAccount, Account, ApiKeyInfo Data classes
ZGServingBrokerError, InsufficientBalanceError, ContractError, ... Exception hierarchy

For the full per-class API, browse the Compute reference docs or read the source — every public method has a docstring.

Network configuration

Network Chain ID RPC
Mainnet 16661 https://evmrpc.0g.ai
Testnet (Galileo) 16602 https://evmrpc-testnet.0g.ai

Contract addresses are bundled with the SDK and resolved automatically from the chain ID. Override with create_broker(..., ledger_address=..., inference_address=...) if needed.

Get testnet tokens at faucet.0g.ai.

Examples

Runnable scripts in examples/:

  • examples/inference/inference_example.py — end-to-end inference flow
  • examples/inference/setup_account.py — create and fund a ledger
  • examples/inference/diagnose_account.py — inspect your account state
  • examples/fine_tuning/fine_tuning_example.py — full fine-tuning + deploy + chat

Troubleshooting

  • 403 Forbidden — endpoint URL must include /v1/proxy. Use broker.inference.get_service_metadata(provider)["endpoint"], which appends it automatically.
  • 401 Unauthorized — session token expired (24h) or revoked. Headers refresh on the next get_request_headers() call.
  • Insufficient balance — top up with broker.ledger.deposit_fund("1") and transfer to the provider with broker.ledger.transfer_fund(...).
  • Provider not acknowledged — call broker.inference.acknowledge_provider_signer(provider) once per provider.
  • Empty service list — check chain ID matches the network you intended (broker.web3.eth.chain_id).

Contributing

PRs welcome. Please add tests and run them locally before opening a PR.

License

MIT

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

0g_inference_sdk-0.7.0.tar.gz (111.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

0g_inference_sdk-0.7.0-py3-none-any.whl (112.9 kB view details)

Uploaded Python 3

File details

Details for the file 0g_inference_sdk-0.7.0.tar.gz.

File metadata

  • Download URL: 0g_inference_sdk-0.7.0.tar.gz
  • Upload date:
  • Size: 111.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for 0g_inference_sdk-0.7.0.tar.gz
Algorithm Hash digest
SHA256 9a8b513a028adaa3969c55c1f55acb6bb8c8c420f564922f420248db9c14c1c8
MD5 f8cbbbc1103632794a1be8340155b882
BLAKE2b-256 073828c95bb17b5f0b3d571c512c73cb3164d52f0c38b72351c824c7549cae1e

See more details on using hashes here.

File details

Details for the file 0g_inference_sdk-0.7.0-py3-none-any.whl.

File metadata

File hashes

Hashes for 0g_inference_sdk-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7c0da55597f52fc9e26696febaf8a408dad360999774a213d03fdb630517c1ea
MD5 b6530af49588081cc50847b2aeda5dfc
BLAKE2b-256 7a2bf7128590e8f05b07425cea2c7658e36cca9845de35da331ddac06f6deca6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page