An openai-like sdk for finetuning and batch inference

Project description

Info: v0.7 is talking to a new supabase backend. v0.6 will remain online until at least December 1st, 2025.

This repo is research code. Please use github issues or contact me via email (niels dot warncke at gmail dot com) or slack when you encounter issues.

OpenWeights

An openai-like sdk with the flexibility of working on a local GPU: finetune, inference, API deployments and custom workloads on managed runpod instances.

Installation

Run pip install openweights or install from source via pip install -e .

Quickstart

Create an API key You can create one via the ow signup or using the dashboard.
Start the cluster manager (skip this if you got an API key for a managed cluster) The cluster manager is the service that monitors the job queue and starts runpod workers. You have different options to start the cluster

ow cluster --env-file path/to/env   # Run locally
ow deploy --env-file path/to/env    # Run on a runpod cpu instance

# Or managed, if you trust us with your API keys (usually a bad idea, but okay if you know us personally)
ow env import path/to/env
ow manage start

In all cases, the env file needs at least all envs defined in .env.worker.example.

Submit a job

from openweights import OpenWeights

ow = OpenWeights()

training_file = ow.files.upload("data/train.jsonl", purpose="conversations")["id"]
job = ow.fine_tuning.create(
    model="unsloth/Qwen3-4B",
    training_file=training_file,
    loss="sft",
    epochs=1,
    learning_rate=1e-4,
    r=32,
)

For more examples, checkout the cookbook.

Overview

openweights lets you submit jobs that will be run on managed runpod instances. It supports a range of built-in jobs out-of-the-box, but is built for custom workloads.

Custom jobs

A custom job lets you run a script that you would normally run on one GPU as a job.

Example:

from openweights import OpenWeights, register, Jobs
ow = OpenWeights()

@register('my_custom_job')
class MyCustomJob(Jobs):
    mount = {
        'local/path/to/script.py': 'script.py',
        'local/path/to/dir/': 'dirname/'
    }
    params: Type[BaseModel] = MyParams  # Your Pydantic model for params
    requires_vram_gb: int = 24
    base_image: str = 'nielsrolf/ow-unsloth:v0.10' # optional

    def get_entrypoint(self, validated_params: BaseModel) -> str:
        # Get the entrypoint command for the job.
        return f'python script.py {json.dumps(validated_params.model_dump())}'

More details

Built-in jobs

Inference

from openweights import OpenWeights
ow = OpenWeights()

file = ow.files.create(
  file=open("mydata.jsonl", "rb"),
  purpose="conversations"
)

job = ow.inference.create(
    model=model,
    input_file_id=file['id'],
    max_tokens=1000,
    temperature=1,
    min_tokens=600,
)

# Wait or poll until job is done, then:
if job.status == 'completed':
    output_file_id = job['outputs']['file']
    output = ow.files.content(output_file_id).decode('utf-8')
    print(output)

More details

OpenAI-like vllm API

from openweights import OpenWeights

ow = OpenWeights()

model = 'unsloth/llama-3-8b-Instruct'

# async with ow.api.deploy(model) also works
with ow.api.deploy(model):            # async with ow.api.deploy(model) also works
    # entering the context manager is equivalent to temp_api = ow.api.deploy(model) ; api.up()
    completion = ow.chat.completions.create(
        model=model,
        messages=[{"role": "user", "content": "is 9.11 > 9.9?"}]
    )
    print(completion.choices[0].message)       # when this context manager exits, it calls api.down()

More details

CLI

Use ow {cmd} --help for more help on the available commands:

❯ ow --help
usage: ow [-h] {ssh,exec,signup,cluster,worker,token,ls,cancel,logs,fetch,serve,deploy,env,manage} ...

OpenWeights CLI for remote GPU operations

positional arguments:
  {ssh,exec,signup,cluster,worker,token,ls,cancel,logs,fetch,serve,deploy,env,manage}
    ssh                 Start or attach to a remote shell with live file sync.
    exec                Execute a command on a remote GPU with file sync.
    signup              Create a new user, organization, and API key.
    cluster             Run the cluster manager locally with your own infrastructure.
    worker              Run a worker to execute jobs from the queue.
    token               Manage API tokens for organizations.
    ls                  List job IDs.
    cancel              Cancel jobs by ID.
    logs                Display logs for a job.
    fetch               Fetch file content by ID.
    serve               Start the dashboard backend server.
    deploy              Deploy a cluster instance on RunPod.
    env                 Manage organization secrets (environment variables).
    manage              Control managed cluster infrastructure.

options:
  -h, --help            show this help message and exit

For developing custom jobs, ow ssh is great - it starts a pod, connects via ssh, and live-syncs the local CWD into the remote. This allows editing finetuning code locally and testing it immediately.

General notes

Job and file IDs are content hashes

The job_id is based on the params hash, which means that if you submit the same job many times, it will only run once. If you resubmit a failed or canceled job, it will reset the job status to pending.

Citation

Originally created by Niels Warncke (@nielsrolf).

If you find this repo useful for your research and want to cite it, you can do so via:

@misc{warncke_openweights_2025,
  author       = {Niels Warncke},
  title        = {OpenWeights},
  howpublished = {\url{https://github.com/longtermrisk/openweights}},
  note         = {Commit abcdefg • accessed DD Mon YYYY},
  year         = {2025}
}

Project details

Release history Release notifications | RSS feed

This version

0.10.0

May 7, 2026

0.9.0

Apr 14, 2026

0.8.2

Feb 16, 2026

0.8.1

Feb 16, 2026

0.8.0

Dec 5, 2025

0.7.6

Dec 1, 2025

0.7.5

Dec 1, 2025

0.7.4

Nov 2, 2025

0.7.3

Nov 2, 2025

0.7.2

Nov 1, 2025

0.7.1

Nov 1, 2025

0.7.0

Oct 31, 2025

0.6.0

Oct 2, 2025

0.5.0

Sep 17, 2025

0.4.0

May 16, 2025

0.3.0

May 5, 2025

0.2.0

May 5, 2025

0.1.0

Feb 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openweights-0.10.0.tar.gz (5.7 MB view details)

Uploaded May 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openweights-0.10.0-py3-none-any.whl (5.4 MB view details)

Uploaded May 7, 2026 Python 3

File details

Details for the file openweights-0.10.0.tar.gz.

File metadata

Download URL: openweights-0.10.0.tar.gz
Upload date: May 7, 2026
Size: 5.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for openweights-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`4ae0dc119b23138ea45bd10e8c03636099ef20eb0cc434aa2bc19a2ee6df2284`
MD5	`eaf844db3ff6370d472d1c740e7e84b0`
BLAKE2b-256	`03db4560d5b5a9925f23a079cab4675c17950e3081dd21e16021f7c78d506314`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openweights-0.10.0.tar.gz:

Publisher: manual_publish.yaml on longtermrisk/openweights

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openweights-0.10.0.tar.gz
- Subject digest: 4ae0dc119b23138ea45bd10e8c03636099ef20eb0cc434aa2bc19a2ee6df2284
- Sigstore transparency entry: 1459409243
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: longtermrisk/openweights@51b91e5ee204d73f447b90b74c9fb86c0432fda9
- Branch / Tag: refs/heads/main
- Owner: https://github.com/longtermrisk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: manual_publish.yaml@51b91e5ee204d73f447b90b74c9fb86c0432fda9
- Trigger Event: workflow_dispatch

File details

Details for the file openweights-0.10.0-py3-none-any.whl.

File metadata

Download URL: openweights-0.10.0-py3-none-any.whl
Upload date: May 7, 2026
Size: 5.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for openweights-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`efc987b921d8fddb6359aabc3fab6c06f14cbdf100d27bc7f192654753e7d0fa`
MD5	`f953b195c16a9b2f60d4fc6cf76e8115`
BLAKE2b-256	`aab620f4ded333f4de514687abb430bd1691920f450e82eaeb0cc9c6190d7a17`

See more details on using hashes here.

Provenance

The following attestation bundles were made for openweights-0.10.0-py3-none-any.whl:

Publisher: manual_publish.yaml on longtermrisk/openweights

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: openweights-0.10.0-py3-none-any.whl
- Subject digest: efc987b921d8fddb6359aabc3fab6c06f14cbdf100d27bc7f192654753e7d0fa
- Sigstore transparency entry: 1459409286
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: longtermrisk/openweights@51b91e5ee204d73f447b90b74c9fb86c0432fda9
- Branch / Tag: refs/heads/main
- Owner: https://github.com/longtermrisk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: manual_publish.yaml@51b91e5ee204d73f447b90b74c9fb86c0432fda9
- Trigger Event: workflow_dispatch

openweights 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

OpenWeights

Installation

Quickstart

Overview

Custom jobs

Built-in jobs

Inference

OpenAI-like vllm API

CLI

General notes

Job and file IDs are content hashes

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance