Skip to main content

CLI for running GPU workloads, managing remote workspaces, and evaluating/optimizing kernels

Project description

Wafer CLI

Wafer CLI gives coding agents direct access to GPU docs, trace analysis, and remote kernel evaluation. It helps you develop and optimize GPU kernels even when you are not working on a machine with a GPU.

Key features

  • Query GPU documentation with citations
  • Analyze GPU traces and profiles
  • Evaluate kernels on remote GPUs for correctness and performance
  • Run commands on GPU targets (remote or local)
  • Manage persistent workspaces

Quick start

uv tool install wafer-cli
wafer login
wafer remote-run -- nvidia-smi

Common commands

wafer workspaces list
wafer workspaces create my-workspace --wait
wafer agent -t ask-docs --corpus cuda "What causes shared memory bank conflicts?"
wafer agent -t trace-analyze --args trace=./profile.ncu-rep "Why is this kernel slow?"
wafer evaluate --impl kernel.py --reference ref.py --test-cases tests.json --benchmark
wafer nvidia ncu analyze profile.ncu-rep
wafer corpus list

Typical workflows

Query GPU documentation

Download a documentation corpus and ask questions with citations.

wafer corpus download cuda
wafer agent -t ask-docs --corpus cuda "What causes shared memory bank conflicts?"

Analyze performance traces

Use the trace analysis template or query trace data directly.

wafer agent -t trace-analyze --args trace=./profile.ncu-rep "Why is this kernel slow?"
wafer nvidia perfetto query trace.json \
  "SELECT name, dur/1e6 as ms FROM slice WHERE cat='kernel' ORDER BY dur DESC LIMIT 10"

Evaluate kernels on remote GPUs

Run correctness and performance checks on a remote target.

wafer evaluate \
  --impl ./kernel.py \
  --reference ./reference.py \
  --test-cases ./tests.json \
  --benchmark

Run commands on a remote GPU

wafer remote-run -- nvidia-smi
wafer remote-run --upload-dir ./my_code -- python3 train.py

Manage workspaces

wafer workspaces list
wafer workspaces create my-workspace --wait
wafer workspaces ssh <workspace-id>
wafer workspaces delete <workspace-id>

Install the CLI skill (optional)

wafer skill install
# or
wafer skill install -t <claude/codex>

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wafer_cli-0.2.28.tar.gz (244.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wafer_cli-0.2.28-py3-none-any.whl (225.1 kB view details)

Uploaded Python 3

File details

Details for the file wafer_cli-0.2.28.tar.gz.

File metadata

  • Download URL: wafer_cli-0.2.28.tar.gz
  • Upload date:
  • Size: 244.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for wafer_cli-0.2.28.tar.gz
Algorithm Hash digest
SHA256 8dea807624c1ce17143d34606fcc684d27b7efd71ea3e2e080523af416dd2efe
MD5 c829169b87ed4f40b31a2dd6cd2dfd9d
BLAKE2b-256 fb38a30c3138050e79349ece43ec745460a1a7e5ad682d4bce0fd95cf4b30335

See more details on using hashes here.

File details

Details for the file wafer_cli-0.2.28-py3-none-any.whl.

File metadata

  • Download URL: wafer_cli-0.2.28-py3-none-any.whl
  • Upload date:
  • Size: 225.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for wafer_cli-0.2.28-py3-none-any.whl
Algorithm Hash digest
SHA256 622f94b2b845f2746ca62b7024f25ae0f01244b66f2ee21e37fa9c123d8936d5
MD5 139c4e71acb9ad112376f24140dc710e
BLAKE2b-256 f068354187559f47ee2200a7f8dfad6bb95dddd421eaee9f281e47831362ea9e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page