Skip to main content

CLI for running GPU workloads, managing remote workspaces, and evaluating/optimizing kernels

Project description

Wafer CLI

Wafer CLI gives coding agents direct access to GPU docs, trace analysis, and remote kernel evaluation. It helps you develop and optimize GPU kernels even when you are not working on a machine with a GPU.

Key features

  • Query GPU documentation with citations
  • Analyze GPU traces and profiles
  • Evaluate kernels on remote GPUs for correctness and performance
  • Run commands on GPU targets (remote or local)
  • Manage persistent workspaces

Quick start

uv tool install wafer-cli
wafer login
wafer remote-run -- nvidia-smi

Common commands

wafer workspaces list
wafer workspaces create my-workspace --wait
wafer agent -t ask-docs --corpus cuda "What causes shared memory bank conflicts?"
wafer agent -t trace-analyze --args trace=./profile.ncu-rep "Why is this kernel slow?"
wafer evaluate --impl kernel.py --reference ref.py --test-cases tests.json --benchmark
wafer nvidia ncu analyze profile.ncu-rep
wafer corpus list

Typical workflows

Query GPU documentation

Download a documentation corpus and ask questions with citations.

wafer corpus download cuda
wafer agent -t ask-docs --corpus cuda "What causes shared memory bank conflicts?"

Analyze performance traces

Use the trace analysis template or query trace data directly.

wafer agent -t trace-analyze --args trace=./profile.ncu-rep "Why is this kernel slow?"
wafer nvidia perfetto query trace.json \
  "SELECT name, dur/1e6 as ms FROM slice WHERE cat='kernel' ORDER BY dur DESC LIMIT 10"

Evaluate kernels on remote GPUs

Run correctness and performance checks on a remote target.

wafer evaluate \
  --impl ./kernel.py \
  --reference ./reference.py \
  --test-cases ./tests.json \
  --benchmark

Run commands on a remote GPU

wafer remote-run -- nvidia-smi
wafer remote-run --upload-dir ./my_code -- python3 train.py

Manage workspaces

wafer workspaces list
wafer workspaces create my-workspace --wait
wafer workspaces ssh <workspace-id>
wafer workspaces delete <workspace-id>

Install the CLI skill (optional)

wafer skill install
# or
wafer skill install -t <claude/codex>

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wafer_cli-0.2.26.tar.gz (238.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wafer_cli-0.2.26-py3-none-any.whl (218.5 kB view details)

Uploaded Python 3

File details

Details for the file wafer_cli-0.2.26.tar.gz.

File metadata

  • Download URL: wafer_cli-0.2.26.tar.gz
  • Upload date:
  • Size: 238.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for wafer_cli-0.2.26.tar.gz
Algorithm Hash digest
SHA256 975e181451441a96ac0d5f759719f41735bca1332ab9248ba5de5282f44ed676
MD5 6ca7311e858786cd5a35cb2aabfed87e
BLAKE2b-256 44b6e79bf9961ea7516ff37e70dd40210521c3632c147e42b08f51ebc3d66fbb

See more details on using hashes here.

File details

Details for the file wafer_cli-0.2.26-py3-none-any.whl.

File metadata

  • Download URL: wafer_cli-0.2.26-py3-none-any.whl
  • Upload date:
  • Size: 218.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for wafer_cli-0.2.26-py3-none-any.whl
Algorithm Hash digest
SHA256 bc27a3276ed0105d40151337ce9c953365d3c005b5f67ca24c9c0f7b4086f5fb
MD5 631c5cea3dacc420949e725f2a448063
BLAKE2b-256 c0b1a04ce30194aeb6115399a4670a982d1451a24607a06d904c7011a733c53e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page