swm-gpu

One CLI to search, provision, and manage cloud GPUs across 10 providers

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

swmgpu

These details have not been verified by PyPI

Project links

Project description

swm

One CLI to rule all GPU clouds.
Search pricing across 10 providers, spin up a GPU in seconds, sync your workspace, and track every dollar.

$ swm gpus -g h200 --max-price 4

  Live GPU Availability
┏━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━┓
┃ Provider ┃ GPU              ┃ VRAM   ┃ $/hr     ┃ Stock   ┃
┡━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━┩
│ vastai   │ NVIDIA H200      │ 141 GB │ $2.89/hr │ 12 avl  │
│ runpod   │ NVIDIA H200      │ 141 GB │ $3.49/hr │ High    │
│ lambda   │ NVIDIA H200      │ 141 GB │ $3.99/hr │ 4 avl   │
│ vultr    │ NVIDIA H200      │ 141 GB │ $3.88/hr │ 8 avl   │
└──────────┴──────────────────┴────────┴──────────┴─────────┘

Install

# macOS (Homebrew)
brew tap swm-gpu/swm && brew install swm

# Python (3.11+)
pipx install swm-gpu

# From source
git clone https://github.com/swm-gpu/swm.git && cd swm && pip install -e .

Quick Start

# 1. Add your API key
swm config set runpod.api_key <your-key>

# 2. Find a GPU (the table tells you each GPU's minimum CUDA)
swm gpus -g h200

# 3. Create a pod (auto-picks a CUDA-compatible image; auto-saves to storage)
swm pod create -p runpod -g h200 -n my-session --cuda 12.8

# 4. Install a framework
swm setup install vllm runpod:<id>

# 5. Done — pushes workspace to storage and terminates
swm pod down runpod:<id>

Or just ask your agent

Don't want to learn the CLI? Install the SKILL.md and your AI agent manages GPUs for you:

# Universal (works with Cursor, Copilot, Windsurf, Amp, Devin)
mkdir -p .agents/skills/swm-gpu-workflow
curl -sL https://raw.githubusercontent.com/swm-gpu/swm/main/.agents/skills/swm-gpu-workflow/SKILL.md \
  -o .agents/skills/swm-gpu-workflow/SKILL.md

Works with Cursor, Claude Code, Codex, Copilot, Windsurf, Amp, Devin, and any agent that can run shell commands.

Supported Providers

Provider	GPU Search	Provision	Stop/Resume	Billing API
RunPod	Live	Yes	Yes	Full
Vast.ai	Live	Yes	Yes	Full
Lambda Labs	Live	Yes	—	—
Vultr	Live	Yes	Yes	—
TensorDock	Live	Yes	Yes	—
FluidStack	Live	Yes	Yes	—
AWS (EC2)	Live	Yes	Yes	—
GCP (Compute)	Live	Yes	Yes	—
Azure	Live	Yes	Yes	—
CoreWeave	Live	Yes	Yes	—

Key Features

GPU Search & Provisioning

swm gpus                            # all GPUs, all providers
swm gpus -g h200 -c 4              # 4×H200 configs
swm gpus --max-price 4 --secure    # under $4/hr, certified clouds
swm images list -p runpod --cuda 12.8  # see compatible Docker images
swm pod create -p runpod -g h200 -n train --cuda 12.8
swm pod down runpod:<id>            # sync + terminate

swm gpus reports each GPU's minimum CUDA toolkit. Pass --cuda <major.minor> to swm pod create to auto-pick the newest provider image that satisfies it.

Workspace Sync

Your /workspace directory follows you across clouds via S3-compatible storage (Backblaze B2, Amazon S3, Google GCS).

swm sync pull runpod:<id>           # storage → pod
swm sync push runpod:<id>           # pod → storage (incremental)
swm sync push runpod:<id> --delete  # also remove files deleted locally
swm sync watch runpod:<id>          # filesystem change watcher
swm sync auto runpod:<id>           # background daemon: push every 60s

Three-tier smart sync: inotify watcher tracks changes, incremental push uploads only what changed, tar mode packs 600k small files into one S3 object.

Continuous auto-sync. swm pod create starts an auto-sync daemon by default — it tails the watcher log and pushes every 60s with no manual intervention. Adopt an existing pod with swm setup workspace <pod> if you created it with --no-storage or the bootstrap was interrupted.

Frameworks

swm setup install vllm runpod:<id>       # vLLM inference server
swm setup install open-webui runpod:<id> # Open WebUI chat interface
swm setup install comfyui runpod:<id>    # ComfyUI image generation
swm setup install axolotl runpod:<id>    # Axolotl fine-tuning
swm setup install ollama runpod:<id>     # Ollama model runner
swm setup install swarmui runpod:<id>    # SwarmUI
swm setup install llm-studio runpod:<id> # H2O LLM Studio

Auto-detects GPU count for tensor parallelism, opens SSH tunnels for unexposed ports, probes health endpoints.

Lifecycle Guard

Monitors SSH sessions, GPU utilization, filesystem writes, and active processes. If nothing's happening, it saves your workspace and terminates the pod.

swm pod create -p runpod -g h200 -n train \
  --lifecycle auto-down --idle-timeout 30   # bake the policy into create
swm guard set runpod:<id> --mode auto-down --idle-timeout 30
swm guard list

No more $96 overnight H100 bills.

Cost Tracking

swm costs live                      # running cost right now
swm costs summary                   # spending breakdown
swm costs reconcile                 # verify against provider billing APIs
swm costs budget set 100            # $100/month alert

Model Management

swm models search qwen3                         # search HuggingFace Hub
swm models info civitai:101055                  # inspect HF / Civitai refs
swm models pull runpod:<id> Qwen/Qwen3-8B       # HuggingFace repo
swm models pull runpod:<id> deepseek-r1:14b     # Ollama ref
swm models pull runpod:<id> civitai:101055 --as checkpoint
swm models pull runpod:<id> https://example.com/style.safetensors --as lora
swm models list runpod:<id> --all               # tracked + untracked files
swm models link runpod:<id> /workspace/foo.safetensors --as lora
swm setup start vllm runpod:<id> --model Qwen/Qwen3-8B

Downloads land in a unified on-pod model store at /workspace/models/. Framework installs wire their expected paths into that store: ComfyUI and SwarmUI get bucket-style directories (checkpoints/, loras/, vae/, diffusion_models/, text_encoders/, ...), vLLM uses the shared HF cache, and Ollama uses the shared Ollama store. Every pull/link is recorded in /workspace/models/.manifest.json, so swm models list can show tracked, missing, and unmanaged files.

For gated repos or restricted Civitai models:

swm config set hf.api_key <huggingface-token>
swm config set civitai.api_key <civitai-token>

How It Works

Everything happens over SSH. No agents on the pod. No custom images. No webhooks.

┌──────────┐       SSH        ┌─────────────┐       S3 API      ┌───────────┐
│ Your Mac │ ───────────────> │  GPU Pod    │ ────────────────> │ B2 / S3   │
│   swm    │  exec, scp      │  (any       │  s5cmd sync       │ / GCS     │
│          │ <─────────────── │   provider) │ <──────────────── │(workspace)│
└──────────┘                  └─────────────┘                   └───────────┘

Credentials are never stored on the pod. Storage keys are passed as transient environment variables per command.

Security

SSH key authentication only — no passwords stored anywhere
No credentials on pods — storage keys passed transiently, never written to disk
Non-destructive by default — sync push, sync pull, and pod down never remove files from your storage bucket. Deletions are opt-in (sync push --delete) and the auto-sync daemon refuses to start unless a prior pull/push has confirmed pod ↔ bucket are in sync
Secure cloud default — swm pod create defaults to SOC 2 / HIPAA certified data centers

Documentation

Full docs at swmgpu.com.

Page	Description
Getting Started (CLI)	Install and create your first pod in 5 minutes
Getting Started (Agent)	Let your AI agent manage GPUs for you
Configuration	All config keys for providers and storage
Command Reference	Full reference for every swm command
Core Concepts	Providers, workspaces, frameworks, lifecycle guard

Requirements

macOS or Linux
Python 3.11+ (if not using Homebrew binary)
SSH client (ssh, scp)
An account with at least one GPU provider

Contributing

Bug reports, feature requests, and pull requests welcome. See CONTRIBUTING.md for scope, code style, and the PR workflow. The community is governed by our Code of Conduct.

Open-ended questions and design discussions belong in GitHub Discussions. Security reports go through private vulnerability reporting — see SECURITY.md.

License

Licensed under the Apache License, Version 2.0.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

swmgpu

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.12

Jul 28, 2026

0.2.11

Jul 27, 2026

0.2.10

Jul 27, 2026

0.2.9

Jul 19, 2026

0.2.8

Jul 19, 2026

0.2.7

Jul 14, 2026

0.2.6

Jun 17, 2026

0.2.5

May 29, 2026

0.2.4

May 27, 2026

0.2.3

May 25, 2026

0.2.2

May 24, 2026

0.2.1

May 22, 2026

0.2.0

May 21, 2026

0.1.13

May 21, 2026

0.1.12

May 20, 2026

0.1.11

May 18, 2026

0.1.10

May 11, 2026

0.1.9

May 10, 2026

0.1.8

May 4, 2026

0.1.7

May 4, 2026

0.1.6

Apr 28, 2026

0.1.5

Apr 27, 2026

0.1.4

Apr 27, 2026

0.1.2

Apr 26, 2026

0.1.1

Apr 26, 2026

0.1.0

Apr 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

swm_gpu-0.2.12.tar.gz (198.2 kB view details)

Uploaded Jul 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

swm_gpu-0.2.12-py3-none-any.whl (180.0 kB view details)

Uploaded Jul 28, 2026 Python 3

File details

Details for the file swm_gpu-0.2.12.tar.gz.

File metadata

Download URL: swm_gpu-0.2.12.tar.gz
Upload date: Jul 28, 2026
Size: 198.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for swm_gpu-0.2.12.tar.gz
Algorithm	Hash digest
SHA256	`bf66386faa12a49f1e1c042885a48f5fe8e31f00aed2958bb9b1750f7175aedf`
MD5	`82db0cd626d26b15c75d87f76b8e1698`
BLAKE2b-256	`8159a173a76880a24651445f091bd7b9d3b21f0612584074f7d53debf51a63de`

See more details on using hashes here.

Provenance

The following attestation bundles were made for swm_gpu-0.2.12.tar.gz:

Publisher: release.yml on swm-gpu/swm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: swm_gpu-0.2.12.tar.gz
- Subject digest: bf66386faa12a49f1e1c042885a48f5fe8e31f00aed2958bb9b1750f7175aedf
- Sigstore transparency entry: 2264275526
- Sigstore integration time: Jul 28, 2026
Source repository:
- Permalink: swm-gpu/swm@ed26f28d115dd14cd1e4d4191b83312f5e5d8d0a
- Branch / Tag: refs/tags/v0.2.12
- Owner: https://github.com/swm-gpu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ed26f28d115dd14cd1e4d4191b83312f5e5d8d0a
- Trigger Event: push

File details

Details for the file swm_gpu-0.2.12-py3-none-any.whl.

File metadata

Download URL: swm_gpu-0.2.12-py3-none-any.whl
Upload date: Jul 28, 2026
Size: 180.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for swm_gpu-0.2.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`80ee56a46704bf22207ca54a645f39d46886f9714ef086a81f123492e1306ae2`
MD5	`d1e89e77163c512da69a3946dcd76742`
BLAKE2b-256	`f31a3d6611ada578b1a1e3fb3e9a88c1d75f31ca86d052696fafa76cd599cf69`

See more details on using hashes here.

Provenance

The following attestation bundles were made for swm_gpu-0.2.12-py3-none-any.whl:

Publisher: release.yml on swm-gpu/swm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: swm_gpu-0.2.12-py3-none-any.whl
- Subject digest: 80ee56a46704bf22207ca54a645f39d46886f9714ef086a81f123492e1306ae2
- Sigstore transparency entry: 2264275910
- Sigstore integration time: Jul 28, 2026
Source repository:
- Permalink: swm-gpu/swm@ed26f28d115dd14cd1e4d4191b83312f5e5d8d0a
- Branch / Tag: refs/tags/v0.2.12
- Owner: https://github.com/swm-gpu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ed26f28d115dd14cd1e4d4191b83312f5e5d8d0a
- Trigger Event: push

swm-gpu 0.2.12

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Install

Quick Start

Or just ask your agent

Supported Providers

Key Features

GPU Search & Provisioning

Workspace Sync

Frameworks

Lifecycle Guard

Cost Tracking

Model Management

How It Works

Security

Documentation

Requirements

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance