Engineer-first training calibration: estimate VRAM fit, profile short runs, and pick GPU configs under real budget constraints.

These details have not been verified by PyPI

Project links

Project description

alloc

Find and fix training bottlenecks. Zero code changes.

pip install alloc
alloc run python train.py

alloc v0.0.2 — Calibrate

 Run Summary
  Peak VRAM       31.2 GB / 40.0 GB (A100)
  VRAM used       78.0%
  Avg GPU util    72.3%
  Avg power       287 W
  Duration        24.1s (auto-stopped: metrics stable at 18.2s)
  Step time       148.5 ms (p50) / 152.1 ms (p90)
  Throughput      42.3 samples/sec

  Artifact: alloc_artifact.json.gz

That's it. No decorators, no config files, no code changes. Alloc wraps your command, profiles GPU usage, and tells you what's wrong.

What you get

alloc diagnose reads your training script and tells you exactly what to change:

alloc diagnose train.py

alloc diagnose — 3 findings in train.py

 CRITICAL  DL005 — DataLoader running in main thread
   train.py:47   num_workers=0 → num_workers=8
   num_workers=0 loads data in the main thread, blocking GPU computation entirely.
   Expected impact: ~30-50% faster training with parallel data loading

 WARNING   PREC002 — Using fp16, consider bf16
   train.py:56   dtype: float16 → dtype: bfloat16
   H100 supports bf16 natively — eliminates loss scaling overhead.
   Expected impact: ~5-10% speedup, eliminates GradScaler complexity

 INFO      THRU001 — cudnn.benchmark not enabled
   Add: torch.backends.cudnn.benchmark = True
   Expected impact: ~5-10% speedup for fixed-size inputs

Summary: 1 critical, 1 warning, 1 info
Run with --diff to generate patches | --json for CI output

alloc ghost estimates VRAM before you launch:

alloc ghost train.py --dtype bf16

 Ghost Scan — 7.0B params (bf16)

  Model weights       13.04 GB
  Gradients           13.04 GB
  Optimizer (Adam)    78.23 GB
  Activations (est.)   0.50 GB
  Buffer (10%)        10.48 GB

  Total VRAM         115.28 GB

alloc scan ranks GPU configs without a GPU:

alloc scan --model llama-3-70b --gpu H100-80GB --num-gpus 8

Works with everything

Alloc wraps your launch command. No framework-specific setup required.

alloc run python train.py
alloc run torchrun --nproc_per_node=4 train.py
alloc run accelerate launch train.py
alloc run srun python train.py           # Slurm
alloc run ray job submit -- python train.py

Multi-GPU detection is automatic (discovers all GPUs in the process tree).

Deeper signals (optional)

Add a one-line callback for step-level timing:

# HuggingFace
from alloc import HuggingFaceCallback
trainer = Trainer(..., callbacks=[HuggingFaceCallback()])

# Lightning
from alloc import LightningCallback
trainer = Trainer(..., callbacks=[LightningCallback()])

This unlocks step time p50/p90, throughput, and dataloader bottleneck detection.

All commands

Command	What it does
`alloc run <cmd>`	Profile a training run (auto-stops when stable)
`alloc diagnose <script>`	AST analysis with specific fix suggestions
`alloc ghost <script>`	Estimate VRAM before launching
`alloc scan --model <name>`	Rank GPU configs remotely (no GPU needed)
`alloc catalog list`	Browse 13 GPUs with specs and pricing
`alloc init`	Configure GPU fleet and budget (`.alloc.yaml`)
`alloc login`	Authenticate for dashboard + auto-upload

Every command supports --json for CI/CD integration.

Dashboard

alloc login --browser
alloc run python train.py    # auto-uploads when logged in

Dashboard at alloclabs.com

Design principles

Zero config — alloc run python train.py works out of the box
Never crash training — all Alloc failures are caught silently
No monkey-patching — external monitoring only, deeper signals opt-in
Local-first — works in air-gapped environments, no internet required

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.16

Mar 20, 2026

0.0.15

Mar 19, 2026

0.0.14

Mar 17, 2026

0.0.13

Mar 17, 2026

0.0.12

Mar 17, 2026

0.0.11

Mar 17, 2026

0.0.10

Mar 16, 2026

0.0.9

Mar 14, 2026

0.0.8

Mar 14, 2026

0.0.7

Mar 14, 2026

0.0.6

Mar 14, 2026

0.0.5

Mar 9, 2026

0.0.4

Mar 9, 2026

This version

0.0.3

Feb 22, 2026

0.0.1

Feb 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

alloc-0.0.3.tar.gz (108.8 kB view details)

Uploaded Feb 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

alloc-0.0.3-py3-none-any.whl (84.9 kB view details)

Uploaded Feb 22, 2026 Python 3

File details

Details for the file alloc-0.0.3.tar.gz.

File metadata

Download URL: alloc-0.0.3.tar.gz
Upload date: Feb 22, 2026
Size: 108.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for alloc-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`5f5edb23a0baec296a238f9a1fbc0039c62846b8a04d6c522b5e3b0b3ca7c7ca`
MD5	`e6ac01e703d1a1b144c13d7d0853b141`
BLAKE2b-256	`1652d48afb6698525ba9ab6fa179f1bf1f968ad744ae9d4307f4e65af2effb9e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for alloc-0.0.3.tar.gz:

Publisher: publish-pypi.yml on alloc-labs/platform

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: alloc-0.0.3.tar.gz
- Subject digest: 5f5edb23a0baec296a238f9a1fbc0039c62846b8a04d6c522b5e3b0b3ca7c7ca
- Sigstore transparency entry: 976718985
- Sigstore integration time: Feb 22, 2026
Source repository:
- Permalink: alloc-labs/platform@96591b48ea384fd716d8f3486f3dcbd9bf9376ef
- Branch / Tag: refs/tags/alloc-v0.0.3
- Owner: https://github.com/alloc-labs
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@96591b48ea384fd716d8f3486f3dcbd9bf9376ef
- Trigger Event: push

File details

Details for the file alloc-0.0.3-py3-none-any.whl.

File metadata

Download URL: alloc-0.0.3-py3-none-any.whl
Upload date: Feb 22, 2026
Size: 84.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for alloc-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7238d662601c6a5d7d91658aab60e02e386065a3d4ec8d277993c0719cc6fc5c`
MD5	`94d3fe7281f52fe541afd69a17d2359a`
BLAKE2b-256	`3a96ef7976fc411959522580fe42a460494b23bafa9414eec8b494f30c70f255`

See more details on using hashes here.

Provenance

The following attestation bundles were made for alloc-0.0.3-py3-none-any.whl:

Publisher: publish-pypi.yml on alloc-labs/platform

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: alloc-0.0.3-py3-none-any.whl
- Subject digest: 7238d662601c6a5d7d91658aab60e02e386065a3d4ec8d277993c0719cc6fc5c
- Sigstore transparency entry: 976718986
- Sigstore integration time: Feb 22, 2026
Source repository:
- Permalink: alloc-labs/platform@96591b48ea384fd716d8f3486f3dcbd9bf9376ef
- Branch / Tag: refs/tags/alloc-v0.0.3
- Owner: https://github.com/alloc-labs
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@96591b48ea384fd716d8f3486f3dcbd9bf9376ef
- Trigger Event: push

alloc 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

alloc

What you get

Works with everything

Deeper signals (optional)

All commands

Dashboard

Design principles

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance