NVIDIA GPU inspection and synthetic benchmark CLI for GPUBoost.

These details have not been verified by PyPI

Project links

Project description

GPUBoost 0.2.0

GPUBoost is a local-first Python CLI for inspecting NVIDIA GPU environments, running bounded synthetic benchmarks, analyzing PyTorch scripts, and preparing reviewable optimization advice.

Release status. GPUBoost 0.2.0 includes human-approved agentic optimization: it proposes an exact deterministic diff, requires explicit approval, applies only approved deterministic edits, creates backups, validates the result, and can roll back automatically. Model output cannot directly modify files, unapproved patching is forbidden, Git commits/pushes are never automatic, and fully unattended autonomous patching is not supported.

PyPI: https://pypi.org/project/gpuboost/
Repository: https://github.com/its-arnavtech/GPU-Boost
License: MIT License; see LICENSE
Maturity: pre-alpha local tooling. Useful today, but performance results are hardware and workload specific.

What It Does

Reports Python, OS, CPU, RAM, NVIDIA GPU, CUDA, cuDNN, and PyTorch details.
Runs quick synthetic benchmark suites for matrix multiplication, mixed precision, batch-size sweeps, and DataLoader behavior.
Produces deterministic rule-based recommendations from benchmark metrics.
Statically analyzes PyTorch-like source files without importing or executing them.
Builds reviewable patch plans and unified diffs for conservative suggestions.
Validates generated patch plans in temporary trial workspaces when requested.
Prepares approval-gated agentic optimization runs that can apply selected deterministic edits only after explicit human approval.
Compares supported GPUBoost benchmark JSON files.
Stores local history only when explicitly requested.
Provides local dataset, model artifact, and advisory model workflows.
Describes synthetic real-world demo workloads that use local data only.

What It Does Not Do

No automatic patch application to original source files; no unapproved patch application is allowed.
Approved source edits require an immutable plan digest, the original file hash, and an explicit human confirmation phrase.
No guaranteed speedups.
Model output is advisory; it cannot apply patches or approve changes.
Deterministic checks remain authoritative.
patch_application_allowed=false is part of the model safety contract.
GPU/CUDA is optional for core workflows such as help, doctor, static analysis, compare, history, demo discovery, and safety checks.
It does not call external APIs, upload source, download datasets, train models, or run heavy benchmark grids during normal use.

Installation

Base install:

pip install gpuboost

Full ML/benchmark extras:

pip install "gpuboost[all]"

Targeted extras:

pip install "gpuboost[benchmark]"
pip install "gpuboost[model]"

The base package depends on psutil, nvidia-ml-py, and rich. The benchmark, model, and all extras add PyTorch and NumPy.

Quickstart

These commands were validated during the post-release audit:

python -m gpuboost --version
python -m gpuboost doctor --json
python -m gpuboost info --json
python -m gpuboost analyze examples\bad_train_sample.txt --json
python -m gpuboost agent optimize examples\bad_train_sample.txt --json
python -m gpuboost agent optimize examples\bad_train_sample.txt --trial --json
python -m gpuboost benchmark --quick --json --recommend
python -m gpuboost demo real-world-info --json
python -m gpuboost demo real-world-pairs --json

examples/bad_train_sample.txt is intentionally a .txt file so formatters and linters do not treat it as project Python code.

Core Workflows

Inspect the environment:

python -m gpuboost doctor
python -m gpuboost doctor --json
python -m gpuboost info
python -m gpuboost info --json

Analyze a script:

python -m gpuboost analyze train.py
python -m gpuboost analyze train.py --json
python -m gpuboost analyze train.py --patch

Run the deterministic agent:

python -m gpuboost agent optimize train.py --json
python -m gpuboost agent optimize train.py --trial --json
python -m gpuboost agent optimize train.py --save-history

Prepare a human-approved agentic optimization run:

python -m gpuboost agent optimize train.py --prepare
python -m gpuboost agent show-plan <run_id>
python -m gpuboost agent approve <run_id> --confirm "APPLY <plan>"
python -m gpuboost agent apply <run_id>
python -m gpuboost agent rollback <run_id>

This lifecycle is included in GPUBoost 0.2.0.

--prepare writes only an ignored .gpuboost/runs/<run_id>.json audit record and never modifies source.
agent show-plan displays the exact reviewable diff before any approval.
agent approve is mandatory: it records a confirmation phrase bound to the immutable plan digest, the original file hash, and the selected action IDs.
agent apply re-validates the original SHA-256 hash, creates a backup, applies only approved deterministic edits, validates the result, and rolls back automatically if validation fails.
agent rollback restores the pre-application backup and verifies the restored hash matches the original.
Model output is advisory only and can never modify files directly.
No Git commit or push is ever performed automatically.
Unattended or autonomous patching is not supported; a human must approve every applied change.

Compare supported GPUBoost benchmark JSON files:

python -m gpuboost compare baseline.json optimized.json
python -m gpuboost compare baseline.json optimized.json --json

Use local history:

python -m gpuboost history list
python -m gpuboost history show <run_id>
python -m gpuboost history compare <baseline_run_id> <optimized_run_id>

Use dataset and model commands:

python -m gpuboost dataset collect-outcomes data\gpuboost\experiments\pairs.json
python -m gpuboost model evaluate-baselines --json
python -m gpuboost model train-neural --json
python -m gpuboost model list-artifacts
python -m gpuboost model safety-check --json

Model training is experimental and local. Saved model artifacts are generated files and remain advisory only.

The model workflow uses safe training data loading, safe feature extraction, baseline model comparison, MLP training, explicit artifact save steps, direct artifact prediction, and advisory-only agent integration. It does not fine-tune an LLM and does not call external services.

Use demo discovery:

python -m gpuboost demo real-world-info --json
python -m gpuboost demo real-world-pairs --json

Demo workloads use synthetic local data and are not proof that a change will speed up a production workload.

Published 0.1.2 Audit Evidence

Validated on one local machine.

Fresh audit evidence is in docs/post-release-audit.md. The table below is the published 0.1.2 post-release audit evidence. It is hardware/workload specific and should not be generalized.

Item	Result
GPUBoost version	0.1.2
Python	3.12.10
OS	Windows 11
GPU	NVIDIA GeForce RTX 4060 Laptop GPU, 8188 MB VRAM
CUDA available	true
PyTorch	2.11.0+cu128
Ruff	all checks passed
Pytest	1044 passed
Static analysis sample	5 findings
Agent optimize sample	5 completed actions
Agent trial sample	passed; original file unchanged
Trial syntax-check duration	0.024519 seconds
Quick benchmark sections	4 successful sections
Best FP32 matmul	7.167453 TFLOPS
Best FP16 matmul	31.337318 TFLOPS
FP16/FP32 matmul ratio	4.372169x
AMP synthetic throughput	24402.147 samples/sec
FP32 synthetic throughput	15938.796 samples/sec
AMP synthetic ratio	1.530991x
Best batch sweep result	batch size 8, 2607.100 images/sec
CNN demo baseline	353.727 samples/sec
CNN demo optimized	862.134 samples/sec

The benchmark and demo results used synthetic or controlled data on one laptop. They are evidence that the commands ran successfully on that machine, not a promise that other workloads will see the same ratios.

Current 0.2.0 Validation

Current release validation after the human-approved agentic apply changes:

Item	Result
Package version	0.2.0
Ruff	all checks passed
Pytest	1060 passed, 1 skipped
Expected skip	Windows symlink privilege-dependent test

The published 0.1.2 audit remains above as historical evidence for the prior review-first release. The 0.2.0 validation count is separate and includes the human-approved agentic apply workflow.

Safety Model

Patch suggestions are reviewable diffs.
Trial mode applies patch plans only to temporary copies.
The original source file is not modified by trial mode.
Human-approved agentic optimization never mutates source during --prepare.
agent apply requires a persisted approval tied to the plan digest and original file hash, creates a pre-application backup, validates the result, and rolls back automatically on validation or acceptance-policy failure.
No unapproved patch application is allowed.
Model output cannot directly modify files; only approved deterministic plan edits are ever written.
No Git commit or push is performed automatically by any agent command.
Unattended autonomous patching is not supported; every applied change requires an explicit human confirmation phrase.
Static analysis parses code without importing or executing the target script.
Raw diffs and trial stdout/stderr are redacted from agent JSON by default.
Model output is advisory and cannot override tests, trials, or benchmark evidence.
Advisory-only model output is a triage aid only.
Deterministic checks authoritative: deterministic GPUBoost checks remain authoritative.
Generated raw data, generated datasets, local DBs, model weights, build artifacts, temp validation environments, and local reports are ignored.
Generated artifacts ignored: generated artifacts are ignored by default.

Documentation

Limitations

Synthetic benchmarks are signals, not production performance proof.
GPU results vary with GPU model, driver, CUDA/PyTorch versions, thermals, power mode, background load, and workload shape.
The local model workflow is experimental and has no bundled production model.
Real-world demo JSON currently records useful workload metrics, but gpuboost compare did not compare those demo JSON files during the audit.
Raw third-party intake data is intentionally ignored and should be reviewed before being kept in or copied from a local checkout.
PowerShell Tee-Object can write UTF-16 JSON on some systems; use UTF-8 when saving JSON intended for gpuboost compare.
Agent benchmark acceptance policies require an explicit benchmark command that emits JSON metrics such as speedup_percent or regression_percent.

Development

python -m ruff check .
python -m pytest -q
python -m build
python -m twine check --strict dist/*

Do not commit generated artifacts, raw intake data, model weights, local databases, .env files, temp validation environments, or build outputs.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Jun 25, 2026

0.1.2

Jun 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpuboost-0.2.0.tar.gz (362.6 kB view details)

Uploaded Jun 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gpuboost-0.2.0-py3-none-any.whl (215.8 kB view details)

Uploaded Jun 25, 2026 Python 3

File details

Details for the file gpuboost-0.2.0.tar.gz.

File metadata

Download URL: gpuboost-0.2.0.tar.gz
Upload date: Jun 25, 2026
Size: 362.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for gpuboost-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`61b8d3501694ffea4029ebb984ddb3c7686cccc9385801750bb7bb3113cc4ea5`
MD5	`f5db3ca7d79575954bd9638b64365924`
BLAKE2b-256	`20c16abdecd7f49a72d313be4c2faaeb0858c6e54a41de88e2e45bcc78163b4d`

See more details on using hashes here.

File details

Details for the file gpuboost-0.2.0-py3-none-any.whl.

File metadata

Download URL: gpuboost-0.2.0-py3-none-any.whl
Upload date: Jun 25, 2026
Size: 215.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for gpuboost-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bd6a2b80886d7637e904e7d19568363c6f187b9f68cdaf3fc30d36a174f8992f`
MD5	`a0474c16c0c0f00967fd3fb974ee1fbe`
BLAKE2b-256	`6fda7232daaa264bbc667c507af353fd6570e4fa88c9fe49bad3adb7cb3b5238`

See more details on using hashes here.

gpuboost 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GPUBoost 0.2.0

What It Does

What It Does Not Do

Installation

Quickstart

Core Workflows

Published 0.1.2 Audit Evidence

Current 0.2.0 Validation

Safety Model

Documentation

Limitations

Development

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes