Skip to main content

Active GPU diagnostic tool that identifies performance bottlenecks

Project description

NVSonar

PyPI version Python License Downloads

Active GPU diagnostic tool that identifies performance bottlenecks, detects anomalous patterns over time, and gives you actionable recommendations.

GPU monitoring tools show utilization percentages, but this can be misleading. A GPU reporting 100% utilization may actually be computing useful work, or wastefully stalled waiting on memory transfers, thermal throttling, or power limits. NVSonar analyzes real-time patterns from NVML metrics to identify what's actually limiting your GPU performance.

Features

  • Bottleneck classification (compute-bound, memory-bandwidth-bound, memory-capacity-bound, power-limited, thermal-throttled, data-starved)
  • Temporal pattern detection (clock oscillation, temperature trends, utilization dips, memory leaks)
  • Multi-GPU outlier detection via Z-scores
  • Health scoring with A-F grades (0-100 per GPU)
  • Throttle bitmask decoder with severity levels
  • PCIe link degradation and ECC error monitoring
  • Actionable recommendations with specific commands
  • JSON output for automation and scripting

Example

╭──── GPU 0: NVIDIA GeForce RTX 4090    Health: B (82/100) ────╮
│  GPU utilization       95%                                    │
│  Memory controller     45%                                    │
│  VRAM                  18432MB / 24576MB (75%)                │
│  Clocks                2520 / 2520 MHz                        │
│  Temperature           78C                                    │
│  Power                 380W / 450W (84%)                      │
│  PCIe                  Gen4 x16                               │
│  Throttle              Clock state: Software Power Cap        │
│                                                               │
│  Bottleneck: power_limited (90% confidence)                   │
│  Power draw at 84% of limit, clocks reduced 0%               │
│                                                               │
│  Recommendations:                                             │
│    [P1] GPU is power limited                                  │
│      - Raise power limit: nvidia-smi -pl <watts>              │
│      - Check that all GPU power cables are connected          │
╰───────────────────────────────────────────────────────────────╯

Requirements

  • Python 3.10+
  • NVIDIA GPU with driver installed
  • Linux

Installation and Usage

pip install nvsonar
nvsonar                  # interactive TUI
nvsonar report           # one-shot diagnostic
nvsonar report --json    # structured output for scripts/LLMs
nvsonar report --gpu 0   # specific GPU

License

Apache License 2.0

Author

Bekmukhamed Tursunbayev

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nvsonar-2.0.0.tar.gz (27.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nvsonar-2.0.0-py3-none-any.whl (30.6 kB view details)

Uploaded Python 3

File details

Details for the file nvsonar-2.0.0.tar.gz.

File metadata

  • Download URL: nvsonar-2.0.0.tar.gz
  • Upload date:
  • Size: 27.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for nvsonar-2.0.0.tar.gz
Algorithm Hash digest
SHA256 2b4a90a8a4131a43fb8c5b536b0cba2db29d1b160c7b5cf64144120776759154
MD5 2028ca54c09ab4a558bfd6f634e89ccb
BLAKE2b-256 7c6016ca81a19ee88b5a60a694828ca4a4c1415833941a6c3aa8f23894c24eb6

See more details on using hashes here.

Provenance

The following attestation bundles were made for nvsonar-2.0.0.tar.gz:

Publisher: ci.yml on btursunbayev/nvsonar

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nvsonar-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: nvsonar-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 30.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for nvsonar-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f132b5a25728d16d061a2d3fff2ef8e0d0a3ae5ce1e15e29e789860117c16f80
MD5 47e5263c007056bea7adf4f648089215
BLAKE2b-256 b3cc01d099d7026cd8623f6b0096d3031a99f71a07afddb39b83098c6a4e83f0

See more details on using hashes here.

Provenance

The following attestation bundles were made for nvsonar-2.0.0-py3-none-any.whl:

Publisher: ci.yml on btursunbayev/nvsonar

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page