SLURM Fragmentation Analyzer - Visualize and analyze resource fragmentation across SLURM clusters
Project description
slurmfrag ๐งฉ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ GPU: 25/32 used (78.1%) โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ NODE USED TOTAL USE% FRAGMENTATION โ
โ gpunode01 7 8 87.5% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode02 8 8 100.0% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode03 4 8 50.0% [โโโโโโโโโโโโโโโโโโโโ] โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ CPU: 374/1536 used (24.3%) โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ gpunode01 124 384 32.3% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode02 98 384 25.5% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode03 108 384 28.1% [โโโโโโโโโโโโโโโโโโโโ] โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ Memory: 4.3/6.0 TB used (71.5%) โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ gpunode01 1091 1500 72.7% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode02 1070 1500 71.3% [โโโโโโโโโโโโโโโโโโโโ] โ
โ gpunode03 1074 1500 71.6% [โโโโโโโโโโโโโโโโโโโโ] โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Visualize SLURM cluster fragmentation โ GPU, CPU, and memory usage per node, plus queue depth and pending requests.
Features
- All-in-one โ GPU, CPU, memory fragmentation per node in a single view
- Queue status โ see pending jobs and total resource requests waiting
- JSON export โ for automation and historical analysis
- Logging daemon โ run
slurmfrag --logto periodically capture snapshots for trend analysis - Configurable โ YAML config for cluster-specific GRES patterns
Quick start
From PyPI:
pip install slurmfrag
slurmfrag
From source:
git clone https://github.com/ferreirafabio/slurmfrag
cd slurmfrag
python -m slurmfrag
Usage
slurmfrag # interactive partition picker
slurmfrag -p gpu-partition # direct analysis
slurmfrag -p gpu-partition --json
slurmfrag --log # logging daemon mode
Config
Optional ~/.config/slurmfrag/config.yaml:
gres_patterns:
gpu:
pattern: "gpu[^:]*:(\\d+)" # adjust for your cluster
logging:
output_dir: "./slurmfrag_logs"
interval_minutes: 5
Requirements
- Python 3.8+
- SLURM (
sinfo,squeuecommands) - PyYAML
License
Apache-2.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
slurmfrag-0.1.0.tar.gz
(17.8 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
slurmfrag-0.1.0-py3-none-any.whl
(18.6 kB
view details)
File details
Details for the file slurmfrag-0.1.0.tar.gz.
File metadata
- Download URL: slurmfrag-0.1.0.tar.gz
- Upload date:
- Size: 17.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e5cef073d8e7d14130497841585a7e060cf9865b239d0146b200dc4ad3f2e7bb
|
|
| MD5 |
0d91a90351fe77794eae2884e4307235
|
|
| BLAKE2b-256 |
16fff86f796e04fa867e828eb254ebae67a3c286bfceb4c9bebc2f6a4ea31fa7
|
File details
Details for the file slurmfrag-0.1.0-py3-none-any.whl.
File metadata
- Download URL: slurmfrag-0.1.0-py3-none-any.whl
- Upload date:
- Size: 18.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b927d6fc33885b99b052ec6682e41757b7d28538fc3027d06958914c59a29846
|
|
| MD5 |
507f0d64f4878c8d8ab20b59d4bceb94
|
|
| BLAKE2b-256 |
8cfb623899d39c0f23d9ba7e3ecfb7c6bc9481d5a9565bd663ebf93cfeb7d6f4
|