Script running tool for optimizing GPU memory utilization.
Project description
TRIAGE
Intended use: running a multitude of GPU-intensive scripts in a way that optimizes GPU memory utilization. Great for ML/DL based experiments on servers shared between several users.
Installation
pip install triage-runner
Usage
See --help option for extended list of possible arguments.
Running one config:
triage run_config.json
Running several configs:
triage run_config1.json run_config2.json run_config3.json
Patterns can be used for config discovery as well:
triage run_config*.json
More on pattern syntax can be found here: https://docs.python.org/3.10/library/pathlib.html#pathlib.Path.glob
Run configurations
Stored in JSON format. The sample run configuration looks like this:
{
"memory_needed": 10.0,
"config_name": "sample_config",
"command": "python3 train.py",
"args": [
"arg1",
"--arg2",
["--seed=1", "--seed=2", "--seed=3"],
"--arg3=3"
]
}
Every entry in args list is an argument for command. An entry can be a list - in which case TRIAGE will iterate through all the possible combinations of all values in list entries. The example script above will be run 3 times with an argument --seed set to 1, 2 and 3.
Parameter config_name is optional and is used for logging the results (see --logfile option TODO). Based on this parameter environment variable TASK_NAME is set in order to be used by running script.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file triage-runner-0.4.5.tar.gz.
File metadata
- Download URL: triage-runner-0.4.5.tar.gz
- Upload date:
- Size: 9.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.0b5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dee812293656a034d9f0d35b506e036b8ed808ebebeb1a17f8f2256a7c03b6d6
|
|
| MD5 |
20f088527cda84bdc024599df8c7d02a
|
|
| BLAKE2b-256 |
82e12cc54f5ceaf965d928f52a98463d456351f617bbc5df59ecd86b62f26ed9
|
File details
Details for the file triage_runner-0.4.5-py3-none-any.whl.
File metadata
- Download URL: triage_runner-0.4.5-py3-none-any.whl
- Upload date:
- Size: 10.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.0b5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a131acf9a5966d159efc51139d98f536ea3bc579585dededac25147b655fddd9
|
|
| MD5 |
c440fb54c2d7ab7eb6d7812b52e52a0d
|
|
| BLAKE2b-256 |
c3efed1faccdb3a374a30e2fe2961369cc708bd2c33e11c4f2fd43e2d160c8ca
|