Script running tool for optimizing GPU memory utilization.
Project description
TRIAGE
Intended use: running a multitude of GPU-intensive scripts in a way that optimizes GPU memory utilization. Great for ML/DL based experiments on servers shared between several users.
Installation
pip install triage-runner
Usage
See --help
option for extended list of possible arguments.
Running one config:
triage run_config.json
Running several configs:
triage run_config1.json run_config2.json run_config3.json
Patterns can be used for config discovery as well:
triage run_config*.json
More on pattern syntax can be found here: https://docs.python.org/3.10/library/pathlib.html#pathlib.Path.glob
Run configurations
Stored in JSON format. The sample run configuration looks like this:
{
"memory_needed": 10.0,
"config_name": "sample_config",
"command": "python3 train.py",
"args": [
"arg1",
"--arg2",
["--seed=1", "--seed=2", "--seed=3"],
"--arg3=3"
]
}
Every entry in args
list is an argument for command
. An entry can be a list - in which case TRIAGE will iterate through all the possible combinations of all values in list entries. The example script above will be run 3 times with an argument --seed
set to 1, 2 and 3.
Parameter config_name
is optional and is used for logging the results (see --logfile
option TODO). Based on this parameter environment variable TASK_NAME
is set in order to be used by running script.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file triage-runner-0.4.5.tar.gz
.
File metadata
- Download URL: triage-runner-0.4.5.tar.gz
- Upload date:
- Size: 9.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.0b5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dee812293656a034d9f0d35b506e036b8ed808ebebeb1a17f8f2256a7c03b6d6 |
|
MD5 | 20f088527cda84bdc024599df8c7d02a |
|
BLAKE2b-256 | 82e12cc54f5ceaf965d928f52a98463d456351f617bbc5df59ecd86b62f26ed9 |
File details
Details for the file triage_runner-0.4.5-py3-none-any.whl
.
File metadata
- Download URL: triage_runner-0.4.5-py3-none-any.whl
- Upload date:
- Size: 10.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.0b5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a131acf9a5966d159efc51139d98f536ea3bc579585dededac25147b655fddd9 |
|
MD5 | c440fb54c2d7ab7eb6d7812b52e52a0d |
|
BLAKE2b-256 | c3efed1faccdb3a374a30e2fe2961369cc708bd2c33e11c4f2fd43e2d160c8ca |