A python template
Project description
gpu_tester
Gpu tester finds all your bad gpus.
Works on slurm.
Install
pip3 install torch --extra-index-url https://download.pytorch.org/whl/cu116
then
pip install gpu_tester
Python examples
Checkout these examples to call this as a lib:
API
This module exposes a single function gpu_tester
which takes the same arguments as the command line tool:
- cluster the cluster. (default slurm)
- job_name slurm job name. (default gpu_tester)
- partition slurm partition. (default compute-od-gpu)
- gpu_per_node numbe of gpu per node. (default 8)
- nodes number of gpu nodes. (default 1)
- output_folder the output folder. (default None which means current folder / results)
- job_timeout job timeout (default 300 seconds)
For development
Either locally, or in gitpod (do export PIP_USER=false
there)
Setup a virtualenv:
python3 -m venv .env
source .env/bin/activate
pip install -e .
to run tests:
pip install -r requirements-test.txt
then
make lint
make test
You can use make black
to reformat the code
python -m pytest -x -s -v tests -k "dummy"
to run a specific test
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gpu_tester-1.0.0.tar.gz
(5.4 kB
view hashes)
Built Distribution
Close
Hashes for gpu_tester-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 50e65156f8b4b613a075e090aac58e257a5ac957079420b0009f4349ee52663a |
|
MD5 | f6c84643d1f7e6f311a093dc5e928192 |
|
BLAKE2b-256 | 30cd4ad2350bc1a9f502acaa81e568a2c4b2e9816294cc53d6c806832c18203f |