GPU job scheduler using filelock

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

🚀 GPU Runner (`gpurun`)

Overview

gpurun is a sophisticated command-line tool designed to simplify scheduling GPU jobs on a single machine for machine learning, data science, and computational tasks. It provides intelligent GPU allocation, concurrency control, and seamless integration with CUDA applications using CUDA_VISIBLE_DEVICES. It is inspired by the sem tool from GNU parallel.

Why Use GPU Runner?

Imagine you are on an 8-GPU machine, and you want to run 100 inference jobs. You want to run them in parallel. Each job needs 2 GPUs and each GPU can only run three jobs at a time to avoid GPU memory overflow. You think you need a cluster scheduler like sbatch but for a single machine on GPUs. If this is your problem, GPU Runner is the perfect solution for you!

Think of GPU Runner as a local GPU scheduler that intelligently coordinates job execution, preventing memory overflow and maximizing computational efficiency.

🌟 Key Features

Flexible GPU Allocation: Automatically assign jobs to GPUs with CUDA_VISIBLE_DEVICES
Simple Interface: Minimalistic command-line interface
Concurrent Job Management: Limit and control GPU usage across multiple jobs
Automatic GPU Detection: Automatically identifies available GPUs
GNU Parallel Compatibility: Easy to be used with GNU parallel
Session-based Locking: Prevent GPU contention between different computational tasks

Installation

pip install git+https://github.com/zhou13/gpurun

Usage Examples

Basic Usage

# 1. Run infer.py (needs 1 GPU) on 100 images with all the GPUs in parallel, 1 job per GPU.
for i in $(seq 1 100); do gpurun python infer.py $i.jpg & done

# 2. Same as 1, but put 2 jobs per GPU at the same time.
for i in $(seq 1 100); do gpurun -j2 python infer.py $i.jpg & done

# 3. Same as 2, but use gnu-parallel to simplify the command.
parallel -j0 gpurun -j2 python infer.py {} ::: $(seq 1 100)

# 4. Same as 1, but infer.py now will see 2 GPUs.
parallel -j0 gpurun -g2 python infer.py {} ::: $(seq 1 100)

# 5. You can customize the GPUs to be used with --gpus.
parallel -j0 gpurun --gpus 0,1 python infer.py {} ::: $(seq 1 100)

# 6. You can customize the name of lockfile with --session.
parallel -j0 gpurun --session ml-seesion python infer.py {} ::: $(seq 1 100)

Configuration Options

$ gpurun --help
usage: gpurun [-h] [--session SESSION] [--gpus GPUS] [-j MAX_JOBS_PER_GPU] [-g GPUS_PER_JOB] ...

Run a command with GPU management and scheduling

positional arguments:
  command               Command to run (e.g., "python run.py")

options:
  -h, --help            show this help message and exit
  --session SESSION     String representing the session name used in locks (default: "default-session")
  --gpus GPUS           Comma-separated list of GPU IDs to use (e.g., "0,1,2") or "auto" for autodetection
  -j MAX_JOBS_PER_GPU, --max-jobs-per-gpu MAX_JOBS_PER_GPU
                        Maximum number of concurrent jobs allowed per GPU group (default: 1)
  -g GPUS_PER_JOB, --gpus-per-job GPUS_PER_JOB
                        Number of GPUs needed for each job (default: 1)

Contributing

Contributions welcome! Feel free to open issues and submit pull requests.

License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.1.1

Nov 23, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpurun-0.1.1.tar.gz (7.5 kB view details)

Uploaded Nov 23, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gpurun-0.1.1-py3-none-any.whl (5.6 kB view details)

Uploaded Nov 23, 2024 Python 3

File details

Details for the file gpurun-0.1.1.tar.gz.

File metadata

Download URL: gpurun-0.1.1.tar.gz
Upload date: Nov 23, 2024
Size: 7.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for gpurun-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`ca989529407521e6fdff4ebafd6108c4edb5c76330d13aec36f3e5248ec509ab`
MD5	`3f27c3a158dc02f37578fe53cdf93c1e`
BLAKE2b-256	`9ef2d62b64803e528cef8feb33c916077de487e9d757e28c75252ae2e327fa34`

See more details on using hashes here.

File details

Details for the file gpurun-0.1.1-py3-none-any.whl.

File metadata

Download URL: gpurun-0.1.1-py3-none-any.whl
Upload date: Nov 23, 2024
Size: 5.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for gpurun-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9a6780d3189a50c5bd7432fc1c8a13af9e06303e91a5a7bddcce5f20bfc0d347`
MD5	`2385a843e3e9c58336d5979cc0086bbf`
BLAKE2b-256	`2f3846479885d3640f5a52bdf45006366cf37780b019dabe1d62faa320ec6034`

See more details on using hashes here.

gpurun 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🚀 GPU Runner (`gpurun`)

Overview

Why Use GPU Runner?

🌟 Key Features

Installation

Usage Examples

Basic Usage

Configuration Options

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

gpurun 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🚀 GPU Runner (gpurun)

Overview

Why Use GPU Runner?

🌟 Key Features

Installation

Usage Examples

Basic Usage

Configuration Options

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

🚀 GPU Runner (`gpurun`)