Skip to main content

Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..

Project description

GPUSitter

Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..

Features

  • Real-time GPU usage monitoring
  • Command-line interface, easy to integrate into workflows
  • Email notifications
  • Scheduled automatic job running

Installation

pip install gpusitter

Usage

gpust --job="python train.py"  # with 1 gpu

gpust --job="python train.py:4"  # with 4 gpus

Parameter description:

class ConfigData:
    """Configuration data for GPU Snatcher."""

    gpu_free_memory_ratio_threshold: float
    friendly_min: float
    email_host: str
    email_user: str
    email_pwd: str
    email_sender: str
    email_receivers: list[str]
  • gpu_free_memory_ratio_threshold: The minimum free GPU memory ratio required to consider a GPU available. Only GPUs with free memory above this threshold will be used.
  • friendly_min: Waiting time (in seconds) before allocating GPUs. Helps prevent OOM from previous jobs.
  • email_host: Email server, e.g., smtp.qq.com
  • email_user: Email address
  • email_pwd: SMTP authorization code
  • email_sender: Sender
  • email_receivers: Recipients

Contribution

Issues and pull requests are welcome. Please follow the project's code style guidelines.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpusitter-2.0.1.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gpusitter-2.0.1-py3-none-any.whl (9.9 kB view details)

Uploaded Python 3

File details

Details for the file gpusitter-2.0.1.tar.gz.

File metadata

  • Download URL: gpusitter-2.0.1.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.17

File hashes

Hashes for gpusitter-2.0.1.tar.gz
Algorithm Hash digest
SHA256 16561200676a51203dedc2035cd060f028015183b593b111e3875972035a6042
MD5 48f87ca9066642571d5fd04172d78296
BLAKE2b-256 2b9add7a94562f412a585add83699c4612263300c6cc7dc452dca8e8c3a22695

See more details on using hashes here.

File details

Details for the file gpusitter-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: gpusitter-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 9.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.17

File hashes

Hashes for gpusitter-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0c3e957c7186b1b3f1aa730774c207c831f927b6194dccd9e3aa9cd58d5ef16c
MD5 4fdd03249d715c401ddc89304b059b46
BLAKE2b-256 4de3ced991828d1e6889fc3723f321ee7e8c0a10647cb291bfaab63bf2e2dfd4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page