Skip to main content

Monitor idle GPU usage.

Project description

GPU Sentinel

A Moonshine Labs tool

If you're automating training your large models in the cloud, cost control is critial. How many times have you accidentally left an expensive GPU instance running when the underlying job had crashed, costing you money or capacity with no benefit?

GPU Sentinel is a simple tool that will watch your instance and automatically trigger when GPU utilization drops below a certain amount for a period of time. GPU Sentinel can automatically shutdown or reboot the instance, or simply end its own process so you can do an action yourself.

Constraints:

  • To shutdown/reboot the machine, GPU Sentinel requires sudo permissions.
  • Currently only working on Linux, Windows support coming soon.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpu_sentinel-0.1.2.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

gpu_sentinel-0.1.2-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file gpu_sentinel-0.1.2.tar.gz.

File metadata

  • Download URL: gpu_sentinel-0.1.2.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for gpu_sentinel-0.1.2.tar.gz
Algorithm Hash digest
SHA256 f043968beb2ede701c5c136aa3dfc537276ef509965b8780b74a724fa76f0c8d
MD5 bbe9654e875dfbc93c8b4de9b438d666
BLAKE2b-256 c38f11eb91affdc4ff2fca0c55080b3e8f8434b008472b3b02fe0176b5cb174e

See more details on using hashes here.

File details

Details for the file gpu_sentinel-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for gpu_sentinel-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3ff9355b541ec1d8f57a9495a9a2be3631f463ea91d538d28a47f8dcd0ec6bac
MD5 3008036a8ac6faa4019ef716f663bb43
BLAKE2b-256 a85e5c44141dfff24fce8317b88c46db86c72a5e1712d43dca8be670fa6906d7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page