Skip to main content

Prometheus metrics collectors for Keras

Project description

Gangplank

Export Keras metrics to Prometheus

Prometheus is a monitoring system that pulls metrics from applications and infrastructure. While polling works for applications that are continuously running, scraping metrics does not work well with batch jobs such as machine learning training or evaluation jobs. The Prometheus Pushgateway is middleware that connects batch jobs to Prometheus.

Gangplank is a Keras callback for pushing Keras training and testing metrics to Prometheus via a pushgateway.

What metrics are exported?

During training, the following metrics are exported:

  • The number of completed training epochs
  • The time spent training
  • The number of model weights (both trainable and non-trainable)
  • The model's loss
  • All metrics configured for the model (e.g. accuracy for a classification model or mean absolute error for a regression model)
  • (Optionally) A histogram of the model's trainable weights at the end of the training run

For testing (i.e. evaluation), the following metrics are exported:

  • The time spent testing
  • The model's loss
  • All metrics configured for the model (accuracy, mean absolute error, etc.)
  • (Optionally) A histogram of the model's trainable weights

Installing Gangplank

Gangplank can be installed from PyPI

pip install gangplank

The installation will also install Keras. Keras needs a tensor arithmetic backend like tensorflow, JAX or PyTorch. You can install a backend at the same time as installing Gangplank by running one of the following

pip install gangplank[tensorflow]
pip install gangplank[jax]
pip install gangplank[torch]

Note: Running, e.g., pip install gangplank[jax] will install a CPU-only version of JAX. If you want, say, CUDA support you should install JAX separately

pip install gangplank
pip install jax[cuda12]

Similar comments apply to tensorflow and PyTorch.

Examples

An example of using Gangplank can be found in the github repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gangplank-0.2.2.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gangplank-0.2.2-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file gangplank-0.2.2.tar.gz.

File metadata

  • Download URL: gangplank-0.2.2.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for gangplank-0.2.2.tar.gz
Algorithm Hash digest
SHA256 ea61a1af2f33e65a7dd431f9443300dfa63c39a27e8ff6e191998c87a97e9656
MD5 7efacd7d300ebafab8e854ea98dce078
BLAKE2b-256 0a930f3651846a4e3ec2540ef87d50ed6487be5b120e07951c1014906d7eab95

See more details on using hashes here.

Provenance

The following attestation bundles were made for gangplank-0.2.2.tar.gz:

Publisher: publish-to-pypi.yml on hammingweight/gangplank

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file gangplank-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: gangplank-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for gangplank-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6b8af20c45902807075404eb7bc4e35d7f183d4b01860164a9b11100bb2e337c
MD5 34ef9d02f2de221d1a16d5dfec2a78be
BLAKE2b-256 d73e95f6342f26b9b54c99bc5b2fd1781e0fd38d845464f42c2af9ebf0555afc

See more details on using hashes here.

Provenance

The following attestation bundles were made for gangplank-0.2.2-py3-none-any.whl:

Publisher: publish-to-pypi.yml on hammingweight/gangplank

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page