Skip to main content

Utility library for easily distributing code execution on clusters

Project description

Cluster Tools

Build Status Code Style

This package provides python Executor classes for distributing tasks on a Slurm cluster, Kubernetes, Dask or via multi processing.

Example

import cluster_tools

def square(n):
  return n * n

if __name__ == '__main__':
  strategy = "slurm"  # other valid values are "multiprocessing" and "sequential"
  with cluster_tools.get_executor(strategy) as executor:
    result = list(executor.map(square, [2, 3, 4]))
    assert result == [4, 9, 16]

Installation

The cluster_tools package requires at least Python 3.9.

You can install it from pypi, e.g. via pip:

pip install cluster_tools

By default only the dependencies for running jobs on Slurm and via multiprocessing are installed. For Kubernetes and Dask run:

pip install cluster_tools[kubernetes]
pip install cluster_tools[dask]

Configuration

Slurm

The cluster_tools automatically determine the slurm limit for maximum array job size and split up larger job batches into multiple smaller batches. Also, the slurm limit for the maximum number of jobs which are allowed to be submitted by a user at the same time is honored by looking up the number of currently submitted jobs and only submitting new batches if they fit within the limit.

If you would like to configure these limits independently, you can do so by setting the SLURM_MAX_ARRAY_SIZE and SLURM_MAX_SUBMIT_JOBS environment variables. You can also limit the maximum number of simultaneously running tasks within the slurm array job(s) by using the SLURM_MAX_RUNNING_SIZE environment variable.

Kubernetes

Resource configuration

Key Description Example
namespace Kubernetes namespace for the resources to be created. Will be created if not existent. cluster-tools
node_selector Which nodes to utilize for the processing. Needs to be a Kubernetes nodeSelector object. {"kubernetes.io/hostname": "node001"}
image The docker image for the containerized jobs to run in. The image needs to have the same version of cluster_tools and the code to run installed and in the PYTHONPATH. scalableminds/voxelytics:latest
mounts Additional mounts for the containerized jobs. The current working directory and the .cfut directory are automatically mounted. ["/srv", "/data"]
cpu CPU requirements for this job. 4
memory Memory requirements for this job. Not required, but highly recommended to avoid congestion. Without resource requirements, all jobs will be run in parallel and RAM will run out soon. 16G
python_executable The python executable may differ in the docker image from the one in the current environment. For images based of FROM python, it should be python. Defaults to python. python3.8
umask umask for the jobs. 0002

Notes

  • The jobs are run with the current uid:gid.
  • The jobs are removed 7 days after completion (successful or not).
  • The logs are stored in the .cfut directory. This is actually redundant, because Kubernetes also stores them.
  • Pods are not restarted upon error.
  • Requires Kubernetes ≥ 1.23.
  • Kubernetes cluster configuration is expected to be the same as for kubectl, i.e. in ~/.kube/config or similar.

Dev Setup

# See ./dockered-slurm/README.md for troubleshooting
cd dockered-slurm
docker compose up -d
docker exec -it slurmctld bash
docker exec -it c1 bash

Make sure to install all extra dependencies, such as Kubernetes, with uv sync --all-extras.

Tests can be executed with cd tests && uv run pytest -s tests.py after entering the container. Linting can be run with ./lint.sh. Code formatting (black) can be run with ./format.sh.

Credits

Thanks to sampsyo/clusterfutures for providing the slurm core abstraction and giovtorres/slurm-docker-cluster for providing the slurm docker environment which we use for CI based testing.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cluster_tools-3.4.0.tar.gz (49.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cluster_tools-3.4.0-py3-none-any.whl (48.5 kB view details)

Uploaded Python 3

File details

Details for the file cluster_tools-3.4.0.tar.gz.

File metadata

  • Download URL: cluster_tools-3.4.0.tar.gz
  • Upload date:
  • Size: 49.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for cluster_tools-3.4.0.tar.gz
Algorithm Hash digest
SHA256 9513f3573ea1afd1d153ce2f91d59e648bfc4ed1dee1c2d2d3ca1d9db0befc6c
MD5 2793da21f2c204e1766104edfa1762c1
BLAKE2b-256 73da7d54fe0f250ad5c8941016d492845efffbc01e1f9b7c080fd8793709d763

See more details on using hashes here.

File details

Details for the file cluster_tools-3.4.0-py3-none-any.whl.

File metadata

  • Download URL: cluster_tools-3.4.0-py3-none-any.whl
  • Upload date:
  • Size: 48.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for cluster_tools-3.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4c7546b6656362886549fffff4a8915fbf851f1396ae39f8b0a2c1385f62902f
MD5 f98c20b053ec8c19deb2b5a259270800
BLAKE2b-256 df9706d3e1cd7d556587bbe8d9b10cf094807dcacca675a6ea4ad0db7f58d02f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page