Simple slurm resource estimation

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

iparask troycomi

These details have not been verified by PyPI

Project description

slurmise

Installation
License

Installation

pip install slurmise

Usage

Configuration

Slurmise requires a configuration file for every command which tells slurmise where to store important files and how to process input files. A general CLI parser is hard to get correct and still doesn't account for all use cases, for example is the input file size important or number of lines? In addition to some built-in file parsers, you have the option of utilizing awk to extract more complex information from a file.

Example toml

[slurmise]
# base directory to store database and optimized models
base_dir = "slurmise_dir"

# default resources to return if a model doesn't exist or isn't trained
# can be overwritten by subsequent jobs
# if not set, will use 1000 for default memory and 60 for default runtime
default_mem = 2000
default_time = 70

# for each job you want to track, give a unique job name
[slurmise.job.job_name]
# the job spec determines how to parse commands to extract their relevant,
# dependent variables
job_spec = "subcommand -T {threads:numeric} -C {complexity:category}"
# jobs of `job_name` will now return default memory of 3000 and time of 80
default_mem = 3000
default_time = 80

Matching job names

The job name of a command can be set in various ways. First, if the command starts with the job name, the job name will be detected and removed from the command.

# slurmise.toml
[slurmise.job.git]
job_spec = "checkout {branch:category}"

This job specification will match any of the following with the branch set to my_branch

# infer from command
slurmise record "git checkout my_branch"
# tell explicitly
slurmise record --job-name git "checkout my_branch"

Since the job name cannot have spaces and some commands have several subcommands, you can set unique prefixes for certain jobs.

# slurmise.toml
[slurmise.job.git_checkout]
job_prefix = "git checkout"
job_spec = "{branch:category}"

[slurmise.job.git_merge]
job_prefix = "git merge"
job_spec = "{branch:category}"

With a job prefix, slurmise will use the prefix instead of job name to infer job names.

slurmise record "git checkout my_branch"
# explicit job name
slurmise record -j git_checkout "my_branch"

Note that the job name and prefix should not be included in the job specification. When a job name is explicitly given to slurmise, the corresponding command should not have the prefix or job name included.

Job specifications

When constructing the job specification, tokens that should be recorded use curly braces as placeholders:

{variable_name:variable_type}

The name should be unique within a job and contain no spaces. The type can be one of:

numeric: A single number, used in regression as an independent variable. Examples include the number of threads, epochs, or replicates to perform.
category: A string which is used to select the correct model. Examples include what algorithm to choose, switches or flags. Note that a category can be a number, but will be stored as a string, e.g. "1.0" is different from "1". For inference, the categories will be matched to particular, independent model.
ignore: A placeholder for a token that shouldn't be considered. Ignored tokens do not require a variable name.
file: An input file in plain text. Can be processed further as described below.
gzip_file: An input file in gzip format. During processing, the file will be decompressed to read it's contents, note this can incur memory and cpu drain.
file_list: An input file that contains a list of files to process in turn.

File Parsers

Each file can have one or more parsers associated with its variable name. Slurmise comes with several built-in options for parsing files:

file_size: The size of the file on disk, in bytes, numeric
file_lines: The number of lines (newlines) in the file, numeric
file_basename: The base filename, category
file_md5: The md5 digest of the file contents, category

Additionally, custom file parsers can be made using awk. While somewhat limited, awk prevents security issues with running arbitrary code. File parsers require a unique name in the slurmise.file_parsers collection. The return type is categorical by default. The awk command can be supplied as a string or file path, which is used with the -f flag of awk. Here are some examples:

[slurmise.file_parsers.epochs]
return_type = "numerical"
awk_script = "/^epochs:/ {print $2}"

[slurmise.file_parsers.network]
return_type = "categorical"
awk_script = "/^network type:/ {print $3}"

[slurmise.file_parsers.fasta_length]
return_type = "numerical"
awk_script = "/path/to/awk/file.awk"
script_is_file = True

# contents of file.awk
# /^>/ {if (seq) print seq; seq=0} 
# /^>/ {next} 
# {seq = seq + length($0)} 
# END {if (seq) print seq}

The first extracts the token after epochs: as a number and could be used for getting metadata from a configuration file. Similarly, the network parser extracts the network type but this time returns the result as a category.

Finally, the fasta_length parser takes an awk script file that prints the length of each sequence in a fasta file, returning the list of numbers as numerics.

To specify which parser a file uses, add them to the job entry:

[slurmise.job.sample_files]
job_spec = "--reference {reference:file} {fasta:file}"
file_parsers.reference = "file_md5,file_lines"
file_parsers.fasta = "file_size,fasta_length"

Each file name (reference and fasta) needs a file_parsers entry within the job specification. The name can take a comma separated list of parsers. Here the reference file is parsed with the md5 and number of lines. The md5 will create a new category based on the file contents while the lines will be an independent variable. In practice, matching md5 will ensure the same number of lines which doesn't provide additional information to the mode. The fasta file returns the file size in bytes and the number of nucleotides in each fasta entry.

License

slurmise is distributed under the terms of the MIT license.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

iparask troycomi

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.3

Jul 9, 2025

This version

0.0.2

Jul 9, 2025

0.0.1

Jun 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slurmise-0.0.2.tar.gz (37.5 kB view details)

Uploaded Jul 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

slurmise-0.0.2-py3-none-any.whl (25.7 kB view details)

Uploaded Jul 9, 2025 Python 3

File details

Details for the file slurmise-0.0.2.tar.gz.

File metadata

Download URL: slurmise-0.0.2.tar.gz
Upload date: Jul 9, 2025
Size: 37.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for slurmise-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`818f3e1d4ce7812c1a9a384dfce0d8b84e0f8b6303e120e1b04f01f791239006`
MD5	`3f330123f391050a957c71f6e45310bf`
BLAKE2b-256	`2589ea029fce3490d1f2051716fb6ef1733576cca66b50d996d981c27257965b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for slurmise-0.0.2.tar.gz:

Publisher: release.yaml on PrincetonUniversity/slurmise

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: slurmise-0.0.2.tar.gz
- Subject digest: 818f3e1d4ce7812c1a9a384dfce0d8b84e0f8b6303e120e1b04f01f791239006
- Sigstore transparency entry: 268707576
- Sigstore integration time: Jul 9, 2025
Source repository:
- Permalink: PrincetonUniversity/slurmise@07080ed34c85260ba2e0091a81672053102f06da
- Branch / Tag: refs/tags/v0.0.2
- Owner: https://github.com/PrincetonUniversity
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@07080ed34c85260ba2e0091a81672053102f06da
- Trigger Event: release

File details

Details for the file slurmise-0.0.2-py3-none-any.whl.

File metadata

Download URL: slurmise-0.0.2-py3-none-any.whl
Upload date: Jul 9, 2025
Size: 25.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for slurmise-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`63e60fc58ffdb55e81adbcf217a3d2408e51f0772b842162d2123db083b1ed6c`
MD5	`13b85a5690c73adc0551c3276f36bc25`
BLAKE2b-256	`c7c60fba5ea2987e2daf501dab51fe485b21bd5fb4a56bd344871bb1084a3ea2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for slurmise-0.0.2-py3-none-any.whl:

Publisher: release.yaml on PrincetonUniversity/slurmise

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: slurmise-0.0.2-py3-none-any.whl
- Subject digest: 63e60fc58ffdb55e81adbcf217a3d2408e51f0772b842162d2123db083b1ed6c
- Sigstore transparency entry: 268707580
- Sigstore integration time: Jul 9, 2025
Source repository:
- Permalink: PrincetonUniversity/slurmise@07080ed34c85260ba2e0091a81672053102f06da
- Branch / Tag: refs/tags/v0.0.2
- Owner: https://github.com/PrincetonUniversity
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@07080ed34c85260ba2e0091a81672053102f06da
- Trigger Event: release

slurmise 0.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

slurmise

Table of Contents

Installation

Usage

Configuration

Example toml

Matching job names

Job specifications

File Parsers

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance