Nomad job management tool
Project description
Overview
nmgr
is a utility program for managing jobs in a Nomad cluster according to certain specific needs and preferences of mine. For a basic orientation in what it does and why it does it, see Rationale. The type of jobs it is designed to operate on can be gleaned from my homelab repository.
It started as a set of Bash convenience functions which, in time, slowly but surely threatened to evolve into an unmaintainable monstrosity. This Python rewrite, consequently, represents a more or less desperate attempt to tame the beast before it would be too late—or perhaps more accurately, a way of trading one set of complexities for another that nevertheless feels a bit more structured and robustly extensible. In any case, it's fun sometimes to seek out a dubious break from the purity of UNIX pipes to get tangled up in some overengineered OOP for a bit instead. Misery needs variety if it is to be enjoyable.
Installation
nmgr
is packaged on PyPi and can be installed using, for example, pipx
:
pipx install nmgr
Usage
usage: nmgr [options] [action] [target]
Nomad job manager
positional arguments:
action up, down, find, list, image, logs, exec, reconcile
target infra, services, all, a custom filter, a specific job name, or a string (for "find")
options:
-h, --help show this help message and exit
-c, --config CONFIG path to config file (default: /home/<user>/.config/nmgr/config.toml)
-n, --dry-run dry-run mode
-d, --detach start jobs in detached mode
-p, --purge purge jobs when stopping
-v, --verbose verbose output
--completion install autocompletion for Bash and exit
--version show program's version number and exit
Rationale
Consider the following use-cases:
-
You're using something like Renovate to manage updates to container image versions. Now one fine day, a whole bunch of these comes in as a PR, so you merge, pull locally—and then what? Do you manually hunt down all the jobs needing to be updated and restart them one by one? Well, now you can do this instead:
nmgr reconcile all
Or, if you still would like to preserve some manual control:
nmgr reconcile my-specific-job
Also, just for fun, you might first want to compare a job's currently running images against those in its specification:
$ nmgr image forgejo Live images: forgejo = "codeberg.org/forgejo/forgejo:9.0.3-rootless" valkey = "docker.io/valkey/valkey:7.2-alpine" Spec images: forgejo = "codeberg.org/forgejo/forgejo:10.0.0-rootless" valkey = "docker.io/valkey/valkey:8.0-alpine"
-
You're about to perform a server upgrade that requires a restart. Instead of manually coddling every one of those 50+ running jobs first, it sure would be handy to be able to do this:
nmgr down all sudo apt update && sudo apt upgrade sudo reboot now [...] nmgr up all
-
Nextcloud's PHP spaghetti has decided to crap the bed, and you have no choice but to start tailing the logs. "What's the syntax again?
nomad logs -f -job nextcloud
? Wait, no, that errors out. Oh, that's right: I have to specify a 'task' to get the logs from. But what did I name the Nextcloud job tasks? I better check the job specification..." No! Stop right there.$ nmgr logs nextcloud Tasks for job nextcloud: 1. server 2. cron 3. redis 4. push 5. collabora Select a task (number):
And off you go.
-
You find yourself wanting to break all the rules of application containers by looking to shell in and execute some command. Now what was it,
nomad alloc exec -job immich
? Apparently not:Please specify the task
. Ah, right:nomad alloc -job immich -task server
. What the hell?Please specify the task
again? Perhaps-task
has to precede-job
? At this point you might feel like giving up. But fear not!$ nmgr exec immich Tasks for job immich: 1. server 2. machine-learning 3. redis Select a task (number): 1 Command to execute in server: ls bin get-cpus.sh package-lock.json resources upload dist node_modules package.json start.sh
-
At random parts of the day, your heart will sink when you suddenly remember you probably still have some jobs running with a
latest
image tag. After some time, you have had enough of these crises of conscience, so you roll up your sleeves,ssh
into the server, and–what's that? You were going to go look for all those image specifications manually? Don't be silly:nmgr find :latest
-
You're about to upgrade or otherwise mess with, say, a NAS on which a host of currently running jobs depend. Do you now wade through each and every job specification to remind yourself which jobs you would need to stop before making your changes? Instead, you could do this:
nmgr down nas
And then, after you're done messing with the NAS:
nmgr up nas
You could do the same thing for jobs that depend on e.g. a database job (
nmgr {up,down} db
), a JuiceFS mount (nmgr {up,down} jfs
), and so forth.
-
Before blindly tearing down a bunch of jobs as in the example above, you would like to know exactly which jobs are going to be impacted. Hence, nervous Nellie that you are, you run:
nmgr list nas
Or, if you could muster up just a bit more courage, you might perform a dry-run:
nmgr -n down nas
NOTE: Some of these examples make use of custom target filters (nas
, jfs
, db
). These can be defined in the configuration file that will be generated on first run.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nmgr-0.3.1.tar.gz
.
File metadata
- Download URL: nmgr-0.3.1.tar.gz
- Upload date:
- Size: 25.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.13.1 Linux/6.8.0-1021-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
05324c95abb20d915f2f2a9c1c87809f8d2e94b4d5f45466f39585f7ae901bc5
|
|
MD5 |
a2356a3c47cd31f66e9af7c44cd0024e
|
|
BLAKE2b-256 |
e126da1d1c190604ec668172a2de47151c2b2f5961a38dd653de9161f9fd2740
|
File details
Details for the file nmgr-0.3.1-py3-none-any.whl
.
File metadata
- Download URL: nmgr-0.3.1-py3-none-any.whl
- Upload date:
- Size: 26.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.13.1 Linux/6.8.0-1021-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
1739ad3fa1c82e32390cbfe60350079fa55e47212785a5b3327eec2647f740f0
|
|
MD5 |
c7d2d4617ed002e1299ef2ace73ca039
|
|
BLAKE2b-256 |
f11282009280deda504753a69e2bcb63f9876d1357173af7a9d8dd9979650d67
|