Skip to main content

A utility for splitting tasks up into batches

Project description

JobBatch: Run batches of commands

Background

JobBatch is a simple tool for queuing up batches of commands and running N of them at a time.

The initial use case was scheduling software updates on a large number of remote devices, without getting all the support queries back in the same day! So we wanted to update 100 devices each night. But you might want to send 10 emails per minute, transcode 5 videos per hour, or whatever. You can set up a cron job or similar to run jobbatch, and it will split the tasks into these chunks for you and run them.

Here's how it works.

The flow is managed using the filesystem. You have a set of subdirectories:

  • queue
  • batch
  • successes
  • failures
  • output

You can put all of the jobs you want to run as individual files into the 'queue' directory. If you have them as lines in a single file, you can use jobbatch split to turn it into one file per job. Often, each job file is a shell script, but it doesn't have to be (see below).

Each time you want to run the next chunk of jobs, you do jobbatch select to take a selection of the files in the queue directory and move them into the batch directory. You can then take a look to check that things look right, and if you don't like what you see, you can just move all the files back into the queue directory. If the queue directory is empty, this command will do nothing.

Then you run jobbatch run, which will execute those the jobs in the batch directory one at a time. After execution, each file is moved to either successes or failures depending on its exit code. If the batch directory is empty, this command will do nothing.

Any standard output or error output from a job will be put in a file with a '.stdout' or '.stderr' extension within the output directory.

So, to summarise, each file eventually goes from:

queue -> batch -> successes or failures (possibly plus output)

Not just for scripts

Typically, each job file will be a shell script or program to be executed.

If the file has the 'executable' flag bit set, it will be executed directly. If it doesn't, it will be passed as the argument to a processor such as /bin/bash, but you can change this with a command-line option.

This means, therefore, that the jobs do not have to be scripts. Suppose you had a script for emailing thumbnails of your photographs to your friends, you could put the photographs into the 'queue' directory and specify your script as the processor, and then send them eight thumbnails per day.

For some tasks, the job files may not even need to have any contents, because the filename itself is the parameter. You might just created empty files named after the serial numbers of the devices to be updated. Note that jobbatch run will normally send the full pathname of the file to the processor, but there is an option just to send the base name within the subdirectory, if you wish to use it this way.

Author

Quentin Stafford-Fraser June 2022

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jobbatch-1.0.5.tar.gz (11.4 kB view hashes)

Uploaded Source

Built Distribution

jobbatch-1.0.5-py3-none-any.whl (12.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page