Skip to main content

Python Bash Script Helpers

Project description

Pyscriptlib - Python Bash Script Helpers

Whats new in version 2.3.0

  • Added shbg(cmd, **kwargs) to run a command in the background and read its stdout in a separate thread.
    • This creates a BackgroundProcess(cmd, **kwargs) class to manage the background process
    • The get_stdout(sentinel=None) method will return immediately, meanwhile reading the stdout of the background process in a separate thread up to the sentinel string.
    • The join_stdout(timeout=None) method will wait up to timeout seconds for the IO to complete and returns the stdout lines read.
  • Improved humanize(seconds, style='compact', days='days', day='day', *, zerodays=True) to use day instead of days when style is compact and there is only one day.

Introducing pyscriptlib

pyscriptlib is a collection of helper functions that make it easy to invoke bash shell commands from a python script.

These helpers simplify the use of the sys, os, and subprocess modules for common cases.

From sys we get:

  • arg(n:int) -> str
    • a simple parse of sys.argv that always returns a string
    • also consider argparse or click-shell if you are building a complex CLI
  • nargs(n:int) -> List(str)
    • retrieve the remaining command line args starting with arg(n)
  • shift(n:int=1) -> List[str]
    • shifts sys.argv by n preserving sys.argv[0] returning removed args
  • exit(return_code:int=0)
    • exeunt, pursued by a bear (WS)

From os we get:

  • env(var:str) -> str|None
    • easy retrieval of environment variables
  • kill(pid:int, signal:int=9)
    • this subprocess is killing me, so kill or be killed

From subprocess we get several useful variations of sh*(cmd:str, **kwargs) to invoke the /bin/bash shell for us (including subprocess.Popen(**kwargs) if necessary):

  • sh(cmd:str, **kwargs) -> (stdout+stderr).strip()
    • output, output, gimme output
  • shl(cmd:str, **kwargs) -> list(rc, stdout, stderr)
    • ok, ok, maybe we should take a look first: if rc == 0
  • shk(cmd:str, **kwargs) -> list(is_ok, rc, stdout, stderr)
    • let's test with a concise boolean is_ok instead of rc == 0
  • sho(cmd:str, **kwargs) -> CompletedProcess(is_ok, rc, stdout, stderr)
    • cp = sho(cmd) is the full Monty
    • customized with cp.is_ok and cp.rc
  • shx(cmd:str, **kwargs) -> None
    • displays live output to terminal just like you want it to in living color
  • shb(cmd:str, **kwargs) -> int(pid)
    • runs cmd in background, returning immediately
    • use pid to kill(pid), or not, you daemon you
  • shbg(cmd:str, **kwargs) -> BackgroundProcess(pid, stdout, stderr)
    • runs cmd in background, returning immediately
    • returns a BackgroundProcess object with get_stdout() and join_stdout() methods
    • usage:
    bg = shbg('echo -n "hello\\nfrom\\nbackground\\nprocess"; sleep 3') 
    bg.get_stdout(sentinel='back')
    print('waiting for output...')
    stdout = bg.join_stdout(timeout=3)
    print(stdout)
    
  • class Shell can be used to select your preferred shell
    • Shell.shell is the variable used to access the current shell value
    • Shell.shell = '/bin/bash' is the default bash shell used by sh*() functions

And, from nowhere in particular, we get

  • humanize(seconds:int|float, style='compact', days='days', day='day', *, zerodays=True) -> str()
    • returns a human readable form of elapsed seconds such as cat /proc/uptime
    • style = full: '05d 03h 59m 27s' or compact: '05 days 03:59:27'
    • days = days or day
    • zerodays = True or False
  • joinlines(lines:list) -> str
    • this is the inverse of str.splitlines()
    • returns list as a string concatenated with LF's
  • Constants: SP, TAB2, TAB = TAB4, LF, CRLF
    • spaces and line feed constants useful in string formats

Installation

pyscriptlib is available at pypi.org

pip install pyscriptlib

Example Usage

See pyscriptlib/example_script.py below

#! /usr/bin/env python3
'''
Usage: python3 /path/to/site-packages/pyscriptlib/example_script.py ~/some/example/dir arg2 arg3
'''

from pathlib import Path
import time
from pyscriptlib import (arg, nargs, shift, env, 
                         sh, shx, shl, shk, sho, shb, shp, shbg,
                         kill, humanize)

def title(descr, code):
    print(f'\n>>> {descr}\n>>> {code}')
    

title('Retrieve first sys.argv or os.environ variable MY_DIR_PATH or None if not present',
      'dir_path = arg(1) or env("MY_DIR_PATH")')
dir_path = arg(1) or env('MY_DIR_PATH')
print(f'{dir_path = }')


title('Get remaining args',
      'remaining_args = nargs(1)')
remaining_args = nargs(2)
print(f'{remaining_args = }')


title('Shift args',
      'removed = shift()); arg1 = arg(1)')
removed = shift()
arg1 = arg(1)
print(f'{removed = }  {arg1 = }')


title('Open the dir_path and verify it is a directory',
      'if Path(dir_path).expanduser().is_dir() else exit(2)')
if dir_path:
    my_dir = Path(dir_path).expanduser()
    if not my_dir.is_dir():
        print(f'Exiting: cannot find {dir_path}')
        exit(2)
else:
    print('usage: ./myscript dir_path') 
    print('       -- or -- ')
    print('       MY_DIR_PATH=~/git/pyscriptlib; ./myscript')
    exit(1)
print(f'{my_dir = }')


title('Capture the stripped output from stdout+stderr',
      'output = sh(cmd)')
output = sh(f'ls -alh {my_dir}')
print(output)


title('Execute the command directly sending output to the terminal',
      'shx(cmd)')
shx(f'tree {my_dir}')


title('Get a list of return values from subprocess.CompletedProcess object -- test with rc == 0',
      'rc, stdout, stderr = shl(cmd); if rc == 0:')
rc, stdout, stderr = shl(f'ls -alh {my_dir}')
if rc == 0:
    print(f'{rc = }\n{stdout = }')
else:
    print(f'{rc = }\n{stderr = }')


title('Get a list of return values from subprocess.CompletedProcess object -- test with boolean is_ok',
      'is_ok, rc, stdout, stderr = shk(cmd); if is_ok:')
is_ok, rc, stdout, stderr = shk(f'ls -alh {my_dir}')
if is_ok:
    print(f'{is_ok = } {rc = }\n{stdout = }')
else:
    print(f'{is_ok = } {rc = }\n{stderr = }')


title('Get the customized subprocess.CompletedProcess object -- test with boolean cp.is_ok',
      'cp = sho(cmd); if cp.is_ok:')
cmd = f'''
cd {my_dir}
grep -r \
    --exclude='*.pyc' \
    --exclude-dir='.git' --exclude-dir='dist' --exclude-dir='*.egg-info' \
    pyscriptlib
'''
print('cmd = ', cmd)
cp = sho(cmd)
if cp.is_ok:
    text = cp.stdout.splitlines()
    print(text)
else:
    print(f'grep failed: {cp.rc = } {cp.stderr = }')
    exit(cp.rc)


title('Create and kill background process with pid',
      'pid = shb(cmd); kill(pid)')
pid = shb(f'echo "hello from background process"; sleep 10')
print(f'{pid = }')
time.sleep(1)
kill(pid)


title('Get the stdout from a background process in a separate thread',
      'proc = shp(cmd); print(proc.stdout)')
proc = shp(f'echo "hello from background process"; sleep 5')
print(proc.stdout) 


title('Get the stdout from a background process in a separate thread',
'''bg = shbg(cmd)
bg.get_stdout(sentinel="proc")
print('waiting for output...')
stdout = bg.join_stdout(timeout=1)
print(stdout)''')

bg = shbg(f'echo -n "hello\nfrom\nbackground\nprocess"; sleep 3')
bg.get_stdout(sentinel='back')
print('waiting for output...')
stdout = bg.join_stdout(timeout=3)
print(stdout)


title('Humanize the uptime for this host',
      'uptimes = sh("cat /proc/uptime"); humanize(uptimes[0]))')
uptimes = sh('cat /proc/uptime')
if uptimes:
    uptime = float(uptimes.split(' ')[0]) 
    print(f'{humanize(uptime) = }')
    print(f'{humanize(uptime, style="full") = }')


```python
#! /usr/bin/env python3
'''
Usage: python3 /path/to/site-packages/pyscriptlib/example_script.py ~/some/example/dir arg2 arg3
'''


from pathlib import Path
import time
from pyscriptlib import (arg, nargs, shift, env, 
                         sh, shx, shl, shk, sho, shb, shp, shbg,
                         kill, humanize)

def title(descr, code):
    print(f'\n>>> {descr}\n>>> {code}')
    

title('Retrieve first sys.argv or os.environ variable MY_DIR_PATH or None if not present',
      'dir_path = arg(1) or env("MY_DIR_PATH")')
dir_path = arg(1) or env('MY_DIR_PATH')
print(f'{dir_path = }')


title('Get remaining args',
      'remaining_args = nargs(1)')
remaining_args = nargs(2)
print(f'{remaining_args = }')


title('Shift args',
      'removed = shift()); arg1 = arg(1)')
removed = shift()
arg1 = arg(1)
print(f'{removed = }  {arg1 = }')


title('Open the dir_path and verify it is a directory',
      'if Path(dir_path).expanduser().is_dir() else exit(2)')
if dir_path:
    my_dir = Path(dir_path).expanduser()
    if not my_dir.is_dir():
        print(f'Exiting: cannot find {dir_path}')
        exit(2)
else:
    print('usage: ./myscript dir_path') 
    print('       -- or -- ')
    print('       MY_DIR_PATH=~/git/pyscriptlib; ./myscript')
    exit(1)
print(f'{my_dir = }')


title('Capture the stripped output from stdout+stderr',
      'output = sh(cmd)')
output = sh(f'ls -alh {my_dir}')
print(output)


title('Execute the command directly sending output to the terminal',
      'shx(cmd)')
shx(f'tree {my_dir}')


title('Get a list of return values from subprocess.CompletedProcess object -- test with rc == 0',
      'rc, stdout, stderr = shl(cmd); if rc == 0:')
rc, stdout, stderr = shl(f'ls -alh {my_dir}')
if rc == 0:
    print(f'{rc = }\n{stdout = }')
else:
    print(f'{rc = }\n{stderr = }')


title('Get a list of return values from subprocess.CompletedProcess object -- test with boolean is_ok',
      'is_ok, rc, stdout, stderr = shk(cmd); if is_ok:')
is_ok, rc, stdout, stderr = shk(f'ls -alh {my_dir}')
if is_ok:
    print(f'{is_ok = } {rc = }\n{stdout = }')
else:
    print(f'{is_ok = } {rc = }\n{stderr = }')


title('Get the customized subprocess.CompletedProcess object -- test with boolean cp.is_ok',
      'cp = sho(cmd); if cp.is_ok:')
cmd = f'''
cd {my_dir}
grep -r \
    --exclude='*.pyc' \
    --exclude-dir='.git' --exclude-dir='dist' --exclude-dir='*.egg-info' \
    pyscriptlib
'''
print('cmd = ', cmd)
cp = sho(cmd)
if cp.is_ok:
    text = cp.stdout.splitlines()
    print(text)
else:
    print(f'grep failed: {cp.rc = } {cp.stderr = }')
    exit(cp.rc)


title('Create and kill background process with pid',
      'pid = shb(cmd); kill(pid)')
pid = shb(f'echo "hello from background process"; sleep 10')
print(f'{pid = }')
time.sleep(1)
kill(pid)


title('Get the stdout from a completed background process',
      'proc = shp(cmd); print(proc.stdout.read())')
proc = shp(f'echo "hello from background process"; sleep 3')
print(proc.stdout.read()) 


title('Get the stdout from a background process in a separate thread',
'''bg = shbg('echo -n "hello\\nfrom\\nbackground\\nprocess"; sleep 3')
bg.get_stdout(sentinel="back")
print('waiting for output...')
stdout = bg.join_stdout(timeout=3)
print(stdout)''')

bg = shbg('echo -n "hello\nfrom\nbackground\nprocess"; sleep 3')
bg.get_stdout(sentinel='back')
print('waiting for output...')
stdout = bg.join_stdout(timeout=3)
print(stdout)


title('Humanize the uptime for this host',
      'uptimes = sh("cat /proc/uptime"); humanize(uptimes[0]))')
uptimes = sh('cat /proc/uptime')
if uptimes:
    uptime = float(uptimes.split(' ')[0]) 
    print(f'{humanize(uptime) = }')
    print(f'{humanize(uptime, style="full") = }')

A Personal Note

I've been using bash for over 35 years (since the days of Bell Labs Unix where it was invented by Stephen Bourne), but I never really felt very confident with its (to me) arcane syntax.

No, I don't mean just the things like this

# I'm not always sure which condition form to use 
# and watch those spaces around the brackets!!!  
# not to mention tests like -x, -n, -z and $? == 0 is success?
if <condition> | [ <condition> ] | [[ <condition> ]] 
then
    <statements>
else 
    <statements> 
fi

# seriously, case );; );; esac anyone?
case <value> in 
    <match> ) <statements> ;;
    <match> ) <statements> ;;
    * ) <statements>;;
esac

But mostly the magic stuff you can do with array notation that is so cryptic it makes my eyes water.

# I'm still not sure what all this means :-(

locations=( "New York" Chicago Atlanta Miami )
for val in ${!locations[@]}
do
   echo "index = ${val} , value = ${locations[$val]}"
done

Sigh, and I thought that perl was noisy ...

So I started looking for a better shell and what I actually found was python. It's a more powerful interpreter with a much more comfortable and much less cryptic syntax than bash.

Along the way, I also considered the python conch shell, but it wants to be a new hybrid language in a REPL, and I don't really want the overhead of another layer of shell even if it is ipython at its heart.

In particular, all I really want to do is easily invoke the bash shell to run useful tools like grep, ls -alh, tree, ip address, et alia, but otherwise I'm happy with python syntax as the medium in a script file.

Unfortunately, the down-side to using python for scripting is that it is still a real programming language -- so creating bash equivalent scripts can get a bit hairy, to say the least.

This is especially true when you start using subprocess.Popen() and its incredibly powerful capabilities to fork any process you want with any options you want. But the price of power can be overwhealming complexity.

And so, I finally realized that all I really need is a concise set of wrapper functions that I can use to embed bash commands in my python code and let them actually do all the heavy lifting using subprocess.Popen() -- and why not throw in some sys and os sugar as well?

And, thus, complexity begat pyscriptlib and here we are.

A Testimonial to pathlib.Path -- Can I have an Amen?

I am also highly impressed with the Path class from the standard pathlib module. The Path object model helps my python scripts become much more concise as I'm often navigating the file system such as Path.home()/'subdir' before I launch some sh*(cmd) that accesses the files.

Ooh, ooh, Mr. Kotter, Mr. Kotter: this is not a typo, rather, it is the coolest use of the class dunder method __truediv__ to create a Path slash(/) join operator that I have seen:

from pathlib import Path

# using the Path slash(/) join operator
file_path = Path.home() / 'subdir' / 'file'

# is the same as using the Path.joinpath() method
file_path = Path.home().joinpath('subdir', 'file')

# and the Path instance has simple direct access methods to the files
if file_path.is_file():
  text = file_path.read_text()

# instead of more complexity using a context manager such as with open():
with open(file_path, 'r') as file:
    text = file.read()

The old way of doing this with os.path was only slightly less painful than subprocess.Popen. I leave that as a heartless exercise for the reader :-).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyscriptlib-2.3.0.tar.gz (13.0 kB view details)

Uploaded Source

Built Distribution

pyscriptlib-2.3.0-py3-none-any.whl (11.4 kB view details)

Uploaded Python 3

File details

Details for the file pyscriptlib-2.3.0.tar.gz.

File metadata

  • Download URL: pyscriptlib-2.3.0.tar.gz
  • Upload date:
  • Size: 13.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for pyscriptlib-2.3.0.tar.gz
Algorithm Hash digest
SHA256 abe6854f40c81face785e4c4fd9f0abdc186481361e40063aea5a7bf168a32bd
MD5 0cc89e1014bd3283fce9951aaa5be4ae
BLAKE2b-256 238822fc7e414efb873518dd483749dbf100a77ccc1fd172efb57276d5af749a

See more details on using hashes here.

File details

Details for the file pyscriptlib-2.3.0-py3-none-any.whl.

File metadata

  • Download URL: pyscriptlib-2.3.0-py3-none-any.whl
  • Upload date:
  • Size: 11.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for pyscriptlib-2.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bb6affb5cedb033b7ae643b6aeb6c47eb00bad3f5ae16964686188b0bbab5a5b
MD5 12f32d1fc00b9c2b8db38e0dd9be9967
BLAKE2b-256 33d7f5b412d870e401a6cca50d4919e9a3d3f19160a81133cd3052ea079c3db7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page