simple commands execution pipeline, coupled with unified command-line-interface entry-point
Project description
simppl
Package for writing simple command-line pipelines, and organized command-line toolboxes.
The package is composed of two separate but intertwined modules:
- simple_pipeline - aids writing command-line pipelines with minimal overhead. Can be integrated into any python script.
- cli - aids converting a collection of cli python scripts into an organized toolbox cli
These two modules are packed together, and can be naturally used together. Each module has functionalities that use the other.
simple_pipeline
The simple_pipeline module defines the SimplePipeline class.
SimplePipeline conveniently enables turning a python cli script (executed from terminal) into a pipeline of os commands.
- It enables running os commands sequentially / concurrently.
- Uses multiprocessing to easily imple:wqment scatter-gather pattern
- Each command / commands-batch is given an index.
- The user can run a sub-sequence of commands by specifying -fc (first_command) and -lc (last_command) flags.
- Has option to dry_run the pipeline using -d flag.
- Each command is printed before execution, and is also optionally timed.
- outputs/errors from sub-commands are collected and logged.
Using simple_pipeline
The simplest usage will look like this:
from simppl.simple_pipeline import SimplePipeline
sp = SimplePipeline(start=0, end=100):
sp.print_and_run('<YOUR_FIRST_OS_COMMAND>')
sp.print_and_run('<YOUR_SECOND_OS_COMMAND>')
To run multiple commands concurrently use:
commands = ['<YOUR_FIRST_OS_COMMAND>', '<YOUR_SECOND_OS_COMMAND>']
max_number_of_processes = 4
sp.run_parallel(commands, max_number_of_processes)
Finally if your project uses the cli module, you can run directly another command_line_tool as part of a pipeline. The other tool will be run from the same process, but it will appear from the logs as another command in the pipeline. This enables smoother debugging and refactoring of tools calling other tools.
from example_module import example_tool
sp.print_and_run_clt(example_tool.run, ['first_number', 'second_nmber'],
{'-key1': 'val1', '-key2': 'val2'},
{'--flag'})
Note that in order to see the commands printed, you will need to configure logging. See example_module/logging_config.ini for example.
cli
cli enables turning a collection of python executable scripts into a unified cli.
- Creates a single entrypoint for running the command-line tools
- Standardized tool development and documentation
- Adds a manual printout listing all available tools and packages with minimal development overhead
Using cli - developer side:
- example_module gives an example of how to use CommandLineInterface in your project
- requirements:
- main.py - define toolbox logo, constructs and runs the CommandLineInterface.
- init.py - set logging configuration
cli supports two modes of operation:
- explicit-tool-loading:
- tools list is explicitly passed as argument in the constructor.
- good for project with few changes in tool content.
- should be used in projects where runtime overhead is critical.
- automatic-tool-loading:
- tools are annotated and dynamically searched for in project files.
- good for projects with many changes in tool content / many collaborators
- easy tool addition - tool addition/removal don't require touching the toolbox main module.
- adds runtime overhead before every tool execution (depends on project sources size)
Explicit tool loading mode:
import package1.module1
import package1.modele2
import pakage2.module1
if __name__ == '__main__':
modules_list = [package1.module1, package1.module2, package2.module1]
cli = CommandLineInterface(__file__, ascii_logo, modules_list)
cli.run(sys.argv)
Automatic tool loading mode:
if __name__ == '__main__':
cli = CommandLineInterface(__file__, ascii_logo)
cli.run(sys.argv)
This mode requires to annotate tool 'run' methods with command_line_tool decorator
Defining a script as command_line_tool:
from simppl.cli import command_line_tool
@command_line_tool
def run(argv):
"""
Tool description that will appear in main man printout
"""
# Do something here using any python code
Using cli - user side
Printing manual (run the package with no arguments):
python -m your_toolbox_package_name
Where your_toolbox_package_name is the name of the folder containing main.py
Running a specific tool:
python -m your_toolbox_package_name tool_name <tool_args>
Where tool_name is the name of the py file which contains the @command_line_tool definition
Examples
Command-line-tool example:
- See example_module/add_two_numbers.py
python -m example_module add_two_numbers 5 6
- Should print 11.0 to stdout
Example for running command-line-tool using SimplePipeline
python -m example_module analyze_file_pipeline resources/analyze_file_pipeline_input.txt test_outputs
- Should print the following (except date-time) to stdout:
python -m <module_name> analyze_file_pipeline resources/analyze_file_pipeline_input.txt test_outputs
2020-09-11 14:31:05,639 - analyze_file_pipeline - INFO - 1) wc resources/analyze_file_pipeline_input.txt > test_outputs/wc.txt
2020-09-11 14:31:05,643 - analyze_file_pipeline - INFO - Time elapsed wc: 0 s
2020-09-11 14:31:05,643 - analyze_file_pipeline - INFO - 2) ls -l resources/analyze_file_pipeline_input.txt > test_outputs/ls.txt
2020-09-11 14:31:05,648 - analyze_file_pipeline - INFO - Time elapsed ls: 0 s
2020-09-11 14:31:05,649 - analyze_file_pipeline - INFO - 3) sed 's/\s/\n/g' resources/analyze_file_pipeline_input.txt | sort | uniq -c | sort -n > test_outputs/word_count.txt
2020-09-11 14:31:05,653 - analyze_file_pipeline - INFO - Time elapsed sed: 0 s
Distribution
Distribution to pypi was done by following this manual: https://packaging.python.org/tutorials/packaging-projects/
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file simppl-1.0.7.tar.gz
.
File metadata
- Download URL: simppl-1.0.7.tar.gz
- Upload date:
- Size: 13.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c5401e6061c3504d29fc06c9fc130141a983d608a9af8363c8817c773403e8f |
|
MD5 | 8823f2dc037b392038a88d0ff51a49b3 |
|
BLAKE2b-256 | 67cce99fff1595ba3b544282fedc3749303979baf9a1531a870abcd02102902f |
File details
Details for the file simppl-1.0.7-py3-none-any.whl
.
File metadata
- Download URL: simppl-1.0.7-py3-none-any.whl
- Upload date:
- Size: 14.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c4b940a2b91339b91e1590f4988fc42d3c36048efabb7b82fcbe1dfc186c7bd9 |
|
MD5 | 379ea32ba3e99bc0cc735b37e4f999ef |
|
BLAKE2b-256 | 74323b5aa5a8a8d6311fdbf8ab0de4d52fbbb15ae48fcbf0d555d61542b7f6ee |