CLI tool for doing async tasks and transformations

Project description

streamline

The goal of this project is to make data acessible and actionable on the CLI. Streamline does this by implementing the utility functions necessary for taking an input stream and parallelizing work done on it.

The sequence of operations in an invocation:

The "Generator" object loads entries from a source data stream (usually stdin).
The "Streamers" selected are sequentially given each input value and are exepcted to filter and make modifications to those values and yield them for the next streamer.
The "Consumer" object takes the entries yielded from the last streamer and usually outputs them for storage or viewing (usually stdout).

Installation

Requires Python 3.6+ and pip
pip install streamline

Guide

The simplest call specifies no streaming operations it just reads from stdin and writes to stdout exactly what was written:

  $ printf "foo\nbar" | streamline
  foo
  bar

By default streamline takes input from stdin and writes to stdout. This is very flexible as it makes the tool composible with other CLI tools. However you can also use the --input and --output flags to control output. Lets assume that you have a file with the same data you just sent in with printf:

  $ streamline --input my_source_file.txt --output my_target_file.txt
  $ cat my_target_file.txt
  foo
  bar

Now lets do something a little less useless. Lets use the "shell" streamer to execute a shell command for each entry and check if they're listening for https traffic:

  $ printf "www.google.com\nslashdot.org" | streamline -s shell -- "nc -zv {value} 443"
  {"stdout": "", "stderr": "Connection to www.google.com 443 port [tcp/https] succeeded!\n", "exit_code": 0}
  {"stdout": "", "stderr": "Connection to slashdot.org 443 port [tcp/https] succeeded!\n", "exit_code": 0}

Streamline modules aim to provide all the useful information in object form as output can then be customized with other streaming modules. For exampe, to take the above output and get just the exit code that tells us whether the port is open we can just add the --headers option to prefix each output with the original input and the --extract exit_code option to set the value to the exit_code property of each result:

  $ printf "www.google.com\nslashdot.org" | streamline --extract exit_code --headers -s shell  -- "nc -zv {value} 443"
  www.google.com: 0
  slashdot.org: 0

Everything before the -- is an option for the streamline command and options after that are for the particular modules we're using (shell in this case).

Built-in Modules

There are many modules available that do asynchronous jobs and transformations to input. To see all available modules use the main help option to list them with examples:

$ streamline --help

::extract::
	Description: Filter out values that dont have a truthy result to a particular python expression
	Example: streamline -s extract -- --selector exit_code

::py::
	Description: Translate each value by assigning it to the result of a python expression
	Example: streamline -s py -- "value.upper()"

::pyfilter::
	Description: Filter out values that dont have a truthy result to a particular python expression
	Example: streamline -s pyfilter -- "'foobar' in value"

::truthy::
	Description: Filter out values that are not truthy
	Example: streamline -s truthy -- 

::noop::
	Description: No operation. Just for testing.
	Example: streamline -s noop -- 

::split::
	Description: Take any values that are an array and treat each value of an array as a separate input 
	Example: streamline -s split -- 

::breakdown::
	Description: Show a report of how many input values ended up with a particular result value
	Example: streamline -s breakdown -- 

::headers::
	Description: Force each value to a string and prefix each with the original input value
	Example: streamline -s headers -- 

::filter_out_errors::
	Description: Filter out any entries that have produced an error
	Example: streamline -s filter_out_errors -- 

::errors::
	Description: Use the latest error on the entry as the value
	Example: streamline -s errors -- 

::buffer::
	Description: Hold entries in memory until a certain number is reached (give no args to buffer all)
	Example: streamline -s buffer -- --buffer 20

::http::
	Description: Use a template to execute an HTTP request for each value
	Example: streamline -s http -- "https://{value}/"

::ssh::
	Description: Treat each value as a host to connect to. SSH in and run a command returning the output
	Example: streamline -s ssh -- "uptime"

::shell::
	Description: Run a shell command for each value
	Example: streamline -s shell -- "nc -zv {value} 22"

::scp::
	Description: Treat each value as a host to connect to. Copy a file to or from this host
	Example: streamline -s scp -- "/tmp/file.txt" "{value}:/tmp/file.txt

::sleep::
	Description: Sleep for each entry making no change to its value
	Example: streamline -s sleep --

To get available options for a particular module run (substituting "http" for the module you're interested in):

streamline -s http --help

Technical Vocabulary

Entry: A small wrapper around a value being passed along through the stream. Commonly a single line of input.
Generator: An asynchonous generator function that takes no input and yields Entry objects.
Executor: An asyncronous function that takes a single value and returns a new value. Usually some unit of work is done and the result of that work is returned as the new value.
Streamer: An asynchronous generator function that takes as an argument an asynchronous source iterable that yields Entry objects. Commonly streamers do some manipulation of each Entry it gets from the source iterable and then sets a new value on the entry.
Consumer: An asynchronous function that reads all entries of an asynchronous source iterable. Usually this function writes to some output (such as stdout).

Project details

Release history Release notifications | RSS feed

1.1.1

Jun 22, 2023

1.1.0

Apr 25, 2023

1.0.5

Feb 29, 2020

1.0.4

Feb 29, 2020

1.0.3

Nov 24, 2019

1.0.2

Nov 24, 2019

1.0.1

Jul 29, 2019

1.0.0

Jun 20, 2019

0.11.0

Jun 20, 2019

0.10.0

Apr 22, 2019

0.9.24

Apr 12, 2019

0.9.23

Apr 11, 2019

0.9.22

Apr 1, 2019

0.9.21

Mar 23, 2019

0.9.20

Mar 22, 2019

0.9.19

Mar 22, 2019

0.9.18

Mar 18, 2019

0.9.17

Mar 16, 2019

0.9.16

Mar 14, 2019

0.9.15

Mar 14, 2019

0.9.14

Mar 13, 2019

0.9.13

Feb 14, 2019

0.9.12

Feb 14, 2019

0.9.11

Feb 11, 2019

0.9.10

Feb 11, 2019

0.9.9

Feb 11, 2019

0.9.8

Feb 5, 2019

0.9.7

Jan 29, 2019

0.9.6

Jan 23, 2019

0.9.5

Jan 21, 2019

0.9.4

Jan 18, 2019

0.9.3

Jan 17, 2019

0.9.2

Jan 16, 2019

0.9.1

Jan 16, 2019

0.9.0

Jan 16, 2019

0.8.0

Jan 15, 2019

0.7.1

Jan 7, 2019

0.7.0

Jan 5, 2019

0.6.5

Jan 5, 2019

0.6.4

Jan 4, 2019

0.6.3

Jan 4, 2019

0.6.2

Jan 4, 2019

0.6.1

Jan 4, 2019

0.6.0

Jan 4, 2019

This version

0.5.0

Jan 4, 2019

0.4.0

Jan 3, 2019

0.3.7

Jan 3, 2019

0.3.6

Jan 3, 2019

0.3.5

Jan 2, 2019

0.3.4

Jan 2, 2019

0.3.3rc0 pre-release

Jan 2, 2019

0.3.3b0 pre-release

Jan 2, 2019

0.3.2

Jan 2, 2019

0.3.1

Jan 2, 2019

0.3.0

Jan 2, 2019

0.2.01

Dec 31, 2018

0.2.00

Dec 29, 2018

0.1.0

Dec 27, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

streamline-0.5.0.tar.gz (12.8 kB view hashes)

Uploaded Jan 4, 2019 Source

Hashes for streamline-0.5.0.tar.gz

Hashes for streamline-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`459c30c39330be4c302aeb3c4e73fe9b34b1268252ea7745567ddc61accb49f6`
MD5	`6fb3107f76b35646df4a7b43c4d608ef`
BLAKE2b-256	`c7cb322d8afc2d3158f6cf49572063f3d41c41c806dcd8ee0cfb3cbee3cd3809`