Skip to main content

Python I/O pipe utilities

Project description

https://imgur.com/Q9Lv0xo.png
https://travis-ci.org/dokipen/tubing.svg?branch=master https://coveralls.io/repos/github/dokipen/tubing/badge.svg?branch=master https://img.shields.io/pypi/v/tubing.svg https://img.shields.io/pypi/pyversions/tubing.svg https://img.shields.io/pypi/dd/tubing.svg https://img.shields.io/pypi/l/tubing.svg https://img.shields.io/pypi/wheel/tubing.svg

Tubing is a Python I/O library. What makes tubing so freakin’ cool is the gross abuse of the bitwise OR operator (|). Have you ever been writing python code and thought to yourself, “Man, this is great, but I really wish it was a little more like bash.” Welp, we’ve made python a little more like bash.If you are a super lame nerd-kid, you can replace any of the bitwise ORs with the pipe() function and pray we don’t overload any other operators in future versions. If you do avoid the bitwise OR, we don’t know if we want to hang out with you.

Tubing is pretty bare-bones at the moment. We’ve tried to make it easy to add your own functionality. Hopefully you find it not all that unpleasant. There are three sections below for adding sources, pipes and sink. If you do make some additions, think about committing them back upstream. We’d love to have a full suite of tools.

Now for an example to show the true power of this fully operational I/O library:

from tubing import sources, pipes, sinks

objs = [
    dict(
        name="Bob Corsaro",
        birthdate="08/03/1977",
        alignment="evil",
    ),
    dict(
        name="Tom Brady",
        birthdate="08/03/1977",
        alignment="good",
    ),
]
sources.Objects(objs) \
     | pipes.JSONSerializer() \
     | pipes.Joined(by=b"\n") \
     | pipes.Gzip() \
     | sinks.File("output.gz", "wb")

Then in our old friend bash:

$ zcat output.gz
{"alignment": "evil", "birthdate": "08/03/1977", "name": "Bob Corsaro"}
{"alignment": "good", "birthdate": "08/03/1977", "name": "Tom Brady"}
$

We need to seriously think about renaming pipes to tubes.. man, what was I thinking?

Sources

To make your own source, create a Reader class with the following interface:

class MyReader(object):
    """
    MyReader returns count instances of data.
    """
    def __init__(self, data="hello world\n", count=10):
        self.data = data
        self.count = count

    def read(self, amt):
        """
        read(amt) returns $amt of data and a boolean indicating EOF.
        """
        if not amt:
            amt = self.count
        r = self.data * min(amt, self.count)
        self.count -= amt
        return r, self.count <= 0

The important thing to remember is that your read function should return an iterable of units of data, not a single piece of data. Then wrap your reader in the loving embrace of MakeSource:

from tubing import sources

MySource = sources.MakeSource(MyReader)

Now it can be used in a pipeline!:

from __future__ import print_function

from tubing import pipes
sink = MySource(data="goodbye cruel world!", count=1) \
     | pipes.Joined(by=b"\n") \
     | sinks.Bytes()

print(sinks.result)
# Output: goodby cruel world!

Pipes

Making your own pipe is a lot more fun, trust me. First make a Transformer:

class OptimusPrime(object):
    def transform(self, chunk):
        return list(reversed(chunk))

chunk is an iterable with a len() of whatever type of data the stream is working with. In Transformers, you don’t need to worry about buffer size or closing or exception, just transform an iterable to another iterable. There are lots of examples in pipes.py.

Next give Optimus Prime a hug:

from tubing import pipes

AllMixedUp = pipes.MakePipe(OptimusPrime)

Ready to mix up some data?:

from __future__ import print_function

import json
from tubing import sources, sinks

objs = [{"number": i} for i in range(0, 10)]

sink = sources.Objects(objs) \
     | AllMixedUp(chunk_size=2) \
     | sinks.Objects()

print(json.dumps(sink))
# Output: [{"number": 1}, {"number": 0}, {"number": 3}, {"number": 2}, {"number": 5}, {"number": 4}, {"number": 7}, {"number": 6}, {"number": 9}, {"number": 8}]

Sinks

Really getting tired of making documentation… Maybe I’ll finish later. I have real work to do.

Well.. I’m this far, let’s just push through:

from __future__ import print_function
from tubing import sources, pipes, sinks

class StdoutWriter(object):
    def write(self, chunk):
        for part in chunk:
            print(part)

    def close(self):
        # this function is optional
        print("That's all folks!")

    def abort(self):
        # this is also optional
        print("Something terrible has occurred.")

Debugger = sinks.MakeSink(StdoutWriter)

objs = [{"number": i} for i in range(0, 10)]

sink = sources.Objects(objs) \
     | AllMixedUp(chunk_size=2) \
     | pipes.JSONSerializer() \
     | pipes.Joined(by=b"\n") \
     | Debugger()
# Output:
#{"number": 1}
#{"number": 0}
#{"number": 3}
#{"number": 2}
#{"number": 5}
#{"number": 4}
#{"number": 7}
#{"number": 6}
#{"number": 9}
#{"number": 8}
#That's all folks!

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tubing-0.0.1.post94.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tubing-0.0.1.post94-py2.py3-none-any.whl (13.5 kB view details)

Uploaded Python 2Python 3

File details

Details for the file tubing-0.0.1.post94.tar.gz.

File metadata

  • Download URL: tubing-0.0.1.post94.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for tubing-0.0.1.post94.tar.gz
Algorithm Hash digest
SHA256 31af4816c190df7289f0dbe0f7bc1646ae4e1c125a624de493c42c274cf442ce
MD5 4839878d02a0d8dc98cf59e305f66fa7
BLAKE2b-256 e00d34fb959e79e288bb477a2ee01881782033f725f33ef557044beda58c1472

See more details on using hashes here.

File details

Details for the file tubing-0.0.1.post94-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for tubing-0.0.1.post94-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 e630ab3cb21bf725edd0d3cd5601c2cd79fd8962ba6a3788522a1b5212d0dd5f
MD5 9bb963c72216fcad7f96b103e714211a
BLAKE2b-256 90d84c2ed0e9d25299b95778943ede0770f5a8c7101f48adbdfa444db9c16e4a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page