Skip to main content

More Streams! Chained function calls

Project description

More Streams!!

Python code is more elegant with method chaining!

Overview

There are two families of "streams" in this library, both are lazy:

  1. ByteStream - a traditional stream of bytes intended to pipe bytes through various byte transformers, like compression, encoding and encyrption.
  2. ObjectStream: An iterator/generator with a number of useful methods.

Example

In this case I am iterating through all files in a tar and parsing them:

results = (
    File("tests/so_queries/so_queries.tar.zst")
    .content()
    .content()
    .exists()
    .utf8()
    .to_str()
    .map(parse)
    .to_list()
)

Each of the steps constructs a generator, and no work is done until the last step

  • File().content() - will unzst and untar the file content to an ObjectStream of file-like objects. It is short form for stream(File().read_bytes()).from_zst().from_tar()
  • The second .content() is applied to each of the file-like objects, returning ByteStream of the content for each
  • .exists() - some of the files (aka directories) in the tar file do not have content, we only include content that exists.
  • .utf8 - convert to a StringStream
  • .to_str - convert to a Python str, we trust the content is not too large
  • .map(parse) - run the parser on each string
  • .to_list() - a "terminator", which executes the chain and returns a Python list with the results

Project Status

Alive and in use, but

  • basic functions missing
  • inefficient - written using generators
  • generators not properly closed

Optional Reading

The method chaining style has two distinct benefits

  • functions are in the order they are applied
  • intermediate values need no temporary variables

The detriments are the same that we find in any declarative language: Incorrect code can be difficult to debug because you can not step through it to isolate the problem. For this reason, the majority of the code in this library is dedicated to validating the links in the function chain before they are run.

Lessons

The function chaining style, called "streams" in Java or "linq" in C#, leans heavly on the strict typed nature of those langauges. This is missing in Python, but type annotations help support this style of programming.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mo-streams-1.312.22362.tar.gz (17.6 kB view details)

Uploaded Source

File details

Details for the file mo-streams-1.312.22362.tar.gz.

File metadata

  • Download URL: mo-streams-1.312.22362.tar.gz
  • Upload date:
  • Size: 17.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.6

File hashes

Hashes for mo-streams-1.312.22362.tar.gz
Algorithm Hash digest
SHA256 24bc24e25e473039e03cce739a36807fd8db31f09349e2fa4edf44bcb583100a
MD5 6e969a8fcf4e604d65562a8fa7b89d93
BLAKE2b-256 030c4ac73e7ecdfc63a914ccc1af2652a68674974a0543bce4004027aad35bc4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page