Skip to main content

Method chaining for iterables and dictionaries in Python.

Project description

pyochain ⛓️

Functional-style method chaining for Python data structures.

pyochain brings a fluent, declarative API inspired by Rust's Iterator and DataFrame libraries like Polars to your everyday Python iterables and dictionaries.

Manipulate data through composable chains of operations, enhancing readability and reducing boilerplate.

Notice on Stability ⚠️

pyochain is currently in early development (< 1.0), and the API may undergo significant changes multiple times before reaching a stable 1.0 release.

Installation

uv add pyochain

API Reference 📖

The full API reference can be found at: https://outsquarecapital.github.io/pyochain/

Overview

Philosophy

  • Declarative over Imperative: Replace explicit for and while loops with sequences of high-level operations (map, filter, group, join...).
  • Fluent Chaining: Each method transforms the data and returns a new wrapper instance, allowing for seamless chaining.
  • Lazy and Eager: Iter operates lazily for efficiency on large or infinite sequences, while Seq represents materialized sequences for eager operations.
  • 100% Type-safe: Extensive use of generics and overloads ensures type safety and improves developer experience.
  • Documentation-first: Each method is thoroughly documented with clear explanations, and usage examples. Before any commit is made, each docstring is automatically tested to ensure accuracy. This also allows for a convenient experience in IDEs, where developers can easily access documentation with a simple hover of the mouse.
  • Functional paradigm: Design encourages building complex data transformations by composing simple, reusable functions on known buildings blocks, rather than implementing customs classes each time.

Inspirations

  • Rust's language and Rust Iterator Trait: Emulate naming conventions (from_(), into()) and leverage concepts from Rust's powerful iterator traits (method chaining, lazy evaluation) to bring similar expressiveness to Python.
  • Python iterators libraries: Libraries like rolling, cytoolz, and more-itertools provided ideas, inspiration, and implementations for many of the iterator methods.
  • PyFunctional: Although not directly used (because I started writing pyochain before discovering it), also shares similar goals and ideas.

Core Components

Iter[T]

A wrapper for any Iterator or Generator. All operations are lazy, consuming the underlying iterator only when needed.

This allows for efficient processing of large or even infinite sequences.

To create an Iter, you can:

  • Wrap an existing iterator/generator: pc.Iter(my_iterator)
  • Convert any iterable: pc.Iter.from_(my_list)
  • Wrap unpacked values: pc.Iter.from_(1, 2, 3)
  • Use built-in constructors like pc.Iter.from_count() for infinite sequences.

Seq[T]

A wrapper for a Sequence (like a list or tuple), representing an eagerly evaluated collection of data. Seq is useful when you need to store results in memory, access elements by index, or reuse the data multiple times.

It shares many methods with Iter but performs operations immediately. You can switch between lazy and eager evaluation by using my_seq.iter() and my_iter.collect().

Dict[K, V]

A wrapper for a dict, providing a rich, chainable API for dictionary manipulation. It simplifies common tasks like filtering, mapping, and transforming dictionary keys and values.

Key features include:

  • Immutability: Most methods return a new Dict instance, preventing unintended side effects.
  • Nested Data Utilities: Easily work with complex, nested dictionaries using methods like pluck and flatten.
  • Flexible Instantiation: Create a Dict from mappings, iterables of pairs, or even object attributes with Dict.from_object().

Result[T, E]

A type for functions that can fail, inspired by Rust's Result. It represents either a success (Ok[T]) containing a value or an error (Err[E]) containing an error. It forces you to handle potential failures explicitly, leading to more robust code.

Option[T]

A type for values that may be absent, inspired by Rust's Option. It represents either the presence of a value (Some[T]) or its absence (NONE). It provides a safe and expressive way to handle optional values without resorting to None checks everywhere.

Core Piping Methods

All wrappers provide a set of common methods for chaining and data manipulation:

  • into(func, *args, **kwargs): Passes the unwrapped data to func and returns the raw result. This is a terminal operation that ends the chain.
  • apply(func, *args, **kwargs): Passes the unwrapped data to func and re-wraps the result in the same wrapper type for continued chaining.
  • pipe(func, *args, **kwargs): Passes the wrapped instance (self) to func. This allows you to insert custom functions into the chain that operate on the wrapper itself.
  • println(): Prints the unwrapped data to the console for debugging and returns self to continue the chain.
  • inner(): Returns the underlying wrapped data.

Rich Lazy Iteration (Iter)

Leverage dozens of methods inspired by Rust's Iterator, itertools, cytoolz, and more-itertools.

import pyochain as pc

result = (
    pc.Iter.from_count(1)  # Infinite iterator: 1, 2, 3, ...
    .filter(lambda x: x % 2 != 0)  # Keep odd numbers
    .map(lambda x: x * x)  # Square them
    .take(5)  # Take the first 5
    .into(list)  # Consume into a list
)
# result: [1, 9, 25, 49, 81]

Type-Safe Error Handling (Result and Option)

Write robust code by handling potential failures explicitly.

import pyochain as pc

def divide(a: int, b: int) -> pc.Result[float, str]:
    if b == 0:
        return pc.Err("Cannot divide by zero")
    return pc.Ok(a / b)

# --- With Result ---
res1 = divide(10, 2)  # Ok(5.0)
res2 = divide(10, 0)  # Err("Cannot divide by zero")

# Safely unwrap or provide a default
value = res2.unwrap_or(0.0)  # 0.0

# Map over a successful result
squared = res1.map(lambda x: x * x)  # Ok(25.0)

# --- With Option ---
def find_user(user_id: int) -> pc.Option[str]:
    users = {1: "Alice", 2: "Bob"}
    return pc.Some(users.get(user_id)) if user_id in users else pc.NONE

user = find_user(1).map(str.upper).unwrap_or("Not Found")  # "ALICE"
not_found = find_user(3).unwrap_or("Not Found")  # "Not Found"

Typing enforcement

Each method and class make extensive use of generics, type hints, and overloads (when necessary) to ensure type safety and improve developer experience.

Since there's much less need for intermediate variables, the developper don't have to annotate them as much, whilst still keeping a type-safe codebase.

Convenience mappers: itr and struct

Operate on iterables of iterables or iterables of dicts without leaving the chain.

import pyochain as pc

nested = pc.Iter.from_([[1, 2, 3], [4, 5]])
totals = nested.itr(lambda it: it.sum()).into(list)
# [6, 9]

records = pc.Iter.from_(
    [
        {"name": "Alice", "age": 30},
        {"name": "Bob", "age": 25},
    ]
)
names = records.struct(lambda d: d.pluck("name").unwrap()).into(list)
# ['Alice', 'Bob']

Key Dependencies and credits

Most of the computations are done with implementations from the cytoolz, more-itertools, and rolling libraries.

An extensive use of the itertools stdlib module is also to be noted.

pyochain acts as a unifying API layer over these powerful tools.

https://github.com/pytoolz/cytoolz

https://github.com/more-itertools/more-itertools

https://github.com/ajcr/rolling

The stubs used for the developpement, made by the maintainer of pyochain, can be found here:

https://github.com/py-stubs/cytoolz-stubs


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyochain-0.5.5.tar.gz (50.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyochain-0.5.5-py3-none-any.whl (67.6 kB view details)

Uploaded Python 3

File details

Details for the file pyochain-0.5.5.tar.gz.

File metadata

  • Download URL: pyochain-0.5.5.tar.gz
  • Upload date:
  • Size: 50.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.13 {"installer":{"name":"uv","version":"0.9.13"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyochain-0.5.5.tar.gz
Algorithm Hash digest
SHA256 d013f74b60622a037264c47b58b31c03448498924a93e5a33cf83e140b436d12
MD5 bb0128f2780b7b44f607490b513add6e
BLAKE2b-256 3f9b3c5d1a355df2a219ae069ad61eba38f61baafa9d42e905519063219f5251

See more details on using hashes here.

File details

Details for the file pyochain-0.5.5-py3-none-any.whl.

File metadata

  • Download URL: pyochain-0.5.5-py3-none-any.whl
  • Upload date:
  • Size: 67.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.13 {"installer":{"name":"uv","version":"0.9.13"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyochain-0.5.5-py3-none-any.whl
Algorithm Hash digest
SHA256 de5cc52c41207d1f24ca5c6633bcbc390a86b70e02cd984c331dc8294dbf3314
MD5 5627c599edd955dbc359c3fb247450b9
BLAKE2b-256 78e70801e7b690efb12743818ecffe41480075c2c76a7462a9d9a93799e85d94

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page