A framework for data piping in python

Project description

pipda

Docs

A framework for data piping in Python.

Inspired by siuba, dfply, plydata, and dplython. Provides simple yet powerful APIs to mimic dplyr and tidyr in Python.

API | Changelog | Documentation

Installation

pip install -U pipda

Usage

Verbs

A verb is pipeable (able to be called like data >> verb(...))
A verb is dispatchable by the type of its first argument
A verb evaluates other arguments using the first one
A verb is passing down the context if not specified in the arguments

import pandas as pd
from pipda import (
    register_verb,
    register_func,
    register_operator,
    evaluate_expr,
    Operator,
    Symbolic,
    Context
)

f = Symbolic()

df = pd.DataFrame({
    'x': [0, 1, 2, 3],
    'y': ['zero', 'one', 'two', 'three']
})

df

#      x    y
# 0    0    zero
# 1    1    one
# 2    2    two
# 3    3    three

@register_verb(pd.DataFrame)
def head(data, n=5):
    return data.head(n)

df >> head(2)
#      x    y
# 0    0    zero
# 1    1    one

@register_verb(pd.DataFrame, context=Context.EVAL)
def mutate(data, **kwargs):
    data = data.copy()
    for key, val in kwargs.items():
        data[key] = val
    return data

df >> mutate(z=1)
#    x      y  z
# 0  0   zero  1
# 1  1    one  1
# 2  2    two  1
# 3  3  three  1

df >> mutate(z=f.x)
#    x      y  z
# 0  0   zero  0
# 1  1    one  1
# 2  2    two  2
# 3  3  three  3

Functions used as verb arguments

# verb can be used as an argument passed to another verb
# dependent=True makes the `data` argument invisible while calling
@register_verb(pd.DataFrame, context=Context.EVAL, dependent=True)
def if_else(data, cond, true, false):
    cond.loc[cond.isin([True]), ] = true
    cond.loc[cond.isin([False]), ] = false
    return cond

# The function is then also a singledispatch generic function

df >> mutate(z=if_else(f.x>1, 20, 10))
#    x      y   z
# 0  0   zero  10
# 1  1    one  10
# 2  2    two  20
# 3  3  three  20

# function without data argument
@register_func
def length(strings):
    return [len(s) for s in strings]

df >> mutate(z=length(f.y))

#    x     y    z
# 0  0  zero    4
# 1  1   one    3
# 2  2   two    3
# 3  3 three    5

Context

The context defines how a reference (f.A, f['A'], f.A.B) is evaluated

@register_verb(pd.DataFrame, context=Context.SELECT)
def select(df, *columns):
    return df[list(columns)]

df >> select(f.x, f.y)
#    x     y
# 0  0  zero
# 1  1   one
# 2  2   two
# 3  3 three

How it works

data %>% verb(arg1, ..., key1=kwarg1, ...)

The above is a typical dplyr/tidyr data piping syntax.

The Python counterpart is:

data >> verb(arg1, ..., key1=kwarg1, ...)

To implement this, execution of the verb must be deferred by turning it into a VerbCall object that holds the function and its arguments. The VerbCall is not evaluated until data is piped in via >>. This detection is made possible by the executing package, which inspects the AST to determine whether a function call appears on the right-hand side of a pipe operator.

Arguments that reference columns of the data must also be deferred. For example, in dplyr (R):

data %>% mutate(z = a)

This adds a column z with values from column a. In Python, the equivalent is:

data >> mutate(z=f.a)

Here f.a is a Reference object that captures the column name without immediately fetching the data.

The Symbolic object f acts as a proxy, chaining attribute/item accesses and operator expressions into a single Expression tree. That tree is later evaluated when data and context become available.

Documentation

https://pwwang.github.io/pipda/

See datar for real-world usage.

Project details

Release history Release notifications | RSS feed

0.14.1

May 4, 2026

This version

0.14.0

Apr 30, 2026

0.13.2

Apr 15, 2026

0.13.1

Oct 10, 2023

0.13.0

Oct 5, 2023

0.12.0

Apr 13, 2023

0.11.1

Jan 18, 2023

0.11.0

Dec 8, 2022

0.10.0

Dec 1, 2022

0.9.0

Oct 28, 2022

0.8.2

Oct 17, 2022

0.8.1

Oct 15, 2022

0.8.0

Oct 8, 2022

0.7.6

Oct 6, 2022

0.7.5

Oct 5, 2022

0.7.4

Oct 5, 2022

0.7.3

Sep 23, 2022

0.7.2

Sep 20, 2022

0.7.1

Sep 13, 2022

0.7.0

Sep 4, 2022

0.6.0

May 13, 2022

0.5.9

Mar 30, 2022

0.5.8

Mar 17, 2022

0.5.7

Mar 6, 2022

0.5.6

Mar 2, 2022

0.5.5

Mar 1, 2022

0.5.4

Mar 1, 2022

0.5.3

Feb 17, 2022

0.5.2

Feb 15, 2022

0.5.1

Feb 14, 2022

0.5.0

Feb 12, 2022

0.4.5

Aug 4, 2021

0.4.4

Aug 3, 2021

0.4.3

Jul 27, 2021

0.4.2

Jul 16, 2021

0.4.1

Jul 13, 2021

0.4.0

Jul 7, 2021

0.3.0

Jul 1, 2021

0.2.9

Jun 21, 2021

0.2.8

Jun 15, 2021

0.2.7

Jun 11, 2021

0.2.6

May 28, 2021

0.2.5

May 18, 2021

0.2.4

Apr 29, 2021

0.2.3

Apr 10, 2021

0.2.2

Apr 7, 2021

0.2.1

Apr 6, 2021

0.2.0

Mar 30, 2021

0.1.5

Mar 13, 2021

0.1.4

Mar 5, 2021

0.1.3

Mar 2, 2021

0.1.2

Mar 1, 2021

0.1.1

Feb 28, 2021

0.1.0

Feb 17, 2021

0.0.6

Dec 5, 2020

0.0.4

Dec 1, 2020

0.0.3

Nov 30, 2020

0.0.1

Nov 30, 2020

0.0.0

Nov 27, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pipda-0.14.0.tar.gz (150.6 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pipda-0.14.0-py3-none-any.whl (21.1 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file pipda-0.14.0.tar.gz.

File metadata

Download URL: pipda-0.14.0.tar.gz
Upload date: Apr 30, 2026
Size: 150.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pipda-0.14.0.tar.gz
Algorithm	Hash digest
SHA256	`03d2f4e9e9e24ed3976342205d6f64c014918fc26acc0b5d9f496a5192e393eb`
MD5	`af7d120a2d4f27efd07764ce6483a105`
BLAKE2b-256	`789ab3a6deb309a73ee978a02f260284dbdce7a94b76a528278dac5cc1f2d4a9`

See more details on using hashes here.

File details

Details for the file pipda-0.14.0-py3-none-any.whl.

File metadata

Download URL: pipda-0.14.0-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 21.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pipda-0.14.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cf16b7d67fc52d5cb1be16688fee41886ffc8044df73a4a4a8fb4b7709c9f3d1`
MD5	`f083211b9867aa7756a3b61c1dd9381a`
BLAKE2b-256	`19405e34e34d38f1a3f5ba013fdae3789fc2fd3985ac1cb87da6153bb66235d2`

See more details on using hashes here.

pipda 0.14.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

pipda

Installation

Usage

Verbs

Functions used as verb arguments

Context

How it works

Documentation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes