Skip to main content

Simple, expressive pipeline syntax to transform and manipulate data with ease

Project description

viper

Simple, expressive pipeline syntax to transform and manipulate data with ease

Overview

viper is a Python package that provides a simple, expressive way to work with data. It allows you to easily manipulate and transform data using a pipeline syntax similar to that of dplyr.

Pipelining your DataFrame manipulation operations offers several benefits:

  • improved code readability (no need to 'comment the what')
  • no need to save intermediate dataframes
  • ability to chain a long sequence of operations in a single command
  • thinking of coding as a series of transformations between the input and the desired output can improve the design and make it less coupled

Docs

Complete documentation and reference are available on the package's site.

Quick Start

Installation:

pip install viper

Here is an example of how to use viper to analyze the famed mtcars dataset.

We want to find:

  • the average consumption, expressed in Miles/(US) gallon
  • the average power

Furthermore:

  • only consider those cars that weigh more than 2000lbs
  • group the results by the number of cylinders and number of gears
  • arrange in descending orders by the grouping variables
from viper.main import *
from viper.data import mtcars

pipeline(
    mtcars,
    rename(
        "hp = power",
        "mpg = consumption",
    ),
    mutate(
        consumption=lambda r: 1 / r["consumption"]
    ),
    filter(
        lambda r: r["wt"] > 2
    ),
    group_by("cyl", "gear"),
    summarize(
        "power = mean()",
        "consumption = mean()"
    ),
    arrange(
        "cyl desc",
        "gear desc"
    ),
)
#                power  consumption
# cyl gear
# 8   5     299.500000     0.064979
#     3     194.166667     0.068824
# 6   5     175.000000     0.050761
#     4     116.500000     0.050875
#     3     107.500000     0.050989
# 4   5      91.000000     0.038462
#     4      85.000000     0.041259
#     3      97.000000     0.046512

Here you can find more examples, particularly on joins.

Roadmap

The future development of the package will probably focus on:

  • adding pivot_longerand pivot_wider functions
  • adding more join_* functions

Contributions

You are welcome to contribute to the project or open issues if you have any ideas.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viper_test-0.0.1.tar.gz (5.7 kB view details)

Uploaded Source

Built Distribution

viper_test-0.0.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file viper_test-0.0.1.tar.gz.

File metadata

  • Download URL: viper_test-0.0.1.tar.gz
  • Upload date:
  • Size: 5.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for viper_test-0.0.1.tar.gz
Algorithm Hash digest
SHA256 6da2f37c227ca7bfa43267079ed988802f614359c407b6b8d32b4c07c39a2022
MD5 be49efe263e40f64de7ed4f9b6f715eb
BLAKE2b-256 d4bf4c914db66c566181077e4455d2e11150aee7e068a22aa91706412b67460c

See more details on using hashes here.

File details

Details for the file viper_test-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: viper_test-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 6.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for viper_test-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 091c8eb4647deb2197fbeedb1f0be7fa346ddeb0b39250bb2b7e8ba09e758af4
MD5 d6b7f89bf9020066f9896a5bf2e68b71
BLAKE2b-256 b74f4fe2c31b8d6291222b8a74b4e1dd33548ed8a2ae5bdb2deb8c09590e7ceb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page