Skip to main content

A pythonic dplyr clone

Project description

dppd

Build status: Build Status
Documentation https://dppd.readthedocs.io/en/latest/

| Code style | Code style: black](https://github.com/ambv/black)

Dppd is a python dplyr clone.

It allows you to use code like this

   >>> from plotnine.data import mtcars
   >>> from dppd import dppd
   >>> dp, X = dppd()
   >>> dp(mtcars).mutate(kwh = X.hp * 0.74).groupby('cyl').filter_by(X.kwh.rank() < 2).ungroup().pd
      cyl              name   mpg   disp   hp  drat     wt   qsec  vs  am  gear  carb     kwh
   5     6           Valiant  18.1  225.0  105  2.76  3.460  20.22   1   0     3     1   77.70
   18    4       Honda Civic  30.4   75.7   52  4.93  1.615  18.52   1   1     4     2   38.48
   21    8  Dodge Challenger  15.5  318.0  150  2.76  3.520  16.87   0   0     3     2  111.00
   22    8       AMC Javelin  15.2  304.0  150  3.15  3.435  17.30   0   0     3     2  111.00

Briefly, it uses a data-manipulater instance (dp above) together with a proxied reference to the latest created DataFrame (the X above) to achive for pandas what dpylr's non-standard-evaluation based verbs does for R.

Please see our full documentation at https://dppd.readthedocs.io/en/latest/ for more details and a list of the supported verbs.

Also check out dppd_plotnine

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for dppd, version 0.22
Filename, size File type Python version Upload date Hashes
Filename, size dppd-0.22.tar.gz (57.9 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page