Skip to main content

A pythonic dplyr clone

Project description

dppd

Build status: Build Status
Documentation https://dppd.readthedocs.io/en/latest/

| Code style | Code style: black](https://github.com/ambv/black)

Dppd is a python dplyr clone.

It allows you to use code like this

   >>> from plotnine.data import mtcars
   >>> from dppd import dppd
   >>> dp, X = dppd()
   >>> dp(mtcars).mutate(kwh = X.hp * 0.74).groupby('cyl').filter_by(X.kwh.rank() < 2).ungroup().pd
      cyl              name   mpg   disp   hp  drat     wt   qsec  vs  am  gear  carb     kwh
   5     6           Valiant  18.1  225.0  105  2.76  3.460  20.22   1   0     3     1   77.70
   18    4       Honda Civic  30.4   75.7   52  4.93  1.615  18.52   1   1     4     2   38.48
   21    8  Dodge Challenger  15.5  318.0  150  2.76  3.520  16.87   0   0     3     2  111.00
   22    8       AMC Javelin  15.2  304.0  150  3.15  3.435  17.30   0   0     3     2  111.00

Briefly, it uses a data-manipulater instance (dp above) together with a proxied reference to the latest created DataFrame (the X above) to achive for pandas what dpylr's non-standard-evaluation based verbs does for R.

Please see our full documentation at https://dppd.readthedocs.io/en/latest/ for more details and a list of the supported verbs.

Also check out dppd_plotnine

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
dppd-0.18.tar.gz (57.2 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page