Skip to main content

A pythonic dplyr clone

Project description

dppd

Build status: Build Status
Documentation https://dppd.readthedocs.io/en/latest/

| Code style | Code style: black](https://github.com/ambv/black)

Dppd is a python dplyr clone.

It allows you to use code like this

   >>> from plotnine.data import mtcars
   >>> from dppd import dppd
   >>> dp, X = dppd()
   >>> dp(mtcars).mutate(kwh = X.hp * 0.74).groupby('cyl').filter_by(X.kwh.rank() < 2).ungroup().pd
      cyl              name   mpg   disp   hp  drat     wt   qsec  vs  am  gear  carb     kwh
   5     6           Valiant  18.1  225.0  105  2.76  3.460  20.22   1   0     3     1   77.70
   18    4       Honda Civic  30.4   75.7   52  4.93  1.615  18.52   1   1     4     2   38.48
   21    8  Dodge Challenger  15.5  318.0  150  2.76  3.520  16.87   0   0     3     2  111.00
   22    8       AMC Javelin  15.2  304.0  150  3.15  3.435  17.30   0   0     3     2  111.00

Briefly, it uses a data-manipulater instance (dp above) together with a proxied reference to the latest created DataFrame (the X above) to achive for pandas what dpylr's non-standard-evaluation based verbs does for R.

Please see our full documentation at https://dppd.readthedocs.io/en/latest/ for more details and a list of the supported verbs.

Also check out dppd_plotnine

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dppd-0.19.tar.gz (57.4 kB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page