Skip to main content

pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.

Project description

Build Status Coverage Status Documentation Status

pd_multiprocessing

pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.

Requirements

Documentation

If you want to build the documentation, you need the following packages:

  • Sphinx
  • sphinx_rtd_theme
  • m2r

Installation

You can easily install pd_multiprocessing via

pip install pd-multiprocessing

Usage

A typical usage looks like this

import pandas as pd
from pd_multiprocessing.map import df_map


def twotimes(row):
    row['col2'] = row['col1']*2
    return row


if __name__ == '__main__':
    df = pd.DataFrame.from_dict({'col1': range(100)})
    print(df_map(twotimes, df))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pd_multiprocessing-1.0.4.tar.gz (4.9 kB view hashes)

Uploaded Source

Built Distributions

pd_multiprocessing-1.0.4-py3.6.egg (3.8 kB view hashes)

Uploaded Source

pd_multiprocessing-1.0.4-py3-none-any.whl (4.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page