Skip to main content

pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.

Project description

Build Status Coverage Status Documentation Status

pd_multiprocessing

pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.

Requirements

Documentation

If you want to build the documentation, you need the following packages:

  • Sphinx
  • sphinx_rtd_theme
  • m2r

Installation

You can easily install pd_multiprocessing via

pip install pd-multiprocessing

Usage

A typical usage looks like this

import pandas as pd
from pd_multiprocessing.map import df_map


def twotimes(row):
    row['col2'] = row['col1']*2
    return row


if __name__ == '__main__':
    df = pd.DataFrame.from_dict({'col1': range(100)})
    print(df_map(twotimes, df))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pd_multiprocessing-1.0.4.tar.gz (4.9 kB view hashes)

Uploaded source

Built Distributions

pd_multiprocessing-1.0.4-py3.6.egg (3.8 kB view hashes)

Uploaded 3 6

pd_multiprocessing-1.0.4-py3-none-any.whl (4.8 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page