Simple function to parallelize the mapping function of Pandas DataFrame
Project description
pd_multiprocessing
pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.
Usage
A typical usage looks like this
import pandas as pd
from pd_multiprocessing.map import df_map
def twotimes(row):
row['col2'] = row['col1']*2
return row
if __name__ == '__main__':
df = pd.DataFrame.from_dict({'col1': range(100)})
print(df_map(twotimes, df))
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.