pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.
Project description
pd_multiprocessing
pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.
Requirements
Documentation
If you want to build the documentation, you need the following packages:
- Sphinx
- sphinx_rtd_theme
- m2r
Installation
You can easily install pd_multiprocessing via
pip install pd-multiprocessing
Usage
A typical usage looks like this
import pandas as pd
from pd_multiprocessing.map import df_map
def twotimes(row):
row['col2'] = row['col1']*2
return row
if __name__ == '__main__':
df = pd.DataFrame.from_dict({'col1': range(100)})
print(df_map(twotimes, df))
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Close
Hashes for pd_multiprocessing-1.0.4-py3.6.egg
Algorithm | Hash digest | |
---|---|---|
SHA256 | c52f5f22f0d94ba873a81af326d9500c06780522765cbfe5aabe9bf92422aaed |
|
MD5 | c98aa8bd8a3c7f2608d6dbfdc8fbaab3 |
|
BLAKE2b-256 | eff0071c34e49955182e9a24a57275c48de4359c45f15fb39a722ed914c74ee3 |
Close
Hashes for pd_multiprocessing-1.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 28d0b5a763fe6c64d7b38de19fbfedb9e80bce6d45767942c34ce27919be11cb |
|
MD5 | 0bec8e497b8af05717433ae0ceadabf2 |
|
BLAKE2b-256 | b2d766035883f8b8a87efe784ac488446c9c74f3436a80b48c8f729c86aa5b10 |