Sailfish provides helpful utilities and pipelines to optimize Pandas dataframes. This Sailfish projectreplaces pd-helper 1.0.0 which is deprecated with sailfish 1.0.0.
Project description
sailfish
Sailfish provides data engineering and management utilities to apply common optimizations to Pandas Dataframes.
Sailfish primarily works by identifying the optimal data type for each column. For example, for integer-only columns, it may be possible to have a datatype of int32 versus int64 which can save roughly have the memory. On big datasets, these types of optmizations can provide tremendous savings.
This project replaces the pd-helper project as of pd-helper 1.0.1; sailfish is basically pd-helper 2.0.0.
Install
pip install sailfish
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.