Save and load numeric pandas data frames as numpy array and pickle their row and column names and types info for performance reasons.
Project description
Install
pip install numpickle
Usage
import pandas as pd
import numpickle as npl
# create example data frame with non-numeric and numeric columns
df = pd.DataFrame([[1, 2,'a'], [3, 4, 'b']])
df.columns = ["A", "B", "C"]
df.index = ["row1", "row2"]
df
# A B C
# row1 1 2 a
# row2 3 4 b
df.dtypes
# A int64
# B int64
# C object
# dtype: object
# save data frame as numpy array and pickle row and column names
# into helper pickle file "/home/user/test.npy.pckl"
npl.save_numpickle(df, "/home/user/test.npy")
# load the saved data
df_ = npl.load_numpickle("/home/user/test.npy")
df_
# A B C
# row1 1 2 a
# row2 3 4 b
df_.dtypes
# A int64
# B int64
# C object
# dtype: object
all(df == df_)
# True
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
numpickle-0.1.2.post5.tar.gz
(2.1 kB
view hashes)
Built Distribution
Close
Hashes for numpickle-0.1.2.post5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 46869540efbe4bfc7fe43b5a000ada1830a43fe26053909add8b88127e23733b |
|
MD5 | f5f20e5c32a8df8a5d76b786e0bf7a37 |
|
BLAKE2b-256 | e87213317be5de1d6202aedb264030bad0ed30a0150e03f799b04a3a36aafd92 |