Helpers for doing IO with Pandas DataFrames
Project description
df_io
Python helpers for doing IO with Pandas DataFrames
Available methods
read_df
- bzip2/gzip/zstandard compression
- passing parameters to Pandas' readers
- reading from anything, which
smart_open
supports (local files, AWS S3 etc) - most of the available formats, Pandas supports
write_df
This method supports:
- streaming writes
- chunked writes
- bzip2/gzip/zstandard compression
- passing parameters to Pandas' writers
- writing to anything, which
smart_open
supports (local files, AWS S3 etc) - most of the available formats, Pandas supports
Documentation
Examples
Write a Pandas DataFrame (df) to an S3 path in CSV format (the default):
import df_io
df_io.write_df(df, 's3://bucket/dir/mydata.csv')
The same with gzip compression:
df_io.write_df(df, 's3://bucket/dir/mydata.csv.gz')
With zstandard compression using pickle:
df_io.write_df(df, 's3://bucket/dir/mydata.pickle.zstd', fmt='pickle')
Using JSON lines:
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json')
Passing writer parameters:
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', writer_options={'lines': False})
Chunked write (splitting the df into equally sized parts and creating/writing outputs for them):
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', chunksize=10000)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
df_io-0.0.12.tar.gz
(6.7 kB
view details)
Built Distribution
File details
Details for the file df_io-0.0.12.tar.gz
.
File metadata
- Download URL: df_io-0.0.12.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6009b8f03a6bb35e57ce25033957b0ec6b8d9438977cb07e46de2256669d28d1 |
|
MD5 | 6828520dc83f70930c9425e28767fd6a |
|
BLAKE2b-256 | 3eda8248de8f141611e199599bb3e389d9e99d5d1e866c5bd8d1731a9bf70173 |
File details
Details for the file df_io-0.0.12-py2.py3-none-any.whl
.
File metadata
- Download URL: df_io-0.0.12-py2.py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2195bc9d1606a763c64c58fe4b48a3a06d9a526e0c25932ba2eb00a73fb1eebe |
|
MD5 | 379757bda3889550571b475a68eaddea |
|
BLAKE2b-256 | 1798300958e6c7c1cbac4997437105a4cd889926dfdcf29af3161ac95a458ba2 |