Helpers for doing IO with Pandas DataFrames
Project description
df_io
Python helpers for doing IO with Pandas DataFrames
Available methods
read_df
- bzip2/gzip/zstandard compression
- passing parameters to Pandas' readers
- reading from anything, which
smart_opensupports (local files, AWS S3 etc) - most of the available formats, Pandas supports
write_df
This method supports:
- streaming writes
- chunked writes
- bzip2/gzip/zstandard compression
- passing parameters to Pandas' writers
- writing to anything, which
smart_opensupports (local files, AWS S3 etc) - most of the available formats, Pandas supports
Documentation
Examples
Write a Pandas DataFrame (df) to an S3 path in CSV format (the default):
import df_io
df_io.write_df(df, 's3://bucket/dir/mydata.csv')
The same with gzip compression:
df_io.write_df(df, 's3://bucket/dir/mydata.csv.gz')
With zstandard compression using pickle:
df_io.write_df(df, 's3://bucket/dir/mydata.pickle.zstd', fmt='pickle')
Using JSON lines:
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json')
Passing writer parameters:
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', writer_options={'lines': False})
Chunked write (splitting the df into equally sized parts and creating/writing outputs for them):
df_io.write_df(df, 's3://bucket/dir/mydata.json.gz', fmt='json', chunksize=10000)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file df_io-0.0.12.tar.gz.
File metadata
- Download URL: df_io-0.0.12.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6009b8f03a6bb35e57ce25033957b0ec6b8d9438977cb07e46de2256669d28d1
|
|
| MD5 |
6828520dc83f70930c9425e28767fd6a
|
|
| BLAKE2b-256 |
3eda8248de8f141611e199599bb3e389d9e99d5d1e866c5bd8d1731a9bf70173
|
File details
Details for the file df_io-0.0.12-py2.py3-none-any.whl.
File metadata
- Download URL: df_io-0.0.12-py2.py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2195bc9d1606a763c64c58fe4b48a3a06d9a526e0c25932ba2eb00a73fb1eebe
|
|
| MD5 |
379757bda3889550571b475a68eaddea
|
|
| BLAKE2b-256 |
1798300958e6c7c1cbac4997437105a4cd889926dfdcf29af3161ac95a458ba2
|