Read and write dataframes anywhere.
Project description
dataframe-io
Read and write dataframes from and to any storage.
- Documentation: https://chr1st1ank.github.io/dataframe-io/
- License: Apache-2.0
- Status: Initial development
Features
Dataframes types supported:
- pandas DataFrame
- Python dictionary
Supported storage backends:
- Parquet files
- PostgreSQL database
More backends will come. Open an issue if you are interested in a particular backend.
Implementation status for reading data:
Storage | Select columns | Filter rows | Max rows | Sampling | Drop duplicates |
---|---|---|---|---|---|
Parquet files | ✔️ | ✔️ | ✔️ | ✔️ | ✔ ¹ |
PostgreSQL | ✔️ | ✔️ | ✔️ | ✔️ | ✔️ |
¹ only for pandas DataFrames
Implementation status for writing data:
Storage | write append | write replace |
---|---|---|
Parquet files | ✔️ | ✔️ |
PostgreSQL | ✔️ | ✔️ |
Installation
pip install dframeio
# Including pyarrow to read/write parquet files:
pip install dframeio[parquet]
# Including PostgreSQL support:
pip install dframeio[postgres]
Show installed backends:
>>> import dframeio
>>> dframeio.backends
[<class 'dframeio.parquet.ParquetBackend'>]
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dframeio-0.3.0.tar.gz
(932.5 kB
view hashes)
Built Distribution
dframeio-0.3.0-py3-none-any.whl
(16.8 kB
view hashes)