Persistent Pandas DataFrame storage and retrieval using a SQL database, CSV files, HDF5, or pickle files.
Project description
TrashPandas: Persistent Pandas DataFrame Storage and Retrieval
What is it?
TrashPandas is a Python package that provides persistent Pandas DataFrame storage and retrieval using a SQL database, CSV files, HDF5, or pickle files.
Main Features
Here are just a few of the things that TrashPandas does well:
- Store Pandas DataFrames in your choice of format. (SQL, CSV, HDF5, pickle)
- Retrieve the Pandas DataFrame in the same format you stored.
- Transfer your DataFrames between storage formats.
Where to get it
The source code is currently hosted on GitHub at: https://github.com/eddiethedean/trashpandas
# PyPI
pip install trashpandas
Dependencies
- pandas - a Python package that provides fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive.
- sqlalchemy - Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL
- h5py - a Python package with a Pythonic interface to the HDF5 binary data format.]
Example
import pandas as pd
import sqlalchemy as sa
import trashpandas as tp
df = pd.DataFrame({'name': ['Joe', 'Bob', 'John'], 'age': [23, 34, 44]})
# Create SqlStorage object with sqlite database connection string.
storage = tp.SqlStorage('sqlite:///test.db')
# or create an engine with SQLAlchemy and pass it into SqlStorage
engine = sa.create_engine('sqlite:///test.db')
storage = tp.SqlStorage(engine)
# Store DataFrame in database as table named 'people'
# and store metadata as table named '_people_metadata'
storage.store(df, 'people')
# or assign DataFrame to item
storage['people'] = df
# Retrieve DataFrame from SqlStorage object.
df = storage.load('people')
# or use table name key
df = storage['people']
# Delete stored sql table using SqlStorage delete method.
storage.delete('people')
# or use del on table name key
del storage['people']
# Or use functions instead of SqlStorage class
tp.store_df_sql(df, 'people', engine)
df = tp.load_df_sql('people', engine)
tp.delete_table_sql('people', engine)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
trashpandas-0.0.4.tar.gz
(7.4 kB
view hashes)
Built Distribution
Close
Hashes for trashpandas-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c05bac8eb9d21ec9d5e41ba242dc1df276f694d0c8070dbe719907020cff0a4 |
|
MD5 | f939ed771d708f7090578da6c02d70fe |
|
BLAKE2b-256 | ea352610d239683d3544a130ca0eb9311a918776650cd19a34ee9878a185dba9 |