Persistent Pandas DataFrame storage and retrieval using a SQL database, HDF5, CSV files, or pickle files.
Project description
TrashPandas: Persistent Pandas DataFrame Storage and Retrieval
What is it?
TrashPandas is a Python package that provides persistent Pandas DataFrame storage and retrieval using a SQL database, CSV files, HDF5, or pickle files.
Main Features
Here are just a few of the things that TrashPandas does well:
- Store Pandas DataFrames in your choice of format. (SQL, CSV, HDF5, pickle)
- Retrieve the Pandas DataFrame with the same same indexes and data types you stored.
- Transfer your DataFrames between storage formats.
Where to get it
The source code is currently hosted on GitHub at: https://github.com/eddiethedean/trashpandas
# PyPI
pip install trashpandas
Dependencies
- pandas - a Python package that provides fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive.
- sqlalchemy - Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL
- h5py - a Python package with a Pythonic interface to the HDF5 binary data format.]
Example
import pandas as pd
import sqlalchemy as sa
import trashpandas as tp
df = pd.DataFrame({'name': ['Joe', 'Bob', 'John'], 'age': [23, 34, 44]})
# Create SqlStorage object with sqlite database connection string.
storage = tp.SqlStorage('sqlite:///test.db')
# or create an engine with SQLAlchemy and pass it into SqlStorage
engine = sa.create_engine('sqlite:///test.db')
storage = tp.SqlStorage(engine)
# Store DataFrame in database as table named 'people'
# and store metadata as table named '_people_metadata'
storage.store(df, 'people')
# or assign DataFrame to item
storage['people'] = df
# Retrieve DataFrame from SqlStorage object.
df = storage.load('people')
# or use table name key
df = storage['people']
# Delete stored sql table using SqlStorage delete method.
storage.delete('people')
# or use del on table name key
del storage['people']
# Or use functions instead of SqlStorage class
tp.store_df_sql(df, 'people', engine)
df = tp.load_df_sql('people', engine)
tp.delete_table_sql('people', engine)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
trashpandas-0.0.7.tar.gz
(8.0 kB
view hashes)
Built Distribution
Close
Hashes for trashpandas-0.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5c3239cebe9f71c7c454c4572174f4ebfd5127f3a61368ca858cb996db15e17e |
|
MD5 | 891f757c899589f8cd5378a1892c197d |
|
BLAKE2b-256 | db1e6f9642344482075b6910560a6429416d4f1a1ae5e596885ea46dda2a8c08 |