Persistent Pandas DataFrame storage and retrieval using a SQL database, CSV files, HDF5, or pickle files.
Project description
TrashPandas: Persistent Pandas DataFrame Storage and Retrieval
What is it?
TrashPandas is a Python package that provides persistent Pandas DataFrame storage and retrieval using a SQL database, CSV files, HDF5, or pickle files.
Main Features
Here are just a few of the things that TrashPandas does well:
- Store Pandas DataFrames in your choice of format. (SQL, CSV, HDF5, pickle)
- Retrieve the Pandas DataFrame with the same same indexes and data types you stored.
- Transfer your DataFrames between storage formats.
Where to get it
The source code is currently hosted on GitHub at: https://github.com/eddiethedean/trashpandas
# PyPI
pip install trashpandas
Dependencies
- pandas - a Python package that provides fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive.
- sqlalchemy - Python SQL toolkit and Object Relational Mapper that gives application developers the full power and flexibility of SQL
- h5py - a Python package with a Pythonic interface to the HDF5 binary data format.]
Example
import pandas as pd
import sqlalchemy as sa
import trashpandas as tp
df = pd.DataFrame({'name': ['Joe', 'Bob', 'John'], 'age': [23, 34, 44]})
# Create SqlStorage object with sqlite database connection string.
storage = tp.SqlStorage('sqlite:///test.db')
# or create an engine with SQLAlchemy and pass it into SqlStorage
engine = sa.create_engine('sqlite:///test.db')
storage = tp.SqlStorage(engine)
# Store DataFrame in database as table named 'people'
# and store metadata as table named '_people_metadata'
storage.store(df, 'people')
# or assign DataFrame to item
storage['people'] = df
# Retrieve DataFrame from SqlStorage object.
df = storage.load('people')
# or use table name key
df = storage['people']
# Delete stored sql table using SqlStorage delete method.
storage.delete('people')
# or use del on table name key
del storage['people']
# Or use functions instead of SqlStorage class
tp.store_df_sql(df, 'people', engine)
df = tp.load_df_sql('people', engine)
tp.delete_table_sql('people', engine)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
trashpandas-0.0.5.tar.gz
(7.4 kB
view hashes)
Built Distribution
Close
Hashes for trashpandas-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 21bd540de4a8daef416afffcce887e6ed2936282f68f906a840ead5701e1a725 |
|
MD5 | dab23718954045261eb263c96ff41251 |
|
BLAKE2b-256 | 07d7b7a90c43924fc68671dc6d07f2a6a0d851fc236f00cc4e635bd924c6b4ae |