A package to upload Pandas DataFrame to Redshift
Project description
df_to_rs
df_to_rs
is a Python package that provides convenient methods to upload and upsert Pandas DataFrames to Amazon Redshift.
Installation
Install the package using pip:
pip install df_to_rs
Usage
Initialization
from df_to_rs import df_to_rs
import psycopg2
uploader = df_to_rs(
region_name='ap-south-1',
s3_bucket='your-s3-bucket',
aws_access_key_id='your-access-key-id',
aws_secret_access_key='your-secret-access-key',
redshift_c=psycopg2.connect(dbname='more', host="hostname.ap-south-1.redshift.amazonaws.com", port=1433, user='username', password='password')
)
Upload DataFrame to Redshift
uploader.upload_to_redshift(df, dest='analytics.ship_pen')
Upsert DataFrame to Redshift
Upsert (insert or update) the DataFrame into a specified destination table in Redshift. Matching rows are identified by the specified columns, and existing rows are deleted before new rows are inserted.
uploader.upsert_to_redshift(df, dest_table='analytics.ship_pen', upsert_columns=['id', 'name'])
License
This project is licensed under the MIT License - see the LICENSE.txt file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
df_to_rs-0.1.19.tar.gz
(5.1 kB
view hashes)
Built Distribution
Close
Hashes for df_to_rs-0.1.19-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a0ceb0466254700674cc51f79e7369815788805c9d3458c9dad73ac7055dd629 |
|
MD5 | c6ecd9a9ce4549fff5e8e2fbf169e4fa |
|
BLAKE2b-256 | 524d98fed8856fa7807cbf1adb427a6feaf9ca213fcf69e428a886deb8e8cf26 |