A package to upload Pandas DataFrame to Redshift
Project description
df_to_rs
df_to_rs
is a Python package that provides convenient methods to upload and upsert Pandas DataFrames to Amazon Redshift.
Installation
Install the package using pip:
pip install df_to_rs
Usage
Initialization
from df_to_rs import df_to_rs
import psycopg2
uploader = df_to_rs(
region_name='ap-south-1',
s3_bucket='your-s3-bucket',
aws_access_key_id='your-access-key-id',
aws_secret_access_key='your-secret-access-key',
redshift_c=psycopg2.connect(dbname='more', host="hostname.ap-south-1.redshift.amazonaws.com", port=1433, user='username', password='password')
)
Upload DataFrame to Redshift
uploader.upload_to_redshift(df, dest='analytics.ship_pen')
Upsert DataFrame to Redshift
Upsert (insert or update) the DataFrame into a specified destination table in Redshift. Matching rows are identified by the specified columns, and existing rows are deleted before new rows are inserted.
uploader.upsert_to_redshift(df, dest_table='analytics.ship_pen', upsert_columns=['id', 'name'])
License
This project is licensed under the MIT License - see the LICENSE.txt file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
df_to_rs-0.1.16.tar.gz
(4.6 kB
view hashes)
Built Distribution
Close
Hashes for df_to_rs-0.1.16-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e2359d340a73ade81801851cbc5002948b0d18a62ac817482f1120c94988a1e1 |
|
MD5 | c15e870a6722c51a1720a81778da2d3f |
|
BLAKE2b-256 | e661ba4517847018b7290d5d7f33a051e8c525464f58a5b436d70093b9bc0dd3 |