Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Page Installation Command
PyPi PyPI Downloads Link pip install awswrangler
Conda Conda Downloads Link conda install -c conda-forge awswrangler

Quick Start

Install the Wrangler with: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Creating QuickSight Data Source and Dataset to reflect our new table
wr.quicksight.create_athena_data_source("athena-source", allowed_to_manage=["username"])
wr.quicksight.create_athena_dataset(
    name="my-dataset",
    database="my_db",
    table="my_table",
    data_source_name="athena-source",
    allowed_to_manage=["username"]
)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.6.1.tar.gz (74.8 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.6.1-py3.6.egg (198.7 kB view details)

Uploaded Egg

awswrangler-1.6.1-py3-none-any.whl (90.2 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.6.1.tar.gz.

File metadata

  • Download URL: awswrangler-1.6.1.tar.gz
  • Upload date:
  • Size: 74.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.6.1.tar.gz
Algorithm Hash digest
SHA256 c8a8883a7c935ee2704ffefb80904bb813d438449e0b4d4ea1b8a302621aecbc
MD5 03484a0dad490d709668151127c9c699
BLAKE2b-256 daf21d74805f8d9fb8d08ccffd4858bd6640f6bbcf8bfa8f041fe5c18adf5550

See more details on using hashes here.

File details

Details for the file awswrangler-1.6.1-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.6.1-py3.6.egg
  • Upload date:
  • Size: 198.7 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.6.1-py3.6.egg
Algorithm Hash digest
SHA256 a76c2db117df8dab606ec9f73deb817c161e2b3b23904b7ee0bdd83550911c85
MD5 742f67a9d03d8d19859bb676e4c39a24
BLAKE2b-256 8b60bba3dd0a4737fa49901c3dd2fd0cc797614fe400c97bbdd006c7a762fb57

See more details on using hashes here.

File details

Details for the file awswrangler-1.6.1-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.6.1-py3-none-any.whl
  • Upload date:
  • Size: 90.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 020259751425ca41f94824afa5e180b07e5d62389765009cf542c411e4987998
MD5 48063c62da144e1742ab47886d418af0
BLAKE2b-256 78b2fec014441579b3e8de128a77014e684b7063f7b256fdef94cd46ff8dd6ca

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page