Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Page Installation Command
PyPi PyPI Downloads Link pip install awswrangler
Conda Conda Downloads Link conda install -c conda-forge awswrangler

Quick Start

Install the Wrangler with: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Creating QuickSight Data Source and Dataset to reflect our new table
wr.quicksight.create_athena_data_source("athena-source", allowed_to_manage=["username"])
wr.quicksight.create_athena_dataset(
    name="my-dataset",
    database="my_db",
    table="my_table",
    data_source_name="athena-source",
    allowed_to_manage=["username"]
)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.5.0.tar.gz (70.8 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.5.0-py3.6.egg (188.0 kB view details)

Uploaded Egg

awswrangler-1.5.0-py3-none-any.whl (85.2 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.5.0.tar.gz.

File metadata

  • Download URL: awswrangler-1.5.0.tar.gz
  • Upload date:
  • Size: 70.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.5.0.tar.gz
Algorithm Hash digest
SHA256 8ee64ca3bbebe0c42b59d51b9367d49717a8230bf08b075be605347016d43df8
MD5 41742a45082876d9fa7c8c9fcb60de40
BLAKE2b-256 139689aa083b41c9b55d036b9b73bbc5b73ca2da74e898b0dd478aef63351b14

See more details on using hashes here.

File details

Details for the file awswrangler-1.5.0-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.5.0-py3.6.egg
  • Upload date:
  • Size: 188.0 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.5.0-py3.6.egg
Algorithm Hash digest
SHA256 4a51d45601855a87c86f9912adec6ccb674d96a7cf76ad6fd1b1749aebcbbcc4
MD5 507540f99eeb57e94c2212f2d219abee
BLAKE2b-256 3719eb3ab1832a3ecd9944e718e740a4e832217547e82bf8d9e5d42e28f99a10

See more details on using hashes here.

File details

Details for the file awswrangler-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 85.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.6.10

File hashes

Hashes for awswrangler-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e7bb1d84793dc0cdeeb4f15d2ff303f79b8919ee80b2087b4aa1ac10c65eb20f
MD5 1973f071722d860f87ffb8d675f2eaae
BLAKE2b-256 f63d8f08c333ebdd1725fc9e4636ff85182eb8165e5d1b6ad94282009f480fc8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page