Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Installation Command
PyPi PyPI Downloads pip install awswrangler
Conda Conda Downloads conda install -c conda-forge awswrangler

Powered By

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Community Resources

Please send a Pull Request with your resource reference and @githubhandle.

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a Pull Request with your company name and @githubhandle if you may.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.9.5.tar.gz (110.4 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.9.5-py3.6.egg (288.8 kB view details)

Uploaded Egg

awswrangler-1.9.5-py3-none-any.whl (134.8 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.9.5.tar.gz.

File metadata

  • Download URL: awswrangler-1.9.5.tar.gz
  • Upload date:
  • Size: 110.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.5.tar.gz
Algorithm Hash digest
SHA256 7c031c379800ffda571a745f045a7b2e164f8d4a9a1bc3896cc53a7f2ba85e46
MD5 55ea095722f097bc192ec9a0e7d9e193
BLAKE2b-256 191e3ec7009833f5f012844350febe77263f3977a4f66fea4e201a2c0aec90c3

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.5-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.9.5-py3.6.egg
  • Upload date:
  • Size: 288.8 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.5-py3.6.egg
Algorithm Hash digest
SHA256 3b84936fc1872d94ea6b273169736580137423f456682d70006ec037ce0baf64
MD5 e9c4765dba4b116f2b00dbbcccf3796d
BLAKE2b-256 a20dc7e83e276b552e4bd13319f243aec4737c8b95053a59353077655a0ae58e

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.5-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.9.5-py3-none-any.whl
  • Upload date:
  • Size: 134.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.49.0 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.5-py3-none-any.whl
Algorithm Hash digest
SHA256 fb9a645711bb3ccf91ce5ea380410aac5b66389442da86e68206b71aca0fb43f
MD5 a3045dd160a46c670a8308d45a598c9e
BLAKE2b-256 d86bb57580d95c9345063c9aedc74644ea1067c7d8933dcec339c68b5a4f0d54

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page