Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Installation Command
PyPi PyPI Downloads pip install awswrangler
Conda Conda Downloads conda install -c conda-forge awswrangler

Powered By

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

Project details


Release history Release notifications | RSS feed

This version

1.9.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.9.1.tar.gz (107.3 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.9.1-py3.6.egg (284.8 kB view details)

Uploaded Egg

awswrangler-1.9.1-py3-none-any.whl (132.7 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.9.1.tar.gz.

File metadata

  • Download URL: awswrangler-1.9.1.tar.gz
  • Upload date:
  • Size: 107.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.1.tar.gz
Algorithm Hash digest
SHA256 2a89f15a63e13417487588447e61a3a17e375af0ce57c38e9e7822cffcac55b5
MD5 d3b9881b881724c45fd5fea0265ab286
BLAKE2b-256 c53fe4c0900fc044f09fcda8d70d5162444dd297fa0c0b1b2a0222b2a24b6c6e

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.1-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.9.1-py3.6.egg
  • Upload date:
  • Size: 284.8 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.1-py3.6.egg
Algorithm Hash digest
SHA256 db549ed99d8fd3e801136eab1f12ad5adc7ef6655bbb12762e6dae5c78e54df3
MD5 71319f3061250a1d4dc6c6c2d3d4ab64
BLAKE2b-256 aedbfa606c7f9ccafc311669a2c28bc359d31c7c43761ca5904721221c5f7d1c

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.1-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.9.1-py3-none-any.whl
  • Upload date:
  • Size: 132.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c4060311d7d9904d2442ed2252441747016d8f59665f59a705e9c5438053c5e5
MD5 81f73cbad6718725c5548d754b54dea1
BLAKE2b-256 15a6b96018af476ed8699c4b3d04205c051fd552a0364cc36e9b23c33b2ffff0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page