Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Installation Command
PyPi PyPI Downloads pip install awswrangler
Conda Conda Downloads conda install -c conda-forge awswrangler

Powered By

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

Project details


Release history Release notifications | RSS feed

This version

1.9.2

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.9.2.tar.gz (107.5 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.9.2-py3.6.egg (284.9 kB view details)

Uploaded Egg

awswrangler-1.9.2-py3-none-any.whl (132.7 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.9.2.tar.gz.

File metadata

  • Download URL: awswrangler-1.9.2.tar.gz
  • Upload date:
  • Size: 107.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.2.tar.gz
Algorithm Hash digest
SHA256 0d0277adf31d57d154795f4330f055c1b55db482be5515ef52ad5fe806bce6a6
MD5 9542a1fe5ee2db522d7524b1bd76151f
BLAKE2b-256 8663f9e267ca5441fac2d6cdaff5a2cc7d055ae2984b215bc5fcbda243a3d5eb

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.2-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.9.2-py3.6.egg
  • Upload date:
  • Size: 284.9 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.2-py3.6.egg
Algorithm Hash digest
SHA256 7e66c035862ece0441dcf27b335c1bd6b6dd32010a332a32d09791125004bef8
MD5 371939344658e9ef5eda4527714145b5
BLAKE2b-256 587908d09795eaa16c67e526a126702bf2674a0ab5c149c48334f019da3dfdfc

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.2-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.9.2-py3-none-any.whl
  • Upload date:
  • Size: 132.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.2.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.2-py3-none-any.whl
Algorithm Hash digest
SHA256 de294908e78466b983d041777ecda8d7ddf954ce5c0c3ad1f57ac143c71771c7
MD5 a9e216e7ee319a02f374b48d47bbc992
BLAKE2b-256 24d3649703398a427ef736313652aa61b1e3daf6d12a2b5083e457c1bcfc53e0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page