Skip to main content

Pandas on AWS.

Project description

AWS Data Wrangler

Pandas on AWS

AWS Data Wrangler

An AWS Professional Service open source initiative | aws-proserve-opensource@amazon.com

Release Python Version Code style: black License

Checked with mypy Coverage Static Checking Documentation Status

Source Downloads Installation Command
PyPi PyPI Downloads pip install awswrangler
Conda Conda Downloads conda install -c conda-forge awswrangler

Powered By

Table of contents

Quick Start

Installation command: pip install awswrangler

import awswrangler as wr
import pandas as pd

df = pd.DataFrame({"id": [1, 2], "value": ["foo", "boo"]})

# Storing data on Data Lake
wr.s3.to_parquet(
    df=df,
    path="s3://bucket/dataset/",
    dataset=True,
    database="my_db",
    table="my_table"
)

# Retrieving the data directly from Amazon S3
df = wr.s3.read_parquet("s3://bucket/dataset/", dataset=True)

# Retrieving the data from Amazon Athena
df = wr.athena.read_sql_query("SELECT * FROM my_table", database="my_db")

# Get Redshift connection (SQLAlchemy) from Glue and retrieving data from Redshift Spectrum
engine = wr.catalog.get_engine("my-redshift-connection")
df = wr.db.read_sql_query("SELECT * FROM external_schema.my_table", con=engine)

# Get MySQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into MySQL
engine = wr.catalog.get_engine("my-mysql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

# Get PostgreSQL connection (SQLAlchemy) from Glue Catalog and LOAD the data into PostgreSQL
engine = wr.catalog.get_engine("my-postgresql-connection")
wr.db.to_sql(df, engine, schema="test", name="my_table")

Read The Docs

Who uses AWS Data Wrangler?

Knowing which companies are using this library is important to help prioritize the project internally.

Please send a PR with your company name and @githubhandle if you may.

Project details


Release history Release notifications | RSS feed

This version

1.9.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

awswrangler-1.9.0.tar.gz (107.9 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

awswrangler-1.9.0-py3.6.egg (286.8 kB view details)

Uploaded Egg

awswrangler-1.9.0-py3-none-any.whl (133.4 kB view details)

Uploaded Python 3

File details

Details for the file awswrangler-1.9.0.tar.gz.

File metadata

  • Download URL: awswrangler-1.9.0.tar.gz
  • Upload date:
  • Size: 107.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.0.tar.gz
Algorithm Hash digest
SHA256 82a0195add178196ed5c0019eafa589721710fa5ecc8924c0298d598b6733eea
MD5 2acba581f8eee9d3bfc9ab0feb8d0ab1
BLAKE2b-256 197dc8b440d9565ce42711025efb743c65c7d20165cf9033f67b5631f4a6c403

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.0-py3.6.egg.

File metadata

  • Download URL: awswrangler-1.9.0-py3.6.egg
  • Upload date:
  • Size: 286.8 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.0-py3.6.egg
Algorithm Hash digest
SHA256 f5d05872796057dcc82ff94262e591a33bf2fdbe9964cdec6c3dcab0b11ae2fc
MD5 aad97f667ba4c0bc4e87a5a3d7066a3c
BLAKE2b-256 9bf8abc22ac0965c571269f3df61b59f1485c2df50e2ded7f73700195896b9b5

See more details on using hashes here.

File details

Details for the file awswrangler-1.9.0-py3-none-any.whl.

File metadata

  • Download URL: awswrangler-1.9.0-py3-none-any.whl
  • Upload date:
  • Size: 133.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.6.11

File hashes

Hashes for awswrangler-1.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 311d926d3eacbed93239eebbc89a12210b508157b371ba08db15cc94ee8d28cc
MD5 fba9b10dc09015b18ddb1a85c6869a9e
BLAKE2b-256 d0c268a31caea0bf3b078154fb9cbc2e0abc8bd81f6c9297c5319af4409d3689

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page